This application is a U.S. National Phase of International Patent Application No. PCT/JP2015/073464 filed on Aug. 21, 2015, which claims priority benefit of Japanese Patent Application No. JP 2014-177172 filed in the Japan Patent Office on Sep. 1, 2014. Each of the above-referenced applications is hereby incorporated herein by reference it its entirety.
The present technology relates to an audio processing apparatus, and relates particularly to an audio processing apparatus capable of appropriately converting 7.1-ch audio data into 2-ch audio data.
In an MPEG4 audio standard (ISO/IEC 14496-3:2009/Amd 4:2013), a description method of 7.1-ch Advanced Audio Coding (AAC) and a downmixing method for reducing the number of channels are standardized (for example, see Non-Patent Document 1).
In the above-described standard, however, although a downmixing method for converting 7.1-ch audio data into 5.1 ch is defined, a method for downmixing 7.1-ch audio data to 2-ch audio data is not defined.
For this reason, it is necessary to apply a conventional downmixing method for converting 5.1-ch audio data into 2 ch. In other words, in order to downmix 7.1-ch audio data to 2-ch audio data, it is necessary to downmix the 7.1-ch audio data to 5.1-ch audio data on the basis of the standard, and to further downmix the 5.1-ch audio data thus downmixed to the 2-ch audio data.
As a result, the process becomes cumbersome, and in addition, a total power amount, a power ratio between channels, or a localization position after downmixing in the audio data may be changed and thereby there may be a case where the 7.1-ch audio data cannot be appropriately downmixed to the 2-ch audio data.
The present technology makes it possible to directly convert 7.1-ch audio data into 2-ch audio data and to adjust a total power amount to be appropriate, the same amount as that before downmixing.
An audio processing apparatus according to a first aspect of the present technology includes a coefficient unit and a conversion unit. The coefficient unit stores a coefficient for directly downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to the 2-ch speaker system, specified by a Moving Picture Experts Group 4 (MPEG4) audio standard. The conversion unit directly downmixes, using the coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
The MPEG4 audio standard may be ISO/IEC 14496-3:2009/Amd 4:2013.
The coefficient can include a third coefficient for downmixing, using a first coefficient for downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to a 5.1-ch speaker system, specified by the Moving Picture Experts Group 4 (MPEG4) audio standard, and a second coefficient for downmixing audio data corresponding to a 5.1-ch speaker system to audio data corresponding to a 2-ch speaker system, specified by the standard, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system. The conversion unit can be configured to directly downmix, using the third coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
The conversion unit can be configured to directly downmix the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system by making the sum of power and a power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and a power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively.
The 7.1-ch speaker system may be 7.1-ch back.
The conversion unit can be configured to set a scaling coefficient which makes the sum of power and a power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and a power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, to make the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, by the scaling coefficient and the coefficient, and to directly downmix the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
The scaling coefficient can include a first scaling coefficient which adjusts power of audio data output from a rear surround speaker.
The scaling coefficient can include a first scaling coefficient which adjusts power of audio data output from a rear surround speaker, and a second scaling coefficient which adjusts power of audio data output from a surround speaker.
The 7.1-ch speaker system may be 7.1-ch front.
The conversion unit can be configured to directly downmix the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sum of power and a power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and a power ratio between channels in the audio data corresponding to the 2-ch speaker system are made equal, respectively.
The coefficient unit can be configured to include a coefficient unit which stores a coefficient for directly downmixing the audio data corresponding to the 7.1 ch-speaker system to the audio data corresponding to the 2-ch speaker system, in accordance with an arrangement of speakers which constitute the 7.1-ch front, such that the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system are made equal, respectively. The conversion unit can be configured to directly downmix, using the coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sums of power and the power ratios between channels are respectively made equal therebetween.
The coefficient unit can be configured to store a third coefficient for downmixing, using a first coefficient for downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to a 5.1-ch speaker system, specified by the Moving Picture Experts Group 4 (MPEG4) audio standard, and a second coefficient for downmixing audio data corresponding to a 5.1-ch speaker system to audio data corresponding to a 2-ch speaker system, specified by the standard, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system. The conversion unit can be configured to directly downmix, using the third coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sums of power and the power ratios between channels are respectively made equal therebetween.
The conversion unit can be configured to set a scaling coefficient which makes the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, to make the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, by the scaling coefficient and the coefficient, and to directly downmix the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
The 7.1-ch speaker system may be 7.1-ch top.
The coefficient unit can be configured to store a third coefficient for downmixing, using a first coefficient for downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to a 5.1-ch speaker system, specified by the Moving Picture Experts Group 4 (MPEG4) audio standard, and a second coefficient for downmixing audio data corresponding to a 5.1-ch speaker system to audio data corresponding to a 2-ch speaker system, specified by the standard, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system. The conversion unit can be configured to directly downmix, using the third coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sums of power and the power ratios between channels are respectively made equal therebetween.
The conversion unit can be configured to set a scaling coefficient which makes the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, to make the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, by the scaling coefficient and the coefficient, and to downmix the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
An audio processing apparatus according to a second aspect of the present technology includes a first conversion unit, a second conversion unit, a first coefficient unit, and a second coefficient unit. The first conversion unit downmixes audio data corresponding to a 7.1-ch speaker system to audio data corresponding to the 5.1-ch speaker system, specified by a Moving Picture Experts Group 4 (MPEG4) audio standard. The second conversion unit downmixes the audio data corresponding to the 5.1-ch speaker system downmixed by the first conversion unit to audio data corresponding to the 2-ch speaker system. The first coefficient unit stores a first coefficient for performing, when the audio data corresponding to the 5.1-ch speaker system is finally output, downmixing to the audio data corresponding to the 5.1-ch speaker system. The second coefficient unit stores a second coefficient for performing, when the audio data corresponding to the 2-ch speaker system is finally output, downmixing to the audio data corresponding to the 5.1-ch speaker system. When the audio data corresponding to the 7.1-ch speaker system is finally downmixed to the audio data corresponding to the 2-ch speaker system and output, the first conversion unit downmixes the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system, using the second coefficient which is stored in the second coefficient unit and makes the sum of power, a power ratio between channels, and a localization position after downmixing in the audio data corresponding to the 7.1-ch speaker system and the sum of power, a power ratio between channels, and a localization position after downmixing in the audio data corresponding to the 2-ch speaker system to be finally output equal, respectively.
The 7.1-ch speaker system may be 7.1-ch front.
In the first aspect of the present technology, a coefficient for directly downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to the 2-ch speaker system, specified by the Moving Picture Experts Group 4 (MPEG4) audio standard, is stored. The audio data corresponding to the 7.1-ch speaker system is directly downmixed to the audio data corresponding to the 2-ch speaker system using the stored coefficient.
In the second aspect of the present technology, a first coefficient is stored which is for performing downmixing to audio data corresponding to the 5.1-ch speaker system when the audio data corresponding to the 5.1-ch speaker system is finally output, and the second coefficient is stored which is for performing downmixing to audio data corresponding to the 5.1-ch speaker system when audio data corresponding to the 2-ch speaker system is finally output, in a case where audio data corresponding to a 7.1-ch speaker system is downmixed to audio data corresponding to the 5.1-ch speaker system, and the downmixed audio data corresponding to the 5.1-ch speaker system is downmixed to audio data corresponding to the 2-ch speaker system, which are specified by the Moving Picture Experts Group (MPEG4) audio standard. When the audio data corresponding to the 7.1-ch speaker system is finally downmixed to the audio data corresponding to the 2-ch speaker system and output, the audio data corresponding to the 7.1-ch speaker system is downmixed to the audio data corresponding to the 2-ch speaker system, using the second coefficient which makes the sum of power, a power ratio between channels, and a localization position after downmixing in the audio data corresponding to the 7.1-ch speaker system and the sum of power, a power ratio between channels, and a localization position after downmixing in the audio data corresponding to the 2-ch speaker system to be finally output equal, respectively.
The respective audio processing apparatuses according to the first aspect and the second aspect of the present technology may be independent apparatuses or blocks which function as audio processing apparatuses.
According to an aspect of the present technology, audio data corresponding to a 7.1-ch speaker system can be appropriately downmixed to audio data corresponding to a 2-ch speaker system.
<7.1-ch back>
In other words, the arrangement of the speakers in
As illustrated in
As illustrated in
As illustrated in
A 7.1-ch speaker system is constituted by a combination of six speakers arranged symmetrically which include the low-sound speaker LFE and the center speaker C among the group of speakers illustrated in
For example, the 7.1-ch speaker system may be constituted by, in addition to the low-sound speaker LFE and the center speaker C surrounded by dotted lines in
<Conventional Conversion Method in 7.1-ch Back>
Next, with reference to
In other words, the converting apparatus in
The 5.1-ch downmixing unit 11 converts 7.1-ch audio data into 5.1-ch audio data through a multiply-add operation using a coefficient stored in the 5.1-ch downmixing coefficient unit 12, and performs output to the 2-ch downmixing unit 13.
The 2-ch downmixing unit 13 converts 5.1-ch audio data into 2-ch audio data through a multiply-add operation using a coefficient stored in the 2-ch downmixing coefficient unit 14, and performs output.
In a case where 7.1-ch back audio data is input as illustrated in the top portion of
Here, in
In addition, regarding the 5.1-ch audio data converted by the 5.1-ch downmixing unit 11 on the basis of the audio data of the 7.1-ch back speaker system, audio data output from the center speaker C is referred to as audio data C′, audio data output from the left and right speakers L and R are referred to as audio data L′ and R′, respectively, and audio data output from the left and right surround speakers Ls and Rs are referred to as audio data Ls′ and Rs′, respectively.
Furthermore, audio data output from the 2-ch left and right speakers L and R, which have been converted by the 2-ch downmixing unit 13 on the basis of the audio data of the 5.1-ch speaker system, are referred to as audio data Lo and Ro, respectively.
In other words, the 5.1-ch downmixing unit 11 reads a required coefficient from the 5.1-ch downmixing coefficient unit 12, and executes an operation represented by the following expression (1), thereby converting the 7.1-ch back audio data into the 5.1-ch audio data.
C′=C
L′=L
R′=R
Ls′=d1×Ls+d2×Lsr
Rs′=d1×Rs+d2×Rsr
LFE′=LFE (1)
Here, C, L, R, Ls, Rs, Lsr, Rsr, and LFE are audio data respectively output from the center speaker C, the left and right speakers L and R, the left and right surround speakers Ls and Rs, the left and right rear surround speakers Lsr and Rsr, and the low-sound speaker LFE, which constitute the 7.1-ch back. In addition, C′, L′, R′, Ls′, Rs′, and LFE′ are audio data respectively output from the center speaker C, the left and right speakers L and R, the left and right surround speakers Ls and Rs, and the low-sound speaker LFE, which constitute the 5.1 ch. d1 and d2 are coefficients specified by ISO/IEC 14496-3:2009/Amd 4:2013.
In other words, the 5.1-ch downmixing unit 11 reads a coefficient from the 5.1-ch downmixing coefficient unit 12, and multiplies respective audio data of the center speaker C and the left and right speakers L and R by a coefficient of 1.0 to perform conversion, thereby obtaining audio data C′, L′, and R′. In addition, the 5.1-ch downmixing unit 11 multiplies respective audio data of the left and right surround speakers Ls and Rs and the left and right rear surround speakers Lsr and Rsr by the coefficients d1 and d2, to obtain products and sums thereof, thereby obtaining audio data Ls′ and Rs′ of the left and right surround speakers Ls and Rs.
By the conversion process as described above, the 7.1-ch back audio data is converted into the 5.1-ch audio data.
Furthermore, the 2-ch downmixing unit 13 reads a coefficient from the 2-ch downmixing coefficient unit 14, and performs a multiply-add operation to the 5.1-ch audio data, thereby performing conversion into 2-ch audio data. In more detail, the 2-ch downmixing unit 13 converts the 5.1-ch audio data into the 2-ch audio data through an operation represented by the following expression (2).
Lo=a×Ls′+L′+b×C′
Ro=a×Rs′+R′+b×C′ (2)
Here, C′, L′, R′, Ls′, and Rs′ are audio data respectively output from the center speaker C, the left and right speakers L and R, and the left and right surround speakers Ls and Rs, which constitute the 5.1 ch. In addition, Lo and Ro are audio data output from the left and right speakers L and R, respectively, and are 2-ch audio data. Furthermore, a and b are coefficients specified by ISO/IEC 14496-3:2009/Amd 4:2013.
As described above, a two-stage operation process has been conventionally required for converting 7.1-ch audio data into 2-ch audio data, in which conversion into 5.1-ch audio data is first performed and then the converted 5.1-ch audio data is converted into 2-ch audio data. It should be noted that the coefficients used for the operations of the above expressions (1) and (2) are given by way of example only. For example, when forming a sound image in an acoustic space, a coefficient is a combination of various values, and therefore, a coefficient other than the above coefficients may be applied.
<First Embodiment of Converting Apparatus to which Present Technology is Applied>
Next, with reference to
As described above, the two-stage operation process has been conventionally required for converting 7.1-ch audio data into 2-ch audio data, in which conversion into 5.1-ch audio data is first performed and then the converted 5.1-ch audio data is converted into 2-ch audio data, and consequently, the process is cumbersome. Therefore, 7.1-ch audio data is directly converted into 2-ch audio data in the present technology.
In more detail, the converting apparatus includes a 2-ch downmixing unit 21, a 2-ch downmixing coefficient unit 22, a 5.1-ch downmixing unit 23, and 5.1-ch downmixing coefficient unit 24, as illustrated in
The 2-ch downmixing unit 21 reads a coefficient stored in the 2-ch downmixing coefficient unit 22, and performs a multiply-add operation to 7.1-ch audio data, thereby performing conversion into 2-ch audio data through one operation. In other words, the 7.1-ch audio data is directly downmixed to the 2-ch audio data via no 5.1-ch audio data.
In more detail, as illustrated in
Lo=a′×Ls+a″×Lsr+L+b×C
Ro=a′×Rs+a″×Rsr+R+b×C (3)
Here, Lo and Ro are audio data respectively output from the left and right speakers L and R, and are 2-ch audio data. C, L, R, Ls, Rs, Lsr, and Rsr are audio data respectively output from the center speaker C, the left and right speakers L and R, the left and right surround speakers Ls and Rs, and the left and right rear surround speakers Lsr and Rsr, which constitute the 7.1-ch back.
Furthermore, the coefficients a′ and a″ satisfy expressions a′=a×d1 and a″=a×d2, respectively.
In other words, the operation represented by the expression (3) is obtained by substituting the expression (2) in the expression (1).
The above process makes it possible for the converting apparatus to which the present technology is applied to perform conversion into 2-ch audio data through one operation process, while two operation processes are required when converting 7.1-ch audio data into 2-ch audio data in a conventional configuration.
<First Variation>
The example has been described above in which 7.1-ch audio data is converted into 2-ch audio data through one operation by combining coefficients required for the conventional two operations. When using the operation, however, there may be a case where the sum of power and a power ratio between channels in the converted 2-ch audio data are not consistent with those of the 7.1-ch audio data before conversion.
For example, in 2-ch audio data, respective power P (Lo) and power P (Ro) of audio data Lo and Ro output from the left and right speakers are obtained through an operation as represented by the following expression (4).
P(Lo)=(a′)2×(Ls)2+(a″)2×(Lsr)2+(L)2+(b)2×(C)2
P(Ro)=(a′)2×(Rs)2+(a″)2×(Rsr)2+(R)2+(b)2×(C)2 (4)
Accordingly, power P (All_2ch) in the 2-ch audio data satisfies the following expression (5).
Meanwhile, power P (All_7.1ch) in the 7.1-ch audio data is represented by the following expression (6).
P(All_7.1ch)=(C)2+(L)2+(R)2+(Ls)2+(Rs)2+(Lsr)2+(Rsr)2 (6)
In other words, the power P (All_2ch) in the 2-ch audio data and the power P (All_7.1ch) in the 7.1-ch audio data are different from each other.
Therefore, a correction scaling coefficient is set such that the power P (All_2ch) in the 2-ch audio data is made equal to the power P (All_7.1ch) in the 7.1-ch audio data.
The scaling coefficient is a coefficient which makes the power P (All_2ch) in the 2-ch audio data which satisfies the above-described expression (5) consistent with the power P (All_7.1 ch) in the 7.1-ch audio data represented by the above-described expression (6).
In other words, the expression (5) is different from the expression (6) in a point that coefficients of (Ls)2, (Rs)2, (Lsr)2, and (Rsr)2 are not 1 but ½. Therefore, a scaling coefficient is set as a coefficient which adjusts the coefficient to be 1.
As indicated by the following expression (7), a scaling coefficient β1 for adjusting power of audio data of the left and right surround speakers Ls and Rs, and a scaling coefficient β2 for adjusting power of audio data of the left and right rear surround speakers Lsr and Rsr are set.
More specifically, when the coefficients d1, d2, and a change in a range of 1, 1/√2 (=0.7071), and ½ (=0.5), the scaling coefficients β1 and β2 are set as illustrated in
For example, as illustrated in
By setting the scaling coefficients as described above, the 2-ch downmixing unit 21 reduces the two operation processes to one operation process and performs downmixing to 2-ch audio data in which the sum of power and a power ratio between channels are equal to the sum of power and a power ratio between channels in 7.1-ch audio data. As a result, when downmixing 7.1-ch audio data to 2-ch audio data, conventionally required two operation processes can be reduced to one operation process and downmixing can be performed while keeping the sum of power and the power ratio between channels equal to those before downmixing.
<Second Variation>
The example has been described above in which the scaling coefficients β1 and β2 are set in the left and right surround speakers Ls and Rs and the left and right rear surround speakers Lsr and Rsr, respectively, and a change in power occurring when performing downmixing to 2-ch audio data is adjusted. However, when output of the left and right rear surround speakers Lsr and Rsr provided at the rear is made equal to output of the left and right speakers L and R provided at the front, the audio is louder than the audio originally heard, due to the ear shape inherent in humans. In other words, a human ear must hear audio occurring at the rear as audio quieter than that occurring at the front.
Therefore, for the purpose of adjusting the above, only a scaling coefficient α may be set as illustrated in
By doing so, it is possible to appropriately adjust power and to downmix 7.1-ch audio data to 2-ch audio data through one operation. It should be noted that
<7.1-ch front>
The example has been described above in which 7.1-ch back audio data is converted into 2-ch audio data through one operation. However, as indicated by dotted lines in
<Conventional Conversion Method in 7.1-ch front>
In that case, the 5.1-ch downmixing unit 11 converts 7.1-ch front audio data into 5.1-ch audio data as illustrated in the top portion and the middle portion in
C′=C+(Lc+Rc)×e1
L′=L+Lc×e2
R′=R+Rc×e2
Ls′=Ls
Rs′=Rs
LFE′=LFE (8)
Here, C, L, R, Ls, Rs, Lc, Rc, and LFE are audio data respectively output from the center speaker C, the left and right speakers L and R, the left and right surround speakers Ls and Rs, the left and right center speakers Lc and Rc, and the low-sound speaker LFE, which constitute the 7.1-ch front. In addition, C′, L′, R′, Ls′, Rs′, and LFE′ are audio data respectively output from the center speaker C, the left and right speakers L and R, the left and right surround speakers Ls and Rs, and the low-sound speaker LFE, which constitute the 5.1 ch. Furthermore, e1 and e2 are coefficients specified by ISO/IEC 14496-3:2009/Amd 4:2013.
In other words, the 5.1-ch downmixing unit 11 reads a coefficient from the 5.1-ch downmixing coefficient unit 12, and performs an operation in which audio data of the center speaker C is multiplied by a coefficient of 1.0, the sum of audio data Lc and Rc of the left and right center speakers is obtained and multiplied by the coefficient e1, and the values thus obtained are added, thereby performing conversion into audio data C′. In addition, the 5.1-ch downmixing unit 11 reads a coefficient from the 5.1-ch downmixing coefficient unit 12, and performs an operation in which audio data of the left and right speakers L and R are multiplied by a coefficient of 1.0, respective audio data which are audio data Lc and Rc of the left and right center speakers are multiplied by the coefficient e2, and the values thus obtained are added, thereby performing conversion into audio data L′ and R′. Furthermore, the 5.1-ch downmixing unit 11 multiplies respective audio data of the left and right surround speakers Ls and Rs and the low-sound speaker LFE by 1.0 as a coefficient to obtain respective audio data Ls′, Rs′, and LFE′ of the left and right surround speakers Ls and Rs and the low-sound speaker LFE.
By the conversion process as described above, 7.1-ch front audio data is converted into 5.1-ch audio data. It should be noted that since the process for converting the 5.1-ch audio data into the 2-ch audio data illustrated in the middle portion and the lower portion in
In the meantime, even when 7.1-ch front audio data is converted into 2-ch audio data by the above process, the 7.1-ch audio data and the 2-ch audio data have different power.
In other words, when 7.1-ch front audio data is converted into 5.1-ch audio data on the basis of an operation result of the expression (8), power P (All_5.1ch) thereof is obtained through an operation as represented by the following expression (9).
It should be noted that both of the coefficients e1 and e2 are 1/√2.
In other words, when conversion is performed to downmix 7.1-ch front audio data to 5.1 ch, there occur no change in the sum of power and a power ratio between channels.
On the other hand, when 5.1-ch audio data converted from 7.1-ch front audio data is converted into 2-ch audio data, power P (All_2ch) thereof is obtained through an operation as represented by the following expression (10). It should be noted that both of the coefficients e1 and e2 are 1/√2, and the coefficient a is equal to 1.0 and the coefficient b is equal to 1/√2.
In other words, as indicated by the expression (10), it is indicated that the power increases through the conversion for downmixing 5.1-ch audio data to 2-ch audio data. In addition, it can be seen from coefficients of (Lc)2 and (Rc)2 being larger than 1 that the power ratio between channels is changed.
In addition, when 7.1-ch front audio data is converted into 2-ch audio data by the above technique, audio data of the left center speaker Lc is localized on audio data of the left speaker L, and audio data of the right center speaker Rc is localized on audio data of the right speaker R.
In other words, for example, power P (L to Lc) from the left speaker L to the left center speaker Lc is (1/√2+½)2, whereas power P (R to Lc) from the right speaker R to the left center speaker Lc is (½)2. Accordingly, since the power P (L to Lc) from the left speaker L to the left center speaker Lc is approximately 23 times larger than the power P (R to Lc) from the right speaker R to the left center speaker Lc, localization occurs substantially on the left speaker L.
<Second Embodiment of Converting Apparatus to which Present Technology is Applied>
Therefore, the 5-ch downmixing coefficient unit 24 is caused to include the same coefficients as those described above, and the 2-ch downmixing coefficient unit 22 is caused to store such coefficients as illustrated in
Lt=Ls+L+k2×Lc+k4×C+k5×Rc
Rt=Rs+R+k3×Rc+k1×C+k0×Lc (11)
Here, k0=k5=½, k1=k4=1/√2, and k2=k3=√3/2 are satisfied.
<Grounds for Deriving Coefficients k0 to k5>
Here, grounds for deriving coefficients k0 to k5 will be described.
The coefficients k0 and k2 with respect to audio data Lc of the left center speaker Lc are set so as to obtain a power ratio of 3:1 when the audio data Lc of the left center speaker Lc is mixed with audio data L and R of the left and right speakers L and R. In other words, selection is performed such that the audio data Lc of the left center speaker Lc after downmixing is localized on the same position as a reproduction position before downmixing. In other words, it is assumed that the left and right speakers L and R, the left and right center speakers Lc and Rc, and the center speaker C are each arranged at equal intervals in a vertical direction with respect to a direction directly opposite to the user P. For this reason, a power ratio is set so as to correspond at 3:1 by a ratio of physical distance.
In other words, since expressions (k0)2:(k2)2=3:1 and (k0)2+(k2)2=1 are satisfied, when the coefficients k0 and k2 are solved on the basis of the constraint, the coefficients are obtained as k0=½ and k2=√3/2.
Similarly, the coefficients k3 and k5 with respect to audio data Rc of the right center speaker Rc are set so as to obtain a power ratio of 1:3 when the audio data Rc of the right center speaker Rc is mixed with audio data L and R of the left and right speakers L and R. In other words, selection is performed such that the audio data Rc of the right center speaker Rc after downmixing is localized on the same position as a reproduction position before downmixing. In other words, it is assumed that the left and right speakers L and R, the left and right center speakers Lc and Rc, and the center speaker C are each arranged at equal intervals in a vertical direction with respect to a direction directly opposite to the user P. For this reason, a power ratio is set so as to correspond at 1:3 by a ratio of physical distance.
In other words, since expressions (k3)2:(k5)2=1:3 and (k3)2+(k5)2=1 are satisfied, when the coefficients k3 and k5 are solved on the basis of the constraint, the coefficients are obtained as k3=√3/2 and k5=½.
In addition, regarding the coefficients k4 and k1 of audio data C of the center speaker C, the coefficients are determined so as to set a power ratio such that the audio data of the center speaker C corresponds at 1:1 with respect to 2-ch left and right speakers Lt and Rt.
In other words, since expressions (k4)2:(k1)2=1:1 and (k4)2+(k1)2=1 are satisfied, when the coefficients k1 and k4 are solved on the basis of the constraint, the coefficients are obtained as k1=1/√2 and k4=1/√2.
In other words, coefficients k0 to k6 are set in accordance with the arrangement of each speaker in this example. Consequently, a change in power is prevented from occurring between before and after downmixing. As a result, it is possible to realize downmixing with good power balance in accordance with the arrangement of the speakers, while suppressing the change in power between before and after downmixing.
<Third Variation>
The conversion process has been described above in which 7.1-ch front audio data is downmixed to 2-ch audio data through one operation. However, a coefficient for converting 7.1-ch front audio data into 5.1 ch and performing output, and a coefficient for finally performing conversion into 2-ch audio data after conversion into 5.1 ch and performing output may be set separately.
In other words, when finally performing downmixing to 5.1-ch audio data in the converting apparatus in
Alternatively, when finally performing downmixing to 2-ch audio data, the 5-ch downmixing unit 31 reads a coefficient stored in a 2-ch outputting and 5-ch downmixing coefficient unit 33, downmixes 7.1-ch audio data to 5.1 ch through a multiply-add operation, and performs output to a 2-ch downmixing unit 34.
The 2-ch downmixing unit 34 reads a coefficient for performing conversion into 2-ch audio data from a 2-ch downmixing coefficient unit 35, and downmixes the audio data downmixed from 5.1-ch to 2-ch audio data.
Coefficients for finally performing downmixing to 2-ch audio data are those illustrated in
In other words, in order to make power P (All_2ch) in the left and right speakers Lt and Rt equal to power P (All_7.1ch) in 7.1-ch audio data to be input, each of coefficients K14 and K15 is set to 1/√2 such that power distribution of the center speaker CC to the left and right speakers Lt and Rt becomes 1:1.
Furthermore, each of coefficients k10 and k12 is set to 1/√(2+√2) such that power of audio data of the 7.1-ch left center speaker Lc is distributed to the 5.1-ch left speaker LL and the 5.1-ch center speaker CC at 1:1.
Similarly, each of coefficients k11 and k13 is set to 1/√(2+√2) such that power of audio data of the 7.1-ch right center speaker Rc is distributed to the 5.1-ch right speaker RR and the 5.1-ch center speaker CC at 1:1.
As described above, coefficients for performing downmixing to 5.1 ch are switched and used in accordance with whether 7.1-ch audio data as input data is finally output as 5.1-ch audio data or as 2-ch audio data, and thereby similar power to that of 7.1-ch audio data as input data and a power balance can be obtained in any downmixing.
<Fourth Variation>
The example has been described above in which a coefficient specified by ISO/IEC 14496-3:2009/Amd 4:2013 is not used. However, the sum of power and a power ratio between channels may be adjusted to be constant by using coefficients specified by ISO/IEC 14496-3:2009/Amd 4:2013 and then setting a scaling coefficient.
In other words, the configuration of the converting apparatus in that case is the configuration in
Lo=a×Ls+L+a′×Lc×β+b×C+a″×Rc×β
Ro=a×Rs+R+a′×Rc×β+b×C+a″×Lc×β (12)
Here, the coefficient a′ satisfies an expression a′=b×e2+b×e1, the coefficient a″ satisfies an expression a″=b×e1, and β is a scaling coefficient.
Accordingly, when the coefficient e1=e2=b=1/√2 and a=1.0 are satisfied, for example, the audio data of left and right speakers Lo and Ro are represented by the following expression (13).
At that time, power P (Lo) and power P (Ro) are respectively represented by the following expression (14).
P(Lo)==(Ls)2+L2+(Lc)2×β2+(½)×C2+¼×(Rc)2×β2
P(Ro)==(Rs)2+R2+(Rc)2×β2+(½)×C2+¼×(Lc)2×β2 (14)
Accordingly, as indicated by the following expression (15), the scaling coefficient β is set such that power P (All_2ch) in 2-ch audio data is made equal to power P (All_7.1ch) in 7.1-ch audio data. For example, in the case of the expression (14), the scaling coefficient β is set to be equal to 2/√5 as indicated by the following expression (15).
Consequently, an expression 5/4×β2=1 is satisfied in order to obtain power equal to the power P (All_7.1ch) in the 7.1-ch audio data, and therefore, the scaling coefficient β=2/√5 is satisfied.
The above process makes it possible to perform downmixing such that the power P (All_2ch) in the 2-ch audio data is made equal to the power P (All_7.1ch) in the 7.1-ch audio data by using the scaling coefficient β even when coefficients specified by ISO/IEC 14496-3:2009/Amd 4:2013 are used.
<Fifth Variation>
The example has been described above in which the scaling coefficient β is set in the audio data of the left and right center speakers Lc and Rc. However, a scaling coefficient β11 may be further added which sets a power ratio in each audio data of the left and right center speakers Lc and Rc.
In other words, the scaling coefficient β11 is set, for example, as indicated by the following expression (16).
P(Lo)==(Ls)2+L2+(Lc)2×β2+(½)×C2+¼×(Rc)2×β2×(β11)2
P(Ro)==(Rs)2+R2+(Rc)2×β2+(½)×C2+¼×(Lc)2×β2×(β11)2 (16)
Accordingly, power in 2-ch audio data is represented by the following expression (17).
Consequently, an expression β2×(1+¼×(β11)2)=1 is satisfied in order to obtain power equal to the power P (All_7.1ch) in the 7.1-ch audio data, and therefore, when the scaling coefficient β11=2/√3 is satisfied, for example, the scaling coefficient β=√3/2 is satisfied.
In
With the scaling coefficient β11 set as described above, it is possible to eliminate a power change between before and after downmixing to realize downmixing with good power balance.
<7.1-ch top>
The example has been described above in which audio data of a 7.1-ch front speaker system is converted into 2-ch audio data. However, audio data of a 7.1-ch speaker system including left and right top speakers Lv and Rv instead of the left and right center speakers Lc and Rc at the rear, as illustrated in dotted lines in
<Conventional Conversion Method in 7.1-ch top>
In that case, the 5.1-ch downmixing unit 11 converts 7.1-ch top audio data into 5.1-ch audio data by executing an operation represented by the following expression (18), as illustrated in the top portion to the middle portion in
C′=C
L′=L×f1+Lv×f2
R′=R×f1+Rv×f2
Ls′=Ls
Rs′=Rs
LFE′=LFE (18)
Here, C, L, R, Ls, Rs, Lv, Rv, and LFE are audio data respectively output from the center speaker C, the left and right speakers L and R, the left and right surround speakers Ls and Rs, the left and right top speakers Lv and Rv, and the low-sound speaker LFE, which constitute the 7.1-ch top. In addition, C′, L′, R′, Ls′, Rs′, and LFE′ are audio data respectively output from the center speaker C, the left and right speakers L and R, the left and right surround speakers Ls and Rs, and the low-sound speaker LFE, which constitute the 5.1 ch. Furthermore, f1 and f2 are coefficients specified by ISO/IEC 14496-3:2009/Amd 4:2013.
In other words, the 5.1-ch downmixing unit 11 reads a coefficient from the 5.1-ch downmixing coefficient unit 12, and performs an operation by multiplying audio data of the center speaker C by a coefficient of 1.0, thereby performing conversion directly into audio data C′. In addition, the 5.1-ch downmixing unit 11 reads a coefficient from the 5.1-ch downmixing coefficient unit 12, and performs an operation in which audio data of the left and right speakers L and R are multiplied by the coefficient f1, respective audio data Lv and Rv of the left and right top speakers are multiplied by the coefficient f2, and the values thus obtained are added, thereby performing conversion into audio data L′ and R′. Furthermore, the 5.1-ch downmixing unit 11 multiplies respective audio data of the left and right surround speakers Ls and Rs and the low-sound speaker LFE by 1.0 as a coefficient to obtain respective audio data Ls′, Rs′, and LFE′ of the left and right surround speakers Ls and Rs and the low-sound speaker LFE.
By the conversion process as described above, 7.1-ch top audio data is converted into 5.1-ch audio data. It should be noted that the process for converting 5.1-ch audio data into 2-ch audio data illustrated in the middle portion and the lower portion in
Lo=a×Ls+f1×L+f2×Lv+b×C
Ro=a×Rs+f1×R+f2×Rv+b×C (19)
Through the operation of the above-described expression (19), conversion is realized in which 7.1-ch audio data is downmixed to 2-ch audio data as illustrated in
However, even when 7.1-ch top audio data is converted into 2-ch audio data by the above process, the sum of power and a power ratio between channels of the 7.1-ch audio data and those of the 2-ch audio data are different from each other.
In other words, when 7.1-ch front audio data is converted into 2-ch audio data on the basis of an operation result of the expression (18), power P (All_2ch) thereof is obtained through an operation as represented by the following expression (20). It should be noted that the coefficient a=1.0 and the coefficient f1=f2=b=1/√2 are satisfied here.
In other words, as indicated by the expression (20), it is indicated that the power decreases through the conversion in which 7.1-ch audio data is downmixed to 2-ch audio data.
<Sixth Variation>
Therefore, the 5-ch downmixing unit 23 sets a correction scaling coefficient such that power P (All_2ch) in 2-ch audio data is made equal to power P (All_7.1ch) in 7.1-ch top audio data.
The scaling coefficient is a coefficient which makes the power P (All_2ch) in the 2-ch audio data which satisfies the above-described expression (20) consistent with the power P (All_7.1ch) in the 7.1-ch top audio data.
In other words, in the expression (20), a difference from the power P (All_7.1ch) in the 7.1-ch top audio data resides in a point that coefficients of L2, R2, (Lv)2, and (Rv)2 are not 1 but ½. Therefore, coefficients for adjusting the coefficients to be 1 are set.
As indicated by the following expression (21), a scaling coefficient β21 is set as a coefficient which adjusts power of audio data L and R of the left and right speakers L and R, and a scaling coefficient β22 is set as a coefficient which adjusts audio data Lv and Rv of the left and right top speakers Lv and Rv.
More specifically, when the coefficients f1 and f2 change in a range of 1, 1/√2 (=0.7071), and ½ (=0.5), the scaling coefficients β21 and β22 are set as illustrated in
For example, as illustrated in
By setting the scaling coefficients as described above, it is possible to perform conversion into 2-ch audio data of which power is equal to power of 7.1-ch top audio data even when two operation processes are reduced to one operation process.
The above process makes it possible to realize, in any of 7.1-ch back, 7.1-ch front, and 7.1-ch top, a conversion process in which direct downmixing to 2 ch is performed through one operation via no 5.1-ch audio data, and to perform downmixing while maintaining power before the downmixing.
In the meantime, it is possible to cause not only hardware, but also software to execute the above-described series of processes. When software is caused to execute the series of processes, a program constituting the software is installed from a recording medium, in a computer incorporated in dedicated hardware, or, for example, a general-purpose personal computer capable of executing various kinds of functions by installing various kinds of programs.
An input unit 1006, an output unit 1007, a storage unit 1008, and a communication unit 1009 are connected to the input/output interface 1005. The input unit 1006 includes an input device such as a key board and a mouse with which a user inputs an operation command. The output unit 1007 outputs a processing operation screen or an image of a processing result to a display device. The storage unit 1008 includes a hard disk drive which stores a program and various kinds of data. The communication unit 1009 includes a local area network (LAN) adapter and executes a communication process through a network typified by the internet. In addition, a drive 1010 is connected thereto. The drive 1010 reads/writes data with respect to a removable medium 1011 such as a magnetic disk (including a flexible disk), an optical disc (including a compact disc-read only memory (CD-ROM), and a digital versatile disc (DVD)), a magneto-optical disc (including a mini disc (MD)), or a semiconductor memory.
The CPU 1001 executes various processes in accordance with a program stored in the ROM 1002 or a program read from the removable medium 1011 such as a magnetic disk, an optical disc, a magneto-optical disc, or a semiconductor memory, installed in the storage unit 1008, and loaded to the RAM 1003 from the storage unit 1008. In addition, the RAM 1003 appropriately stores data required for the CPU 1001 to execute various processes.
In a computer with the above configuration, the CPU 1001 loads, for example, a program stored in the storage unit 1008 to the RAM 1003 through the input/output interface 1005 and the bus 1004 to execute the program, and thereby the above-described series of processes is performed.
The program executed by the computer (CPU 1001) can be recorded, for example, in the removable medium 1011 as a package medium and provided. In addition, the program can be provided through a wired or wireless transmission medium such as a local area network, the internet, or digital satellite broadcasting.
In the computer, a program can be installed in the storage unit 1008 through the input/output interface 1005 by inserting the removable medium 1011 into the drive 1010. In addition, the program can be received by the communication unit 1009 through the wired or wireless transmission medium and installed in the storage unit 1008. Besides, the program can be installed in advance in the ROM 1002 or the storage unit 1008.
It should be noted that the program executed by the computer may be a program with which processes are performed time-sequentially along the order of description herein, or may be a program with which processes are performed in parallel or at necessary timing, for example, when a call is made.
In addition, a system as used herein means a collection of a plurality of components (such as an apparatus and a module (part)), irrespective of whether all components are included in the same housing. Accordingly, both of multiple apparatuses accommodated in separate housings and connected through a network, and one apparatus in which a plurality of modules is accommodated in one housing are systems.
It should be noted that the embodiment of the present technology is not limited to the embodiments described above. Various modifications may be made without departing from the gist of the present technology.
For example, the present technology may have a cloud computing configuration in which a plurality of apparatuses shares one function through a network, and jointly performs a process.
In addition, each step described with the above flowchart can be executed by one apparatus, and in addition, can be shared and executed by a plurality of apparatuses.
Furthermore, when a plurality of processes is included in one step, the plurality of processes included in the one step can be executed by one apparatus, and in addition, can be shared and executed by a plurality of apparatuses.
It should be noted that the present technology may have the following configurations.
(1) An audio processing apparatus including:
a coefficient unit that stores a coefficient for directly downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to the 2-ch speaker system, specified by a Moving Picture Experts Group 4 (MPEG4) audio standard; and
a conversion unit that directly downmixes, using the coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
(2) The audio processing apparatus according to (1),
wherein the MPEG4 audio standard is ISO/IEC 14496-3:2009/Amd 4:2013.
(3) The audio processing apparatus according to (1),
wherein the coefficient includes a third coefficient for downmixing, using a first coefficient for downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to a 5.1-ch speaker system, specified by the Moving Picture Experts Group 4 (MPEG4) audio standard, and a second coefficient for downmixing audio data corresponding to a 5.1-ch speaker system to audio data corresponding to a 2-ch speaker system, specified by the standard, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system, and
the conversion unit directly downmixes, using the third coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
(4) The audio processing apparatus according to (1),
wherein the conversion unit directly downmixes the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system by making the sum of power and a power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and a power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively.
(5) The audio processing apparatus according to (1),
wherein the 7.1-ch speaker system is 7.1-ch back. (6) The audio processing apparatus according to (5), wherein the conversion unit sets a scaling coefficient that makes the sum of power and a power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and a power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, makes the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, by the scaling coefficient and the coefficient, and directly downmixes the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
(7) The audio processing apparatus according to (6),
wherein the scaling coefficient includes a first scaling coefficient that adjusts power of audio data output from a rear surround speaker.
(8) The audio processing apparatus according to (6),
wherein the scaling coefficient includes a first scaling coefficient that adjusts power of audio data output from a rear surround speaker, and a second scaling coefficient that adjusts power of audio data output from a surround speaker.
(9) The audio processing apparatus according to (1),
wherein the 7.1-ch speaker system is 7.1-ch front.
(10) The audio processing apparatus according to (9),
wherein the conversion unit directly downmixes the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sum of power and a power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and a power ratio between channels in the audio data corresponding to the 2-ch speaker system are made equal, respectively.
(11) The audio processing apparatus according to (10),
wherein the coefficient unit includes a coefficient unit that stores a coefficient for directly downmixing the audio data corresponding to the 7.1 ch-speaker system to the audio data corresponding to the 2-ch speaker system, in accordance with an arrangement of speakers that constitute the 7.1-ch front, such that the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system are made equal, respectively, and
the conversion unit directly downmixes, using the coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sums of power and the power ratios between channels are respectively made equal therebetween.
(12) The audio processing apparatus according to (10),
wherein the coefficient unit stores a third coefficient for downmixing, using a first coefficient for downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to a 5.1-ch speaker system, specified by the Moving Picture Experts Group 4 (MPEG4) audio standard, and a second coefficient for downmixing audio data corresponding to a 5.1-ch speaker system to audio data corresponding to a 2-ch speaker system, specified by the standard, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system, and
the conversion unit directly downmixes, using the third coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sums of power and the power ratios between channels are respectively made equal therebetween.
(13) The audio processing apparatus according to (12),
wherein the conversion unit sets a scaling coefficient that makes the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, makes the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, by the scaling coefficient and the coefficient, and directly downmixes the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
(14) The audio processing apparatus according to (1),
wherein the 7.1-ch speaker system is 7.1-ch top.
(15) The audio processing apparatus according to (14),
wherein the coefficient unit stores a third coefficient for downmixing, using a first coefficient for downmixing audio data corresponding to a 7.1-ch speaker system to audio data corresponding to a 5.1-ch speaker system, specified by the Moving Picture Experts Group 4 (MPEG4) audio standard, and a second coefficient for downmixing audio data corresponding to a 5.1-ch speaker system to audio data corresponding to a 2-ch speaker system, specified by the standard, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system, and
the conversion unit directly downmixes, using the third coefficient stored in the coefficient unit, the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system such that the sums of power and the power ratios between channels are respectively made equal therebetween.
(16) The audio processing apparatus according to (15),
wherein the conversion unit sets a scaling coefficient that makes the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, makes the sum of power and the power ratio between channels in the audio data corresponding to the 7.1-ch speaker system and the sum of power and the power ratio between channels in the audio data corresponding to the 2-ch speaker system equal, respectively, by the scaling coefficient and the coefficient, and downmixes the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system.
(17) An audio processing apparatus including:
a first conversion unit that downmixes audio data corresponding to a 7.1-ch speaker system to audio data corresponding to the 5.1-ch speaker system, specified by a Moving Picture Experts Group 4 (MPEG4) audio standard;
a second conversion unit that downmixes the audio data corresponding to the 5.1-ch speaker system downmixed by the first conversion unit to audio data corresponding to the 2-ch speaker system;
a first coefficient unit that stores a first coefficient for performing, when the audio data corresponding to the 5.1-ch speaker system is finally output, downmixing to the audio data corresponding to the 5.1-ch speaker system; and
a second coefficient unit that stores a second coefficient for performing, when the audio data corresponding to the 2-ch speaker system is finally output, downmixing to the audio data corresponding to the 5.1-ch speaker system,
wherein when the audio data corresponding to the 7.1-ch speaker system is finally downmixed to the audio data corresponding to the 2-ch speaker system and output, the first conversion unit downmixes the audio data corresponding to the 7.1-ch speaker system to the audio data corresponding to the 2-ch speaker system, using the second coefficient that is stored in the second coefficient unit and makes the sum of power, a power ratio between channels, and a localization position after downmixing in the audio data corresponding to the 7.1-ch speaker system and the sum of power, a power ratio between channels, and a localization position after downmixing in the audio data corresponding to the 2-ch speaker system to be finally output equal, respectively.
(18) The audio processing apparatus according to (17),
wherein the 7.1-ch speaker system is 7.1-ch front.
Number | Date | Country | Kind |
---|---|---|---|
2014-177172 | Sep 2014 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2015/073464 | 8/21/2015 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2016/035567 | 3/10/2016 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5426702 | Aarts | Jun 1995 | A |
20040032960 | Griesinger | Feb 2004 | A1 |
20100324915 | Seo | Dec 2010 | A1 |
20150030160 | Lee | Jan 2015 | A1 |
Number | Date | Country |
---|---|---|
2008-509600 | Mar 2008 | JP |
2008-535015 | Aug 2008 | JP |
2014007096 | Jan 2014 | WO |
Entry |
---|
“Information Technology—Coding of Audio-Visual Objects”, Part 3: Audio, ISO/IEC14496-3:2009/Amd.4:2013, Apr. 1, 2015, pp. 04. |
“Information technology—Coding of audio-visual objects, Part 3: Audio”, ISO/IEC14496-3:2009/Amd.4:2013 Technical Corrigendum 1, Apr. 2015, 04 pages. |
Number | Date | Country | |
---|---|---|---|
20170257720 A1 | Sep 2017 | US |