The present invention will further be explained below with reference to exemplary embodiments illustrated in the accompanying drawings, in which:
The Prior Art device 1′ shown in
The number of output channels (N outputs 6) is greater than the number of input channels (M inputs 5). Exemplary values are N=6 and M=2, as when a stereo audio signal is converted into a 5.1 audio signal, or N=2 and M=1, as when a stereo signal is encoded as a mono signal plus additional information, although other values of M and N are also possible. The output channels typically have (mutual) correlations defined by parameters fed to the upmix unit 4. To produce output channels having the desired correlations, a set of mutually uncorrelated channels is derived from the input channels. To this end, decorrelation units 3 are coupled to each input 5 so as to produce sets of uncorrelated input channels. The actual number of decorrelation filters, which are well known in the art, may vary and is not limited to the number shown in the drawings.
The decorrelation units 31, . . . , 39 typically include filters having all-pass characteristics. Such filters substantially maintain the spectral envelope of the audio signal. However, the all-pass characteristics have the disadvantage of introducing a time delay. In addition, they often cause a “smearing” of the input signal, that is, the temporal envelope of the decorrelated signal is less well-defined than the temporal envelope of the original signal. Both the time delay and the “smearing” result in a discrepancy between the audio signal and the corresponding parameters: some signal parts (that is, time segments of the signal produced by decorrelation filters) reach the upmix unit later than the corresponding parameters. As a result, the wrong parameters are applied to these signal parts and the audio signal is processed incorrectly, leading to a perceptible signal distortion, for example cross-talk. It will be understood that this is highly undesirable.
It is noted that the parameters could be delayed (e.g. be a delay unit) so as to better match the timing of the parameters and the signals. However, the upmix unit 4 also receives the un-decorrelated input signals, which have not been delayed. In addition, the “smearing” may be frequency-dependent. As a result, it is difficult to match the parameters and the corresponding signal parts.
The present invention solves this problem by processing the audio signal prior to the decorrelation. That is, a substantial part of the signal processing is performed before the audio signal is fed to the decorrelation filters. In this way, the mismatch caused by the decorrelation filters is largely avoided.
The device 1 according to the present invention and illustrated merely by way of non-limiting example in
The pre-processing unit 2 receives the M input channels of the audio signal through the M inputs 5. The unit 2 also receives parameters relating to the audio signal, which are indicative of desired signal properties. Using these parameters, the pre-processing unit 2 performs signal processing such as adjusting the power ratios of the audio channels and predicting some audio channels on the basis of other audio channels. As a result, power ratio adjustment and prediction are carried out without being influenced by the decorrelation filters 3, and any time mismatch between the audio signal and the parameters relating to these operations is avoided.
It will be understood that not all signal processing can be performed by the pre-processing unit. Setting the desired correlations of the audio channels typically requires the availability of uncorrelated channels as produced by the decorrelation filters 3. Accordingly, setting the correlations is performed by the upmix unit 4. In addition, additional signal adjustments may be made by the upmix unit 4, such as an additional adjustment of the power levels of the audio channels. In this case, the power adjustment may be carried out in both the pre-processing unit 2 and the upmix unit 4, although it is very well possible to perform this operation in only one of these units.
An additional advantage of the present invention is the possibility to choose which of the units 2 and 4 is best suitable for performing a certain signal processing operation. By providing two units (2 and 4) instead of a single unit (4), a greater design flexibility is achieved, and the unfavorable effects of the decorrelation units can be avoided to the greatest extent possible.
In the preferred embodiments of the present invention, the pre-processing unit 2 and the upmix unit 4 are both time-variant: their signal processing properties are controlled by signal parameters which may vary in time. The decorrelation filters 3, however, are preferably time-invariant: their properties are not time-dependent and are preferably not controlled by signal parameters that vary over time. Embodiments can be envisaged in which either the pre-processing unit 2 or the upmix unit 4 is time-invariant.
In further advantageous embodiments, the processing performed by the pre-processing unit 2 and/or the upmix unit 4 is frequency-dependent: the signal processing properties of these units may be controlled by parameters which vary in dependence of the frequency.
As mentioned above, the number of output channels (N) is greater than the number of input channels (M). For example, there may be two input channels and five or six output channels, or there may be a single input channel and two or more output channels, although other combinations are possible.
It is also possible that the number of output channels 6 is equal to the number of input channels 5 (that is, M=N), in which case the device of the present invention provides a remix of the audio channels. This may be useful to adjust certain signal properties and to enhance the audio signal.
It is noted that the audio signal may be constituted by a series of signal parts contained in consecutive time segments. Such time segments may be time frames or other units defining a time-limited signal part. Due to the decorrelation units the synchronization between the time segments and the corresponding parameters may be lost. This problem is solved by the present invention.
A merely exemplary embodiment of the device of the present invention is shown in more detail in
A (first) gain unit 21 having a gain G1 could be added between the input terminal and the first decorrelation unit 31 but has been omitted from the embodiment shown where the first gain G1 is equal to 1.
The upmix unit 4 comprises, in the example shown, three mixing units 41, 42 and 43 which mix the input channel and its three auxiliary channels to produce four output channels Lf (Left front), Ls (Left surround), Rf (Right front) and Rs (Right surround). The mixing unit 41 receives the (time-dependent) parameters IID_lr (Inter-channel Intensity Difference left-right) and ICC_lr (Inter-channel Cross-Correlation left-right), the mixing unit 42 receives the (time-dependent) parameters IID_l (Inter-channel Intensity Difference left front-left surround) and ICC_l (Inter-channel Cross-Correlation left front-left surround), while the mixing unit 43 receives the (time-dependent) parameters IID_r (Inter-channel Intensity Difference right front-right surround) and ICC_r (Inter-channel Cross-Correlation right front-right surround).
The parameters mentioned above are typically used in a so-called mixing matrix to determine the desired output signals. For example, the output signals Rf (Right front) and Rs (Right surround) may be determined by a mixing matrix M of mixing unit 43:
where the matrix M has coefficients m11 . . . m22, and where H3(G3·S)=S3 is the output signal of decorrelation unit 33. The normalized correlation coefficient ICC of the signals Rf and Rs is given by:
where σ2x is the power of signal x. The intensity ratio IID is given by:
As the total power should be unaltered, it follows that:
σR2=m112 σR2+m122 σS32+m212 σR2+m222 σH32 (4)
It has been found that the further constraint m12=−m22 is effective. In other words, the power of the intermediate signal (auxiliary channel) S3 in both signals Rf and Rs is equal but has opposite signs (anti-phase). If m12=−m22 holds, the factors m12 and m22 can be moved upstream of decorrelator unit 33, for example to gain unit 23, to allow processing prior to decorrelation. Equation (1) can then be rewritten as:
Equation (1′) can be generalized using a parameter c:
For c=1 all time-variant processing of the decorrelator signal path is performed upstream of the decorrelator, while for c=G3·m12 all time-variant processing of the decorrelator signal path is performed downstream of the decorrelator. In accordance with the present invention, the parameter c will preferably have a value approximately or substantially equal to 1.
In the exemplary embodiment described above the upmix unit 4 sets both the cross-correlation and the intensity difference of the four output channels. This is, of course, not essential and in some embodiments the inter-channel intensity may be set in the pre-processing unit 2. This may be accomplished by performing all mixing operations in the pre-processing unit 2, for example directly using the input signal S.
It can be seen from
Another example of a device 1 according to the present invention is illustrated in
As can be seen from
An exemplary stereo decoder in accordance with the present invention is illustrated in
An audio system 10 according to the present invention is schematically illustrated in
Accordingly, the present invention may be used in audio amplifiers and/or systems. Such audio systems may include one or more audio sources, an amplifier and loudspeaker units or their equivalents. The audio sources may include a CD player, a DVD player, an MP3 or AAC player, a radio tuner, a hard disk, and/or other sources. The audio system may be incorporated in an entertainment center or in a computer system.
As discussed above, the present invention provides both a device and a method. The method steps are evident from
The present invention is based upon the insight that the time delay and possible “smearing” caused by the decorrelation in an audio decoder may cause temporal alignment discrepancies between the signal parameters and the corresponding signal parts. The present invention benefits from the further insight that this discrepancy can be eliminated, at least for certain signal processing operations, by carrying out these operations prior to the decorrelation.
It is noted that any terms used in this document should not be construed so as to limit the scope of the present invention. In particular, the words “comprise(s)” and “comprising” are not meant to exclude any elements not specifically stated. Single (circuit) elements may be substituted with multiple (circuit) elements or with their equivalents.
It will be understood by those skilled in the art that the present invention is not limited to the embodiments illustrated above and that many modifications and additions may be made without departing from the scope of the invention as defined in the appending claims.
Number | Date | Country | Kind |
---|---|---|---|
04103370.5 | Jul 2004 | EP | regional |
05103072.6 | Apr 2005 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB05/52293 | 7/11/2005 | WO | 00 | 1/9/2007 |