The present invention relates to a device for processing multi-channel audio signals. The invention also relates to a method for processing multi-channel audio signals and to a computer-readable storage medium.
Audio signals are usually optimized for reproduction in a standardized environment. Especially multi-channel audio signals require loudspeakers at defined positions relative to a single listener in order to result in an intended particular spatial image. The listener's ideal position is known as the sweet spot. However, in some cases, the audio reproduction is aimed at two listeners, such as in a car. The listeners, in this case, have defined positions, but are usually not in the sweet spot. Thus, it may be desirable to optimize audio signals for reproduction in such an environment.
Creating a proper spatial image in cars is difficult. In a standardized environment such as a recording studio, the listener can be placed in the sweet spot where the distances and the angles between the listener and the speakers are as prescribed, e.g., symmetrical. In cars with usually two or more seats in the front, however, this is not possible. While most cars have left and right speakers, a simple and effective approach is to add a center speaker, so that imaging becomes more symmetrical. In the simplest form, as shown in
A new set of challenges arises with the introduction of immersive audio formats into the car. The most commonly used immersive audio formats, such as 5.1 Surround sound, 7.1 Surround sound, Dolby Surround, Dolby Atmos. Auro 3D, and MPEG-H, have three front channels: Front Left, Front Center, and Front Right. The immersive audio formats usually have additional channels, but these are not considered here. A simple approach is just to play the left and right channels on left and right speakers and use the center channel to create two virtual centers or phantom centers. This can be achieved by an arrangement as shown in
However, while this approach works for some scenarios, it does not work for others. In particular, music may be mixed in different ways. Usually, information that should be perceived in front of the listener, e. g. the main vocals, is mixed into the center channel. In another commonly used mixing style however, such information is mixed into the left and right channel. In a studio environment, this works well since it creates a phantom center for the listener's perception. Moreover, it can have the effect of the instruments blending in better with the voice, which is why many sound engineers choose this approach during the mixing process. However, in an environment where the listener is not located in the sweet spot, such as in a car, the image of the voice will not be centered directly in front in this case, but moved outwards to either the left (for the driver/passenger on the left) or the right (for the passenger/driver on the right). Moreover, the music is usually not tagged to indicate which mixing style was applied. It is therefore difficult to enable a correct reproduction of either style.
An object of the present invention is therefore to provide a solution for the problems mentioned above.
As described in the following, the invention solves the problem and is suitable for creating two phantom centers, one in front of each listening position. Advantageously, the solution works regardless of the mixing style. The input multi-channel audio signals can have a conventional format like, for example, one of 5.1 Surround sound, 7.1 Surround sound, Dolby Surround, Dolby Atmos, Auro 3D, and MPEG-H.
In an embodiment, a method for processing multi-channel audio signals that include at least a left channel, a right channel, and a center channel comprises performing a center extraction for the left channel and right channel, wherein a left remainder signal and a right remainder signal remain, adding the extracted center signal to the center channel to obtain an enhanced center channel and add the enhanced center channel to both the left remainder and the right remainder signal. For reproduction, the left and right remainder signals with the enhanced center channel respectively added are provided then to left and right speakers, and the enhanced center channel is provided to a center speaker.
In a further embodiment, the invention relates to a device for processing multi-channel audio signals that include at least a left channel, a right channel, and a center channel. The device comprises a center extraction unit adapted for extracting a center signal from the left channel and right channel, wherein a left remainder signal and a right remainder signal remain. The device further comprises a first summation unit for adding the extracted center signal to the center channel to obtain an enhanced center channel and two more summation units for adding the enhanced center channel to the left and the right remainder signal, respectively. The device provides, on respective outputs, the enhanced center channel and the respective summation results of the left and the right remainder signals with the enhanced center channel.
In yet a further embodiment, the invention relates to a computer-readable storage device having stored thereon instructions that when executed on a computer cause the computer to perform the method as described above.
Further advantageous embodiments are disclosed in the detailed description below.
Details and further advantageous embodiments of the present invention may be better understood by reference to the accompanying figures, which show in
Each of the center extraction unit 410 and the summation units S42, S43, S44 may be implemented by one or more hardware elements, such as one or more processors and/or adders, that may but do not need to be configurable by software.
The enhanced left channel 440L, enhanced right channel 430R, and enhanced center channel 420C are provided to respective outputs of the device. They may be fed to respective loudspeakers LSL, LSC, LSR positioned near two listening positions P1, P2 as follows: A first speaker LSL is positioned to the left and in front of the listening positions P1, P2. A second speaker LSR is positioned to the right and in front of the two listening positions P1, P2. Finally, a third speaker LSC is positioned in the middle and in front of the two listening positions P1, P2. Thus, the listening positions P1, P2 may be two adjacent seats in a car, particularly the driver seat and the passenger seat. However, the listening positions can also be located in other, similar environments. Advantageously, the arrangement provides two phantom centers, one in front of each listening position, for all audio information that should be perceived in front of each listener.
The multi-channel audio signal may comprise analog or digital audio signals. Further, it may also comprise one or more additional audio channels, which may be provided to one or more further speakers e. g. to the side of or behind the listening positions P1, P2. These are not considered here. All processing mentioned above, except for the center extraction, may be performed in the analog domain. In particular, the summation units may perform analog summation or simple superposition of signals. In the case of analog audio input signals, additional analog-to-digital converters (ADC, not shown) for digitizing at least the left and right audio channels L, R are included. If the summation units S42-S44 perform analog summation, also additional digital-to-analog converters (DAC, not shown) are provided for converting the output signals of the center extraction unit 410 into analog signals. The ADCs and/or the DACs may also be part of the center extraction unit 410. Alternatively, the processing may also be performed entirely in the digital domain. In this case, either the input audio signal may be a digital signal, or the device may have a digitization stage (ADC) for digitizing all analog input signals. In the case of digital processing, the device may optionally also comprise a DAC for obtaining analog output signals.
In one embodiment, the invention relates to a system comprising a device for processing multi-channel audio signals as described above and at least three speakers positioned relative to two listening positions as described above.
In one embodiment, the invention relates to a method for audio processing, and in particular for processing multi-channel audio signals that comprise at least a left channel L, a right channel R, and a center channel C.
In particular, the enhanced left channel 440L can be provided to a first speaker LSL positioned to the left and in front of two listening positions P1, P2. Likewise, the enhanced right channel 430R can be provided to a second speaker LSR positioned to the right and in front of the two listening positions P1, P2. Finally, the enhanced center channel 420C can be provided to a third speaker LSC positioned substantially in the middle and in front of said two listening positions P1, P2. Optionally, the enhanced channel signals 440L, 430R, 420C can be fed to additional processing units, such as, e.g., speaker management and delay adjustment, before being fed to the corresponding physical speaker.
The invention is particularly advantageous for correctly processing multi-channel audio signals, independent of how they are mixed, and in cases where neither of two listeners can be located in the conventional sweet spot. That is, the sound that is meant to be perceived in front of the listener will be perceived in the intended way for each of the two listeners, whether the center information is mixed into the center channel or distributed to the left and right channels. Even intermediate solutions where the center information is partly mixed into the center channel and partly distributed can be reproduced as intended. In each case, two phantom centers are created, one for each listener. This means that improved sound reproduction e. g. in cars is possible. However, the invention can also be used in other environments like home cinema, trains, public spaces, etc. It may also be adapted for audio formats with more than three speakers in the front.
While various embodiments have been described, it is clear that combinations of features of different embodiments may be possible, even if not expressly mentioned herein. Accordingly, such combinations are considered to be within the scope of the present invention.
Number | Name | Date | Kind |
---|---|---|---|
20170272884 | Aoki | Sep 2017 | A1 |