Aspects disclosed herein generally relate to an apparatus and method for providing stereo image enhancement of a multi-channel loudspeaker setup in accordance with one aspect. In one example, the apparatus and method for providing the stereo image enhancement of the multi-channel loudspeaker setup may be performed via mono and side audio decoding and amplitude panning. These aspects and others will be discussed in more detail below.
Stereo widening is generally required if an available loudspeaker setup is insufficient to reproduce a satisfactory stereo width and proper image/stage. For devices with small loudspeaker dimensions, for example, such as laptops or mobile devices, stereo widening helps to achieve sufficient stereo perception despite the small dimensions. Stereo widening may also improve the stereo distribution in larger playback systems. In asymmetrical listening situations, such as in a vehicle, a problem or issue generally arises when the listener is positioned too close to one of the loudspeakers and the stereo width is unnaturally narrow.
In at least one embodiment, an audio system for providing stereo image enhancement for a multi-channel loudspeaker is provided. The audio system includes memory and at least one controller. The at least one controller includes the memory and is programmed to receive an audio input including a first audio component and a second audio component from an audio playback device and to encode the first audio component and the second audio component to generate a first encoded audio component and a second encoded audio component. The at least one controller is programmed to apply a first gain to at least one of the first encoded audio component and the second encoded audio component to generate at least one of a first increased encoded audio component and a second increased encoded audio component and to decode the at least one of the first increased encoded audio component and the second increased encoded audio component to generate at least one of a first decoded audio component and a second decoded audio component. The at least one controller is further programmed to amplitude pan the at least one of the first decoded audio component and the second decoded audio component to increase a stereo width for an audio output signal transmitted by a first loudspeaker and a second loudspeaker.
In at least one embodiment, a method for providing stereo image enhancement for a multi-channel loudspeaker is provided. The method includes transmitting an audio output signal in a listening environment via a first loudspeaker and a second loudspeaker and receiving an audio input including a first audio component and a second audio component from an audio playback device. The method further includes encoding the first audio component and the second audio component to generate a first encoded audio component and a second encoded audio component and applying a first gain to at least one of the first encoded audio component and the second encoded audio component to generate at least one of a first increased encoded audio component and a second increased encoded audio component. The method further includes decoding the at least one of the first increased encoded audio component and the second increased encoded audio component to generate at least one of a first decoded audio component and a second decoded audio component and amplitude panning the at least one of the first decoded audio component and the second decoded audio component to increase a stereo width for the audio output signal transmitted by the first loudspeaker and the second loudspeaker.
In at least one embodiment, a computer-program product embodied in a non-transitory computer readable medium that is programmed for providing stereo image enhancement for a multi-channel loudspeaker is provided. The computer-program product includes instructions for receiving an audio input including a first audio component and a second audio component from an audio playback device and encoding the first audio component and the second audio component to generate a first encoded audio component and a second encoded audio component. The computer-program product includes instructions for applying a first gain to at least one of the first encoded audio component and the second encoded audio component to generate at least one of a first increased encoded audio component and a second increased encoded audio component and decoding the at least one of the first increased encoded audio component and the second increased encoded audio component to generate at least one of a first decoded audio component and a second decoded audio component. The computer-program product includes instructions for amplitude panning the at least one of the first decoded audio component and the second decoded audio component to increase a stereo width for an audio output signal transmitted by a first loudspeaker and a second loudspeaker.
The embodiments of the present disclosure are pointed out with particularity in the appended claims. However, other features of the various embodiments will become more apparent and will be best understood by referring to the following detailed description in conjunction with the accompany drawings in which:
As required, detailed embodiments of the present invention are disclosed herein; however, it is to be understood that the disclosed embodiments are merely exemplary of the invention that may be embodied in various and alternative forms. The figures are not necessarily to scale; some features may be exaggerated or minimized to show details of particular components. Therefore, specific structural and functional details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for teaching one skilled in the art to variously employ the present invention.
It is recognized that the controllers as disclosed herein may include various microprocessors, integrated circuits, memory devices (e.g., FLASH, random access memory (RAM), read only memory (ROM), electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), or other suitable variants thereof), and software which co-act with one another to perform operation(s) disclosed herein. In addition, such controllers as disclosed utilizes one or more microprocessors to execute a computer-program that is embodied in a non-transitory computer readable medium that is programmed to perform any number of the functions as disclosed. Further, the controller(s) as provided herein includes a housing and the various number of microprocessors, integrated circuits, and memory devices ((e.g., FLASH, random access memory (RAM), read only memory (ROM), electrically programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM)) positioned within the housing. The controller(s) as disclosed also include hardware-based inputs and outputs for receiving and transmitting data, respectively from and to other hardware-based devices as discussed herein.
Stereo widening may be necessary if the available loudspeaker setup is insufficient to reproduce a satisfactory stereo width and proper image/stage. For devices with small loudspeaker dimensions, for example, laptops or mobile devices, stereo widening helps to achieve sufficient stereo perception despite the small dimensions. Stereo widening may also improve the stereo distribution in larger playback systems. In asymmetrical listening situations, such as in a vehicle, a problem that may occur is when the listener is too close to one of the loudspeakers and the stereo width is unnaturally narrow. Aspects disclosed herein enables a stereo base to be expanded beyond the physical distance of the loudspeakers positioned in a listening environment. Additionally, aspects disclosed herein enables the position of a phantom (or virtual) center loudspeaker to be adjusted (or moved) in an asymmetrical listening position. In this case, a proper and stable stereo and stage imaging audio experience may be achieved.
Aspects disclosed herein may also provide various processing operations such as splitting up a stereo signal into a side signal (or part) (e.g., de-correlated content) and a mono signal (or part) (e.g., correlated content) and using amplitude panning to shift the side signal to rear loudspeakers while the mono signal is panned to a phantom (or virtual) center loudspeaker. In this case, the widening effect can be applied to outer signal parts (or outer side signals) only (e.g., instruments that are panned to the left or right or decorrelated signal content like reverberation, etc.) while the center content (or mono content) may be kept stable (e.g., the solo instrument, mono speaker or singer, etc.). With this aspect, it is possible to emphasize reverberation and side-panned content to draw attention toward the stereo content. The center image stability may be controlled by the level of the mono signal. In addition, the position of the phantom center may be modified which may be useful for asymmetric listening positions such as in a vehicle. Compared to stereo widening implementations that are based on de-correlation and correlated audio signals, the aspects provided herein may avoid comb-filtering artifacts, smearing of the stereo image and incorrect localization of the phantom center.
The stereo content (or stereo audio signal) may be divided into two parts, a stereo part and a mono part. The stereo part and mono part may be the only two channels that can be weighted differently. Once these channels are encoded and then later decoded prior to being transmitted from a loudspeaker, the decoded signal is translated into a left/right signal, which is performed by using the two parts (e.g., stereo and mono) together. However, it is possible to use the mono part separately and either run the mono part in a separate (third) channel or to merge the mono part with the stereo part in such a way that the mono content (correlated part) is no longer perceived in the middle, but off center, for example, further to the left. The transformation from left/right into mono/stereo parts is a lossless transformation and represents a different (two-channel) representation of the signal.
The aspects disclosed herein may provide the following: (1) a stable and adjustable phantom center position, (2) less comb filtering and phasing artifacts compared to conventional stereo widening implementations, (3) improved widening effect for larger loudspeaker setups (e.g., vehicle, HiFi, etc.); (4) the possible re-use of existing audio processing objects; and (5) low machine instructions per second (MIPS)/micro-electro-mechanical (MEM) footprint. It is recognized that a minimum 4-channel loudspeaker setup may be used in connection with the system and method described herein.
The loudspeaker array 104 generally includes a plurality of loudspeakers 104a-104d (see
Referring now to
The M/S encoding block 204 generates mono audio (M) and side audio (S) based on the left audio data (L) and the right audio data (R). For example, the M/S encoding block 204 adds the left audio data (L) to the right audio data (R) to generate the mono audio (M) that is output therefrom. Similarly, the M/S encoding block 204 subtracts the left audio data (L) from the mono output to generate side audio data (S). It is recognized that instead of M/S processing, other processing techniques may include or be added to M/S processing which includes shuffling. The side gain block 206 applies a gain to the side audio (S) as output from the M/S encoding block 204. The mono gain block 208 applies a gain to the mono audio (M). It is recognized that the mono gain block 208 may apply small or low gain to the mono audio (M) (i.e., almost a unity gain). It is also recognized that the mono gain block 208 may also decrease the loudness of the mono audio (M) if the gain block 208 is set to a fraction. The amount of gain applied by the mono gain block 208 may vary based on the desired criteria of a particular implementation. The side gain block 206 applies a gain (e.g., a large gain) to the side audio (S) to increase the loudness of the side audio (S). In another example, the controller 105 may adjust a frequency response of the output from the side gain block 206 prior to being received at the M/S decoding block 212 to obtain or increase the side or stereo perception. Additionally, the phantom center gain block 210 applies a gain to the mono audio (M) that is independent of the gain applied to the mono audio (M) from the mono gain block 208. In this case, the phantom center gain block 210 may apply a larger gain than that applied by the mono gain block 208 to provide for an independent mono audio (M) that bypasses the M/S decoding block 212. The phantom center gain block 210 increases the loudness of the mono audio (M) which is fed to the amplitude panning block 214.
The M/S decoding block 212 decodes the increased side audio (S) (e.g., as provided by the side gain block 206) and decodes the mono audio (M) as output by the mono gain block 208. The M/S decoding block 212 then generates left audio data (L) and the right audio data (R) based on the increased side audio (S) and the mono audio (M). The left audio data (L) and the right audio data (R) as generated by the M/S decoding block 212 is increased given that such data is based on an increase to the side audio (S) by the side gain block 206.
The M/S decoding block 212 generates the left audio data (L) by adding the mono audio (M) to the side audio (S). The M/S decoding block 212 generates the right audio data (R) by subtracting the side audio (S) from the mono audio (M). The amplitude panning block 214 receives the increased left audio data (L) and the increased right audio data (R) in addition to the increased mono audio (M) provided by the phantom center gain block 210. The amplitude panning block 214 applies amplitude panning to the increased left and right audio data (L) and (R) to shift such audio data to the rear loudspeakers 104c and 104d while the mono audio (M) provided by the phantom center gain block 210 is panned to provide the phantom center loudspeaker 112 at the position (or location) 156 along the first axis 158 to provide the center channel for the listener 108.
Amplitude panning generally corresponds to the amplitude (playback level) of an audio output from a loudspeaker 104 that changes based on an angle. A phantom source generally requires two loudspeakers (e.g., in our case loudspeakers 104a and 104b). If a mono sound source is played through the two loudspeakers 104a and 104b (e.g., both loudspeakers 104a and 104b play the same), the phantom sound source (or phantom center loudspeaker 112 is created, for example, in the middle, between the two loudspeakers 104a and 104b. If the playback level of one loudspeaker 104a is increased and that of the other loudspeaker 104b is reduced, the phantom center loudspeaker 112 moves in the direction toward the louder loudspeaker (e.g., the loudspeaker 104a). Amplitude panning generally entails that one can place a sound source between the speakers by changing the playback level of two speakers. This may be accomplished with a mono center signal or with a stereo signal (for example the left channel).
A control input angle is provided to the amplitude panning block 214 to adjust an angle of audio transmission for the front left loudspeaker 104a and the front right loudspeaker 104b, respectively. Due to the amplitude panning as employed herein, an engineer (or end user) has the ability to input the control input angle (via a user interface that is either positioned on the audio playback device 102 or on another controller in communication with the audio playback device 102). Such a control angle enables the user to specify the exact degree on the polar coordinate at which the phantom center loudspeaker 112 transmits a center channel audio signal. If the signal that is normally only played from one loudspeaker (for example the left stereo channel from the left front loudspeaker 104a) is played through two loudspeakers (for example from the left front and left rear loudspeakers 104a and 104c), it appears as a phantom sound source (or phantom center loudspeaker 112) between the two loudspeakers 104a and 104b. In the example as set forth herein, the left stereo signal detaches from the position of the physical loudspeaker (front left) 104a and moves towards the rear left loudspeaker 104c. This increases the stereo width, and the left “end” of the stereo image moves further to the left and to the rear. Due to the amplitude panning method, the controller 105 enables the engineer to place the virtual speakers (or phantom sound sources) to the exact degree.
The panning correction block 216 enables an adjustment (e.g., linear increase or decrease) of the gain after the amplitude panning is performed. The routing matrix 218 defines which audio is transmitted to the various loudspeakers 104a-104d (e.g., right/left and front/rear). In theory, amplitude panning algorithms match the perceived position of a sound source. In practice, however, this may not be the case, since the listening room adds reflections or the loudspeakers 104a-104d do not have an optimal transfer function or are not placed at the same distance from the listener 108. As a result, a sound source that is supposed to come from 30°, for example, is not perceived at this location. The panning correction block 216 may compensate for this condition. For example, such compensation may occur by adjusting the level of the two loudspeakers 104a-104d at 30°. The controller 105 is therefore dynamically adapted to the set angle (e.g., angle control input). For example, the controller 105 may include gains, delays or digital filters that act based on the angle and compensate for the perceived localization problems. The speaker parameters block 220 adjusts or balances the spectral behavior of the audio output for the loudspeakers 104a-104d.
In operation 251, the controller 105 receive left audio data (L) (e.g., a first audio component) and right audio data (R) (e.g., a second audio component) from the audio playback device.
In operation 252, the controller 105 increases a gain (or loudness) of left audio data (L) and right audio data (R) as received from the audio playback device 102.
In operation 254, the controller 105 generates mono audio data (M) and side audio data (S) based on the increased gain of the left audio data (L) and the right audio data (R). For example, the controller 105 encodes the left audio data (L) and the right audio data (R). By encoding, the controller 105 adds the left audio data (L) to the right audio data (R) to generate the mono audio data (M) (e.g., a first encoded audio component). Further, by encoding, the controller 105 subtracts the left audio data (L) from the mono audio data (M) to generate side audio data (S) (e.g., a first encoded audio component).
In operation 256, the controller 105 applies a gain to the mono audio data (M) and to the side audio data (S) to generate an increased mono audio data (M) and an increased side audio data (S).
In operation 258, the controller 105 decodes the increased side audio data (S) and decodes the increased mono audio data (M). The M/S decoding block 212 then generates the left audio data (L) and the right audio data (R) based on the increased side audio data (S) and the increased mono audio data (M). The controller 105 generates the left audio data (L) by adding the increased mono audio data (M) to the side audio data (S). The controller 105 generates the right audio data (R) by subtracting the increased side audio data (S) from the increased mono audio data (M).
In operation 260, the controller 105 applies a gain only to the mono audio data (M) that is larger than the gain applied in operation 256 to generate phantom mono audio data (M).
In operation 262, the controller 105 performs amplitude panning to the left audio data (L) and the right audio data (R) to shift an audio output signal during playback and establishing the phantom center sound source 112 (or the phantom (or virtual) center loudspeaker 112).
In operation 264, the controller 105 receives the control input (or control angle) to adjust an angle of audio transmission for the left and right loudspeakers 104a and 104b. The control angle also enables the user to specify the exact degree on the polar coordinate at which the phantom center loudspeaker 112 transmits a center channel audio signal.
In operation 266, the controller 105 provides an adjustment (e.g., linear increase or decrease) of the gain applied to the audio output after the amplitude panning is performed.
In the event the control angle is set to roughly −80 degrees, such a control angle enables both the front left loudspeaker 104a and the rear loudspeaker 104c to provide a loudspeaker level (or linear amplitude) of 0.7. In the event the control angle is set to roughly −50 degrees, such a control angle enables the front left loudspeaker 104a to provide a loudspeaker level of 0.96 (or close to 1.0) and the rear left loudspeaker 104c to provide a loudspeaker level of close to 0.2. In the event the control angle is set to roughly +80 degrees, such a control angle enables both the front right loudspeaker 104b and the rear right loudspeaker 104d to provide a loudspeaker level of 0.7. In the event the control angle is set to roughly +50 degrees, such a control angle enables the front right loudspeaker 104b to provide a loudspeaker level of 0.96 (or close to 1.0) and the rear right loudspeaker 104d to provide a loudspeaker level of close to 0.2. It is recognized that an audio engineer may configure or set the control angles after tuning the audio system. However, in other embodiments, a user interface may be provided to enable users to adjust the degree of stereo widening to suit his/her audio playback preferences.
The plot 300 of
The second set of controls 404 includes a left loudspeaker control, a phantom center control, and a right loudspeaker control. The left loudspeaker control controls panning of a left M/S encoded signal (e.g., may be limited to −90 . . . 0 degrees). The right loudspeaker control controls panning of a right M/S encoded signal (e.g., may be limited to 0 . . . +90 degrees). The phantom center control controls a shift of a phantom center (e.g., may be limited to +/−20 degrees).
The third set of controls 406 includes a mixer and panning look up tables (LUTs). The mixer controls the output gain of physical loudspeakers and a phantom center. The panning LUTs opens LUTs including amplitude vectors. The LUTs include amplitude vectors and define amplitude values that are to be calculated at each corresponding angle. It is also recognized that real-time calculations may be performed in determining the amplitude vectors for corresponding control angles.
While exemplary embodiments are described above, it is not intended that these embodiments describe all possible forms of the invention. Rather, the words used in the specification are words of description rather than limitation, and it is understood that various changes may be made without departing from the spirit and scope of the invention. Additionally, the features of various implementing embodiments may be combined to form further embodiments of the invention.
This application claims the benefit of U.S. provisional application Ser. No. 63/167,952 filed Mar. 30, 2021, the disclosure of which is hereby incorporated in its entirety by reference herein.
Number | Name | Date | Kind |
---|---|---|---|
9983847 | Lakkundi et al. | May 2018 | B2 |
20150194158 | Oh | Jul 2015 | A1 |
20210112340 | Seldess | Apr 2021 | A1 |
Number | Date | Country | |
---|---|---|---|
20220322020 A1 | Oct 2022 | US |
Number | Date | Country | |
---|---|---|---|
63167952 | Mar 2021 | US |