The present disclosure relates to digital sound processing, and more particularly to generating a soundstage in front of a listener with a simple speaker architecture.
A soundstage is an imaginary three-dimensional space that allows a listener to hear the location of sounds. A wide soundstage centered on the listener is desired for a compelling listening experience. Generally, this is effectively accomplished by using many speakers. However, a large number of speakers requires complex signal processing methods to achieve the desired listening experience. In certain listening environments, for example in an automotive vehicle, a large number of speakers is not practical in terms of space, weight, and cost considerations. Further, complex signal processing methods require high powered and expensive processors. In vehicle listening environments there are fewer speakers than the number typically found in a room or theater, and the speakers are positioned in fixed locations in the vehicle. Weight, packaging constraints, and processing power are all factors that, ideally, are reduced and kept to a minimum as much as possible in automotive applications.
There is a need for generating a virtual sound source in front of, to the left of and to the right of the listener in an automotive vehicle that creates a soundstage spanning from left to right in the vehicle using a speaker array having only two or three speakers at the center of the vehicle and minimal signal processing.
A system for generating a virtual soundstage in a listening environment having a compact speaker array centrally positioned in a listening environment in front of a listener, a center of the compact speaker array coincides with a center of the listening environment, the compact speaker array has at least first and second speakers. A signal processing unit is configured to receive an incoming audio signal, to process left and right channel signals of the incoming audio signal to generate a null, and to steer the null toward one ear of a listener thereby generating virtual sound sources for left, right and center. The virtual sound source is offset from the center of the listening environment, for example, in front of, on the left of, and on the right of the listener or the listening environment.
In one or more embodiments, the signal processing unit is configured to feed an inverted signal to one of the speakers in the compact speaker array to generate the null. In one or more embodiments, the null is steered by adding time delay to one speaker. As a result of the null hitting one ear of the listener, interaural level difference is manipulated, affecting localization, and a virtual sound source is perceived offset.
An audio system for a listening environment that includes a compact speaker array having at least first and second speakers. The first and second speakers are arranged symmetrically adjacent one another and centered in the listening environment in front of a listener. A signal processing unit is configured to split an incoming audio signal into right and left side signals to be played, respectively, at the first and second speakers. The signal processing unit is configured to create a null in an output of the compact speaker array and to steer the null off axis from a center of the listening environment thereby creating at least one virtual sound source that is offset from the center of the listening environment.
A method for generating a virtual center sound source in front of a listener in an interior of an automotive vehicle, the interior of the automotive vehicle includes a compact speaker array having at least first and second speakers adjacent each other and centered at a front of the interior, and a signal processing unit configured to execute instructions of a software program having a non-transitory computer-readable storage medium capable of storing instructions, the method is carried out in the signal processing unit and comprises the steps of selecting a position of the listener relative to the first and second speakers, creating a null, and steering the null to a predetermined position offset from a center of the vehicle interior relative to the selected position of the listener thereby generating a virtual sound source.
Elements and steps in the figures are illustrated for simplicity and clarity and have not necessarily been rendered according to any sequence. For example, steps that may be performed concurrently or in different order are illustrated in the figures to help to improve understanding of embodiments of the present disclosure.
While various aspects of the present disclosure are described with reference to
The invention may be carried out in an electronic device that may include one or more aspects of an exemplary audio system. The electronic device may be implemented using electronic devices that provide audio, video, voice, and or data communication. The term “device” may include a collection of devices or sub-devices that individually or jointly execute a set, or multiple sets, of instructions to perform one or more electronic functions of the speaker system. The electronic device may include memory that may include a main memory, as static memory, or a dynamic memory. The memory may include a non-transitory memory device that includes a non-transitory tangible medium upon which software is stored and is operable to store instructions executable by a processor, such as a Digital Signal Processor (DSP). A listening environment is an environment where a listener hears audio being played by an audio system. In the example described hereinafter, the listening environment is an interior of a vehicle.
A signal processor (DSP) 110, or other components, manipulate, or process sound signals sent to speakers 104, 106. The signals may be processed jointly or separately. The processor 110 may include instructions for adjusting a phase, amplitude, and/or delay of each sound signal delivered to the speakers 104, 106. The processor 110 processes an incoming audio signal (not shown) and separates the audio signal into a Mid or center signal, M, and a side signal, S. The side signal, S, may be further converted into left, L, and right, R, side signals to be played back at the speakers 104, 106.
Array processing, performed by the DSP 110, processes the incoming audio signal to create a null that may be steered to a desired location. For example, the null may be created by feeding an inverted signal (M=L+R) into one of the speakers 104, 106. The DSP 110 adds time delay to the signal to be played at one of the speakers, which steers the null to a desired location. Fine tuning audio parameters of the audio will fine tune the location from which a listener perceives a sound source. In the present example, the listener 402 is positioned in a left-side driver seat in the vehicle. Time delay is added to the signal being played at the first speaker 104 to steer the null toward an ear of a listener 402 that is closest to the middle of the vehicle. This creates a virtual center sound source 112 in front of the listener 402. The location of the listener 402 is for example purposes only. The listener 402 may be seated in a different position in the vehicle and the virtual center sound source 112 may be adjusted as needed to match the listener's position and steer the null to generate one or more virtual sound source locations.
Further, more than one virtual sound source may be generated in parallel, thereby creating a virtual sound stage in front of the listener. In one or more embodiments the null may be steered for several channel signals in parallel, thereby generating several virtual sound sources. For example, three virtual channel signals may be processed in parallel to generate a soundstage having three virtual sound sources that are perceived by the listener 402 at a virtual center 112 directly in front of the listener 402, a virtual left 118 at a far left of the listener 402, and a virtual right 116 at a far right of the listener 402. For example, in a vehicle environment where the listener 402 is in a left side driver seat, the virtual center is perceived to be directly in front of the listener, the virtual left is perceived to be at the driver's side A-pillar of the vehicle interior to the left of the listener 402, and the virtual right is perceived to be at the passenger side A-pillar of the vehicle to the right of listener 402. For purposes of example and simplicity, the method will mainly be described herein as it relates to the virtual center 112 and one skilled in the art can apply the method in parallel the virtual side signals, L, R, so that they are perceived as the virtual left 118 and the virtual right 116 sound sources of the soundstage.
Adjusting audio parameters that affect time delay, amplitude, and phase equalization, as well as cutoff frequencies will fine tune the location at which the virtual sound source is perceived. The virtual center 112 is accomplished as outlined above, and audio parameters are adjusted to improve the effect of the listener 402 perceiving the sound source directly in front of the listener 402. A virtual right sound source 116 is accomplished, in parallel, by steering the null to a left ear of the driver positioned in the left-hand driver seat and fine tuning the audio parameters to improve the effect of the listener perceiving the sound source from the right side of the listening environment. The virtual left 118 is accomplished, in parallel, by applying the null to the right ear of the listener and adjusting the audio parameters to improve the effect that the listener perceives the sound source to be coming from the left side of the listening environment.
As discussed above, the signal being fed into one of the first and second speakers 104, 106 is processed, as by array processing performed in the DSP, to create a null. The null may be created, for example, by feeding an inverted signal (M=L+R) into one of the speakers, the first and second speakers 104, 106.
The null 304 may be steered to the optimal position by adding time delay to the signal being fed into one of the speakers (in this example, the left speaker). The null 304 may be steered such that a virtual center is generated to the left of the center in the listening environment. A sound source is then perceived to be at the virtual center by steering the null so that it is offset, in this example offset left of center, in a front end of the listening environment.
Referring first to
Now referring to
The predetermined time delay, Δt, that is added to the signal being played at the second speaker may be determined in a manner that is known to those skilled in the art, and as an example, it may be determined with reference to
The distance, x sin (θ), is an extra distance for the sound from the speaker that is farther from the listener. This distance is compensated so that the sound from both speakers 104, 106 arrives at the right ear of the listener at the same time. In Equation (1), x is a distance between the first and second speakers, θ is a firing angle to the right ear of the listener, the speed of sound is 343.3 m/s.
Referring again to
For virtual left channel signal, L′, the right channel signal, R, is subtracted from the left channel signal, L.
L′=L−R (2)
For R′ the left channel signal, L, is subtracted from the right channel signal, R.
R′=R−L (3)
Time delay units 604a, 604b, delay virtual L′ and R′ channel signals by adding a predetermined time delay value, Δt. The predetermined time delay is dependent upon the distance between the speakers. The signal to be played at the right speaker 104 is the sum of R′ and L′ with a predetermined time delay. The signal to be played at the left speaker 106 is the sum of L′ and R′ with a predetermined time delay.
The left and right signals being fed into the first and second speakers 104, 106 is processed, as by array processing performed in the DSP shown in
Left Signal to Speakers:
Left Speaker (106)=+[L(t)−L(t−Δt)] (4)
Right speaker (104)=−[L(t)−L(t−Δt)] (5)
Right Signal to Speakers:
Left Speaker (106)=−[R(t)−R(t−Δt)] (6)
Right speaker (104)=+[R(t)−R(t−Δt)] (7)
A null is created 804. The null may be created using speaker array processing. In one example, creating a null 804 includes operating one of the speakers in the first and second speakers normally while inverting a signal at the other speaker. Only one of the speakers is inverted and there is no difference when applying the method to whether the left or the right speaker is inverted.
The null is steered 806 toward one ear of the listener. One way in which the null may be steered is to introduce a time delay 808 to the signal that is to be played at one of the speakers so that the null is steered a desired ear of the listener.
Audio parameters are tuned 810 in a manner that adds to the listener's perception of a location for the sound source. For example, a virtual center for a listener in a left-side driver seat is created by steering the null to the listener's right ear. However, a virtual left sound source is also created by steering the null to the listener's right ear. The audio parameters for the virtual center are adjusted in a manner that is different than the audio parameter adjustments for the virtual left so that a difference is perceived between the perception of the virtual center being directly in front of the listener and the virtual left being left of the listener. For example, audio parameters that affect the volume of the signal may be adjusted to differentiate the virtual center sound source from the virtual left sound source thereby affecting the listener's perception of the signal associated with the virtual left sound source in a manner that is different than the volume of the signal associated with the virtual center sound source.
In one or more embodiments, the method of
In the foregoing specification, the present disclosure has been described with reference to specific exemplary embodiments. The specification and figures are illustrative, rather than restrictive, and modifications are intended to be included within the scope of the present disclosure. Accordingly, the scope of the present disclosure should be determined by the claims and their legal equivalents rather than by merely the examples described.
For example, the steps recited in any method or process claims may be executed in any order, may be executed repeatedly, and are not limited to the specific order presented in the claims. Additionally, the components and/or elements recited in any apparatus claims may be assembled or otherwise operationally configured in a variety of permutations and are accordingly not limited to the specific configuration recited in the claims. Any method or process described may be carried out by executing instructions with one or more devices, such as a processor or controller, memory (including non-transitory), sensors, network interfaces, antennas, switches, actuators to name just a few examples.
Benefits, other advantages, and solutions to problems have been described above regarding embodiments; however, any benefit, advantage, solution to problem or any element that may cause any particular benefit, advantage, or solution to occur or to become more pronounced are not to be construed as critical, required, or essential features or components of any or all the claims.
The terms “comprise”, “comprises”, “comprising”, “having”, “including”, “includes” or any variation thereof, are intended to reference a non-exclusive inclusion, such that a process, method, article, composition, or apparatus that comprises a list of elements does not include only those elements recited but may also include other elements not expressly listed or inherent to such process, method, article, composition, or apparatus. Other combinations and/or modifications of the above-described structures, arrangements, applications, proportions, elements, materials, or components used in the practice of the present disclosure, in addition to those not specifically recited, may be varied, or otherwise particularly adapted to specific environments, manufacturing specifications, design parameters or other operating requirements without departing from the general principles of the same.
Priority is claimed to application Ser. No. 63/166,144 filed Mar. 25, 2021, in the United States, the disclosure of which is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5870484 | Greenberger | Feb 1999 | A |
20050259831 | Hutt | Nov 2005 | A1 |
20110216925 | Riggs | Sep 2011 | A1 |
20120020480 | Visser | Jan 2012 | A1 |
20130142337 | Troughton | Jun 2013 | A1 |
20160286329 | Grosche et al. | Sep 2016 | A1 |
20190391783 | Moore | Dec 2019 | A1 |
20200374631 | Hayashi | Nov 2020 | A1 |
Number | Date | Country |
---|---|---|
1596627 | Nov 2005 | EP |
Entry |
---|
Hamdan Eric C et al: “A modal analysis of mutichannel crosstalk cancellation systems and their relationship to amplitude panning”, of Journal of Sound and Vibration, Elsevier, Amsterdam, NL, vol. 490, Sep. 24, 2020 (Sep. 24, 2020), XP086330789, .ISSN: 0022-460X, DOI: 10.1016/J.JSV.2020.115743 [retrieved on Sep. 24, 2020] * Sections 2, 5.2; figure 3a *. |
Number | Date | Country | |
---|---|---|---|
20220312141 A1 | Sep 2022 | US |
Number | Date | Country | |
---|---|---|---|
63166144 | Mar 2021 | US |