Embodiments of the invention relate to the field of wired one-way processing systems for audio signals where there are two or more independent audio signals which are to be separately reproduced so as to create a sense of depth; and more specifically, to audio processing systems that create two or more processed audio signals from each of the independent audio signals by a spectral adjustment to at least one of the processed audio signals.
A portable electronic device, such as a tablet computer, may include multiple speakers to provide a stereo audio presentation to a user of the device. In a stereo audio presentation, the audio signal that represents the left channel will be directed to speakers on the left side of the device as oriented with respect to the user. Likewise, the right channel signal will be directed to speakers on the right side of the device. The device may include four or more speakers symmetrically arranged on the device with respect to both the vertical and horizontal centerlines of a display surface to be viewed by the user. This will provide a generally similar stereo audio presentation to the listener in any of the four orientations of a rectangular device, if the sound is routed appropriately for the orientation.
It is desirable to route the audio signal that represents the left channel to all the speakers on the left side of the device to increase the maximum loudness and dynamic range, and to better center the apparent center of the sound field along the vertical axis of the device with respect to the listener. However, when the same audio signal is sent to two speakers, there will be a destructive interference of the resulting sound waves from the two speakers at certain places within the sound field produced. The locations of these areas of destructive interference are dependent on the frequency of the sound wave and the distance between the speakers.
It would be desirable to provide a way to minimize the destructive interference in the sound field of a portable electronic device in which the audio signal for each channel is routed to more than one speaker.
The invention may be understood by referring to the following description and accompanying drawings that are used as examples to illustrate embodiments of the invention. The invention is not limited to the examples of the description and drawings. In the drawings, in which like reference numerals indicate similar elements:
In the following description, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In other instances, well-known circuits, structures and techniques have not been shown in detail in order not to obscure the understanding of this description.
In the following description, reference is made to the accompanying drawings, which illustrate several embodiments of the present invention. It is understood that other embodiments may be utilized, and mechanical compositional, structural, electrical, and operational changes may be made without departing from the spirit and scope of the present disclosure. The following detailed description is not to be taken in a limiting sense, and the scope of the embodiments of the present invention is defined only by the claims of the issued patent.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. Spatially relative terms, such as “up”, “down”, “left”, “right”, “beneath”, “below”, “lower”, “above”, “upper”, and the like may be used herein for ease of description to describe one element's or feature's relationship to another element(s) or feature(s) as illustrated in the figures. It will be understood that the spatially relative terms are intended to encompass different orientations of the device in use or operation in addition to the orientation depicted in the figures. For example, if the device in the figures is turned over, elements described as “below” or “beneath” other elements or features would then be oriented “above” the other elements or features. Thus, the exemplary term “below” can encompass both an orientation of above and below. The device may be otherwise oriented (e.g., rotated 90 degrees or at other orientations) and the spatially relative descriptors used herein interpreted accordingly.
As used herein, the singular forms “a”, “an”, and “the” are intended to include the plural forms as well, unless the context indicates otherwise. It will be further understood that the terms “comprises”, “comprising”, “includes”, and “including” specify the presence of stated features, steps, operations, elements, components, items, species, and/or groups but do not preclude the presence, existence, or addition of one or more other features, steps, operations, elements, components, items, species, and/or groups thereof.
The terms “or” and “and/or” as used herein are to be interpreted as inclusive or meaning any one or any combination. Therefore, “A, B or C” or “A, B and/or C” mean “any of the following: A; B; C; A and B; A and C; B and C; A, B and C.” An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.
For the purposes of this application “audio signal” will be used to describe an electrical representation of a sound. “Sound” will be used to describe a sound pressure wave in air that is emitted by a speaker to produce an audible sound for a listener. An audio signal may be sent to a speaker to produce a sound. The terms speaker and speaker are used interchangeably to describe an electrical transducer that converts an electrical input into an audible sound pressure wave that travels through the air to a listener. A speaker does not, for the purposes of this application, include an earphone where the transducer is acoustically closely coupled to the ear of the listener such that the sound pressure wave is at least somewhat confined to the ear of the listener.
More generally, a portable electronic device that embodies the invention will have four or more speaker components (or speakers), each having similar sound reproduction capabilities, spaced apart from each other but arranged symmetrically on the device such that a similar array of speakers faces the user in all four orientations of the device in which there are two vertical sides and two horizontal sides. In any given orientation the two vertical sides can be considered as a left side and a right side. There will be at least two speakers on the left side and at least two speakers on the right side. To present a stereophonic audio program the audio signals representing the left side of the program will be sent to speakers on the left side of the device based on the device orientation. Audio signals representing the right side of the program will be sent to speakers on the right side of the device based on the device orientation.
When the distance to each of the two speakers 102, 104 is equal or differs by an integer number of wavelengths, the sound pressures from each of the speakers 102, 104 will reinforce one another to produce a maximum sound pressure level. The distance to each of the two speakers 102, 104 is equal along the perpendicular bisecting plane 300 of a line between the two speakers. Being on the bisecting plane 300 may be described as being on-axis. There may be additional surfaces 310 where the distance to each of the two speakers 102, 104 differs by an integer number of wavelengths and the speakers produce a maximum sound pressure level. The location or position of these additional surfaces 310 depends on the wavelength of the sound with respect to the distance between the two speakers. The sound waves from the two speakers can be described as being in-phase for a particular frequency when the maximum sound pressure from each of the speakers coincides to produce a maximum sound pressure level.
When the distance to each of the two speakers 102, 104 differs by an integer number of wavelengths plus one-half wavelength, the sound pressures from each of the speakers 102, 104 will destructively interfere with one another to produce a minimum sound pressure level. The surfaces 320 where the maximum sound pressure 302 from one of the two speakers 102 coincides with the minimum sound pressure 314 from the other of the speakers 104 are suggested by lines with a long dash separated by two short dashes. This destructive interference 320 of the sound waves 302, 312, 304, 314 from the two speakers 102, 104 produces an undesirable effect known as “lobing” where changing frequencies in the audio signal are attenuated as the listener moves to different positions away from the ideal on-axis position, which may be described as moving off-axis. The listener may experience undesirable psychoacoustic effects because of the notches in the frequency spectrum that lobing causes.
As in
As the wavelength of the sound is increased to more than twice the distance between the speakers 102, 104, which is equivalent to lowering the frequency, there will be no place in the sound field where there is completely destructive interference. The reductions in sound pressure level as the listener moves off-axis will become more gradual as the wavelength of the sound is further increased and the frequency becomes lower. It is generally considered that the effects of lobing become negligible for frequencies having a wavelength of four times the distance between the speakers or greater.
It is desirable to use more than one speaker on one side of a stereo field to increase the maximum sound pressure levels available and hence the dynamic range of the audio system. It is also desirable to use more than one speaker on one side of a stereo field to better center the apparent origin of the audio signal between the speakers so that the audio presentation appears to originate more toward the center of the display screen 110 on the device 100. However, as described above, providing the same audio signal to two speakers gives rise to undesirable lobing because of destructive interference between the sound waves produced by the speakers.
Referring again to
The audio router 230 directs the left audio signal 202 with the high frequency portion of the left audio signal to only one speaker on the left side of the device 100. Likewise, the audio router 230 directs the right audio signal 204 with the high frequency portion of the right audio signal to only one speaker on the right side of the device 100. In this way, the high frequency portion of the audio program is limited to a single speaker on each side of the device to minimize the undesirable lobing effect. The low frequency portion of the audio program, which has a lesser contribution to lobing, is delivered to all speakers to maximize the sound pressure levels of the delivered audio program.
The cutoff frequency and roll-off rate of the low-pass filter for attenuating a portion of the audio signal may be “tuned” experimentally to produce the desired psychoacoustic effect for an audio presentation on the device. In some embodiments, a second order low-pass filter may be used to eliminate the high frequency portion of the audio signal. In other embodiments, a shelf filter may be used to attenuate the high frequency portion of the audio signal without entirely eliminating the high frequency portion.
It will be appreciated that the distance between the speakers on the left and right sides of the device 100 may change based on the orientation of the device. For example, as shown in
In one embodiment, the orientation sensor 220 may sense tilting of the device 100 to an approximately horizontal position, such as when the device is laying on a table, and the low-pass filter may be adjusted such that a sound field suitable for listening over a wide area is presented.
An audio source 200 provides a left audio signal 202 and a right audio signal 204. The left and right audio signals 202, 204 are coupled to an audio processor 210 that attenuates a high frequency portion of the left and right audio signals 202, 204 to produce processed left and right audio signals 212, 214. The cut-off frequency for the high frequency portion of the audio signals may be adjusted responsive to the device orientation. The cut-off frequency for the high frequency portion of the audio signals may be further adjusted by a spectrum analyzer portion (not shown) of the audio processor 210, responsive to the frequency spectrum of the content represented by the audio signals 202, 204. The left and right audio signals 202, 204 may also be coupled to an equalizer 740 that boosts or emphasizes the high frequency portion of the left and right audio signals 202, 204 to produce enhanced left and right audio signals 712, 714.
The processed left and right audio signals 212, 214 and the enhanced left and right audio signals 712, 714 are coupled to a delay processor that may time delay the audio signals that will be routed to speakers that are closer to the listener due to device tilting. The audio signals are provided to the audio router 230. The audio router directs the enhanced left audio signal 712 to only one speaker that is on the left side of the device in its current orientation. The audio router directs the enhanced right audio signal 714 to only one speaker that is on the right side of the device in its current orientation. The processed left and right audio signals 212, 214 are directed to one or more of the remaining speakers on the appropriate side of the device 700.
An audio source 200 provides a left audio signal 202 and a right audio signal 204. The left and right audio signals 202, 204 are each coupled to a high-pass filter 822, 832 and a low-pass filter 824, 834 to separate the audio signals into high frequency and low frequency portions. The high and low-pass filters may be matched such that the high and low frequency portions can be recombined to provide a signal that is substantially the same as the audio signal provided to the high and low-pass filters. In some embodiments (not shown) a signal from the orientation sensor 220 may be used to adjust the high and low-pass filters responsive to the device orientation similarly to the embodiment shown in
The left high frequency portion 826 of the left audio signal 202 and the right high frequency portion 836 of the right audio signal 204 are provided to the audio router 850. The audio router directs the left high frequency portion 826 to only one speaker that is on the left side of the device in its current orientation. The audio router directs the right high frequency portion 836 to only one speaker that is on the right side of the device in its current orientation. This may reduce the undesirable lobing effects as described above.
The speakers 102, 104, 106, 108 may all have similar sound reproduction capabilities. Each speaker may be relatively small and lack the capacity to move a large volume of air as needed to reproduce lower frequencies effectively. In this embodiment, the left low frequency portion 828 of the left audio signal 202 and the right low frequency portion 838 of the right audio signal 204 are combined by a bass mixer 842 to provide a single bass signal 844 that includes the left and right low frequency portions 828, 838 of the left and right audio signals 202, 204. The single bass signal 844 is routed to all speakers 102, 104, 106, 108 of the device 800.
Speaker mixers 862, 864, 866, 868 each receive the single bass signal 844 and may receive one of the high frequency portions 826, 836 as determined by the device 800 orientation. Each speaker mixer 862, 864, 866, 868 is coupled to one of the speakers 102, 104, 106, 108 to provide a combined audio signal that drives the speaker. By providing the same bass signal 844 to all of the speakers, a larger volume of air can be moved by the cooperative action of all the speakers to reproduce lower frequencies more effectively. As discussed above, lower frequencies do not produce a lobing effect even though all the speakers of the device are reproducing the same low frequency content.
The left high frequency portion 826 of the left audio signal 202 and the right high frequency portion 836 of the right audio signal 204 are provided to a decorrelation engine 950. The decorrelation engine shifts the phases of the components of the audio signals it receives. The decorrelation engine produces a decorrelated version 958 of the left high frequency portion 826 of the left audio signal 202 and a decorrelated version 956 of the right high frequency portion 836 of the right audio signal 204. The decorrelated version of the high frequency portion of the audio signal produces a sound that is aurally similar to a sound produced by the high frequency portion of the audio signal when the signals are reproduced by a speaker. However, because of the phase shifts in the decorrelated version, the decorrelated version may be played in a speaker adjacent to a speaker playing the high frequency portion with less of an undesirable lobing effect.
The decorrelation engine 950 may include an audio router to direct the left high frequency portion 952 and the decorrelated version 958 of the left high frequency portion 826 to speakers that are on the left side of the device in its current orientation as indicated by an orientation signal 224 from an orientation sensor. The audio router may direct the right high frequency portion 954 and the decorrelated version 956 of the right high frequency portion 826 to speakers that are on the right side of the device in its current orientation. If the device orientation is fixed, the decorrelation engine may direct the audio signals as necessary without using an orientation sensor. It will be appreciated that the decorrelation engine may provide additional decorrelated versions of the high frequency portion of an audio channel to allow more than two speakers to reproduce the sound for that audio channel.
It will be appreciated that the left high frequency portion 826 of the left audio signal 202 and the right high frequency portion 836 of the right audio signal 204 may be correlated to a greater or lesser degree according to the source material of the audio source 200. At one extreme the left and right channels may be of entirely different audio material with no correlation between the two channels. At the other extreme, monophonic material may be encoded so that the left and right channels are identical and completely correlated. Between these extremes the left and right channels may include some material, such as a vocal track, that is identical in both channels while other material, such as an instrumental accompaniment, differs between the channels to a greater or lesser degree. Thus the correlation between the high frequency portion of the channels can vary based on the audio source material that can, in turn, vary over time.
To reduce undesirable lobing effects from correlation between the channels, the device may include a decorrelation metric generator 948 that determines the correlation between the high frequency portions 822, 832 of the audio source channels 202, 204 and provides a channel decorrelation metric 946 to the decorrelation engine 950 responsive to the amount of decorrelation needed. This may also be viewed as a comparison or compare of the high pass filtered versions of the audio source channels 202, 204. The decorrelation engine shifts the phases of the channel signals it receives to produce intermediate channel signals responsive to the channel decorrelation metric 946. It will be appreciated that the decorrelation engine may modify one or both channels to decorrelate the signals and produce the intermediate channel signals. The decorrelation engine may then further produce a decorrelated version 958 of the left intermediate high frequency portion 826 of the left audio signal 202 and a decorrelated version 956 of the right intermediate high frequency portion 836 of the right audio signal 204. This may reduce undesirable lobing effects between the channels in addition to reduce undesirable lobing effects between multiple speakers that produce sound for the same channel. While decorrelation has been described for two channels and two speakers per channel, it will be understood that the invention may be applied to devices with differing numbers of channels and differing numbers of speakers per channel.
Speaker mixers 862, 864, 866, 868 each receive the single bass signal 844 and one of the decorrelated high frequency portions 952, 954, 956, 958. Each speaker mixer 862, 864, 866, 868 is coupled to one of the speakers 102, 104, 106, 108 to provide a combined audio signal that drives the speaker. By providing the same bass signal 844 to all of the speakers, a larger volume of air can be moved by the cooperative action of all the speakers to reproduce lower frequencies more effectively. As discussed above, lower frequencies do not produce a lobing effect even though all the speakers of the device are reproducing the same low frequency content. By providing decorrelated high frequency portions to all of the speakers, a fuller sound may be produced by the device 900 for the high frequency portions of the audio program. The undesirable lobing effects between multiple speakers that are producing the high frequency portions of the audio program may be reduced or eliminated by decorrelating the high frequency portions before sending the signals to the speakers 102, 104, 106, 108.
The decorrelation engine 1050 may pass each high frequency portion of each audio channel 1026 through a chain of n all-pass filters 1072-1, 1072-2, 1072-n and to generate the decorrelated intermediate channel signal 1052. An all-pass filter is a signal processing filter that passes all frequencies equally in gain, but changes the phase relationship among various frequencies by varying its phase shift as a function of frequency.
The all-pass filter is a linear, time-invariant, causal, digital filter with an equal number of inputs and outputs, whose transfer function in the Z-domain can be expressed as:
The all-pass filters may be configured by optimizing the following parameters:
fLP cutoff frequency of low-pass filter
nLP order of low-pass filter
fHP cutoff frequency of high-pass filter
nHP order of high-pass filter
NAP general number of all-pass filters
AP fstart general starting point in frequency spectrum of all-pass filters
AP fstop general stopping point in frequency spectrum of all-pass filters
AP fn,m frequency of all all-pass filters for n speakers and m audio channels
AP Qn,m quality factor (Q) of all all-pass filters
The all-pass filter is calculated as:
where
f0 is the center frequency of the filter
Q is the quality factor (Q) of the filter
fs is the sampling frequency
B1=1−alpha
B2=−2 cos(w0)
B3=1+alpha
A is the inverse of B:
A1=B3
A2=B2
A3=B1
A and B can be used to compute the transfer function for the all-pass filter.
The decorrelation engine 1050 may include a coefficient calculator 1080 to perform calculations of the coefficients for the all-pass filters. As suggested by the figure, the calculations may be performed using matrix mathematics. The coefficients for each of the n all-pass filters that process a single audio channel may be represented as a vector A[1] through A[m] for the m audio channels. Each all-pass filter in a channel chain may be configured with an element of the vector that is selected as distributed by a cascade circuit 1082-1, 1082-2, 1082-m.
It will be appreciated that when the number of speakers in the device is greater than the number of audio channels, it may be desirable to create decorrelated versions of some or all of the decorrelated intermediate channel signals so that aurally similar high frequency portions of the audio channel are reproduced by more than one speaker as discussed for the embodiment shown in
The decorrelation engine 1050 may receive a channel decorrelation metric signal 1046 from the decorrelation metric generator 1048 that indicates the amount of decorrelation needed between the channels. The channel decorrelation metric signal 1046 may be used in the calculation of the coefficients for the all-pass filters.
The exemplary decorrelation metric generator 1048 shown in
The channel decorrelation metric signal may be generated with other functions, such as the inverse autocorrelation function (IACF) equation:
The channel decorrelation metric signal may be any metric that conveys how unique each channel's content is relative to every other channel's content at a given moment in time. For example, in the stereo case, the “stereo-ness” of the signal would be “not at all” for mono content and “very much” for completely unrelated content in each channel. The purpose of the channel decorrelation metric is to inform the decorrelation algorithm of the coefficient calculator 1080 how much decorrelation is required at a given time.
While certain exemplary embodiments have been described and shown in the accompanying drawings, it is to be understood that such embodiments are merely illustrative of and not restrictive on the broad invention, and that this invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. For example, while the embodiments have been described as applied to a tablet device, they may also be applied to other devices such as a cellular telephone or a computer monitor on a pivoting stand. As another example, the speaker components may each be comprised of several speaker driver elements such as a coaxial speaker driver or woofer/tweeter pair in close proximity. The description is thus to be regarded as illustrative instead of limiting.
This is a continuation of co-pending U.S. application Ser. No. 15/256,384 filed Sep. 2, 2016, which claims the benefit of U.S. Provisional Application No. 62/215,288 filed Sep. 8, 2015.
Number | Name | Date | Kind |
---|---|---|---|
8600084 | Garrett | Dec 2013 | B1 |
9241217 | Rodgers | Jan 2016 | B2 |
9621124 | Lu | Apr 2017 | B2 |
20070098191 | Wan | May 2007 | A1 |
20080130905 | Fincham | Jun 2008 | A1 |
20110002487 | Panther | Jan 2011 | A1 |
20110194712 | Potard | Aug 2011 | A1 |
20120051567 | Castor-Perry | Mar 2012 | A1 |
20130163794 | Groves | Jun 2013 | A1 |
20140219482 | Jia | Aug 2014 | A1 |
20140233771 | Giustina | Aug 2014 | A1 |
20140233772 | Giustina | Aug 2014 | A1 |
20150023533 | Johnson | Jan 2015 | A1 |
20150078555 | Zhang | Mar 2015 | A1 |
20150181337 | Rodgers | Jun 2015 | A1 |
20150189438 | Hampiholi | Jul 2015 | A1 |
20160381487 | Harvey | Dec 2016 | A1 |
Number | Date | Country |
---|---|---|
1875656 | Dec 2006 | CN |
101742378 | Jun 2010 | CN |
101946526 | Jan 2011 | CN |
103491482 | Jan 2014 | CN |
203734829 | Jul 2014 | CN |
103581791 | Sep 2014 | CN |
Entry |
---|
Chinese Office Action with English Translation, dated Nov. 23, 2017, Chinese Application No. 201610996847.9. |
Number | Date | Country | |
---|---|---|---|
20180249278 A1 | Aug 2018 | US |
Number | Date | Country | |
---|---|---|---|
62215288 | Sep 2015 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15256384 | Sep 2016 | US |
Child | 15942287 | US |