This invention relates to processing of audio and, more particularly, to processing of audio for presentation to loudspeakers.
The spatial images of soundfields are mainly detected through interaural time differences (ITD) and interaural delay differences (ILD). For low frequency signals, i.e., below 500 Hz, synchronous binaural detection provides imaging cues. For signals above 2 kHz, where “synchronous” detection is not neurally possible, the ear switches from phase to envelope detection. Accordingly, at the higher frequencies the ILD and leading edges of the envelope provide the imaging cues. Between 500 Hz and 2 kHz, both systems contribute imaging cues, but sometimes those cues conflict.
When audio is recorded, both direct and indirect sound is captured by the microphones and converted to a composite electrical signal. To reproduce the sound, the electrical signals are converted to sound, but the prior art pays no attention to the sound field imaging cues that are contained in the signal. There is prior art, however, that applies some of the signal to a directional speaker, and applies the remaining signal to a diffusing speaker, but the ratio of the signals that are applied to the two different speakers is fixed, or manually adjustable. It is not sensitive to the signal characteristics.
An improvement in the art is realized by analyzing audio signals for perceptual soundfield imaging cues, and applying appropriate portions of the audio signal to a directional speaker and to a diffusing speaker, based on the analyzed audio signals. Specifically, the audio signal is applied to a critical band filter bank, and at the higher frequencies a detector detects leading edges in the signal's envelope. Concurrently, the audio signal is applied to a delay element followed by a reconstruction filter bank. The output of each filter in the reconstruction filter bank is applied to a one-input/two-outputs soft switch (single pole, double throw) that is controlled by one or more of the detectors. One output of the switch is applied to a directional speaker, and the other output of the switch is applied to a diffusing speaker.
x(t−t0)>α·x(t)
is satisfied, where α is greater than 1 and varies with frequency. If the equation is satisfied, the detector concludes that the signal's power in the band corresponding to the band of the filter in bank 100 to which the detector is connected has increased significantly within the (t−t0) time interval, and should be applied to a directional speaker, rather than to a diffusing speaker. Effectively, therefore, the detector monitors of change in the envelope of the input signal, and when it detects an “attack,” is calls for outputting of the sound through directional speaker 10. As soon as the “attack” subsides, the detector initiates a switch back to diffusing speaker 20.
The output of each detector designates whether a certain frequency in the audio signal that is applied to element 100 should be sent to directional speaker 10 or to diffusing speaker 20. However, the switching of a signal from one speaker to another speaker should not be done instantaneously. Because it is desired to take an interval of time to switch a signal from one speaker to another speaker, and since it is desired to complete the switch at the beginning of the audio signal interval when imaging cues are present (i.e., during the “attack”), it is necessary to introduce a delay that corresponds to the switching interval plus the processing time between the input at block 100 and the output of detectors in block 110. This delay is provided by element 12, which precedes invertible filter bank 130 that contains approximately 32 filters of equal bandwidth, covering the audio range. Filter bank 130 is fairly conventional. An illustrative embodiment of such a filter is presented in “Tight Weyl-Heisenberg Frames in l2(Z)” by Z.Cvetkovic, IEEE, 1998, pp. 1356–159. The output of each filter in filter bank 130 is applied to a switch in switch bank 140. Each switch in bank 140 is a “Hann window” switch that, when told to switch a signal s from output B (diffusive speaker 20) to output A (directional speaker 10) does so in accordance with the function
where t is time and L is the switching interval of time, and
When switching back, the relevant equations of each switch are
where L′>L.
Each switch in bank 140 is controlled by one or more of the detectors in bank 110. More specifically, each detector covers a certain frequency band (that corresponds to the band of the filter in bank 100 to which it is connected) and, correspondingly, it controls the switches in bank 140 that cover the same frequency band. Thus, at the low frequencies two or three detectors may affect a given switch in switch bank 140 (illustratively, detectors 111 and 112 affect switch 142 through OR gate 151, whereas at the high frequencies one detector may affect a number of switches in switch bank 140 (illustratively, detector 114 affecting switches 143 and 144).
The A outputs of the switches in bank 140 are combined in adder 152, and applied to amplifier 153 whose output is coupled to directional speaker 10. The B outputs of the switches in bank 140 are combined in adder 154, and applied to amplifier 155 whose output is coupled to diffusing speaker 20.
The switching of a signal in a switch of bank 140 from output port B to output port A should be done quickly, for example, in less than 10 msec, the switching back (from port A to port B) advantageously be done more slowly, for example, four times as more slowly. The delay of element 120 corresponds to the faster switching time.
In connection with the switching delay, an issue can arise where, while switching a signal slowly from the directional speaker means (port A) to the diffusing speaker means (port B), a detector connected to the switch will require a switch of the signal to port A. In accordance with the
In connection with the control of a switch by more than one of the detectors in bank 110, again the switch to port A (i.e., the fast switch) takes precedence and, therefore, all of the detector control signals that are applied to a switch are combined in an OR function, as illustrated by OR gate 151 (where a logic 1 corresponds to a directive to switch the input signal to port A).
This application claims priority from a provisional application No. 60/156,483 filed Sep. 28, 1999.
Number | Name | Date | Kind |
---|---|---|---|
4118601 | Yeap | Oct 1978 | A |
4454609 | Kates | Jun 1984 | A |
5046101 | Lovejoy | Sep 1991 | A |
5631964 | Harada et al. | May 1997 | A |
6285767 | Klayman | Sep 2001 | B1 |
Number | Date | Country | |
---|---|---|---|
60156483 | Sep 1999 | US |