This invention relates to a signal processing system and to a method of operating the signal processing system. The signal processing system is particularly suitable for use in a speech reinforcement system, for example in a vehicle.
Reinforcement of the speech of passengers via a car-loudspeaker system improves the intelligibility of this speech for other passengers in a car. In
The input of the signal processing system is connected to an input of decorrelator 6 and to a first input of a subtracter circuit 13. The output of the decorrelator 6 is connected to an input of the echo canceller 16. Inside the echo canceller 16, this input is connected to a first input of a subtracter circuit 8. The output of the subtracter circuit 8 is connected to the output of the echo canceller 16 and to a signal input of an adaptive filter 12. An output of the adaptive filter 12 is connected to an input of a further decorrelator 10 and to a second input of the subtracter circuit 13. The output of the subtracter circuit 13 is connected to a residual signal input of the adaptive filter 12. The output of the further decorrelation means 10 is connected to a second input of the subtracter circuit 8.
The output of the echo canceller is connected to an input of a power amplifier 14 whose output is connected to an input of a loudspeaker 18. The (undesired) feedback path 11 is denoted in a dash-and-dot line. In the signal amplifier system shown in
With a standard speech-reinforcement system the microphone picks up the speech of the speaking person. A processed version of this speech is reproduced by loudspeakers, which are located close to the listening person(s). To perceive this speech in noisy environments (such as a car), a reinforcement gain (from the amplifier 14) is required prior to the reproduction of the speech via the loudspeakers. However, for large reinforcement gains, the open-loop gain of the complete electro-acoustic loop will be larger than one, for certain frequencies, which will result in the audio artefact of “howling”.
In order to prevent the howling effect in the case of large reinforcement gains, an acoustic feedback suppressor system is required. This feedback suppressor system comprises an adaptive filter (AF) that estimates the feedback and subtracts it (at the point of the subtracter 8 in
In
In a vehicle it will often be the case that besides the speech-reinforcement (played by the rear loudspeaker), an audio signal is reproduced (played by both the rear and the front loudspeaker). Before amplifying the front microphone signal with the speech communication system via the rear loudspeaker, it is required to cancel the audio signal in this microphone. This is shown in the prior art of U.S. Pat. No. 6,674,865. However, the prior art of U.S. Pat. No. 6,674,865 fails when the audio signal played by the front loudspeaker is equal to or correlated with the signal played by the rear loudspeaker. The reason for this problem is caused by the fact that the audio is played on both loudspeakers while the speech for the speech communication in played on only a single loudspeaker. This will result in a non-unique path identification.
A trivial and straightforward solution to this problem is to introduce a separate adaptive filter for the audio signal cancellation, having the audio signal as a reference input. This is shown in
It is therefore an object of the invention to improve the adaptation and the tracking-speed of the feedback canceller by exploiting the audio signal.
According to a first aspect of the present invention, there is provided a signal processing system comprising a microphone, a subtractor arranged to receive an output of the microphone, an amplifier arranged to receive an output of the subtractor, a rear loudspeaker arranged to receive an output of the amplifier, a front loudspeaker arranged to receive an output of the amplifier, one or more summers interposed between the amplifier and a loudspeaker, the or each summer arranged to add an audio signal to the signal received from the amplifier, a mixing matrix arranged to receive the respective inputs of the rear and front loudspeakers and arranged to output a summation signal and a difference signal, and an adaptive filter arranged to receive the outputs of the mixing matrix, the subtractor arranged to receive an output of the adaptive filter and an output of the subtractor arranged to control the adaptive filter.
According to a second aspect of the present invention, there is provided a method of operating a signal processing system comprising; receiving, at a microphone, a signal, receiving, at a subtractor, an output of the microphone, amplifying, at an amplifier, an output of the subtractor, outputting, at a rear loudspeaker, an output of the amplifier, receiving, at a front loudspeaker, an output of the amplifier, adding an audio signal, at a summer interposed between the amplifier and a loudspeaker, to the signal received from the amplifier, receiving, at a mixing matrix, the respective inputs of the rear and front loudspeakers and outputting, from the mixing matrix, a summation signal and a difference signal, filtering, at an adaptive filter, the outputs of the mixing matrix, receiving, at the subtractor, an output of the adaptive filter, and controlling, with an output of the subtractor, the adaptive filter.
The system provides reinforcement of the speech of passengers via a car-loudspeaker system thereby improving the intelligibility of this speech perceived by other passengers in a car. The speech-reinforcement system performs a feedback cancellation in order to alleviate the well-known howling phenomenon. To estimate the feedback that needs to be cancelled, an acoustic path identification is made. In this system, the presence of audio-signals (for example, stereo-music) is exploited to improve the identification of the acoustic path required for the feedback cancellation.
Preferably, the system further comprises a post processor interposed between the subtractor and the amplifier, the post processor arranged to apply noise reduction to the signal received from the subtractor. The system can use a (spectral) post processor (PP). The most important task of this post processor is to suppress the (additive) noise components that are present in a car. If this noise is not cancelled sufficiently, the noise would be reinforced via the system and would lead to an increase of the total noise level in the car.
Another task of the post processor is to suppress feedback components that are not sufficiently cancelled by the adaptive filter. Especially during movements in the car, the adaptive filter cannot track the Wiener solution quickly enough and the post processor acts as a backup. Yet another task of the post processor is to apply a dereverberation of the signal picked up by the microphone. When the gain G (from the amplifier) is put to a high value that is much higher than the original howling-bound, the reinforced speech sounds reverberated. In order to make the speech more natural, a dereverberator is applied.
Advantageously, the system further comprises a frequency shifter interposed between the subtractor and the amplifier, the frequency shifter arranged to apply a frequency shift to the signal received from the subtractor. The frequency-shifter shifts the entire signal by 5 Hz. By means of this frequency-shifter alone, howling at a single frequency is avoided in the situation where the gain-factor G (applied by the amplifier) is increased to a level greater than would be allowed when no signal processing is carried out. With a frequency-shifter, the gain G can be increased beyond the original howling-bound. The reason for the increased howling bound is that, because of the frequency-shift, every round-trip the averaged open loop gain (over frequency) must be below one, instead of the open loop gain at each frequency.
Another advantage of using the frequency-shifter is that the desired speech signal is decorrelated from the loudspeaker signal. As a result from this frequency shift, the adaptive filter can converge to a solution that is equal to the acoustic path between the rear loudspeaker and the front microphone. Assuming that the adaptive filter coefficients w[k] start from the all-zero vector, and that there are no changes in the acoustic path, the adaptive filter coefficients converge to the Wiener solution:
Ideally, the system further comprises a variable gain attenuator interposed between the subtractor and the amplifier, the variable gain attenuator arranged to attenuate the signal received from the subtractor. The variable attenuator is controlled by the background noise present (for example in a car, if the system is used in such a vehicle). The amount of attenuation is adjusted inverse proportionally with the amount of noise (or music) that is measured (or estimated) in the car. In case a lot of noise is present (i.e. driving on the highway), the speech-reinforcement system is highly required and the variable attenuation is set to A=1. In situations with less noise, the variable attenuator will be adjusted to a lower value.
Another purpose of the variable attenuator is to limit the amount of speech reinforcement in case the output signal of the loudspeaker gets close to saturation. In this way the system is kept linear and the adaptive filter is able to continue the adaptation in a correct way.
Preferably, the system further comprises a high pass filter interposed between the microphone and the subtractor, the high pass filter arranged to filter the signal received from the microphone. As generally for lower frequencies (50-200 Hz) the vehicle noise is much more dominant compared to the passenger speech, the microphone signal is high-pass filtered (HPF) to prevent the amplification of the vehicle noise.
Embodiments of the present invention will now be described, by way of example only, with reference to the accompanying drawings, in which:
In systems such as that shown in
The signal processing system of
The subtractor 22 is also arranged to receive an output of the adaptive filter AF2 and an output of the subtractor 22 is arranged to control the adaptive filter AF2. A second subtractor 30 is interposed between the subtractor 22 and the amplifier G, and a second adaptive filter AF1 is arranged to receive the input of the amplifier G. The second subtractor 30 is arranged to receive an output of the second adaptive filter AF1 and an output of the second subtractor 30 is arranged to control the second adaptive filter AF1.
The system also comprises a post processor PP interposed between the subtractor 22 and the amplifier G, the post processor PP arranged to apply noise reduction to the signal received from the subtractor 22. A frequency shifter FS is also interposed between the subtractor 22 and the amplifier G, the frequency shifter FS arranged to apply a frequency shift to the signal received from the subtractor 22.
A variable gain attenuator A is interposed between the subtractor 22 and the amplifier G, the variable gain attenuator A arranged to attenuate the signal received from the subtractor 22. The system also comprises a high pass filter HPF interposed between the microphone 20 and the subtractor 22, the high pass filter HPF arranged to filter the signal received from the microphone 20.
Furthermore, up- and down-samplers are required because of the combined sound reinforcement and audio reproduction. Generally, the audio content has a sampling rate of 44.1 or 48 kHz, while speech signals can be processed at a lower sampling rate, like 8, 11.025 or 16 kHz. Therefore, up- and down-samplers are needed, shown by the components K, with a factor K equal to, for example, 2, 3, 4 or 6.
In the embodiment of
wi[k] are the coefficients of the i'th adaptive filter, hRF is the (truncated) acoustic path from the rear loudspeaker 24 to the front microphone 20 and hRF+hFF is the (truncated) acoustic path from both loudspeakers 24 and 26 to the front microphone 20. Although not included in equation (2), the Wiener solution also includes the characteristics of the high-pass filter (HPF) and the up- and down-samplers.
The main difference between the audio-cancellation and the speech feedback cancellation is that the audio canceller can operate mainly in so-called “single-talk” mode, while the feedback canceller always operates in so-called “double-talk” mode. Single-talk means that the microphone merely picks up the signal that needs to be cancelled, while in double-talk situations, also the desired speech signal is present. The reason that feedback cancellers are always operating in double-talk mode is that the feedback of the desired speech and the desired speech itself are always (except for attacks and releases of the speech) present at the same time.
Since in the single-talk mode, acoustic paths can be identified more quickly and more accurately compared to the double-talk mode, it is beneficial to combine the two adaptive filters in
In the first option, the audio is not reproduced in the front of the car, which is obviously undesirable. In the second option (similar to the embodiment in U.S. Pat. No. 6,674,865), it would be necessary to have different signals reproduced at the front and the rear, while generally the front and the rear loudspeaker signals will be equal. This solution is not a practical situation. The third option is shown in
The second embodiment, shown in
The front loudspeaker 26 is now arranged to receive an output of the amplifier G. By applying the reinforced speech to the front loudspeaker 26, in addition to the rear loudspeaker 24 however, there is created an additional problem. As generally the coupling between the front loudspeaker 26 and the front microphone 20 is larger than coupling between the rear loudspeaker 24 and the front microphone 20, the howling-bound is decreased drastically. In practical experiments in some vehicles (such as an Audi-A4), the front loudspeakers are very close to the feet of the front passengers. With each small foot movement, the adaptive filter AF carrying out the feedback cancellation needs to converge to a new solution and the system approaches instability. Therefore, the solution as presented in
In the special case when F=0, the filter coefficients converge to a non-unique solution. When only speech s[k] is present, the solution is equal to hRF. When only audio m[k] is present, the solution is equal to (hRF+hFF)/2.
When both s[k] and m[k] are present, neither of the two solutions presented above are obtained. The actual solution that is obtained depends on the signals s[k] and m[k] In general, no stable solution is obtained and the adaptive filter always has to adapt.
To let the audio (at least to some extent) help the speech feedback-cancellation, it is desirable to combine the loudspeaker signals and feed these combined signals to an adaptive filter in such a way that stable solutions are obtained, independent of the speech/music ratio and allowing different loudspeaker volume settings for the music and the sound reinforced speech. Taking, for example, the situation where (mono-) music is played back over all loudspeakers and the reinforced speech is only reproduced at the rear loudspeakers (scenario of
to obtain the combined signals.
In case only a (mono-) music signals is present only the sum signals contains energy and the “sum”-path is estimated by:
If w[k] has been converged and a reinforced sound signal s[k] comes in, then the difference signal will also contain energy and the “difference” path will converge to:
If hRF and hFF are independent and have equal energy (a reasonable assumption in practice), then there is the following equality:
It means that the error at startup of the “difference” path is 3 dB lower than the error in the embodiment of
The signal processing system of
To show that the system of
Σ{s2[k]}=Σ{m2[k]}, (9)
where Σ{ } denotes the ensemble-average operator. The gain of the amplifier G is set to one. Furthermore, the following were used:
h
FF=(1,0), (10)
h
RF=(0,1) (11)
where (1,0), for example, is an impulse-response with two taps (1 and 0 respectively). The three scenarios used in the simulation are listed in the table below:
For the “proposed” (the preferred embodiment according to
From
In practice, in most vehicle environments, the audio signal will be a stereo signal with left and right components. Following the same principle outlined above with respect to
with R the bit-reversal matrix:
With RL, RR, FL, FR indicating rear-left, right-right, front-left, and front-right signals respectively, this results in:
The sum-signal (RL+RR+FL+FR) contains mono-music and speech. The rear minus front signal (RL+RR−FL−FR) only contains speech (as in the mono-example before) and the left minus right signal (RL−RR+FL−FR) only contains music. The fourth signal (RL−RR−FL+FR) does not contain any signal and thus can be left out. It should be noted that the combinations with the mixing-matrix can be performed in different ways. However there are only a few combinations possible that yield a result where the output equals 0. The converged solution will converge to:
where, for example, hRLF is the (truncated) acoustic path from the rear-left loudspeaker to the front microphone.
The various embodiments of the signal processing system can be applied within car entertainment systems, where speech reinforcement is required simultaneously with regular audio and/or GSM reproduction. More generally, the system can be used in sound reinforcement systems where also other known sources are reproduced that use other loudspeaker volume settings than the ones that are used for sound reinforcement.
The method of operating the signal processing system is shown in
The next step 83 is the applying of noise reduction, at the post processor PP, to the signal received from the subtractor 22. There is then the step 84, which comprises applying a frequency shift, at a frequency shifter FS. Step 85 comprises attenuating, at a variable gain attenuator (A), the signal (of course the actual level of attenuation may be zero). The signal is then amplified, at the amplifier G, step 86.
The output of the amplifier G is sent to both loudspeakers 24 and 26. The signal that is to be output at the rear loudspeaker 24 has an attenuation factor applied, at the attenuator F, (step 87). The attenuated signal then has added (step 88) the audio signal m[k], at a summer 28 interposed between the amplifier G and the rear loudspeaker 24, to the signal s[k] received from the amplifier G. This signal is finally outputted (step 89), at the rear loudspeaker 24. Similarly the signal destined for the front loudspeaker 26 has the audio signal m[k] added (step 90) and this is then output at the loudspeaker (step 91).
These two signals that are outputted by the loudspeakers (R and F) are received at the mixing matrix D (step 92). The matrix D receives the respective inputs R, F of the rear and front loudspeakers 24, 26 and outputs, from the mixing matrix D, a summation signal R+F and a difference signal R−F. These two signals are received by the stereo adaptive filter SAF, where they are filtered, shown as step 93. The output of the adaptive filter SAF is then received, at the subtractor 22 (step 94). Control of the adaptive filter SAF, with an output of the subtractor 22 is performed. This is shown by the dotted line 95. The subtractor 22 is carrying out the feedback suppression.
Number | Date | Country | Kind |
---|---|---|---|
06123834.1 | Nov 2006 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2007/054541 | 11/8/2007 | WO | 00 | 5/5/2009 |