The present invention relates to a system for suppressing audio distortion, comprising:
The present invention also relates to a filter arrangement for application in the system and to a method of suppressing audio distortion.
Such a system is known from WO 97/45995. The known audio system comprises an adaptive echo cancelling filter for removing echoes emanating between a systems' loudspeaker output and a microphone. The known system has a filter arrangement coupled to the echo cancelling filter and the microphone for spectrally suppressing echo components in the microphone signal that were not removed by the echo cancelling filter. One microphone senses a desired audio signal, while the other microphones only receive interfering distortions of the desired signal. The system may have a filter arrangement coupled to the echo cancelling means and/or the microphone array for spectrally suppressing distortion in the form of additional audio noise interference.
It is a disadvantage of the known system that it can not effectively be used for also reducing reverberant distortions in a desired audio signal sensed by a microphone array.
Therefore it is an object of the present invention to provide an improved system and filter arrangement therein for also suppressing echo distortion in the form of echo tail part reverberation in an audio signal sensed by a microphone array.
Thereto in the system according to the invention the filter arrangement includes filter coefficients representing reverberation distortion in the desired audio sensed by the microphone array.
The known echo cancelling means remove only a first part of the acoustic echo from the microphone array signals. The echo emanates between the audio output and each of the microphones, and the first part thereof generally comprises direct audio or sound and first room reflections. However-second part, that is-reverberant echo components are not removed by the echo cancelling means but are in the system according to the invention at least reduced by the filter arrangement, which includes filter coefficients which comprise a measure for reverberation distortion in the desired audio sensed by the microphone array. In addition it appeared that large microphone spacings which are not always feasible in practise are not required for the second part reverberation reduction to be effective.
It is an advantage of the present invention that not only reverberant components of the acoustic echo emanating from the audio output, but also reverberant components of a desired audio signal sensed by the microphone array are effectively reduced by the system according to the present invention.
An embodiment of the system according to the invention allowing design flexibility is characterised in that the filter arrangement includes a beamformer having at least a filter and sum beamformer and/or a delay and sum beamformer.
Most often a combination of filter, sum and delay elements is also used to form the so called Generalised Sidelobe Canceller. Advantageously an additional delay element may be added to the filter arrangement for further improving the performance of the system according to the invention.
A further embodiment of the system according to the invention is characterised in that the filter arrangement is arranged to be adaptive to the reverberation distortion and/or the desired audio signal sensed by the microphone array.
In that case the filter coefficients can be updated, to include a dynamic aspect in the cancelling of varying reverberation, instead of representing a more or less fixed model of the room. Now reverberation can also be suppressed in relation to the respective varying positions and directions of the array microphones.
A still further embodiment of the system according to the invention is characterised in that the system is arranged for updating the filter coefficients in case the reverberation not cancelled by the echo cancelling means dominates the audio signal sensed by the microphone array.
Advantageously the filter coefficients of the filter arrangement are not updated when the desired audio source dominates the array sensed audio signals, thus avoiding the risk of unwanted distortion or even cancellation of the desired audio signal in the output of the filter arrangement.
Another embodiment of the system according to the invention is characterised in that the system is arranged for updating the filter coefficients during a training session.
In the alternative—not requiring such a training session—the system is characterised in that it is provided with automated filter coefficient update control means at least to be coupled to the filter arrangement.
An elaboration of the system according to the invention is characterised in that the filter arrangement has an output, and that the system comprises output echo canceller means coupled between the filter output and the audio output.
Any remaining reverberation not cancelled by the beamformer or the echo cancelling means is now cancelled by the output echo canceller means.
A further elaboration of the system according to the invention is characterised in that the automated filter coefficient update control means are further coupled to the output echo canceller means for controlling the update speed of the filter arrangement.
The output echo cancelling means apart from cancelling remaining echoes can advantageously also be used to provide a measure for any remaining reverberation level in order to compare that level with the level of other sensed sound sources in order to use the result of the comparison as a quantity for controlling the update speed of the filter arrangement.
Another further embodiment of the system according to the invention is characterised in that each microphone of the microphone array has its individual echo cancelling means.
By applying individualised echo cancelling means for each microphone of the array any separate direct echoes and reflections in the first part of any of sensed array signals are cancelled individually as much as possible, while combined remaining reverberation in the tail part is dealt with by the filter arrangement and/or the output echo cancelling means.
At present the system and filter arrangement according to the invention will be elucidated further together with their additional advantages, while reference is being made to the appended drawing, wherein similar components are being referred to by means of the same reference numerals. In the drawing:
The respective echo cancelling means gi are coupled to two microphones 3-1, 3-2 of the array 3 through schematically shown subtractors 5-1, and 5-2 each having outputs 6-1, 6-2. These subtractor outputs 6-1 and 6-2 carry respective echo cancelled signals.
The system has a filter arrangement 7, which may include a beamformer 7B, which is coupled through the subtractors 5 to the echo cancelling means gi and/or to the microphone array 3. The beamformer 7B, which is included in a generally called Generalised Sidelobe Canceller, is capable of defining and controlling an audio microphone sensitivity lob or curve. Given the in this case two beamformer input signals on the subtractor outputs 6-1, 6-2, these signals comprise the desired audio/sound/speech signal and a reverberation signal originating from the reverberating tail. The beamformer 7B is capable of discriminating the reverberation signal by deriving a primary signal z including the desired signal and a reference signal x which includes the reverberation. It does this here by filtering in filters f1, and f2, as shown, and then summing in summing device 9-1 the filters fi outputs to reveal the primary signal. This way the echo cancelled microphone signals u1 and u2 are added such that remaining direct signals and early reflections of the desired audio are coherently summed, which increases the beamformers performance. Furthermore it does this here by filtering the echo cancelled microphone signals in blocking filters b1 and b2 and then by summing in device 9-2 the filters' outputs to reveal a reverberation representing reference signal x. The reference signal x virtually contains no desired signal components. The filters bi together B, are called the blocking matrix. The filters fi and bi carry the directional, that is the desired sources dependent information. These filters may also be fixed or adaptive.
In the case as shown in
Because the filtered reverberation or reference signal on inverting input 11 is subtracted from the primary signal in summing device 9-3 its signal on the summed output S only contains the desired signal, with the reverberating tail being cancelled.
In order to adapt or update the filter coefficients, only the reverberant behaviour of the room needs to taken into account. Thereto the desired audio source is not required, as any source in the room would do that job. One possibility is to only update the filter coefficients if the reverberation on the audio output 4 dominates the array 3 sensed reverberation. Another possibility is to update the filter coefficients during a training session.
In a further embodiment the system 1 comprises output echo canceller means g3 coupled between the beamformer output S and to the audio output 4, in this case through delay means 12 providing a delay of N samples corresponding with the direct signal and the early reflections already removed by the echo cancelling means gi. If the system 1 is provided with automated beamformer coefficient update control means 13 these means will be coupled to the beamformer 7 and to the output echo canceller means g3 for controlling the update speed of the filter w. The update speed of the filter w may for example be controlled by a measurement of the reverberant echo level relative to the level of other audio or sound sources that may be present in the room. Such measurement is preferably performed in both the time and frequency domain in order to control the update speed of the filter arrangement 7 accordingly.
If the system 1 does not start up by itself, due to absence of any far end signal a loudspeaker signal could be generated, e.g. a noise sequence or some kind of start up tune.
The system explained above can for example be used in hands-free communication systems, such as hands-free speakerphones, voice controlled systems for example in home or for medical applications, congress systems, dictation system or the like.
Number | Date | Country | Kind |
---|---|---|---|
04100023.3 | Jan 2004 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB04/52856 | 12/20/2004 | WO | 6/23/2006 |