1. Field of the Invention
The present invention relates to the field of digital signal processing. More particularly, the present invention relates to a method and apparatus for use in acoustic feedback suppression in digital audio devices such as hearing aids.
2. Background
Acoustic feedback, which is most readily perceived as high-pitched whistling or howling, is a persistent and annoying problem typical of audio devices with relatively high-gain settings, such as many types of hearing aids.
Prior art feedback cancellation approaches for acoustic feedback control either typically use the compensated speech signals (i.e., Y(n) 140 in FIG. 1), or add a white noise probe as the input signal to the adaptive filter.
Wideband feedback cancellation approaches without a noise probe are based on the architecture shown in
Feedback cancellation methods using a noise probe are dichotomized based on the control of their adaptation as being either continuous or noncontinuous.
A different feedback cancellation apparatus and method has been recently proposed, comprising a feedback canceller with a cascade of two wideband filters in the cancellation path. This method involves using linear prediction to determine Infinite Impulse Response (“IIR”) filter coefficients which model the resonant electro-acoustic feedback path. As known to those skilled in the art, linear prediction is most widely used in the coding of speech, where the IIR-filter coefficients model the resonances of the vocal tract. In this system, the IIR filter coefficients are estimated prior to normal use of the hearing aid and are used to define one of the cascaded wideband filters. The other wideband filter is a Finite Impulse Response (“FIR”) filter, and adapts during normal operation of the hearing aid.
A new subband feedback cancellation scheme is proposed, capable of providing additional stable gain without introducing audible artifacts. The subband feedback cancellation scheme employs a cascade of two narrow-band filters Ai(Z) and Bi(Z) along with a fixed delay, instead of a single filter Wi(Z) and a delay to represent the feedback path in each subband. The first filter, Ai(Z), is called the training filter, and models the static portion of the feedback path in ith subband, including microphone, receiver, ear canal resonance, and other relatively static parameters. The training filter can be implemented as a FIR filter or as an IIR filter. The second filter, Bi(Z), is called a tracking filter and is typically implemented as a FIR filter with fewer taps than the training filter. This second filter tracks the variations of the feedback path in the ith subband caused by jaw movement or objects close to the ears of the user.
Those of ordinary skill in the art will realize that the following description of the present invention is illustrative only and not in any way limiting. Other embodiments of the invention will readily suggest themselves to such skilled persons having the benefit of this disclosure.
The present invention discloses a new subband feedback cancellation scheme, capable of providing more than 10 dB of additional stable gain without introducing any audible artifacts. The present invention employs a cascade of two narrowband filters Ai(Z) and Bi(Z) along with a fixed delay instead of a single filter Wi(Z) and a delay to represent the feedback path in each subband, and where
Wi(Z)=Ai(Z)Bi(Z)i.
The first filter, Ai(Z), is called the training filter, and models the static portion of the feedback path in ith subband, including microphone, receiver, ear canal resonance, and other relatively static model parameters. The training filter can be implemented as either a FIR filter or an IIR filter, but compared with a FIR filter, an IIR filter may need fewer taps to represent the transfer function. However, the IIR adaptive filter may become unstable if its poles move outside the unit circle during the adaptation process. This instability must be prevented by limiting the filter weights during the updating process. In addition, the performance surfaces are generally nonquadratic and may have local minima. Most importantly, only a few taps are needed for an FIR filter to represent the feedback path in subbands, and thus an IIR filter does not provide any computational benefits in subbands. Therefore, due to the disadvantages of an IIR adaptive filter, the FIR adaptive filter is usually applied in subbands.
The second filter, Bi(Z), is called a tracking filter and is usually chosen to be a FIR filter with fewer taps than the training filter. It is employed to track the variations of the feedback path in the ith subband caused by jaw movement or objects close to the ears of a user. If subband variations in the feedback path mainly reflect changes in the amount of sound leakage, the tracking filter only needs one tap. Experimentation indicates that this is a good assumption.
The feedback cancellation algorithm according to embodiments of the present invention performs feedback cancellation in two stages: training and tracking. The canceller is always set to the tracking mode unless pre-defined conditions are detected. Without limitation, such conditions may include power-on, switching, training commands from an external programming station, or oscillations.
Because the hearing aid's canceller must initially be trained before it attempts to track, the tracking filter Bi(Z) is constrained to be a unit impulse while Ai(Z) is being estimated using adaptive signal processing techniques known to those skilled in that art. Training is performed by driving the receiver with a very short burst of noise. Since the probe sequence is relatively short in duration (˜300 ms), the feedback path will remain stationary. Furthermore, since the probe sequence is not derived from the microphone input, the configuration of the adaptive system is open loop, which means that the performance surface is quadratic and the coefficients of the filter will converge to their expected values quickly.
Once training is completed, the coefficients of Ai(Z) are frozen and the hearing aid's canceller switches into tracking mode. The initial condition of the tracking filter is always an impulse. No noise is injected in the tracking mode. In this mode, the system according to embodiments of the present invention operates as a normal hearing aid with the compensated sound signal sent to the receiver used as the input signal to the feedback cancellation filter cascade.
It should be noted that an output limiting block 582 is shown after the synthesis filter bank 580 in FIG. 5. Although other embodiments of the present invention may or may not include a limiter 582, if one is present, it would typically follow the synthesis filter bank if it is needed to avoid saturation nonlinearities.
The feedback path in each subband is modeled by a cascade of two filters 590 and 592. This feedback cancellation scheme works in two different modes: training and tracking. One filter is adaptively updated only in the training mode, while the other is updated only in the tracking mode. The hearing aid usually works in the tracking mode unless training is required. The switch position 594 shown in the
Techniques used to update the filter coefficients adaptively are known to those skilled in the art, and can be directly applied in updating Ai(Z) and Bi(Z) in each subband. Depending on the desired tradeoff between performance and complexity, a signed adaptive algorithm can be used for simpler implementation while more complicated adaptive algorithms, such as the well known NLMS, variable step-size LMS (VS), fast affine projection, fast Kalman filter, fast newton, frequency-domain algorithm, or the transform-domain LMS algorithms can be employed for fast convergence and/or less steady state coefficient variance.
A few techniques specifically useful for the update of the filter coefficients in a subband hearing aid are introduced herein.
First, the attenuation provided by the feedback path 588 may cause the audio output signal in any one subband to fall below the noise floor of the microphone 520 or A/D converter 530. In this case, the subband signal Xi will contain no information about the feedback path. In this subband, the acoustic feedback loop is sufficiently cancelled (the feedback path is broken) and the subband adaptive filter should be frozen. In conjunction with an averager used on a subband version of the audio output, statistics about the attenuation provided by the feedback path can be used to estimate if the subband signal Xi contains any statistically significant feedback components.
Second, the subband source signal additively interferes with the subband feedback signals necessary for identifying the subband feedback path. The ratio of the feedback distorted probe signal to the interfering subband source signal can be considered as the subband adaptive filter's signal-to-noise ratio. During times when this signal-to-noise ratio is low, the adaptive filter will tend to adapt randomly and will not converge. Due to the delays in the feedforward and feedback path, the subband adaptive filter's signal-to-noise ratio will be lowest during the onset of a word or other audio input. While the signal-to-noise ratio is low the adaptive filter should be frozen or the step-size of the update algorithm should be reduced. On the other hand, the subband adaptive filter's signal-to-noise ratio will be high during the offset of a word or other audio input. While this signal-to-noise ratio is high the adaptive filter will tend to converge and the update algorithm's step-size should be increased. In conjunction with averagers used on subband versions of the audio output and the audio input, statistics about the attenuation provided by the feedback path can be used to estimate each subband adaptive filter's signal-to-noise ratio.
Third, if the subband hearing aid implements both noise reduction and a feedback canceller which adapts on the feedback-distorted gain-compensated output sound signal then an additional adaptation control can be used. This control is recommended since noise reduction circuitry usually differentiates the subband audio signal Xi(n) into a short-term stationary and a long-term stationary component. The short-term stationary component is considered to be the desired audio signal and the long-term stationary component is deemed to be unwanted background noise. The ratio of the power in the short-term stationary as compared to the long-term stationary sound signal is called the signal-to-noise ratio of the subband audio signal. If the subband signal's statistics indicate that this signal-to-noise ratio is low then the noise reduction circuit will lower the gain in that subband. The lower gain may prevent feedback, but will also reduce the energy of the subband audio output signal. Since this audio output helps to probe the feedback path during tracking, lower gain results in poorer tracking performance. This is especially true if the subband audio input Xi(n) is largely composed of long-term stationary background noise which carries no information about the feedback path. This background noise will interfere with the feedback-distorted gain-compensated output sound signal and produce random variations in the transfer function of Bi(Z). To avoid these random variations the step-size should be reduced (probably to zero). Furthermore, when the signal-to-noise ratio of the subband audio signal is very high it is more likely to be cross-correlated with the feedback-distorted gain-compensated output sound signal. In this case adaptation of the canceller will have an unwanted bias. A decorrelating delay in the feedforward path should be large enough to continue adaptation in this case, but the update algorithm's step-size can be reduced to avoid the influence of the bias.
Fourth, the NLMS and VS algorithms are both simple variations of the LMS algorithm which increase the convergence speed of the canceller. The NLMS algorithm is derived to optimize the adaptive filter's instantaneous error reduction assuming a highly correlated probe sequence. Since for tracking the probe sequence is preferably speech and since speech is highly correlated the NLMS is known to have a practical advantage. On the other hand, the VS algorithm is based on the notion that the optimal solution is nearby when the estimates of the error surface's gradient are consistently of opposite sign. In this case the step-size is decreased. Likewise, if the gradient estimates are consistently of the same sign it is estimated that the current coefficient value is far from the optimal solution and the step size is increased. In feedback cancellation the non-stationarity of the feedback path will cause the optimal solution to change dynamically. Since they operate on different notions, and since they perfectly fit the problems associated with using the conventional LMS algorithm for feedback cancellation a combined NLMS-VS scheme is suggested. The NLMS algorithm will control the step-size on a sample-by-sample basis to adjust for the signal variance and the VS algorithm will aperiodically compensate for changes in the feedback path.
Below, the conventional LMS adaptive algorithm is employed as an example to derive updating equations. It should be very straight-forward to apply other adaptive algorithms to estimate the training filter or the tracking filter. The estimation process of the subband transfer function using the conventional LMS algorithm in two modes is described by the following equations:
Training: i=0, . . . , M−1
Ti(n)=AiH(n)Ni(n),
ei(n)=Xi(n)−Ti(n),
Ai(n+1)=Ai(n)+μe*i(n)Ni(n).
Tracking: i=0, . . . , M−1
Ti(n)=AiT(n)Ni(n),
ei(n)=Xi(n)−BiH(n)Ti(n),
Bi(n+1)=Bi(n)+μei*(n)Ti(n).
where Ai(n) is the coefficient vector of the training filter in the ith band, and Ni(n) is an input vector of the training filter in the corresponding band. The variable μ is the step size, and Bi(n) is the coefficient vector of the subband tracking filter.
To describe the static feedback path, the corresponding wideband training filter A(Z) usually requires more than 64 taps. If the analysis filter bank decomposes and down-samples the signal by a factor of 16, as in some embodiments of the present invention, the training filter in each subband only requires 4 taps and a fixed delay.
As described earlier, the signal used to update the coefficient vector Bi(n) is processed speech rather than white noise. Due to the non-flat spectrum of speech, the corresponding spread of the eigenvalues in the autocorrelation matrix of the signal tends to slow down the adaptation process.
Moreover, the subband adaptive filter's signal-to-noise ratio is usually low, and thus the correlation between the subband audio source signal and the feedback-distorted gain-compensated output sound signal is likely to be high. Also, the system in the tracking mode is recursive, and the performance surface may have local minima. These considerations dictate that the tracking filter should be as short as possible, while still providing an adequate number of degrees of freedom to model the subband variations of the feedback path.
If subband variations in the feedback path mainly reflect changes in the amount of sound leakage, the tracking filter only needs one tap. If this tap is constrained to be real, the filter simplifies nicely to an Automatic Gain Control (“AGC”) on the training filter's subband feedback estimate. Even with only a single real tap for tracking in each subband, the recursive nature of the system implies that instability is a possibility if the signal-to-noise ratio is very low, if the correlation between input and output is too high, or if the feedback path changes drastically. Moreover, even if the adaptive canceller remains stable the recursive system may exhibit local minima. To avoid instability and local minima, the coefficients of the tracking filter should be limited to a range consistent with the normal variations of the feedback path. As known to those skilled in the art, methods of limiting the tap may involve resetting or temporarily freezing the tracking filter if it goes out of bounds.
Furthermore, as illustrated in the frequency response graph of
As mentioned previously, a common problem in using a noise signal 583 as the training signal for an adaptive feedback canceller is that it must be a very low-level signal so that it is not unpleasant to the listener. However, a low-level training signal can be overwhelmed by ambient sounds so that the signal-to-noise ratio for the training signal can be very low. This can cause poor training results.
To overcome the problem of low signal-to-noise ratio for the training signal, one can take advantage of the fact that the probe sequence is periodic. First, a relatively short sequence is chosen, but one that is longer than the longest feedback component. Then, the sequence is synchronously detected after it has passed through the feedback path. Corresponding samples within the sequence are averaged. For example, the first samples from each period of the sequence are averaged together. Likewise, second samples are averaged together, and so forth. Two commutators and a set of averagers can be used by those skilled in the art to grow the desired sequence.
Averaging periods of the sequence together will increase the amplitude of the training signal and simultaneously reduce the amplitude of the ambient sounds assuming that the ambient sound is zero-mean. The averaged sequence will grow to the probe sequence distorted by the feedback path. The averaged sequence becomes the desired signal (X[n]−S[n]) of the adaptive structure. The probe sequence is filtered by the adaptive filter that grows an estimate of the feedback distortion. The configuration for training in the wideband is shown in
Additionally, if the ambient sounds are expected to fluctuate in amplitude, then the probe sequence can be averaged only during times when the level of the ambient sound is low. This can further improve the signal-to-noise ratio of the adaptive canceller.
Finally, since the feedback canceller will be used with individuals who have a hearing loss, it may be possible to inject an attenuated version of the probe sequence during the normal operation of the hearing aid. By averaging periods of the sequence together, the amplitude of zero-mean feedback-filtered speech will be reduced just like the zero-mean ambient sounds. Thus even when mixed with the normal speech output, the averaged sequence will still represent the training signal distorted by the feedback path. As suggested previously, the averaged sequence should be computed in the subbands to take advantage of the downsampling. To use the averaged subband sequence for updating of the training filter during normal operation of the hearing aid requires a third analysis filter bank and a second set of subband training filters as shown in FIG. 15.
When some pre-specified conditions are met, the coefficients of the second training filter, Ai(Z), 1540 in the ith band are copied into the first training filter, Âi(Z) 1550. When this is done, the tracking filter Bi(Z) 1560 should be reset to an impulse. The pre-specified conditions may be if the correlation coefficient between Ai(Z) 1540 and Âi(Z) 1550 falls below a threshold, if a counter triggers a scheduled update, or if feedback oscillations are detected. The first training filter in the ith band, Âi(Z) 1550, can be initially adapted as shown in
Compared with the existing feedback cancellation approaches, this invention is simpler and easier to implement. It is well-suited for use with a digital subband hearing aid. In addition, embodiments of the present invention can provide more than 10 dB of additional gain without introducing distortion or audible noise.
While embodiments and applications of this invention have been shown and described, it would be apparent to those of ordinary skill in the art having the benefit of this disclosure that many more modifications than mentioned above are possible without departing from the inventive concepts herein. The invention, therefore, is not to be restricted except in the spirit of the appended claims.
This application is a continuation of U.S. patent application Ser. No. 10/254,698, filed on Sep. 24, 2002, “SUBBAND ACOUSTIC FEEDBACK CANCELLATION IN HEARING AIDS”, now abandoned, which is a continuation of U.S. patent application Ser. No. 09/399,483, filed on Sep. 20, 1999, “SUBBAND ACOUSTIC FEEDBACK CANCELLATION IN HEARING AIDS”, now U.S. Pat. No. 6,480,610.
Number | Name | Date | Kind |
---|---|---|---|
3578913 | Ufkes | May 1971 | A |
3685009 | Gleming, Jr. | Aug 1972 | A |
3763333 | Lichowsky | Oct 1973 | A |
3928733 | Hueber | Dec 1975 | A |
4025721 | Graupe et al. | May 1977 | A |
4061875 | Freifeld et al. | Dec 1977 | A |
4122303 | Chaplin et al. | Oct 1978 | A |
4135590 | Gaulder | Jan 1979 | A |
4185168 | Graupe et al. | Jan 1980 | A |
4187472 | Yum | Feb 1980 | A |
4188667 | Graupe et al. | Feb 1980 | A |
4216430 | Amazawa et al. | Aug 1980 | A |
4238746 | McCool et al. | Dec 1980 | A |
4243935 | McCool et al. | Jan 1981 | A |
4249128 | Karbowski | Feb 1981 | A |
4326172 | Schmidt | Apr 1982 | A |
4355368 | Zeidler et al. | Oct 1982 | A |
4368459 | Sapora | Jan 1983 | A |
4494074 | Bose | Jan 1985 | A |
4548082 | Engelbretson et al. | Oct 1985 | A |
4589133 | Swinbanks | May 1986 | A |
4589137 | Miller | May 1986 | A |
4602337 | Cox | Jul 1986 | A |
4628529 | Borth et al. | Dec 1986 | A |
4630305 | Borth et al. | Dec 1986 | A |
4654871 | Chaplin et al. | Mar 1987 | A |
4658426 | Chabries et al. | Apr 1987 | A |
4718099 | Hotvet | Jan 1988 | A |
4723294 | Taguchi | Feb 1988 | A |
4759071 | Heide | Jul 1988 | A |
4783818 | Graupe et al. | Nov 1988 | A |
4802227 | Elko et al. | Jan 1989 | A |
4845757 | Wagner | Jul 1989 | A |
4878188 | Ziegler, Jr. | Oct 1989 | A |
4912767 | Chang | Mar 1990 | A |
4939685 | Feintuch | Jul 1990 | A |
4953217 | Twiney et al. | Aug 1990 | A |
4956867 | Zurek et al. | Sep 1990 | A |
4985925 | Langberg et al. | Jan 1991 | A |
5016280 | Engebretson et al. | May 1991 | A |
5027306 | Dattorro et al. | Jun 1991 | A |
5091952 | Williamson et al. | Feb 1992 | A |
5097510 | Graupe | Mar 1992 | A |
5105377 | Ziegler, Jr. | Apr 1992 | A |
5111419 | Morley, Jr. et al. | May 1992 | A |
5165017 | Eddington et al. | Nov 1992 | A |
5177755 | Johnson | Jan 1993 | A |
5225836 | Morley, Jr. et al. | Jul 1993 | A |
5233665 | Vaughn et al. | Aug 1993 | A |
5251263 | Andrea et al. | Oct 1993 | A |
5259033 | Goodings et al. | Nov 1993 | A |
5263019 | Chu | Nov 1993 | A |
5291525 | Funderburk et al. | Mar 1994 | A |
5305307 | Chu | Apr 1994 | A |
5355418 | Kelsey et al. | Oct 1994 | A |
5357251 | Morley, Jr. et al. | Oct 1994 | A |
5396560 | Arcos et al. | Mar 1995 | A |
5402496 | Soli et al. | Mar 1995 | A |
5412735 | Engebretson et al. | May 1995 | A |
5452361 | Jones | Sep 1995 | A |
5473684 | Bartlett et al. | Dec 1995 | A |
5475759 | Engebretson | Dec 1995 | A |
5500902 | Stockham, Jr. et al. | Mar 1996 | A |
5511128 | Lindemann | Apr 1996 | A |
5539831 | Harley | Jul 1996 | A |
5544250 | Urbanski | Aug 1996 | A |
5600729 | Darlington et al. | Feb 1997 | A |
5651071 | Lindemann et al. | Jul 1997 | A |
5677987 | Seki et al. | Oct 1997 | A |
5680467 | Hansen | Oct 1997 | A |
5689572 | Ohki et al. | Nov 1997 | A |
5710820 | Martin et al. | Jan 1998 | A |
5794187 | Franklin et al. | Aug 1998 | A |
5825898 | Marash | Oct 1998 | A |
5838801 | Ishige et al. | Nov 1998 | A |
5848169 | Clark, Jr. et al. | Dec 1998 | A |
5848171 | Stockham, Jr. et al. | Dec 1998 | A |
5867581 | Obara | Feb 1999 | A |
5937070 | Todter et al. | Aug 1999 | A |
6023517 | Ishige | Feb 2000 | A |
6035048 | Diethorn | Mar 2000 | A |
6044162 | Mead et al. | Mar 2000 | A |
6072884 | Kates | Jun 2000 | A |
6072885 | Stockham, Jr. et al. | Jun 2000 | A |
6118878 | Jones | Sep 2000 | A |
6163287 | Huang | Dec 2000 | A |
6173063 | Melanson | Jan 2001 | B1 |
6219427 | Kates et al. | Apr 2001 | B1 |
6278786 | McIntosh | Aug 2001 | B1 |
6396930 | Vaudrey et al. | May 2002 | B1 |
Number | Date | Country |
---|---|---|
0 064 042 | Jan 1986 | EP |
0 823 829 | Nov 1998 | EP |
0 930 801 | Dec 1998 | EP |
WO 9320668 | Oct 1993 | WO |
WO 9409604 | Apr 1994 | WO |
9635314 | Nov 1996 | WO |
9750186 | Dec 1997 | WO |
9828943 | Jul 1998 | WO |
WO 9843567 | Oct 1998 | WO |
9847227 | Oct 1998 | WO |
9847314 | Oct 1998 | WO |
9926453 | May 1999 | WO |
WO 9945741 | Sep 1999 | WO |
Number | Date | Country | |
---|---|---|---|
20040125973 A1 | Jul 2004 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10254698 | Sep 2002 | US |
Child | 10737206 | US | |
Parent | 09399483 | Sep 1999 | US |
Child | 10254698 | US |