The present application is a National Stage application of PCT/AU2009/000686 entitled “Acoustic Processing Method and Apparatus”, filed on May 29, 2009, which claims priority from Australian Provisional Patent Application No. 2008902723, filed on May 30, 2008, which are hereby incorporated by reference herein.
1. Field of the Invention
The present application relates to processing audio signals containing noise, particularly for auditory prostheses.
2. Related Art
Auditory prostheses include any acoustic or electrical auditory prostheses, such as hearing aids, bone anchored hearing aids, middle ear implants, intracochlear implants, brain stem implants, implanted acoustic prostheses or any combination of these, for example prostheses providing combined electrical and acoustic stimulation. For those prostheses having an external part and an implanted part, the external part may be continuously, intermittently or occasionally in communication with the implanted part.
Auditory prostheses require, as an input, an electrical signal corresponding to an audio signal for processing in the device. This input is most commonly provided by a microphone. For example, a conventional cochlear implant consists of an external part containing a microphone, a sound processor and a transmitter, and an internal part which contains a receiver/stimulator device and an electrode array. Sound enters the microphone, which outputs a corresponding electrical signal to the sound processor, which in turn codes the sound using one of many possible processing strategies. The coded signal is passed to the transmitter, which sends it to the implanted receiver/stimulator unit. The receiver/stimulator then sends the corresponding stimuli to the appropriate electrodes, so as to provide a percept of hearing for a user.
A significant problem for users of auditory prostheses is listening to a target signal in the presence of noise, such as when talking on the phone, or in restaurants, sports stadiums, supermarkets, and the like. One major source of this problem is that a hearing impaired person has a much smaller range of perceivable sounds than a normal hearing person. This perceivable range of sounds is known as the dynamic range.
One way existing systems combat this problem is by automatically turning down the gain of the system in the presence of noise. By turning down the gain of the system the processor can lower the noise level such that it is mapped to the lower end of the recipient's dynamic range. An example of such an algorithm is disclosed in U.S. Pat. No. 6,151,400, and reproduced in part as
The problem with such algorithms is that while they turn down the noise level, they also turn down the level of the target signal. This means that while the user may hear less noise, the target signal also becomes more difficult to hear.
In a broad form, the present invention provides a method and apparatus for rescaling the signal for an auditory prosthesis in order to optimise the dynamic range available to the user.
According to a first aspect of the present invention, there is provided a method for processing an input sound signal by an auditory prosthesis comprising: performing a frequency analysis on the input sound signal to generate a plurality of analysis signals each corresponding to a channel within the input sound signal; applying a scaling function to each of the plurality of analysis signals in response to a control signal, such that for each channel, a minimum output level of the channel is modified in response to the control signal, while a maximum output level of the channel remains substantially unchanged; and applying stimulation to a recipient using at least one of the scaled signals.
According to a second aspect, there is provided a method for processing an input signal in an auditory prosthesis to produce an output signal, said output signal having a maximum output level and a minimum output level, said method including at least the steps of: estimating the noise floor of the input signal; applying a scaling function to the input signal such that the base level of the output signal is modified in response to the noise floor estimate, without substantially changing the maximum output level of the output signal; and applying stimulation to a recipient using at least one of the output signals.
According to a third aspect, there is provided an apparatus for processing a sound signal in an auditory prosthesis to produce an output signal, said output signal having a maximum output level and a base level, comprising: a noise floor estimator for estimating the noise floor of the sound signal; and a scaling circuit for applying a scaling function to the sound signal, such that a minimum output level of the output signal is modified in response to the noise floor estimate without substantially changing a maximum output level of the output signal.
According to a fourth aspect, there is provided a method for processing an input signal in an auditory prosthesis to produce an output signal, said method comprising processing said input signal so as to produce a set of channel input signals corresponding to frequency channels; for each channel, estimating the noise floor of the channel input signal; and applying a scaling function to the channel input signal to produce a channel output signal, the channel output signal having a base level and a maximum output level such that the base level of the channel output signal is modified in response to the noise floor estimate, while the maximum output level of the channel output signal remains substantially the same; and applying stimulation to a recipient using at least one of the output signals.
An illustrative embodiment of the present invention will be described with reference to the accompanying figures, in which:
a-c shows a schematic comparison of a signal in noise in a dynamic range between, respectively, a conventionally processed signal, a signal pre-processed with an automatic sensitivity control algorithm, and a signal processed using an embodiment of the present invention;
The present invention may be implemented in a variety of ways and the embodiments illustrated are to be considered only as illustrative constructions. For example, while the description below relates to cochlear implants, the present invention is also applicable to any auditory prosthesis, such as those listed above. The present invention is concerned specifically with the audio processing part of the signal path, it will be appreciated that the remaining aspects of the auditory prosthesis may be constructed and operated in any suitable way.
Shown in
In the unprocessed condition (
The effect of this implementation of the present invention, referred to as the automatic expander, is shown in
A simplified version of part of a known cochlear implant signal path is depicted in
An example of a typical LGF input/output curve is depicted in
One approach to reducing the effect of noise is to automatically adjust the sensitivity of the signal path such that the noise falls outside the output dynamic range. The effect of shifting the sensitivity in this way is depicted by the dashed curve in
Consider a situation where the noise floor is at an input level of −30 dB and the current base level is at −36 dB. If the sensitivity is reduced by 6 dB it is as if the LGF curve has been shifted to the right by 6 dB. This has the effect of shifting the base level to B′ such that the noise is now below the base level and is therefore not presented to the recipient. This procedure is performed by the ASC algorithm.
Such an approach has the negative side effect of mapping signals above the noise floor to a lower output level than before the sensitivity adjustment. For example, the input level that previously produced a maximum output level now produces an output that is less than maximum. For the recipient this means that it is harder to hear the portion of the target signal that is above the noise.
Instead of simply shifting the curve by applying attenuation, an implementation of the present invention modifies the input/output function such that the base level now coincides with the estimated noise floor (B′). Such a modification is depicted as the dash-dot curve in
Note that this approach can also be applied in environments where the noise floor is low. In this case the base level may be reduced to allow more of the quiet sounds into the output dynamic range. The effect of increasing and decreasing the base level without modifying the maximum output level is shown graphically in
In the algorithm provided by the preferred implementation of the present invention, rather than modifying the loudness growth function directly, the algorithm pre-scales the input to the loudness growth function in response to a control signal. The purpose of the pre-scaling is to remap the input signal such that the noise floor of the environment is below/or at the base level. A pre-scaling is preferred to modifying the LGF due to the difficulty in reshaping the loudness growth function dynamically in a real-world implementation. In an alternative embodiment the loudness growth function may be modified directly rather than applying a pre-scaling. A signal path adapted to implement this aspect of the present invention is shown in
This signal path incorporates a microphone 32, sensitivity control 33, filter bank 34, channel gain 23 and loudness mapping 24, as in the prior art. However, it also incorporates a new scaling block 27 before the loudness mapping block 24 on each channel, and a noise floor estimator 28 on each channel. The purpose of scaling block 27 is to rescale the signal such that the maximum output level (M) is unchanged, but the base level (B) of the LGF is modified (B′) to follow the noise floor estimate.
For the purpose of deriving the scaling function parameters (a, b) it is helpful to define the intermediate signals (x, x′ and y) shown in
The loudness growth function is defined as follows:
Substituting in the equation of the scaling block to derive the above equation in terms of x:
At x=M we require the modified and unmodified output signals to be equal:
which reduces to:
M=aM+b
At x=B′ the output must go to zero so we have:
which reduces to:
B=aB′+b
Combining these two results and solving for a we have:
And then,
The scaling function is now defined in terms of the existing LGF parameters (B and M) and the desired base level (B′).
The other component of the proposed algorithm is the noise floor estimator. The noise floor estimator controls the modification of the base level parameter, which is then used to derive the scaling parameters (a and b). Noise floor estimation is described in U.S. Pat. No. 6,151,400, incorporated by reference herein, is well known in the prior art, and accordingly will not be described in detail here.
Briefly, in U.S. Pat. No. 6,151,400 and as shown in
According to implementations of the present invention, the noise floor estimate may be used to drive the desired base level directly as depicted in
In this configuration the noise floor estimate is passed into a control logic block, which then determines how the base level should be adjusted. By decoupling the control signal from the adjustment of the dynamic range we can allow other requirements to be included in the algorithm. For example, it would then be possible to specify a maximum and minimum allowable dynamic range. A simple flow chart of an example control logic block is shown in
If the noise floor is above an upper threshold (MAX) 41, then this means that the base level is too low and therefore allowing too much noise through the system. The control logic will respond to this by increasing the base level at block 42 (either by an additive or multiplicative factor). Eventually the base level will be increased enough that the noise floor is below the upper threshold and the algorithm can cease increasing the base level.
Similarly if the noise floor is below a second threshold (MIN), determined at block 44, then this implies that the base level is too high and cutting off some of the target signal. The control logic responds to this situation by decreasing the base level until the noise floor estimate is above the low threshold. When the noise floor estimate is between the two thresholds the control logic does not apply an adjustment to the base level, reaching end block 45.
The rate at which the base level is increased or decreased would be programmable and most the rate of increase and rate of decrease would be independent from each other. For example, the base level may be increased faster than it is decreased. This would cause the algorithm to weight cutting off noise more highly than allowing more signal through the signal path. Although this description has focused on using the estimated noise floor as the parameter used to generate the control signal various other parameters may be applicable, for example:
For example, the modulation depth can be computed in a number of ways. One technique is to form a noise floor estimate (as mentioned in the ASC patent U.S. Pat. No. 6,151,400). A signal peak estimate is then formed by changing the minimum detector to a maximum detector. These two values can then either be divided or alternatively subtracted in the log domain to determine the modulation depth.
This description has also focused on rescaling the signal with reference to the set points in the loudness growth function. However, it is also possible to use a similar rescaling method in systems that do not contain a loudness growth function. In this case other set points would need to be defined that mark important points in the dynamic range to be optimized.
This description has focused on performing the dynamic range adjustment on a number of frequency bands, in an alternative embodiment of this invention the algorithm may be implemented on the broadband audio signal. It will be appreciated that variations and additions are possible within the scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
2008902723 | May 2008 | AU | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/AU2009/000686 | 5/29/2009 | WO | 00 | 2/16/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2009/143588 | 12/3/2009 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4630305 | Borth et al. | Dec 1986 | A |
4641344 | Kasai et al. | Feb 1987 | A |
6151400 | Seligman | Nov 2000 | A |
6757395 | Fang et al. | Jun 2004 | B1 |
7072717 | Wolf et al. | Jul 2006 | B1 |
7783063 | Pocino et al. | Aug 2010 | B2 |
20030035549 | Bizjak et al. | Feb 2003 | A1 |
Number | Date | Country |
---|---|---|
0326905 | Aug 1989 | EP |
02084866 | Oct 2002 | WO |
Entry |
---|
Extended European Search Report, European Application No. 09753352.5 mailed Feb. 15, 2013 (7 pages). |
International Search Report, for PCT/AU2009/000686, mailed Sep. 4, 2009, 2 pages. |
Written Opinion, for PCT/AU2009/000686, mailed Sep. 4, 2009, 6 pages. |
Number | Date | Country | |
---|---|---|---|
20110135129 A1 | Jun 2011 | US |