The invention generally relates to spectral enhancement systems for enhancing a spectrum of multi-frequency signals (e.g., acoustic, electromagnetic, etc.), and relates in particular to spectral enhancement systems that involve filtering and amplification.
Conventional spectral enhancement systems typically involve filtering a complex multi-frequency signal to remove signals of undesired frequency bands, and then amplifying the filtered signal in an effort to obtain a spectrally enhanced signal that is relatively background free.
In many systems, however, the background information may be difficult to filter out based on frequencies alone because the complex multi-frequency signal may include background noise that is close to the frequencies of the desired information signal. Moreover, many conventional spectral enhancement systems inadvertently amplify some background noise with the amplification of the desired information signal.
For example, a spectral enhancement system may include one or more band pass filters into which an input signal is received, as well as one or more compression and/or amplification units, the outputs of which are combined at a combiner to produce an output signal. If the frequencies of the desired signals, for example, vowel sounds in an auditory signal are either within a band filtered frequency or are surrounded by substantial noise signals in the frequency spectrum, then such a filter and amplification system may not be sufficient in certain applications.
As a particular example of a spectral enhancement system, an electronic cochlea models the traveling-wave amplifier architecture of a biological cochlea as a cascade of nonlinear-and-adaptive second-order filters with corner frequencies that decrease exponentially from approximately 10 kHz to 100 Hz. Due to the successive compounding of gains, a change in the individual filter gains of a few percent can alter the gain of the composite transfer function by many orders of magnitude. For example, (1.1)45=73 while (0.9)45=0.009. It is very difficult to accomplish such wide-dynamic-range gain control with one localized amplifier without changing the amplifier's bandwidth, temporal resolution, and power dissipation drastically. Any parameter variations in the Q's of the various cochlear filters, which can result in inhomogenities and nonrobust or unstable operation, are compensated for through gain control. Any physical biological system, such as the cochlea, must possess a feedback system to ensure that it works in a real-world environment, where parameters are not perfectly matched and controlled to high precision as in current digital implementations or simulations.
Distributed gain control and the traveling-wave phenomena are important aspects of the silicon cochlea (as disclosed in A Low-Power Wide-Dynamic Range Analog VLSI Cochlea, Sarpeshkar, R., Lyon, R. F., and Mead, C. A., Analog Integrated Circuits and Signal Processing (1998), the disclosure of which is hereby incorporated by reference) in replicating the performance of the biological cochlea. The silicon cochlea's importance to cochlear implant processing is significant for at least the following reasons.
1) An exponentially tapering filter-cascade architecture provides an extremely efficient mechanism for constructing a bank of closely spaced high-order filters as disclosed in Traveling Waves versus Bandpass Filters: The Silicon and Biological Cochlea, Sarpeshkar, R., Proceedings of the International Symposium on Recent Developments in Auditory Mechanics, World Scientific (2000), and Filter Cascades as Analogs of the Cochlea, Lyon, R. F., Neuromorphic Systems Engineering (1998), the disclosures of which are both hereby incorporated by reference. As the number of channels in implants continues to grow (e.g., 31 channel implants, 64-channel implants, 128-channel implants etc.), the advantages of filter cascades in creating a bank of high-order filters will become more and more apparent.
2) A sophisticated frequency-dependent version of the gain control algorithms presently used in implants and hearing aids may be implemented as disclosed in Comparison of Different Forms of Compression in Wearable Digital Hearing Aids, Stone, M. A., Moore, B. C. J., Alcantara, J. I., and Glasberg, B. R., J. Acoustic Society of America, (1999), the disclosure of which is hereby incorporated by reference. Thus loud sounds at one frequency do not have to result in inaudible sounds at another frequency. Also, the gain control allows important phenomena in the perception of speech in noise such as forward masking to be easily modeled. Gain control has been shown to be particularly important in the performance of speech recognition systems in reverberant and noisy environments.
3) The architecture of the cochlea is amenable to both time and place coding as described in A Low-Power Analog Front-end Module for Cochlear Implants, Wang, R. J. W, Sarpeshkar, R, Jabri, M. and Mead, C. XVI World Congress on Otorhinolaryngology (1997), the disclosure of which is hereby incorporated by reference.
4) The biological realism allows several important phenomena in biology to be naturally replicated. These include filter broadening with level, the distributed coding of loudness, the transition from place cues to time cues as level increases, redundant signal representations, the close intertwining of both filtering and compression rather than the artificial separation of filtering and compression in today's implants, compression of long-term information while preserving good sensitivity to transients, two-tone suppression, the upward spread of masking, and forward masking. Although it is quite possible that none of these effects have any importance for implant patients, given that cochlear front ends have been shown to improve speech recognition in noise it is unlikely that models closer to the biology will have no impact on implant patients. It is also likely that coding strategies that are closer to the biology will prove superior to those that are not.
5) The silicon cochlea's analog circuit techniques provide a foundation for ultra-low-power cochlear implant design.
The silicon cochlea may be implemented as a particular form of local feedforward gain control as disclosed in A Low-Power Wide-Dynamic Range Analog VLSI Cochlea discussed above. Such an implementation, however, generates input-output curves that are too compressive as compared with those in a real cochlea. Such curves are not suitable for direct use in cochlear implants. Furthermore, such curves cannot easily be programmed to implement a desired compression characteristic, an important necessity in a practical system.
There is a need therefore, for an improved spectral enhancement system that is efficient and practical.
In accordance with an embodiment, the invention provides a spectral enhancement system that includes a plurality of distributed filters, a plurality of energy distribution units, and a weighted-averaging unit. At least one of the distributed filters receives a multi-frequency input signal. Each of the plurality of energy-detection units is coupled to an output of at least one filter and provides an energy-detection output signal. The weighted-averaging unit is coupled to each of the energy-detection units and provides a weighted-averaging signal to each of the filters responsive to the energy-detection output signals from each of the energy-detection units to implement distributed gain control. In an embodiment, the energy detection units are coupled to the outputs of the filters via a plurality of differentiator units.
The following description may be further understood with reference to the accompanying drawings in which:
The drawings are shown for illustrative purposes only and are not to scale.
It has been discovered that a system may be developed to provide an efficient spectral enhancement system by employing a bank of wide-dynamic-range frequency-analysis channels. Such a system may be created using hardware circuit components (e.g., electronic, optic or pneumatic), using software, or using any other simulation routine such as the MATLAB program sold by Math Works, Inc. of Natick, Mass.
For example, in an auditory enhancement or replacement systems for humans, an electronic cochlea maps the traveling-wave architecture of the biological cochlea into a silicon chip. In both biology and electronics gain control is essential in ensuring that the architecture is robust to parameter changes, and in attaining wide dynamic range. A silicon cochlea with distributed gain control is advantageous as a front end in cochlear-implant processors to improve patient performance in noise and to implement the computationally intensive algorithms of the biological cochlea with very low power.
In accordance with an embodiment, the invention provides a computer simulation of a filter-cascade cochlear model with distributed gain control that incorporates several important features such as multi-band compression, an intertwining of filtering and compression, masking, and an ability to tradeoff the preservation of spectral contrast with the preservation of audibility. The gain control algorithm disclosed herein successfully reproduces cochlear frequency response curves, and represents an example of a class of distributed-control algorithms that could yield similar results. In distributed gain-control systems like the cochlea, each individual filter does not change its gain appreciably although the collective system does change its gain appreciably. Thus, a system may maintain its bandwidth, temporal resolution, and power dissipation to be relatively invariant with amplitude.
During operation, the cascaded resonant second-order sections 12-24 may provide low pass filter functions and have characteristic frequencies (CFj) that are exponentially tapered from the beginning of the cascade to the end of the cascade. The outputs from the resonant low pass second-order sections 12-24 are double differentiated in the (jw/CFj)2 blocks 26-36 to create CF-normalized bandpass frequency-response characteristics at each stage of the silicon cochlea. In other embodiments, the differentiation 26-36 may provide a unity differentiator function. The envelope energy in each of these stages is extracted by the envelope-detector (ED) blocks 38-48 and fed to a kernel that computes a spatially-filtered version of these energies. The kernel 50 weights local energies more strongly than energies from remote stages. The output of the kernel, Ij, is then passed through nonlinear block, NLj (52). The NL block outputs a large value for the resonant gain, Qj, if the energy is low, and a small value for Qj, if the energy is high, thus, performing gain control. The attack and release dynamics of the gain control arise from charging and discharging time constants in the envelope detector respectively, and may be tapered with the CF's of the cochlear stages. For clarity, the architecture is only shown in detail for stage j of the cascade, but every stage of the cascade has similar NLj blocks that operate on local estimates of envelope energy output by the kernel.
The weighted-averaging at any local filter is a function of the of the energy outputs of each of the other filters as well as the local energy output and may be generally represented as follows:
Ij=Fj(. . . , ej−3,ej−2,ej−1,ej,ej+1,ej+2,ej+3, . . .) (1)
A specific example of the equations that describe distributed gain control in accordance with an embodiment of the invention are disclosed in the equations below to describe the spatial weight-averaging kernel and non-linearity unit:
Spatial kernel:
NL:
Qj=Qmax for Ij≦K and
The weights of the kernel are given by wij. The parameters Qmax and Qmin determine the maximum and minimum Q settings of a cochlear stage. The value K determines the knee of the cochlear compression characteristic, and z determines the power law of the compression characteristic. A large K implies that the gain control is activated only at large intensities. A large z means that the compression characteristic obeys a small power law, and is relatively flat with intensity. The spatial extent of the kernel, Qmax, and Qmin determine whether the gain control is broadband and preserves spectral contrast (large spatial-extent kernels and small Q's) or whether it is narrowband and preserves audibility (small spatial-extent kernels and large Q's).
As shown in
Input-output curves are shown in
Those skilled in the art will appreciate that numerous variations and modifications may be made to the above embodiments without departing from the spirit and scope of the claims.
The present application claims priority to U.S. Provisional Patent Application Ser. No. 60/398,253 filed Jul. 24, 2002.
Number | Name | Date | Kind |
---|---|---|---|
4633501 | Werrbach | Dec 1986 | A |
5027410 | Williamson et al. | Jun 1991 | A |
5473759 | Slaney et al. | Dec 1995 | A |
5757932 | Lindemann et al. | May 1998 | A |
6231604 | von Ilberg | May 2001 | B1 |
6690205 | Alvandpour | Feb 2004 | B2 |
6873709 | Hou | Mar 2005 | B2 |
6990205 | Chen | Jan 2006 | B1 |
7076315 | Watts | Jul 2006 | B1 |
20050141733 | Blamey et al. | Jun 2005 | A1 |
Number | Date | Country | |
---|---|---|---|
20040136545 A1 | Jul 2004 | US |
Number | Date | Country | |
---|---|---|---|
60398253 | Jul 2002 | US |