This invention relates to sound enhancement for hearing-impaired listeners. More particularly, the invention relates to a method of, and equipment for, enhancing sound heard by hearing-impaired listeners.
A listener wearing a conventional hearing-aid demonstrates a substantial reduction in his or her sound externalisation and sound spatialisation abilities and this, in turn, significantly reduces the listener's ability to parse sounds of interest from competing background sounds. On the other hand, a non-hearing impaired listener relies on spatial hearing to separate competing sounds based on the different spatial locations between the sources of the sounds and the listener. Sound spatialisation also assists listeners to focus attention on sounds of interest.
Human spatial hearing relies on the integration of acoustic information from both ears. This acoustic information consists of the binaural difference in the intensity and time of arrival of sound between the two ears and also the monaural spectral cues that result from the location-dependent acoustic filtering of sound by the outer ear. The perception of externalised sounds (i.e., sounds that are heard as outside of the head) relies primarily on the monaural spectral cues provided by the acoustic filtering of the outer ear. Sounds without these spectral cues, but with a consistent interaural time difference cue and interaural intensity difference cue, are perceived as lateralised and inside of the head.
A hearing-impaired listener usually suffers greater hearing loss at higher frequencies. However, due to the shape and size of the outer ear, the frequency range over which the monaural spectral cues play an important role for spatial acuity is generally from about 5 kHz to 20 kHz, which is in the higher range of auditory frequencies. As a result, auditory spatialisation is significantly impaired for the hearing-impaired listener, which ultimately leads to the inability to separate information from background noise. Furthermore, it is the high frequencies above about 8 kHz that are required for accurate spatialisation of speech stimuli.
Various methods for enhancing the spatial hearing of listeners wearing hearing aids have been proposed. One of these methods for enhancing the spatial hearing of listeners wearing hearing aids involves the use of miniature, completely-in-the-canal (CIC) hearing aids to avoid interference with the acoustic filtering of the outer ear. The electronics for the CIC hearing-aids are contained within a small mould that is completely contained within the auditory canal.
Another method for enhancing the spatial hearing of listeners wearing hearing aids involves the use of open or non-occluding ear moulds that do not distort the low-frequency interaural time difference cues.
Yet another method for enhancing the spatial hearing of listeners wearing hearing aids involves adjusting the gains of the left and right hearing aids based on empirical localisation tests in an attempt to preserve the interaural intensity difference cues.
One disadvantage of all of these methods is that they do not use signal processing to enhance and provide high-frequency monaural spectral cues that vary consistently with the location of the sound in space.
Another disadvantage of all of these methods is that they do not make the very high frequency spectral cues (greater than about 8 kHz) more audible.
Terms related to this invention are defined below:
The term “speech frequency band” is the frequency range (approximately, but not exactly, 200 Hz to 4 kHz) that is empirically most important for a listener's speech perception. It may vary slightly from listener to listener and may be determined empirically and/or analytically.
The term “high-frequency band” refers to the frequency band above the speech frequency band.
The term “high frequency component” refers to a frequency component of a sound that occurs in the high frequency band.
According to a first aspect of the invention, there is provided a method of enhancing sound heard by a hearing-impaired listener, the method comprising
monitoring the sound in an environment in which the listener is located; and
manipulating the frequency of high frequency components of the sound in a high frequency band, with little, if any, distortion to components of the sound in a speech frequency band, to enhance spectral cues to aid the listener in sound externalisation and spatialisation.
The method may include manipulating the frequency of the high frequency components by a technique selected from the group comprising: compressing the components across a frequency range, shifting the high frequency components to lower frequencies and combinations of the foregoing.
The method may include
dividing the sound into a number of segments in time;
determining whether or not there are high frequency components of the sound in each of the segments; and
manipulating the frequency of the high frequency components only for segments in which there is an occurrence of high frequency energy above a predetermined threshold in the high frequency band.
Instead, the method may include
dividing the sound into a number of segments in time;
determining whether or not the sound in each segment has a harmonic structure in the high frequency band; and
manipulating the frequency of the high frequency components only for segments in which there is little, if any, harmonic structure in the high frequency band.
The method may be implemented in at least one hearing aid of the listener, the method further including configuring the hearing aid to preserve acoustic filtering of an outer ear of the listener.
Further, the method may include determining a hearing range for the listener and customising the manipulation of the high frequency components to the hearing range of the listener.
In one embodiment, the method may include manipulating the high frequency components by first transforming a sound signal to the frequency domain and, thereafter, modifying the frequency domain representation using one of a mapping and a warping technique.
In another embodiment of the invention, the method may include manipulating the high frequency components in the time-domain using at least one of a time-domain filter bank and a resampling technique to shift and/or compress the high frequency components to lower frequencies.
In the case of both embodiments, the mapping technique may include replacing frequency components in a range from f1 to f2 with frequency components in a second, lower range of f3 to f4 according to a mapping:
The method may include, when effecting the manipulation of the high frequency components, at least partially preserving a harmonic relationship between the components.
Further, the method may include manipulating the high frequency components using a logarithmic compression technique.
The method may include dividing the sound signal into a number of discrete frequency components and obtaining frequency components fi above the speech frequency band for an output signal according to a mapping:
S(fn*i+c)→S(fi),
where n is a positive integer and c is a constant integer.
Instead, the method may include dividing the sound signal into a number of discrete frequency components and obtaining frequency components fi above the speech frequency band for an output signal according to a mapping:
S(fn*i+ci)→S(fi),
where n is a positive integer and ci is adjusted for each i to select that frequency component with maximum energy out of frequency components fn*i to f(n+1)*i−1.
In yet a further embodiment the method may include performing frequency transposition of the sound signal using a Laguerre transform.
Preferably, the method includes further manipulating the frequency of the high frequency components by signal amplification. Further, the method may include applying the signal amplification so as to maintain consistent relative gain across frequency for the high frequency components.
The method may be implemented using a hearing aid in each ear of the listener, the method including applying the signal amplification so as to maintain consistent relative gain between the two ears for the high frequency band of each ear.
The method may include changing the relative amplitude of each frequency component of the sound independently before and/or after manipulation of the high frequency components.
Further, the method may include enabling the listener to discontinue manipulation of the high frequency components.
In a development of the invention, the method may include
receiving auxiliary audio signals to be rendered as virtual audio; and
incorporating the auxiliary audio signals to produce an output audio signal including a virtual audio component.
The method may include processing the auxiliary audio signals using virtual audio space techniques to create an effect for the listener that the sound originate at specific locations in a personal auditory space around the listener's head. The virtual audio space techniques are described in greater detail in PCT/AU01/00038 filed 16 Jan. 2001 and entitled “The generation of customised three dimensional sound effects for individuals”, the contents of which are incorporated herein by reference.
According to second aspect of the invention, there is provided equipment for enhancing sound heard by a hearing-impaired listener, the equipment comprising
at least one hearing aid device comprising:
The equipment may include a listener operable interface for enabling the listener to disable the auxiliary signal processing arrangement.
The equipment may include a discriminator in communication with the auxiliary signal processing arrangement, the discriminator discriminating between the frequencies of the components of the sounds and being operable to activate the auxiliary signal processing arrangement only for time windows in which there is an occurrence of high frequency energy above a predetermined threshold in the high frequency band.
The housing may be configured to minimally disrupt acoustic filtering of an outer ear of the listener.
The auxiliary signal processing arrangement may manipulate the high frequency components by at least one of compressing the high frequency components across a frequency range and shifting the high frequency to lower frequencies.
At least one of the primary signal processing arrangement and the auxiliary signal processing arrangement may be further operable to manipulate the high frequency components by signal amplification.
The auxiliary signal processing arrangement may be interposed between the primary signal processing arrangement and the sensor.
The equipment may include two hearing aid devices, one for each ear of the listener. The signal processing arrangements of each of the hearing aid devices may be operable to amplify the high frequency sound components so as to maintain consistent gain between the two ears of the listener for each high frequency band.
In a development of the invention, the equipment may include a communications receiver in communication with the primary signal processing arrangement, the receiver receiving auxiliary audio signals to be rendered as virtual audio to produce an output audio signal including a virtual audio component. Then, the primary processing arrangement may be operable to process the auxiliary audio signals using virtual audio space techniques to create an effect for the listener that the sound originates at specific locations in a personal auditory space around the listener's head.
The invention is now described by way of example with reference to the accompanying drawings in which:—
In the drawings, reference numeral 10 generally designates equipment, in accordance with an embodiment of the invention, for enhancing sound heard by a hearing-impaired listener. The equipment 10 includes a housing 12 which houses hearing-aid electronics and components.
An acoustic sensor 14 is arranged on the housing for sensing acoustic signals. A sound delivery medium 16 is carried by the housing 12 and relays sound to the eardrum of a listener's ear carrying the equipment 10.
The components of the equipment 10 include a primary signal processor 18 which perform conventional hearing aid signal processing. An auxiliary signal processor 20 is interposed between the primary signal processor 18 and the sensor 14.
The auxiliary signal processor 20 is, optionally, controlled by a discriminator 22 which determines whether or not there are components of sound having a high energy frequency above a predetermined threshold in the high frequency band. In the preferred implementation of the invention though, the auxiliary signal processor 20 is operative always to do a frequency shift operation regardless of whether or not there are any high frequency sound components present. In this way, the need to detect the presence of the high frequency components above a certain threshold and, hence, the need for the discriminator is obviated.
In addition, externally accessible switches 24 and 25 are provided to enable the listener to deactivate the auxiliary signal processor 20. These switches are, optionally, controlled by the discriminator 22 to be deactivated when no high frequency sound components are present.
In a preferred implementation of the invention, the housing 12 is in the form of a completely-in-the-canal hearing aid housing to preserve acoustic filtering of an outer ear of the listener and, in so doing, to minimise adversely influencing monaural spectral cues provided by such acoustic filtering of the outer ear.
The sensor 14 is a broadband (20 Hz to 20 kHz) microphone. The sensor 14 converts incoming soundwaves into an electronic signal for onward transmission to the components of the equipment 10.
The auxiliary signal processor 20 is tailored to an individual listener's requirements by appropriate calibration so that, prior to use, the high frequency band applicable to that listener falls in the listener's optimal high frequency range.
The auxiliary signal processor 20 is operable to manipulate the sound component in the high frequency band. More particularly, the auxiliary signal processor 20 compresses the sound components across a frequency range and/or shifts the frequencies of the sound components in the high frequency baud to lower frequencies by means of the following mapping:
A block diagram of the processing operation of the auxiliary signal processor is shown in
at step 32, the 512 most recent samples are windowed with their respective windowing coefficients. The window used is a 512 taps Cosine window;
at step 34, the windowed data are transformed to the frequency domain using a 512 point Fast Fourier Transform (FFT). The outputs of the FFT are 512 frequency bins representing signal frequencies from DC (0 Hz) to 16 kHz with complex numbers;
at step 38, those frequency bins outside the speech frequency band are frequency shifted (transposed) by a transposition block. An example of such a transposition table is illustrated in
at step 40, the output of the transposition is transformed from the frequency domain to the time domain using a 512 point Inverse Fast Fourier Transform (IFFT);
at step 44, the output of the IFFT is windowed with a 512 taps Cosine window;
The output of the windowing block 44 is combined with its output of the previous cycle (256 samples ago) in block 46 using a 50% Overlap and Add method.
The digital samples resulting from this series of operations is turned into an analogue signal using a Digital to Analogue Converter (DAC) 48.
An output from the auxiliary signal processor 20 feeds the manipulated sound components to the primary signal processor 18. The primary signal processor 18 carries out conventional hearing aid compression and amplification processing. An output from the primary signal processor 18 feeds the sound delivery medium 16, which may be a normal hearing aid receiver.
Referring now to
In this embodiment, the frequency manipulation occurs in the time domain. Consequently, instead of the use of an FFT at step 36 and its IFFT at step 40, a time domain analysis filter bank is used at step 36 prior to the transposition step 38 and a time domain synthesis filter bank is used at a step 40 after the transposition step 38.
In yet a further embodiment of the invention, the auxiliary signal processor 20 divides the sound signal into a number of discrete frequency components and obtains frequency components fi above the speech frequency band for an output signal according to a mapping:
S(fn*i+c)→S(fi),
where n is a positive integer and c is a constant integer.
Once again, those frequency components or bins outside the speech frequency band are frequency shied (transposed) by a transposition block as shown in
In still another embodiment of the invention, the auxiliary signal processor divides the sound signal into a number of discrete frequency components and obtains frequency components fi above the speech frequency band for an output signal according to a mapping:
S(fn*i+ci)→S(fi),
where n is a positive integer and ci is adjusted for each i to select that frequency component with maximum energy out of frequency components fn*i to f(n+1)*i−1. An example of a transposition table for this embodiment is shown in
In yet a further embodiment of the invention, the auxiliary signal processor effects manipulation of the high frequency components by using a Laguerre Transform at step 36 instead of a FFT and, as a result, an Inverse Laguerre Transform at step 40 as shown in
The amplification of the previously high frequency sound components by the primary signal processor 18 is performed in such a manner so as to maintain a relative gain that is consistent as possible across the frequency components of the high frequency band.
In the embodiment of the invention where a listener wears two hearing aids, one in each ear, the amplification of the previously high frequency sound components by the primary signal processor 18 is also performed in such a manner that there is a relative gain that is as consistent as possible between the two ears for each frequency component within the high frequency band.
As indicated above, the conventional acoustic filtering provided by the outer ear of the listener is preserved by using a completely-in-the-canal housing 12 for the equipment 10. In the event that the listener has one unimpaired and one hearing impaired ear the listener can use the equipment 10 in the impaired ear with the unimpaired ear operating unassisted. Instead, in the case where the listener requires two hearing aids, each hearing aid can be implemented using the equipment 10.
In a development of the invention, the equipment 10 can be provided with a communications receiver 60 (
In the case of
It is an advantage of the invention that the high frequency spectral cues that vary most with directions in space, i.e. those having frequencies above 8 kHz, are presented to a hearing impaired listener in an audible form. Because the auditory system has greater frequency resolution at the lower frequencies, the manipulation of the high frequency components to those lower frequencies assists in compensating for the hearing impaired listener's decreased frequency selectivity.
In addition, because the auditory system of the listener is capable of re-learning monaural spectral cues for sound spatialisation, the listener is able to learn to use the altered spectral cues that result from the manipulation of the high frequency components to lower frequencies. The length of time necessary to adapt to these new cues is comparable to the time normally required to become acclimatised to the wearing of conventional hearing aids.
Yet another advantage of the invention is that it restores some degree of spatial hearing to a hearing impaired listener which provides a basis for speech segregation in noisy acoustic environments. The equipment 10 enhances the segregation of multiple talkers from one another as well as from other background noises by using binaural and spectral cues related to the different locations of the sound sources. These spectral cues also give rise to a clearer perception of externalised sound sources which aids in information unmasking.
Yet a further advantage of the invention is that it provides a basis for locating the sources of a sound which aids in normal acoustic navigation.
Still another advantage of the invention is that it makes use of high frequency information provided by the fricatives and plosives of speech to aid in the spatialisation of the speech. In addition, the invention provides a means to optimise the utilisation of spatial information by the hearing-impaired listener by customising the high frequency band to the listener's optimal high frequency hearing range.
It will be appreciated by persons skilled in the art that numerous variations and/or modifications may be made to the invention as shown in the specific embodiments without departing from the spirit or scope of the invention as broadly described. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive.
Number | Date | Country | Kind |
---|---|---|---|
2003904207 | Aug 2003 | AU | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/AU2004/001068 | 8/10/2004 | WO | 00 | 1/31/2006 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2005/015952 | 2/17/2005 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4837832 | Fanshel | Jun 1989 | A |
5144675 | Killion et al. | Sep 1992 | A |
20030072464 | Kates | Apr 2003 | A1 |
20040175010 | Allegro et al. | Sep 2004 | A1 |
Number | Date | Country |
---|---|---|
WO 8502085 | May 1985 | WO |
WO 9914986 | Mar 1999 | WO |
WO 0126418 | Apr 2001 | WO |
WO 0154453 | Jul 2001 | WO |
WO 2005004534 | Jan 2005 | WO |
Number | Date | Country | |
---|---|---|---|
20070127748 A1 | Jun 2007 | US |