The present invention relates to a method of controlling a hearing instrument based on identifying an acoustic environment, and a corresponding hearing instrument.
It is common for state-of-the-art hearing instruments to incorporate automatic control of actuators such as noise cancellers, beam formers, and so on, to automatically adjust the hearing instrument to optimise the sound output for the wearer dependent on the acoustic environment. Such automatic control is based on classifying types of acoustic environments into broad classes, such as “clean speech”, “speech in noise”, “noise”, and “music”. This is typically achieved by processing sound information from a microphone and extracting characteristic features of the sound information, such as energy spectrums, frequency responses, signal-to-noise ratios, signal directions, and so on. Based on the result of the extraction of characteristic features, parameters of the audio signal processing unit are adjusted to optimise the wearer's hearing experience in his or her present surroundings. This optimisation can be by means of predefined programs, or adjusting individual parameters as required.
With various prior art systems, there are several limitations: the detectable classes of acoustic environments are rather broad, leading to insufficient hearing performance for some specific hearing scenarios; extra hardware is often required, increasing costs, power consumption and complexity; and many of the prior art solutions rely on real-time communication between hearing devices and sometimes also a beacon or other separate module that has to be carried by the wearer. Real-time communication uses a lot of power, leading to short battery life and frequent battery changes.
Certain acoustic environments can be problematic for prior art hearing instruments, such as:
1. Driving a Car
While driving a car, different noises occur. E.g. the machine noise of the motor varies considerably, depending on the acceleration or speed of the car. This leads to a “nervous” behaviour of prior art hearing instruments, since the Noise Canceler (NC) is activated in and out. Also the main noise has the character of low frequencies, whereas important feedback signals of the car or of the traffic have the character of higher frequencies.
While communicating to a passenger in the car, the speech does not arrive from the front. The passenger's voice typically arrives from the side or from the back. The state-of-the art reaction of the HI is activating the Beam former (BF), which decreases the speech intelligibility.
2. Quiet at Home
In quiet situations at home, e.g. the humming of the fridge or air conditioning system is amplified by prior art hearing instruments, and disturbs the wearer. Also, the rustling of newspaper or noises from the neighbour can disturb the wearer while performing a quiet activity at home, such as reading.
3. Quiet in Nature
In contrast to the “quiet at home” scenario above, most of the end-users want to listen every little event while they are in nature and observing e.g. birds. In such a situation it is advantageous to enhance soft sounds, whereas in the “quiet at home” situation it is advantageous to diminish soft sounds.
As a result, it is beneficial to distinguish between “quiet at home” and “quiet in nature”, which is not possible to perform reliably with prior art hearing instruments.
4. Watching TV
A TV broadcasts audio signals with high variety in short time. The state-of-the art classification tries to follow the audio signal changes and makes prior art hearing instrument behaviour appear “nervous”, frequently switching modes. In addition, the most important class “speech in noise” does not assist in speech intelligibility on a TV signal, since the target and the noise signal are coming from the same direction.
It is thus desirable for the TV signal to be detected as a TV signal, so that the hearing device could for instance launch a program with suitable constant actuator settings, or distinguish only between “understanding speech” and “listening to music”.
It is thus advantageous to be able to distinguish at least the above-mentioned scenarios and thereby increase the overall number of acoustic environments that can be automatically determined.
The object of the present invention is thus to overcome at least one of the above-mentioned limitations of the prior art.
In the context of the invention, by hearing instruments we understand hearing aids, which may be situated in the ear, behind the ear, or as cochlea implants, active hearing protection for loud noises such as explosions, gunfire, industrial or music noise, and also earpieces for communication devices such as two-way radios, mobile telephones etc. which may communicate by Bluetooth or any other protocol. A hearing instrument may comprise one single hearing device (e.g. a single hearing aid), two hearing devices (e.g. a pair of hearing aids either acting independently or linked in a binaural system, or a single hearing aid and an external control unit), or three or more hearing devices (e.g. a pair of hearing aids as previously, combined with an external control unit).
This is achieved by a method of controlling a hearing instrument comprising at least one hearing device, the method comprising the steps of:
wherein X and Y are functions, γXY is the complex coherence between the two functions, and asterisks denote complex conjugates of the relevant functions. For simplicity, frequency dependence has been omitted from the above equation. Using complex coherence calculated based on sound information received by the first and second transducer, many more classes of acoustic environments can be distinguished than with previous methods, particularly when used in addition to existing methods as an extra characteristic enabling refinement of the determination of the acoustic environment. Since the complex coherence is a single complex number (or a single complex number per desired frequency band if calculating in frequency bands), computation utilising it is extremely fast and simple.
In an embodiment, the first transducer is a pressure microphone and the second transducer is a particle velocity transducer, which may be of any type, both being situated in the same hearing device in an acoustically-coincident manner, i.e. no more than 10 mm, better no more than 4 mm apart, and the complex coherence is calculated based on the sound pressure measured by the pressure microphone and the particle velocity measured by the particle velocity transducer, these two transducers being situated in the same hearing device, i.e. one individual hearing device. In this case, the complex coherence is computed as (equation 2):
wherein P is the sound pressure at the pressure microphone and U is the particle velocity measured by the particle velocity transducer. Both signals are in the frequency domain. Angled brackets indicate an averaging procedure necessary for the calculation of the coherence from discrete time and finite duration signals, such as the well-known Welch's Averaged Periodogram. The time frames for the averaging would typically be between 5 ms and 300 ms long, and should be smaller than the reverberation time in the rooms to be characterised.
This has the particular advantage of giving accurate results, since the particle velocity is measured directly.
In an embodiment, the particle velocity transducer is a pressure gradient microphone, or hot wire particle velocity transducer. This gives concrete forms of the particle velocity transducer.
In an alternative embodiment, the first transducer is a first pressure microphone i.e. an omnidirectional pressure microphone, and the second transducer is a second pressure microphone, which may likewise be an omnidirectional pressure microphone. This enables utilisation of current transducer layouts.
In an embodiment, these two microphones are situated in the same hearing device, e.g. integrated in the shell of one hearing device, in which case the complex coherence is calculated using equation 2 as above, however with the following substitutions (equation 3):
wherein P is the mean pressure between the sound pressure at the first and second microphones (P1 and P2 respectively), and (equation 4a):
or (equation 4b):
wherein U is the particle velocity, P1 and P2 are the sound pressure at the first and second microphones respectively, k is the wave number, c is the speed of sound in air, ρ0 is the mass density of air, ω is the angular frequency, j is the square root of −1, and Δx is the distance between the first and second pressure microphones.
This embodiment enables the advantages of the invention to be applied to pre-existing dual-microphone hearing devices, such as hearing devices incorporating adjustable beamforming function.
In an alternative embodiment incorporating two pressure microphones, each microphone is situated in a different hearing device, i.e. one in a first hearing device (e.g. a first hearing aid) and one in a second hearing device (e.g. a second hearing aid), the combination of the first and second hearing devices forming at least part of the hearing instrument, in which case the complex coherence is calculated as (equation 5):
wherein P1 is sound pressure at the first transducer and P2 is the sound pressure at the second transducer.
This can advantageously be incorporated into existing binaural hearing instruments with a single microphone (or multiple microphones) on each individual hearing device.
In an embodiment, since information is required to be exchanged between the two hearing devices, the first and second hearing devices send and/or receive signals relating to the received sound information to/from the other hearing device, thus enabling the complex coherence between P1 and P2 as above to be calculated.
In an embodiment, data is exchanged between a first processing unit in the first hearing device and the second processing unit in the second hearing device.
In an embodiment, digitised signals corresponding to sound information received at each microphone is exchanged between each hearing device, the signals corresponding to sound information in either the time domain or the frequency domain. This provides the processing unit in each hearing device with full information.
Alternatively, digitised signals corresponding to sound information at one microphone are transmitted from the second hearing device to the first hearing device, and signals corresponding to commands for adjusting sound process parameters are transmitted from the first hearing device to the second hearing device. This enables calculation of the complex coherence (and optionally other characteristic features) in a single hearing device, the resulting commands for adjusting sound process parameters being transmitted back to the other hearing device.
Alternatively, one hearing device processes sound information for determining the complex coherence in a first frequency band, e.g. low-frequency, and the other hearing device processes sound information in a second frequency band, e.g. high-frequency. In this case, the sound information in the respective frequency ranges is transmitted to the other hearing device, and the result of the processing is transmitted back. This enables the calculation of the complex coherence to be performed without redundancy: the first hearing device thereby calculates the complex coherence in the first frequency band (e.g. low frequency), and the second hearing device calculates the complex coherence in the second frequency band (e.g. high-frequency), the two hearing devices mutually exchanging the sound information required for their respective calculations, and the results of their respective calculations.
In an embodiment, the characteristic features further comprise at least one of: signal-to-noise ratio in at least one frequency band; signal-to-noise ratio in a plurality of frequency bands; noise level in at least one frequency band; noise level in a plurality of frequency bands; direction of arrival of noise signals; direction of arrival of useful signal; signal level; frequency spectra; modulation frequencies; modulation depth; zero crossing rate; onset; center of gravity; RASTA, etc.
This enables the present invention to improve on the resolution of the classification of acoustic environments that can be distinguished.
In combination with any of the above embodiments, the complex coherence may be calculated in a single frequency band, e.g. encompassing the entire audible range of frequencies (normally considered as being 20 Hz to 20 kHz), which is simple, or, for more accuracy and resolution, the complex coherence may be calculated in a plurality of frequency bands spanning at least the same frequency range. In an embodiment, the plurality of frequency bands has a linear resolution of between 50 Hz and 250 Hz or a psychoacoustically-motivated non-linear frequency resolution, such as octave bands, Bark bands, other logarithmically arranged bands, etc. as known in the literature. Incorporating a frequency-dependence enables significantly increased discernment of various acoustic environments, as will be illustrated later.
The above-mentioned method embodiments may be combined in any non-contradictory manner.
The invention further concerns a hearing instrument comprising at least one hearing device; a least a first transducer and a second transducer; at least one processing unit (which could be multiple processing units in one or more hearing devices, arranged as convenient) operationally connected to first transducer and the second transducer; an output transducer operationally connected to an output of the least one processing unit, wherein the at least one processing unit comprises means for processing sound information received by the first transducer and the second transducer so as to extract at least one characteristic feature of the sound information; means for determining a type of acoustic environment selected from a plurality of predefined classes of acoustic environment based on the at least one extracted characteristic feature; means for adjusting sound processing parameters based on the determined type of acoustic environment, the sound processing parameters defining an input/output behavior of the at least one hearing device and controlling, for instance, active beamformers, noise cancellers, filters and other sound processing; wherein said at least one characteristic feature comprises a complex coherence calculated based on the sound information received by the first transducer and the second transducer. As above, using complex coherence calculated based on sound information received by the first and second transducer, many more classes of acoustic environments can be distinguished than with previous methods, particularly when used in addition to existing methods as an extra characteristic enabling refinement of the determination of the acoustic environment. Since the complex coherence is a single complex number (or a single complex number per desired frequency band if calculating in frequency bands), computation utilising it is extremely fast and simple.
In an embodiment, the first transducer is a pressure microphone and the second transducer is a particle velocity transducer, both being situated in the same hearing device in an acoustically-coincident manner, i.e. no more than 10 mm, better no more than 4 mm apart, the complex coherence determined being that of the complex coherence between the sound pressure measured by the pressure microphone and a particle velocity measured by the particle velocity transducer, which may be of any type.
In an embodiment, the particle velocity transducer is a pressure gradient microphone or a hot wire particle velocity transducer. These are concrete examples of such transducers.
In an embodiment, the first transducer is a first pressure microphone, i.e. an omnidirectional pressure microphone, and the second transducer is a second pressure microphone, i.e. likewise an omnidirectional pressure microphone. This enables utilisation of current transducer layouts.
In an embodiment, these two microphones are situated in the same hearing device, e.g. integrated in the shell of one hearing device, in which case the complex coherence is calculated as described above in relation to equations 2, 3, 4a and 4b. This embodiment enables the advantages of the invention to be applied to pre-existing dual-microphone hearing devices, such as hearing devices incorporating beamforming function.
In an alternative embodiment incorporating two pressure microphones, each microphone is situated in a different hearing device, i.e. one in a first hearing device (e.g. a first hearing aid) and one in a second hearing device (e.g. a second hearing aid), the combination of the first and second hearing devices forming at least part of the hearing instrument, in which case complex coherence is calculated as in equation 5 above. This can advantageously be incorporated into existing binaural hearing instruments with a single microphone (or multiple microphones) on each individual hearing device.
In an embodiment, the first and second hearing devices each comprise at least one of a transmitter, a receiver, or a transceiver, for sending and receiving signals as appropriate to and from the other hearing device as appropriate. This enables the transmission and reception of sound information, data, commands and so on between the first and second hearing devices.
In an embodiment, the signals sent between the two hearing devices relate to sound information in either the time domain or the frequency domain. This provides the processing unit in each hearing device with full information.
In an embodiment, above-mentioned signals relate to data exchanged between a first processing unit in the first hearing device and the second processing unit in the second hearing device.
In an embodiment, the second hearing device is arranged to transmit digitised signals corresponding to sound information at one microphone to the second hearing device, and the second hearing device is arranged to transmit signals corresponding to commands for adjusting sound process parameters to the first hearing device, each hearing device being arranged to receive signals transmitted by the contra-lateral (i.e. the other) hearing device. This enables calculation of the complex coherence (and optionally other characteristic features) in a single hearing device, the resulting commands for adjusting sound process parameters being transmitted back to the other hearing device.
In an embodiment, the first hearing device comprises a first processing unit for processing sound information situated in a first frequency band and the second device comprises a processing unit for processing sound information situated in a second frequency band, wherein each hearing device is arranged to transmit the sound information required by the contra-lateral device via its transmitter or transceiver, and after processing, each hearing device further being arranged to transmit the result of said processing to the contra-lateral hearing device via its transmitter or transceiver, each hearing device being further arranged to receive the signals transmitted by the contra-lateral hearing device by means of its receiver or transceiver. This enables the calculation of the complex coherence to be performed without redundancy: the first hearing device thereby calculates the complex coherence in the first frequency band (e.g. low frequency), and the second hearing device calculates the complex coherence in the second frequency band (e.g. high-frequency), the two hearing devices mutually exchanging the sound information required for their respective calculations, and the results of their respective calculations.
In an embodiment, the characteristic features further comprise at least one of: signal-to-noise ratio in at least one frequency band; signal-to-noise ratio in a plurality of frequency bands; noise level in at least one frequency band; noise level in a plurality of frequency bands; direction of arrival of noise signals; direction of arrival of useful signal; signal level; frequency spectra; modulation frequencies; modulation depth; zero crossing rate; onset; center of gravity; RASTA, etc.
This enables the present invention to improve on the resolution of the classification of acoustic environments that can be distinguished.
In an embodiment, at least one processing unit is arranged to calculate the complex coherence in a single frequency band, e.g. encompassing the entire audible range of frequencies (normally considered as being 20 Hz to 20 kHz), which is simple, or, for more accuracy and resolution, in a plurality of frequency bands spanning at least the same frequency range. In an embodiment, the plurality of frequency bands has a linear resolution of between 50 Hz and 250 Hz, or a psychoacoustically-motivated non-linear frequency resolution, such as octave bands, Bark bands, other logarithmically arranged bands, etc. as known in the literature. This latter enables significantly increased discernment of various acoustic environments, as will be illustrated later.
The above-mentioned hearing instrument claims may be combined in any manner that is not contradictory.
The invention will now be illustrated by means of example embodiments, which are not be considered as limiting, as shown in the following figures:
In the figures, like components are illustrated with like reference signs.
Although signal processing unit 4, data processing unit 5 and determination unit 6 have been illustrated and described as separate functional blocks, they may be integrated into the same processing unit and implemented either in hardware or software. Likewise, they may be divided over or combined in as many functional units as is convenient. This equally applies to all of the below embodiments.
As described above, transducer 1 is a pressure microphone, and transducer 2 may be either a second pressure microphone or a particle velocity transducer such as a pressure gradient microphone, a hot wire particle velocity transducer, or any other equivalent transducer. In each case, the complex coherence is calculated as described above. As a variation, the digitised output of one single transducer (i.e. the/one pressure microphone) may be used as an input for the signal processing unit, the output of both transducers being used for determining the complex coherence and other characteristic features.
This embodiment is particularly suited for a type of distributed processing, for which the data processing unit 5 can be utilised.
In the distributed processing embodiment, the complex coherence in one frequency range, e.g. low-frequency, is calculated in one data processing unit 5 in one individual hearing device L, R, (hereinafter “ipsi-lateral”) and the complex coherence of a second frequency range, e.g. high frequency, is calculated in the other data processing unit 5 in the other individual hearing device R, L (hereinafter “contra-lateral”). The definition of “low” and “high” frequencies is chosen for convenience, e.g. “low” frequencies may be frequencies below 4 kHz and “high” frequencies may be frequencies above 4 kHz. Alternatively the cut-off point may be 2 kHz, for instance. Only one data processing unit 5 is illustrated here, the other being merely a mirror image in terms of frequency ranges, that is to say where high and low frequencies are discussed in terms of the ipsi-lateral hearing device of
The following describes one variant of how the complex coherence is used to define the acoustic environment. (n.b.: Although the complex coherence in the following is given as γPU, the same relations hold for γPP as appropriate):
Inside of Car
Inside a car, at low frequencies (<300 Hz) the coherence is imaginary (|∠γPU|>60°, Im{γPU}>0.75) and at high frequencies (>2 kHz) it is real (|∠γPU|<30°) and close to zero (|γPU+0.2)
Quiet at Home Vs. Quiet in Nature
In a quiet situation in nature, the real value of the coherence approaches unity, (|∠γPU|<30°, Re{γPU}>0.75). In a quiet situation at home, the real value of the coherence is close to zero (|∠γPU<30°, Re{γPU}<0.2). This is true at all frequencies where the source signals have a certain signal power that is typical for a quiet situation.
TV at Home
When watching TV, the small distance to the sound source (the TV) results in a high direct-to-reverberant ratio (|∠γPU|<30°, Re{γPU}>0.75). The lower frequency above which this holds is dependent on the room size but is typically not higher than 200 Hz.
In addition to determining the acoustic environments, the complex coherence can also be used to help in determining various other useful parameters:
Number of Speakers
The sound field due to a low number of discrete sources positioned at various angles leads to a decrease of the real value of the coherence from unity but a distinction from a diffuse field can be made due to spectral/temporal orthogonality of the sources or due to different dynamics of the coherence values. Combining the coherence estimate further with the SNR estimated from classical features further helps in the distinction. For example, a low SNR and high coherence can only be achieved with a low number (<8) of sources (|∠γPU|<30°, |γPU| varies according to the number of sources). This is true at all frequencies where the source signals have sufficient signal power above the noise floor.
SNR (Signal-to-Noise Ratio) Estimation
The SNR in a mixed direct/diffuse field situation is related by a non-linear function to the real value of the coherence: (|∠γPU<30°, |γPU| varies according to SNR). This is true at all frequencies where the source signals have sufficient signal power above the noise floor.
Detection of Reverberant Environments
Reverberant environments can be detected by calculating the coherence either with (i) different FFT (Fast Fourier Transform) block sizes, i.e. time frames, (ii) PSD (Power Spectral Density) averaging with different averaging constants or (iii) PSD averaging over different number of FFT bins. In either case the transition from unity (long FFT block size or short averaging constants with respect to reverberation time) to the asymptotic direct-to-reverberant energy ratio (DRR) value (small FFT block size or long averaging constants with respect to reverberation time) depends on the reverberation time (|∠γPU<30°, |γPU| varies with the FFT block size or averaging time constant). This is true at all frequencies where the source signals have sufficient signal power above the noise floor.
Although the invention has been explained in terms of specific embodiments, these are not to be constituted as limiting the invention, which is solely defined by the appended claims, incorporating all variations falling within their scope.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2012/057464 | 4/24/2012 | WO | 00 | 10/23/2014 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2013/159809 | 10/31/2013 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7319769 | Allegro-Baumann | Jan 2008 | B2 |
8295497 | Roeck | Oct 2012 | B2 |
20130054231 | Jeub | Feb 2013 | A1 |
Number | Date | Country |
---|---|---|
1 465 456 | Oct 2004 | EP |
1 670 285 | Jun 2006 | EP |
2012007183 | Jan 2011 | WO |
Entry |
---|
Kuster, “Estimating the Direct-to-reverberant Energy Ration From the Coherence Between Coincident Pressure and Particle Velocity”, The Journal of the Acoustical Society of America, American Institute of Physics for the Acoustical Society of America, New York, NY, US, vol. 130, No. 6, Dec. 1, 2011, pp. 3781-3787. |
International Search Report for PCT/EP2012/057464 dated Jan. 16, 2013. |
Written Opinion for PCT/EP2012/057464 dated Jan. 16, 2013. |
Number | Date | Country | |
---|---|---|---|
20150110313 A1 | Apr 2015 | US |