This invention relates to a hearing-instrument system comprising an own-voice detector and to the method of identifying the user's own voice in a hearing-instrument system. In this context a hearing-instrument may be hearing aids, such as an in-the-ear (ITE), completely-in-canal (CIC) or behind-the-ear (BTE) hearing aids, headphones, headsets, hearing protective gear, intelligent earplugs etc.
The most common complaint about hearing aids, especially when someone starts wearing them for the first time, is that the sound of their own voice is to loud or that it sounds like they are talking into a barrel. Accordingly, there exists the need to identify the own voice of the user of a hearing aid to be able to process the users own voice in a different way than sound originating from other sound sources.
In prior art document WO 2004/077090 A1 there are described different methods for distinguishing between sound from the users mouth and sound originating from other sources. The methods described in WO 2004/077090 A1 have the drawback that the signals from two or more microphones are needed for the identification of the user's own voice.
Other known methods for identifying the user's own voice in a hearing aid, which are based on a quantity derived from a single microphone signal, are e.g. based on overall level, pitch, spectral shape, spectral comparison of auto-correlation and auto-correlation of predictor coefficients, cepstral coefficients, prosodic features or modulation metrics. It has not been demonstrated or even theoretically substantiated that these methods will perform reliable own-voice detection.
Another known method for identifying the user's own voice is based on the input from a special transducer, which picks up vibrations in the ear canal caused by vocal activity. While this method of own-voice detection is expected to be very reliable, it requires a special transducer, which is expected to be difficult to realize and costly.
The object of this invention is to provide a method of identifying the users own voice in a hearing-instrument system and a hearing-instrument system comprising an own-voice detector, which provides reliable and simple detection of the user's own voice.
The object of the invention is solved by a method according to claim 1 and by a hearing-instrument system according to claim 8. Further developments are characterized in the dependent claims.
In the method of identifying the user's own voice in a hearing-instrument system according to the invention, assessing whether the sound originates from the user's own voice or from another sound source is based on the direct-to-reverberant ratio (DtoR) between the signal energy of a direct sound part and that of a reverberant sound part of at least a part of a recorded sound. This method has the advantage that the direct-to-reverberant ratio (DtoR) allows very reliable detection of the user's own voice.
In accordance with a preferred embodiment of the invention, it is possible with this method to identify the user's own voice on the basis of the signal from one microphone as the direct-to-reverberant ratio (DtoR) is determined from the envelope of the signal energy.
From the direct-to-reverberant ratio (DtoR), it can be assessed whether the sound originates from a near-field sound source (the user's own voice) or from a far-field sound source by comparing the direct-to-reverberant ratio to an own-voice threshold value which can be determined empirically from experiments made in advance.
An even more reliable method for detecting the users own voice in a hearing-instrument system can be realized by independently determining the direct-to-reverberant ratio in a number of frequency bands and assessing whether the sound originates from the user's own voice on the basis of the direct-to-reverberant ratios of the number of frequency bands.
If assessing whether the sound originates from the user's own voice is based on a combination of the direct-to-reverberant ratio (DtoR) and another characteristic of the recorded sound, then there is the advantage that the own-voice detection will be more robust compared to the case in which detection is based only on the direct-to-reverberant ratio.
The invention will be more easily understood by the person skilled in the art from the following description of preferred embodiments in connection with the drawings. In the figures thereof:
In
The method of identifying the user's own voice in a hearing instrument system is based on the finding that the direct-to-reverberant ratio (DtoR) of a sound signal is higher if the sound originates from a near-field source—such as the user's own voice—than if the sound originates from a far-field sound source.
In a first step S1, a sound signal is recorded. In a next step S2, this recorded sound signal is partitioned into a number of frequency bands. In a third step S3, the signal energy is determined in short time intervals, e.g. 20 ms, in each frequency band to obtain the envelope of the signal energy. In a fourth step S4, usable sound events are identified in each frequency band, which allow a reliable estimation of the direct-to-reverberant ratio (DtoR). This is accomplished by examining the determined envelopes in successive segments of, for example, 700 ms. Thus, it is examined whether or not each successive segment comprises a sufficiently sharp onset (corresponding to the direct sound 1a, 2a) and an approximately exponentially decaying tail of sufficient duration (corresponding to the reverberant sound 1b, 1c, 2b, 2c). Accordingly, the identified usable sound events comprise a direct sound part and a reverberant sound part. In step S5, the sound events identified in step S4 are partitioned into direct and reverberant sound parts in each frequency band. In step S6, a direct-to-reverberant ratio (DtoR) between the signal energy of the direct sound part (1a; 1b) and that of the reverberant sound part (2a 3a; 2b, 3b) is calculated in each frequency band. Then, in a next step S7, all the individual direct-to-reverberant ratios (DtoR) of the different frequency bands are combined into a single final direct-to-reverberant ratio (combined direct-to-reverberant ratio). Therein the combined direct-to-reverberant ratio can be the average of the sub-band direct-to-reverberant ratios, for example. In step S8, this combined direct-to-reverberant ratio is compared with an own-voice threshold, wherein this own-voice threshold is determined empirically in experiments. If the combined direct-to-reverberant ratio is above the own-voice threshold then it is decided that the recorded sound signal is of the user's own voice. Otherwise it is decided that the recorded sound signal is not of the user's own voice.
If it is decided that the recorded sound signal is of the user's own voice, separate and dedicated signal processing can be activated in the hearing instrument before outputting the processed sound to the user.
In a modified embodiment, the method of identifying the user's own voice may be combined with the output of other own-voice detectors to obtain a final own-voice detector output which is more robust. The combination with other own-voice detectors can be done in such way that a flag is set for each own-voice detector assessing that the recorded sound signal is of the user's own voice. In this case, the final own-voice detector output determines that the recorded sound signal is the user's own voice if a predetermined number of flags is set. Due to the fact that the determination of the direct-to-reverberant ratio (DtoR) from the envelope of the signal energy involves a latency in the order of one second, it is preferable to combine the present invention with other faster own-voice detectors known in the prior art. In this way, the reliability of the own-voice detection based on the direct-to-reverberant ratio can be combined with the high speed of detection by other less reliable methods.
In the following, a hearing instrument system for performing the above described method is described with reference to
A hearing-instrument system 20 which can perform the above described method comprises a microphone 4, an A/D converter 5 connected to the microphone 4, a digital signal processing unit 6, the input of which is connected to the output of the A/D converter 5, a D/A converter 7, the input of which is connected to the output of the digital signal processing unit 6, and a loudspeaker 8 which is connected to the output of the D/A converter 7. The digital signal processing unit 6 includes a filter bank 9, a random access memory (RAM) 10, a read-only-memory (ROM) 11 and a central processing unit (CPU) 12.
The microphone 4 is means for recording a sound signal, the filter bank 9 is means for partitioning the recorded sound signal into a number of frequency bands and the CPU 12, the RAM 10 and the ROM 11 are means for determining the signal energy in short time intervals, for identifying usable sound events, for partitioning the sound events into direct and reverberant parts (1a, 2a, 3a; 1b, 2b, 3b), for calculating the direct-to-reverberant ratio (DtoR) in each frequency band and for combining the sub-band direct-to-reverberant ratios to a final combined direct-to-reverberant ratio as well as for comparing the combined direct-to-reverberant ratio (combined DtoR) with an own-voice threshold to decide whether or not the recorded sound signal originates from the user's own voice.
The hearing-instrument system may be hearing aids, such as an in-the-ear (ITE), completely-in-canal (CIC), behind-the-ear (BTE), or a receiver-in-the-ear (RITE) hearing aid.
Modifications from the above described preferred embodiments of the invention are possible. For example, it is described to partition a recorded sound signal into a number of frequency bands and to calculate a direct-to-reverberant ratio (DtoR) in each frequency band. However, it is also possible to realize the own voice detection of the invention in only one single broad frequency band. The before described hearing-instrument system uses digital signal processing. However, it is also possible to use analogue processing of the sound signals.
Number | Date | Country | Kind |
---|---|---|---|
07 101 796.6 | Feb 2007 | EP | regional |