The present invention is related to a method for operating a hearing device as well as to a hearing device adapted to perform the method. In particular, the present invention is directed at detecting a hearing device user's voice activity, i.e. so-called “own-voice detection”, to be used in conjunction with operating a hearing device.
A frequent complaint of users of hearing devices, especially when they start wearing them for the first time, is that the sound of their own voice is too loud or that it sounds like they are talking into a barrel. Both effects are particularly pronounced when the ear canal (commonly also referred to as the auditory canal) is sealed, e.g. by an otoplastic. Accordingly, there exists the need to identify the presence or activity of the own voice of the user of a hearing device to be able to process the user's own voice in a different way than sound originating from other sources.
Methods for own-voice detection are commonly based on quantities that can be derived from a single microphone signal measured at an ear of a user, such as for example overall level, pitch, spectral shape, spectral comparison of auto-correlation and auto-correlation of predictor coefficients, cepstral coefficients, prosodic features, or modulation metrics. However, the degree of achieving reliable own-voice detection is rather poor when using methods based on such measures.
EP 1 956 589 A1 discloses a method for identifying the user's own voice by assessing a direct-to-reverberant ratio between the signal energy of a direct sound part and that of a reverberant sound part of at least a portion of a recorded sound. It is stated that this allows a very reliable own-voice detection. However, to achieve this a rather complex signal analysis is required.
WO 2004/077090 discloses a method for detection of own voice activity in a communication system which seeks to improve detection reliability. Hereto, own-voice detection is based on a combination of a number of individual detectors, each of which may be error-prone, whereas the combined detector is asserted to be robust. A signal processing unit is utilised to receive signals from at least two microphones worn on the user's head, which are then processed so as to distinguish as well as possible between sound from the user's mouth and sounds originating from other sources. The distinction is based on the specific characteristics of the sound field produced by own voice, which are due to the fact that the microphones are in the acoustical near-field of the hearing device user's mouth and in the far-field of the other sources of sound, and that arise because the mouth is located symmetrically with respect of the user's head. The combined detector then detects the presence of own-voice when each of the individual characteristics of the signal are in respective ranges. This method too has a relatively high complexity.
Alternatively, a transducer which picks up vibrations within the ear canal caused by vocal activity of the user can be employed.
U.S. Pat. No. 6,041,129 discloses a hearing aid which uses an accelerometer or other rigid body motion sensor attached to the surface of the hearing aid at a point where it most closely comes in contact with the solid portion of the auditory canal. In this way, the accelerometer can sense directly the conductive sound waves created by the user's own voice. Such sound waves can then be either amplified or attenuated, and subsequently mixed with air-borne sound detected by the microphone depending on the user's needs.
US 2007/0009122 A1 discloses a method of own-voice detection achieved by providing a microphone in the auditory channel whose signal level is compared with that of an external microphone.
It is an object of the present invention to provide a method for operating a hearing device which performs own-voice detection in a reliable and simple manner.
Within the context of the present invention hearing devices for instance comprise hearing aids, such as in-the-ear (ITE), completely-in-canal (CIC) or behind-the-ear (BTE) hearing aids, earphones, hearing protection devices, as well as ear-level communication, noise reduction and sound enhancement devices.
The object of the invention is achieved by the method according to claim 1 and by the hearing device according to claim 18. Specific embodiments are provided in the dependent claims.
The present invention is first directed to a method for operating a hearing device comprising at least one ambient microphone, a signal processing unit, a receiver and an ear canal microphone, the method comprising the steps of:
An ear canal microphone refers to any type of sound pressure sensor, including for instance a piezo sensor or an accelerometer, intended to be located within the ear canal of the user during use of the hearing device.
A transfer function G(f) at least comprising a transfer function T(f) from a first signal port A to a second signal port B refers to a transfer function G(f) that is representative of the transfer function T(f) and could possibly comprise one or more further transfer functions T′(f), T″(f), . . . , e.g. G(f)=T(f)·T′(f)·T″(f), f being frequency, T′(f) for instance being a transfer function of a receiver, and T″(f) for instance being a transfer function of an ear canal microphone, such that the transfer function G(f) is representative of an overall transfer function Ttot(f) from a third signal port C, located “upstream” from signal port A (e.g. a receiver input), to a fourth signal port D, located “downstream” from signal port B (e.g. an ear canal microphone output).
In an embodiment of the present invention the transfer function of the first filter at least comprises a transfer function from the input of the receiver to the output of the ear canal microphone when the hearing device is turned on and being worn in an ear canal of the user, i.e. the transfer function of the first filter further includes the transfer functions of the receiver and the transfer function of the ear canal microphone.
In this way an estimate of the sound component within the ear canal originating from the receiver is taken into account and removed from the second audio signal provided by the ear canal microphone. This yields a good approximation of the own-voice signal possibly present within the ear canal based upon which own-voice activity can be discerned.
In a further embodiment of the method the step of detecting is further based on the first audio signal. In this way the ambient sound component, consisting of sound from the user's environment as well as possibly of the user's voice originating from his mouth, which enters the ear canal, e.g. via a vent of the hearing device, is taken into account. By for instance additionally removing the ambient sound component from the second audio signal provided by the ear canal microphone, an improved approximation of the own-voice signal present within the ear canal can be achieved, thus yielding an improved detection of own-voice activity.
In a further embodiment the method further comprises the step of filtering the first audio signal with a second filter having a transfer function representative of a real-ear occluded gain (REOG) transfer function, the second filter providing a filtered first audio signal. A real-ear occluded gain (REOG) transfer function is defined from the output of the ambient microphone to the output of the ear canal microphone while the hearing device is inserted in the ear canal of the user. The REOG transfer function can for example be determined by comparing the output signals of the ambient microphone and the ear canal microphone when the receiver of the hearing device is turned off or muted. By doing this an improved estimate of the ambient sound component is achieved by taking into account the way the ambient sound component is affected by for instance the vent or other direct sound paths from the outside of the ear canal past the hearing device towards the ear drum (also referred to as tympanic membrane). In this way a further improved detection of own-voice activity is achieved.
In a further embodiment of the method filtering the first audio signal is carried out in the log/dB domain, e.g. by simply subtracting a magnitude expressed in decibels (and not considering phase). Since the phase of the real-ear occluded gain (REOG) transfer function is typically not known precisely, performing only frequency-dependent amplitude weighting simplifies the filtering process.
In a further embodiment of the method the second filter is adapted online, i.e. in real-time, during operation of the hearing device, for instance by means of a least mean squares (LMS) algorithm. In this way the time-variability of the REOG transfer function due to variations of the ear canal geometry for instance caused by movements of the jaw are taken into account. Moreover, different positioning/seating of the hearing device within the ear canal as well as for instance clogging of the vent with earwax (cerumen) or debris can be taken into account in this way.
In a further embodiment of the method the transfer function of the second filter is determined based on a first measurement of the REOG transfer function, the first measurement for instance being made when the hearing device is fitted to the needs of the user.
In a further embodiment of the method the transfer function of the second filter is determined based on at least one further measurement of the real-ear occluded gain (REOG) transfer function, the at least one further measurement for instance being made when the hearing device and/or the jaw of the user is positioned differently compared to that when the first measurement was made. In this way an average REOG transfer function can be determined for the user.
In a further embodiment of the method the first filter is adapted online, i.e. in real-time, during operation of the hearing device, for instance by means of a further least mean squares (LMS) algorithm. In this way the time-variability of the sound transmission within the ear canal from the receiver to the ear canal microphone due to variations of the ear canal geometry for instance caused by movements of the jaw are taken into account. Moreover, different positioning/seating of the hearing device within the ear canal as well as for instance clogging of the vent with earwax (cerumen) or debris can be taken into account in this way.
In a further embodiment of the method the transfer function of the first filter is determined based on an initial measurement of the transfer function from the output (or input) of the receiver to the input (or output) of the ear canal microphone when the hearing device is turned on and being worn in the ear canal of the user, the initial measurement for instance being made when the hearing device is fitted to the needs of the user.
In a further embodiment of the method the transfer function of the first filter is determined based on at least one additional measurement of the transfer function from the output (or input) of the receiver to the input (or output) of the ear canal microphone when the hearing device is turned on and being worn in the ear canal of the user, the at least one additional measurement for instance being made when the hearing device and/or the jaw of the user is positioned differently compared to that when the initial measurement was made. In this way an average transfer function from the receiver to the ear canal microphone can be determined for the user.
In a further embodiment of the method the step of detecting comprises determining a first power estimate of the third audio signal.
In a further embodiment of the method the step of detecting comprises determining a second power estimate of the first audio signal or of the filtered first audio signal.
In a further embodiment of the method determining the first and/or the second power estimate comprises at least one of squaring, determining an absolute value, conversion into decibels, and low-pass filtering.
In a further embodiment of the method the step of detecting the presence of own-voice comprises one of:
In a further embodiment of the method the step of detecting the presence of own-voice is dependent on a “characteristic curve”/“discriminator function”, such as for instance a step function, a ramp function (with a lower and an upper threshold value), a sigmoid function, or a hysteresis function. In this way for instance a binary function discerning that own-voice is either “present” or “absent” can be assigned. Frequent, uncertain toggling between these two states can be prevented by introducing a hysteresis. Alternatively, a probability, e.g. a value between 0 and 1, can be assigned to the detection of own-voice. Smoothing, averaging or low-pass filtering can also be applied as part of the step of detecting in order to avoid rapid fluctuations in the output of the detection process.
In a further embodiment of the method the hearing device further comprises at least one of an active occlusion control unit, a classifier (i.e. a classification unit), a gain model, a noise canceller, a beamformer, a reverberation canceller, and a wind noise canceller, and the method further comprises the step of controlling at least one of the active occlusion control unit, the classifier, the gain model, the noise canceller, the beamformer, the reverberation canceller, and the wind noise canceller dependent on the presence of own-voice.
In a further embodiment of the method controlling the active occlusion control unit comprises turning off the active occlusion control unit when the presence of own-voice is not detected. By doing so possible artefacts introduced by the active occlusion control unit can be reduced and furthermore power can be saved by operating the active occlusion control unit only in those instances when own-voice is actually considered present.
Moreover, the present invention is further directed to a hearing device comprising:
In an embodiment of the hearing device the output of the ambient microphone is further connected to a further input of the detector, and wherein the detector is adapted to detect a presence of own-voice of the user further based on a signal provided at the further input of the detector.
In a further embodiment the hearing device further comprises a second filter having a transfer function representative of a real-ear occluded gain (REOG) transfer function, specifically a transfer function from the input of the ambient microphone to the input of the ear canal microphone when the hearing device is turned off and being worn by the user in the ear canal, wherein the output of the ambient microphone is connected to an input of the second filter and an output of the second filter is connected to the further input of the detector.
In a further embodiment of the hearing device the second filter is adapted to perform filtering in the log/dB domain.
In a further embodiment of the hearing device the second filter is adaptable online, i.e. in real-time, during operation of the hearing device, for instance by means of a least mean squares (LMS) algorithm.
In a further embodiment of the hearing device the transfer function of the second filter is based on a first measurement of the REOG transfer function, the first measurement for instance being made when the hearing device is fitted to the needs of the user.
In a further embodiment of the hearing device the transfer function of the second filter is based on at least one further measurement of the REOG transfer function, the at least one further measurement for instance being made when the hearing device and/or the jaw of the user is positioned differently compared to that when the first measurement was made.
In a further embodiment of the hearing device the first filter is adaptable online, i.e. in real-time, during operation of the hearing device, for instance by means of a further least mean squares (LMS) algorithm.
In a further embodiment of the hearing device the transfer function of the first filter is based on an initial measurement of the transfer function from the output (or input) of the receiver to the input (or output) of the ear canal microphone when the hearing device is turned on and being worn in the ear canal of the user, the initial measurement for instance being made when the hearing device is fitted to the needs of the user.
In a further embodiment of the hearing device the transfer function of the first filter is based on at least one additional measurement of the transfer function from the output (or input) of the receiver to the input (or output) of the ear canal microphone when the hearing device is turned on and being worn in the ear canal of the user, the at least one additional measurement for instance made when the hearing device and/or the jaw of the user is positioned differently compared to that when the initial measurement was made.
In a further embodiment of the hearing device the detector comprises a first power estimator adapted to determine a power estimate of the signal provided at the input of the detector.
In a further embodiment of the hearing device the detector comprises a second power estimator adapted to determine a power estimate of the signal provided at the further input of the detector.
In a further embodiment of the hearing device the first and/or the second power estimator comprises at least one of a squaring unit, an absolute value unit, a conversion into decibels unit, and a low-pass filter.
In a further embodiment of the hearing device the detector comprises at least one of:
In a further embodiment of the hearing device the detector is adapted to detect the presence of own-voice of the user dependent on a “characteristic curve” / “discriminator function”, such as for instance a step function, a ramp function, a sigmoid function, or a hysteresis function.
In a further embodiment the hearing device further comprises at least one of an active occlusion control unit, a classifier, a gain model, a noise canceller, a beamformer, a reverberation canceller, a wind noise canceller, and a controller adapted to control at least one of the active occlusion control unit, the classifier, the gain model, the noise canceller, the beamformer, the reverberation canceller, and the wind noise canceller dependent on the presence of own-voice.
In a further embodiment of the hearing device the controller is adapted to turn off the active occlusion control unit when the presence of own-voice is not detected.
It is pointed out that combinations of the above-mentioned embodiments give rise to even further, more specific embodiments according to the present invention.
The present invention is further explained below by means of non-limiting specific embodiments and with reference to the accompanying drawings. What is shown in the figures is the following:
In the figures, like reference signs refer to like parts.
Depending on the application a hearing device is intended for, either an “open” or a “closed” fitting is employed. In the former case sound is delivered to the ear drum of the user both directly, i.e. by-passing the hearing device, as well as for instance via a thin tube extending into the ear canal conveying sound that has been processed, e.g. amplified, by the hearing device. In this way it is possible to maintain the user's voice sounding natural for the user himself, however only relatively mild amplification can be applied, otherwise feedback whistling will occur. On the other hand, when high levels of amplification are required, e.g. to compensate a severe hearing loss, or a great degree of ambient sound attenuation is desired, e.g. for a hearing protection device, a closed fitting is necessary, where the ear canal is essentially sealed-off, i.e. very little direct sound reaches the ear drum. This has the disadvantage of causing the so-called “occlusion effect”, which occurs when an object blocks a person's ear canal, and the person perceives his/her own voice as “hollow” or “booming”, such as when talking into a barrel. This annoying effect can be mitigated for instance by means of active occlusion control.
As is apparent from
a) sound originating from the receiver 3 that traverses the plant 22, i.e. is filtered by the transfer function of the plant 22, represented by the signal yPlant,
b) direct sound originating from the exterior of the ear canal that by-passes the hearing device, e.g. enters the ear canal through a vent 26 or a leaky seal, represented by the signal dv, and
c) speech and body sounds OV generated by the user entering the ear canal through its cartilaginous wall (from the skull 24), giving rise to an occlusion signal dOV (=own-voice).
The sound uRec emitted by the receiver 3, which passes through the plant 22, consists of a component rMicExt picked up by the ambient microphone 1 and processed, e.g. amplified 21, by the signal processing unit 2, and of a component uAOC picked up by the ear canal microphone 4 and processed, e.g. AOC filtered 27, by the AOC unit 6. The component rMicExt picked up by the ambient microphone 1 in turn consists of ambient sound rEnv from the user's environment 20 and possibly also of speech OV of the user's own voice 23 originating from his mouth and reaching the ambient microphone 1 via an external air path 25. The direct sound dv which by-passes the hearing device is influenced by the real-ear occluded gain (REOG) transfer function.
The task of the own-voice detection (OVD) unit 5 is to detect the occlusion (own-voice) signal dOV given only measurements of the aggregate signal, i.e. the sum of all the contributions yMic=yPlant+dOV+dv.
An improved variant of this embodiment is obtained by averaging the difference signal or by determining a power estimate of the difference signal by means of the power estimator 11 (depicted in
A further improved variant is obtained by additionally providing the signal from the ambient microphone 1 to the detector 9. This signal can then be subtracted from the difference signal, the averaged difference signal or the power estimate of the difference signal.
In yet a further improved variant the signal from the ambient microphone 1 is averaged or a power estimate thereof determined by means of the further power estimator 11′ (depicted in
Two such exemplary mappings/functions are illustrated in
According to the method and hearing device of the present invention the various components yPlant, dV and dOV of the sound within the ear canal that is picked up by the ear canal microphone 4 are identified and separated from one another in a systematic manner. In particular, a model of the plant 22 is used, and furthermore the direct sound entering the ear canal via leaks in the seal of the hearing device or via vents provided in the hearing device is for instance filtered by the REOG transfer function. The output of the OVD unit 5 is then for example employed to control the activity of the AOC unit 6 or other parts of the signal processing, e.g. classifier, gain model, noise canceller, beamformer, reverberation canceller and/or wind noise canceller, carried out by the signal processing unit 2. It is thus for instance possible to decrease the power consumption of the hearing device or to reduce artefacts generated by the AOC unit 6 by only turning it on when the OVD unit 5 indicates that own-voice is determined to be present.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2013/061404 | 6/3/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2014/194932 | 12/11/2014 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6041129 | Adelman | Mar 2000 | A |
7853031 | Hamacher | Dec 2010 | B2 |
20080107287 | Beard | May 2008 | A1 |
20090010442 | Usher | Jan 2009 | A1 |
20100260364 | Merks | Oct 2010 | A1 |
Number | Date | Country |
---|---|---|
1 956 589 | Aug 2008 | EP |
03073790 | Sep 2003 | WO |
2004021740 | Mar 2004 | WO |
2004077090 | Sep 2004 | WO |
Entry |
---|
International Search Report for PCT/EP2013/061404 dated Mar. 28, 2014. |
Written Opinion for PCT/EP2013/061404 dated Mar. 28, 2014. |
Number | Date | Country | |
---|---|---|---|
20160105751 A1 | Apr 2016 | US |