The present application claims priority to EP Patent Application No. 18210505.6, filed on Dec. 5, 2018, and entitled “PROVIDING FEEDBACK OF AN OWN VOICE LOUDNESS OF A USER OF A HEARING DEVICE,” the contents of which are hereby incorporated by reference in their entirety.
Hearing devices are generally small and complex devices. Hearing devices can include a processor, microphone, speaker, memory, housing, and other electronical and mechanical components. Some example hearing devices are Behind-The-Ear (BTE), Receiver-In-Canal (RIC), In-The-Ear (ITE), Completely-In-Canal (CIC), and Invisible-In-The-Canal (IIC) devices. A user can prefer one of these hearing devices compared to another device based on hearing loss, aesthetic preferences, lifestyle needs, and budget.
Some users of hearing devices claim to have problems to estimate the loudness of their voice, which may cause discomfort. This estimation may be difficult, while using a remote microphone or any other streaming device, since the microphone input of the hearing device may be attenuated, while in a streaming operation mode.
Hearing impaired children also tend to raise the pitch of their voice, when getting nervous and/or doubt to be understood by peers.
US 2006 0 183 964 A1 proposes to monitor level, pitch and frequency shape of a voice and to provide a feedback thereon.
DE 20 2008 012 183 U1 proposes to use the microphone of a smartphone to analyze a voice.
The reference symbols used in the drawings, and their meanings, are listed in summary form in the list of reference symbols. In principle, identical parts are provided with the same reference symbols in the figures.
The embodiments described herein help a user of a hearing device in controlling his or her voice loudness.
This is achieved by the subject-matter of the independent claims. Further exemplary embodiments are evident from the dependent claims and the following description.
A first aspect described herein relates to a method for providing feedback of an own voice loudness of a user of a hearing device. The feedback may be any indication provided to the user that his or her voice is too silent or too loud. Such an indication may be provided to the user either directly via the hearing device, for example with a specific sound, and/or via a portable device, such as a smartphone, smartwatch, tablet computer, etc.
The hearing device may be a hearing aid adapted for compensating a hearing loss of the user. The hearing device may comprise a sound processor, such as a digital signal processor, which may attenuate and/or amplify a sound signal from one or more microphones, for example frequency and/or direction dependent, to compensate the hearing loss.
According to an embodiment, the method comprises: extracting an own voice signal of the user from an audio signal acquired with a microphone of the hearing device and determining a sound level of the own voice signal. For example, the hearing device may comprise at least two microphones and/or directional audio signals may be extracted from the audio signals of the microphones. Since a position and/or distance from the source of the own voice of the user and the hearing device is constant, an own voice signal may be extracted from the audio signal. The sound level of the own voice may be calculated from the own voice signal and/or may be provided in decibel.
It should be noted, that a microphone may be any sensor, which is adapted to transform vibrations to an electrical signal. Typically the microphone may be an electret condenser microphone, a MEMS-microphone or a dynamic microphone, it can however also be realized by an acceleration sensor or a strain gauge sensor. The microphone may pick up ambient sound. The microphone may pick up body vibrations, in particular vibrations of the skull or the throat of the user during speaking.
The own voice may be the voice of the user of the hearing device.
According to an embodiment, the method further comprises: determining an acoustic situation of the user. The acoustic situation may encode acoustic characteristics of the environment of the user. The actual acoustic situation may be a value and/or context data, which indicates an acoustical environment of the user. The sound situation may include numbers and/or distances of persons around the user, type and/or shape of the room, the user is in, the actual operation mode of the hearing device, etc. The acoustic situation may be automatically determined by the hearing device, for example from the audio signal of the microphone, further audio signals and/or audio stream received from another device and/or from context data, which, for example, may be provided by other devices, which may be in data communication with the hearing device.
According to an embodiment, the method further comprises: determining at least one of a minimal threshold and a maximal threshold for the sound level of the own voice signal from the acoustic situation of the user. Either from a table or with an algorithm, a range (or at least a lower bound or an upper bound of the range) is determined from the acoustic situation. For example, the range may be stored in a table and/or may be calculated from context data of the acoustic situation.
According to an embodiment, the method further comprises: notifying the user, when the sound level is at least one of lower than the minimal threshold and higher than the maximal threshold and notifying. When the sound level of his or her voice is outside of a desired range (or at least outside a lower bound or a higher bound of the range), the user may get feedback from the hearing device, that he is too loud or too silent. The user may get an indication, whether the loudness of his voice is adequate in a specific acoustic situation or not. The hearing device may give indication, whether the user shall raise his or her voice or lower the loudness.
Such an assistance of controlling the voice level of his or her voice may enhance a comfort of the user, for example while being involved in a discussion, while being streaming another sound signal, in a telephone conversation, etc.
Furthermore, a user may be trained with a new type of hearing device. Also, children may be trained learning to use their voice.
According to an embodiment, the at least one of the minimal threshold and the maximal threshold are determined from a table of thresholds, the table storing different thresholds for a plurality of acoustic situations. For every acoustic situation, the hearing device or a hearing system comprising the hearing device may determine an identifier for an acoustic situation and may determine the range and/or one bound of the range from a table by use of the identifier. For example, the hearing system may analyze context data, such as GPS data and/or and environment noise level. The context data and the associated own voice sound level range may be stored locally in a table.
The table may be multi-dimensional, depending on different variables. If the hearing system detects context data similar to context data stored in an entry of the table, the range and/or a bound of the range of this entry may be used.
According to an embodiment, the acoustic situation is determined from a further audio signal. The further audio signal may be extracted from the audio signal acquired by the hearing device. For example, an environmental noise level also may be extracted from the audio signal acquired by the microphone of the hearing device. It also may be that the further audio signal is acquired by a further microphone, such as a microphone carried by a further person and/or a stationary microphone in the environment of the user.
The hearing device may estimate a background noise and may calculate an optimal own voice loudness range therefrom. The hearing device may gather context data to estimate the distance of a listening person and may adapt the range accordingly. A further type of context data that may be extracted from the further audio signal may be a room acoustics, such as a reverberation time.
According to an embodiment, the acoustic situation is determined from a speech characteristics of another person. A own voice signal of another person may be extracted from an audio signal acquired by the hearing device and/or by a further microphone. From this own voice signal, the speech characteristics may be determined, such as a diffuseness of speech, an instantaneously diffuseness dependent on estimated room acoustics, a diffuseness dependent on room acoustics when background noise was low, level, a direction of arrival, which may be calculated binaurally, etc.
According to an embodiment, the acoustic situation is determined from a further user voice signal, which is extracted from the further audio signal. An own voice sound level of the user may be measured at a further person, who may be wearing a microphone as part of a communication system. The user also may put a further device with a microphone to a distant location within the room to retrieve feedback. Such a device may be any remote microphone.
According to an embodiment, determining the acoustic situation is based on an operation mode of the hearing device. In different operation modes, the own voice of the user may be differently attenuated and/or amplified. For example, this may be the case, when an audio signal and/or audio stream from another source as the microphone of the hearing device is output by the hearing device to the user. The operation mode may be streaming of a further audio source, such as from a remote microphone.
Also during a telephone call, the microphone of the hearing device may be damped. The operation mode may be a telephone call operation mode, where an audio signal and/or audio stream from a telephone call may be received in the hearing device and output by the hearing device to the user.
According to an embodiment, the method further comprises: determining the acoustic situation from a user input. It may be that the user provides input to a user interface, for example of a portable device. The user input may include at least one of a number of persons, to which the user is speaking and a distance to a person, to which the user is speaking.
According to an embodiment, the method further comprises: determining a location of the user. The location may be determined with a PGS sensor of the portable device and/or with other sender receivers, such as Bluetooth and/or WiFi sender/receivers, which also may be used for determining a location of the user and/or portable device relative to another sender/receiver. The acoustic situation then may be determined from the location of the user. For example, the location may be a restaurant, a train, a workplace, etc. and the acoustic situation may be set accordingly.
It also may be that a hearing system comprising the hearing receives information of locations of persons around the user, for example from GPS data acquired by their portable devices. A minimal and/or maximal threshold for the own voice sound level then may be determined based on these locations.
According to an embodiment, the minimal threshold and/or the maximal threshold for an acoustic situation are set by user input. The range may be manually set by the user with a portable device having a user interface. In an acoustic situation, the user may choose the range of own voice sound level himself, for example by defining the minimal threshold, which may indicate the minimal loudness the user wants to talk with. Additionally or alternatively, the user may define the maximal threshold, which may indicate the maximal loudness the user wants to talk. The range between the minimal threshold and the maximal threshold may represent the targeted loudness range of the user's voice. This range may be set situation dependent with a user interface on a smartphone and/or smartwatch.
It may be that the user gets feedback from a communication partner and enters the feedback to his hearing system by pressing a predefined button on the user interface, such as “ok”, “too soft”, “rather loud”, etc.
It also may be that one or both of the thresholds are set by another person, such as a speech therapist.
According to an embodiment, the user is notified via an output device of the hearing device. The hearing device may have an output device, which may be adapted for notifying the user acoustically, tactilely (i.e. with vibrations) and/or visually. The output device of the hearing device may be the output device, which is used for outputting audio signals to the user, such as a loudspeaker or a cochlea implant.
According to an embodiment, the user is notified via a portable device carried by the user, which is in data communication with the hearing device. As already mentioned, such a device may be a smartphone and/or smartwatch, which may have actuators for acoustically, tactilely and/or visually notifying the user, such as a loudspeaker, a vibration device and/or a display.
For example, the notification may be provided by a vibrating smartwatch, smartphone, bracelet and/or other device.
A visual notification may be provided with a smartphone, which is blinking with a red screen. A visual notification also may be displayed in electronic eye-glasses. The sound level of the voice may be displayed in a graph on a smartphone in real time. It also may be that the sound level is displayed in a continuous way, by displaying the actual sound level and the thresholds.
It also may be that voice sound level of other persons are displayed, such as the sound level of a speech therapist.
According to an embodiment, the method further comprises: logging the sound level over time; and optionally visualizing a distribution of the sound level over time. The own voice sound level may be logged regularly and/or continuously. The user may have insight to a statistical distribution of his own voice sound level during specific time intervals, for example at the end of the day, at the end of the month, etc. A statistical distribution of the voice sound level during specific acoustic situations may be displayed.
Furthermore, the sound level evolving over time within a specific acoustic situation and/or over the whole day may be displayed. Also, the own voice sound level dependent on other parameters, such as a calendar entry, GPS location, acceleration sensors, day time, acoustic properties of the ambient signal, such as a background noise signal, signal-to-noise ratio, etc., may be displayed.
A speech pathologist and/or hearing care professional may have access to the logged data. Furthermore, instead of the own voice sound level, other speech parameters, such as described below, may be logged.
According to an embodiment, the method further comprises: monitoring other speech properties of the user. Not only the own voice sound level, but also other speech properties that may be extracted from the voice signal, such as a pitch of the voice, may be monitored. This may be done, like the own voice sound level is monitored as described above and below.
Such speech properties may include: a relative height of amplitudes in a 3 kHz range, breath control, articulation, speed of speaking, pauses, harrumphs, phrases, etc., emotional properties, excitement, anger, etc.
Further aspects described herein relate to a computer program for providing feedback of an own voice loudness of a user of a hearing device, which, when being executed by a processor, is adapted to carry out the steps of the method as described in the above and in the following as well as to a computer-readable medium, in which such a computer program is stored.
For example, the computer program may be executed in a processor of a hearing device, which hearing device, for example, may be carried by the person behind the ear. The computer-readable medium may be a memory of the hearing device. The computer program also may be executed by a processor of a portable device and the computer-readable medium at least partially may be a memory of the portable device. It also may be that steps of the method are performed by the hearing device and other steps of the method are performed by the portable device.
In general, a computer-readable medium may be a floppy disk, a hard disk, an USB (Universal Serial Bus) storage device, a RAM (Random Access Memory), a ROM (Read Only Memory), an EPROM (Erasable Programmable Read Only Memory) or a FLASH memory. A computer-readable medium may also be a data communication network, e.g. the Internet, which allows downloading a program code. The computer-readable medium may be a non-transitory or transitory medium.
A further aspect described herein relates to a hearing system comprising a hearing device, which is adapted for performing the method as described in the above and the below. The hearing system may further comprise a portable device and/or a portable microphone. For example, the notification of the user may be performed with the portable device, such as a smartphone, smartwatch, tablet computer, etc. With the portable microphone, a further audio signal may be generated, which may be additionally used for determining an actual acoustic situation.
It has to be understood that features of the method as described in the above and in the following may be features of the computer program, the computer-readable medium and the hearing system as described in the above and in the following, and vice versa.
These and other aspects described herein will be apparent from and elucidated with reference to the embodiments described hereinafter.
Each of the hearing devices 12 is adapted to be worn behind the ear and/or in the ear channel of a user. Also the portable device 14, which may be a smartphone, smartwatch or tablet computer, may be carried by the user. The portable device 14 may transmit data into and receive data from a data communication network 18, such as the Internet and/or a telephone communication network.
The hearing devices 12 may transmit data between them, for example for binaural audio processing and also may transmit data to the portable device 14. The hearing devices 12 also may receive data from the portable device 14, such as an audio signal 20, which may encode the audio signal of a telephone call received by the portable device 14.
The external microphone 16, which may be carried by a further person or may be placed in the environment of the user, also may generate an audio signal 22, which may be transmitted to the hearing devices 12. It has to be noted that audio streams, such as 20, 22, may be seen as digitized audio signals.
The hearing device 12 furthermore comprises a sender/receiver 32, which for example via Bluetooth, may establish data communication with another hearing device 12 and/or which may receive the audio signals 20, 22. These audio signals 20, 22 may be processed by the processor 26 and/or may be output by the output device 28.
The external microphone may comprise also a sender/receiver for data communication with the hearing devices 12 and/or the portable device 14.
In step S10, the audio signal 30 is acquired by the hearing devices 12. The audio signal 30 may be processed with the processor 26, for example for compensating a hearing loss of the user, and output by the output device 28.
Furthermore, one or both of the audio signals 20, 22 may be received in the hearing devices 12. For example, the audio signal 20 may refer to a telephone call. The audio signal 22 may refer to a talk, which is given by a person speaking into the microphone 16. Also these audio signals 20, 22 may be processed with the processor 26, for example for compensating a hearing loss of the user, and output by the output device 28.
In step S12, an own voice signal 42 of the user is extracted from the audio signal 30 acquired with the microphone 24 of the hearing device 12 and a sound level 44 of the own voice signal 42 is determined. For example, the own voice signal 42 may be extracted from the audio signal with beamformers and/or filters implemented with the processor 26, which extract the parts of the audio signal 30, which are generated near to the hearing devices 12.
Returning to
In general, the acoustic situation 48 may be encoded in a value and/or in a context data structure, which indicates sound sources, persons and/or environmental conditions influencing, how the voice of the user can be heard by other persons. In one case, the acoustic situation may be a number. In another case, the acoustic situation may be a data structure comprising a plurality of parameters.
The acoustic situation may be determined from the audio signal 30, the audio signal 20 and/or the audio signal 22.
For example, a further audio signal 22 may be extracted from the audio signal 30 acquired by the hearing device 12. This further audio signal 22 may be a further voice signal, which may encode the voice of a person, who talks to the user. Also the audio signal 22 may contain a further voice signal, which may encode the voice of a person, who carries the microphone 16 and who talks to the user. From the sound level of the further voice signal, the distance of the other person may be determined.
Thus, the sound level of one or more other persons may be a part of the acoustic situation 48 and/or may have influence on the acoustic situation 48.
The acoustic situation 48 also may be determined and/or its context data may comprise a room acoustics and/or a speech characteristics of another person. Also, these quantities may be determined from one or more of the audio signals 30, 20, 22.
It also may be that the acoustic situation 48 is based on an operation mode of the hearing device 12, which operation mode may be a parameter influencing and/or being part of context data for the acoustic situation 48. For example, when the portable device 14 is streaming an audio signal 20, the hearing devices 12 may output this audio signal 20 in a specific operation mode.
As a further example, the acoustic situation 48 may be determined based on a user input. The user may input specific parameters into the portable device, which may become part of the context data and/or influence the acoustic situation 48. For example, the user input may include at least one of a number of persons, to which the user is speaking and/or a distance to a person, to which the user is speaking.
Also a location of the user, which may be determined with a GPS sensor of the portable device 14, may be part of the context data of the acoustic situation 48 and/or the acoustic situation 48 may be determined from the location of the user.
In step S14, at least one of a minimal threshold 46a and a maximal threshold 46b for the sound level 44 of the own voice signal 42 is determined from the acoustic situation 48 and/or from the context data of the acoustic situation 48.
It may be that the thresholds 46a, 46b are determined with an algorithm from the acoustic situation 48 and/or from the context data of the acoustic situation 48. For example, it may be tried that the sound level 44 of the user is in a range within a sound level of another person and/or a noise sound level.
It also may be that a table of thresholds 46a, 46b is stored in the hearing devices 12 and/or the portable device 14. The table may comprise thresholds 46a, 46b for a plurality of acoustic situations 48. The records and/or entries of the table may be referenced with different acoustic situations 48 and/or their context data. The at least one of the minimal threshold 46a and the maximal threshold 46b may be determined from this table of thresholds.
When a specific acoustic situation has been identified it may be that the minimal threshold 46a and/or the maximal threshold 46b for the acoustic situation 48 may be set by a user input. For example, the user may change the thresholds 46a, 46b with a user interface of the portable device 14.
In step S16, it is determined, whether the sound level 44 is at least one of lower than the minimal threshold 46a and higher than the maximal threshold 46b, for example whether the sound level 44 is outside of the range defined by the two thresholds 46a, 46b.
When this is the case, the user receives a notification 50, which may be an acoustic, tactile and/or visual notification. For example, the user may be notified via the output device 28 of the hearing device 12, which may output a specific sound. The user also may be notified by the portable device 14, which may output specific sound with the loudspeaker, may vibrate with the vibration generator and/or may display an indicator for the sound level 44 on the display 34.
In step S18, the sound level 44 and/or further data, such as the acoustic situation 48 and/or the context data for the acoustic situation, may be logged over time. Later, the user and/or a voice trainer may look into the logged data, which may be visualized by the portable device 14 and/or other devices. For example, statistical distribution of the sound level 44, the acoustic situations 48 and/or the context data over time may be visualized.
While the invention has been illustrated and described in detail in the drawings and foregoing description, such illustration and description are to be considered illustrative or exemplary and not restrictive; the invention is not limited to the disclosed embodiments. Other variations to the disclosed embodiments can be understood and effected by those skilled in the art and practising the claimed invention, from a study of the drawings, the disclosure, and the appended claims. In the claims, the word “comprising” does not exclude other elements or steps, and the indefinite article “a” or “an” does not exclude a plurality. A single processor or controller or other unit may fulfill the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage. Any reference signs in the claims should not be construed as limiting the scope.
In the preceding description, various exemplary embodiments have been described with reference to the accompanying drawings. It will, however, be evident that various modifications and changes may be made thereto, and additional embodiments may be implemented, without departing from the scope of the invention as set forth in the claims that follow. For example, certain features of one embodiment described herein may be combined with or substituted for features of another embodiment described herein. The description and drawings are accordingly to be regarded in an illustrative rather than a restrictive sense.
Number | Date | Country | Kind |
---|---|---|---|
18210505 | Dec 2018 | EP | regional |
Number | Name | Date | Kind |
---|---|---|---|
5426179 | Bertolini | Jun 1995 | A |
6275806 | Pertrushin | Aug 2001 | B1 |
7222075 | Petrushin | May 2007 | B2 |
7627475 | Petrushin | Dec 2009 | B2 |
7940914 | Petrushin | May 2011 | B2 |
8965770 | Petrushin | Feb 2015 | B2 |
9973861 | Rosenkranz | May 2018 | B2 |
20020194002 | Petrushin | Dec 2002 | A1 |
20030033145 | Petrushin | Feb 2003 | A1 |
20060183964 | Kehoe | Aug 2006 | A1 |
20070036365 | Rohrseitz | Feb 2007 | A1 |
20070162283 | Petrushin | Jul 2007 | A1 |
20090010456 | Goldstein | Jan 2009 | A1 |
20110178803 | Petrushin | Jul 2011 | A1 |
20140126735 | Gauger, Jr. | May 2014 | A1 |
20150271608 | Sabin | Sep 2015 | A1 |
20150289065 | Jensen | Oct 2015 | A1 |
20150310878 | Bronakowski | Oct 2015 | A1 |
20160163332 | Un | Jun 2016 | A1 |
20180054683 | Pedersen | Feb 2018 | A1 |
20190020957 | Kamkar-Parsi | Jan 2019 | A1 |
20190075406 | Petersen | Mar 2019 | A1 |
Number | Date | Country |
---|---|---|
2015355097 | Jun 2016 | AU |
107003997 | Aug 2017 | CN |
202008012183 | Apr 2009 | DE |
1222448 | Aug 2000 | EP |
2001016570 | Mar 2001 | WO |
2010019634 | Feb 2010 | WO |
2016089929 | Jun 2016 | WO |
Number | Date | Country | |
---|---|---|---|
20200186943 A1 | Jun 2020 | US |