The subject matter of the present disclosure relates to a method and apparatus to provide a user of a communication device with feedback indicating the quality of the user's voice relative to environmental noise and how characteristics of the user's voice can be changed to improve the quality.
People use voice receptive devices, such as wireless phones or voice recognition devices, in a variety of environments-some of which may have relatively high levels of environmental noise. When the device is used in a noisy environment, the person speaking may not be capable of properly gauging the volume, tone, or pitch of his voice provided to the device. If the device is a wireless phone, for example, the user's voice may be too loud for the listener on the other end of a call, or the listener may not be able to understand what is being said if the user's voice is too soft. If the device is capable of voice recognition, for example, the voice commands provided by the user to the device may not be processed properly because the user's voice is not at an optimal range in its characteristics for processing.
The subject matter of the present disclosure is directed to overcoming, or at least reducing the effects of, one or more of the problems set forth above.
Various embodiments of the disclosure are now described, by way of example only, with reference to the accompanying figures, in which:
Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions and/or relative positioning of some of the elements in the figures may be exaggerated relative to other elements to help improve the understanding of various embodiments of the present disclosure. Also, common but well-understood elements that are useful or necessary in a commercially feasible embodiment are not often depicted in order to facilitate a less obstructed view of these various embodiments of the present disclosure. It is further appreciated that certain actions and/or steps may be described or depicted in a particular order of occurrence while those skilled in the art will understand that such specificity with respect to sequence is not actually required. It is also understood that the terms and expressions with respect to their corresponding respective areas of inquiry and study except where specific meaning have otherwise been set forth herein.
Embodiments of the disclosure relate to a feedback mechanism that informs a user of a communication device (e.g., a phone or other telephony arrangement) to adjust the volume, pitch, tone or other characteristic of his voice so as to compensate for noise in the surrounding environment. The feedback mechanism includes feedback circuitry that analyzes audio signals from the microphone and preferably from one or more additional dedicated environmental noise sensors. From the analysis, the feedback circuitry determines characteristics of the user's voice and characteristics of the environmental noise, and provides an analysis of how the user might modify his voice to best compensate for the environmental noise. This analysis results in an indication to the user, such as through a vibration, a sound, or graphical indication on the device, which tells the user whether and to what extent the user should adjust a characteristic of his voice to best overcome such environmental noise. Let us now refer to the figures to describe the disclosure in greater detail.
The device 10 has device circuitry 11, a microphone 12, a speaker 14, and a display 16. If the device 10 is a cellular telephone, for example, the device circuitry 11 can be conventional wireless phone electronics, which are not discussed in detail herein. The device circuitry 11 may provide automatic gain control using techniques known in the art for filtering out environmental noise from the received audio signal and for performing echo-cancellation. Furthermore, the device circuitry 11 may also generate a sidetone that feeds a small amount of the audio signal picked up by the microphone 12 back to the internal speaker 14 so that the person using the device 10 can hear an amplified version of his own voice. Such sidetones can be generated using techniques disclosed in U.S. Pat. No. 6,151,391, which is incorporated herein by reference.
As alluded to above, the feedback mechanism 30 informs a user of the device 10 to adjust the volume, pitch, tone or other characteristic of his voice to compensate for environmental noise. The feedback mechanism 30 includes feedback circuitry 31 that is coupled to or is part of the device circuitry 11. The feedback circuitry 31 analyzes audio signals from the microphone 12, and preferably one or more additional noise sensors 32 (explained further below). From the analysis, the feedback circuitry 31 compares characteristics of the user's voice and the characteristics of the environmental noise, and makes an assessment as to how the user can most logically tailor the characteristics of his voice in light of the environmental noise. The result of this assessment is provided to the user through at least one of a variety of indications, such as through a vibration, a sound, or graphical indication on the device. Preferably, the indication tells the user whether and to what extent the user should adjust a characteristic of his voice to best overcome problems caused by noise in the environment. For example, the indication might tell the user to speak more loudly or softly, or to adjust his pitch to higher or lower frequencies.
A process 50 for such feedback, and the feedback circuitry 31 through which the process can be implemented, are illustrated in
Next, the environmental noise is sampled (Step 70), and like the user's voice, is analyzed for its characteristics (Step 72). As shown in
The voice characteristics are then compared to the noise characteristics to determine the current quality of the user's voice (Step 80). For example, the dB level of the user's voice can be compared to the dB level of the environmental noise to ascertain the difference; if the difference between the voice and noise is high (e.g., above a certain threshold), then the voice can be considered good quality in relation to the noise. Such a comparison of the voice and noise characteristics can be accomplished via control logic 120, which functions in accordance with predetermined thresholds 122, such as the dB threshold just described as an example. Such thresholds 122 may be adjustable by the user, or may be preset as part of the feedback circuitry 31.
If the comparison to the threshold 122 indicates a good quality (Step 82), then there is no need for the feedback mechanism 30 to provide any sort of indication to the user, and the process 50 returns to sampling, etc. (Step 60). If, however, the comparison to the threshold 122 indicates a poor quality, the assistive feedback algorithm 124 is used by the control logic 120 to determine how the user could alter his voice to improve the situation vis-à-vis the environmental noise.
This algorithmic determination can involve several sub-steps. For example, the process 50 can assess the current operational mode of the device 10 (Step 90). This is useful because the current operational mode might affect the suggested feedback. For example, if the device 10 is a telephone being used in a hands-free mode, it is not logical for the assistive feedback algorithm 124 to choose a tactile means of indication to the user, such as a vibration; instead an audible indication might be best. By contrast, when the device 10 is positioned in a cradle for hands-free use in a vehicle, graphical instructions might be preferable, etc. In any event, the assistive feedback algorithm 124 takes the operation mode of the device into consideration, and selects an appropriate form of feedback based on the operational mode (Step 92).
Lastly, a feedback controller 130 generates the appropriate form of feedback (e.g., tactile, graphical, audio, or combinations thereof) to inform the user how to best adjust the characteristics of his voice (Step 94). This occurs in conjunction with a feedback controller 130 which receives the feedback instruction from the control logic 120 and activates an appropriate feedback component, such as an actuator 142 (for a tactile indication), a graphics generator 144 (for a visual indication), or a sound generator (speaker) 146 (for an audible indication), or combinations of these.
The feedback provided to the user by the feedback mechanism 30 as just described can take any different form dependent on whether tactile, graphical, or audible feedback is deemed best. If tactile feedback is chosen, such as a vibration, the feedback controller 130 can activate a vibrating ring 34 and an actuator 35 (see
To provide visual feedback to the user, the feedback controller 130 can generate graphics 36 on the display 16 of the device 10 to instruct the user as to how to adjust characteristics of his voice. For example, and as shown in
To provide audible feedback to the user, the feedback controller 130 can communicate audible instructions to the user by using the internal speaker 14 of the device 10. Alternatively, the feedback controller 130 can send the audible instruction to an interface 18 in communication with an external audio source such as a wireless headset (not shown). The interface and headset may be Bluetooth compliant, as is well known. Alternatively, the interface 18 may be a wired interface for connecting to a wired earpiece and microphone (not shown). In any event, the feedback controller 130 generates a distinct tone, buzzing, or other instructive sound or phrase which can overlay (or which can simply interrupt) the telephone conversation. This audible indication may be diminished and eventually eliminated as the user's voice meets the optimal loudness, pitch, or tone for the environmental noise in question.
As discussed briefly above, the disclosed techniques can be used in conjunction with communication devices having auxiliary components. For example,
When positioned in the cradle 210, it may not be useful for the telephone 10 itself to provide feedback to instruct the user on how to adjust his voice. Accordingly, the phone 10 can operate in conjunction with one or more of the other devices 220, 230, and 240 to provide appropriate feedback. In one example, the telephone 10 operates in conjunction with the wireless headset 230. The headset 230 receives the user's voice, while the environmental noise is received by (for example) dedicated sensors 34 on the telephone 10 or on the headset 230. Alternatively, the microphone of the headset 230 may be used for both as well. The feedback mechanism 30 processes the audio and provides audible feedback by sending generated sounds to the headset 230 to instruct the user audibly on how to adjust his voice. Alternatively, the feedback mechanism 30 can send the indications to the audio system 240 via the connection 212 or can send a graphical indication to the display 222 of the user interface module 220 to instruct the user visually on how to change his voice characteristics.
In
In yet another arrangement, the car kit 250 may not have its own feedback mechanism and must use the feedback mechanism 30 of the phone 10. In this arrangement, the microphone 272 of the car kit 250 obtains audio signals of the user's voice and environmental sound, and the junction box 260 sends the audio signals to the telephone 10 via connection 212. The feedback mechanism 30 of the telephone 10 determines the adjustment needed for the user's voice and determines what type of feedback (e.g., audible, visual, tactile) to provide based on how the phone 10 is currently being operated (as discussed earlier). To make the determination of the type of feedback to use, the feedback mechanism 30 may determine what type of device it is coupled to using standard techniques, for example, when devices pair in a Bluetooth connection. In this way, the feedback mechanism 30 knows the type of user interfaces of the other device 250.
The phone 10 then returns the appropriate feedback information to the car kit 250, which then implements the feedback. For example, the dedicated speaker 274 or the speaker 282 of the audio system 280 can provide generated sounds for audible instruction, or lights (not shown) on the user interface module 270 can provide visual instruction to the user.
The foregoing description of preferred and other embodiments is not intended to limit or restrict the scope or applicability of the inventive concepts conceived of by the Applicants. In exchange for disclosing the inventive concepts contained herein, the Applicants desire all patent rights afforded by the appended claims. Therefore, it is intended that the appended claims include all modifications and alterations to the full extent that they come within the scope of the following claims or the equivalents thereof.
Number | Name | Date | Kind |
---|---|---|---|
4829565 | Goldberg | May 1989 | A |
6151391 | Sherwood | Nov 2000 | A |
6674865 | Venkatesh | Jan 2004 | B1 |
6888935 | Day | May 2005 | B1 |
6889064 | Baratono et al. | May 2005 | B2 |
7006845 | Simon | Feb 2006 | B2 |
7023984 | Short et al. | Apr 2006 | B1 |
20040213402 | Ruetschi | Oct 2004 | A1 |
20040260547 | Cohen et al. | Dec 2004 | A1 |
20050213745 | Thomasson | Sep 2005 | A1 |
20080101557 | Boss et al. | May 2008 | A1 |
20080108306 | Yee | May 2008 | A1 |
Number | Date | Country | |
---|---|---|---|
20080162120 A1 | Jul 2008 | US |