1.2. Field of the Invention
The present invention relates generally to sound systems and in particular to voice communication systems comprising a voice communication terminal, such as a cellular or mobile telephone, and a headset. More particularly, the invention relates to a voice communication system of the described type for providing hands-free functionality.
1.3. Description of the Related Art
Headsets are often used for voice communication and listening music in mobile environments where “hands-free” functionality is required. A typical example is a Bluetooth headset connected wirelessly with a mobile phone or a media player or a radio receiver. Various examples of headsets are described in many published patent documents, among which U.S. Pat. No. 5,793,865; U.S. Pat. No. 6,459,882; U.S. Pat. No. 6,519,475; U.S. Pat. No. 6,810,987; U.S. Pat. No. 7,010,332; U.S. Pat. No. 7,072,686; US 20040198470 and EP1 503 368 may be mentioned. While listening and talking with a headset does not require hands, all other operations such as initiating a call, answering a call, rejecting a call, changing a music track, increasing/decreasing sound volume and the like typically require pressing small buttons on the headset body. This may be inconvenient in some circumstances (e.g. when both hands are occupied) or, sometimes, even dangerous (e.g. while driving a car in dense traffic). It is thus seems useful to provide a headset with hands-free control functionality.
One option to control a headset consists in using automatic speech recognition (ASR) techniques. Indeed, many users prefer to interact with a telephone device via voice commands, particularly in view of the trend in the law towards “hands-free” mobile telephony devices. Examples of ASR techniques are described, among many other publications, in U.S. Pat. No. 6,167,251; U.S. Pat. No. 7,324,942 and EP1 503 368. When employing an ASR system, the user, for example, can say “answer” to start a call, “hand-up” to end a call, “next” to move to a next song, etc. However, speech recognition may be unreliable or inconvenient in noisy conditions, for example in a car with open windows. ASR system may be also prone to false interpretation of input sounds, especially with “noises” produced by other people talking in the vicinity of the headset. Moreover, interpreting voice responses is a challenge for voice-based technology due to different inflections, tones, accents, speech patterns, volumes and many other speech variables. In addition, speech recognition requires constant monitoring of input audio signal which may be in some cases too power consuming for very low power devices such as communication headsets.
Therefore, what is desired is a voice communication and/or entertainment system (which will be termed below as “a sound system”) possessing alternative or additional capabilities for supporting hands-free communication and control and thus providing a greater convenience and comfort for the user.
The present invention will be further disclosed in the following detailed description of the preferred embodiments with reference to the drawings.
The main idea of the present invention consists in generating control data for controlling communication process between a voice communication terminal of a voice communication system and a headset linked to such terminal with the aid of at least one motion sensor integrated into or mechanically connected to the headset. The same integrated motion sensor or sensors can be used to control the functionality of a portable media player, radio receiver, portable gaming console or another gaming machine as well as other sound reproducing devices or the voice terminal additionally having capabilities of such devices. For simplicity, the detailed description of the invention will pertain to controlling hands-free voice communication. The extension of control capabilities from a voice terminal to sound reproducing devices is clear to persons skilled in the art. Such additional functionality will be mentioned in the description when relevant.
In case when the terminal is a sound reproducing device or functioning as a sound reproducing device, the communication (indicated by the double-headed-arrow 60) is a one way communication from the terminal 40 to the headset 10. The term “a sound terminal” or just “a terminal” will be applied below for indicating both the voice terminal and various sound reproducing devices (such as various media players, radio receivers, gaming consoles, etc.).
The headset 10 further comprises a headset control unit 18 (for example, in the form of a microcontroller) for controlling performance of various components of the headset 10, including, for example, such control function as switching the headset on/off, adjusting sound volume or a controlling communication link protocol. The control unit 18 is also configured for generating and sending, via the HC circuit 12, control data (indicated by arrows 96) to the sound terminal 40 in order to enable the terminal to execute one or more commands from a preselected set of commands, such as, for example, accepting or rejecting an incoming call, ending a call, playing a next music track or clip, pausing the music, switching to the next radio station, etc. At least some of the control unit's functions are conventionally activated by selectively pressing by the user one or more buttons (not shown in
The headset 12 further comprises a power source (not shown). Conventionally, the power source is a rechargeable battery. In this case the headset preferably comprises also a socket for a charger. The HC circuit 12 and the power source may be contained in an appropriate enclosure (not shown) attached, together with the earphone 16 and the microphone 14, to an appropriate head mount (not shown). The microphone 14 may be located in a microphone pickup (not shown). Alternatively, both the microphone and the loudspeaker may be located, together with the HC circuit and the power source inside a main body of the headset 10 (not shown). As an alternative, the HC circuit and the power source may be located in different parts of the head mount. As one more alternative corresponding to another conventional headset design, the headset of the present invention may be configured without the head mount, with locating the HC circuit 12 and the power source, together with the earphone, in an appropriately configured earpiece (not shown). Evidently, other mechanical configurations of the headset are also possible.
All the above-mentioned parts and components of the headset 10 may have a conventional design and functionality, well known in the art. Moreover, the specific design and characteristics of the various parts and components comprising the wireless headset 10 are not critical to practicing the present invention, therefore they will not be described or shown in more detail.
Any suitable device supporting interactive (two-way) communications with at least one external communication device and with the headset 10 can be used as the voice communication terminal 40. As an exemplary embodiment of the terminal 40, any mobile (or cell) phone capable to interact with an associated headset may be mentioned. As schematically shown in
The terminal 40 also comprises a terminal control unit 46 (for example, in the form of a microcontroller or a microprocessor) for controlling the terminal 40 and the headset 10 by generating and executing appropriate commands from a preselected set of commands and sending such commands to appropriate circuits in the terminal and/or in the headset (sending commands by the terminal control unit 46 is symbolically represented in
In one exemplary embodiment the headset 10 and the voice communication terminal 40 may be respectively configured as a Bluetooth headset and a Bluetooth mobile phone. General configuration and functioning of the terminal control units used in the terminals of the above-described type, i.e. in the mobile phones and in Bluetooth mobile phones in particular, are well known in the art. In another exemplary embodiment the headset 10 and the sound terminal 40 may be respectively configured as a Bluetooth stereo headset and a Bluetooth enabled sound reproducing device, e.g. a portable media player. General configuration and functioning of the sound terminal control units used in the terminals of the above-described type are well known in the art.
As will be clear to those skilled in the art, the mobile phone, portable media player or any other appropriate sound reproducing device used as the sound terminal 40 will have other units and/or components necessary for ensuring its functionality, such as an external or internal power source (i.e. batteries), an antenna, etc. The mobile phone acting as the terminal 40 of the voice communication system of the present invention will also comprise a plurality of buttons or keys, a microphone, a loudspeaker, etc. All such parts or components of sound reproducing devices or voice communication terminals in general and mobile phones in particular are not critical to practicing the present invention and, besides, all of them are well known in the art; therefore, for clarity, they are not shown in the drawings and will not be described here. The conventional, well-known in the art functions and operations performed by the sound system, in particular by the voice communication system comprising the voice terminal (such as the Bluetooth mobile phone) and associated headset (such as the Bluetooth headset) also will not be described, for clarity.
Now returning to the headset 10, it further contains (as shown in
Further, any particular sensor or each sensor 20 may be tuned or configured such that only one stimulus (i.e. only motion in a preselected direction or only a preselected orientation of the head, and so of the headset 10) actuates the sensor. Alternatively, the sensor may be configured to be sensitive to any of two or more different stimuli. In order to minimize a mechanical load on the headset, in some of embodiments of the inventive system it is advantageous to use a single miniature tri-axis accelerometer manufactured using MicroElectroMechanical Systems (MEMS) technology capable to provide all required data on movements of the headset equipped with such accelerometer. As a few non-limiting examples of the sensor suitable for implementing the invention, the following tri-axis MEMS accelerometers may be mentioned: MMA7450L model manufactured by Freescale Semiconductor Inc. (Austin, Tex.); the KXSC7 series manufactured by Kionix (Ithaca, N.Y.); BMA020 model manufactured by Bosch Sensortec GmbH (Reutlingen, Germany); and LIS302DL model manufactured by STMicroelectronics (Geneva, Switzerland).
The single sensor or each sensor 20 is mechanically connected to the headset 10. For example, if the headset 10 comprises the head mount, the sensors may be directly attached to the head mount in different locations thereof. Alternatively, at least one sensor may be attached to the housing of the HC circuit 12. Other versions of mechanically attaching the sensor or sensors to the headset 10 are evidently possible.
The single or each sensor 20 generates at least one type of headset signals (represented by arrows 72) characterizing at least a position, orientation and/or movement (including vibration) of the headset (or any combination of these parameters, or stimuli). In some embodiments of the headset, when a power consumption issue is not critical, each or some of the sensors 20 may be constantly functioning. However, typically, to save power, the sensors are activated only when the headset control unit 18 receives some control signal or command from the voice communication terminal 40 or directly from the user (who may press for example an appropriate button on the headset). Other ways to minimize periods when the headset signals are generated will be described below with reference to
The use of the sensor(s) 20 in performing controlling functions of the voice communication system 100 will be explained below.
The headset 10 shown in
The main difference between both embodiments consists in that in the second embodiment shown in
Basic principles of generating various headset signals according to the present invention are illustrated by
Turning the head in the median plane (around Y axis) may mean accepting a call. In other words, the headset signals generated by the appropriate sensor 20 when the user is nodding his head, will be eventually transformed into a command (issued by the terminal control unit 46 to the EC circuit 42) to accept an incoming call. Turning the head in the frontal plane (around X axis) may lead to generating by the terminal a command “redial the last dialed number”, while some other command(s) from the preselected set of commands may be generated by appropriate combination(s) of the head movement in the same or different planes.
It is straightforward to persons skilled in the art to translate the corresponding head movements into commands controlling functionality of a portable media player. For example, turning (shaking) the head in the horizontal plane (around Z axis) may be translated into stopping or pausing the music, turning the head in the median plane (around Y axis) may mean starting or resuming the music, turning the head in the frontal plane (around X axis) may lead to generating the command “next track”, etc.
Alternatively or additionally, the sensor 20 (i.e. made as an accelerometer) may detect vibrations caused by user tapping the headset 10. For example:
In addition, tapping the headset when there is no active or incoming call may serve as a clue for starting speech recognition to dial a certain number.
In some embodiments of the present invention, including those shown in
In such embodiments a signal generated by the sensor configured in the described way may indicate an attainment of a preselected threshold by a parameter (a stimulus) to which said sensor is made responsive. Possible uses of the sensor configured in this way are illustrated by
As is further illustrated in
Alternatively, the R/P circuit may react to other features of the discrete signals generated by the sensor(s), i.e. to a frequency of their generation or to time intervals between two consecutive signals of the same type. Such approach may be useful when the traced stimulus corresponds to repetitive nodding. If
The use of discrete (instead of continuous) sensor signals is advantageous also because both the HC circuit 12 in the headset 10 and the IC circuit 44 in the terminal 40 will work under a lesser load, and a lesser computational power is required in this case from the R/P circuit 30.
b illustrates a situation when there exists a possibility that at some moments (such as moments t4, t5, t6) a parameter P(t) (traced by a sensor) is due to some involuntary movements of the user's head (i.e. resulting from rolling and/or pitching of a ship on board of which the user is located) or from his voluntary movements not intended to be controlling movements (as is the case, for example, when the user bends to pick some object from a floor) may exceed an upper level threshold UL for this parameter established for intentional movements the user performs to make the control unit to generate some command(s). Evidently, generating any commands in such situation would be undesirable. Therefore, the headset signal(s) formed by the sensor(s) preferably are blocked as soon as they exceed(s) UL. In the exemplary situation illustrated by
Returning to
When one or more reference sensors 50 are used, their signals are also sent to the R/P circuit 30 (as indicated by arrows 74). In this case, the R/P circuit 30, in order to produce control data, will conjointly process the headset signal(s) received from the headset sensor(s) 20 and the reference signal(s) received from the reference sensor(s) 50.
An advantage which may be attained in some embodiments of the system of the invention owing to the use of the reference sensor(s) is illustrated by
As was indicated above, in most cases it will be advantageous to employ reference sensor(s) of the same type as the headset sensor(s). For example, if the tri-axis gyroscopes are used for this purpose, in most practical situations it will be enough to employ a single headset sensor and a single reference sensor. As shown in
In case the headset sensor 20 and the reference sensor 50 have different orientations, the R/P circuit shall perform, at the beginning of the signals processing, an appropriate transformations of the signals formed by any of the sensors so as to make sensor signals equivalent to signals formed by the sensors having the same orientation. Such transformation may be performed, for example, with the aid of reference signals produced by the gyroscopes used as the headset and reference sensors.
In some embodiments, different types of sensors may be employed. For example, instead of a single reference sensor configured as the tri-axis accelerometer, two or three single-axis accelerometers may be used. The reference sensors may be fixed in the same or in different locations. For example, at least one of such sensors may be mechanically connected to or located in the terminal 40 (evidently, if only one reference sensor is used, it may be placed in various positions depending on convenience of the user or a particular situation).
The above description, with reference to
Now exemplary embodiments of the method according to the invention will be described with references to
When the terminal 40 senses, in beginning step 502, an incoming call, it starts sending control signals to the HC circuit 12 telling it that a call is coming. Typically, to save power, no audio signal link is yet activated. The HC circuit informs the user about incoming call by playing tone (via the earphone 16) and/or by initiating a vibration of the headset 10. In some headsets the audio link may be activated to play the ring tone from the phone or to inform the user about the call, e.g. by playing text-to-speech message with the number or calling person name (these actions, being conventional and not critical for the inventive method, are not indicated in
Meanwhile the user, being informed of the incoming call, makes one of alternative decisions: to ignore, to accept or to reject the call. In the first case, he does not perform any tapping. As a result, no signals from the headset sensor will be detected in step 508 by the R/P circuit during the first time interval TI1 because any small signals not exceeding the threshold Tr will be will be blocked.
At the end of the first time interval, according to the YES branch of block 510, the R/P circuit is deactivated in step 512, and the incoming call will be considered as “missed”.
In case the user decides to accept the call, he makes a single tap on the headset before the first time interval is over. This tap is transformed by the headset sensor into a relevant headset signal exceeding the threshold Tr, so that this signal will be passed, as indicated by the YES branch of block 508, to the R/P circuit. When the R/P circuit receives the signal, a period of monitoring for the second signal is initiated in step 514, which period corresponds to the second time interval TI2, said second time interval typically being less than the first time interval TI1. When, in step 518, expiration of the second time interval TI2 will be determined, with no signal detected in step 516 during this interval, the R/P circuit generates, in step 520, control data associated with the command “Accept call” and sends said data, via the HC circuit 12 and the IC circuit 44, to the terminal control unit 46. On receiving the control data sent by the R/P circuit, the control unit generates (in step 522) the appropriate command and sends it to the EC circuit, so that the call is accepted.
In case the user decides to reject the call, he makes, in addition to the first tap, the second tap on the headset before the second time interval is over. The sensor reacts to this reference tap by generating the second signal. On receiving this second signal, the R/P circuit generates in step 524 control data associated with the command “Reject call” and sends said data, via the HC circuit 12 and the IC circuit 44, to the terminal control unit 46. On receiving the control data sent by the R/P circuit, the terminal control unit generates (in step 526) the appropriate command and sends it to the EC circuit, so the call is rejected.
It will be evident to persons skilled in the art that, according to the above-described embodiment of the inventive method, the single tap and the double tap on the headset are equivalent to pressing a specific “accept call” or “reject call” button on the headset (or, alternatively, to pressing the same button twice). However, in many situations tapping may be more convenient for the user because it does not require locating a position of a small button, especially in case when the headset comprises more than one button.
Moreover, if a completely hands-free control is preferable, such control may be easily implemented according to other embodiments of the invention using various movements of the user's head as was described above with reference to
The voice communication system 100 in this example further comprises a reference sensor 50 (i.e. a dual axis or tri-axis accelerometer of the same type as the accelerometer used with the headset) located at some other object moveable relative to the user's head (i.e. attached to the voice terminal 40) and operatively connected to the R/P circuit, which, according to the embodiment of the system 100 shown in
Initial steps (not represented in
Being activated, the R/P circuit will start in parallel to receive (in step 602) the reference signals RS from the reference sensor 50 and to monitor (in step 604) for the headset signal SI of the first type from the headset sensor 20, the signal SI corresponding to movements of the user's head in the YZ plane. In case it is determined in steps 606, 608 that no headset signal has been generated during the preselected first time interval TI1′ allowed for reaction of the user to the ringing tone or to other signal provided by the headset, the R/P circuit is deactivated in step 610, and the incoming call is “missed”.
On receiving the headset signal SI (as indicated by the YES branch of step 606), the R/P circuit performs, in step 612, a comparison of the headset signal SI and the reference signal RS. Such comparison is performed, for example, by subtracting the reference signal RS (adjusted as explained above) from the first type headset signal SI. In step 614 a check is performed, whether resulting difference signal DS exceeds a preselected differential threshold DT (a level of this threshold is also selected depending on a character and amplitudes of potential movements of the object (in this case the terminal 40) bearing the reference sensor relative the user's head bearing the headset 10).
If it is determined in step 614 that the difference signal DS is less than the differential threshold DT, this signal is ignored as non-relevant, i.e. as a signal resulting from a non-intentional movement of the head produced by some movement or acceleration of the vehicle, so that monitoring for a relevant headset signal is continued (as indicated by the NO branch of the block 614) until the expiration of the first time interval TI1′. In this case the R/P circuit is deactivated in step 610 and the incoming call is “missed”.
When the user makes an intentional nod during the first time interval TI1′, the difference signal first becomes equal to the differential threshold DT and then exceeds it. Starting from this moment, the check is conducted (in steps 616-620), whether the detected movement of the head corresponds to a nod or to some other, non-relevant movement (for example, the user may simply bend his head or his whole body). More specifically, in step 616 comparison of the headset and reference signals is resumed for the second time interval TI2′ selected such that TI2′ is substantially less than TI1′ (for example, TI2′ may correspond to ⅓-⅔ of TI1′). If during the whole time interval TI2′ the difference signal continues to exceed the threshold DT (as determined in steps 618, 620), the earlier detected movement of the user's head is again interpreted as non-intentional (non-relevant to controlling the acceptance of the incoming call), so that, as indicated by the YES branch of the block 620, the R/P circuit is deactivated and the incoming call is “missed”.
Determination (in step 618) that the difference signal DR became lower than the threshold DT before the second time interval TI2 is over means that the user is indeed making a nod. On finding that DR became less than DT, the R/P circuit generates, in step 622, control data associated with the command “Accept call” and sends said data directly to the terminal control unit 46. On receiving control data sent by the R/P circuit, the terminal control unit generates (in step 624) an appropriate command and sends it to the EC circuit, so that the call is accepted. In parallel, the terminal control unit 46 activates a two-way audio transmission between the terminal 40 and the headset 10, so that the user may talk with a caller. Simultaneously, the control unit instructs the R/P circuit to monitor (in step 626) for signals of the second type from the headset sensor 20, these signals of the second type corresponding to movements of the user's head in the XY plane. Processing the second type signals SII, together with reference signals of the same type is quite similar to the described processing of the first type signals, so it will not described in detail here. When a relevant turn of the user's head is detected by the R/P circuit in step 628, it sends, in step 630, appropriate control data to the terminal control unit 46, which unit generates, in step 632, a command to end the call, just as if a specific “end call” button on the headset was pressed, and deactivate (in step 610) the R/P circuit.
It will be evident from the above description of the presented exemplary embodiments of the invention that with the use of even a single tri-axis accelerometer (or an inclinometer, a gyroscope or another appropriate sensor capable to detect movements and/or accelerations in three orthogonal planes, a large number of different head movements may be detected individually or in various combinations. This means that a large number of commands may be generated in an easy and (if desirable) in a completely hands-free manner using the above-described inventive method and system. As was explained above, to improve reliability of differentiating relevant (intentional) head movements from accidental, non-intentional ones, one or more reference sensors may be additionally employed.
While the invention has been described with references to certain embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the scope of the invention. In addition, many modifications may be made to adapt a particular situation or an application to the teachings of the invention without departing from its scope. Various software tools may be employed in processing the headset signals in order to reliably differentiate such signals from noise generated by the sensors due to unintended movements of the user's head. The above-described commands generated by the sound terminal were presented only as examples of a wide range of commands which may be generated by various movements of the user's head or by combinations of such movements. The Bluetooth communication link connecting the Bluetooth mobile phone with the Bluetooth headset was also mentioned as an example only and other type(s) of wired or wireless communication (i.e. an Ultra Low Power (ULP) Bluetooth or Infrared (IR) link) may be employed. Instead of the voice communication system, some other appropriate sound system (i.e. a system comprising a media player and/or another sound reproducing device) or any appropriate combination of the voice communication system and the sound reproducing device(s) may be used. Therefore, it is intended that the invention not be limited to the particular embodiments disclosed but will include all embodiments falling, within the scope of the appended claims.
This application claims benefit of U.S. Provisional Application Ser. No. 60/966,822, filling date Sep. 17, 2007, entitled “Head Mounted Voice Communication Device with Motion Control”.