This invention relates to wireless communication terminals, especially mobile telephones, and the hands-free activation of such terminals.
It is known to incorporate voice recognition software in mobile telephones to allow users to dial a caller by name. However, in order to make use of this facility, the telephone has to be operated manually because, even when in the standby mode, the audio system is not normally turned on. Instead, the receiver only is powered up to receive the paging channel to check for incoming call requests, and for reasons of power saving, the audio system remains turned off.
According to the invention, a wireless communication terminal is adapted so that it is capable of recognizing a predetermined sound in the vicinity of the terminal and its audio input system is powered on periodically when the terminal is in the standby mode and serves to activate the terminal if said predetermined sound is recognized.
Preferably, the audio input system is powered up with the paging channel, and preferably only operates during the paging channel for reasons of power saving, and then processes the received audio signal to recognize said predetermined sound if it is present. In a DSP based GSM terminal, the same DSP processor is used for the radio modem and audio processing, and therefore powering up the processor for paging will automatically make the audio processing function available and produce said audio signal if the audio input system is also powered up.
The paging channel in a mobile telephone consists of a number of paging blocks of short duration separated by an interval of 0.5 to 2.5 seconds. For example, a GSM terminal has a paging channel of four data blocks or bursts, each 4.615 ms long. Each burst has a portion allocated to radio modem processing and the remainder allocated to audio processing, which over four bursts might total 16 ms. Thus, the audio input system of a GSM terminal according to the invention has to recognize said predetermined sound over a short interval of about 16 ms, which would be difficult for a speech pattern. Preferably, therefore, the sound selected is a whistle, which has a narrow bandwidth characteristic and changes only slowly with time so that it can be more easily recognized from a short sample. Also, a whistle can be more easily distinguished from other sounds and will therefore avoid false responses.
The invention is therefore based on the fact that sound recognition is a useful function that can be switched on periodically in a mobile telephone during the standby mode, either with the paging channel or any other short duration channel such as a monitoring channel, and can then be used to recognize narrow bandwidth sounds such as a whistle, to activate the telephone. Once activated, the telephone may then be responsive to voice commands and may then support a speaker phone mode of application.
The invention will now be described by way of example with reference to the accompanying drawings in which:
A typical GSM mobile terminal, as illustrated in
When such a GSM mobile terminal is in the standby mode, the power unit 7 only powers up the radio module 1 and DSP on a low duty cycle to receive a paging radio channel to check whether an incoming call is being requested. The speaker module 5 and microphone module 6 are not powered up in the standby mode in order to save power until such time as they may be required.
The paging channel in GSM consists of four data frames or bursts, each 4.615 ms long, as shown in
Said predetermined audio input is preferably a whistle, this having a narrow bandwidth characteristic which makes it more easily recognizable from a short sample, as illustrated in
It is not necessary that the whistle is of a particular pitch or even that the pitch is held constant with time. The recognition algorithm would merely take a snapshot of the signal and look for a single narrow-band peak much higher than the surrounding signal at other frequencies.
The key feature of the whistle is that it is narrow-band at all times; it is therefore not necessary to scan for it continuously in order to detect it. The GSM paging cycle allowing 16 ms samples of speech at a maximum of 2.1 s intervals is therefore sufficient for whistle recognition.
In a simple implementation, it would be necessary for the user to keep whistling for this maximum interval of 2.1 s to ensure that at least one block of audio samples is captured. However, if it turns out that this is too long to maintain a whistle, then the whistle length could be reduced with an increase in power consumption.
A suitable whistle recognition algorithm needs to detect a narrow-band signal of unknown frequency in the presence of speech with low false alarm probability. A pre-shaping filter would be provided to remove low frequency components from the signal which would otherwise affect the recognition process.
Reasonable recognition/false alarm results have been obtained using the following algorithm:—
An alternative non-linear approach is based on the low variance of the phase increment per sample in the audio block for a whistle compared with speech.
Although the algorithm has been discussed in terms of GSM, it will be appreciated that it can be generalized for any wireless communications system. The only requirement is the capability to periodically switch on the audio hardware to sample 16 ms of audio data. All mobile phone systems should fulfill this requirement since the mobile will need to switch itself on periodically either to listen for paging signals (or their equivalent) or for network measurements, and being a phone it should have the appropriate audio capabilities. As long as this duty cycle is sufficient, the algorithm need not be modified.
In one embodiment of the invention, a mobile terminal is further adapted to include voice dialling and speaker phone operation. The user is then able to use the terminal in hands-free mode as follows:
Speaker phone operation with a mobile terminal requires a loud audio output and some form of echo control.
Number | Date | Country | Kind |
---|---|---|---|
0207732.9 | Apr 2002 | GB | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/GB03/01462 | 4/3/2003 | WO | 00 | 5/2/2005 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO03/084084 | 10/9/2003 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4903319 | Kasai et al. | Feb 1990 | A |
4933963 | Sato et al. | Jun 1990 | A |
5222121 | Shimada | Jun 1993 | A |
5842139 | Muramatsu et al. | Nov 1998 | A |
6088576 | Sone | Jul 2000 | A |
6108543 | Takahashi et al. | Aug 2000 | A |
Number | Date | Country |
---|---|---|
296 21 022 | Feb 1997 | DE |
0 498 398 | Aug 1992 | EP |
1 119 159 | Jul 2001 | EP |
08-298698 | Nov 1996 | JP |
WO 0161872 | Aug 2001 | WO |
Number | Date | Country | |
---|---|---|---|
20050221862 A1 | Oct 2005 | US |