This application claims priority to British Patent Application No. 1305436.6 filed Mar. 26, 2013, which is hereby incorporated by reference in its entirety.
The present disclosure relates to a system which allows a user to control functions of a vehicle by spoken instructions.
A system of this type is known from DE 100 38 803 A1. According to this prior art system, a speech processor on board the vehicle is adapted to recognize spoken instructions such as “open door” or “open trunk”, and to control actuators of a vehicle door or a of a trunk lid according to these instructions, provided that the speaker carries a radio transponder which proves that he is authorized to open the vehicle. In this way, a user does not have to use his hands to open the vehicle. The user can comfortably load the vehicle with goods which must be carried in both hands. However, this conventional system has a problem in that, since the speech processor is located on-board the vehicle and operates on audio data provided by vehicle-based microphones, the reliability of this system strongly depends on the level of ambient noise. In a noisy environment, a user may have to approach the vehicle so closely, in order to enable speech control that he himself obstructs the opening of the door or the trunk lid. Alternately, the user may have to shout in an embarrassing or disruptive way. Further, since the conventional system only verifies the presence of the radio transponder but has no means for identifying a speaker, there is the possibility of the system reacting to instructions spoken by unauthorized persons, e.g. if two vehicle equipped with the conventional system are parked side by side, and transponders of both vehicles are in the vicinity, both vehicles may react to an “open door” instruction spoken by one user, causing the doors to crash into each other.
Another problem of the conventional system has to do with the fact that speech recognition is a recent and rapidly developing technology. Although the computing power and storage capacity required for its execution may be present in most modern vehicles and could be used for speech recognition at no extra cost, a user who wishes to have speech control implemented in his vehicle will in some way or other have to cover the license fees for copyrighted or otherwise protected software.
It would be desirable, therefore, for a vehicle manufacturer to enable speech control of vehicle functions at a minimum cost for the user.
According to the present disclosure a system for controlling functions of a vehicle by speech is disclosed which includes a mobile terminal of a network, speech recognition means such as a computer or microprocessor configured to convert recorded speech into digital characters, and on-board control unit of the vehicle. The mobile network terminal includes a microphone for recording a user's speech, and a terminal interface for communication with the on-board control unit. The on-board control unit is connected to at least one subsystem of the vehicle and is configured to control the subsystem based on messages received from the mobile network terminal. The mobile network terminal is adapted to process a string of digital characters derived from the user's speech into a message and to transmit the message to the on-board control unit.
The present disclosure makes use of the fact that many mobile network terminals, such as smartphones or mobile PCs, come equipped with or can be programmed to provide speech recognition functionality. A primary purpose of such speech recognition means is to enable a user to input a text message to be transmitted to the network not by typing, but by simply speaking to the mobile network terminal. So, any user who possesses such a mobile network terminal has already covered the costs related to speech processing software, and the present disclosure enables the user to put a speech processed by such a mobile terminal to a further use.
Since such a mobile network terminal is not permanently installed in the vehicle but, in most cases, will be carried with the user, the microphone will be located close the user's mouth, and there is no need for the user to shout in order to be properly understood by the system, even in a noisy environment.
Further, since the system of the present disclosure may be used not only for controlling the vehicle, there is considerable opportunity for the system to be trained and to adapt to the user's voice, so that a high degree of reliability can be achieved.
The mobile network terminal may include a user interface which enables the user to choose between an first, network operating mode in which a string of digital characters derived from the user's speech is transmitted to the network, for example in the form of a text message; a second, vehicle operating mode in which such a string of digital characters is transmitted to the on-board control unit; and possibly, other operating modes. Since in the vehicle control operating mode the variety of instructions which the speech recognition means are to detect in the user's speech is considerably reduced, these instructions can be recognized with a high degree of reliability, even if only a rather simple and fast recognition algorithm is used.
On the other hand, the mobile network terminal may be adapted to judge from the information content of a string of characters derived from the user's speech whether it contains an instruction to the vehicle and should therefore be transmitted to the on-board control unit, or not. According to this embodiment, the user does not have to choose an appropriate operating mode of the mobile network terminal before being able to control the vehicle, which is clearly convenient if the need or wish to control the vehicle arises unexpectedly.
The mobile network terminal may be adapted to compare the string of digital characters derived from the user's speech with a predetermined set of instructions for controlling a subsystem of the vehicle, and to transmit the string to the on-board control unit only if it is found to match an instruction from the set. The set of instructions that can be carried out by the vehicle-based interface may vary from one vehicle to the other or even, for a given vehicle, depending on previously received instructions. If the on-board control unit is adapted to communicate the set of instructions to the mobile network terminal, the latter can recognize these instructions with high reliability using a simple speech recognition algorithm.
As pointed out above, the mobile network terminal may be a mobile telephone, and the network, hence, a mobile telephone network. Mobile telephone networks conventionally support a SMS or short message service for transmitting a character string which may be derived from a user's speech to another telephone of the network. Of course, the network may also interface the mobile network terminal to the internet. Most mobile telephone networks provide this service, depending on the conditions of contract.
The speech recognition means may be implemented locally in the mobile network terminal. This is an advantage in particular when it must be ensured that an instruction spoken by the user is processed and transmitted to the on-board control unit within a predetermined delay. Else, the speech recognition means may also be implemented in a remote terminal of the network, in which case the mobile network terminal only requires transmission of the recorded speech to the remote terminal and receipt of the string of characters derived therefrom back from the remote terminal. Since the mobile network terminal is thus relieved from the task of speech recognition, its hardware may be rather simple, and its energy consumption is reduced, enabling it to run for a long time without the need to exchange or recharge its battery.
As a security measure, the mobile network terminal may be adapted to transmit an identification key to the on-board control unit, and the on-board control unit can be adapted to compare the transmitted identification key to an expected key before reacting to a message from the mobile network terminal only if the keys match. Control of the vehicle by an unauthorized terminal can thus be prevented.
The object of the present disclosure is further achieved by a method for controlling functions of a vehicle by speech including recording a user's speech in a mobile network terminal, concerting said speech into a string of digital characters, transmitting a message including said string from the mobile network terminal to on-board control unit of said vehicle, the on-board control unit controlling at least one subsystem of the vehicle based on said string of characters.
The present disclosure may further be embodied in a computer program product including program code means which enable a computer to operate as the mobile network terminal or to carry out the method as described above.
The present disclosure may further be embodied in a computer readable data carrier or nono-transitory computer readable data medium having program instructions stored on it which enable a computer to operate as said mobile network terminal or to carry out the method.
Further features and advantages of the present disclosure will become apparent from the subsequent description of embodiments thereof referring to the appended drawings. The description and the drawings disclose features which are not mentioned in the claims. Such features may be embodied in other combinations than those specifically disclosed herein. From the fact that two or more such features are disclosed in a same sentence or in some other kind of common context it must not be concluded that they can only appear in the combination specifically disclosed; rather, any feature of such a combination may appear without the others, unless the description gives positive reason to assume that in that case the present disclosure would be inoperable.
The present disclosure hereinafter will be described in conjunction with the following drawing figures, wherein like numerals denote like elements, and:
The following detailed description is merely exemplary in nature and is not intended to limit the present disclosure or the application and uses of the present disclosure. Furthermore, there is no intention to be bound by any theory presented in the preceding background or the following detailed description.
In
Vehicle-based interface 3 and an on-board computer 11 connected to it form the on-board control unit of motor vehicle 2. Examples of subsystems of vehicle 2 that are controlled by on-board control unit shown in
A user interface 26 may be provided which enables the driver to specify to onboard computer 11 for which of the various subsystems 12, 15, etc. controlled by computer 11 voice control shall be enabled.
As is usual for a smartphone or a mobile PC, a microphone 22 and a loudspeaker 23 may be directly integrated into a common casing with CPU 4, storage means 5 and user interface 6. If the mobile network terminal 1 is worn e.g. in a clothing pocket, such an integrated microphone may have difficulties in properly recording the user's speech. Therefore, in the context of the present disclosure, it may be convenient for the user to wear a headset connected to the mobile network terminal 1, so that a microphone of the headset may be used for recording his speech.
Such a general purpose algorithm requires considerable processing power and storage capacity. Although such an algorithm and its data may be stored locally in storage means 5 and executed by the CPU 4 itself, it may be preferable to implement the algorithm in a remote speech processor 24 and to have the CPU 4 only convert the audio signal into digital data, e.g. a WAV file, which is then transferred to remote speech processor 24 via the cell phone network 10 and, eventually, the internet. The speech processor 24 detects spoken words in the audio file and returns these to the mobile network terminal 1.
Step S3 verifies whether the character string output by the speech recognition algorithm is a valid instruction which on-board computer 11 is capable of processing. An efficient and fast way to do this is by comparing the character string to a set valid instructions stored locally in memory 5 of mobile network terminal 1. Since the on-board computer 11 will know which subsystems of the vehicle are connected to it and are capable of being voice-controlled, or which of these have been allowed to be voice-controlled by the driver, and what instructions directed to these subsystems it supports, this set of instructions should preferably be uploaded from on-board computer 11 to mobile network terminal 1 prior to the start of the procedure of
A simple alternative way of verifying whether the character string is a valid instruction is to transmit the character string in a message to vehicle-based terminal 3 and to wait for a reply from the latter. If the mobile network terminal 1 receives an acknowledgment from vehicle-based terminal 3, then the string was a valid instruction and has been or is being processed by on-board computer 11, and the process returns to step 51 to wait for further audio signals. Else, if an error message is received as reply from vehicle-based terminal 3, the string was no valid instruction and could not be processed.
In that case it is forwarded to some other process running on mobile network terminal 1, e.g. in order to be made use of in step S4 as part of an SMS message which is displayed on a screen of user interface 6, and is transmitted to another terminal 25 connected to cell phone network 10 when complete. It might also be interpreted as an instruction or part of an instruction for controlling the communication of terminal 1 within the network 10, e.g. as the phone number or part of the phone number of a participant such as terminal 25, as an instruction for selecting/changing the operating mode of terminal 1, and the like.
Any message transmitted from mobile network terminal 1 to vehicle-based terminal 3 in step S3 may include key data, e.g. an IMEI number of terminal 1, which enables onboard computer 11 to verify the origin of all received messages and to ignore those which come from a terminal which is not cleared to control functions of the vehicle subsystems.
Since in this process an audio signal received by microphone 22 is compared not with the complete vocabulary of the user's language but only with a very small number of predetermined words or expressions, a quick and simple algorithm is sufficient to identify spoken instructions with a high degree of reliability.
Not all instructions supported by vehicle-based interface 3 may be applicable at any time. For instance, by a first instruction, e.g. “headlights” the user may have selected a subsystem to which a subsequent instruction will apply. In that case, as the next instruction, “on” or “off” may make sense, but “open” or “close” does not. Conversely, if a first instruction specifying a certain activity such as “open” has been identified, a subsequent instruction can be expected to identify a subsystem to which the first instruction is to apply. In case of an “open” instruction, such a subsystem might be one of the doors 13, the trunk lid 14 or the slidable roof 16, but not the lights 17, 18. Therefore, in the process of
While at least one exemplary embodiment has been presented in the foregoing detailed description, it should be appreciated that a vast number of variations exist. It should also be appreciated that the exemplary embodiment is only an example, and are not intended to limit the scope, applicability, or configuration of the present disclosure in any way. Rather, the foregoing detailed description will provide those skilled in the art with a convenient road map for implementing an exemplary embodiment, it being understood that various changes may be made in the function and arrangement of elements described in an exemplary embodiment without departing from the scope of the present disclosure as set forth in the appended claims and their legal equivalents.
Number | Date | Country | Kind |
---|---|---|---|
1305436.6 | Mar 2013 | GB | national |