The present invention relates to a speech input method and system, and more particularly to a speech input method and system for the portable device.
Nowadays, the capacity of the storage medium is getting larger and larger and the price thereof is getting lower and lower, which makes the storage medium to be more popularized in the market. The portable device available in the market, such as the MP3 player and iPod, already has a large capacity capable of storing more than 200 songs. As a result, if the user wants to search a favorite song among the great amount of songs stored therein, the only way therefore is to press the keys on the portable device and scroll the songs shown on the monitor of the portable device one by one.
Usually, there is no interface for word input on the portable device. Also, in view of compactness, portability and simple operation, it is impossible to employ an additional keyboard or dispose too many keys on the portable device. Taking the MP3 player for example, if the user wants to search a favorite song, currently the only way therefor is to press the keys on the portable device and scroll the songs shown on the monitor of the portable device one by one. Such way is very inefficient if there are too many songs stored in the storage medium of the MP3 player. Therefore, the speech input method provides a convenient way to solve the above problems.
If the speech input function is able to be combined with the portable device for searching the songs stored therein, the user can find his favorite songs easily without having to press the keys on the portable device. Besides, such a portable device with the speech input function has distinctive features over the conventional one and possesses a high additional value.
Therefore, a novel speech input method and speech input system are developed and provided in the present invention. The particular design in the present invention not only solves the problems described above, but is also easy to be implemented. Thus, the present invention has the utility for the industry.
In accordance with one aspect of the present invention, a speech input method and a relevant system for the portable device are provided. The speech input system is able to support the function of multi-lingual input. Furthermore, a proper acoustic unit can be selected by the speech input system based on existing hardware, such as the CPU and the memory.
In accordance with another aspect of the present invention, a speech input method and a relevant system for the portable device are provided. In the speech input system, the acoustic unit is separate from the search unit. It is not necessary to supply all lexicons and the database can be expanded unlimitedly.
In accordance with a further aspect of the present invention, a speech input method and system for the portable device are provided. The portable device is capable of being connected to a remote server via the wireless network to access the database of the remote server. In this way, not only the capacity of the database in the portable device can be economized, but the efficiency thereof can be enhanced.
In accordance with further another aspect of the present invention, a speech input method for a portable device is provided. The speech input method includes steps of (a) selecting a language mode and determining an acoustic unit, (b) inputting a speech by a user and comparing the speech with the acoustic unit to generate a plurality of recognition results, (c) selecting one of the recognition results for obtaining a plurality of keywords with a recognition result-to-keyword mapping table, (d) obtaining a plurality of selected results having the keywords therein from a database by using the keywords as search units, (e) repeating step (b) to step (d) so as to narrow a range of the selected results when a next speech is present, and (f) displaying the selected results in order when the next speech is absent.
Preferably, the portable device is a player.
Preferably, the acoustic unit is one selected from a group consisting of a phonetic symbol, a syllable, a word and a letter.
Preferably, the search units are keywords selected from a group consisting of syllables without a tone, syllables with a tone, words and letters corresponding to the acoustic unit.
Preferably, the acoustic unit is generated by a multi-lingual unit.
Preferably, the acoustic unit is determined by the multi-lingual unit based on the language mode.
Preferably, the recognition result-to-keyword mapping table is a syllable-to-character mapping table.
Preferably, the recognition result-to-keyword mapping table is a character-to-character mapping table.
In accordance with further another aspect of the present invention, a speech input system for a portable device is provided. The speech input device includes a multi-lingual unit for determining an acoustic unit for a language mode selected by a user, a database for storing data, and a mapping table for storing a plurality of keywords which are based on a comparison result of at least one speech inputted by the user with the acoustic unit, wherein a plurality of selected results are generated by searching the database in response to the keywords.
Preferably, the portable device is a player.
Preferably, the acoustic unit is one selected from a group consisting of a phonetic symbol, a syllable, a word and a letter.
Preferably, the data are song files.
Preferably, the mapping table is a syllable-to-character mapping table.
Preferably, the mapping table is a character-to-character mapping table.
Preferably, the selected results are song files stored in the database.
Preferably, the speech input system is further connected to a remote server via a wireless network for accessing a database of the remote server.
In accordance with further another aspect of the present invention, a speech input method for a portable device is provided. The speech input method includes steps of (a) selecting a language mode and determining an acoustic unit, (b) inputting a speech by a user and comparing the speech with the acoustic unit to generate a plurality of recognition results, (c) selecting one of the recognition results as a search unit for searching a database so as to obtain a plurality of selected results having the search unit therein, (d) repeating step (b) to step (c) so as to narrow a range of the selected results when a next voice is present, and (e) displaying the selected results in order when the next speech is absent.
Preferably, the portable device is a player.
Preferably, the acoustic unit is one selected from a group consisting of a word and a letter.
Preferably, the search unit is one selected from a group consisting of a word and a letter.
Preferably, the acoustic unit is generated by a multi-lingual unit.
Preferably, the acoustic unit is determined by the multi-lingual unit based on the language mode.
In accordance with further another aspect of the present invention, a speech input system for a portable device is provided. The speech input system includes a multi-lingual unit for determining an acoustic unit for a language mode selected by a user; and a database for storing data, wherein a plurality of selected results are generated by searching the database in response to a comparison result of at least one speech inputted by the user with the acoustic unit.
Preferably, the comparison result is a search unit for searching the database so as to generate the selected results.
Preferably, the search unit is one selected from a group consisting of syllables without a tone, syllables with a tone, words and letters corresponding to the acoustic unit.
The above objects and advantages of the present invention will become more readily apparent to those ordinarily skilled in the art after reviewing the following detailed descriptions and accompanying drawings, in which:
The present invention will now be described more specifically with reference to the following embodiments. It is to be noted that the following descriptions of preferred embodiments of this invention are presented herein for the purposes of illustration and description only; it is not intended to be exhaustive or to be limited to the precise form disclosed.
In the present invention, the acoustic unit is used for the speech input recognition. Taking English for example, the letter is applied to the acoustic unit. Whereas, the phonetic symbol and the syllable can be adopted as the acoustic unit in the Chinese system. Due to the increase of more and more new songs and singers as well as the limitation of the computing capability and the memory size for the portable device, all databases can be covered under limited hardware resources with the employment of the acoustic unit for the speech input recognition. However, the “word” can be considered as the acoustic unit if the hardware resources are sufficient.
Please refer to
The mapping table 18 is a recognition result-to-keyword mapping table, so that the keywords are acquired therefrom based on the selected recognition result for searching the selected results within the database 19. Preferably, the mapping table 18 is a syllable-to-character mapping table or a character-to-character mapping table. All song files are stored in the database 19. Referring now to
Assume that the user wants to search a Chinese song, provided that the phonetic symbols serve as the acoustic unit and the Chinese character corresponding to the syllable without a tone serves as the search unit. If the user wants to listen to (a Chinese song)” by a Chinses signer)”, the steps for searching it are as follows.
(a) The user speaks (one of the phonetic symbols for ).
(b) Among the recognition results the user selects .
(c) The user then speaks .
(d) Refer to the syllable-to-character mapping table, and the following Chinese characters are found out:
(Chinese characters)”.
(e) A list of the song files containing the above Chinese characters is displayed:
(singer-song)”.
(f) At this time, the user can press the keys on the portable device to choose the song he wants to listen to, or inputs the next speech to further narrow the range of the selected results.
For example, the user speaks (one of the phonetic symbols for
Among the recognition results the user selects
Refer to the syllable-to-character mapping table, and the following Chinese characters are found out:
(Chinese characters)”.
A list of the song files containing the above Chinese characters is displayed:
(singer-song)”.
Assume that the user wants to search a Chinese song, provided that the syllable serves as the acoustic unit and the Chinese character corresponding to the syllable with a tone serves as the search unit. If the user wants to listen to (a Chinese song)” by (a Chinese signer)”, the steps for searching it are as follows.
(a) the user speaks (the phonetic symbol for with the tone of ).
(b) Among the recognition results the user selects
(c) Refer to the syllable-to-character mapping table, and the following Chinese characters are found out:
(Chinese characters)”.
(d) A list of the song files containing the above Chinese characters is displayed:
(singer-song)”.
(e) At this time, the user can press the keys on the portable device to choose the song he wants to listen to, or inputs the next speech to further narrow the range of the selected results.
For example, the user speaks (the phonetic symbol for with the tone of
Among the recognition results the user selects
Refer to the syllable-to-character mapping table, and the following Chinese characters are found out:
(Chinese characters)”.
A list of the song files containing the above Chinese characters is displayed:
(singer-song)”.
Please refer to
Assume that the user wants to search a English song, provided that the English letter serves as the acoustic unit as well as the search unit. If the user wants to listen to “Can't Fight The Moonlight” by “LeAnn Rimes”, the steps for searching it are as follows.
(a) The user speaks “L”.
(b) Among the recognition results “l”, “a”, “r”, the user selects “l”.
(c) Refer to the character-to-character mapping table, and the following English characters are found out:
(d) A list of the song files containing “L” or “l” at the head thereof (like looking up English vocabulary with the electronic dictionary) is displayed.
(e) The user selects the song files containing “L” at the head thereof. At this time, the user can input the next speech to further narrow the range of the selected results.
Assume that the user wants to search a Chinese song, provided that the word serves as the acoustic unit as well as the search unit. If the user wants to listen to (a Chinese song)” by (a Chinese singer)”, the steps for searching it are as follows.
(a) the user speaks
(b) Among the recognition results (all are Chinese singers), the user selects
(c) Search the song files containing from the database and list the results.
(d) At this time, the user can press the keys on the portable device to choose the song he wants to listen to, or inputs the next speech to further narrow the range of the results.
Assume that the user wants to search a Japanese song, provided that the Japanese phonetic symbol serves as the acoustic unit, and the HIRAGANA or the KATAKANA serves as the search unit.
For example, the user speaks “ka”, and a plurality of recognition results could be etc. Then, the user selects and the song files with the titles containing are searched from the database. At this time, the user can press the keys on the portable device to choose the song he wants to listen to, or inputs the next speech to further narrow the range of the selected results.
In conclusion, the present invention has the following features and advantages over the prior art.
1. The speech input system and method of the present invention are able to support the function of multi-lingual input.
2. A proper acoustic unit can be selected by the speech input system of the present invention based on existing hardware, such as the CPU and the memory.
3. In the present invention, the acoustic unit is separate from the search unit. It is not necessary to supply all lexicons and the database can be expanded unlimitedly.
4. The portable device of the present invention is capable of being connected to a remote server via the wireless network to access the database of the remote server. In this way, not only the capacity of the database in the portable device can be economized, but the efficiency thereof can be enhanced.
Accordingly, the present invention can effectively solve the problems and drawbacks in the prior art, and thus it fits the demand of the industry and is industrially valuable.
While the invention has been described in terms of what is presently considered to be the most practical and preferred embodiments, it is to be understood that the invention needs not be limited to the disclosed embodiments. On the contrary, it is intended to cover various modifications and similar arrangements included within the spirit and scope of the appended claims which are to be accorded with the broadest interpretation so as to encompass all such modifications and similar structures.
Number | Date | Country | Kind |
---|---|---|---|
093141879 | Dec 2004 | TW | national |