Claims
- 1. A speech recognition system, comprising:a speech input device which receives a target speech of an analog signal; a converting device which is coupled to the speech input device and which converts said target speech of said analog signal into digital signals and which converts said digital signals into ordered speech frames; a memory device which stores a speech recognition program and phoneme strings representing each of a plurality of speech candidates, each phoneme string including ordered phonemes; a collating device which is coupled to said memory device and to said converting device and which executes said speech recognition program to collate said target speech input from said speech input device with said plurality of speech candidates; wherein a processing device executes said speech recognition program including: storing said ordered speech frames into the memory device; collating said ordered speech frames with said phoneme strings; and providing a collation result, wherein said collating includes: comparing one of said ordered speech frames with a portion of each of said phoneme strings, said portion including consecutive ones of said ordered phonemes; obtaining, based on a result of said comparing, likelihoods representing similarities between said one frame with said portion of each of said phoneme strings; computing evaluation values representing similarities between said portion of each of said phoneme strings and said target speech, based on said likelihoods and on a plurality of transition probabilities corresponding to different combinations of said portion of each of said phoneme strings; and if said evaluation value for a head phoneme of said portion is smaller than that of a last phoneme in said portion, changing phonemes to be collated for the next one of said speech frames to a new portion in said phoneme strings, wherein said new portion is said portion with the head phoneme removed from therefrom and with the next phoneme in the corresponding phoneme string added to said new portion.
- 2. A speech recognition system according to claim 1, wherein said memory device includes:a ROM storing said phoneme strings representing said plurality of speech candidates and said speech recognition program, and a RAM in which said ordered speech frames are to be stored, wherein said phoneme strings representing said plurality of speech candidates and said speech recognition program stored in said ROM are transferred to the RAM in response to an initialization of said speech recognition system.
- 3. A speech recognition system according to claim 2, wherein said ROM includes:a first ROM which stores said speech recognition program, and a second ROM which stores said phoneme strings representing said plurality of speech candidates, wherein said converting device, said collating device and said first ROM are formed on one semiconductor chip.
- 4. A speech recognition system according to claim 3, wherein said collating device is a CPU.
- 5. A speech recognition system according to claim 4, wherein said system is a navigation system.
Parent Case Info
This is a continuation application of U.S. Ser. No. 09/554,003, filed May 9, 2000, which is a 371 of PCT/JP97/04324, filed Nov. 27, 1997.
US Referenced Citations (3)
Number |
Name |
Date |
Kind |
4783803 |
Baker et al. |
Nov 1988 |
A |
5983180 |
Robinson |
Nov 1999 |
A |
5999902 |
Scahill et al. |
Dec 1999 |
A |
Non-Patent Literature Citations (1)
Entry |
“Fundamentals of Speech Recognition”, 1993, L. Rabiner et al, pp. 231-232. |
Continuations (1)
|
Number |
Date |
Country |
Parent |
09/554003 |
|
US |
Child |
09/625855 |
|
US |