Claims
- 1. A method for training a speech recognition processor to respond to speech obtained from telephone systems, comprising the steps of:
- inputting a speech data set to a speech recognition training processor, said data set having a bandwidth higher than a telephone bandwidth;
- decimating said inputted speech data set in said training processor to obtain a decimated speech data set having said telephone bandwidth;
- applying a bandpass digital filter to said decimated speech data set in said training processor, said filter characterizing transmission characteristics of telephone equipment, for obtaining a filtered speech data set;
- rescaling the amplitude of said filtered speech data set in said training processor, so that the maximum dynamic range of said filtered speech data set matches the maximum dynamic range of uncompanded telephone speech, to obtain a rescaled speech data set;
- modifying said rescaled speech data set in said training processor, with quantization noise representing companding and uncompanding a speech signal in a telephone system, to obtain a modified speech data set;
- inputting said modified speech data set into a hidden Markov model speech recognition processor to train statistical pattern matching data units;
- performing speech recognition on voice signals from a telephone system with said speech recognition processor.
- 2. The method of claim 1 wherein:
- said telephone bandwidth is any bandwidth lower than said higher bandwidth.
- 3. The method of claim 1 which further comprises:
- said bandpass digital filter has a maximally flat design algorithm.
- 4. The method of claim 1 wherein said rescaling step results in a maximum dynamic range matching a maximum dynamic range of uncompanded mu-law telephone speech.
- 5. The method of claim 1 wherein said rescaling step results in a maximum dynamic range matching a maximum dynamic range of uncompanded A-law telephone speech.
- 6. The method of claim 1 wherein said modifying step has quantization noise as mu-law noise.
- 7. The method of claim 1 wherein said modifying step has quantization noise as A-law noise.
Parent Case Info
This is a continuation of prior application Ser. No. 07/948,031, filed Sep. 21, 1992, now abandoned.
US Referenced Citations (21)
Foreign Referenced Citations (1)
Number |
Date |
Country |
215573 |
Aug 1985 |
EPX |
Non-Patent Literature Citations (2)
Entry |
Takebayashi et al., "Telephone Speech Recognition Using A Hybrid Method", ICPR (International Conference of Pattern Recognition), Dec. 1989, pp. 1232-1235. |
IEEE Article by K. F. Lee & H. W. Hon, "Large Vocabulary Speaker Independent Continuous Speech Recognition Using HMM," 1988, pp. 123-126, (CH2561-9/88/00000-0123). |
Continuations (1)
|
Number |
Date |
Country |
Parent |
948031 |
Sep 1992 |
|