Claims
- 1. An automatic speech recognition system for receiving as an input a coded bitstream speech signal and generating therefrom feature recognition parameters, said recognition system including
a spectral envelope detection arrangement for generating features derived from line spectrum pair (LSP) information from the encoded bitstream; and an excitation signal extraction arrangement for generating features derived from adaptive codebook gain and fixed codebook gain information related to voiced and unvoiced information, respectively.
- 2. The automatic speech recognition system of claim 1 wherein the bitstream is defined on a frame-by-frame basis, each frame include a first set of bits associated with spectral envelope detection and the LSP coefficients and a second set of bits associated with the excitation signal extraction and the voiced/unvoiced feature parameters.
- 3. The automatic speech recognition system of claim 2 wherein the spectral envelope detection arrangement comprises
an inverse quantizer responsive to the first set of bits in each frame within the bitstream; an interpolator coupled to the output of the inverse quantizer for providing an output at a predetermined rate; an LSP-to-CEP conversion unit, responsive to the output of the interpolator to convert the LSP coefficients into CEP coefficients; and a cepstral weighting filter for smoothing the output of said speech recognition system.
- 4. The automatic speech recognition system of claim 2 wherein the second set of bits within each frame are divided into subframes, with an adaptive gain coefficient and a fixed gain coefficient calculated for each subframe.
- 5. The automatic speech recognition system of claim 4 wherein the adaptive gain coefficient is defined by:
- 6. The automatic speech recognition system of claim 5 wherein γ is less than or equal to 0.1.
- 7. A method of performing feature extraction on a continuous bitstream of encoded speech, the method comprising the steps of:
determining the line spectrum product coefficients from a predetermined set of bits in each frame in the continuous bitstream; converting the coefficients into spectral envelope information; determining adaptive codebook gain coefficients and fixed codebook gain coefficients from another set of bits in each frame of said continuous bitstream; and calculating voiced and unvoiced feature information from the adaptive and fixed codebook gain coefficients.
- 8. The method as defined in claim 7 wherein in determining the line spectrum product coefficients, performing the steps of:
performing an inverse quantization on the predetermined set of bits in each frame; and performing an LSP to CEP conversion on the inverse quantized results.
- 9. The method as defined in claim 8 wherein in performing the LSP to CEP conversion, the LSP coefficients are first converted to LPC coefficients and the LPC coefficients are converted to CEP coefficients.
- 10. The method as defined in claim 8 wherein the method comprises the additional step of weighting the CEP coefficients.
- 11. The method as defined in claim 7 wherein in determining the adaptive codebook gain coefficients, the following equation is used:
- 12. The method as defined in claim 7 wherein in determining the fixed codebook gain coefficients, the following equation is used:
- 13. The method as defined in claim 12 wherein γ is less than or equal to 0.1
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the priority of Provisional Application No. 60/170,170, filed Dec. 10, 1999.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60170170 |
Dec 1999 |
US |