Claims
- 1. Method of processing speech comprising:receiving an original speech signal; using sample and hold techniques to digitize the original speech signal at a predetermined sampling rate to produce samples; analyzing the samples on a block basis by acquiring a predetermined number of the samples; providing preemphasis filtering of the block of samples; generating reflection coefficients for the block of samples; quantizing the reflection coefficients for voiced and unvoiced speech values; converting the voiced and unvoiced speech values to respective spectral coefficients; and using the spectral coefficients to compute respective log-spectral distances between the unquantized spectrum and the quantized spectrum.
- 2. The method of claim 1, further comprising the preemphasis filtering providing a z-transform function.
- 3. The method of claim 1, further comprising the quantitizing of the reflection coefficients performed by using quantizer tables, the quantizer tables corresponding to the respective voiced and unvoiced speech values, thereby resulting in quantizing the reflection coefficients for voiced speech and quantizing the reflection coefficients for unvoiced speech.
- 4. The method of claim 1, wherein the digitization of the original speech signal uses A/D circuitry along with said sample and hold techniques.
- 5. The method of claim 1, further comprising providing the quantitized reflection coefficients to a circuit for signal whitening.
- 6. The method of claim 1, further comprising the performing a predictive all-pole (LPC) analysis of the samples to generate the reflection coefficients.
- 7. The method of claim 1, comprising:determining log-spectral distances of the quantized reflection coefficients; and selecting and retaining the set of quantized reflection coefficients which produces a smaller log-spectral distance.
- 8. The method of claim 7, further comprising:encoding the retained reflection coefficient parameters for transmission; and converting the encoded retained reflection coefficient parameters to corresponding all-pole linear predictive LPC filter coefficients.
- 9. The method of claim 1, further comprising:the LPC analysis performed on speech of block length N which corresponds to N/x seconds, where x is a sampling rate; and generating a set of filter coefficients is generated for every N samples of speech or every N/x sec.
- 10. The method of claim 9, further comprising interpolating the LPC parameters on a sub-frame basis at a sub-frame rate of twice the frame rate, thereby providing a set of parameters at a rate of twice the frame rate.
- 11. The method of claim 1, wherein the digitization of the original speech signal uses sample/hold and A/D circuitry at sampling rate of 8 kHz.
- 12. The method of claim 11, further comprising:the LPC analysis performed on speech of block length N which corresponds to N/8000 seconds; and generating a set of filter coefficients is generated for every N samples of speech or every N/8000 sec.
Parent Case Info
This application is a continuation of U.S. patent application Ser. No. 10/083,237, filed Feb. 26, 2002, now U.S. Pat. No. 6,611,799 which is a continuation of U.S. patent application Ser. No. 09/805,634, filed Mar. 14, 2001, now U.S. Pat. No. 6,385,577, which is a continuation of U.S. patent application Ser. No. 09/441,743, filed Nov. 16, 1999, now U.S. Pat. No. 6,223,152, which is a continuation of U.S. patent application Ser. No. 08/950,658, filed Oct. 15, 1997, now U.S. Pat. No. 6,006,174, which is a file wrapper continuation of U.S. patent application Ser. No. 08/670,986, filed Jun. 28, 1996 now abandoned, which is a file wrapper continuation of U.S. patent application Ser. No. 08/104,174, filed Aug. 9, 1993, now abandoned, which is a continuation of U.S. patent application Ser. No. 07/592,330, filed Oct. 3, 1990, now U.S. Pat. No. 5,235,670, which applications are incorporated herein by reference.
US Referenced Citations (21)
Foreign Referenced Citations (1)
| Number |
Date |
Country |
| WO8602726 |
Jun 1986 |
WO |
Non-Patent Literature Citations (6)
| Entry |
| Proc. ICASSP '82, A New Model of LPC Excitation for Producing Natural-Sounding Speech at Low Bit Rates, B.S. Atal and J.R. Remde, pp 614-617, Apr. 1982. |
| Proc. ICASSP '84, Improving Performance of Multi-Pulse Coders at Low Bit Rates, S. Singhal and B.S. Atal, paper 1.3, Mar. 1984. |
| Proc. ICASSP '84, Efficient Computation and Encoding of the Multiple Excitation for LPC, M. Berouti et al., paper 10. Mar. 1, 1984. |
| Proc. ICASSP '86, Implementation of Multi-Pulse Coder on a Single Chip Floating-Point Signal Processor, H. Alrutz, paper 44. Apr. 3, 1986. |
| Digital Telephony, John Bellamy, pp 153-154, 1991. |
| Veeneman et al., Computationally Efficient Stochastic Coding of Speech, 1990, IEEE 40th Vehicular Technology Conference, May 1990, pp. 331-335. |
Continuations (7)
|
Number |
Date |
Country |
| Parent |
10/083237 |
Feb 2002 |
US |
| Child |
10/446314 |
|
US |
| Parent |
09/805634 |
Mar 2001 |
US |
| Child |
10/083237 |
|
US |
| Parent |
09/441743 |
Nov 1999 |
US |
| Child |
09/805634 |
|
US |
| Parent |
08/950658 |
Oct 1997 |
US |
| Child |
09/441743 |
|
US |
| Parent |
08/670986 |
Jun 1996 |
US |
| Child |
08/950658 |
|
US |
| Parent |
08/104174 |
Aug 1993 |
US |
| Child |
08/670986 |
|
US |
| Parent |
07/592330 |
Oct 1990 |
US |
| Child |
08/104174 |
|
US |