Claims
- 1. A method for encoding speech, comprising the steps of:
- sampling an original speech signal;
- producing spectral coefficients from said samples;
- interpolating the spectral coefficients; and
- subjecting interpolated spectral coefficients to pitch analysis to obtain a spectral residual signal.
- 2. A method for encoding speech as in claim 1, wherein said samples are pre-emphasized before spectral coefficients are produced.
- 3. A method for encoding speech as in claim 1 wherein the samples are perceptually weighted before producing said spectral coefficients.
- 4. An apparatus for encoding speech, comprising:
- means for sampling an original speech signal;
- means for producing spectral coefficients from said sample;
- means for interpolating the spectral coefficients; and
- means for performing a pitch analysis of the interpolated spectral coefficients to obtain a spectral residual signal.
- 5. An apparatus for encoding speech as in claim 4, further comprising means for perceptually weighting said samples before producing spectral coefficients.
- 6. An improved method for encoding a digitized speech signal comprising the steps of:
- a) defining a filter with coefficients based upon selected interpolated parameters of the digitized speech signal;
- b) perceptually weighting said digitized speech signal;
- c) selectively pulsing said filter to create a synthetic speech signal which is an approximation of said perceptually weighted digitized speech signal;
- d) comparing said synthetic speech signal to said perceptually weighted digitized speech signal to determine the difference between the two signals;
- e) selectively pulsing the filter to create a correction signal which approximates said difference; and
- f) combining said correction signal with said synthetic speech signal to provide a modified synthetic speech signal which is a better approximation of said perceptually weighted digitized speech signal.
- 7. The method according to claim 6 wherein steps d, e and f are repeated with respect to said modified speech signal to provide increasingly better approximations of said perceptually weighted digitized speech signal.
- 8. The method according to claim 6 wherein steps d, e and f are performed four times so that an approximated synthetic speech signal defined by five selected pulses is produced such that said interpolated filter parameters and the parameters of said five pulses can be transmitted to a receiving station whereat said approximated speech signal can be reproduced at said receiving station.
- 9. The method of claim 6 wherein the selection of each successive pulse does not impact the selection of the previous pulses.
- 10. The method of claim 6 wherein said defining step further includes:
- quantizing said coefficients using a quantizer table based upon voiced speech to produce voiced coefficients;
- quantizing said coefficients using a quantizer table based upon unvoiced speech to produce unvoiced coefficients;
- comparing said voiced and unvoiced coefficients to determine which coefficients have the smallest error;
- retaining said coefficients having the smallest error; and
- interpolating said coefficients having the smallest error.
- 11. The method of claim 10 further including converting said voiced and unvoiced coefficients to spectral coefficients prior to said comparing step.
- 12. The method of claim 11 wherein said comparing step comprises computing the log-spectral distance between said coefficients and said quantized voiced and unvoiced coefficients.
Parent Case Info
This application is a continuation of Application Ser. No. 08/670,986, filed Jun. 28, 1996 abandoned, which is a continuation of Application Ser. No. 08/104,174 filed Aug. 9, 1993, now abandoned, which is a continuation of 07/592,330, filed Oct. 3, 1990, now U.S. Pat. No. 5,235,670.
US Referenced Citations (7)
Foreign Referenced Citations (1)
Number |
Date |
Country |
WO8602726 |
Jun 1986 |
WOX |
Non-Patent Literature Citations (5)
Entry |
B.S. Atal and J.R. Remde, "A New Model of LPC Excitation for Producing Natural-Sounding Speech at Low Bit Rates; " Proc. ICASSP '82, pp. 614-617, Apr. 1982. |
S. Singhal and B.S. Atal, "Improving Performance of Multi-Pulse Coders at Low Bit Rates," Proc. ICASSP '84, paper 1.3, Mar. 1984. |
M . Berouti et al., "Efficient Computation and Encoding of the Multipulse Excitation for LPC," Proc. ICASSP '84 paper 10.1 Mar. 1984. |
H, Alrutz, "Implementation of a Multi-Pulse Coder on a Single Chip Floating-Point Signal Processor," Proc. ICASSP '86, paper 44.3 Apr. 1986. |
Bellamy, John. Digital Telephony, John Wiley & Sons, Inc., NY, 1991, pp. 153-154. |
Continuations (3)
|
Number |
Date |
Country |
Parent |
670986 |
Jun 1996 |
|
Parent |
104174 |
Aug 1993 |
|
Parent |
592330 |
Oct 1990 |
|