Claims
- 1. A method of performing pitch analysis for use in encoding speech, the method comprising:sampling a speech signal; spectrally whitening the sampled speech signal to produce a spectral residual signal; collecting samples of the spectral residual signal and autocorrelating the collected samples; determining maximum values of the correlated result; determining gain values based at least in part on the maximum values of the correlated result; and quantizing the gain values using a codebook to produce a codebook index and an associated frame delay, the codebook index and the frame delay representing a pitch of the speech signal and facilitate encoding the speech signal as a representation of the original speech signal.
- 2. The method of claim 1 further comprising pre-emphasizing the sampled speech signal prior to the spectral whitening.
- 3. The method of claim 2 wherein the pre-emphasizing takes a z-transform of the sampled speech signal.
- 4. The method of claim 1 wherein the spectral whitening uses an inverse linear predictive all-pole filter to produce the spectral residual signal.
- 5. The method of claim 1 wherein the collected samples are collected in a block of N samples and the block is appended to K prior samples to form a segment and the autocorrelating is performed on the segment.
- 6. The method of claim 1 wherein the maximum values are two maximum values.
- 7. The method of claim 1 wherein the gain values are 3-tap gain terms.
- 8. The method of claim 7 wherein the 3-tap gain terms are determined using Choleski matrix decomposition.
- 9. The method of claim 1 wherein the code book is a 32 word vector code book.
- 10. An apparatus for analyzing pitch to encode a speech signal, the apparatus comprising:a spectral whitening block having an input which receives digital speech signal samples of an original speech signal and outputs spectral residual signal samples; a pitch analysis block coupled to the spectral whitening block to collect spectral residual signal samples, autocorrelate the collected samples and output gain values based at least in part on maximum values of the correlated result; and a quantizer block coupled to said pitch analysis block using a codebook to produce a codebook index and an associated frame delay, the codebook index and the frame delay are outputted as quantized gain values representing a pitch of the speech signal, the quantized values facilitate encoding the speech signal as a representation of the original speech signal.
- 11. The apparatus of claim 10 further comprising a pre-emphasis block coupled to the input of the spectral whitening block to pre-emphasize the sampled speech signal.
- 12. The apparatus of claim 11 further comprising a sample and hold block coupled to an analog to digital converter to produce the speech signal samples.
- 13. The apparatus of claim 10 further comprising a bit packing block coupled to the quantizing block to combine the quantized values with other parameters of the encoded speech signal.
- 14. The apparatus of claim 13 further comprising a synthesizer/post filter block coupled to the bit packing block and having an input for receiving the combined result.
- 15. The apparatus of claim 10 wherein the spectral whitening block having an additional input for receiving linear predictive all-pole filter parameters and the spectral whitening block uses the linear predictive all-pole filter parameters to produce the spectral residual signal.
- 16. An apparatus for analyzing pitch to encode a speech signal, the apparatus comprising:means for sampling a speech signal; means for spectrally whitening the sampled speech signal to produce a spectral residual signal; means for collecting samples of the spectral residual signal and autocorrelating the collected samples; means for determining maximum values of the correlated result; means for determining gain values at least in part on the maximum values of the correlated result; and means for quantizing the gain values using a codebook to produce a codebook index and an associated frame delay, the codebook index and the frame delay representing a pitch of the speech signal and facilitate encoding the speech signal as a representation of the original speech signal.
- 17. The apparatus of claim 16 wherein the means for spectral whitening uses an inverse linear predictive all-pole filter to produce the spectral residual signal.
REFERENCES TO RELATED APPLICATIONS
This application is a continuation of application Ser. No. 08/950,658, filed Oct. 15, 1997, now U.S. Pat. No. 6,006,174, which is a continuation of application Ser. No. 08/670,986, filed Jun. 28, 1996, now abandoned, which is a continuation of application Ser. No. 08/104,174, filed Aug. 9, 1993, now abandoned, which is a continuation of application Ser. No. 07/592,330, filed Oct. 3, 1990, which issued on Aug. 10, 1993 as U.S. Pat. No. 5,235,670.
US Referenced Citations (15)
Foreign Referenced Citations (1)
| Number |
Date |
Country |
| WO8602726 |
Jun 1986 |
WO |
Non-Patent Literature Citations (6)
| Entry |
| Veeneman et al., “Computationally efficient stochastic coding of speech,” 1990 IEEE 40th Vehicular Technology Conference, May 1990, pp. 331 to 335.* |
| Proc. ICASSP '82, A New Model of LPC Excitation for Producing Natural-Sounding Speech at Low Bit Rates, B.S. Atal and J.R. Remde, pp. 614-617, Apr., 1982. |
| Proc. ICASSP '84, Improving Performance of Multi-Pulse Coders at Low Bit Rates, S. Singhal and B.S. Atal, paper 1.3, Mar. 1984. |
| Proc. ICASSP '84, Efficient Computation and Encoding of the Multiple Excitation for LPC, M. Berouti et al., paper 10.1, Mar., 1984. |
| Proc. ICASSP '86, Implementation of Multi-Pulse Coder on a Single Chip Floating-Point Signal Processor, H. Alrutz, paper 44.3, Apr., 1986. |
| Digital Telephony, John Bellamy, pp. 153-154, 1991. |
Continuations (4)
|
Number |
Date |
Country |
| Parent |
08/950658 |
Oct 1997 |
US |
| Child |
09/441743 |
|
US |
| Parent |
08/670986 |
Jun 1996 |
US |
| Child |
08/950658 |
|
US |
| Parent |
08/104174 |
Aug 1993 |
US |
| Child |
08/670986 |
|
US |
| Parent |
07/592330 |
Oct 1990 |
US |
| Child |
08/104174 |
|
US |