Claims
- 1. A method for transcoding a bit stream encoded according to a linear predictive coding (LPC) standard to a bit stream encoded according to a mixed-excitation linear prediction (MELP) standard, comprising:
decoding a bit stream into a first set of vocoder parameters compatible with the LPC standard; transforming the first set of vocoder parameters into a second set of vocoder parameters compatible with the MELP standard without converting the first set of vocoder parameters to an analog or digital waveform representation; and encoding the second set of vocoder parameters into a bit stream compatible with the MELP vocoder standard.
- 2. The method of claim 1, wherein the transforming includes converting LPC spectrum, voicing, pitch, and gain parameters to MELP spectrum, voicing, pitch, and gain parameters.
- 3. The method of claim 2, wherein the converting of LPC spectrum parameters to MELP spectrum parameters includes:
removing preemphasis effects from LPC spectral coefficients.
- 4. The method of claim 3, wherein the removing of preemphasis effects comprises:
converting LPC reflection coefficients (RC) to their equivalent normalized autocorrelation coefficients (R); storing the autocorrelation coefficients (R) symmetrically in a first array; and convolving the first array with a second array storing symmetrical autocorrelation coefficients (HH) of a deemphasis filter, thereby producing a deemphasized set of autocorrelation coefficients (R′).
- 5. The method of claim 4, wherein the converting of the LPC reflection coefficients (RC) to their equivalent normalized autocorrelation coefficients (R) is carried out to approximately 50 time lags, and the reflection coefficients (RCs) having an index above the number of LPC spectrum parameters are set to zero.
- 6. The method of claim 4, wherein the deemphasis filter is the inverse of a preemphasis filter applied by the LPC standard.
- 7. The method of claim 4, wherein the convolving of the first array with the second array is performed according to the following equation:
- 8. The method of claim 4, further comprising:
converting the deemphasized autocorrelation coefficients R′ to deemphasized reflection coefficients (RC′) and deemphasized predictor filter coefficients (A′).
- 9. The method of claim 4, further comprising:
checking the stability of a synthesis filter formed by the deemphasized reflection coefficients (RC′), and if the synthesis filter is unstable, using a minimum order stable model instead of a model represented by the entire set of deemphasized reflection coefficients.
- 10. The method of claim 2, further comprising:
converting LPC reflection coefficients (RC) to their equivalent predictor filter coefficients (A); and storing the predictor filter coefficients for later use in formant enhancement.
- 11. The method of claim 10, further comprising:
performing formant enhancement of the deemphasized predictor filter coefficients either in the correlation domain or in the frequency domain to produce formant enhanced deemphasized predictor filter coefficients (A″).
- 12. The method of claim 11, wherein formant enhancement is performed in the correlation domain and the formant enhancement process utilizes both the predictor filter coefficients A and deemphasized predictor filter coefficients A′.
- 13. The method of claim 11, wherein the formant enhanced predictor filter coefficients (A″) are converted to MELP line spectrum frequencies (LSFs).
- 14. The method of claim 13, further comprising:
adaptively smoothing the MELP LSFs.
- 15. The method of claim 2, further comprising generating a plurality of MELP harmonic amplitudes.
- 16. The method of claim 2, wherein the converting of LPC voicing parameters to MELP voicing parameters is performed by one of three methods corresponding to three different conditions:
(a) mid-frame onset (the first LPC half-frame voicing bit is unvoiced and the second half-frame voicing bit is voiced); (b) fully voiced (both LPC half-frame voicing bits are voiced); and (c) fully un-voiced and mid-frame unvoiced transition (either both LPC half-frame voicing bits are unvoiced or the first LPC half frame is voiced and the second LPC half-frame voicing bit is unvoiced).
- 17. The method of claim 16, wherein when (a) mid-frame onset occurs, the top two MELP bandpass voicing strengths are set to be zero (unvoiced) to reduce perceptual buzziness in the output speech.
- 18. The method of claim 2, wherein the converting of LPC pitch parameters to MELP pitch parameters comprises:
taking a logarithm of the LPC pitch parameters.
- 19. The method of claim 2, wherein the converting of LPC gain parameters to MELP gain parameters comprises:
scaling the LPC gain parameters to account for preemphasis removal performed on the LPC spectrum parameters.
- 20. The method of claim 19, further comprising calculating an LPC gain ratio for each frame of LPC parametric data decoded, wherein the LPC gain ratio is the ratio of an LPC synthesis filter gain after preemphasis removal to LPC synthesis filter gain before preemphasis removal, multiplied by a scaling factor.
- 21. The method of claim 19, further including
assigning a first MELP half-frame gain to be equal to the logarithmic average of the LPC gain parameter from the previous frame and the LPC gain parameter from the current frame; and assigning a second MELP half-frame gain to be equal to the LPC gain parameter for the current frame.
- 22. The method of claim 2, further comprising:
encoding MELP LSFs, pitch parameters, voicing and jitter parameters, and gain parameters according to a MELP quantization standard algorithm.
- 23. A method of converting LPC parameters to MELP parameters, comprising:
receiving LPC parameters; and transforming the LPC parameters to MELP parameters without converting to a decompressed form.
- 24. A transcoder for transcoding a bit stream encoded according to a linear predictive coding (LPC) standard to a bit stream encoded according to a mixed-excitation linear prediction (MELP) standard, comprising:
a decoder decoding a bit stream into a first set of vocoder parameters compatible with the LPC standard; a conversion unit transforming the first set of vocoder parameters into a second set of vocoder parameters compatible with the MELP standard without converting the first set of vocoder parameters to an analog or digital waveform representation; and an encoder encoding the second set of vocoder parameters into a bit stream compatible with the MELP vocoder standard.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuation-in-part (CIP) of U.S. patent application Ser. No. 09/822,503 filed Apr. 2, 2001 (“Compressed Domain Universal Transcoder”), the disclosure of which is incorporated herein in its entirety.
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
09822503 |
Apr 2001 |
US |
Child |
09983513 |
Oct 2001 |
US |