Claims
- 1. A speech encoding system using an analysis by synthesis approach on a speech signal having varying characteristics, the speech encoding system comprising:an encoder processing circuit that selectively applies a first or a second encoding scheme upon identification of varying characteristics of the speech signal; where the varying characteristics are utilized to classify the speech signal as having one of active voice content and inactive voice content; the first encoding scheme utilizes a first analysis-by-synthesis speech coding approach on a speech signal classified as active voice content; and the second encoding scheme utilizes a second analysis-by-synthesis speech coding approach on a speech signal classified as inactive voice content, the inactive voice content comprising background noise.
- 2. The speech encoding system of claim 1, wherein the varying characteristics of the speech signal comprises pitch characteristics.
- 3. The speech encoding system of claim 1, wherein the varying characteristics of the speech signal comprises periodicity characteristics.
- 4. The speech encoding system of claim 1, wherein the varying characteristics of the speech signal comprises intensity characteristics.
- 5. The speech encoding system of claim 1, wherein the encoder processing circuit selectively applies one of the first and the second encoding scheme at one of a plurality of bit rates.
- 6. A speech encoding system for processing a speech signal having varying characteristics, the speech encoding system comprising:an encoder processing circuit that selectively applies a first or a second analysis-by-synthesis encoding scheme based upon at least one of the varying characteristics of the speech signal; the encoder processing circuit applies the first analysis-by-synthesis encoding scheme following identification of an active voice frame of the speech signal; and the encoder processing circuit applies the second analysis-by-synthesis encoding scheme following identification of an inactive voice frame of the speech signal, the inactive voice frame comprising background noise.
- 7. The speech encoding system of claim 6, wherein the second encoding scheme selects a random excitation sequence to encode the speech signal.
- 8. The speech encoding system of claim 6, wherein the encoder processing circuit selectively applies one of the first and the second encoding scheme at one of a plurality of bit rates.
- 9. The speech encoding system of claim 6, wherein the second encoding scheme identifies an energy level.
- 10. The speech encoding system of claim 6, wherein the second encoding scheme identifies a spectral information.
- 11. The speech encoding system of claim 1, wherein the first encoding scheme selects operation in one of a long term predictor (LTP) mode and a pitch preprocessing (PP) mode.
- 12. The speech encoding system of claim 1, wherein the second encoding scheme selects a random excitation sequence after considering an energy level and spectral information of the speech signal.
- 13. The speech encoding system of claim 1, wherein a speech signal classified as inactive voice comprises silence.
- 14. The speech encoding system of claim 1, wherein a speech signal classified as inactive voice comprises background noise.
- 15. The speech encoding system of claim 6, wherein the first encoding scheme selects operation in one of a long term predictor (LTP) mode and a pitch preprocessing (PP) mode.
- 16. A method of encoding a speech signal comprising:classifying the speech signal as having one of active voice content and inactive voice content, the inactive voice content comprising background noise; applying a first encoding scheme comprising analysis-by-synthesis when the speech signal is classified as having active voice content; and applying a second encoding scheme comprising analysis-by-synthesis when the speech signal is classified as having inactive voice content.
- 17. The method of claim 16, further comprising identifying an energy level and spectral information of the speech signal when the second encoding scheme is applied.
- 18. The method of claim 17, further comprising performing encoding with a selected random excitation sequence after identifying the energy level and the spectral information.
- 19. The method of claim 16, further comprising applying one of the first encoding scheme and the second encoding scheme at one of a plurality of bit rates.
- 20. The method of claim 16, further comprising encoding a first frame of the speech signal with the first encoding scheme at a bit rate and encoding a second frame of the speech signal with the second encoding scheme at the same bit rate.
COMPACT DISC APPENDIX
A compact disc appendix containing Appendix A created on Feb. 7, 2003, and having 654,440 bytes; Appendix B created on Feb. 7, 2003, and having 201,066 bytes; and Appendix C created on Feb. 7, 2003, and having 905,455 bytes, is hereby included and incorporated herein by reference in its entirety and made part of the present U.S. Patent Application for all purposes.
The present invention is based on U.S. Provisional Application Ser. No. 60/097,569, filed Aug. 24, 1998.
The following applications are hereby incorporated herein by reference in their entirety and made part of the present application:
1) U.S. Provisional Application Ser. No. 60/097,569, entitled “Adaptive Rate Speech Codec,” filed Aug. 24, 1998;
2) U.S. patent application Ser. No. 09/154,675, entitled “Speech Encoder Using Continuous Warping In Long Term Preprocessing,” filed Sep. 18, 1998;
3) U.S. patent application Ser. No. 09/156,814, entitled “Completed Fixed Codebook For Speech Encoder,” filed Sep. 18, 1998;
4) U.S. patent application Ser. No. 09/156,649, entitled “Comb Codebook Structure,” filed Sep. 18, 1998;
5) U.S. patent application Ser. No. 09/156,648, entitled “Low Complexity Random Codebook Structure,” filed Sep. 18, 1998;
6) U.S. patent application Ser. No. 09/156,650, entitled “Speech Encoder Using Gain Normalization That Combines Open And Closed Loop Gains,” filed Sep. 18, 1998;
7) U.S. patent application Ser. No. 09/154,654, entitled “Pitch Determination Using Speech Classification And Prior Pitch Estimation,” filed Sep. 18, 1998;
8) U.S. patent application Ser. No. 09/154,657, entitled “Speech Encoder Using A Classifier For Smoothing Noise Coding,” filed Sep. 18, 1998;
9) U.S. patent application Ser. No. 09/156,826, entitled “Adaptive Tilt Compensation For Synthesized Speech Residual,” filed Sep. 18, 1998;
10) U.S. patent application Ser. No. 09/154,662, entitled “Speech Classification And Parameter Weighting Used In Codebook Search,” filed Sep. 18, 1998;
11) U.S. patent application Ser. No. 09/154,653, entitled “Synchronized Encoder-Decoder Frame Concealment Using Speech Coding Parameters,” filed Sep. 18, 1998;
12) U.S. patent application Ser. No. 09/154,663, entitled “Adaptive Gain Reduction To Produce Fixed Codebook Target Signal,” filed Sep. 18, 1998;
13) U.S. patent application Ser. No. 09/154,660, entitled “Speech Encoder Adaptively Applying Pitch Long-Term Prediction and Pitch Preprocessing With Continuous Warping,” filed Sep. 18, 1998.
US Referenced Citations (11)
Non-Patent Literature Citations (2)
Entry |
C.B. Southcott, D. Freeman, G. Cosier, D. Sereno, A. Van der Krogt, A. Gilloire, and H.J. Braun, “Voice Control of the Pan-European Digital Mobile Radio System,” Proceedings of the Global Telecommunications Conference and Exhibition (Globecom), US, New York, IEEE 1989, pp. 1070-1074. |
Erdal Paksoy, Krishnaswamy Srinivasan, and Allen Gersho, “Variable Bit-Rate CELP Coding of Speech with Phonetic Classification,” European Transactions on Telecommunications and Related Technologies, IT, AEI, Milano, vol. 5, Sep.-Oct. 1994, pp. 57/591-67-601. |
Provisional Applications (1)
|
Number |
Date |
Country |
|
60/097569 |
Aug 1998 |
US |