Claims
- 1. A hybrid speech encoding method, comprising the steps of:
(a) classifying frames of speech signals as voiced, unvoiced, or transitory; (b) using harmonic coding to compress frames associated with at least one of said classes; and (c) coding frames classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding.
- 2. A method as recited in claim 1, further comprising the steps of:
(a) time aligning a harmonically coded frame in a decoder when the preceding frame has been waveform encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonic coded frame; and (b) time aligning the frame in an encoder to be waveform encoded when the subsequent frame is to be harmonically encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 3. A method for encoding speech in an encoder for communication to a decoder for reproduction thereof, said speech comprising a plurality of frames of speech, said method comprising the steps of:
(a) classifying each frame of speech into three or more classes wherein one or more of said classes is transitory in character; (b) representing the speech in a frame of speech associated with at least one of said classes with a harmonic model; (c) computing parameter values of said harmonic model where said parameter values are characteristic of the frame; and (d) quantizing said parameters for communication to said decoder.
- 4. A method as recited in claim 3, wherein one or more of said transitory classes is encoded using a coding technique selected from the group consisting of waveform-matching coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding.
- 5. A method as recited in claim 4, further comprising the step of time aligning the reproduced speech across the boundary between two successive frames of speech where one frame of speech is waveform coded and the other frame of speech is harmonically coded.
- 6. A method as recited in claim 5, further comprising the step of time-aligning a harmonically coded frame of speech in the decoder when the preceding frame of speech has been waveform encoded for pairs of adjacent frames of speech comprising a waveform encoded frame of speech followed by a harmonically coded frame of speech.
- 7. A method as recited in claim 5, further comprising the step of time-aligning the frame in the encoder to be waveform-encoded when the subsequent frame is to be harmonically coded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 8. A hybrid method of encoding speech in an encoder for transmission to a decoder for reproduction thereof, comprising the steps of:
(a) classifying frames of the speech signal into steady state voiced, stationary unvoiced, or transitory speech segments; (b) coding a frame with harmonic coding if the frame is classified as steady state voiced speech; (c) coding a frame with “noise-like” coding if the frame is classified as stationary unvoiced speech; and (d) coding a frame classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding.
- 9. A method as recited in claim 8, further comprising the step of time aligning the reproduced speech across the boundary between two successive frames of speech where one frame of speech is waveform coded and the other frame of speech is harmonically coded.
- 10. A method as recited in claim 9, further comprising the step of time-aligning a harmonically coded frame of speech in the decoder when the preceding frame of speech has been waveform encoded for pairs of adjacent frames of speech comprising a waveform encoded frame of speech followed by a harmonically coded frame of speech.
- 11. A method as recited in claim 9, further comprising the step of time-aligning the frame in the encoder to be waveform-encoded when the subsequent frame is to be harmonically coded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 12. A method for encoding a speech signal for communication to a decoder for reproduction, comprising the steps of:
(a) computing a plurality of pitch candidates using time-domain or frequency domain estimation; (b) computing a candidate synthetic signal for each pitch value; (c) modifying the original signal to match each of the synthetic signal candidates; (d) computing a weight which represents the degree of matching achieved in step (d); and (e) selecting the final pitch value by making use of the weights computed in step (d).
- 13. A hybrid speech encoder, comprising:
(a) means for classifying frames of speech signals as voiced, unvoiced, or transitory; (b) means for harmonically coding frames associated with at least one of said classes; and (c) means for coding frames classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding.
- 14. A hybrid speech encoder as recited in claim 13, further comprising:
(a) means for time aligning a harmonically coded frame in a decoder when the preceding frame has been waveform encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonic coded frame; and (b) means for time aligning the frame in an encoder to be waveform encoded when the subsequent frame is to be harmonically encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 15. A hybrid speech encoder for communication to a decoder for reproduction of speech, said speech comprising a plurality of frames of speech, said encoder comprising:
(a) means for classifying each frame of speech into three or more classes wherein one or more of said classes is transitory in character; (b) means for representing the speech in a frame of speech associated with at least one of said classes with a harmonic model; (c) means for computing parameter values of said harmonic model where said parameter values are characteristic of the frame; and (d) means for quantizing said parameters for communication to said decoder.
- 16. A hybrid speech encoder as recited in claim 15, wherein one or more of said transitory classes is encoded using a coding technique selected from the group consisting of waveform-matching coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding.
- 17. A hybrid speech encoder as recited in claim 16, further comprising means for time aligning the reproduced speech across the boundary between two successive frames of speech where one frame of speech is waveform coded and the other frame of speech is harmonically coded.
- 18. A hybrid speech encoder as recited in claim 17, further comprising means for time-aligning a harmonically coded frame of speech in the decoder when the preceding frame of speech has been waveform encoded for pairs of adjacent frames of speech comprising a waveform encoded frame of speech followed by a harmonically coded frame of speech.
- 19. A hybrid speech encoder as recited in claim 17, further comprising means for time-aligning the frame in the encoder to be waveform-encoded when the subsequent frame is to be harmonically coded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 20. An apparatus for encoding speech for transmission to a decoder for reproduction thereof, comprising:
(a) means for classifying frames of the speech signal as steady state voiced, stationary unvoiced, or transitory speech; (b) means for coding a frame with harmonic coding if the frame is classified as steady state voiced speech; (c) means for coding a frame with “noise-like” coding if the frame is classified as stationary unvoiced speech; and (d) means for coding a frame classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding.
- 21. An apparatus as recited in claim 20, further comprising means for time aligning the reproduced speech across the boundary between two successive frames of speech where one frame of speech is waveform coded and the other frame of speech is harmonically coded.
- 22. An apparatus as recited in claim 21, further comprising means for time-aligning a harmonically coded frame of speech in the decoder when the preceding frame of speech has been waveform encoded for pairs of adjacent frames of speech comprising a waveform encoded frame of speech followed by a harmonically coded frame of speech.
- 23. An apparatus as recited in claim 21, further comprising means for time-aligning the frame in the encoder to be waveform-encoded when the subsequent frame is to be harmonically coded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 24. An apparatus for encoding a speech signal for communication to a decoder for reproduction, comprising:
(a) means for computing a plurality of pitch candidates using time-domain or frequency domain estimation; (b) means for computing a candidate synthetic signal for each pitch value; (c) means for modifying the original signal to match each of the synthetic signal candidates; (d) means for computing a weight which represents the degree of matching achieved by said signal modifying means; and (e) means for selecting the final pitch value by making use of said weights.
- 25. A hybrid speech encoder, comprising:
(a) a speech classifier, said speech classifier classifying frames of speech signals as voiced, unvoiced, or transitory; (b) a harmonic encoder, said harmonic encoder harmonically coding frames associated with at least one of said classes; and (c) a transitory encoder, said transitory encoder coding frames classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding.
- 26. A hybrid speech encoder as recited in claim 25, further comprising:
(a) a first phase synchronizer, said first phase synchronizer time aligning a harmonically coded frame in a decoder when the preceding frame has been waveform encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonic coded frame; and (b) a second phase synchronizer, said second phase synchronizer time aligning the frame in an encoder to be waveform encoded when the subsequent frame is to be harmonically encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 27. A hybrid speech encoder, comprising:
(a) a speech classifier, said speech classifier classifying frames of speech signals as voiced, unvoiced, or transitory; (b) a encoder for voiced signals; (c) an encoder for unvoiced signals; and (d) an encoder for transitory signals; (e) wherein at one of said encoders comprises a harmonic encoder, and wherein at least one of said encoders comprises an encoder selected from the group consisting of a waveform encoder, an analysis-by-synthesis encoder, a codebook excited linear prediction analysis-by-synthesis encoder, and a multipulse analysis-by-synthesis encoder.
- 28. A hybrid speech encoder as recited in claim 27, further comprising:
(a) a first phase synchronizer, said first phase synchronizer time aligning a harmonically coded frame in a decoder when the preceding frame has been waveform encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonic coded frame; and (b) a second phase synchronizer, said second phase synchronizer time aligning the frame in an encoder to be waveform encoded when the subsequent frame is to be harmonically encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 29. A hybrid speech decoder, comprising:
(a) a harmonic decoder, said harmonic decoder harmonically decoding frames of speech associated with at least one of the classes of speech comprising voiced, unvoiced and transitory speech; and (b) a transitory decoder, said transitory decoder decoding frames of speech classified as transitory using a decoding technique selected from the group consisting of waveform decoding, analysis-by-synthesis decoding, codebook excited linear prediction analysis-by-synthesis decoding, and multipulse analysis-by-synthesis decoding.
- 30. A hybrid speech decoder, comprising:
(a) a decoder for speech signals classified as voiced signals; (c) a decoder for speech signals classified as unvoiced signals; and (d) a decoder for speech signals classified as transitory signals; (e) wherein at one of said decoders comprises a harmonic decoder, and wherein at least one of said decoders comprises a decoder selected from the group consisting of a waveform decoder, an analysis-by-synthesis decoder, a codebook excited linear prediction analysis-by-synthesis decoder, and a multipulse analysis-by-synthesis decoder.
- 31. A hybrid speech compression system, comprising:
(a) a speech classifier, said speech classifier classifying frames of speech signals as voiced, unvoiced, or transitory; (b) a harmonic encoder, said harmonic encoder harmonically coding frames associated with at least one of said classes; (c) a transitory encoder, said transitory encoder coding frames classified as transitory using a coding technique selected from the group consisting of waveform coding, analysis-by-synthesis coding, codebook excited linear prediction analysis-by-synthesis coding, and multipulse analysis-by-synthesis coding; (d) a harmonic decoder; and (e) a transitory decoder, said transitory decoder decoding frames of speech classified as transitory using a decoding technique selected from the group consisting of waveform decoding, analysis-by-synthesis decoding, codebook excited linear prediction analysis-by-synthesis decoding, and multipulse analysis-by-synthesis decoding.
- 32. A hybrid speech compression system as recited in claim 31, further comprising:
(a) a first phase synchronizer, said first phase synchronizer time aligning a harmonically coded frame in a decoder when the preceding frame has been waveform encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonic coded frame; and (b) a second phase synchronizer, said second phase synchronizer time aligning the frame in an encoder to be waveform encoded when the subsequent frame is to be harmonically encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
- 33. A hybrid speech compression system, comprising:
(a) a speech classifier, said speech classifier classifying frames of speech signals as voiced, unvoiced, or transitory; (b) an encoder for voiced signals; (c) an encoder for unvoiced signals; (d) an encoder for transitory signals; (e) wherein at one of said encoders comprises a harmonic encoder, and wherein at least one of said encoders comprises an encoder selected from the group consisting of a waveform encoder, an analysis-by-synthesis encoder, a codebook excited linear prediction analysis-by-synthesis encoder, and a multipulse analysis-by-synthesis encoder; (f) a decoder for speech signals classified as voiced signals; (g) a decoder for speech signals classified as unvoiced signals; and (h) a decoder for speech signals classified as transitory signals; (i) wherein at one of said decoders comprises a harmonic decoder, and wherein at least one of said decoders comprises a decoder selected from the group consisting of a waveform decoder, an analysis-by-synthesis decoder, a codebook excited linear prediction analysis-by-synthesis decoder, and a multipulse analysis-by-synthesis decoder.
- 34. A hybrid speech compression system as recited in claim 33, further comprising:
(a) a first phase synchronizer, said first phase synchronizer time aligning a harmonically coded frame in a decoder when the preceding frame has been waveform encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonic coded frame; and (b) a second phase synchronizer, said second phase synchronizer time aligning the frame in an encoder to be waveform encoded when the subsequent frame is to be harmonically encoded for pairs of adjacent frames comprising a waveform encoded frame followed by a harmonically coded frame.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority from U.S. provisional application serial No. 60/057,415 filed on Aug. 29, 1997, which is incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60057415 |
Aug 1997 |
US |
Continuations (1)
|
Number |
Date |
Country |
Parent |
09143265 |
Aug 1998 |
US |
Child |
09777424 |
Feb 2001 |
US |