Claims
- 1. A speech encoding method comprising:dividing an input speech signal into a plurality of frames each having a predetermined length; dividing a speech signal of each of the frames into a plurality of subframes; obtaining a predictive pitch period of a subframe in a to-be-encoded current frame by using pitch periods of at least two frames of the current frame and past and future frames with respect to the current frame; obtaining a pitch period of a subframe in the current frame by using the predictive pitch period; preparing a relative pitch pattern codebook storing a plurality of relative pitch patterns representing fluctuations in pitch periods of a plurality of subframes; and expressing a change in pitch period of plural subframes with one relative pitch pattern selected from said relative pitch pattern codebook.
- 2. A method according to claim 1, further comprising encoding the pitch period of the subframe in the current frame.
- 3. A method according to claim 1, further comprising preparing a pitch filter for suppressing or emphasizing a pitch period component of an input speech signal, and determining a transfer function for said pitch filter by using the pitch period of the subframe in the current frame.
- 4. A method according to claim 1, wherein obtaining the pitch period of the frame comprises adaptively deciding a pitch period analysis position for each frame.
- 5. A method according to claim 4, wherein deciding the pitch period analysis position includes deciding based on one of magnitude of a power of the speech signal, a predictive error signal and a short-term power of a predictive error signal obtained through a low-pass filter.
- 6. A method according to claim 1, further comprising selecting a manner of obtaining a pitch period of a subframe in the current frame in accordance with continuance of pitch periods.
- 7. A method according to claim 1, wherein the relative pitch pattern codebook stores a plurality of relative pitch patterns with high appearance frequencies as vectors to be matched with the pitch periods of the subframes as vectors to express the pitch periods of the subframes by optimal relative pitch patterns.
- 8. A speech encoding apparatus comprising:a division unit configured to divide an input speech signal into a plurality of frames each having a predetermined length and to divide a speech signal of each of the frames into a plurality of subframes; a prediction unit configured to obtain a predictive pitch period of a subframe in a to-be-encoded current frame by using pitch periods of at least two of the current frame and past and future frames with respect to the current frame; a pitch period unit configured to obtain a pitch period of a subframe in the current frame by using the predicitive pitch period; a relative pitch pattern codebook which stores a plurality of relative pitch patterns representing fluctuations in pitch periods of a plurality of subframes; and a pitch period change unit configured to express a change in pitch period of subframes with one relative pitch pattern selected from the relative pitch pattern codebook.
- 9. An apparatus according to claim 8, further comprising an encoder which encodes the pitch period of the subframe in the current frame.
- 10. An apparatus according to claim 8, further comprising a pitch filter to suppress or emphasize a pitch period component of an input speech signal, and a determination unit configured to determine a transfer function for the pitch filter by using the pitch period of the subframe in the current frame.
- 11. An apparatus according to claim 8, wherein the pitch period unit comprises a unit configured to adaptively decide a pitch period analysis position for each frame.
- 12. An apparatus according to claim 8, further comprising a selector which selects a manner of obtaining a pitch period of a subframe in the current frame in accordance with continuance of pitch periods.
- 13. An apparatus according to claim 8, wherein the relative pitch pattern codebook stores a plurality of relative pitch patterns with high appearance frequencies as vectors to be matched with the pitch periods of the subframe as vectors to express the pitch periods of the subframes by optimal relative pitch patterns.
- 14. A speech encoding program stored in a computer readable medium comprising:means for instructing a computer to divide an input speech signal into a plurality of frames each having a predetermined length; means for instructing the computer to divide a speech signal of each of the frames into a plurality of subframes; means for instructing the computer to obtain a predictive pitch period of a subframe in a to-be-encoded current frame by using pitch periods of at least two frames of the current frame and past and future frames with respect to the current frame; means for instructing the computer to obtain a pitch period of a subframe in the current frame by using the predictive pitch period; means for instructing the computer to prepare a relative pitch pattern codebook storing a plurality of relative pitch patterns representing fluctuations in pitch periods of a plurality of subframes; and means for instructing the computer to express a change in pitch period of subframes with one relative pitch pattern selected from said relative pitch pattern codebook.
- 15. A program according to claim 14, further comprising means for instructing the computer to encode the pitch period of the subframe in the current frame.
- 16. A program according to claim 14, further comprising means for instructing the computer to suppress or emphasize a pitch period component of an input speech signal, and determine a transfer function for said pitch filter by using the pitch period of the subframe in the current frame.
- 17. A program according to claim 14, wherein means for instructing the computer to obtain the pitch period of the frame comprises means for instructing the computer to adaptively decide a pitch period analysis position for each frame.
- 18. A program according to claim 17, wherein means for instructing the computer to decide the pitch period analysis position includes means for instructing the computer to decide based on one of magnitude of a power of the speech signal, a predictive error signal and a short-term power of a predictive error signal obtained through a low-pass filter.
- 19. A program according to claim 14, further comprising means for instructing the computer to select a manner of obtaining a pitch period of a subframe in the current frame in accordance with continuance of pitch periods.
Priority Claims (2)
Number |
Date |
Country |
Kind |
9-010326 |
Jan 1997 |
JP |
|
9-155552 |
Jun 1997 |
JP |
|
Parent Case Info
Divisional of prior application Ser. No. 09/012,792 filed Jan. 23, 1998, now U.S. Pat. No. 6,202,046.
US Referenced Citations (12)
Foreign Referenced Citations (13)
Number |
Date |
Country |
59-170894 |
Sep 1984 |
JP |
61-25200 |
Feb 1986 |
JP |
62-238599 |
Oct 1987 |
JP |
1-502779 |
Sep 1989 |
JP |
6-27996 |
Feb 1994 |
JP |
6-161494 |
Jun 1994 |
JP |
7-181991 |
Jul 1995 |
JP |
8-202398 |
Aug 1996 |
JP |
8-305398 |
Nov 1996 |
JP |
8-328588 |
Dec 1996 |
JP |
9-6397 |
Jan 1997 |
JP |
9-34499 |
Feb 1997 |
JP |
9-504124 |
Apr 1997 |
JP |
Non-Patent Literature Citations (2)
Entry |
Mei Yong, et al., “Efficient Encoding of the Long-Term Predictor in Vector Excitation Coders”, Advances in Speech Coding, Kluwer Academic Publishers, (1991), pp. 329-338. |
D.K. Freeman, et al., “The Voice Activity Detector for the Pan-European Digital Cellular Mobile Telephone Service”; IEEE, Proc. ICASSP89, (1989), pp. 369-372. |