Claims
- 1. A speech synthesis method for use in a text-to-speech system, said method comprising:outputting a plurality of input speech segments Si and a plurality of training speech elements Ti; generating a plurality of synthesized speech segments Gij, where i and j are positive integers, by changing at least one of a pitch period and a time duration of each of the input speech segments Sj to be equal to at least one of a pitch period and time duration of each of the training speech segments Ti; evaluating a distortion eij of each of the synthesized speech segments Gij based on a distance between each of the synthesized speech segments Gij and each of the training speech segments Ti; selecting a plurality of synthesis units Dk indicating a minimum evaluation from the input speech segments Sj according to the distortion eij; and generating a synthesis speech by selecting predetermined synthesis units from the synthesis units Dk on input text information and connecting the predetermined synthesis units to one another to generate the synthesis speech.
- 2. The speech synthesis method according to claim 1, wherein said synthesized speech segment generation step includes a step of spectrum-shaping the synthesized speech segments, andsaid synthesis speech generation step includes a step of spectrum-shaping the synthesis speech to generate a final synthesis speech.
- 3. The speech synthesis method according to claim 1, wherein said synthesis unit selection step includes a step of storing, as said synthesis units, speech source signals and information on combinations of coefficients of a synthesis filter for receiving said speech source signals and generating a synthesis speech signal.
- 4. The speech synthesis method according to claim 3, wherein the synthesis unit selection step includes a step of quantizing the speech source signals and the coefficients of the synthesis filter, anda step of storing, as the synthesis units, the quantized speech source signals and information on combinations of the coefficients of the synthesis filter.
- 5. The speech synthesis method according to claim 1, wherein the synthesis unit selection step includes a step of combining speech source signals and filter coefficients of a synthesis filter for receiving the speech source signals to generate a synthesis speech signal, anda step of setting at least one of the number of the speech source signals and the number of the filter coefficients of the synthesis filter to be less than the total number of speech synthesis units.
- 6. A speech synthesis apparatus for use in a text-to-speech system, said apparatus comprising:an output device configured to output a plurality of input speech segments Sj and a plurality of training speech elements Ti; a generator configured to generate a plurality of synthesized speech segments Gij; where i and j are positive integers, by changing at least one of a pitch period and a time duration of each of the input speech segments Sj so as to be equal to at least one of a pitch period and a time duration of each of the training speech segments Ti; an evaluation unit configured to evaluate a distortion eij of each of the synthesized speech segments Gij based on a distance between each of the synthesized speech segments Gij and each of the training speech segments Ti; a synthesis unit selection section configured to select a plurality of synthesis units Dk indicating a minimum evaluation from the input speech segments Si based on the distortion eij; and a speech synthesis section configured to generate a synthesis speech by selecting predetermined synthesis units from the synthesis units Dk based on input text information and connecting the predetermined synthesis units to one another to generate the synthesis speech.
- 7. The speech synthesis apparatus according to claim 6, wherein said synthesis unit selection section includes a shaping section which spectrum shapes the synthesis speech segments and a selector section which selects a plurality of synthesis units from said second speech segments on the basis of the distance between said spectrum-shaped synthesis speech segments and said first speech segments, andsaid synthesis speech generation section includes a shaping section which spectrum shapes the synthesis speech to generate a final synthesis speech.
- 8. The speech synthesis apparatus according to clam 6, wherein said synthesis unit selection section includes a storage section which stores, as said synthesis units, speech source signals and information on combinations of coefficients of a synthesis filter for receiving said speech source signals and generates a synthesis speech signal.
- 9. The speech synthesis apparatus according to claim 8, wherein the synthesis unit selection section includes a quantization section which quantizes the speech source signals and the coefficients of the synthesis filter, and stores, as the synthesis units, the quantized speech source signals and information on combinations of the coefficients of the synthesis filter.
- 10. The speech synthesis apparatus according to claim 6, wherein the synthesis unit selection section includes a storage section which stores, as the synthesis units, speech source signals and information on combinations of coefficients of a synthesis filter configured to receive the speech source signals and generate a synthesis speech signal, andat least one of the number of the speech source signals stored as the synthesis units and the number of the coefficients of the synthesis filter stored as the synthesis units being less than the total number of speech synthesis units.
Priority Claims (5)
| Number |
Date |
Country |
Kind |
| 7-315431 |
Dec 1995 |
JP |
|
| 8-054714 |
Mar 1996 |
JP |
|
| 8-068785 |
Mar 1996 |
JP |
|
| 8-077393 |
Mar 1996 |
JP |
|
| 8-250150 |
Sep 1996 |
JP |
|
Parent Case Info
This application is a continuation of 08/758,772 filed Dec. 3, 1996, now U.S. Pat. No. 6,240,384 issued May 29, 2001.
US Referenced Citations (16)
Foreign Referenced Citations (4)
| Number |
Date |
Country |
| 58-88798 |
May 1983 |
JP |
| 1-304499 |
Dec 1989 |
JP |
| 6-175675 |
Jun 1994 |
JP |
| 07-152787 |
Jun 1995 |
JP |
Continuations (1)
|
Number |
Date |
Country |
| Parent |
08/758772 |
Dec 1996 |
US |
| Child |
09/722047 |
|
US |