Claims
- 1. A method of selecting acoustic units from an acoustic unit database for synthesizing speech, a concatenation cost being a measure of the mismatch between an acoustic unit sequential pair, the method comprising:
selecting one or more acoustic units from the acoustic unit database; determining whether a concatenation cost of an acoustic unit sequential pair resides in a concatenation cost database; extracting the concatenation cost of the acoustic unit sequential pair from the concatenation cost database if the concatenation cost database contains the concatenation cost of the acoustic unit sequential pair; and determining a value to the concatenation cost of the acoustic unit sequential pair if the concatenation cost database does not contain the concatenation cost of the acoustic unit sequential pair.
- 2. The method according to claim 1, further comprising synthesizing the one or more acoustic units to produce synthetic speech.
- 3. The method according to claim 1, wherein forming the concatenation cost database uses a training set of data.
- 4. The method according to claim 1, wherein forming the concatenation cost database is based on of at least one concatenation cost.
- 5. The method according to claim 1, wherein selecting at least one acoustic unit from the acoustic unit database further uses at least one target cost of an acoustic unit, the target cost being a measure of the mismatch between the acoustic unit and a phoneme.
- 6. The method according to claim 1, wherein determining a value for the concatenation cost of the acoustic unit sequential pair includes assigning a default value.
- 7. The method according to claim 1, wherein determining a value of the concatenation cost of the acoustic unit sequential pair includes computing the concatenation cost of the acoustic unit sequential pair.
- 8. The method according to claim 1, wherein the default concatenation cost value is large enough to eliminate selection of an acoustic unit sequential pair under any reasonable pruning, but does not disallow the acoustic unit sequential pair selection entirely.
- 9. The method according to claim 1, wherein selecting at least one acoustic unit from the acoustic unit database further uses a hash table.
- 10. The method according to claim 1, further comprising:
forming a concatenation cost database, wherein the concatenation cost database comprises a selected subset of concatenation costs of possible acoustic unit sequential pairs of the acoustic unit database.
- 11. An apparatus for selecting acoustic units, comprising:
an acoustic unit database containing at least two acoustic units; a concatenation cost database containing concatenation costs of acoustic unit sequential pairs, a concatenation cost being a measure of the mismatch between an acoustic unit sequential pair, wherein the concatenation cost database comprises a selected subset of concatenation costs of all possible acoustic unit sequential pairs of the acoustic unit database; and a selecting device that selects acoustic units using the concatenation cost database, wherein the selecting device includes
a first determining portion that determines whether a concatenation cost of an acoustic unit sequential pair resides in the concatenation cost database; an extracting portion that extracts the concatenation cost of the acoustic unit sequential pair from the concatenation cost database if the concatenation cost database contains the concatenation cost of the acoustic unit sequential pair; and a second determining portion that determines a value to the concatenation cost of the acoustic unit sequential pair if the concatenation cost database does not contain the concatenation cost of the acoustic unit sequential pair.
- 12. The apparatus of claim 11, further comprising a synthesizer that synthesizes acoustic units to form synthetic speech.
- 13. The apparatus of claim 11, wherein the concatenation cost database is formed using a training set of data.
- 14. The apparatus of claim 11, the concatenation cost database is formed based on a value of at least one concatenation cost.
- 15. The apparatus of claim 11, wherein the selecting device further uses a target cost of an acoustic unit, the target cost being a measure of the mismatch between the acoustic unit and a phoneme specification.
- 16. The apparatus of claim 11, wherein the second determining portion is assignment portion that assigns a default value to the concatenation cost of the acoustic unit sequential pair.
- 17. The apparatus of claim 16, wherein the default value is large enough to eliminate selection of an acoustic unit sequential pair under any reasonable pruning, but does not disallow the acoustic unit sequential pair selection entirely.
- 18. The apparatus of claim 11, wherein the second determining portion is a computing portion that computes the concatenation cost of the acoustic unit sequential pair.
- 19. The apparatus of claim 11, wherein the selecting device further uses a hash table.
- 20. A method of forming a computer readable medium containing a concatenation cost database, a concatenation cost being a measure of the mismatch between an acoustic unit sequential pair, the method comprising;
synthesizing a body of speech using a training data set and an acoustic unit database to produce a plurality of synthesized acoustic unit sequential pairs; calculating a concatenation cost for at least one synthesized acoustic unit sequential pair of the plurality of synthesized acoustic unit sequential pairs; storing at least one concatenation cost of the calculated concatenation cost in the concatenation cost database; and determining the concatenation cost for at least one synthesized acoustic unit sequential pair if the calculated concatenation cost is not found in the concatenation cost database.
Parent Case Info
[0001] This nonprovisional application claims the benefit of U.S. provisional application No. 60/131,948 entitled “Rapid Unit Selection From a Large Speech Corpus For Concatenative Speech” filed on Apr. 30, 1999. The Applicants of the provisional application are Mark C. Beutnagel, Mehryar Mohri and Michael Dennis Riley (Attorney Docket No. 1999-0135). The above provisional application is hereby incorporated by reference including all references cited therein.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60131948 |
Apr 1999 |
US |
Continuations (1)
|
Number |
Date |
Country |
Parent |
09557146 |
Apr 2000 |
US |
Child |
10359171 |
Feb 2003 |
US |