Claims
- 1. A method of modeling the sounds produced by speaking at least first and second portions of speech, said method comprising the steps of:
- uttering at least the first portion of speech N times in a time interval having a series of successive subintervals, where N is an integer greater than or equal to one;
- measuring the value of at least one feature of the utterance of the first portion of speech during each of the series of successive subintervals to produce a series of feature vector signals representing the feature values;
- estimating the expected number of occurrences of the first portion of speech in the time interval as a combination of the values for each subinterval of a first model function of the measured value of the feature of the utterance of the first portion of speech, said first model function having at least a first parameter having an initial value;
- estimating the expected number of occurrences of the second portion of speech in the time interval as a combination of the values for each subinterval of a second model function of the measured value of the feature of the utterance of the first portion of speech, said second model function having at least a second parameter having an initial value;
- estimating the probability of exactly N occurrences of the first portion of speech in the time interval given the estimated expected number of occurrences of the first portion of speech;
- estimating the probability of exactly zero occurrences of the second portion of speech in the time interval given the estimated expected number of occurrences of the second portion of speech;
- calculating revised values of the first and second parameters to improve the value of an objective function comprising a combination of at least the estimated probability of exactly N occurrences of the first portion of speech and the estimated probability of exactly zero occurrences of the second portion of speech;
- modeling the first portion of speech with the first model function with the revised value of the first parameter; and
- modeling the second portion of speech with the second model function with the revised value of the second parameter.
- 2. A method as claimed in claim 1, characterized in that:
- each portion of speech is a word; and
- the revised values of the first and second parameters are calculated to substantially optimize the value of the objective function.
- 3. A method as claimed in claim 2, characterized in that:
- N is equal to one;
- the values of the first model function are combined by arithmetic averaging to estimate the expected number of occurrences of the first portion of speech; and
- the values of the second model function are combined by arithmetic averaging to estimate the expected number of occurrences of the second portion of speech.
- 4. A method as claimed in claim 2, characterized in that the model function of a word W.sub.i has the form
- m.sub.i (t)=e.sup.-d.sbsp.i.spsp.2.sup.(t)+.beta..sbsp.i,
- where ##EQU7## q is the number of acoustic features of the utterance being measured, and .alpha..sub.j,i, .beta., and .mu..sub.j,i are parameters of the model functions.
- 5. A method as claimed in claim 4, characterized in that the probability of exactly n.sub.i occurrences of a word W.sub.i in a time interval T having subintervals .DELTA.t given the estimated expected number of occurrences of the word is estimated from a function of the form ##EQU8##
- 6. A method as claimed in claim 2, characterized in that the objective function comprises the product of at least the estimated probability of exactly N occurrences of the first portion of speech and the estimated probability of exactly zero occurrences of the second portion of speech.
- 7. A method as claimed in claim 2, characterized in that the objective function comprises the sum of the logarithms of at least the estimated probability of exactly N occurrences of the first portion of speech and the estimated probability of exactly zero occurrences of the second portion of speech.
- 8. A method of modeling the sounds produced by speaking at least a first portion of speech, said method comprising the steps of:
- uttering at least the first portion of speech N times in a time interval having a series of successive subintervals, where N is an integer greater than or equal to one;
- measuring the value of at least one feature of the utterance of the first portion of speech during each of the series of successive subintervals to produce a series of feature vector signals representing the feature values;
- estimating the expected number of occurrences of the first portion of speech in the time interval as a combination of the values for each subinterval of a first model function of the measured value of the feature of the utterance of the first portion of speech, said first model function having at least a first parameter having an initial value;
- estimating the probability of exactly N occurrences of the first portion of speech in the time interval given the estimated expected number of occurrences of the first portion of speech;
- calculating a revised value of the first parameter to improve the value of an objective function comprising at least the estimated probability of exactly N occurrences of the first portion of speech; and
- modeling the first portion of speech with the first model function with the revised value of the first parameter.
Parent Case Info
This is a continuation of application Ser. No. 318,042, filed Mar. 2, 1989, now abandoned.
US Referenced Citations (3)
Number |
Name |
Date |
Kind |
4363102 |
Holmgren et al. |
Dec 1982 |
|
4559604 |
Ichikawa et al. |
Dec 1985 |
|
4759068 |
Bahl et al. |
Jul 1988 |
|
Non-Patent Literature Citations (1)
Entry |
E04KDF-NAG Fortran Library Routine Document (pp. 1-18). |
Continuations (1)
|
Number |
Date |
Country |
Parent |
318042 |
Mar 1989 |
|