Claims
- 1. A method of speech recognition, comprising the steps of:
- predetermining a series of parameter vectors a.sub.1.sup.(k), a.sub.2.sup.(k), . . . , a.sub.J.sup.(k) representative of standard patterns of speeches of preset words (k) respectively, where the letter k denotes a preset-word identifying number variable as k=1, 2 . . . , K, the letter J denotes a predetermined natural number common for all the preset words and the letter K denotes a predetermined natural number representing a vocabulary size;
- deriving parameter vectors x.sub.i representative of data of respective frames of an input signal, wherein the adscript i denotes a frame number;
- calculating similarities d.sub.j.sup.(k) (i) between the parameter vectors a.sub.j.sup.(k) and the parameter vectors x.sub.i by using a formula:
- d.sub.j.sup.(k) (i)=a.sub.j.sup.(k) .multidot.x.sub.i,
- where j=1, 2, . . . , J;
- calculating parameters R.sub.j.sup.(k)(i) for the respective preset words by referring to the following recurrence formulas:
- R.sub.1.sup.(k)(i) =d.sub.1.sup.(k) (i)
- R.sub.j.sup.(k)(i) =d.sub.j.sup.(k) (i)+max (R.sub.j-1 .sup.(k)(i-h) R.sub.j-1.sup.(k)(i-h-1),R.sub.j-1.sup.(k)(i-h-2), . . . ,R.sub.j-1.sup.(k)(i-h-m))
- where j=2, 3, . . . , J; the characters h and m denote preset positive integers; and the character max denotes an operator of selecting a maximal member of terms in related parentheses;
- deriving a maximum value R.sub.J.sup.(k)(i) for each k and each i; and
- determining one of the preset words as a recognition result, the determined word corresponding to the maximum value R.sub.J.sup.(k)(i).
- 2. The method of claim 1 further comprising the step of varying the integers h and m in accordance with a variation in the value j during the step of calculating the parameters R.sub.j.sup.(k)(i).
- 3. The method of claim 1 further comprising the step of varying the integers h and m in accordance with the preset words during the step of calculating the parameters R.sub.j.sup.(k)(i).
- 4. A method of speech recognition, comprising the steps of:
- predetermining a series of parameter vectors a.sub.1.sup.(k), a.sub.2.sup.(k), . . . , a.sub.J.sup.(k) representative of standard patterns of speeches of preset words (k) respectively, where the letter k denotes a preset-word identifying number variable as k=1, 2 . . . , K, the letter J denotes a predetermined natural number common for all the preset words and the letter K denotes a predetermined natural number representing a vocabulary size;
- deriving parameter vectors x.sub.i representative of data of respective frames of an input signal, wherein the adscript i denotes a frame number;
- calculating similarities d.sub.j.sup.(k) (i) between the parameter vectors a.sub.j.sup.(k) and the parameter vectors x.sub.i by using a formula:
- d.sub.j.sup.(k) (i)=a.sub.j.sup.(k) .multidot.x.sub.i,
- where j=1, 2, . . . , J;
- calculating parameters R.sub.j.sup.(k)(i) for the respective preset words by referring to the following recurrence formulas:
- R.sub.1.sup.(k)(i) =d.sub.1.sup.(k) (i)
- R.sub.j.sup.(k)(i) =d.sub.j.sup.(k) (i)+max (R.sub.j-1.sup.(k)(i-1),R.sub.j-1.sup.(k)(i-2),R.sub.j-1.sup.(k)(i-3), . . . ,R.sub.j-1.sup.(k)(i-4))
- where j=2, 3, . . . , J; and the character max denotes an operator of selecting a maximal member of terms in related parentheses;
- deriving a maximum value R.sub.J.sup.(k)(i) for each k and each i; and
- determining one of the preset words as a recognition result, the determined word corresponding to the maximum value R.sub.J.sup.(k)(i).
- 5. A method of speech recognition, comprising the steps of:
- predetermining a series of parameter vectors a.sub.1.sup.(k), a.sub.2.sup.(k), . . . , a.sub.J.sup.(k) representative of standard patterns of speeches of preset words (k) respectively, where the letter k denotes a preset-word identifying number variable as k=1, 2 . . . , K, the letter J denotes a predetermined natural number common for all the preset words and the letter K denotes a predetermined natural number representing a vocabulary size;
- deriving parameter vectors x.sub.i representative of data of respective frames of an input signal, wherein the adscript i denotes a frame number;
- calculating similarities d.sub.j.sup.(k) (i) between the parameter vectors a.sub.j.sup.(k) and the parameter vectors x.sub.i by using a formula:
- d.sub.j.sup.(k) (i)=a.sub.j.sup.(k) .multidot.x.sub.i,
- where j=1, 2, . . . , J;
- calculating parameters R.sub.j.sup.(k)(i) for the respective preset words by referring to the following recurrence formulas:
- R.sub.1.sup.(k)(i) =d.sub.1.sup.(k) (i)
- R.sub.j.sup.(k)(i) =d.sub.j.sup.(k) (i)+opt(R.sub.j-1.sup.(k)(i-h),R.sub.j-1.sup.(k)(i-h-1),R.sub.j-1.sup.(k)(i-h-2), . . . ,R.sub.j-1.sup.(k)(i-h-m))
- where j=2, 3, . . . , J; the characters h and m denote preset positive integers; and the character opt denotes an operator of selecting an optimal member of terms in related parentheses;
- deriving an optimal value R.sub.J.sup.(k)(i) for each k and each i; and
- determining one of the preset words as a recognition result, the determined word corresponding to the optimal value R.sub.J.sup.(k)(i).
- 6. The method of claim 5 further comprising the step of varying the integers h and m in accordance with a variation in the value j during the step of calculating the parameters R.sub.j.sup.(k)(i).
- 7. The method of claim 5 further comprising the step of varying the integers h and m in accordance with the preset words during the step of calculating the parameters R.sub.j.sup.(k)(i).
- 8. A method of speech recognition using word spotting, comprising the steps of:
- predetermining a series of parameter vectors a.sub.1.sup.(k), a.sub.2.sup.(k), . . . , a.sub.J.sup.(k) representative of standard patterns of speeches of preset words (k) respectively, where the letter k denotes a preset-word identifying number and is variable in the range k=1, 2 . . . , K, the letter J denotes a predetermined natural number corresponding to a common length of standard pattern for each of the preset words and the letter K denotes a predetermined natural number representing a vocabulary size;
- deriving parameter vectors x.sub.i representative of data of respective frames of an input signal, wherein the adscript i denotes a frame number;
- calculating similarities d.sub.j.sup.(k) (i) between the parameter vectors a.sub.j.sup.(k) and the parameter vectors x.sub.i by using a formula:
- d.sub.j.sup.(k) (i)=a.sub.j.sup.(k) .multidot.x.sub.i,
- where j=1, 2, . . . , J; and
- a matching process based on non-linear expansion and contraction, including:
- calculating parameters R.sub.j.sup.(k)(i) for the respective preset words by referring to the following recurrence formulas:
- R.sub.1.sup.(k)(i) =d.sub.1.sup.(k) (i)
- R.sub.j.sup.(k)(i) =d.sub.j.sup.(k) (i)+max (R.sub.j-1.sup.(k)(i-h),R.sub.j-1.sup.(k)(i-h-1),R.sub.j-1.sup.(k)(i-h-2), . . . ,R.sub.j-1.sup.(k)(i-h-m))
- where j=2, 3, . . . , J; the characters h and m denote preset positive integers; and the character max denotes an operator of selecting a maximal member of terms in related parentheses;
- deriving a maximum value of R.sub.J.sup.(k) (i) for each k and each i; and
- determining one of the preset words as a recognition result, the determined word corresponding to the maximum value of R.sub.J.sup.(k)(i).
- 9. The method of claim 8 wherein said non-linear expansion and contraction comprises the step of varying the integers h and m in accordance with a variation in the value j during the step of calculating the parameters R.sub.j.sup.(k)(i).
Priority Claims (4)
Number |
Date |
Country |
Kind |
62-59407 |
Mar 1987 |
JPX |
|
62-59413 |
Mar 1987 |
JPX |
|
62-68436 |
Mar 1987 |
JPX |
|
62-69344 |
Mar 1987 |
JPX |
|
CROSS-REFERENCES TO RELATED APPLICATIONS
This is a continuation-in-part of U.S. patent application Ser. No. 503,080, filed on Apr. 2, 1990, abandoned, which is a continuation of U.S. patent application Ser. No. 167,794, filed on Mar. 14, 1988 now abandoned.
US Referenced Citations (1)
Number |
Name |
Date |
Kind |
3816722 |
Sakoe |
Jun 1974 |
|
Non-Patent Literature Citations (1)
Entry |
Kimura, et al., "A Telephone Speech Recognition System etc.," Proc. of ICASSP, Dallas, 1987, pp. 1175-1178. |
Continuations (1)
|
Number |
Date |
Country |
Parent |
167794 |
Mar 1988 |
|
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
503080 |
Apr 1990 |
|