Claims
- 1. An arrangement for modifying a codebook of vector quantized speech feature signals for speaker verification or adaptation to the voice of a particular speaker comprising:
- means for storing a set of vector quantized feature signals q(i), i=1, 2, . . . , K, where K is the number of vector quantized feature signals in said set, and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signal q(i);
- means for receiving an input pattern;
- means for analyzing said input pattern to generate a set of input feature vector signals v(t), t=1,2, . . . , M, where M is the number of vector-quantized feature signals in an input pattern;
- means responsive to said set of input feature signals and said set of vector quantized feature signals for classifying each input feature signal v(t) as one of said set if vector quantized feature signals q(i);
- means responsive to the classification of each input feature signal as one of said vector quantized feature signals for generating an occupancy signal m(i) corresponding to the count of input feature signals classified as each one of said vector quantized feature signals q(i); and
- means for modifying said set if vector quantized feature vector signals, q(i), to produce a modified set of vector quantized feature vector signals, q'(i), based on said classified input feature vector signals, said input feature occupancy signals, said vector quantized feature signals, and said occupancy signals of said vector quantized feature signals
- 2. An arrangement for modifying a codebook of vector quantized feature signals according to claim 1 wherein said input pattern comprises an unidentified input pattern.
- 3. An arrangement for modifying a codebook of vector quantized feature signals according to claim 2 wherein said vector quantized feature signal modifying means comprises:
- means responsive to each of said vector quantized feature signal q(i) and to each corresponding one of said vector quantized feature occupancy signals n(i) for forming a weighted vector quantized feature signal;
- means responsive to each of said input feature signals v(t) classified as one of vector quantized feature signals q(i) and a corresponding one of occupancy signals m(i) for forming a weighted input feature vector signal; and
- means responsive to said weighted vector quantized feature signal; and said weighted input feature vector signal for generating one of said modified set of vector quantized feature signals q'(i).
- 4. An arrangement for modifying a codebook of vector quantized feature vector signals according to claim 2, wherein said modifying means comprises means for forming a signal corresponding to
- q'(i)=[n(i)q(i)+p(i)]/[n(i)+m(i)],
- for each value of i, where each n(i)q(i) is a weighted codebook vector quantized feature signal, each n(i) is occupancy signal for each said codebook vector quantized signal q(i), each p(i) is a weighted input feature signal classified as the corresponding one of the vector quantized feature signals q(i), and each m(i) is the occupancy signal for said input feature signal.
- 5. An arrangement for modifying a codebook of vector quantized feature vector signals according to claim 4 further comprising:
- means responsive to said vector quantized feature occupancy signal and said input feature signal occupancy signal for modifying said vector quantized feature occupancy signal.
- 6. An arrangement for modifying a codebook of vector quantized feature vector signals according to claim 5 wherein said vector quantized feature occupancy signal modifying means comprises:
- means for forming signals corresponding to
- n'(i)=n(i)+m(i)-M.sub.tot /K
- where each n'(i) is the modified vector quantized feature occupancy signal, each n(i) is the feature occupancy signal of the codebook ith feature vector signal, each m(i) is the occupancy signal for a corresponding one of the input feature vector signals classified as the ith codebook feature vector signal, M.sub.tot is the total number of input feature vector signals, and K is the total number of the codebook quantized feature vector signals.
- 7. An arrangement for modifying a codebook of vector quantized feature vector signals according to claim 2 wherein said modifying means comprises forming a signal corresponding to
- q'(i)=(.lambda.)q(i)+(1-.lambda.)p(i)/m(i)
- for each value of i, where each p(i) is the weighted input feature signal classified as the corresponding one of the vector quantized feature signals q(i) and each m(i) is the occupancy signal for said input feature signal, m(i)<<n(i), and .lambda. is a relative weighting factor which attributes substantially greater significance to n(i) than to m(i).
- 8. An arrangement for modifying a codebook of quantized feature vector signals according to claim 7 further comprising:
- means for modifying said vector quantized feature occupancy signals responsive to said vector quantized feature occupancy signals and said input feature signal occupancy signals.
- 9. An arrangement for modifying a codebook of vector quantized feature signals according to claim 8 wherein said vector quantized feature occupancy signal modifying means comprises:
- means for forming signals corresponding to
- n'(i)=.lambda.n(i)+(1-.lambda.)m(i)
- for each value of is.
- 10. An arrangement for modifying a codebook of vector quantized feature signals according to claim 1, 3, 4, 5, 6, 7, 8 or 2 wherein each feature signal is a speech feature signal and each input pattern is a speech signal.
- 11. In a signal processing arrangement having a stored codebook of a set of vector quantized speech feature signals q(i) i=1, 2, . . . , K, where K is the number of vector quantized feature signals in said set, and occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals for speaker verification or adaptation to the voice of a particular speaker comprising the steps of:
- receiving an input pattern;
- analyzing said input pattern to generate a set of input feature vector signals v(t), t=1, 2, . . . , M, where M is the number of vector-quantized feature signals in an input pattern;
- classifying each input feature vector signal v(t) as one of said set of vector quantized feature signals q(i) responsive to said input feature signals and said set of vector quantized feature signals;
- generating an occupancy signal m(i) corresponding to the count input feature vector signals classified as respective ones of said vector quantized feature signals q(i) responsive to the classifying step; and
- modifying said set of vector quantized feature vector signals, q(i), based on said classified input feature vector signals, said input feature occupancy signals, said vector quantized feature signals and said occupancy signals of said vector quantized feature signals.
- 12. In a signal processing arrangement having a stored codebook of vector quantized feature signals q(i) i=1, 2, . . . , K and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 11 wherein said input pattern comprises an unidentified input pattern.
- 13. In a processing arrangement having a stored codebook of vector quantized feature signals q(i)i=1, 2, . . . , K and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 12 wherein said vector quantized feature signal modifying step comprises:
- forming a weighted vector quantized feature signal responsive to each of said vector quantized feature signals q(i) and each of said vector quantized feature occupancy signals n(i);
- forming a weighted input feature signal responsive to each of said input feature signals n(t) classified as vector quantized feature signals q(i); and
- generating a set of modified vector quantized signals q'(i) responsive to said weighted vector quantized feature signal and said weighted input feature signal for each of said vector quantized feature signals q(i).
- 14. In a signal processing arrangement having a stored codebook of a set of vector quantized feature signals q(i) i=1, 2, . . . , K and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 12 wherein said generating step comprises means for forming a signal corresponding to
- q'(i)=[n(i )q(i)+p(i)]/[n(i)+m(i)],
- for each value of i,
- where n(i)q(i) is the set of weighted codebook vector quantized feature signals, p(i) is the set of weighted input feature signals each classified as one of the vector quantized feature signals q(i), and each m(i) is the occupancy signal for said input feature signal.
- 15. In a signal processing arrangement having a stored codebook of a set of vector quantized feature signals q(i) i=1, 2, . . . , K and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 14 further comprising the step of:
- modifying each of said vector quantized feature occupancy signals responsive to each of said vector quantized feature occupancy signals and each of said input feature signal occupancy signals.
- 16. In a signal processing arrangement having a stored codebook of a set of vector quantized feature signals q(i) i=1, 2. . . , K and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 15 wherein said vector quantized feature occupancy signal modifying step comprises:
- forming a set of signals corresponding to
- n'(i)=n(i)+m(i)-M.sub.tot /K
- where n'(i) is the the set of modified vector quantized feature occupancy signals, n(i) is the feature occupancy signals of the codebook ith feature vector signals, m(i) is the set of occupancy signals for input pattern signals classified as respective ith codebook feature, M.sub.tot is the total number of input feature signals, and K is the total number of the codebook quantized feature signals.
- 17. In a signal processing arrangement having a stored codebook of a set of vector quantized feature signals q(i) i=1, 2, . . . , K and a set of occupancy signals n(i) corresponding each to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 12 wherein said generating comprises forming a signal corresponding to
- q'(i)=(.lambda.)q(i)+(1-.lambda.)p(i)/m(i)
- for each value of i, where each p(i) is the weighted input feature signal classified as one of the vector quantized feature signals q(i), each m(i) is the occupancy signal for said input feature signal, m(i)<<n(i), and .lambda. a relative weighting factor which attributes substantially greater significance to n(i) than to m(i).
- 18. In a signal processing arrangement having a stored codebook of a set of vector quantized feature signals q(i) i=1, 2, . . . , K and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 17 further comprising the step of:
- modifying each said vector quantized feature occupancy signal responsive to each said vector quantized feature occupancy signal and each said input feature signal occupancy signal.
- 19. In a signal processing arrangement having a stored codebook of a set of vector quantized feature signals q(i) i=1, 2, . . . , K and a set of occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals according to claim 18 wherein said vector quantized feature occupancy signal modifying step comprises:
- forming a set of signals corresponding to
- n'(i)=.lambda.n(i)+(1-.lambda.)m(i)
- where n'(i) is the set of modified vector quantized feature occupancy signals, n(i) is the set of feature occupancy signals of the codebook ith feature signals, m(i) is the set of occupancy signals for input feature signals classified as the ith codebook feature signals, and .lambda. is n(i)/(n(i)+m(i)), for each value of i.
- 20. An arrangement for modifying a codebook of vector quantized feature signals according to claims 11, 13, 14, 15, 16, 17, 18, 19 or 12 wherein each feature signal is a speech feature signal and each input pattern is a speech signal.
- 21. In a signal processing arrangement having a stored codebook of a set of vector quantized feature signals q(i) i=1, 2, . . . K, where K is the number of vector quantized feature signals in said set, and occupancy signals n(i) each corresponding to one of said vector quantized feature signals, a method for modifying the codebook of vector quantized feature signals, the method comprising the steps of:
- receiving an input pattern;
- analyzing said input pattern to generate a set of input feature vector signals v(t), t=1, 2, . . . , M, where M is the number of vector-quantized feature signals in an input pattern;
- classifying each input feature vector signal v(t) as one of said set of vector quantized feature signals q(i) responsive to said input feature signals and said set of vector quantized feature signals;
- generating an occupancy signal m(i) corresponding to the count input feature vector signals classified as respective ones of said vector quantized feature signals q(i) responsive to the classifying step; and
- modifying said set of vector quantized feature vector signals, q(i), based on said classified input feature vector signals, said input feature occupancy signals, said vector quantized feature signals and said occupancy signals of said vector quantized feature signals.
Parent Case Info
This application is a continuation of application Ser. No. 08/008,269, filed on Jan. 25, 1993, now abandoned, which is a continuation of application Ser. No. 07/900,801, filed on Dec. 27, 1990, now abandoned, which is a continuation of application Ser. No. 07/462,847 filed on Jan. 3, 1990, now abandoned, which was a continuation of application Ser. No. 06/845,501 filed on Mar. 28, 1986, now abandoned.
US Referenced Citations (9)
Non-Patent Literature Citations (7)
Entry |
Gray, "Vector Quantization", IEEE ASSP Magazine, vol. 1, No. 2, Apr. 1984, pp. 4-29. |
Gersho et al., "Vector Quantization: A Pattern-Matching Technique For Speech Coding", IEEE Comm. Magazine, Dec. 1983, pp. 15-21. |
Juang et al., "Distortion Performance of Vector Quantization LPC Voice Coding", IEEE Trans. ASSP, vol. ASSP-30, No. 2, Apr. 1982, pp. 294-303. |
Casey et al., "Advances in Pattern Recognition", Scientific American, Apr. 1971, pp. 56-71. |
Proceedings of the IEEE, vol. 73, No. 11, Nov. 1985, "Vector Quantization in Speech Coding", pp. 1551-1588. |
ICASSP 83, Boston, "An 800 BPS Adaptive Vector Quantization Vocoder Using A Perceptual Distance Measure", D. B. Paul, pp. 73-76. |
Patent No. SHO 60[1985]-60696, issue in Japan on Sep. 13, 1983 to Y. Sato, N. Fujimoto. |
Continuations (4)
|
Number |
Date |
Country |
Parent |
8269 |
Jan 1993 |
|
Parent |
900801 |
Dec 1990 |
|
Parent |
462847 |
Jan 1990 |
|
Parent |
845501 |
Mar 1986 |
|