Claims
- 1. A method for developing a measure of non-stationarity of an input speech signal comprising the steps of:dividing said input signal into intervals; evaluating a measure of variability of a selected attribute of said input signal in each of said intervals; from said measure of variability, developing an analog measure of non-stationarity of said input signal for every one of said intervals.
- 2. The method of claim 1 where said intervals are uniform, with a length that is on the order of 30 msec.
- 3. The method of claim 1 where said step of developing an analog measure of non-stationarity of said input signal for each of said intervals develops a measure that is bounded by 0 and 1.
- 4. The method of claim 1 where said step of evaluating a measure of variability considers a time-domain characteristic of said input signal.
- 5. The method of claim 1 where said step of evaluating a measure of variability evaluates the RMS value of each interval of said input signal, En, in accordance with the relationship En=1N+1∑m=-N/2N/2 x2(n+m),where x represents a sample of said input signal in said interval, and N+1 is the number of such samples in said interval,developing a measure of non-stationarity of said input signal by evaluating the quotient &LeftBracketingBar;En-En-1&RightBracketingBar;En+En-1 each of said intervals.
- 6. The method of claim 1 where said step of evaluating a measure of variability considers a frequency-domain characteristic of said input signal.
- 7. The method of claim 1 where said step of evaluating a measure of variability evaluates 21+ⅇ-β1s(n)-1,where β1 is a preselected constant and s(n) is a spectral transition rate in interval n of a selected number of spectral lines of said input signal.
- 8. The method of claim 7 where said s(n) signal is developed in accordance with the relationship s(n)=∑i=1P (ci(n))2,where ci(n)=∑m=-MM myi(n+m)∑m=-MM m2,and yi is the ith spectral line.
- 9. The method of claim 1 where said step of evaluating a measure of variability considers a time domain and a frequency-domain characteristic of said input signal.
- 10. The method of claim 9 where said step of evaluating a measure of variability evaluates 21+ⅇ-β2s(n)-α Cn1-1,where β2 is a preselected constant, α is another preselected constant, s(n) is a spectral transition rate in interval n of a selected number of spectral lines of said input signal, and Cn1=&LeftBracketingBar;En-En-1&RightBracketingBar;En+En-1where En is the RMS value of said input signal within a time interval n, and En−1 is the RMS value of the speech signal within a time interval (n−1).
- 11. A method for modifying a speech signal comprising the steps of:dividing said speech signal into uniform time intervals, for every interval, computing an analog stationarity measure, ƒ(n), that is related to energy of said signal within said interval, and modifying said signal within said interval by a factor that is based on said measure.
- 12. The method of claim 11 where said measure has a range that approximately spans the interval 0 to 1.
- 13. The method of claim 11 where f(n)=&LeftBracketingBar;En-En-1&RightBracketingBar;En+En-1,En is the a root mean squared value of the speech signal within time interval n, and En−1 is a root mean squared value of the speech signal within time interval (n−1).
- 14. The method of claim 13 where En=1N+1∑m=-N/2N/2 x2(n+m),where x(n) is the speech signal over an interval of N+1 samples.
- 15. The method of claim 11 where said time intervals do not overlap.
- 16. The method of claim 11 where said time intervals overlap by a preselected amount.
- 17. The method of claim 11 where said measure is related to a root mean square measure of said signal in said interval.
- 18. The method of claim 11 where said factor, β, is β=1+[1−ƒ(n)]b, where b is a preselected constant.
- 19. The method of claim 11 where said modifying is time scaling of said signal in said time interval.
- 20. A method for modifying a speech signal comprising the steps of:dividing said signal into time intervals, for every interval, n, computing an analog stationarity measure, f(n), that is related to spectral parameters of said signal within said interval, and modifying said signal within said interval by a scaling factor that is based on said measure.
- 21. The method of claim 20 where said modifying is time scaling of said signal in said time interval.
- 22. The method of claim 20 where said spectral parameters measure corresponds to spectral feature transition rate.
- 23. The method of claim 20 where said spectral parameters measure is related to s(n)=∑i=1P ci(n)2,where ci(n)=∑m=-MM myi(n+m)∑m=-MM m2,yi is an ith spectral parameter about a time window [n−M, n+M].
- 24. The method of claim 23 where said scaling factor is 21+ⅇ-β1s(n)-1,where β1 is a preselected weight factor.
- 25. The method of claim 23 where said scaling factor is 21+ⅇ-β2s(n)-α Cn1-1,where β2 and α are preselected constants, Cn1=&LeftBracketingBar;En-En-1&RightBracketingBar;En+En-1,En is the a root mean squared value of the speech signal within time interval n, and En−1 is a root mean squared value of the speech signal within time interval (n−1).
RELATED APPLICATION
This application is related to an application, filed on Aug. 18, 1999, as application Ser. No. 09/376455, now U.S. Pat. No. 6,324,501, titled “Signal Dependent Speech Modifications”.
US Referenced Citations (8)
Number |
Name |
Date |
Kind |
4720862 |
Nakata et al. |
Jan 1988 |
A |
4802224 |
Shiraki et al. |
Jan 1989 |
A |
5596676 |
Swaminathan et al. |
Jan 1997 |
A |
5734789 |
Swaminathan et al. |
Mar 1998 |
A |
5799276 |
Komissarchik et al. |
Aug 1998 |
A |
5926788 |
Nishiguchi |
Jul 1999 |
A |
6101463 |
Lee et al. |
Aug 2000 |
A |
6240381 |
Newson |
May 2001 |
B1 |
Non-Patent Literature Citations (2)
Entry |
Nandasena, “Spectral Stability Based Event Localizing Temporal Decomposition”, Proceedings of IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 2, pp. 957-960, 1998. |
Verhelst et al, “An Overlap-add Technique Based on Waverform Similarity (WSOLA) for High Quality Time-Scale Modification of Speech”, Proc. IEEE ICASSP-93, pp. 554-557, 1993. |