Claims
- 1. For use in a speech processing system having means for computing a plurality of temporal speech parameters including short-term parameters having time trajectories, a method for alleviating harmful effects of distortions of speech, the method comprising:
- performing a non-linear operation on a function of the short-term parameters of speech, the function being substantially linear for small values of the parameters and substantially logarithmic for large values of the parameters; and
- filtering data representing time trajectories of the short-term parameters of speech in a particular spectral domain to obtain a filtered spectrum and to minimize distortions due to convolutive noise and additive noise in speech.
- 2. The method of claim 1 wherein the particular spectral domain is an intermediate domain, between a time domain and a logarithmic power spectral domain, in which convolutive noise and additive noise in speech are transformed to error that is substantially additive in the filtered spectrum.
- 3. The method of claim 1 wherein the short-term parameters of speech are spectral parameters.
- 4. The method of claim 3 wherein the spectral parameters are parameters of an auditory spectrum.
- 5. The method of claim 1 wherein the step of filtering includes the step of bandpass filtering to simultaneously smooth the data and remove any influences due to slow variations in the parameters.
- 6. The method of claim 1 wherein the non-linear operation is an operation described by:
- y=ln(1+Jx),
- wherein x represents a critical-band spectrum and J represents a constant over a period of time during which a noise level remains relatively constant.
- 7. The method of claim 1 further comprising taking an inverse non-linear transformation of the filtered spectrum.
- 8. The method of claim 7 wherein the inverse non-linear transformation is an inexact transformation which ensures that after the inverse transformation, all spectral values remain non-negative, the inexact transformation described by: ##EQU10## wherein y represents the result of the non-linear operation performed on the function of the short-term parameters of speech.
- 9. The method of claim 8 further comprising the step of
- approximating the filtered spectrum by a spectrum of an autoregressive model using an auto correlation method of linear predictive analysis.
- 10. For use in a speech processing system having means for computing a plurality of temporal speech parameters including short-term parameters having time trajectories, the system being useful for alleviating harmful effects of steady-state distortions of speech, the system comprising:
- means for performing a non-linear operation on a function of the short-term parameters of speech, the function being substantially linear for small values of an amplitude and substantially logarithmic for large values of the amplitude; and
- means for filtering the time trajectories of the short-term parameters of speech in a particular spectral domain to obtain a temporal pattern in which distortions due to convolutive noise and additive noise in speech are minimized.
- 11. The system of claim 10 wherein the particular spectral domain is an intermediate domain, between a time domain and a logarithmic power spectral domain, in which convolutive noise and additive noise in speech are transformed to error that is substantially additive in the filtered spectrum.
- 12. The system of claim 10 wherein the short-term parameters are spectral parameters.
- 13. The system of claim 12 wherein the spectral parameters are parameters of an auditory spectrum.
- 14. The system of claim 10 wherein the means for filtering is a bandpass filter for simultaneously smoothing the data and removing the influence of slow variations in the parameters.
- 15. The system of claim 10 wherein the means for performing a non-linear operation includes
- means for performing an operation described by:
- y=ln(1+Jx),
- wherein x represents a critical-band spectrum and J represents a constant over a period of time during which a noise level remains relatively constant.
- 16. The system of claim 10 further comprising means for taking an inverse non-linear transformation of the filtered spectrum.
- 17. The system of claim 16 wherein the means for taking an inverse non-linear transformation includes means for taking an inexact transformation described by: ##EQU11## wherein y represents the result of the non-linear operation performed on the function of the short-term parameters of speech.
- 18. The system of claim 10 further comprising means for approximating the filtered spectrum by a spectrum of an autoregressive model using an autocorrelation method of linear predictive analysis.
CROSS-REFERENCE TO RELATED APPLICATION
This application is a continuation-in-part of U.S. patent application Ser. No. 747,181, filed Aug. 19, 1991, U.S. Pat. No. 5,450,522 and titled "Auditory Model For Parametrization of Speech", which is hereby expressly incorporated by reference in its entirety.
US Referenced Citations (17)
Non-Patent Literature Citations (3)
Entry |
Adaptive Post Filtering for Enhancement of Noisy Speech in the frequency Domain Kabal et al. 1991 IEEE Internation Symposium on Circuits and Systems pp. 312-315 vol. 1 Jun. 1991. |
Perceptual linear predicitive (PLP) analysis of speech, by Hynek Hermansky, Apr., 1990. |
Compensation For The Effect Of The Communciation Channel In Auditory-Like Analysis Of Speech, by Hynek Hermansky et al, Sep., 1991. |
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
747181 |
Aug 1991 |
|