Claims
- 1. A speech enhancement system, comprising:
a noise adaptation module receiving noisy speech, the noisy speech being characterized by spectral coefficients spanning a plurality of frequency bins and containing an original noise, the noise adaptation module segmenting the noisy speech into noise-only frames and signal-containing frames, and the noise adaptation module determining a noise estimate and a probability of signal absence in each frequency bin; a signal-to-noise ratio estimator coupled to the noise adaptation module, the signal-to-noise ratio estimator determining a first signal-to-noise ratio and a second signal-to-noise ratio based on the noise estimate; and a core estimator coupled to the signal-to-noise ratio estimator and receiving the noisy speech, the core estimator applying to the spectral coefficients of the noisy speech a first set of gains in the frequency domain without discarding the noise-only frames to produce speech that contains a residual noise, wherein the first set of gains is determined based, at least in part, on the second signal-to-noise ratio and a level of aggression, and wherein the core estimator is operative to maintain the spectral density of the spectral coefficients of the residual noise below a proportion of the spectral density of the spectral coefficients of the original noise.
- 2. The system of claim 1, wherein:
each one of the first set of gains is also based on the probability of signal absence in each frequency bin.
- 3. The system of claim 1, wherein:
the system modifies the spectral amplitude of the noisy speech without affecting the phase of the noisy speech.
- 4. The system of claim 1, wherein:
during a noise-only frame, a constant gain is applied to the noise in order to avoid noise structuring.
- 5. The system of claim 1, wherein:
the core estimator applies to the spectral coefficients of the noisy speech one of the first set of gains for each frequency bin.
- 6. The system of claim 1, further comprising:
a soft decision module coupled to the signal-to-noise ratio estimator and to the core estimator, the soft decision module applying a second set of gains to the spectral coefficients of the speech that contains a residual noise.
- 7. The system of claim 6, wherein:
the soft decision module determines the second set of gains based on the first signal-to-noise ratio, the second signal-to-noise ratio and the probability of signal absence in each frequency bin.
- 8. A method for enhancing speech, comprising the steps of:
receiving noisy speech, wherein the noisy speech is characterized by spectral coefficients spanning a plurality of frequency bins and contains an original noise; segmenting the speech into noise-only frames and signal-containing frames; determining a noise estimate and a probability of signal absence in each frequency bin; determining a first signal-to-noise ratio and a second signal-to-noise ratio based on the noise estimate; determining a first set of gains based, at least in part, on the second signal-to-noise ratio and a level of aggression; and applying the first set of gains to the spectral coefficients of the noisy speech without discarding the noise-only frames to produce speech that contains a residual amount of noise, such that the spectral density of the spectral coefficients of the residual noise is maintained below a proportion of the spectral density of the spectral coefficients of the original noise.
- 9. The method of claim 8, wherein:
the first set of gains is also based on the probability of signal absence in each frequency bin.
- 10. The method of claim 8, further comprising the step of:
modifying the spectral coefficients of the noisy speech without affecting the phase of the noisy speech.
- 11. The method of claim 8, further comprising the step of:
during a noise-only frame, applying a constant gain to the noise.
- 12. The method of claim 8, wherein:
one of the first set of gains is applied to the spectral coefficients of the noisy speech for each frequency bin.
- 13. The method of claim 8, further comprising the step of:
applying a second set of gains to the spectral coefficients of the speech that contains a residual noise.
- 14. The method of claim 13, further comprising the step of:
determining the second set of gains based on the first signal-to-noise ratio, the second signal-to-noise ratio and the probability of signal absence in each frequency bin.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the priority benefit of provisional U.S. application Ser. No. 60/071,051, filed Jan. 9, 1998.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60071051 |
Jan 1998 |
US |