Claims
- 1. A method for recovering an audio signal produced by a desired source from an audio channel in which audio signals from a plurality of different sources are combined, comprising the steps of:
(a) processing the audio channel with a joint acoustic modulation frequency algorithm to separate audio signals from the plurality of different sources into distinguishable components; (b) masking each distinguishable component corresponding to any source that is not desired in the audio channel, such that the distinguishable component corresponding to the desired source remains unmasked; and (c) processing the distinguishable component that is unmasked with an inverse joint acoustic modulation frequency algorithm, to recover the audio signal produced by the desired source.
- 2. The method of claim 1, wherein the step of processing the audio channel with the joint acoustic modulation frequency algorithm comprises the steps of:
(a) applying a base acoustic transform to the audio channel; and (b) applying a second modulation transform to a result from applying the base acoustic transform.
- 3. The method of claim 2, wherein the step of processing the distinguishable component that is unmasked with an inverse joint acoustic modulation frequency algorithm comprises the steps of:
(a) applying an inverse second modulation transform to the distinguishable component that is unmasked; and (b) applying an inverse base acoustic transform to a result of the inverse second modulation transform.
- 4. The method of claim 2, wherein the base acoustic transform separates the audio channel into a magnitude spectrogram and a phase spectrogram.
- 5. The method of claim 4, wherein the second modulation transform converts the magnitude spectrogram and the phase spectrogram into a magnitude joint frequency plane and a phase joint frequency plane.
- 6. The method of claim 5, wherein the step of masking each distinguishable component corresponding to any source that is not desired comprises the steps of:
(a) providing a magnitude mask and a phase mask for each distinguishable component corresponding to any source that is not desired; (b) using each magnitude mask, performing a point-by-point operation on the magnitude joint frequency plane, thereby producing a modified magnitude joint frequency plane; and (c) using each phase mask, performing a point-by-point operation on the phase joint frequency plane, thereby producing a modified phase joint frequency plane.
- 7. The method of claim 5, wherein the step of masking each distinguishable component corresponding to any source that is not desired comprises the steps of:
(a) providing a magnitude mask and a phase mask for each distinguishable component corresponding to any source that is not desired; (b) using each magnitude mask, performing a point-by-point multiplication on the magnitude joint frequency plane, thereby producing a modified magnitude joint frequency plane; and (c) using each phase mask, performing a point-by-point addition on phase joint frequency plane, thereby producing a modified phase joint frequency plane.
- 8. The method of claim 6, wherein the step of processing the distinguishable component that is unmasked with an inverse joint acoustic modulation frequency algorithm comprises the steps of:
(a) performing an inverse second modulation transform on the modified magnitude joint frequency plane, thereby producing a magnitude spectrogram; (b) performing an inverse second modulation transform on the modified phase joint frequency plane, thereby producing a phase spectrogram; and (c) performing an inverse base acoustic transform on the magnitude spectrogram and the phase spectrogram, to recover the audio signal produced by the desired source.
- 9. The method of claim 3, wherein the steps of applying a base acoustic transform, applying a second modulation transform, applying an inverse second modulation transform, and applying an inverse base acoustic transform are executed by a computing device.
- 10. The method of claim 1, further comprising the step of automatically selecting each distinguishable component corresponding to any source that is not desired.
- 11. The method of claim 1, further comprising the step of enabling a user to listen to the audio signal that was recovered, to determine if additional processing is desired.
- 12. The method of claim 2, further comprising the steps of:
(a) displaying the distinguishable components; and (b) enabling a user to select the distinguishable component that corresponds to the audio signal from the desired source.
- 13. The method of claim 1, wherein before the step of processing the audio channel with the joint acoustic modulation frequency algorithm, further comprising the step of separating the audio channel into a plurality of different analysis windows, such that each portion of the audio channel in an analysis window has relatively constant spectral characteristics.
- 14. The method of claim 13, wherein the plurality of different analysis windows are selected such that vocalic and fricative sounds are not present in the same analysis window.
- 15. The method of claim 1, wherein steps (a)-(c) are implemented as a preprocessor in an automated speech recognition system, so that the audio signal produced by the desired source is recovered for automated speech recognition.
- 16. The method of claim 1, wherein steps (a)-(c) are implemented as a preprocessor in a hearing aid, so that the audio signal produced by the desired source is recovered for amplification.
- 17. A memory medium storing machine instructions for carrying out the steps of claim 1.
- 18. A system for recovering an audio signal produced by a desired source from an audio channel in which audio signals from a plurality of different sources are combined, comprising:
(a) a memory in which are stored a plurality of machine instructions defining a single channel audio separation program; and (b) a processor that is coupled to the memory, to access the machine instructions, said processor executing said machine instructions and thereby implementing a plurality of functions, including:
(i) processing the audio channel with a joint acoustic modulation frequency algorithm to separate audio signals from the plurality of different sources into distinguishable components; (ii) masking each distinguishable component corresponding to any source that is not desired in the audio channel, such that the distinguishable component corresponding to the desired source remains unmasked; and (iii) processing the distinguishable component that is unmasked with an inverse joint acoustic modulation frequency algorithm, to recover the audio signal produced by the desired source.
- 19. The system of claim 18, wherein the machine instructions further cause said processor to:
(a) apply a base acoustic transform to the audio channel; and (b) apply a second modulation transform to a result from applying the base acoustic transform.
- 20. The system of claim 19, wherein the machine instructions further cause the processor to:
(a) apply an inverse second modulation transform to the distinguishable component that is unmasked; and (b) apply an inverse base acoustic transform to a result of the inverse second modulation transform.
- 21. The system of claim 18, further comprising:
(a) a display operatively coupled to the processor and configured to display the distinguishable components; and (b) a user input device operatively coupled to the processor and configured to enable a user to select from the display the distinguishable component that corresponds to the audio signal from the desired source.
- 22. The system of claim 18, further comprising:
(a) a microphone configured to provide the audio channel in response to an ambient audio environment that includes a plurality of different sources, the microphone being coupled to said processor such that the processor receives the audio channel produced by the microphone; (b) an amplifier coupled with the processor, such that the amplifier receives the audio signal conveying the desired source from the processor, the amplifier being configured to amplify the audio signal conveying the desired source; and (c) an output transducer coupled with the amplifier such that the output transducer receives the amplified audio signal corresponding to the desired source.
- 23. The system of claim 22, further comprising a housing substantially enclosing said microphone, said processor, said amplifier, and said output transducer, the housing being configured to be disposed in at least one of:
(a) behind an ear of a user; (b) within an ear of a user; and (c) within an ear canal of a user.
- 24. A method for employing a joint acoustic modulation frequency algorithm to separate individual audio signals from different sources that have been combined into a combined audio signal, into distinguishable signals, comprising the steps of:
(a) applying a base acoustic transform to the combined audio signal to separate the combined audio signal into a magnitude spectrogram and a phase spectrogram; (b) applying a second modulation transform to the magnitude spectrogram and the phase spectrogram, generating a magnitude joint frequency plane and a phase joint frequency plane, such that the individual audio signals from different sources are separated into the distinguishable signals.
- 25. The method of claim 24, further comprising the steps of:
(a) masking each distinguishable component that is not desired, such that at least one distinguishable component remains unmasked; (b) applying an inverse second modulation transform to the at least one unmasked distinguishable component; and (c) applying an inverse base acoustic transform to a result of the inverse second modulation transform, producing an audio signal that includes only those audio signals from each different source that is desired.
- 26. The method of claim 25, wherein the step of masking each distinguishable component that is not desired comprises the steps of:
(a) providing a magnitude mask and a phase mask for each distinguishable component that is not desired; (b) using each magnitude mask provided, performing a point by point multiplication on the magnitude joint frequency plane, thereby producing a modified magnitude joint frequency plane; and (c) using each phase mask provided, performing a point-by-point addition on the phase joint frequency plane, thereby producing a modified phase joint frequency plane.
- 27. The method of claim 26, wherein the step of applying the inverse second modulation transform comprises the steps of:
(a) applying the inverse second modulation transform to the modified magnitude joint frequency plane, producing a magnitude spectrogram; and (b) applying the inverse second modulation transform to the modified phase joint frequency plane, producing a phase spectrogram.
- 28. The method of claim 27, wherein the step of applying the inverse base acoustic transform comprises the step of applying the inverse base acoustic transform to the magnitude spectrogram and the phase spectrogram, producing the audio signals from each different source that is desired.
- 29. A memory medium storing machine instructions for carrying out the steps of claim 24.
RELATED APPLICATIONS
[0001] This application is based on a prior copending provisional application Serial No. 60/369,432, filed on Apr. 2, 2002, the benefit of the filing date of which is hereby claimed under 35 U.S.C. §119(e).
Provisional Applications (1)
|
Number |
Date |
Country |
|
60369432 |
Apr 2002 |
US |