1. Technical Field
The present disclosure relates to the field of noise control in audio signals. In particular, to a system and method for mitigating microphone hiss.
2. Related Art
There are many types of microphones each designed for different applications. Microphones may be selected for an application based on factors including cost, size and the dynamic range over a specified frequency range. For example, a microphone utilized in a professional recording application may have a large dynamic range close to the range of human perception from 0 Hz to 16 kHz. The microphone utilized in a mobile phone may have an acceptable dynamic range for speech content from 0 Hz to 5 kHz. The microphones utilized in the mobile phone may be selected for cost reasons but often the dynamic range over a give frequency range may be limited because of the physical size of the microphone.
Mobile phones, and many other computing devices, utilize small microphones because of the small physical size of the computing devices. Physically small microphones may have limitations in the dynamic range over a given frequency range. The limited dynamic range may be audible even in the presence of a completely silent environment. The limited dynamic range of a small microphone at higher frequencies will be perceived as hiss noise.
The system may be better understood with reference to the following drawings and description. The components in the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the disclosure. Moreover, in the figures, like referenced numerals designate corresponding parts throughout the different views.
Other systems, methods, features and advantages will be, or will become, apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features and advantages be included with this description and be protected by the following claims.
A system and method for mitigating microphone hiss may obtain a frequency spectrum characteristic for a sound input transducer such as, for example, a microphone. A microphone that has limited dynamic range with may create microphone hiss in an output signal generated by the microphone. The microphone hiss may prevent a reproduction of a sound field, represented in an output signal of the microphone, from being perceived as a natural environment. The microphone frequency spectrum characteristics (a.k.a. microphone frequency spectrum) may be obtained using static measurements or calculated dynamically. A virtual noise floor may be calculated responsive to the microphone frequency spectrum and a desired noise floor. Gain coefficients may be calculated responsive to the output signal of the microphone (a.k.a. the microphone signal). The gain coefficients may be calculated to mitigate undesirable signal content including background noise and echoes. The calculated gain coefficients may be modified responsive to the virtual noise floor. The modified gain coefficients may allow a reproduction of the sound field to be perceived as a natural environment.
There are many types of microphones 102 each designed for different applications. The microphone 102 may be chosen for an application based on cost, size or dynamic range over a specified frequency range (a.k.a. frequency response). For example, a microphone 102 utilized in a professional recording application may have a large dynamic range close to the range of human perception from 0 Hz to 16 kHz. The microphone 102 utilized in a mobile phone may have an acceptable dynamic range for speech content from 0 Hz to 5 kHz. The microphones 102 utilized in the mobile phone may be selected for cost reasons but often the dynamic range over a give frequency range may be limited because of the physical size of the microphone. Mobile phones, and many other computing devices, utilize small microphones because of the small physical size of the computing devices. Physically small microphones may have limitations of the dynamic range over a given frequency range. A common limitation may be limited dynamic range. The limited dynamic range may be audible even in the presence of a completely silent environment. The limited dynamic range may be heard as microphone hiss in the output signal of the microphone.
Processing that may mitigate undesirable content received in the sound field may include a gain coefficient calculator 104. The gain coefficient calculator 104 may comprise one or more of a noise suppressor 106 and an echo canceller 108. The echo canceller 108, noise suppressor 106 and other audio processing processes may calculate one or more gain coefficients, or suppression gains. The echo canceller 108 and the noise suppressor 106 may each calculate one or more gain coefficients. Each respective gain coefficient may be applied individually or a composite gain coefficient may be applied to process the sound field using a gain coefficient applier 110.
The echo canceller 108 mitigates echoes caused by signal feedback between two or more communication devices. Signal feedback occurs when an audio transducer on a first communication device reproduces the signal received from a second communication device and subsequently the microphones on the first communication device recapture the reproduced signal. The recaptured signal may be transmitted to the second communication device where the recaptured signal may be perceived as an echo of the previously transmitted signal. The echo canceller 108 may detect when a portion of the received signal has been recaptured and attempt to suppress the recaptured portion of the signal. Many different types of echo cancellers 108 may mitigate echoes by calculating one or more gain coefficients that, when applied to the signals received by the microphone 102, suppress the echoes. In one example implementation, the echo canceller 108 may calculate gain coefficients using a coherence calculation between near and far signals disclosed in U.S. Pat. No. 8,036,879, which is incorporated herein by reference, except that in the event of any inconsistent disclosure or definition from the present specification, the disclosure or definition herein shall be deemed to prevail. Alternatively or in addition, the echo canceller 108 may use other mechanisms.
Background noise is another type of undesirable signal content that may be mitigated by processing the microphone signal 120. Many different types of noise suppressor 106 techniques may mitigate background noise. An exemplary noise suppressor 106 is a recursive Wiener filter. The Wiener suppression gain Gi,k, or gain coefficient, is defined as
Where S{circumflex over (N)}Rpriori
S{circumflex over (N)}Rpriori
Where S{circumflex over (N)}Rpost
Where |{circumflex over (N)}i,k| is a background noise estimate. In one example implementation, the background noise estimate, or signal values, may be calculated using the background noise estimation techniques disclosed in U.S. Pat. No. 7,844,453, which is incorporated herein by reference, except that in the event of any inconsistent disclosure or definition from the present specification, the disclosure or definition herein shall be deemed to prevail. In other implementations, alternative background noise estimation techniques may be used, such as, for example, a noise power estimation technique based on minimum statistics. Alternatively or in addition, the noise suppressor 106 may use other mechanisms.
Noise suppression techniques including the exemplary Wiener filter described above may mitigate some microphone hiss. Mitigating microphone hiss using standard noise suppression may reduce the audible hiss created by the microphone 102 although the resulting output 112 may not be perceived as a natural environment. Natural environments, for example a room response, may have a noise floor that changes at a rate of 1/frequency, or −6 dB per octave. A noise floor that changes at a rate of approximately −5 dB to −7 dB per octave may be perceived as a natural environment. The noise suppressor 106 may reduce the amount of hiss created by the microphone 102 but the result may not be perceived as a natural environment. The noise suppressor 106 may generate gain coefficients that result in an output 112 that is perceived as a natural environment when the generated gain coefficients are modified to simulate a natural environment. For example, gain coefficients that result in an output 112 where the hiss noise changes at a rate of approximately −6 dB per octave may be perceived as a natural environment.
A microphone spectrum obtainer 114 may obtain the microphone frequency spectrum 402. The microphone spectrum obtainer 114 may obtain the microphone frequency spectrum 402 utilizing an offline measurement tool. Alternatively or in addition, the microphone spectrum obtainer 114 may obtain the microphone frequency spectrum 402 dynamically by analyzing the microphone signal 120 over time and frequency. Obtaining a dynamic frequency spectrum 402 may include utilizing the background noise estimate.
A virtual noise floor calculator 116 may calculate a virtual noise floor 404 that simulates (e.g. approximates) a natural environment. The microphone hiss may distort the audible environment received by the microphone 102 when reproduced in an output 112. The output 112 when reproduced with the virtual noise floor 404 may be perceived as a more natural environment. The virtual noise floor 404 may be calculated responsive to the microphone frequency spectrum 402. For example, the virtual noise floor calculator 116 may calculate the virtual noise floor 404 starting from a frequency beyond which microphone hiss may become audible when compared to a natural environment. The virtual noise floor calculator 116 may update the virtual noise floor 404 as the microphone frequency spectrum 402 changes responsive to the microphone spectrum obtainer 114. The virtual noise floor 404 may be responsive to detected signal content. For example if no signal content is detected above 10 kHz, comfort noise may be added to the level of the virtual noise floor 404.
A gain coefficient modifier 118 may modify the gain coefficients calculated by the gain coefficient calculator 104. The gain coefficient modifier 118 may modify the gain coefficients responsive to the virtual noise floor 404 calculated by the virtual noise floor calculator 116. The gain coefficient modifier 118 may modify the gain coefficients by increasing or decreasing the value of the gain coefficients. The gain coefficients from the example Wiener filter described above may be modified by limiting the gain coefficients to track the virtual noise floor 404. The gain coefficient modifier 118 may use the following equation to modify the gain coefficients where η(k) represents the virtual noise floor 404:
Ĝi,k=max(η(k),Gi,k). (4)
Equation 4 limits the gain coefficients calculated by the Wiener filter from suppressing noise below the virtual noise floor 404. An alternative gain coefficient modifier 118 may filter, or smooth, the gain coefficients over time and/or frequency. The gain coefficients may be modified resulting in the output 112 being below the virtual noise floor 404 responsive to a different limiting method and/or filtering. Gain coefficients that result in the output 112 being substantially lower than the virtual noise floor 404 may be perceived as a different type of natural environment including for example a large room as compared to a small room. Typically, the gain coefficient modifier 118 allows more noise removal with increasing frequency. For example, 20 dB (decibel) of noise removal between 8 to 10 kHz and 30 dB of noise removal between 15 to 20 kHz.
Gain coefficients calculated with the echo canceller 108 may be modified by the gain coefficient modifier 118 in accordance with equation 4. In an alternative example, additional noise may be added to the signal after the echo canceller 108 to obtain an output 112 that is approximately the same energy level as the virtual noise floor 404. The additional noise may be comfort noise with substantially the same energy level as the virtual noise floor. The comfort noise may be added for any gain coefficient calculator 104.
A subband filter may process the signal from the microphone 102 to extract frequency information. The subband filtering may be accomplished by various methods, such as a Fast Fourier Transform (FFT), critical filter bank, octave filter band, or one-third octave filter bank. Alternatively, the subband analysis may include a time-based filter bank. The time-based filter bank may be composed of a bank of overlapping bandpass filters, where the center frequencies have non-linear spacing such as octave, 3rd octave, bark, mel, or other spacing techniques. The one or more gain coefficients may be calculated for each frequency bin or band of the subband filter.
The processor 202 may comprise a single processor or multiple processors that may be disposed on a single chip, on multiple devices or distributed over more that one system. The processor 202 may be hardware that executes computer executable instructions or computer code embodied in the memory 204 or in other memory to perform one or more features of the system. The processor 202 may include a general purpose processor, a central processing unit (CPU), a graphics processing unit (GPU), an application specific integrated circuit (ASIC), a digital signal processor (DSP), a field programmable gate array (FPGA), a digital circuit, an analog circuit, a microcontroller, any other type of processor, or any combination thereof.
The memory 204 may comprise a device for storing and retrieving data, processor executable instructions, or any combination thereof. The memory 204 may include non-volatile and/or volatile memory, such as a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM), or a flash memory. The memory 204 may comprise a single device or multiple devices that may be disposed on one or more dedicated memory devices or on a processor or other similar device. Alternatively or in addition, the memory 204 may include an optical, magnetic (hard-drive) or any other form of data storage device.
The memory 204 may store computer code, such as the gain coefficient calculator module 104, the noise suppressor module 106, the echo canceller module 108, the gain coefficient applier module 110, the microphone spectrum obtainer module 114, the virtual noise floor calculator 116 and the gain coefficient modifier module 118 as described herein. The computer code may include instructions executable with the processor 202. The computer code may be written in any computer language, such as C, C++, assembly language, channel program code, and/or any combination of computer languages. The memory 204 may store information in data structures including, for example, gain coefficients and state variables.
The I/O interface 206 may be used to connect devices such as, for example, the microphone 102, and to other components of the system 200.
All of the disclosure, regardless of the particular implementation described, is exemplary in nature, rather than limiting. The system 100 and 200 may include more, fewer, or different components than illustrated in
The functions, acts or tasks illustrated in the figures or described may be executed in spectrum to one or more sets of logic or instructions stored in or on computer readable media. The functions, acts or tasks are independent of the particular type of instructions set, storage media, processor or processing strategy and may be performed by software, hardware, integrated circuits, firmware, micro code and the like, operating alone or in combination. Likewise, processing strategies may include multiprocessing, multitasking, parallel processing, distributed processing, and/or any other type of processing. In one embodiment, the instructions are stored on a removable media device for reading by local or remote systems. In other embodiments, the logic or instructions are stored in a remote location for transfer through a computer network or over telephone lines. In yet other embodiments, the logic or instructions may be stored within a given computer such as, for example, a CPU.
While various embodiments of the system and method for mitigating microphone hiss have been described, it will be apparent to those of ordinary skill in the art that many more embodiments and implementations are possible within the scope of the present invention. Accordingly, the invention is not to be restricted except in light of the attached claims and their equivalents.
Number | Name | Date | Kind |
---|---|---|---|
5839101 | Vahatalo et al. | Nov 1998 | A |
5937377 | Hardiman et al. | Aug 1999 | A |
20020152066 | Piket | Oct 2002 | A1 |
20050278171 | Suppappola et al. | Dec 2005 | A1 |
20070156399 | Matsuo | Jul 2007 | A1 |
20070223733 | Shuttleworth et al. | Sep 2007 | A1 |
20100202631 | Short | Aug 2010 | A1 |
20100318353 | Bizjak | Dec 2010 | A1 |
Number | Date | Country |
---|---|---|
1 739 657 | Jan 2007 | EP |
WO 2011159858 | Dec 2011 | WO |
Entry |
---|
Dreiseitel, Pia et al., “Speech Enhancement for Mobile Telephony Based on Non-Uniformly Spaced Frequency Resolution,” EUSIPCO 1998, IEEE, Sep. 8, 1998, pp. 1-4. |
Office Action Examination Report for corresponding European Application No. 13153113.9, dated Aug. 19, 2015, 6 pages. |
Number | Date | Country | |
---|---|---|---|
20140211955 A1 | Jul 2014 | US |