The present invention relates to signal processing technology, and, more particularly, to methods, electronic devices, and computer program products for detecting noise in a signal.
Wind noise may be picked up by microphones used in such devices as mobile terminals and hearing aids, for example, and may be a source of interference for a desired audio signal. Electronic devices may incorporate adaptively directional microphones to reduce the effect of wind noise. More specifically, an electronic device may adjust the directivity pattern created by its microphones based on whether the electronic device is operating in a wind mode or no-wind mode.
Conventionally, as described in U.S. Patent Application Publication U.S. 2002/0037088 by Dickel et al., the disclosure of which is hereby incorporated herein by reference, a windy condition is detected by analyzing the output signals of at least two microphones. In more detail, one output signal is subtracted from the other to remove a common component of the two signals. The result of the subtraction is averaged and compared with a threshold value. If the threshold value is exceeded, then the device switches to a wind mode due to the wind being generally spatially uncorrelated. Unfortunately, the foregoing approach to detecting wind noise generally is more effective when turbulence near the microphones is large. For some wind angles, however, the turbulence may be relatively small. Other noise sources that are uncorrelated or inverse correlated to each other in space may generate a false windy condition.
According to some embodiments of the present invention, noise, such as wind noise, for example, may be detected in an electronic device by generating first and second microphone signals. Cross correlation coefficients are determined for the first and the second microphone signals and autocorrelation coefficients are determined for the first and second microphone signals, respectively. A determination may be made whether at least one of the microphone signals includes a noise component based on the cross correlation coefficients, the first autocorrelation coefficients, and the second autocorrelation coefficients. Embodiments of the present invention use the properties that wind noise is relatively uncorrelated in space, but relatively correlated in time. These two properties may be represented by cross correlation coefficients for multiple microphone signals and autocorrelation coefficients for the individual microphone signals, respectively. By combining both a cross correlation coefficient analysis and an autocorrelation coefficient analysis, embodiments of the present invention may detect wind noise with generally improved reliability because the determination is less sensitive to the physics of the microphones.
In other embodiments of the present invention, the directivity pattern created by the first and the second microphones is adjusted based on whether a noise component is detected.
In still other embodiments of the present invention, the cross correlation coefficients are summed to generate a spatial correlation sum, the first autocorrelation coefficients are summed to generate a first autocorrelation sum, and the second autocorrelation coefficients are summed to generate a second autocorrelation sum. The first autocorrelation sum and the spatial correlation sum are multiplied together to generate a first correlation product and the second autocorrelation sum and the spatial correlation sum are multiplied together to generate a second correlation product. A determination is made whether at least one of the microphone signals includes a noise component based on the first and second correlation products.
In further embodiments of the present invention, the first and second correlation products are compared with a threshold value and a determination is made whether at least one of the microphone signals includes a noise component based on at least one of the comparisons.
In still further embodiments of the present invention, the cross correlation coefficients may be scaled and filtered before the spatial correlation sum is generated. The first and second autocorrelation coefficients may be inverted, scaled, and filtered before generating the first and second autocorrelation sums.
Although described above primarily with respect to method aspects of the present invention, it will be understood that the present invention may be embodied as methods, electronic devices, and/or computer program products.
Other features of the present invention will be more readily understood from the following detailed description of specific embodiments thereof when read in conjunction with the accompanying drawings, in which:
While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that there is no intent to limit the invention to the particular forms disclosed, but on the contrary, the invention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the invention as defined by the claims. Like reference numbers signify like elements throughout the description of the figures. It should be further understood that the terms “comprises” and/or “comprising” when used in this specification is taken to specify the presence of stated features, integers, steps, operations, elements, and/or components, but does not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The present invention may be embodied as methods, electronic devices, and/or computer program products. Accordingly, the present invention may be embodied in hardware and/or in software (including firmware, resident software, micro-code, etc.). Furthermore, the present invention may take the form of a computer program product on a computer-usable or computer-readable storage medium having computer-usable or computer-readable program code embodied in the medium for use by or in connection with an instruction execution system. In the context of this document, a computer-usable or computer-readable medium may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
The computer-usable or computer-readable medium may be, for example but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or propagation medium. More specific examples (a nonexhaustive list) of the computer-readable medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, and a compact disc read-only memory (CD-ROM). Note that the computer-usable or computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted, or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory.
The present invention is described herein in the context of detecting wind noise in a mobile terminal. It will be understood, however, that the present invention may be embodied in other types of electronic devices that incorporate multiple microphones, such as, for example automobile speech recognition systems, hearing aids, etc. Moreover, as used herein, the term “mobile terminal” may include a satellite or cellular radiotelephone with or without a multi-line display; a Personal Communications System (PCS) terminal that may combine a cellular radiotelephone with data processing, facsimile and data communications capabilities; a PDA that can include a radiotelephone, pager, Internet/intranet access, Web browser, organizer, calendar and/or a global positioning system (GPS) receiver; and a conventional laptop and/or palmtop receiver or other appliance that includes a radiotelephone transceiver. Mobile terminals may also be referred to as “pervasive computing” devices.
It should be further understood that the present invention is not limited to detecting wind noise. In general, the present invention may be used to detect noise that is relatively uncorrelated in space, but relatively correlated in time.
Referring now to
The processor 140 communicates with the memory 135 via an address/data bus. The processor 140 may be, for example, a commercially available or custom microprocessor. The memory 135 is representative of the one or more memory devices containing the software and data used to determine achievable data rate estimates, which may be communicated to a radio packet data system access network for use in allocating bandwidth in the radio packet data system, in accordance with embodiments of the present invention. The memory 135 may include, but is not limited to, the following types of devices: cache, ROM, PROM, EPROM, EEPROM, flash, SRAM, and DRAM.
As shown in
As shown in
Although
Computer program code for carrying out operations of the wind detection program module 165 and/or the signal processor 160 discussed above may be written in a high-level programming language, such as C or C++, for development convenience. In addition, computer program code for carrying out operations of the present invention may also be written in other programming languages, such as, but not limited to, interpreted languages. Some modules or routines may be written in assembly language or even micro-code to enhance performance and/or memory usage. It will be further appreciated that the functionality of any or all of the program and/or processing modules may also be implemented using discrete hardware components, one or more application specific integrated circuits (ASICs), or a programmed digital signal processor or microcontroller.
Referring now to
The foregoing components of the signal processor 200 will now be described in more detail. The delay chain 205 is responsive to a first microphone signal and generates delayed samples of the first microphone signal (microphone 1 signal), which are provided to the correlation unit 210 along with the original first microphone signal (microphone 1 signal). In particular embodiments, the delay chain 205 may weight the samples, such that newer samples are weighted greater than the older samples. The correlation unit 210 also receives the output signal from a second microphone (microphone 2 signal). If the first microphone signal (microphone 1 signal) is given by s1 and the second microphone signal (microphone 2 signal) is given by s2, and the number of delay elements is N, then the correlation unit 210 generates cross correlation coefficients R12( ) at lag k according to Equation 1 set forth hereafter:
The scaling unit 215 scales the cross correlation coefficients, which are representative of the spatial correlation of the two microphone signals, and the scaled cross correlation coefficients are provided to the low pass filter 220 for smoothing. The low pass filter 220 may be embodied as an autoregressive filter in which the output is based on a weighted sum of previous input values. The summation unit 225 generates a spatial correlation sum by adding the coefficients output from the low pass filter 220.
Similarly, the delay chain 230 is responsive to a second microphone signal (microphone 2 signal) and generates delayed samples of the second microphone signal (microphone 2 signal), which are provided to the autocorrelation unit 235 along with the original second microphone signal (microphone 2 signal). In particular embodiments, the delay chain 230 may weight the samples, such that newer samples are weighted greater than the older samples. If the second microphone signal (microphone 2 signal) is given by s2 and the number of delay elements is N, then the autocorrelation unit 235 generates autocorrelation coefficients R22( ) at lag k according to Equation 2 set forth hereafter:
An inversion unit 240 inverts the autocorrelation coefficients. This is because wind noise is relatively uncorrelated spatially, but relatively correlated in time. Thus, to combine the results from the cross correlation and autocorrelation signal analyses and compare the result with a single threshold, the autocorrelation coefficients are inverted so that a higher correlation will result in lower autocorrelation coefficient values output from the inversion unit 240. The scaling unit 245 scales the autocorrelation coefficients, which are representative of the correlation in time of the second microphone signal, and the scaled autocorrelation coefficients are provided to the low pass filter 250 for smoothing. The low pass filter 250 may be embodied as an autoregressive filter in which the output is based on a weighted sum of previous input values. The summation unit 225 generates an autocorrelation sum by adding the coefficients output from the low pass filter 250.
A multiplication unit 260 is coupled to the two summation units 225 and 255 and multiplies the spatial correlation sum output by the summation unit 225 by the autocorrelation sum output by the summation unit 255 to generate a correlation product. A comparator 265 is coupled to the multiplication unit 260 and compares the correlation product output by the multiplication unit 260 with a threshold value. The result of this comparison is provided to a processor, such as the processor 140 of
For purposes of illustration,
Although
The present invention is described hereinafter with reference to flowchart and/or block diagram illustrations of methods, electronic devices, and computer program products in accordance with some embodiments of the invention. These flowchart and/or block diagrams further illustrate exemplary operations of the mobile terminal and signal processor architectures of
These computer program instructions may also be stored in a computer usable or computer-readable memory that may direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer usable or computer-readable memory produce an article of manufacture including instructions that implement the function specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions that execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart and/or block diagram block or blocks.
Referring now to
The cross correlation coefficients are summed at block 320 to generate a spatial correlation sum and the autocorrelation coefficients are summed at block 325 to generate respective autocorrelation sums for the first and second microphone signals. At block 330, the autocorrelation sum for the first microphone signal and the spatial correlation sum are multiplied together to generate a first correlation product and the autocorrelation sum for the second microphone signal and the spatial correlation sum are multiplied together to generate a second correlation product.
A determination is then made at block 340 whether either of the first or second correlation products exceeds a threshold value. The wind detection module 170 (see
If either the first or the second correlation products exceeds the threshold value, then the wind detection module 170 may determine that no wind noise has been detected in the sound waves received by the microphones 105 and 110 at block 345. If, however, neither of the first and second correlation products exceeds the threshold, then the wind detection module 170 may determine that wind noise has been detected in the sound waves received by the microphones 105 and 110 at block 350. In particular embodiments, hysteresis may be used such that wind noise is not detected unless the first and second correlation products exceed the threshold by a predetermined value. Similarly, absence of wind may not be detected after a windy environment has been detected unless the threshold exceeds the first and second correlation products by a predetermined value. The wind detection module 170 may then adjust the directivity pattern created by the microphones 105 and 110 according to whether the mobile terminal 100 is determined to be in a windy environment or in a no-wind environment. Additional signal processing may also be performed to reduce the effects of wind noise if the mobile terminal 100 is determined to be in a windy environment, in accordance with some embodiments of the present invention
In other embodiments of the present invention, the wind detection module 170 may determine the presence of wind noise in received sound waves if at least one of the correlation products fails to exceed the threshold, rather than requiring both correlation products to fail to exceed the threshold. In further embodiments, when more than two microphones are used, the wind detection module 170 may determine wind noise is detected if a particular percentage of correlation products fail to exceed the threshold.
Advantageously, embodiments of the present invention use the properties that wind noise is relatively uncorrelated in space, but relatively correlated in time. These two properties may be represented by cross correlation coefficients for multiple microphone signals and autocorrelation coefficients for the individual microphone signals, respectively. Recall that, in addition to wind noise, other noise signals may also be uncorrelated or inverse correlated in space. By combining both a cross correlation coefficient analysis and an autocorrelation coefficient analysis, embodiments of the present invention may detect wind noise with generally improved reliability because the determination is less sensitive to the physics of the microphones.
The flowchart of
Many variations and modifications can be made to the preferred embodiments without substantially departing from the principles of the present invention. All such variations and modifications are intended to be included herein within the scope of the present invention, as set forth in the following claims.
This application claims the benefit of Provisional Application Nos. 60/395,889 and 60/395,888, filed Jul. 15, 2002, the disclosures of which are hereby incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
5347586 | Hill et al. | Sep 1994 | A |
5400409 | Linhard | Mar 1995 | A |
6219427 | Kates et al. | Apr 2001 | B1 |
6675114 | Ando et al. | Jan 2004 | B1 |
20020337088 | Dickel et al. | Mar 2002 | |
20020041696 | Jensen | Apr 2002 | A1 |
Number | Date | Country |
---|---|---|
10045197 | Mar 2002 | DE |
1 196 009 | Apr 2002 | EP |
WO 9960822 | Nov 1999 | WO |
WO 0359010 | Jul 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20040008850 A1 | Jan 2004 | US |
Number | Date | Country | |
---|---|---|---|
60395889 | Jul 2002 | US | |
60395888 | Jul 2002 | US |