This application claims the benefit under 35 USC §119(a) of Korean Patent Application No. 10-2011-0027178, filed on Mar. 25, 2011, in the Korean Intellectual Property Office, the entire disclosure of which is incorporated herein by reference for all purposes.
1. Field
The following description relates to a processor and method for estimating spectrum density of diffused noise.
2. Description of the Related Art
Typical methods for removing noises from audio apparatuses include valley detection, histograms, and the like. However, a portable audio apparatus has a limited battery capacity. Accordingly, a limited amount of algorithm calculations may improve battery life.
In one general aspect, there is provided a processor including at least two sound receiving units configured to receive sounds, a correlation estimating unit configured to estimate a correlation between diffused noises included in the sounds received by the at least two sound receiving units, and a spectrum density estimating unit configured to estimate a spectrum density with respect to the diffused noises based on the estimated correlation.
The spectrum density estimating unit may be configured to estimate the spectrum density in which a low frequency band is compensated.
The spectrum density estimating unit may comprise an eigenvalue estimating unit configured to estimate an eigenvalue of a covariance matrix based on the sounds received by the at least two sound receiving units, and a low frequency band compensating unit configured to compensate a low frequency band of spectrum density with respect to the diffused noises based on the estimated eigenvalue and the estimated correlation.
The covariance matrix may comprise elements including the estimated correlation multiplied by the power spectrum density of the diffused noises.
The correlation estimating unit may estimate the correlation between diffused noises included in the sounds received by the at least two sound receiving units, such that a higher weight is applied to a low frequency band of the diffused noises in comparison to a high frequency band of the diffused noises.
The correlation estimating unit may be configured to estimate the correlation using a sinc function according to a frequency.
The correlation estimating unit may be configured to estimate the correlation using sinc functions according to the frequency and a distance between the at least two sound receiving units.
The processor may further comprise a noise removing unit configured to remove the diffused noises included in the sounds received by the at least two sound receiving units using the estimated spectrum density.
In another aspect, there is provided a sound reproducing device including a processor configured to receive sounds using at least two sound receiving units, to estimate a spectrum density with respect to diffused noises included in the received sounds in consideration of correlation between the diffused noises, and to remove the diffuses noises included in the received sounds based on the estimated spectrum density, an amplifying unit configured to amplify the sounds from which the diffused noises are removed, and an output unit configured to output the amplified sounds.
The estimated spectrum density may be a spectrum density in which a low frequency band is compensated.
The sound reproducing device may be a binaural hearing aid.
In another aspect, there is provided a method for estimating spectrum density of diffused noises using a device that has at least two sound receiving units, the method including receiving sounds by the at least two sound receiving units, estimating a correlation between diffused noises included in the sounds received by the two sound receiving units, and estimating a spectrum density with respect to the diffused noises based on the estimated correlation.
The estimating of the spectrum density may comprise estimating the spectrum density in which a low frequency band is compensated.
The estimating of the spectrum density with respect to diffused noises may comprise estimating an eigenvalue of a covariance matrix using the sounds received by the at least two sound receiving units, and estimating the spectrum density in which a low frequency band is compensated based on the estimated eigenvalue and the correlation between the diffused noises.
The covariance method may comprise elements including the estimated correlation multiplied by power spectrum density of the diffused noises.
The estimating of the correlation may comprise estimating the correlation between the diffused noises included in the sounds received by the at least two sound receiving units, such that a higher weight is applied to a low frequency band of the diffused noises in comparison to a high frequency band of the diffused noises.
The estimating of the correlation may comprise estimating the correlation using a sinc function according to a frequency.
The estimating of the correlation may comprise estimating the correlation using sinc functions according to the frequency and a distance between the at least two sound receiving units.
The method may further comprise removing the diffused noises included in the sounds received by the at least two sound receiving units using the estimated spectrum density.
In another aspect, there is provided a computer-readable storage medium having stored therein program instructions to cause a computer to execute a method for estimating spectrum density of diffused noises using a device that has at least two sound receiving units, the method including receiving sounds using the at least two sound receiving units, estimating a correlation between diffused noises included in the sounds received by the two sound receiving units, and estimating a spectrum density with respect to the diffused noises based on the estimated correlation.
Other features and aspects may be apparent from the following detailed description, the drawings, and the claims.
Throughout the drawings and the detailed description, unless otherwise described, the same drawing reference numerals will be understood to refer to the same elements, features, and structures. The relative size and depiction of these elements may be exaggerated for clarity, illustration, and convenience.
The following detailed description is provided to assist the reader in gaining a comprehensive understanding of the methods, apparatuses, and/or systems described herein. Accordingly, various changes, modifications, and equivalents of the methods, apparatuses, and/or systems described herein will be suggested to those of ordinary skill in the art. Also, descriptions of well-known functions and constructions may be omitted for increased clarity and conciseness.
Referring to
For example, the processor 100 shown in
The processor 100 may be used to estimate a spectrum density with respect to diffused noise based on sound received from the surroundings. The processor 100 may be included in devices including a sound reproduction device, a sound outputting device, a repeater, a telephone, a communication device, a sound detector, binaural hearing aids, and the like. For example, the processor may be included in a terminal, a mobile phone, a computer, a sensor, a hearing aid, and the like.
The at least two sound receiving units 110 receive sound from the surroundings. In the example of
The sound receiving units 110 may be microphones that receive sound from the surroundings and convert the sound to electric signals. As another example, the sound receiving units 110 may include any of various devices which detect and receive sound from the surroundings.
In an example in which the processor 100 is included in a binaural hearing aid, the first sound receiving unit 112 and the second sound receiving unit 114 may be worn in correspondence to the right ear and the left ear of a user, respectively.
The correlation estimating unit 120 may estimate correlation between diffused noises that are included in sounds received by the sound receiving units 110. For example, a first diffused noise may be included in a first sound received by the first sound receiving unit 112 and a second diffused noise may be included in a second sound received by the second sound receiving unit 114. The correlation estimating unit 120 may estimate a correlation between the first diffused noise and the second diffused noise.
For example, the same sound may be included in the first sound and the second sound received by the first sound receiving unit 112 and the second sound receiving unit 114. Furthermore, diffused noises included in the sound may become the first diffused noise and the second diffused noise received by the first sound receiving unit 112 and the second sound receiving unit 114, respectively.
The diffused noise may include white noises. For example, the term diffused noise may refer to noise that is non-directional, has a same magnitude in all directions, and has a random phase. As an example, diffused noise may include bubble noises, reverberations, and the like from inside a room (e.g., an office, a café, etc.).
Magnitudes of diffused noises included in sounds received by the first sound receiving unit 112 and the second sound receiving unit 114 may be approximately same as each other, and correlation between the diffused noises may be low. Here, the correlation may be coherence.
In various examples, the diffused noises included in the sounds received by the first sound receiving unit 112 and the second sound receiving unit 114 may have higher correlation in a low frequency band in comparison to a high frequency band. Here, the low frequency band may refer to a frequency band below or equal to about 500 Hz. However, the examples herein are not limited thereto.
There may be a correlation between diffused noises in a low frequency band. Accordingly, the correlation estimating unit 120 may estimate a correlation between diffused noises included in sounds received by the sound receiving units 110. For example, correlation estimating unit 120 may estimate correlation between diffused noises included in sounds received by the sound receiving units 110, such that higher weight is applied to a low frequency band in comparison to a high frequency band. As another example, the correlation estimating unit 120 may estimate correlation using a sinc function according to frequency. Furthermore, the correlation estimating unit 120 may estimate correlation using sinc functions according to frequency and a distance between the sound receiving units 110.
In detail, the correlation estimating unit 120 may estimate correlation between diffused noises by using a sinc function which employs at least one of a distance between the first sound receiving unit 112 and the second sound receiving unit 114 and a frequency as a variable.
However, usage of a sinc function to estimate correlation between diffused noises is merely an example, and the correlation estimating unit 120 may estimate correlation between diffused noises using any of various methods for applying a higher weight to a low frequency band of the diffused noise and applying a lower weight to a high frequency band of the diffused noises.
The spectrum density estimating unit 130 may estimate spectrum density of the diffused noises using the correlation estimated by the correlation estimating unit 120. For example, spectrum density may be power spectrum density (PSD). As another example, the spectrum density may further include energy spectrum density (ESD), and the like.
For example, the spectrum density estimating unit 130 may estimate a spectrum density in which a low frequency band is compensated. As described above, diffused noises included in sounds received by the first sound receiving unit 112 and the second sound receiving unit 114 may have a higher correlation in a low frequency band as compared to a high frequency band, and thus the spectrum density estimating unit 130 may estimate a spectrum density in which a low frequency band is compensated.
Furthermore, the spectrum density estimating unit 130 may estimate a covariance matrix using sounds received by the sound receiving units 110, estimate an eigenvalue of the estimated covariance matrix, and estimate a spectrum density in which a low frequency band is compensated by using correlation between the diffused noises and the estimated eigenvalue.
As described herein, the spectrum density estimating unit 130 does not underestimate spectrum density of diffused noises in a low frequency band, and thus, spectrum density may be estimated accurately with a small amount of calculations.
The processor 100 shown in
The at least two sound receiving units 110 receive sounds from the surrounding. For example, the sound receiving units 110 may perform a Fourier Transformation or a Fast Fourier Transformation to convert the received sounds to frequency bands.
The correlation estimating unit 120 may estimate correlation between diffused noises that are included in the sounds received by the sound receiving units 110. Hereinafter, an example in which the processor 100 is included in a binaural hearing aid is described. However, the processor 100 is not limited thereto.
The first sound receiving unit 112 and the second sound receiving unit 114 may be attached on the left ear and the right ear of a user, respectively. For example, correlation between diffused noises received by the first sound receiving unit 112 and the second sound receiving unit 114 may be expressed as shown in Equation 1 below.
In Equation 1, Ψ denotes correlation between diffused noises received by the first sound receiving unit 112 and the second sound receiving unit 114, ΓNN denotes power spectrum density of the diffused noises, ΓNNL denotes power spectrum density of the diffused noises received by the first sound receiving unit 112, ΓNNR denotes power spectrum density of the diffused noises received by the second sound receiving unit 114, and ΓNNLR denotes power spectrum density of the diffused noises received by the first sound receiving unit 112 and the diffused noises received by the second sound receiving unit 114. In this example, ΓNNLR may indicate an average with respect to the diffused noises received by the first sound receiving unit 112 multiplied by the diffused noises received by the second sound receiving unit 114. However, the examples herein are not limited thereto.
In this example, correlation Ψ between diffused noises received by the first sound receiving unit 112 and the second sound receiving unit 114 may be coherence function between the left channel corresponding to the first sound receiving unit 112 and the right channel corresponding to the second sound receiving unit 114. Accordingly, correlation Ψ between diffused noises may be defined as a ratio of power spectrum density ΓNN of the diffused noises to power spectrum density ΓNNLR of the diffused noises received by the first sound receiving unit 112 and the diffused noises received by the second sound receiving unit 114.
As described herein, diffused noises included in sounds received by the first sound receiving unit 112 and the second sound receiving unit 114 may have a higher correlation in a low frequency band as compared to a high frequency band. Therefore, values of power spectrum density ΓNNLR of the diffused noises received by the first sound receiving unit 112 and the diffused noises received by the second sound receiving unit 114 may become closer to zero (0) as frequency increases from a low frequency band to a high frequency band. Accordingly, the correlation estimating unit 120 may estimate correlation, such that higher weight is applied to a low frequency band of diffused noises included in sounds received by the sound receiving units 110 as compared to a high frequency band of diffused noises.
For example, the correlation estimating unit 120 may estimate correlation using a sinc function according to frequency and a distance between the sound receiving units 110. Estimated correlation between diffused noises may be defined as shown in Equation 2 below.
In Equation 2, Ψ denotes correlation, f denotes frequency, dLR denotes a distance between the sound receiving units 110, and c denotes the speed of sound.
The correlation estimating unit 120 may estimate correlation between diffused noises using sinc functions according to frequency and a distance between the first sound receiving unit 112 and the second sound receiving unit 114.
The spectrum density estimating unit 130 may estimate spectrum density with respect to the diffused noises by using the correlation with respect to the diffused noises. For example, spectrum density that is estimated by the spectrum density estimating unit 130 may be defined as shown in Equation 3 below.
In Equation 3, ΓNN denotes power spectrum density of diffused noises, λ denotes an eigenvalue of a covariance matrix with respect to sounds received by the sound receiving units, and Ψ denotes correlation between the diffused noises.
As another example, the spectrum density estimating unit 130 may estimate spectrum density with respect to diffused noises by using correlation estimated by the correlation estimating unit 120 and an eigenvalue of a covariance matrix with respect to sounds received by the sound receiving units 110. For example, the eigenvalue estimating unit 132 may estimate an eigenvalue of a covariance matrix by using sounds received by the sound receiving units 110. The low frequency band compensating unit 134 may compensate a low frequency band of spectrum density with respect to diffused noises by using the eigenvalue estimated by the eigenvalue estimating unit 132 and the correlation estimated by the correlation estimating unit 120.
The eigenvalue estimating unit 132 may estimate a covariance matrix as shown in Equation 4 below by using sounds received by the sound receiving units 110.
In Equation 4, Rx denotes a covariance matrix, aR denotes a right head related transfer function (HRTF) between a sound signal and a user, aL denotes a left HRTF between the sound signal and the user, ΓSS denotes power spectrum density of a sound signal, ΓNN denotes power density spectrum of diffused noises, and Ψ denotes correlation between the diffused noises.
A sound signal may be an input signal input to each of the sound receiving units 110 other than diffused noises. However, the examples herein are not limited thereto.
In Equation 4, a covariance matrix Rx of sounds received by the sound receiving units 110 has elements including ΨΓNN. In other words, the eigenvalue estimating unit 132 further considers ΨΓNN for estimating a cross correlation function with respect to signals received by the first sound receiving unit 112 and the second sound receiving unit 114. Therefore, the eigenvalue estimating unit 132 may estimate a covariance matrix in consideration of correlation between diffused noises.
Furthermore, the eigenvalue estimating unit 132 may estimate an eigenvalue of a covariance matrix as shown in Equation 5 below.
In Equation 5, λ1,2 denote eigenvalues of the covariance matrix, aR denotes a right HRTF between the sound signal and the user, aL denotes a left HRTF between the sound signal and the user, ΓSS denotes power spectrum density of the sound signal, ΓNN denotes power spectrum density of diffused noises, and Ψ denotes correlation between diffused noises.
The eigenvalue estimating unit 132 may estimate the smaller one of the eigenvalues λ1 and λ2 of a covariance matrix estimated according to Equation 5 as an eigenvalue of the covariance matrix. In this example, the low frequency band compensating unit 134 compensates a low frequency band of spectrum density with respect to diffused noises by using the eigenvalue estimated by the eigenvalue estimating unit 132 and the correlation estimated by the correlation estimating unit 120.
For example, the low frequency band compensating unit 134 may estimate a power spectrum density with respect to diffused noises using the eigenvalue estimated by the eigenvalue estimating unit 132 and the correlation estimated by the correlation estimating unit 120, as shown in Equation 3. Accordingly, the estimated spectrum density may be power spectrum density in which a low frequency band is compensated.
As described herein, the spectrum density estimating unit 130 may estimate spectrum density with respect to diffused noises in consideration of correlation between the diffused noises, and thus, accuracy of spectrum density estimation may be improved.
The noise removing unit 140 may remove diffused noises that are included in sounds received by the sound receiving units 110 using the spectrum density estimated by the spectrum density estimating unit 130. For example, the noise removing unit 140 may be a filter for removing diffused noises from sounds received by the sound receiving units 110 using the spectrum density of the diffused noises. However, the examples herein are not limited thereto.
Accordingly, the processor 100 may receive sounds received by the sound receiving units 110 and estimate spectrum density of diffused noises, and thus an amount of calculation performed by the processor 100 is relatively small. Furthermore, because the processor 100 may estimate spectrum density of diffused noises in consideration of correlation between the diffused noises, accuracy of the estimation may be improved, and more particularly, accuracy of estimation with respect to a low frequency band may be significantly improved.
As indicated by the graph 31, diffused noises may have high correlation in a low frequency band (e.g., a band of low frequencies up to 500 Hz) and may have low correlation in a high frequency band. Therefore, the correlation estimating unit 120 may estimate correlation using sinc functions according to frequency and a distance between the sound receiving units 110, as indicated by the graph 31.
The sound reproducing device 200 may be binaural hearing aids. However, the examples herein are not limited thereto. Furthermore, the processor 100 shown in
The processor 100 receives sounds from the surroundings using at least two sound receiving units, estimates spectrum density with respect to diffused noises included in the received sounds in consideration of correlation between the diffused noises, and removes the diffused noises included in the received sounds using the estimated spectrum density. For example, the estimated spectrum density may be a spectrum density in which a low frequency band is compensated.
The processor 100 may transmit signals L and R to the amplifying unit 210. The signal L and AR may be generated by removing diffused noises from signals received by sound receiving units arranged at the left ear and the right ear of a user.
The amplifying unit 210 may amplify sounds from which diffused noises have been removed by the processor 100. For example, the amplifying unit 210 may transmit amplified signals L′ and R′ to the output unit 220. The amplified signals L′ and R′ may be generated by adjusting amplification gains according to frequencies.
The output unit 220 may output sounds amplified by the amplifying unit 210. For example, the output unit 220 may output signals L″ and R″ which are generated by converting the amplified signals L′ and R′ to time domains. For example, the output unit 220 may include a conversion processor for converting signals from the frequency domain to the time domain and a speaker for outputting the converted signals.
For example, a user may hear sounds that are generated by removing diffused noises and that are amplified without the diffused noises, by wearing the sound reproducing device 200.
In 501, sounds are received from the surroundings. For example, the sounds may be received by at least two sound receiving units 110 which may include microphones.
In 502, a correlation between diffused noises included in the sounds received in 501 is estimated.
In 503, spectrum density with respect to the diffused noises is estimated using the correlation estimated in 502. For example, the estimated spectrum density may be a spectrum density in which a low frequency band is compensated.
As described herein, the processor 100 may accurately estimate spectrum density with respect to diffused noises by using a simple algorithm. Accordingly, spectrum density with respect to diffused noises may be accurately estimated with a reduced amount of calculations.
Program instructions to perform a method described herein, or one or more operations thereof, may be recorded, stored, or fixed in one or more computer-readable storage media. The program instructions may be implemented by a computer. For example, the computer may cause a processor to execute the program instructions. The media may include, alone or in combination with the program instructions, data files, data structures, and the like. Examples of computer-readable storage media include magnetic media, such as hard disks, floppy disks, and magnetic tape; optical media such as CD ROM disks and DVDs; magneto-optical media, such as optical disks; and hardware devices that are specially configured to store and perform program instructions, such as read-only memory (ROM), random access memory (RAM), flash memory, and the like. Examples of program instructions include machine code, such as produced by a compiler, and files containing higher level code that may be executed by the computer using an interpreter. The program instructions, that is, software, may be distributed over network coupled computer systems so that the software is stored and executed in a distributed fashion. For example, the software and data may be stored by one or more computer readable storage mediums. Also, functional programs, codes, and code segments for accomplishing the example embodiments disclosed herein can be easily construed by programmers skilled in the art to which the embodiments pertain based on and using the flow diagrams and block diagrams of the figures and their corresponding descriptions as provided herein. Also, the described unit to perform an operation or a method may be hardware, software, or some combination of hardware and software. For example, the unit may be a software package running on a computer or the computer on which that software is running.
As a non-exhaustive illustration only, a terminal/device/unit described herein may refer to mobile devices such as a cellular phone, a personal digital assistant (PDA), a digital camera, a portable game console, and an MP3 player, a portable/personal multimedia player (PMP), a handheld e-book, a portable laptop PC, a global positioning system (GPS) navigation, a tablet, a sensor, and devices such as a desktop PC, a high definition television (HDTV), an optical disc player, a setup box, a home appliance, and the like that are capable of wireless communication or network communication consistent with that which is disclosed herein.
A computing system or a computer may include a microprocessor that is electrically connected with a bus, a user interface, and a memory controller. It may further include a flash memory device. The flash memory device may store N-bit data via the memory controller. The N-bit data is processed or will be processed by the microprocessor and N may be 1 or an integer greater than 1. Where the computing system or computer is a mobile apparatus, a battery may be additionally provided to supply operation voltage of the computing system or computer. It will be apparent to those of ordinary skill in the art that the computing system or computer may further include an application chipset, a camera image processor (CIS), a mobile Dynamic Random Access Memory (DRAM), and the like. The memory controller and the flash memory device may constitute a solid state drive/disk (SSD) that uses a non-volatile memory to store data.
A number of examples have been described above. Nevertheless, it will be understood that various modifications may be made. For example, suitable results may be achieved if the described techniques are performed in a different order and/or if components in a described system, architecture, device, or circuit are combined in a different manner and/or replaced or supplemented by other components or their equivalents. Accordingly, other implementations are within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2011-0027178 | Mar 2011 | KR | national |
Number | Name | Date | Kind |
---|---|---|---|
20030147538 | Elko | Aug 2003 | A1 |
20080130914 | Cho | Jun 2008 | A1 |
20090012783 | Klein | Jan 2009 | A1 |
20100208921 | Fischer | Aug 2010 | A1 |
Number | Date | Country |
---|---|---|
2010-200350 | Sep 2010 | JP |
10-2010-0053890 | May 2010 | KR |
10-2010-0120567 | Nov 2010 | KR |
Entry |
---|
Arslan, Levent, et al. “New methods for adaptive noise suppression.” 1995 International Conference on. vol. 1. IEEE (1995): 812-815. |
Hirsch, H. G., et al. “Noise estimation techniques for robust speech recognition.” 1995 International Conference on. vol. 1. IEEE (1995): 153-156. |
Martin, Rainer. “Noise power spectral density estimation based on optimal smoothing and minimum statistics.” Speech and Audio Processing, IEEE Transactions on vol. 9 No. 5 (2001): 504-512. |
McCowan, Iain A., et al., “Microphone array post-filter based on noise field coherence.” Speech and Audio Processing, IEEE Transactions vol. 11 No. 6 (2003): 709-716. |
Abutalebi, Hamid Reza, et al. “A hybrid subband adaptive system for speech enhancement in diffuse noise fields.” Signal Processing Letters, IEEE vol. 11 No. 1 (2004): 1-4. |
Number | Date | Country | |
---|---|---|---|
20120243695 A1 | Sep 2012 | US |