1. Technical Field of the Invention
The present invention relates to a technology for suppressing noise components in an audio signal.
2. Description of the Related Art
A technology for suppressing noise components in a sound mixture of target sound components and noise components has been suggested. For example, Japanese Patent Application Publication No. 2007-248534 describes a technology for subtracting a spectrum of noise components estimated through independent component analysis from a spectrum of an audio signal in which target sound components have been emphasized through a delay sum type beamformer.
However, in the technology for suppressing noise components in the frequency domain as in Japanese Patent Application Publication No. 2007-248534, components remaining in the time axis and the frequency axis after suppression of noise components are perceived as artificial and harsh musical noise by the listener. Reducing the extent of subtraction of noise components decreases musical noise but has a problem in that noise components cannot be sufficiently suppressed (i.e., the SN ratio is low after noise component suppression).
In view of these circumstances, it is an object of the invention to achieve both reduction in musical noise and effective suppression of noise components.
In order to solve the problem, according to the invention, an apparatus is provided for suppressing noise components from audio signals of a plurality of channels generated by a plurality of sound collecting devices, the inventive apparatus comprising: a noise extraction part that extracts a noise component from an audio signal of each of the plurality of channels; a stationary noise estimation part that estimates stationary noise included in the noise component; a first noise suppression part that removes a spectrum of the stationary noise from a spectrum of the audio signal of each of the plurality of channels to an extent determined according to a subtraction factor; a non-stationary noise estimation part that estimates a spectrum of non-stationary noise by subtracting the spectrum of the stationary noise from the spectrum of the noise component of each of the plurality of channels; a factor setting part that generates a filtering factor for emphasizing a target sound component contained in the audio signal from the spectrum of the non-stationary noise; a second noise suppression part that performs a filtering process using the filtering factor on the audio signals of the plurality of channels after processing of the first noise suppression part; an index calculation part that calculates a kurtosis change index representing an extent of change of kurtosis in a frequence distribution of magnitude of each of the audio signals between the kurtosis observed when processing of the first noise suppression part is performed and the kurtosis observed when processing of the second noise suppression part is performed; and a factor adjustment part that variably controls the subtraction factor according to the kurtosis change index.
In this embodiment, it is possible to effectively suppress noise components while suppressing musical noise caused by the processing of the first noise suppression part since the subtraction factor used for the processing of the first noise suppression part is variably controlled according to the kurtosis change index representing the extent of change of the kurtosis in the frequence distribution of the magnitude of each of the audio signals from the kurtosis observed when the processing of the first noise suppression part is performed to the kurtosis observed when the processing of the second noise suppression part is performed.
In a preferred embodiment of the invention, the factor adjustment part controls the subtraction factor such that the kurtosis change index approaches a predetermined value. In this embodiment, it is possible to effectively suppress noise components while suppressing musical noise caused by the processing of the first noise suppression part to a desired extent according to the predetermined value.
The noise suppression apparatus according to the invention may not only be implemented by hardware (electronic circuitry) such as a Digital Signal Processor (DSP) dedicated to noise suppression but may also be implemented through cooperation of a general arithmetic processing unit such as a Central Processing Unit (CPU) with a program. The program according to the invention is executable by the computer to perform: a noise extraction process of extracting a noise component from an audio signal of each of a plurality of channels generated by a plurality of sound collecting devices; a stationary noise estimation process of estimating stationary noise included in the noise component; a first noise suppression process of removing a spectrum of the stationary noise from a spectrum of the audio signal of each of the plurality of channels to an extent determined according to a subtraction factor; a non-stationary noise estimation process of estimating a spectrum of non-stationary noise by subtracting the spectrum of the stationary noise from the spectrum of the noise component of each of the plurality of channels; a factor setting process of generating a filtering factor for emphasizing a target sound component contained in the audio signal from the spectrum of the non-stationary noise; a second noise suppression process of performing a filtering process using the filtering factor on the audio signals of the plurality of channels after the first noise suppression process is performed; an index calculation process of calculating a kurtosis change index representing an extent of change of kurtosis in a frequence distribution of magnitude of each of the audio signals between the kurtosis observed when the first noise suppression process is performed and the kurtosis observed when the second noise suppression process is performed; and a factor adjustment process of variably controlling the subtraction factor according to the kurtosis change index.
The program achieves the same operations and advantages as those of the noise suppression apparatus according to each embodiment of the invention. The program of the invention may be provided to a user through a computer machine readable recording medium storing the program and then installed on a computer and may also be provided from a server apparatus to a user through distribution over a communication network and then installed on a computer.
A sound mixture of target sound components and noise components from surroundings arrives at the sound collecting devices 12[1] to 12[J]. The target sound components are components of a target sound (vocal or musical sound) to be received. The target sound components arrive at the sound collecting devices 12[1] to 12[J] from a direction at a known angle ξ with respect to normal to the plane PL. For example, in the case where the noise suppression apparatus 100 is installed in an electronic device (for example, a portable phone) to which voice of the user is input, voice arriving at the electronic device from a direction (ξ=0°) corresponding to the front side of the body of the electronic device corresponds to the target sound components.
On the other hand, the noise components are components other than the target sound components and may include stationary noise (i.e., constant noise) and non-stationary noise (i.e., fluctuating noise). The stationary noise is components which undergo little or no temporal change in acoustic characteristics (for example, sound pressure). For example, the stationary noise corresponds to operating noise of air-conditioning equipment or noise in crowds. On the other hand, the non-stationary noise is instantaneous components that undergo a temporal change in acoustic characteristics from moment to moment. For example, the non-stationary noise corresponds to vocal sound (speech sound) or musical sound other than the target sound components.
The noise suppression apparatus 100 generates an audio signal VOUT of the time domain by performing a process for suppressing noise components (stationary noise and non-stationary noise) on audio signals V[1] to V[J]. The audio signal VOUT generated by the noise suppression apparatus 100 is provided to a sound emitting device 14 (for example, a speaker or headphones) and the sound emitting device 14 reproduces the audio signal VOUT as physical sound. An A/D converter for converting the audio signals V[1] to V[J] into digital signals, a D/A converter for converting the audio signal VOUT into an analog signal, or the like are not illustrated for the sake of convenience.
The noise suppression apparatus 100 is implemented as an arithmetic processing device that performs a plurality of functions (such as functions of a frequency analyzer 22, a noise extractor 24, a stationary noise estimator 26, a first noise suppressor 32, a non-stationary noise estimator 34, a filtering processor 40, a waveform synthesizer 52, and a suppression controller 60) by executing a program stored in a storage device (not shown). However, it is also possible to employ a configuration in which an electronic circuit (DSP) dedicated to noise suppression implements each component of
For each of the channels of the audio signals V[1] to V[J], the frequency analyzer 22 generates a spectrum (power spectrum) X[j] (X[1] to X[J]) for each of the frames into which the audio signal V[j] is divided along the time axis. The spectrum X[j] is a series of respective magnitudes (power) of a predetermined number of frequencies discretely set along the frequency axis. Any known technology (for example, short-time Fourier transform) may be used to generate the spectrum X[j].
The noise extractor 24 extracts a noise component from the audio signal V[j] of each channel at each frame. Specifically, the noise extractor 24 generates noise component spectrum (power spectrum) N[j] (N[1] to N[J]) in each frame. In a noise section of the audio signal V[j] in which target sound components are not present, the spectrum X[j] matches the noise component spectrum N[j]. Therefore, the noise extractor 24 divides the audio signal V[j], which is a time series of the spectrum X[j], into target sound sections and noise sections along the time axis and specifies the spectrum X[j] of each frame in the noise section as a noise component spectrum N[j]. Any known voice activity detection (VAD) technology may be used to divide the audio signal V[j] into target sound sections and noise sections.
The stationary noise estimator 26 estimates stationary noise included in the noise component of each channel extracted by the noise extractor 24. The stationary noise is a temporally stationary component among the noise components as described above. Here, the stationary noise estimator 26 generates a stationary noise spectrum (power spectrum) Nw[j] (Nw[1] to Nw[J]) by averaging (specifically, time-averaging) the noise component' spectrums N[j] generated by the noise extractor 24 over a plurality of frames in the noise section. Averaging the spectrum N[j] removes non-stationary noise from the spectrum Nw[j]. The stationary noise spectrum Nw[j] is sequentially updated in each noise section. That is, a spectrum Nw[j] estimated in a noise section immediately previous to a target sound section is maintained during the noise sound section.
For each channel, the first noise suppressor 32 suppresses stationary noise included in the audio signal V[j] in the frequency domain. As shown in
That is, the subtractor SA[j] calculates the spectrum Y[j] by subtracting the product of the stationary noise spectrum Nw[j] and a subtraction factor α from the spectrum X[j] for frequencies in which the spectrum X[j] of the audio signal V[j] is equal to or higher than a threshold Th1 as shown in Equation (1a). On the other hand, the subtractor SA[j] calculates the spectrum Y[j] by multiplying the spectrum X[j] by a flooring factor β for frequencies in which the spectrum X[j] of the audio signal V[j] is less than the threshold Th1 as shown in Equation (1b). For example, the threshold Th1 is set to the product of the subtraction factor α and the spectrum Nw[j]. As can be seen from the Equations (1a) and (1b), the subtraction factor α serves as a numerical value determining the extent of suppression of noise components (stationary noise). That is, the effect of suppression of stationary noise (i.e., the performance of noise suppression) increases as the subtraction factor α increases.
The non-stationary noise estimator 34 estimates a non-stationary noise spectrum (power spectrum) Nd[j] (Nd[1] to Nd[J]) included in the audio signal V[j] of each channel in each frame. As shown in
The noise components are a mixture of stationary noise and non-stationary noise. Therefore, the subtractor SB[j] corresponding to the jth channel generates a non-stationary noise spectrum Nd[j] (Nd[1] to Nd[J]) in each frame in the noise section by subtracting the stationary noise spectrum Nw[j] from the spectrum N[j] of each frame in the noise section specified by the noise extractor 24 (through spectrum subtraction) in the frequency domain. In each frame in the target sound section, a spectrum Nd[j] of the last frame of an immediately previous noise section is continuously output from the subtractor SB[j].
Non-stationary noise in each frame in the target sound section is not directly extracted from the target sound section as described above. However, for example, when the target sound components are voice of one person, noise sections and target sound sections alternate at sufficiently small time intervals, compared to the speed of change of non-stationary noise. Accordingly, the accuracy of noise suppression is not excessively reduced even though the spectrum Nd[j] extracted from each frame in the noise section is used as the spectrum Nd[j] of the non-stationary noise in the target sound section.
The following Equations (2a) and (2b) are applied when the subtractor SB[j] calculates the spectrum Nd[j].
That is, the subtractor SB[j] calculates the spectrum Nd[j] by subtracting the product of the stationary noise spectrum Nw[j] and a factor δ from the noise component spectrum N[j] for frequencies in which the noise component spectrum N[j] is equal to or higher than a threshold Th2 (for example, the product of the spectrum Nw[j] and a factor δ) as shown in Equation (2a). On the other hand, the spectrum Nd[j] of non-stationary noise is set to a predetermined value ε for frequencies in which the noise component spectrum N[j] is less than the threshold Th2 as shown in Equation (2b). For example, the predetermined value ε is set to the product of the noise component spectrum N[j] and a predetermined factor.
Since target sound components, stationary noise, and non-stationary noise are mixed in the audio signal V[j], the spectrum Y[j] after suppression of stationary noise by the first noise suppressor 32 includes the target sound components and the non-stationary noise. For each frame, the filtering processor 40 sequentially generates a spectrum (power spectrum) Z of an audio signal VOUT in which the target sound components have been emphasized (i.e., the non-stationary noise has been suppressed) from the spectrums Y[1] to Y[J] after suppression of stationary noise. The waveform synthesizer 52 converts the spectrum Z of each frame generated by the filtering processor 40 into a time-domain signal through inverse Fourier transform and connects, on the time axis, the converted signals of adjacent frames to generate an audio signal VOUT. The phase spectrum of any of the audio signals V[1] to V[J] is used to generate the audio signal VOUT.
As shown in
The factor setter 44 generates the filtering factor W to be applied to the process of the second noise suppressor 42. Specifically, the factor setter 44 generates the filtering factor W for emphasizing the target sound components through an adaptive beamformer using the non-stationary noise spectrums Nd generated by the non-stationary noise estimator 34. For example, a minimum variance distortionless response (MVDR) is preferably employed as the adaptive beamformer, which determines the filtering factor W so as to minimize the magnitude of noise components (non-stationary noise) arriving from the direction of the angle ξ while maintaining the magnitude of target sound components arriving from the direction.
Specifically, the factor setter 44 calculates a filtering factor W(fq) of each frequency (fq) (q=1, 2, . . . ) according to the following Equation (3). The filtering factor W(fq) is generated, for example, sequentially in each frame.
The symbol RNN(fq) in Equation (3) is a covariance matrix of the respective magnitudes of the component of the frequency fq in the spectrums Nd[1] to Nd[J]. That is, the covariance matrix RNN(fq) is defined according to the following. Equation (4) using a vector vN(fq) (=[Nd[1](fq), Nd[2](fq), . . . , Nd[J](fq)]T) whose elements are the magnitudes Nd[1](fq) to Nd[j](fq) at the frequency (fq) in the spectrums Nd[1] to Nd[J], where T denotes transposition.
R
NN(fq)=E[vN(fq)vN(fq)H] (4)
The symbol H in Equations (3) and (4) denotes Hermitian transposition of the matrix. The symbol “E[ ]” in Equation (4) denotes an average (expectation) or sum over a predetermined number of frames including the current frame (for example, the current frame and a predetermined number of previous frames). The predetermined value ε of Equation (2b) is preferably set to a number other than zero so that an inverse matrix of the covariance matrix RNN(fq) used for calculation of the filtering factor W(fq) of Equation (3) exists.
The symbol dξ(fq) of Equation (3) is a steering vector (direction control vector) of J rows and 1 column representing the differences of times when sound waves (plane waves) of the frequency (fq) arrive at the sound collecting devices 12[1] to 12[J] from the direction of the angle ξ. The factor setter 44 generates the steering vector dξ(fq) of Equation (3) according to the known target sound component arrival angle ξ. When the angle ξ is unknown, the factor setter 44 generates the steering vector dξ (fq) after estimating the target sound component angle ξ. Any known technology such as a MUSIC method or an ESPRIT method may be employed to estimate the angle ξ. The invention also preferably employs a beam-forming method in which beams are formed in a plurality of directions in the directional array process (delay sum array process) and the direction of a beam in which the volume of the audio signals V[1] to V[J] is maximized is specified as the angle ξ. The spectrum Z in which the target sound components have been emphasized is sequentially generated for each frame by applying the filtering factor W(fq) generated in the above procedure to the directional array process performed by the second noise suppressor 42.
However, the spectrum subtraction process, which the first noise suppressor 32 performs to subtract the spectrum Nw[j] from the spectrum X[j] of the audio signal V[j] in the frequency domain, generates high-magnitude components (acnodes) that are distributed over the time axis and the frequency axis, causing musical noise which is artificial and harsh. Generation of musical noise through the spectrum subtraction is described in detail below.
When kurtosis is introduced as a measure of the shape of the frequence distribution of the magnitude (the extent of inclination thereof), the kurtosis KB of the frequence distribution FB of the signal magnitude after spectrum subtraction is greater than the kurtosis KA of the frequence distribution FA of the signal magnitude before spectrum subtraction (KB>KA). Taking into consideration the fact that kurtosis is a measure of Gaussianity, it is understood that non-Gaussianity of the frequence distribution increases as stationary noise which has high Gaussianity in the frequence distribution of the magnitude among the audio signal V[j] is suppressed by the first noise suppressor 32. Since musical noise has high non-Gaussianity (i.e., has high frequence in magnitudes near zero), musical noise tends to develop as the kurtosis increases through spectrum subtraction.
Accordingly, the extent of change of kurtosis of the frequence distribution of signal magnitude, which will hereinafter be referred to as a “kurtosis change index KR38 serves as a quantitative index of the extent of musical noise due to spectrum subtraction. In the following, the kurtosis change index KR is exemplified by the ratio of the kurtosis KB after spectrum subtraction to the kurtosis KA before spectrum subtraction (i.e., KR=KB/KA). As is understood from the following definitions, musical noise becomes apparent or remarkable as the kurtosis change index KR increases (i.e., as the change of the kurtosis increases).
The kurtosis change indices KR of
Since the directional array process (delay sum array process) of the filtering processor 40 of
Similar to
As is understood from
In addition, as is understood from
Taking into consideration the above tendency, the suppression controller 60 of
Kurtosis κ is a high-order statistical quantity calculated from an nth-order moment μn according to the following Equation (5). For further details, reference is made to co-pending U.S. patent application Ser. No. 12/499,734. The contents of the co-pending application are incorporated herein by reference.
The frequence distribution (probability density function) of M samples of magnitudes x1 to xM is approximated by a function Ga(x; k,θ) in the following Equation (6).
The factor C of Equation (6) is defined as follows using a gamma function Γ(k).
The frequence distribution (probability density function) P(x) in an equation defining the 2nd-order moment μ2 is replaced with the function Ga(x; k,θ) of Equation (6) to derive the following Equation (7).
Similar to the derivation of the 2nd-order moment μ2, the frequence distribution (probability density function) P(x) in an equation defining the 4th-order moment μ4 is replaced with the function Ga(x; k,θ) of Equation (6) to derive the following Equation (8).
Then, the 2nd-order moment μ2 of Equation (7) and the 4th-order moment μ4 of Equation (8) are substituted into Equation (5) to derive the following Equation (9) which defines the kurtosis κ.
The index calculator 62 of
The factor adjuster 64 of
The factor adjuster 64 then determines at step S5 whether or not the kurtosis change index KR calculated at step S4 has exceeded the target value K0. When the kurtosis change index KR is less than the target value K0, the factor adjuster 64 calculates the sum of the current subtraction factor α and a predetermined value Δα as an updated subtraction factor α at step S6. At step S2 subsequent to step S6, spectrum subtraction using the updated subtraction factor α is performed on the next frame (i.e., the m+1th frame). That is, the first noise suppressor 32 subtracts the spectrum Nw[j] of stationary noise from each spectrum X[j] of the m+1th frame according to the updated subtraction factor α.
The update of the subtraction factor α (step S6), the spectrum subtraction using the updated subtraction factor α (step S2), the directional array process after spectrum subtraction (step S3), and the calculation of the kurtosis change index KR (step S4) are sequentially repeated as described above. Accordingly, the subtraction factor α sequentially increases by the predetermined value Δα in each frame so that the kurtosis change index KR sequentially approaches the target value K0. The procedure of
In this embodiment, the factor adjuster 64 variably controls the subtraction factor α so that musical noise caused by spectrum subtraction of the first noise suppressor 32 is suppressed to the extent according to the target value K0 (i.e., so that the kurtosis change index KR approaches the target value K0). In the case where the noise components include a lot of dispersive noise (i.e., the directionality index D is small), the subtraction factor α is automatically adjusted to a high value since the kurtosis change index KR hardly increases (i.e., musical noise hardly occurs) even when the subtraction factor α has been increased as described above with reference to
On the other hand, in the case where the noise components include a lot of directional noise (i.e., the directionality index D is high), the subtraction factor α is automatically adjusted to a low value since the kurtosis change index KR easily increases (i.e., musical noise easily occurs) as the subtraction factor α increases as described above with reference to
For example, let us assume that a mobile phone including the noise suppression apparatus 100 is used in a space such as a station yard or an exhibition hall. Operating noise of air-conditioning equipment arrives at the mobile phone as dispersive noise. A radiated sound from a sound source located distant from the mobile phone (for example, walking sound or vocal sound of another user or sound from a broadcast speaker) also arrives at the mobile phone as dispersive noise through reflection from walls or a floor in the space. On the other hand, vocal sound or walking sound of another user located near the mobile phone intermittently arrives at the mobile phone as directional noise. That is, the space such as a station yard or an exhibition hall is a typical environment in which directional noise and dispersive noise alternate in a short time interval. In such an environment, the noise suppression apparatus 100 of
<Modifications>
Various modifications can be made to each of the above embodiments. The following are specific examples of such modifications. It is also possible to arbitrarily select and combine two or more of the following modifications.
(1) Modification 1
As well as the MVDR, any known adaptive beamformer may be used to calculate the filtering factor W. For example, the invention preferably uses an SNR optimization beamformer which determines the filtering factor W so as to maximize the SN ratio of the audio signal VOUT after the directional array process. Specifically, the factor setter 44 calculates an eigenvector, whose eigenvalue is maximized in an eigenvalue problem represented as the following Equation (10), as the filtering factor W(fq).
β·SNN(fq)K(fq)=SXX(fq)K(fq) (10)
The symbol SXX(fq) of Equation (10) represents a covariance matrix of the magnitude of the component of the frequency fq in target sound components and the symbol SNN(fq) of Equation (10) represents a covariance matrix of the magnitude of the component of the frequency fq in noise components. The covariance matrix SXX(fq) of the target sound components is calculated using the same method as that of Equation (4) from the magnitude of the frequency (fq) in each of the spectrums X[1] to X[J] of a target sound section detected by the noise extractor 24. For example, the covariance matrix RNN(fq) calculated using Equation (4) from the spectrums Nd[1] to Nd[J] of non-stationary noise is applied as the covariance matrix SNN(fq) of Equation (10). In the case where the SNR optimization beamformer is used, there is an advantage in that there is no need to specify the direction (i.e., the angle ξ) of the target sound components.
(2) Modification 2
Although the method in which the subtraction factor α is sequentially updated in each frame (i.e., the subtraction factor α gradually approaches an optimal value over a plurality of frames) is described as an example with reference to
Although, in the above embodiment, the subtraction factor α is controlled so that the kurtosis change index KR approaches the target value K0 while actually performing spectrum subtraction through the first noise suppressor 32 and the filtering process (directional array process) through the second noise suppressor 42, it is also possible to analytically calculate the subtraction factor α so that the kurtosis change index KR approaches the target value K0 (i.e., to calculate the subtraction factor α without actual operation of the first noise suppressor 32 or the second noise suppressor 42). Specifically, an iterative equation, which expresses a relationship between the magnitude (second-order statistical quantity) of noise components remaining in a spectrum Z calculated through spectrum subtraction using the subtraction factor α and a filtering process using the filtering factor W and a kurtosis change index KR (fourth-order statistical quantity) after the spectrum subtraction and the filtering process, is defined and a subtraction factor α which maximizes the magnitude of the noise components of the spectrum Z is calculated under a condition that the kurtosis change index KR is maintained at the target value K0, which may be considered “optimization of a second-order statistical quantity under a fourth-order statistical constraint”.
(3) Modification 3
Although the spectrum Nd[j] of non-stationary noise estimated from the noise section is employed as a spectrum Nd[j] of non-stationary noise in the target sound section in the above embodiment, the invention may also employ a configuration in which the spectrum Nd[j] of non-stationary noise in the target sound section is specified directly from each frame in the target sound section. For example, the invention employs a configuration in which the noise extractor 24 of
The noise extractor 24B of
The noise extractor 24C of
In both the configurations of
Since the non-stationary noise spectrums Nd[1] to Nd[J−1] are extracted directly from each frame of the target sound section, the configurations of
(4) Modification 4
The definition of the kurtosis change index KR is not limited to the above example (i.e., the ratio between the kurtosis KX and the kurtosis KZ). For example, the invention also preferably employs a configuration in which the difference between the kurtosis KX and the kurtosis KZ is calculated as the kurtosis change index KR (i.e., KR=KZ−KX) or a configuration in which a value of a predetermined function whose variables are the kurtosis KX and the kurtosis KZ is calculated as the kurtosis change index KR (for example, a configuration in which a logarithmic value of the ratio between the kurtosis KX and the kurtosis KZ or the difference between the kurtosis KX and the kurtosis KZ is used as the kurtosis change index KR). Although the kurtosis KX is calculated from the audio signals V[1] to V[J] in the above embodiments, the invention also employs a configuration in which the kurtosis KX is calculated from only one audio signal V[j] selected from the audio signals V[1] to V[J] of the J channels.
Although the above embodiments have been described with reference to an example in which the kurtosis change index KR increases as the kurtosis KZ increases, relative to the kurtosis KX, the invention also employs a configuration in which the kurtosis change index KR is defined such that the kurtosis change index KR decreases as the kurtosis KZ increases, relative to the kurtosis KX. As is understood from the above examples, the kurtosis change index KR serves as a measure of the amount of change of the kurtosis of the frequence distribution of the signal magnitude from the first kurtosis observed when the processing of the first noise suppressor 32 is performed to the second kurtosis observed when the processing of the second noise suppressor 42 is performed, and the method of calculation of the kurtosis change index KR (definition thereof) is arbitrary.
(5) Modification 5
Although the processes from the process of the frequency analyzer 22 to the process of the waveform synthesizer 52 are performed in the frequency domain, the processes other than the spectrum subtraction by the first noise suppressor 32 may be appropriately changed to signal processes of the time domain. For example, the invention employs a configuration in which the index calculator 62 calculates the kurtosis KX from each magnitude of the audio signal VOUT of the time domain or a configuration in which the index calculator 62 calculates the kurtosis KZ from each magnitude of the audio signal VOUT of the time domain. The processes of the noise extractor 24 or the stationary noise estimator 26 may also be performed in the time domain.
(6) Modification 6
Although the stationary noise spectrum Nw[j] is generated for each channel of the audio signal V[j] in each of the above embodiments, the invention may also employ a configuration in which a common spectrum Nw (for example, the average of the spectrums Nw[1] to Nw[J] of
Number | Date | Country | Kind |
---|---|---|---|
2009-121192 | May 2009 | JP | national |