The present invention relates to a technique of detecting a change in a signal.
In the above technical field, patent literature 1 discloses a technique of evaluating the continuity of a phase component in the time direction and smoothing an amplitude component for each frequency (paragraphs 0135 to 0138). Patent literature 2 describes detecting an abrupt frequency change by measuring a fluctuation in a phase in the time direction. Patent literature 3 describes, in paragraph 0024, that “a phase change in the complex vector of I and Q signals on a complex plane caused by superimposition of impulsive noise is always monitored, thereby reliably detecting the impulsive noise under a strong field environment”. The phase change is a change in the time direction. Patent literature 4 describes, in paragraph 0031, that “a phase linearizer 25 corrects a hop in a phase signal θ input from a polar coordinate converter 24 by linearization and outputs a resultant phase signal θ′ to a phase detector 26”. In addition, patent literature 4 has a description of a phase gradient detector in paragraph 0051, and also describes, in paragraph 0040, that “
However, the techniques of patent literatures 1 and 4 out of the above-described related arts do not detect an abrupt change in an input signal. In addition, the technique of patent literature 2 detects an abrupt change in a “frequency”, and the technique of patent literature 3 detects impulsive noise using a time-rate change in the phase of an AM signal. Patent literature 5 discloses a technique of detecting an impulsive sound using only an amplitude, which is poor in robustness. That is, the techniques described in these literatures cannot effectively detect an abrupt change in a signal.
The present invention enables to provide a technique of solving the above-described problems.
One aspect of the present invention provides a signal processing apparatus comprising:
a converter that converts an input signal into a phase component and an amplitude component in a frequency domain;
a linearity calculator that calculates a linearity of the phase component in the frequency domain; and
a determiner that determines presence of an abrupt change in the input signal based on the linearity calculated by the linearity calculator.
Another aspect of the present invention provides a signal processing method comprising:
converting an input signal into a phase component and an amplitude component in a frequency domain;
calculating a linearity of the phase component in the frequency domain; and
determining presence of an abrupt change in the input signal based on the calculated linearity.
Still other aspect of the present invention provides a signal processing program for causing a computer to execute a method comprising:
converting an input signal into a phase component and an amplitude component in a frequency domain;
calculating a linearity of the phase component in the frequency domain; and
determining presence of an abrupt change in the input signal based on the calculated linearity.
According to the present invention, it is possible to effectively detect an abrupt change in a signal.
Preferred embodiments of the present invention will now be described in detail with reference to the drawings. It should be noted that the relative arrangement of the components, the numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present invention unless it is specifically stated otherwise. Note that “speech signal” in the following explanation indicates a direct electrical change that occurs in accordance with the influence of speech or another sound. The speech signal transmits speech or another sound and is not limited to speech.
A signal processing apparatus 100 according to the first embodiment of the present invention will be described with reference to
As shown in
With the above-described arrangement, an abrupt change in the input signal can accurately be detected based on the degree of the linear change in the phase component in the frequency domain.
A noise suppression apparatus according to the second embodiment of the present invention will be described with reference to
The noise suppressor 205 estimates noise using the noisy signal amplitude spectrum 230 supplied from the converter 201, thereby generating an estimated noise spectrum. In addition, the noise suppressor 205 suppresses the noise using the generated estimated noise spectrum and the noisy signal amplitude spectrum 230 supplied from the converter 201, and transmits an enhanced signal amplitude spectrum as a noise suppression result to an amplitude controller 203. The noise suppressor 205 also receives a determination result from an abrupt change determiner 209, and executes noise suppression in accordance with the presence/absence of an abrupt change in the signal.
The phase controller 202 rotates (shifts) the noisy signal phase spectrum 220 supplied from the converter 201, and supplies it to an inverter 204 as an enhanced signal phase spectrum 240. The phase controller 202 also transmits the phase rotation amount (shift amount) to the amplitude controller 203. The amplitude controller 203 receives the phase rotation amount (shift amount) from the phase controller 202, calculates an amplitude correction amount, corrects the enhanced signal amplitude spectrum in each frequency using the amplitude correction amount, and supplies a corrected amplitude spectrum 250 to the inverter 204. The inverter 204 performs inversion by compositing the enhanced signal phase spectrum 240 supplied from the phase controller 202 and the corrected amplitude spectrum supplied from the amplitude controller 203, and supplies the resultant signal to an output terminal 207 as an enhanced signal.
The linearity calculator 208 calculates the linearity in the frequency domain using the phase spectrum 220 supplied from the converter 201. The abrupt change determiner 209 determines the presence/absence of an abrupt signal change based on the linearity calculated by the linearity calculator 208.
<<Arrangement of Converter>>
y
n(t)=w(t))yn(t) (1)
Two successive frames may partially be overlaid (overlapped) and windowed. Assume that the overlap length is 50% the frame length. For t=0, 1, . . . , K/2−1, the windowing unit 302 outputs the left-hand sides of
A symmetric window function is used for a real signal. The window function is designed to make the input signal and the output signal match with each other except a calculation error when the output of the converter 201 is directly supplied to the inverter 204. This means w(t)+w(t+K/2)=1.
The description will be continued below assuming an example in which windowing is performed for two successive frames that overlap 50%. As w(t), the windowing unit can use, for example, a Hanning window given by
Various window functions such as a Hamming window and a triangle window are also known. The windowed output is supplied to the Fourier transformer 303 and transformed into a noisy signal spectrum Yn(k). The noisy signal spectrum Yn(k) is separated into the phase and the amplitude. A noisy signal phase spectrum arg Yn(k) is supplied to the phase controller 202 and the linearity calculator 208, whereas a noisy signal amplitude spectrum |Yn(k)| is supplied to the noise suppressor 205. As already described, a power spectrum may be used in place of the amplitude spectrum.
<<Arrangement of Inverter>>
n(k)=|
Inverse Fourier transform is performed for the obtained enhanced signal. The signal is supplied to the windowing unit 402 as a series of time domain sample values xn(t) (t=0, 1, . . . , K−1) in which one frame includes K samples, and multiplied by the window function w(t). A signal obtained by windowing an nth frame input signal xn(t) (t=0, 1, . . . , K/2−1) by w(t) is given by the left-hand side of
n(t)=w(t)xn(t) (5)
Two successive frames may partially be overlaid (overlapped) and windowed. Assume that the overlap length is 50% the frame length. For t=0, 1, . . . , K/2−1, the windowing unit 402 outputs the left-hand sides of
and transmits them to the frame composition unit 403.
The frame composition unit 403 extracts the outputs of two adjacent frames from the windowing unit 402 on the basis of K/2 samples, overlays them, and obtains an output signal (left-hand sides of equation (7)) for t=0, 1, . . . , K−1 by
{circumflex over (x)}
n(t)=
An obtained enhanced signal 260 is transmitted from the frame composition unit 403 to the output terminal 207.
Note that the conversion in the converter and the inverter in
The noise suppressor 205 may perform actual suppression after a plurality of frequency components obtained by the converter 201 are integrated. At this time, high sound quality can be achieved by integrating more frequency components from the low frequency range where the discrimination capability of hearing characteristics is high to the high frequency range with a poorer capability. When noise suppression is executed after integrating a plurality of frequency components, the number of frequency components to which noise suppression is applied decreases, and the whole calculation amount can be decreased.
<<Arrangement of Noise Suppressor>>
The noise suppressor 205 estimates noise using the noisy signal amplitude spectrum supplied from the converter 201 and generate an estimated noise spectrum. The noise suppressor 205 then obtains a suppression coefficient using the noisy signal amplitude spectrum from the converter 201 and the generated estimated noise spectrum, multiplies the noisy signal amplitude spectrum by the suppression coefficient, and supplies the resultant spectrum to the amplitude controller 203 as an enhanced signal amplitude spectrum. Upon receiving an abrupt change determination result (information representing whether an abrupt change in the signal exists) from the abrupt change determiner 209 and determining that an abrupt change has occurred, the noise suppressor 205 supplies a smaller one of the noisy signal amplitude spectrum and the estimated noise spectrum to the amplitude controller 203 as an enhanced signal amplitude spectrum.
To estimate noise, various estimation methods are used, as described in non-patent literature 2.
For example, non-patent literature 1 discloses a method of obtaining, as an estimated noise spectrum, the average value of noisy signal amplitude spectra of frames in which no target sound is generated. In this method, it is necessary to detect generation of the target sound. A section where the target sound is generated can be determined by the power of the enhanced signal.
As an ideal operation state, the enhanced signal is the target sound other than noise. In addition, the level of the target sound or noise does not largely change between adjacent frames. For these reasons, the enhanced signal level of an immediately preceding frame is used as an index to determine a noise section. If the enhanced signal level of the immediately preceding frame is equal to or smaller than a predetermined value, the current frame is determined as a noise section. A noise spectrum can be estimated by averaging the noisy signal amplitude spectra of frames determined as a noise section.
Non-patent literature 1 also discloses a method of obtaining, as an estimated noise spectrum, the average value of noisy signal amplitude spectra in the early stage in which supply of them has started. In this case, it is necessary to meet a condition that the target sound is not included immediately after the start of estimation. If the condition is met, the noisy signal amplitude spectrum in the early stage of estimation can be obtained as the estimated noise spectrum.
Non-patent literature 2 discloses a method of obtaining an estimated noise spectrum from the minimum value of the statistical noisy signal amplitude spectrum. In this method, the minimum value of the noisy signal amplitude spectrum within a predetermined time is statistically held, and a noise spectrum is estimated from the minimum value. The minimum value of the noisy signal amplitude spectrum is similar to the shape of a noise spectrum and can therefore be used as the estimated value of the noise spectrum shape. However, the minimum value is smaller than the original noise level. Hence, a spectrum obtained by appropriately amplifying the minimum value is used as an estimated noise spectrum.
The noise suppressor 205 can perform various kinds of suppression. Typical examples are the SS (Spectrum Subtraction) method and an MMSE STSA (Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator) method. In the SS method, the estimated noise spectrum is subtracted from the noisy signal amplitude spectrum supplied from the converter 201. In the MMSE STSA method, a suppression coefficient is calculated using the noisy signal amplitude spectrum supplied from the converter 201 and the generated estimated noise spectrum, and the noisy signal amplitude spectrum is multiplied by the suppression coefficient. The suppression coefficient is decided so as to minimize the mean square power of the enhanced signal.
<<Arrangement of Phase Controller and Amplitude Controller>>
The rotation amount generator 502 generates the rotation amount of the noisy signal phase spectrum for a frequency component determined to “have an abrupt change in the signal” by the abrupt change determiner 209, and supplies the rotation amount to the phase rotator 501 and the correction amount calculator 503. Upon receiving the rotation amount supplied from the rotation amount generator 502, the phase rotator 501 rotates (shifts) the noisy signal phase spectrum 220 supplied from the converter 201 by the supplied rotation amount, and supplies the rotated spectrum to the inverter 204 as the enhanced signal phase spectrum 240.
The correction amount calculator 503 decides the correction coefficient of the amplitude based on the rotation amount supplied from the rotation amount generator 502, and supplies the correction coefficient to the amplitude corrector 504.
The rotation amount generator 502 generates the rotation amount by, for example, a random number. When the noisy signal phase spectrum is rotated for each frequency by a random number, the shape of the noisy signal phase spectrum 220 changes. With the change in the shape, the feature of noise such as an impulsive sound can be weakened.
Examples of the random number are a uniform random number whose occurrence probability is uniform and a normal random number whose occurrence probability exhibits a normal distribution. A rotation amount generation method using a uniform random number will be described first. A uniform random number can be generated by a linear congruential method or the like. For example, uniform random numbers generated by the linear congruential method are uniformly distributed within the range of 0 to (2̂M)−1, where M is an arbitrary integer, and ̂ represents a power. Phase rotation amounts φ need to be distributed within the range of 0 to 2π. To do this, the generated uniform random numbers are converted. The conversion is performed by
where R is the uniform random number, and Rmax is the maximum value capable of being generated by the uniform random number. When a uniform random number is generated by the above-described linear congruential method, Rmax=(2̂M)−1.
To simplify the calculation, the value R may directly be decided as the rotation amount. As the rotation amount, 2π represents just one revolution. A case where the phase is rotated by 2π is equivalent to a case where the phase is not rotated. Hence, a rotation amount 2π+α is equivalent to a rotation amount α. A case where a uniform random number is generated by the linear congruential method has been explained here. Even in a case where a uniform random number is generated by another method, the rotation amount φ is obtained by equation (8). When and how many times random number generation is to be performed may be decided in accordance with the determination result of the abrupt change determiner 209.
The phase rotator 501 receives the rotation amount from the rotation amount generator 502 and rotates the noisy signal phase spectrum. If the noisy signal phase spectrum is expressed as an angle, it can be rotated by adding the value of the rotation amount φ to the angle. If the noisy signal phase spectrum is expressed as the normal vector of a complex number, it can be rotated by obtaining the normal vector of the rotation amount φ and multiplying the noisy signal phase spectrum by the normal vector.
The normal vector of the rotation amount φ can be obtained by
Φ=cos(φ)+j sin(φ) (9)
In equation (9), Φ is the rotation vector, and j represents sqrt(−1). Note that sqrt is the square root.
A correction coefficient calculation method by the correction amount calculator 503 will be described. First, a decrease in the output level caused by phase rotation will be described first with reference to
A signal in a case where phase rotation is not performed will be described with reference to
The signal divided into frames is windowed by the windowing unit 302. The third signal from above, which is separated by a dotted line, is the signal after windowing. In
Next, the Fourier transformer 303 transforms the signal into a signal in a frequency domain. The signal in the frequency domain is not illustrated in
An enhanced signal output from the inverse Fourier transformer 401 of the inverter 204 undergoes windowing again.
A signal in a case where phase rotation is performed will be described with reference to
Lowering of the output signal level caused by phase rotation can also be explained in vector composition in a frequency domain by replacing addition in the time domain with addition in the frequency domain.
m=n+L/2 (10)
holds.
In addition, the relationship between x1 and x2 is represented by
x
2
[m]=x
1
[n+L/2] (11).
The formula of transform from a time domain signal to a frequency domain signal and that of inverse transform will be described. By Fourier transform of a time domain signal x[n], a frequency domain signal X[k] is expressed as
where k is the discrete frequency, and L is the frame length.
When the frequency domain signal X[n] is returned to the time domain signal x[n] by inverse transform, the time domain signal x[n] is expressed as
When the time domain signals x1[n] and x2[m] are transformed into frequency domain signals X1[k] and X2[k] based on this equation, they are expressed as
When the frequency domain signals X1[k] and X2[k] are returned to the time domain signals x1[n] and x2[m] by inverse transform, respectively, they are expressed, based on equation (13), as
The inverter transforms each frequency domain signal into a time domain signal by Fourier transform. After that, the frame composition unit adds the enhanced speech of the preceding frame and that of the current frame which overlap. For example, when the overlap ratio is 50% as in the illustrated example, the adjacent frames are added in the section of the discrete time m=L/2 to L−1. Consider the addition section m=L/2 to L−1.
When equations (16) and (17) are substituted into time domain addition, the addition is expressed as
When equations (14) and (15) are further substituted into the frequency domain signals X1[k] and X2[k] in equation (18), the addition is expressed as
When equations (19) is expanded, the addition is expressed as
Consider the sum operation included in each term of equation (20). When an arbitrary integer g is introduced,
holds.
The inverse Fourier transformation of a delta function δ[g] is given by
The delta function δ[g] is represented by
Based on equation (22), expression (21) can be rewritten as
From the relation of equation (24), equation (20) is represented by
Hence, equation (20) changes to
Consider a case where phase rotation is performed for the frequency domain signal X2[k]. At this time, a time domain signal as shown in
When the phase spectrum of X2[k] is rotated by φ[k], inverse transform is represented by
When this is substituted into equation (18),
holds.
When this is expanded,
holds.
Assume that the overlap ratio is 50%, and consider n=L/2 to L−1 of the overlap section. In the overlap section, equation (11) can be expanded to
Here,
1+efφ[k] (31)
parenthesized in each term represents vector composition, and can be drawn as in
The absolute value of equation (31) is obtained as
Hence, the condition to maximize the absolute value of equation (31) is φ[k]=0, and the value is 2. That is, when phase rotation is performed, the magnitude of the output signal becomes small, as is apparent. The correction amount calculator 503 decides the amplitude correction amount of the enhanced signal amplitude spectrum so as to correct the decrease amount of the output signal level.
A method of calculating a correction amount will be described here in detail assuming that the phase rotation amount is decided by a uniform random number. To simplify the problem, focus is placed on the variation in the magnitude caused by phase rotation, and each frequency component is assumed to have been normalized to a unit vector.
A case where phase rotation is not performed will be considered first. The composite vector in a case where the phase does not change between successive frames is represented by S shown in
On the other hand, when phase rotation is performed by a uniform random number, the phase differences φ between successive frames are uniformly distributed within the range of −π to +π. The composite vector in a case where the phase changes between successive frames is represented by a vector S′ shown in
An expected value E(|S′|̂2) is obtained as
E(|S′|2)=E(2+2 cos φ)=E(2)+E(2 cos φ) (35)
Since the differences φ are uniformly distributed from −π to +π, we obtain
E(2 cos(φ))=0 (36)
For this reason, the expected value E(|S′|̂2) is given by
E(|S′|2)=2 (37)
Based on equation (33), the expected value E(|S′|̂2) in a case where phase rotation is not performed is given by
When the ratio of equation (37) to equation (38) is calculated,
holds.
That is, when the phase is rotated by a uniform random number, the power average value of the output signal decreases to ½ as compared to the input. The amplitude corrector 504 performs correction of the amplitude value. Hence, the correction amount calculator 503 obtains sqrt(2) as the correction coefficient and transmits it to the amplitude corrector 504.
Rotation amount generation by a uniform random number has been exampled above. The correction coefficient can also uniquely be obtained using a normal random number if its variance and average value are determined. Correction coefficient derivation using a normal random number will be described below.
When a normal random number is used, the occurrence probability of φ is decided by a normal distribution. Hence, to obtain a power expected value in a case where phase rotation is executed using a normal random number, weighting needs to be performed based on the occurrence probability of φ.
More specifically, a weight function f(φ) based on the occurrence probability of φ is introduced. By the weight function f(φ), cos(φ) is weighted. The weighted value is further normalized by the integrated value of the weight function f(φ), thereby obtaining the power expected value.
By introducing the weight function f(φ) and its integrated vale into equation (35) representing the output power expected value for a uniform random number, an output power expected value E(S″̂2) in a case where phase rotation is performed using an normal random number can be expressed as
Since the weight function f(φ) can be expressed as a normal distribution,
holds, where σ is the variance, and μ is the average value.
For example, in a standard normal distribution in which the average value μ=0, and the variance σ=1,
holds. When this is substituted into equation (40), we obtain
By numerical calculation of the second term of the right-hand side of equation (43),
E(|S″|2)=2{1+0.609}=3.218 (44)
holds. Hence, the ratio to E(|Ŝ2|) in a case where phase rotation is not performed is given by
E(|S″|2)/E(|S|2)=3.218/4=0.805 (45)
In a case where the phase is rotated by a normal random number of a standard normal distribution, the correction amount calculator 503 obtains sqrt(1/0.805) as the correction coefficient and transmits it to the amplitude corrector 504. Amplitude correction is performed for a frequency that has undergone the phase rotation. Hence, the correction coefficient of a frequency that has not undergone the phase rotation is set to 1.0. Only the correction coefficient of the frequency that has undergone the phase rotation uses the value derived above.
As described above, in the amplitude controller 203, the amplitude correction coefficient is calculated using the phase rotation amount transmitted from the phase controller 202. The enhanced signal amplitude spectrum supplied from the noise suppressor 205 is multiplied by the correction coefficient and supplied to the inverter 204. This can eliminate lowering of the output level when the enhanced signal phase spectrum is obtained by rotating the noisy signal phase spectrum.
<<Arrangement of Linearity Calculator and Abrupt Change Determiner>>
The flatness measure calculator 1202 checks the flatness measure (variation) of the phase change amount Δp(k)=p(k)−p(k−1) obtained by the change amount calculator 1201 along the frequency axis. A difference Δ2p(k)=Δp(k)−Δp(k−1) of the phase change amount of the adjacent frequency is obtained as a flatness measure 1220. If the phase change amount is flat, the difference is 0. The differential value of the phase may be obtained as the phase change amount, and the differential value of the phase change amount may be obtained as the flatness measure 1220. In this case, if the quadratic differential value of the phase is close to 0 (equal to or smaller than a predetermined value), the phase change amount can be determined as flat.
The change amount calculator 1201 calculates the change amount using the phase difference between adjacent frequencies. However, the present invention is not limited to this. The linearity may be determined by differentiation of the frequency of the phase. The smaller the variation between a plurality of differential results of a plurality of frequencies is, the higher the linearity is. A local linearity can be determined using a local differential result. The flatness measure can be used as the index of the variation.
If the absolute value of the calculated flatness measure is equal to or smaller than a predetermined value, the abrupt change determiner 209 determines that the frequency corresponding to the flatness measure includes an impulsive sound. The abrupt change determiner 209 also compares the number of frequencies determined to include an impulsive sound with a predetermined threshold, and outputs impulsive sound present (1) or impulsive sound absent (0) as a determination result 1230 of the current frame.
The phase is known to linearly change at a portion where the signal abruptly changes. It is therefore possible to determine the presence of an abrupt change in the signal by obtaining the linearity of the phase and determining the flatness measure in the above-described way. In a frame in which an abrupt signal change such as an impulsive sound exists, the abrupt change can be removed by rotating the phase spectrum. Hence, a high-quality enhanced signal can be obtained.
The noise suppression apparatus 200 includes a processor 1410, a ROM (Read Only Memory) 1420, a RAM (Random Access Memory) 1440, a storage 1450, an input/output interface 1460, an operation unit 1461, an input unit 1462, and an output unit 1463. The noise suppression apparatus 200 may include a camera 1464. The processor 1410 is a central processing unit and executes various programs, thereby controlling the overall noise suppression apparatus 200.
The ROM 1420 stores various parameters as well as a boot program to be executed first by the processor 1410. The RAM 1440 includes an area to store an input signal 210, the phase component 220, the amplitude component 230, the enhanced signal 260, the phase change amount 1210, the flatness measure 1220, the determination result 1230, and the like as well as a program load area (not shown). The storage 1450 stores a noise suppression program 1451. The noise suppression program 1451 includes a conversion module, a phase control module, an amplitude control module, an inversion module, a noise suppression module, a linearity calculation module, and an abrupt change determination module. When the processor 1410 executes the modules included in the noise suppression program 1451, the functions of the converter 201, the phase controller 202, the amplitude controller 203, the inverter 204, the noise suppressor 205, the linearity calculator 208, and the abrupt change determiner 209 shown in
Enhanced speech that is the output of the noise suppression program 1451 executed by the processor 1410 is output from the output unit 1463 via the input/output interface 1460. This can suppress, for example, the operation sound of the operation unit 1461 input from the input unit 1462. Also possible is an application method of, for example, detecting impulsive sound inclusion in the input signal input from the input unit 1462 and starting shooting by the camera 1464.
With the above-described processing, an impulsive sound can more correctly be detected, and the impulsive sound can appropriately be removed as needed.
While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The arrangement and details of the present invention can variously be modified without departing from the spirit and scope thereof, as will be understood by those skilled in the art. The present invention also incorporates a system or apparatus that combines different features included in the embodiments in any form.
The present invention is applicable to a system including a plurality of devices or a single apparatus. The present invention is also applicable even when a signal processing program for implementing the functions of the embodiments is supplied to the system or apparatus directly or from a remote site. Hence, the present invention also incorporates the program installed in a computer to implement the functions of the present invention by the computer, a medium storing the program, and a WWW (World Wide Web) server that causes a user to download the program.
Note that in the above embodiments, the characteristic arrangements of a signal processing apparatus, a signal processing method, and a signal processing program to be described below are shown (the arrangements are not limited to those to be described below).
(Supplementary Note 1)
There is provided a signal processing apparatus comprising:
a converter that converts an input signal into a phase component and an amplitude component in a frequency domain;
a linearity calculator that calculates a linearity of the phase component in the frequency domain; and
a determiner that determines presence of an abrupt change in the input signal based on the linearity calculated by the linearity calculator.
(Supplementary Note 2)
There is provided the signal processing apparatus according to supplementary note 1, wherein the linearity calculator calculates the linearity based on whether a change in the phase component in the frequency domain falls within a predetermined range.
(Supplementary Note 3)
There is provided the signal processing apparatus according to supplementary note 1 or 2, wherein the linearity calculator calculates a flatness measure of a differential value of the phase component in the frequency domain, and
if the flatness measure of the differential value is high, the determiner determines that the abrupt change in the input signal exists.
(Supplementary Note 4)
There is provided the signal processing apparatus according to supplementary note 1, 2, or 3, wherein the linearity calculator calculates, for each frequency, a phase component difference as a difference between phase components at a frequency and an adjacent frequency, and
calculates the linearity based on a difference between the phase component differences.
(Supplementary Note 5)
There is provided the signal processing apparatus according to supplementary note 4, wherein the linearity calculator compares the difference between the phase component differences with a first threshold for each frequency, and
counts, for each frame, the number of frequency components with a difference not greater than the first threshold as the linearity, and
if the linearity is not less than a second threshold, the determiner determines that the abrupt change exists in the input signal.
(Supplementary Note 6)
There is provided a signal processing method comprising:
converting an input signal into a phase component and an amplitude component in a frequency domain;
calculating a linearity of the phase component in the frequency domain; and
determining presence of an abrupt change in the input signal based on the calculated linearity.
(Supplementary Note 7)
There is provided the signal processing method according to supplementary note 6, wherein the linearity is calculated based on whether a change in the phase component in the frequency domain falls within a predetermined range.
(Supplementary Note 8)
There is provided the signal processing method according to supplementary note 6 or 7, wherein the linearity is calculated by calculating a flatness measure of a differential value of the phase component in the frequency domain, and
if the flatness measure of the differential value is high, the abrupt change in the input signal is determined to exist.
(Supplementary Note 9)
There is provided the signal processing method according to supplementary note 6, 7, or 8, wherein the linearity is calculated based on a difference between phase component differences calculated for each frequency as a difference between the phase component and a phase component of an adjacent frequency.
(Supplementary Note 10)
There is provided the signal processing method according to supplementary note 9, wherein the linearity is calculated as a count value obtained by comparing the difference between the phase component differences with a first threshold for each frequency and counting, for each frame, the number of frequency components for which the difference is determined to be not more than the first threshold, and
the abrupt change in the input signal is determined to exist if the count value is not less than a second threshold.
(Supplementary Note 11)
There is provided a signal processing program for causing a computer to execute steps comprising:
converting an input signal into a phase component and an amplitude component in a frequency domain;
calculating a linearity of the phase component in the frequency domain; and
determining presence of an abrupt change in the input signal based on the calculated linearity.
(Supplementary Note 12)
There is provided the signal processing program according to supplementary note 11, wherein the linearity is calculated based on whether a change in the phase component in the frequency domain falls within a predetermined range.
(Supplementary Note 13)
There is provided the signal processing program according to supplementary note 11 or 12, wherein the linearity is calculated by calculating a flatness measure of a differential value of the phase component in the frequency domain, and
if the flatness measure of the differential value is high, the abrupt change in the input signal is determined to exist.
(Supplementary Note 14)
There is provided the signal processing program according to supplementary note 11, 12, or 13, wherein the linearity is calculated based on a difference between phase component differences calculated for each frequency as a difference between the phase component and a phase component of an adjacent frequency.
(Supplementary Note 15)
There is provided the signal processing program according to supplementary note 14, wherein the linearity is calculated as a count value obtained by comparing the difference between the phase component differences with a first threshold for each frequency and counting, for each frame, the number of frequency components for which the difference is determined to be not more than the first threshold, and
the abrupt change in the input signal is determined to exist if the count value is not less than a second threshold.
This application claims the benefit of Japanese Patent Application No. 2013-042447, filed on Mar. 5, 2013, which is hereby incorporated by reference in its entirety.
Number | Date | Country | Kind |
---|---|---|---|
2013-042447 | Mar 2013 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2014/054633 | 2/26/2014 | WO | 00 |