The present invention relates to a howling detection device and method. More particularly, the present invention relates to a howling detection device and method capable of detecting a risk of a howling occurrence, in a sound-intensifying system for mixing and intensifying a plurality of sound signals, for each of the plurality of sound signals.
Conventionally, in a sound-intensifying system for intensifying a sound signal collected by a microphone, a howling suppression device, for detecting an occurrence of howling and suppressing the howling, has been developed. As a conventional howling suppression device, a howling suppression device using an application filter or a notch filter is well-known (see patent document 1 and patent document 2, for example).
Hereinafter, with reference to
In
The howling suppressing section 94 performs a signal processing on the sound signal mixed by the sound mixing section 93 so as to suppress howling. Thereafter, the sound signal on which the signal processing has been performed is amplified as necessary so as to be outputted by the speaker 95. Note that the howling suppressing section 94 corresponds to a howling suppression device for suppressing the howling. As described above, in this example, the sound-intensifying system adopts howling suppression methods disclosed in patent document 1 and patent document 2. Thus, an application filter or a notch filter is used as the howling suppressing section 94.
Alternately, the notch filter may be used as the howling suppressing section 94.
With reference to
[Formula 1]
X(ω)=S(ω)+R(ω)*Y(ω) (1)
Note that R(ω) may include, in addition to the spatial transfer characteristic, a characteristic of the microphone 91, a characteristic of the speaker 95, an amplification characteristic of a sound signal amplified as necessary between an output of the howling suppressing section 94 and the speaker 95, and the like. In the howling suppressing section 94, a process, in which a sound signal M(ω)*X(ω) adjusted by the sound characteristic adjusting section 92 subtracts the transfer characteristic Hhat(ω) multiplied by the sound signal Y(ω) outputted from the howling suppressing section 94, is performed, thereby obtaining formula (2).
[Formula 2]
Y(ω)=M(ω)*X(ω)−Hhat(ω)*Y(ω) (2)
When formula (1) and formula (2) are deformed, formula (3) is obtained.
In formula (3), a second term thereof is pertinent to the howling occurrence. Therefore, the ideal transfer characteristic Hhat(ω) is a transfer characteristic which satisfies formula (4).
[Formula 4]
Hhat(ω)≈M(ω)*R(ω) (4)
When the transfer characteristic Hhat(ω) satisfies formula (4), the second term of formula (3) will be substantially zero. Thus, the howling suppressing section 94 can suppress the howling occurrence.
Next, with reference to
In formula (5), a second term thereof is pertinent to the howling occurrence. Therefore, the ideal transfer characteristic Hhat(ω) to be estimated is a transfer characteristic which satisfies formula (6).
As shown in formula (6), a spatial transfer characteristic R(ω) of each of the plurality of sound signals is a unique value. Also, the spatial transfer characteristic R(ω) is a value which changes depending on a position of a microphone. That is, in order to appropriately estimate the ideal transfer characteristic, the spatial transfer characteristic R(ω) of each of the plurality of sound signals needs to be taken into consideration. In the conventional art, however, the transfer characteristic is estimated based on an output signal outputted from the howling suppressing section 94. That is, the output signal outputted from the howling suppressing section 94 is a signal generated based on the plurality of sound signals mixed with each other, and not a signal generated by taking account of the transfer characteristic R(ω) of each of the plurality of microphones. Therefore, in the conventional art, there has been a problem in that the transfer characteristic cannot be estimated at a speed corresponding to a change in the spatial transfer characteristic R(ω), whereby the howling occurrence cannot be appropriately suppressed.
Furthermore, as shown in formula (6), the ideal transfer characteristic Hhat(t) to be estimated is a value determined based on M(ω) and R(ω) of each of the plurality of microphones. That is, when M(ω) changes, the ideal transfer characteristic Hhat(ω) accordingly changes. In the application filter 941, the transfer characteristic is estimated, while being converged, based on the output signal outputted from the howling suppressing section 94. Therefore, if a rapid change occurs in M(ω), and then a rapid change accordingly occurs in the ideal transfer characteristic Hhat(ω), the transfer characteristic cannot be estimated at a speed corresponding to the changes, whereby it has been difficult to appropriately suppress the howling occurrence.
In the case where the plurality of microphones are provided, as described above, values, M(ω) and R(ω) are more easily changed than in the case where one microphone is provided. Therefore, the specific frequency f at which howling occurs is also to be more easily changed. Thus, in the case where the notch filter is used as the howling suppressing section 94, a frequency at which the notch filter attenuates cannot be set in accordance with the specific frequency f having been changed, whereby it has been difficult to appropriately suppress the howling occurrence.
As described above, in a sound-intensifying system for mixing and intensifying a plurality of sound signals, there has been a problem in that a howling occurrence cannot be appropriately suppressed unless a risk (changes in M(ω), R(ω), etc., for example) of a howling occurrence for each of the plurality of sound signals is taken into consideration.
Furthermore, when a user is warned of the howling occurrence in the conventional art, well-known is a method in which a power difference, between a frequency band and its adjacent frequency band, of a power spectrum of an inputted sound signal is always monitored, thereby detecting the howling occurrence so as to warn the user thereof. However, in a sound-intensifying system for mixing and intensifying a plurality of sound signals, the howling occurrence is detected based on a power spectrum of a mixed sound signal. Therefore, in the conventional art, among the plurality of sound signals inputted, any of the sound signals which has caused howling or which has a risk of a howling occurrence cannot be specified so as to issue a warning.
Therefore, an object of the present invention is to detect a risk of a howling occurrence, in a sound-intensifying system for mixing and intensifying a plurality of sound signals, for each of the plurality of sound signals. Furthermore, another object of the present invention is to estimate an optimal transfer characteristic based on information regarding the detected risk, thereby performing a robust suppression of the howling occurrence in accordance with the transfer characteristic rapidly changed by the sound characteristic adjusting section. Still furthermore, another object of the present invention is to provide a method for specifying, from among the plurality of sound signals inputted, any of the sound signals which has caused howling or which has the risk of the howling occurrence, so as to issue a warning.
A first aspect of the present invention is directed to a howling detection device for detecting a dominance ratio, which indicates a risk of howling to be occurred when a mixed signal obtained by a sound mixing section for mixing a plurality of sound signals respectively collected by a plurality of microphones is outputted by a speaker, for each of the sound signals, the howling detection device comprises: a level detecting section for respectively detecting levels of the plurality of sound signals; a word ending detecting section for comparing, in a same time domain, the mixed signal with a signal regarding a sound to be outputted by the speaker as a noise reference signal, and detecting a time period, as a word ending section, during which the mixed signal is inputted after the noise reference signal falls; and a dominance ratio calculating section for extracting only a level of the word ending section from each of the levels of the plurality of sound signals, the levels detected by the level detecting section, and calculating, as a dominance ratio, a ratio of the extracted level of each of the sound signals to a sum of extracted levels of the plurality of sound signals.
In a second aspect of the present invention based on the first aspect, the howling detection device further comprises a howling suppressing section for subtracting from the mixed signal a signal having a same component as a signal included in the word ending section, based on a transfer characteristic calculated by using the dominance ratio, and outputting the obtained signal to the speaker.
In a third aspect of the present invention based on the second aspect, the howling suppressing section sets a function used for estimating the mixed signal excluding the signal having the same component as the signal included in the word ending section, updates the sum of the levels of the plurality of sound signals in accordance with the dominance ratio, and calculates the transfer characteristic by multiplying the function by a change rate of an updated sum of the levels of the plurality of sound signals to the sum of the levels of the plurality of sound signals.
In a fourth aspect of the present invention based on the third aspect, the howling suppressing section updates the sum of the levels of the plurality of sound signals by updating at least one of the levels of the sound signals, which indicates a relatively high dominance ratio.
In a fifth aspect of the present invention based on the third aspect, the howling suppressing section updates the sum of the levels of the plurality of sound signals by updating only one of the levels of the sound signals, which indicates the highest dominance ratio.
In a sixth aspect of the present invention based on the first aspect, the howling detection device further comprises a howling warning section for specifying at least one of the sound signals, which indicates a relatively high dominance ratio calculated by the dominance ratio calculating section, and notifying a user of the at least one of the sound signals.
In a seventh aspect of the present invention based on the first aspect, a howling warning section for specifying one of the sound signals, which indicates the highest dominant ratio calculated by the dominance ratio calculating section, and notifying a user of the one of the sound signals.
In an eighth aspect of the present invention based on the first aspect, the level detecting section detects the levels, of the plurality of sound signals, each of which is represented using a power spectrum.
A ninth aspect of the present invention is directed to a howling detection device for detecting a dominance ratio, which indicates a risk of howling to be occurred when a mixed signal obtained by a sound mixing section for mixing a plurality of sound signals respectively collected by a plurality of microphones is outputted by a speaker, for each of the sound signals, the howling detection device comprises: a level detecting section for respectively detecting levels of the plurality of sound signals; a howling occurrence detecting section for calculating a power spectrum of the mixed signal, and detecting a howling occurrence based on a change in the power spectrum; and a dominance ratio calculating section for extracting only a level of the word ending section from each of the levels of the plurality of sound signals, the levels detected by the level detecting section, and calculating, as a dominance ratio, a ratio of the extracted level of each of the sound signals to a sum of extracted levels of the plurality of sound signals.
In a tenth aspect of the present invention based on the ninth aspect, the howling detection device further comprises: a word ending detecting section for comparing, in a same time domain, the mixed signal with a sound signal to be outputted by the speaker as a noise reference signal, and detecting a time period, as a word ending section, during which the mixed signal is inputted after the noise reference signal falls; and a howling suppressing section for subtracting from the mixed signal a signal having a same component as a signal included in the word ending section, based on a transfer characteristic calculated by using the dominance ratio, and outputting the obtained signal to the speaker.
In an eleventh aspect of the present invention based on the tenth aspect, the howling suppressing section sets, when the word ending section is detected, a function used for estimating the mixed signal excluding the signal having the same component as the signal included in the word ending section, updates the sum of the levels of the plurality of sound signals in accordance with the dominance ratio, and calculates, when the howling occurrence is detected, the transfer characteristic by multiplying the function by a change rate of an updated sum of the levels of the plurality of sound signals to the sum of the levels of the plurality of sound signals.
In a twelfth aspect of the present invention based on the eleventh aspect, the howling suppressing section updates the sum of the levels of the plurality of sound signals by updating at least one of the levels of the sound signals, which indicates a relatively high dominance ratio.
In a thirteenth aspect of the present invention based on the eleventh aspect, the howling suppressing section updates the sum of the levels of the plurality of sound signals by updating only one of the levels of the sound signals, which indicates the highest dominance ratio.
In a fourteenth aspect of the present invention based on the ninth aspect, the howling detection device further comprises a howling warning section for specifying at least one of the sound signals, which indicates a relatively high dominance ratio calculated by the dominance ratio calculating section, and notifying a user of the at least one of the sound signals.
In a fifteenth aspect of the present invention based on the ninth aspect, the howling detection device further comprises a howling warning section for specifying one of the sound signals, which indicates the highest dominant ratio calculated by the dominance ratio calculating section, and notifying a user of the one of the sound signals.
In a sixteenth aspect of the present invention based on the ninth aspect, the level detecting section detects the levels, of the plurality of sound signals, each of which is represented using a power spectrum.
A seventeenth aspect of the present invention is directed to a howling detection method for detecting a dominance ratio, which indicates a risk of howling to be occurred when a mixed signal obtained by a sound mixing section for mixing a plurality of sound signals respectively collected by a plurality of microphones is outputted by a speaker, for each of the sound signals, the howling detection method comprises: a level detecting step for respectively detecting levels of the plurality of sound signals; a word ending detecting step for comparing, in a same time domain, the mixed signal with a signal regarding a sound to be intensified as a noise reference signal, and detecting a time period, as a word ending section, during which the mixed signal is inputted after the noise reference signal falls; and a dominance ratio calculating step for extracting only a level of the word ending section from each of the levels of the plurality of sound signals, the levels detected by the level detecting section, and calculating, as a dominance ratio, a ratio of the extracted level of each of the sound signals to a sum of extracted levels of the plurality of sound signals.
An eighteenth aspect of the present invention is directed to a howling detection method for detecting a dominance ratio, which indicates a risk of howling to be occurred when a mixed signal obtained by a sound mixing section for mixing a plurality of sound signals respectively collected by a plurality of microphones is outputted by a speaker, for each of the sound signals, the howling detection method comprises: a level detecting step for respectively detecting levels of the plurality of sound signals; a howling occurrence detecting step for calculating a power spectrum of the mixed signal, and detecting a howling occurrence based on a change in the power spectrum; and a dominance ratio calculating step for extracting only a level of the word ending section from each of the levels of the plurality of sound signals, the levels detected by the level detecting section, and calculating, as a dominance ratio, a ratio of the extracted level of each of the sound signals to a sum of extracted levels of the plurality of sound signals.
According to the aforementioned first aspect, the word ending section includes only a signal component which causes the howling occurrence, and the dominance ratio is calculated by using the level of the word ending section, thereby making it possible to detect the risk indicating a sound signal which is likely to cause a howling occurrence among the plurality of sound signals. Furthermore, the dominance ratio is calculated based on the level of each of the sound signals before being mixed by the sound mixing section. Therefore, according to the first aspect, before the plurality of sound signals are mixed by the sound mixing section, even if changes in frequency characteristics and/or gain characteristics of a plurality of the sound signals occur, for example, the risk can be detected in accordance with the changes.
According to the aforementioned second aspect, the transfer characteristic is calculated by using the dominance ratio, thereby making it possible to perform a howling suppression in accordance with the risk indicating a sound signal which is likely to cause the howling occurrence among the plurality of sound signals. Furthermore, the transfer characteristic is calculated by using the dominance ratio. Thus, before the plurality of sound signals are mixed by the sound mixing section, even if changes in frequency characteristics and/or gain characteristics of a plurality of the sound signals occur, and rapid changes in the transfer characteristics of the sound signals accordingly occur, for example, a robust howling suppression can be performed in accordance with the changes.
According to the aforementioned third aspect, the transfer characteristic is calculated based on the change rate, of the sum of the levels of the sound signals, which corresponds to the dominance ratio, thereby making it possible to realize the robust howling suppression while taking account of risks indicating a plurality of the sound signals which are likely to cause the howling occurrence.
According to the aforementioned fourth aspect, the transfer characteristic is calculated so as to correspond to the at least one of the plurality of sound signals which has a relatively high risk of the howling occurrence, thereby making it possible to realize a high-efficiency howling suppression.
According to the aforementioned fifth aspect, the transfer characteristic is calculated so as to correspond to one of the plurality of sound signals which has the highest risk of the howling occurrence, thereby making it possible to realize a high-efficiency howling suppression. For example, because it is rare that levels of a plurality of sound signals are simultaneously changed when the user performs a mixing operation, the robust howling suppression can be performed even if the transfer characteristic is calculated only in accordance with the highest dominance ratio.
According to the aforementioned sixth aspect, the at least one of the sound signals, which has a relatively high dominance ratio, is specified, thereby making it possible to notify the user of the at least one of the plurality of sound signals which has a relatively high risk of a howling occurrence. Furthermore, even when the user performs a mixing operation on a plurality of sound signals to be collected, for example, he or she can perform the operation by referring to the risk for each of the sound signals so as to prevent a howling occurrence.
According to the aforementioned seventh aspect, one of the sound signals, which has the highest dominance ratio, is specified, thereby making it possible to notify the user of the one of the plurality of sound signals which has the highest risk of a howling occurrence. Furthermore, even when the user performs a mixing operation on a plurality of sound signals to be collected, he or she can perform the operation by referring to the risk for each of the sound signals so as to prevent a howling occurrence.
According to the aforementioned eighth aspect, the level of each of the plurality of sound signals is represented using the power spectrum, thereby making it possible to detect the risk of the howling occurrence for each frequency band.
According to the aforementioned ninth aspect, when howling occurs, it is possible to detect the risk indicating a sound signal which is likely to cause the howling occurrence among the plurality of sound signals. Furthermore, the dominance ratio is calculated based on the levels of the sound signals before being mixed by the sound mixing section. Therefore, according to the present invention, before the sound signals are mixed by the sound mixing section, even if changes in frequency characteristics and/or gain characteristics of a plurality of the sound signals occur, and changes in the transfer characteristics of the sound signals accordingly occur, for example, the risk can be detected in accordance with the changes.
According to the aforementioned tenth aspect, the transfer characteristic is calculated by using the dominance ratio, thereby making it possible to perform a howling suppression in accordance with the risk indicating a sound signal which is likely to cause the howling occurrence among the plurality of sound signals. Furthermore, the transfer characteristic is calculated by using the dominance ratio. Thus, before the plurality of sound signals are mixed by the sound mixing section, even if rapid changes in frequency characteristics and/or gain characteristics of a plurality of the sound signals occur, and changes in the transfer characteristics of the sound signals accordingly occur, for example, a robust howling suppression can be performed in accordance with the changes.
According to the aforementioned eleventh aspect, the transfer characteristic is calculated based on the change rate, of the sum of the levels of the sound signals, which corresponds to the dominance ratio, thereby making it possible to realize, before the word ending section is detected, the robust howling suppression while taking account of risks indicating a plurality of sound signals which are likely to cause the howling occurrence.
According to the aforementioned twelfth aspect, the transfer characteristic is calculated so as to correspond to any of the plurality of sound signals, which has a relatively high risk of the howling occurrence, thereby making it possible to realize a high-efficiency howling suppression.
According to the aforementioned thirteenth aspect, the transfer characteristic is calculated so as to correspond to one of the plurality of sound signals which has the highest risk of the howling occurrence, thereby making it possible to realize a high-efficiency howling suppression. For example, because it is rare that levels of a plurality of sound signals are simultaneously changed when the user performs a mixing operation, a robust howling suppression can be performed even if the transfer characteristic is calculated only in accordance with the highest dominance ratio.
According to the aforementioned fourteenth aspect, when howling occurs, it is possible to notify the user of any of the plurality of sound signals which has a relatively high risk of a howling occurrence. Furthermore, even when the user performs a mixing operation on a plurality of sound signals to be collected, he or she can perform the operation by referring to the risk for each of the sound signals so as to prevent a howling occurrence.
According to the aforementioned fifteenth aspect, when howling occurs, it is possible to notify the user of one of the plurality of sound signals which has the highest risk of a howling occurrence. Furthermore, even when the user performs a mixing operation on a plurality of sound signals to be collected, he or she can perform the operation by referring to the risk for each of the sound signals so as to prevent a howling occurrence.
According to the aforementioned sixteenth aspect, the level of each of the plurality of sound signals is represented using the power spectrum, thereby making it possible to detect the risk of the howling occurrence for each frequency band.
With reference to
The sound signals X1(t) and X2(t) are inputted to the sound characteristic adjusting section 12. The sound characteristic adjusting section 12 adjusts a frequency and gain characteristic of each of the sound signals. Note that the sound signal X1(t) adjusted by the sound characteristic adjusting section 12 is denoted by Xm1(t). Similarly, the sound signal X2 adjusted by the sound characteristic adjusting section 12 is denoted by Xm1(t). The sound signals Xm1(t) and Xm2(t) adjusted by the sound characteristic adjusting section 12 are outputted to the level detecting section 14 and the sound mixing section 13. The sound signals Xm1(t) and Xm2(t) inputted to the sound mixing section 13 are mixed by the sound mixing section 13. The mixed sound signal is denoted by Xm(t). Thereafter, the sound signal Xm(t) mixed by the sound mixing section 13 is outputted to the word ending detecting section 15 and the howling suppressing section 17. Note that the sound characteristic adjusting section 12 and the sound mixing section 13 correspond to a commercially available mixer shown in
The level detecting section 14 detects a level of each of the sound signals Xm1(t) and Xm2(t) outputted from the sound characteristic adjusting section 12. As a specific detection method, for example, a power spectrum is calculated at a predetermined time interval, thereby detecting a level of each of the sound signals for each frequency band. All information regarding the level, for each frequency band, detected by the level detecting section 14 at the predetermined time interval is outputted to the dominance ratio calculating section 16.
Based on the sound signal Xm(t) inputted from the sound mixing section 13 and a noise reference signal Y(t), the word ending detecting section 15 detects a delay section, as a word ending, which is a time difference between a sound section of the noise reference signal Y(t) and a sound section of the sound signal Xm(t). Note that the noise reference signal Y(t) is a signal regarding a sound to be outputted by a speaker. For example, the noise reference signal Y(t) is a sound signal obtained immediately before being outputted by the speaker 18. In this case, the noise reference signal Y(t) obtained immediately before being inputted to the speaker 18 is inputted to the howling suppressing section 17. Alternately, the noise reference signal Y(t) may be a sound signal in which a sound outputted in a close proximity of the speaker 18 is collected and generated by another microphone or the like. In this case, the howling suppressing section 17 is connected to the said another microphone, and a sound signal outputted from the said another microphone is inputted to the howling suppressing section 17 as the noise reference signal Y(t).
With reference to
Based on the level of each of the sound signals outputted from the level detecting section 14 and the word ending detected by the word ending detecting section 15, the dominance ratio calculating section 16 calculates the dominance ratio of each of the plurality of sound signals having been inputted (Xm1(t) and Xm2(t) in
Among the levels calculated by the level detecting section 14, the level of a power spectrum included in the word ending section is denoted by a loop gain G. Also, a loop gain of the sound signal Xm1(t) is denoted by G1(ω), and a loop gain of the sound signal Xm2(t) is denoted by G2(ω). Similarly, a sound signal inputted from the nth (n is a natural number) microphone, the sound signal in which the frequency and gain characteristic thereof is adjusted by the sound characteristic adjusting section 12 is denoted by Xmn(t). In this case, a loop gain Gn(ω) of the sound signal Xmn(t) is represented by formula 7.
[Formula 7]
Gn(ω)=Mn(ω)*Xn(ω) (7)
Thereafter, the dominance ratio calculating section 16 extracts the loop gain G indicating the level of the word ending section from each of the levels of the sound signals, and calculates, as a dominance ratio of each of the sound signals, for example, a ratio of the loop gain of each of the sound signals to a sum of the loop gains of all sound signals. For example, in
As described above, in the word ending section including only the signal component propagated through space, the dominance ratio calculating section 16 calculates a dominance ratio of each of the sound signals, thereby detecting any of the sound signals which has a higher dominance ratio. Note that the signal component propagated through space is a signal component which causes a howling occurrence. Therefore, the dominance ratio calculating section 16 can detect, before howling occurs, whether a sound transmitted through R1(ω) shown in
If the howling detection device is structured such that a calculated dominance ratio is learned and updated by a predetermined method each time the word ending is detected, a dominance ratio can be sequentially changed in accordance with a positional change of a microphone. Note that a time at which the dominance ratio is learned is not limited to a time at which the word ending is detected. The time at which the dominance ratio is learned may be adjusted as necessary, taking account of an estimated sequence and accuracy.
The howling suppressing section 17 performs a signal processing on the sound signal Xm(t) mixed by the sound mixing section 13 so as to suppress howling. The sound signal on which the signal processing has been performed is amplified as necessary so as to be outputted by the speaker 18. Hereinafter, with reference to
In
Based on the sound signal Xm(ω) and the noise reference signal Y(ω), the transfer characteristic calculating section 173 firstly estimates a power spectrum ratio Hr(ω) only in the word ending section detected by the word ending detecting section 15. The power spectrum ratio Hr(ω) is represented by formula (8).
Note that ε indicates an average. Thereafter, the transfer characteristic calculating section 173 calculates a transfer characteristic Hsup(ω) shown in formula (9) based on the power spectrum ratio Hr(ω) estimated by formula (8).
As described above, in the present invention, Hsup(ω) is a function used for estimating the sound signal Xm(t) excluding a signal having the same signal component as a signal included in the word ending section.
Next, the transfer characteristic calculating section 173 multiplies Hsup(ω) calculated by formula (9) by a change rate of the sum of the loop gains, the change rate obtained based on the loop gain and dominance ratio, of each of the sound signals, calculated by the dominance ratio calculating section 16, thereby calculating Hsup(ω). Hereinafter, a calculation method of Hsup(ω) will be described.
It is assumed that a user performs a mixing operation in the sound characteristic adjusting section 12 and the sound mixing section 13, and changes the frequency and gain characteristic of each of the sound signals X1(t) and X2(t). In accordance with the operation, the frequency and gain characteristic M1(ω) of the sound signal Xm1(t) and the frequency and gain characteristic M2(ω) of the sound signal Xm2(t) change. In this case, as shown in formula 7, the loop gains G1(ω) and G2(ω) accordingly change. Here, between the dominance ratios calculated, before the mixing operation, by the dominance ratio calculating section 16, it is assumed that the dominance ratio of the loop gain G1(ω) is higher than that of the loop gain G2(ω). Also, the loop gain G1(ω) calculated, after the mixing operation, by the dominance ratio calculating section 16 is denoted by a loop gain G1new(ω), and the loop gain G1(ω) calculated, before the mixing operation, by the dominance ratio calculating section 16 is denoted by a loop gain G1old(ω). Similarly, the loop gain G2(ω) calculated, after the mixing operation, by the dominance ratio calculating section 16 is denoted by a loop gain G2new(ω), and the loop gain G2(ω) calculated, before the mixing operation, by the dominance ratio calculating section 16 is denoted by a loop gain G2old(ω).
In this case, the sum of the loop gains calculated, before the mixing operation, by the dominance ratio calculating section 16 is represented by G1old(ω)+G2old(ω). In contrast, the sum of the loop gains calculated, after the mixing operation, by the dominance ratio calculating section 16 is a sum obtained by taking account of only the loop gain having the highest dominance ratio among the dominance ratios calculated before the mixing operation. Specifically, in the above example, the dominance ratio of the loop gain G1(ω) is higher than that of the loop gain G2(ω). Thus, the sum of the loop gains calculated, after the mixing operation, by the dominance ratio calculating section 16 is represented by G1new(ω)+G2old(ω). In this case, the change rate Lr(ω) of the sum of the loop gains is represented by formula 10.
As described above, based on the loop gain and dominance ratio, of each of the sound signals, calculated by the dominance ratio calculating section 16, the change rate Lr(ω) of the sum of the loop gains is obtained. That is, in the change rate Lr(ω) of the sum of the loop gains, it is estimated that the sum of the loop gains (G1(ω)old+G2(ω)old) is changed to the sum of the loop gains (G1(ω)new+G2(ω)old) in accordance with a change in the loop gain G1(ω) having the highest dominance ratio. Note that in the above description, the sum of the loop gains is reflected only by the loop gain having the highest dominance ratio. This is on the grounds that it is rare that gains of two or more sound signals are simultaneously changed when the user performs the mixing operation, thereby making it possible to perform a robust howling suppression even if the change rate Lr(ω) is changed only in accordance with the loop gain having the highest dominance ratio. As described above, the sum of the loop gains is reflected by the loop gain having the highest dominance ratio, thereby making it possible to perform an effective and robust howling suppression, while taking account of only the sound signal having a high risk of a howling occurrence even if the plurality of sound signals are inputted.
The transfer characteristic calculating section 173 multiplies the change rate, shown in formula (10), of the sum of the loop gains, by the transfer characteristic Hsup(ω) calculated by formula (9), thereby calculating a transfer characteristic Hsup_new(ω) corresponding to the change rate of the sum of the loop gains. Note that the transfer characteristic Hsup(ω) is denoted by Hsup_old(ω), and the transfer characteristic corresponding to the change rate of the sum of the loop gains is denoted by Hsup_new(ω). In this case, the transfer characteristic Hsup_new(ω) corresponding to the change rate of the sum of the loop gains is represented by formula (11).
[Formula 11]
Hsup
As described above, in the present invention, the transfer characteristic Hsup_new(ω) corresponding to the change rate of the sum of the loop gains is a transfer characteristic obtained by multiplying Hsup(ω)_old, which is an estimated function, by the change rate of the sum of the loop gains.
Hsup_new(ω) updated by formula (11) is converted into a time domain by the inverse fourier transforming section 174. Hsup_new(ω) having been converted into the time domain is denoted by a filter coefficient Hsup_new(t). The convolution section 175 convolutes the filter coefficient Hsup_new(t) with the sound signal Xm(t) inputted from the sound mixing section 13, thereby subtracting from the sound signal Xm(t) the signal having only the same signal component as the signal included in the word ending section detected by the word ending detecting section 15. Note that Hsup(ω) is calculated (formula (9)) and updated (formula (11)) when the word ending is detected by the word ending detecting section 15. Alternatively, Hsup(ω) calculated (formula (9)) and updated (formula (11)) may be learned by a predetermined method each time the word ending is detected, for example.
As described above, according to the present embodiment, the dominance ratio calculating section 16 calculates the loop gain and dominance ratio of each of the sound signals, thereby calculating the transfer characteristic by using the change rate, of the sum of the loop gains, which is obtained based on the dominance ratio. Furthermore, because the dominance ratio is calculated based on an output signal outputted from the sound characteristic adjusting section 12, the dominance ratio is a value changed in accordance with the frequency characteristic and gain characteristic adjusted by the sound characteristic adjusting section 12. Thus, in the sound-intensifying system for mixing and intensifying the plurality of sound signals, the transfer characteristic, which is used for a howling suppression, is calculated based on the dominance ratio, there by making it possible to perform a robust howling suppression, even when the transfer characteristic is rapidly changed by the sound characteristic adjusting section 12. That is, the robust howling suppression can be realized even when the user performs the mixing operation and M(ω) is rapidly changed in accordance with the operation.
In the aforementioned description, the sum of the loop gains is estimated based on the loop gain, changed in accordance with time, which has the highest dominance ratio among the dominance ratios calculated, before the mixing operation, by the dominance ratio calculating section 16. However, the present invention is not limited thereto. For example, the sum of the loop gains may be reflected by a plurality of loop gains having relatively high dominance ratios. For example, it is assumed that three microphones are provided, and loop gains of the microphones are denoted by G1(ω), G2(ω) and G3(ω), respectively. In addition, it is also assumed that a dominance ratio of the loop gain G1(ω) and a dominance ratio of the loop gain G2(ω) are higher than that of the loop gain G3(ω) before the mixing operation. A sum of the loop gains (G1(ω)+G2(ω)+G3(ω)) may be reflected by the loop gains G1(ω) and G2(ω). In this case, the change rate Lr(ω) of the sum of the loop gains is represented by formula 12.
Furthermore, the transfer characteristic calculating section 173 may use the dominance ratios calculated by the dominance ratio calculating section 16 so as to reflect the loop gains of the sound signals, respectively, thereby obtaining the change rate of the sum of the loop gains. Alternatively, the transfer characteristic calculating section 173 may calculate the transfer characteristic, used for howling suppression, based on the dominance ratios by a method other than that using the change rate of the sum of the loop gains.
In the above description, two sound signals are inputted to the sound-intensifying system 1. However, the present invention is not limited thereto. For example, the sound-intensifying system 1 may have three or more microphones and three or more sound signals may be inputted to the sound-intensifying system 1. Furthermore, in the above description, a detailed subtraction configuration of the howling suppressing section 17 is shown in
In the above description, the level detecting section 14 may analyze a frequency of each of the sound signals, thereby calculating the level of each of the sound signals using the power spectrum. However, the present invention is not limited thereto. For example, the level detecting section 14 may calculate power of each of the sound signals at a predetermined time interval based on a scalar value. In this case, the dominance ratio calculating section 16 calculates the dominance ratio of each of the sound signals based on the scalar value. Also, the change rate Lr(ω) of the sum of the loop gains is represented based on the scalar value.
With reference to
In
The howling occurrence detecting section 21 calculates a power spectrum Xm(ω) of the sound signal Xm(t) mixed by the sound mixing section 13, thereby detecting a howling occurrence. For example, it is assumed that howling occurs at a specific frequency f. In this case, the power spectrum X(ω) of the sound signal Xm(t) changes, as shown in
Based on the level of each of the sound signals outputted from the level detecting section 14 and the information detected by the howling occurrence detecting section 21, the dominance ratio calculating section 16 calculates a dominance ratio of each of the plurality of sound signals having been inputted (Xm1(t) and Xm2(t) in
The howling suppressing section 22 performs a signal processing on the sound signal Xm(t) mixed by the sound mixing section 13 so as to suppress howling. Thereafter, the sound signal on which the signal processing has been performed is amplified as necessary so as to be outputted by the speaker 18. Hereinafter, with reference to
In
The word ending detecting section 176 has the same function as the word ending detecting section 15 described above. Based on the sound signal Xm(t) inputted from the sound mixing section 13 and the noise reference signal Y(t), the word ending detecting section 176 detects a delay section, as a word ending, which is a time difference between a sound section of the noise reference signal Y(t) and a sound section of the sound signal Xm(t). Similarity to the aforementioned first embodiment, the noise reference signal Y(t) is a sound signal obtained immediately before being outputted by the speaker 18, for example. In
Based on the sound signal Xm(ω) and the noise reference signal Y(ω), the transfer characteristic calculating section 173 firstly estimates a power spectrum ratio Hr(ω), shown in formula 8, only in the word ending section detected by the word ending detecting section 176. Thereafter, the transfer characteristic calculating section 173 calculates a transfer characteristic Hsup(ω) shown in formula (9) based on the power spectrum ratio Hr(ω) estimated in formula 8. Next, the transfer characteristic calculating section 173 multiplies Hsup(ω), calculated by formula (9), by a change rate of the sum of the loop gains, the change rate obtained based on the loop gain and dominance ratio, of each of the sound signals, calculated by the dominance ratio calculating section 16, thereby calculating a transfer characteristic Hsup(ω)_new corresponding to the change rate. Then, the transfer characteristic Hsup_new(ω), calculated by formula (II), corresponding to the change rate is converted into a time domain by the inverse fourier transforming section 174. The convolution section 175 convolutes a filter coefficient Hsup_new(t) having been converted into the time domain with the sound signal Xm(t) inputted from the sound mixing section 13, thereby subtracting from the sound signal Xm(t) a signal having only the same signal component as a signal included in the word ending section detected by the word ending detecting section 176. In this case, the transfer characteristic Hsup(ω)_new corresponding to the change rate is calculated based on a change rate, of a sum of the loop gains, which is obtained by any of the loop gains which causes the initial occurrence of howling. Therefore, it becomes possible to suppress howling while taking account of any sound signal which currently causes the initial occurrence of the howling and a frequency component of the sound signal.
In the present embodiment, Hsup(ω) is calculated (formula (9)) when the word ending detecting section 176 detects the word ending. Hsup(ω) corresponding to the change rate, of the sum of the loop gains, which is obtained based on the dominance ratio is updated (formula (11)) when the howling occurrence detecting section 21 detects the initial occurrence of howling. Alternatively, Hsup(ω) calculated by formula 9 may be learned by a predetermined method each time the word ending is detected, for example. Hsup(ω) calculated by formula 11 may be learned by a predetermined method each time the initial occurrence of howling is detected, for example.
As described above, according to the present embodiment, the dominance ratio calculating section 16 calculates the loop gain and dominance ratio of each of the sound signals at the time of the initial occurrence of howling. Thereafter, the transfer characteristic is calculated so as to correspond to the change rate, of the sum of the loop gains, which is obtained based on the dominance ratio. Furthermore, because the dominance ratio is calculated based on an output signal outputted from the sound characteristic adjusting section 12, the dominance ratio is a value changed in accordance with the frequency characteristic and gain characteristic adjusted by the sound characteristic adjusting section 12. Thus, in the a sound-intensifying system for mixing and intensifying the plurality of sound signals, the transfer characteristic, which is used for a howling suppression, is calculated based on the dominance ratio, there by making it possible to perform a robust howling suppression, even when howling occurs due to the sound characteristic adjusting section 12 which rapidly changes the transfer characteristic. Specifically, even when M(ω) is rapidly changed in accordance with the mixing operation performed by the user, and howling is almost likely to occur, a robust howling suppression can be realized. As a result, it becomes possible to prevent the howling from occurring.
With reference to
In
Alternatively, as shown in
As described above, in the present embodiment, the howling warning section 31 warns, in accordance with the dominance ratio calculated by the dominance ratio calculating section 16, the user of any of the sound signals which has the risk of the howling occurrence or any of the sound signals which currently causes the initial occurrence of howling. Thus, even if a plurality of sound signals are inputted, it becomes possible to allow the user to perform a mixing operation for each of the sound signals so as to prevent howling from occurring.
Among the respective elements described in the first to third embodiments above, at least a portion of the elements can be realized by an integrated circuit. Hereinafter, a detailed example will be described for each of the embodiments. The level detecting section 14, the word ending detecting section 15, the dominance ratio calculating section 16 and the howling suppressing section 17, which are all described in the first embodiment above, can be realized by an integrated circuit, for example, in which sound signals outputted from the sound characteristic adjusting section 12 (Xm1(t) and Xm2(t) in
A howling detection device and method according to the present invention is applicable to a sound-intensifying system, a PA device having a sound mixing function, and the like, which mix and intensify a plurality of sound signals, and which are capable of detecting a risk of a howling occurrence for each of the sound signals by calculating a dominance ratio.
Number | Date | Country | Kind |
---|---|---|---|
2004-177859 | Jun 2004 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2005/010959 | 6/15/2005 | WO | 00 | 10/31/2006 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2005/125273 | 12/29/2005 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7133529 | Ura | Nov 2006 | B2 |
7190800 | Terada et al. | Mar 2007 | B2 |
7760888 | Kanamori et al. | Jul 2010 | B2 |
Number | Date | Country |
---|---|---|
04-273642 | Sep 1992 | JP |
04-277998 | Oct 1992 | JP |
08-033091 | Feb 1996 | JP |
08-223274 | Aug 1996 | JP |
2002-223182 | Aug 2002 | JP |
2002-237769 | Aug 2002 | JP |
2003-250193 | Sep 2003 | JP |
2003-284183 | Oct 2003 | JP |
Number | Date | Country | |
---|---|---|---|
20080021703 A1 | Jan 2008 | US |