The present invention relates generally to acoustic feedback cancellation technology. More particularly, the present invention relates to a low complexity howling suppression method and system for portable karaoke system.
Karaoke has become more and more popular as an interactive entertainment activity in East Asia. Anyone can sing in a karaoke club using a karaoke machine with their friends. Recently, people are not just satisfied with singing in a club but also sing everywhere with such as a portable all-in-one karaoke machine. Hence, many portable karaoke products have entered the market. However, the sound quality of most of these products are not as good as expected. Many of them suffer from the “acoustic howling” or have not enough sound level due to this howling issue.
In actuality, the howling is a common problem in a karaoke or other sound reinforcement system. A karaoke system usually comprises at least one microphone and a loudspeaker. When a sound signal is picked up by the microphone, and subsequently amplified and fed to the loudspeaker, the loudspeaker sound is usually taken back by the microphone through a direct acoustic path or some reflection paths. The acoustic coupling between the loudspeaker and the microphone results in a closed signal loop, where the acoustic feedback in the form of an unwanted howling can occur. In this case, it either causes the howling problem and even damages the loudspeaker, or limits the performance of the sound reinforcement, since it limits the amount of amplification that can be applied if the karaoke system is required to be stable.
Therefore, different measures have been taken to prevent this problem. There are four major categories of howling suppression methods. The first one is the frequency shifting method, which can alter the microphone input signal in every loop with a frequency shift of several Hz. It will become more helpful when the shifting frequency is bigger, for example 20 Hz, but the side effect on the sound quality is too severe to be considered acceptable. So we usually cannot alter too much and have to compromise on the suppression performance. The second one is the Notch filter based Feedback Suppression (NFS), where notch filters are used to suppress problematic frequencies at which howling has been detected. Notch filters are stop band filters with a very narrow stop band, which can significantly reduce the gain in that particular frequency band and thus suppresses those frequencies from the microphone input signal. However, the NFS method usually includes a detection phase and a suppression phase in the sense that howling sound needs to be detected first, and thus is often heard before it is suppressed. The third one is beamforming method, which uses the microphone array or loudspeaker array to modify the directivities, to reduce the direct sound propagation between the microphones and loudspeakers. But this method requires additional hardware and more computation, and thus the number of microphones and loudspeakers are often limited. The fourth one is the Acoustic Echo Cancellation (AEC) method, which use adaptive filters to approximate the transfer function between the microphones and the loudspeakers, and filter the output signal from the loudspeaker in order to cancel the approximated feedback signal picked up by the microphone. If the transfer function is perfectly approximated by the adaptive filters, no howling comes out. Usually AEC method can perform well thanks to the adaptive mechanism, but it takes high computation and power consumption, as well as the long system processing time, which is not suitable in the portable karaoke system with certain play time and latency requirement.
However, most of these measures are focusing on the regular big karaoke system, in which the distance between the loudspeaker and the microphone is usually much farther than that in the portable all-in-one karaoke system. On the other hand, users would probably like to sing as loud as possible even in such a small karaoke machine. Therefore, if we want to exceed user's expectation in the karaoke product, we need to develop a better technique to suppress the howling.
The present invention overcomes some of the above drawbacks by providing a low complexity howling suppression system for portable karaoke system. The howling suppression system comprising at least one microphone for capturing an input signal which comprises a source signal and an acoustic feedback which is propagated through environment; an electro-acoustic path for compressing and equalizing the input signal, and then feeding to an output signal after amplified; a loudspeaker for playback the output signal. The howling suppression system further comprises at least one infinite impulse response (IIR) filters for estimating the acoustic feedback, and thereby to cancel out the acoustic feedback from the input signal.
The present invention further provides a low complexity howling suppression method for portable karaoke system. The howling suppression method comprises the steps of capturing an input signal which comprises a source signal and an acoustic feedback via at least one microphone, compressing and equalizing the input signal, in an electro-acoustic path, and then feeding an output signal to a loudspeaker after amplified, and playback the output signal with a loudspeaker, and the output signal is propagated through environment. The howling suppression method further comprises the step of estimating the acoustic feedback via at least one infinite impulse response (IIR) filters, and thereby to cancel out the acoustic feedback from the input signal.
The present invention may be better understood from reading the following description of non-limiting embodiments, with reference to the attached drawings. In the figures, like reference numeral designates corresponding parts, wherein below:
The detailed description of one or more embodiments of the present invention is disclosed hereinafter; however, it is understood that the disclosed embodiments are merely exemplary of the invention that may be embodied in various and alternative forms. The figures are not necessarily to scale; some features may be exaggerated or minimized to show details of particular components. Therefore, specific structural and function details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for teaching one skilled in the art to variously employ the present invention.
When the sound signal is picked up by the microphone 110 and subsequently amplified and fed to the loudspeaker 140, the audio sound playback by the loudspeaker 140 can be taken back by the microphone 110 through direct or reflection paths. The acoustic coupling between the loudspeaker 140 and the microphone 110 results in the output signal Y(z) propagated through the environmental transfer function F(z) to form an acoustic feedback, and then also picked up into the microphone 110, and thus the input signal X(z) as shown in
Therefore, with all the signals and transfer functions denoted in the frequency domain, the process in
Y(z)=X(z)·G(z)·K (1)
X(z)=Y(z)·F(z)+S(z) (2)
Regarding the above equation (2), it is conceivable that the input signal X(z) into the karaoke system may further include an Audio Stream, such as accompaniment music of a song, input by way of Bluetooth or AUX interface, for example, which has been omitted herein.
And then we can compute the total transfer function H(z) from the source signal S(z) to the output signal Y(z):
where the term F(z)·G(z)·K refers to the loop response of the system, and the magnitude and phase responses of it denote the loop gain and loop phase, respectively. Therefore, the howling will occur when the system becomes unstable, which is summarized in the Nyquist stability criterion,
F(z)·G(z)·K≥1, ∠F(z)G(z)=n2π (4)
In the past several decades, many methods on howling suppression have been discussed. Certainly, the first precaution to be considered is optimizing the whole karaoke system, such as the directivities of the loudspeaker and the microphone, the distance between them, the overall gain of the system and the amplitude of some potentially problematic frequencies. However, the optimization is usually limited, especially in the portable all-in-one karaoke system, because its form factor and sound performance usually have certain requirements—as small size but as high sound level as possible. For these limited scenarios, the processes must be automated or other measures needs to be taken to avoid howling feedback. Therefore, the present invention provides a portable all-in-one karaoke system with low complexity howling suppression method.
To better suppress the howling sound in the portable all-in-one karaoke machine, and reduce the processing computation, the power consumption, and the system latency, the present invention provides a low-complexity howling suppression method for portable karaoke system, in which filters are used to cancel unwanted components from the microphone signal.
X(z)−Xest(z)=S(z)+F(z)Y(z)−Fest(z)Y(z)≈S(z) (5)
Using an adaptive filter to estimate the environmental transfer function can achieve good effect of suppressing howling. In practice, however, the howling suppression with the adaptive filter estimating the environmental transfer function still exists some problems. Firstly, the latency may not meet the requirement of this small karaoke machine. Since the adaptive algorithm might need long processing time, while the loudspeaker is very close to the microphone, so that the sound propagation time is probably smaller than the processing time, and thus the algorithm becomes ineffective. Secondly, the adaptive algorithm might also consume high power and the battery will drain quickly, which is the obvious defect of a portable device. Thirdly, the adaptive algorithm sometimes does not converge smoothly, resulting in the significant difference of the adaptive filters, which will constantly affect the timbre of the user's singing. Furthermore, there is high correlation between the loudspeaker and microphone in the karaoke system, which renders this structure to perform poorly in this scenario.
Therefore, in the example system, several second-order Infinite Impulse Response (IIR) filters 260′ are used to model the transfer function F(z), as shown in
Besides, the decorrelation 215 can be further introduced into the example system 200 to reduce the correlation between the loudspeaker and microphone signals. In the model as shown in
x
shift(t)=x(t)cos(2πΔft)−{circumflex over (x)}(t)sin(2πΔft) (6)
where Δf is the shifted frequency, and {circumflex over (x)}(t) is the Hilbert transform of the original signal x(t).
The full advantage of the frequency shifting can be taken into the present method. In this proposed model with several IIR filters modeling the environmental transfer function, there is no need the adaptive processing due to the latency problem and high-power consumption. Moreover, since the acoustic feedback problem occurs because the output signal of the loudspeaker is returned to the input microphone through the part of acoustic coupling in the air, the frequency shift is used to decorrelate the reference signal and the error signal, which can be used to mitigate biased filter estimation.
In Step 310, the input signal is provided into the portable karaoke system, which comprises the source signal such as the user's singing voice, and the acoustic feedback. This part of the input signal captured by the at least one microphone; Moreover, the input signal further comprises the audio stream such as the accompanying music of songs, this part of the input signal may be upload to the karaoke system in a wired or wireless way, such as by Bluetooth or via the AUX interface.
In Step 315, the decorrelation is introduced into the provided system to decorrelate the input signal. In this step, the loudspeaker signal of the loudspeaker is decorrelated from the input signal by frequency shifting the input signal.
In Step 320, then the frequency shifted input signal is processed in the electro-acoustic path, including compression and equalization, and then amplified by the gain factor K, to get the output signal.
In Step 340, the output signal is fed to the loudspeaker for playback and propagated through the environment.
In this step, the output signal playback from the loudspeaker, after propagated in the environment, is taken back by the microphone through a direct or some reflection paths as acoustic feedback, the acoustic feedback enters the microphone as the other part of the input signal received by the at least one microphone, as mentioned above in Step 310.
Next, in Step 360, in order to cancel out unwanted components from the microphone signal, several IIR filters are used to model the environmental transfer function. This step comprises measuring and estimating the environmental transfer function offline, and approximates the function by such as multi-band IIR filters. The resulting estimated signal is approximately equal to the acoustic feedback part of the input signal entering the microphone in step 350. Therefore, by subtracting the estimated acoustic feedback from the input signals captured by the microphone, the acoustic feedback that may produce howling in the system can be cancelled out.
In a way of example, two microphones with different directivities as a microphone array 410 are used to form a cardioid directivity pattern as shown in
The beamforming output Xbeam(z) is written as:
X
beam(Z)=ΣkWk(z)·Xk(Z) (7)
where Wk(z) and Xk(z) are the kth beamforming filter and the kth microphone input signal, respectively.
Moreover, In the actual portable karaoke products, the microphones 410 can be wrapped with the sound-absorbing cotton 480 to further reduce the sound energy from the loudspeaker 440. We can see that the passive radiator 470 is alternatively used in this example machine, as shown schematically in
For a portable karaoke machine, we always want to have a long play time but still expect a good sound quality, such as high volume and less howling problem. In this invention, the provided low complexity howling suppression method adopts the IIR filter structure to reduce the power consumption and the system latency. To further suppress the howling, nonlinear algorithms, for example frequency shifting, are also combined with the microphone beamforming method.
The low complexity howling suppression method and system provided in the present invention are suitable for those applications in which a system containing both loudspeaker and microphone provided that their relative positions are fixed, and the loudspeaker plays the input signal of the microphone in real time. The example applications comprise such as but not limited to portable karaoke machines, integrated speakers, and conference systems, etc.
As used in this application, an element or step recited in the singular and proceeded with the word “a” or “an” should be understood as not excluding plural of said elements or steps, unless such exclusion is stated. Furthermore, references to “one embodiment” or “one example” of the present disclosure are not intended to be interpreted as excluding the existence of additional embodiments that also incorporate the recited features. The terms “first,” “second,” and “third,” etc. are used merely as labels, and are not intended to impose numerical requirements or a particular positional order on their objects.
While exemplary embodiments are described above, it is not intended that these embodiments describe all possible forms of the invention. Rather, the words used in the specification are words of description rather than limitation, and it is understood that various changes may be made without departing from the spirit and scope of the invention. Additionally, the features of various implementing embodiments may be combined to form further embodiments of the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2020/111657 | 8/27/2020 | WO |