This application claims priority under 35 U.S.C. § 119 or 365 to Norwegian Application No. 20052109, filed Apr. 29, 2005. The entire teachings of the above application are incorporated herein by reference.
Conventional conference systems are usually equipped with a sensitive non-directive microphone to capture speech from a plurality of participants. The wide coverage area may compromise with noise protection, as any shield or casing will reduce the audio capturing characteristics. As conference microphones also usually are movable, other electronic components may be exposed to external noise.
In particular, the increased use of Global System for Mobile Communications (GSM) mobile phones has lead to an increasing problem with disturbing noise in video and telephone conferences. This noise is introduced into the conferencing system as a result of interference with the audio capturing components caused by radio transmission from the GSM mobile phones. The acoustic components in a videoconferencing system consist of one or more microphones capturing the near-end audio, one or more loudspeakers presenting the far-end audio and a general signal processing unit (codec). When the GSM mobile phones induce interference noise to the audio system, the noise will be received as a very annoying and disturbing noise at the far-end side and the speech intelligibility will be severely degraded.
The GSM networks make use of the TDMA (Time Division Multiple Access) technique to be able to squeeze more calls onto one channel by dividing a calling channel into a few “discontinuous” pieces. TDMA has 8 time slots (i.e. transmitting for one eighth of the time) and the length of each time slot is 0.57 ms (⅛*1/217). Thus, a GSM mobile phone in transmitting mode emits short duration radio-frequency pulses at a rate of 217 Hz.
a shows a GSM induced interference signal combined with normal background noise. The time intervals between each negative spike are 4.6 ms (1/(217 Hz)) and the intervals between the negative and the positive spikes are 0.57 ms. The negative spikes are related to the start of the TDMA time slots and the positive spikes are related to the end of the time slots. The induced interference signal contains the 217 Hz fundamental and a large number of harmonics that overlap the frequency range of speech, and therefore severely degrade speech intelligibility.
The GSM mobile phones radio-frequency pulses in a number of situations, some of which are listed below.
The present invention relates to an audio communication method and device for detecting cell phone induced noise in electronic communication equipment.
There is a need for a system and method that minimizes the problems described above.
In particular, the present invention discloses a method for detecting cell phone induced noise of a captured signal in a telecommunication equipment, including the steps of Fourier transforming the captured signal to a Fourier transformed signal, executing a logarithmic function on the Fourier transform signal to a logarithmic Fourier transform signal, Fourier transforming the logarithmic Fourier transform signal to a cepstrum signal, and deciding whether one or more amplitudes associated with one or more samples in the cepstrum signal are above one or more corresponding threshold(s).
A noise detector corresponding to this method is also disclosed.
The foregoing will be apparent from the following more particular description of example embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating embodiments of the present invention.
a shows a GSM impulse train of TDMA pulses.
b show the GSM impulse train when AC coupled in an encoder.
a, 4b and 4c respectively shows a time-signal, FFT and Cepstrum of a speech signal.
a, 5b and 5c respectively shows a time-signal, FFT and Cepstrum of a GSM induced background noise signal.
a, 6b and 6c respectively shows a time-signal, FFT and Cepstrum of a GSM induced speech signal.
In the following, the present invention will be discussed by describing preferred embodiments, and by referring to the accompanying drawings. However, even if the specific embodiments are described in connection with video conferencing and stereo sound, people skilled in the art will realize other applications and modifications within the scope of the invention as defined in the enclosed independent claims.
The present invention discloses a method allowing an audio processing unit to detect the situations mentioned in the background section when the audio is disrupted by the GSM induced noise. When GSM noise is detected by means of the present invention (see
According to the present invention, an analyzing technique called cepstrum is utilized to detect GSM induced noise. A cepstrum (pronounced “kepstrum”) is the result of taking the Fourier transform of the logarithmic magnitude spectrum of a signal. The cepstrum was for the first time defined in Tukey, J. W., B. P. Bogert and M. J. R. Healy: “The quefrency analysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum, and saphe-cracking”. Proceedings of the Symposium on Time Series Analysis (M. Rosenblatt, Ed) Chapter 15, 209-243. New York: Wiley.
A simplified definition of cepstrum of a signal is the Fourier Transform (FT) of the logarithm of the FT of the signal. This can mathematically be expressed as follows:
cepstrum of signal=FT(log(FT(the signal)))
and algorithmically:
signal→FT→log→FT→cepstrum
In terms of cepstrum analysis, “FT” is used to indicate the Fourier transform function, rather than “FFT”, since the Fast Fourier Transform is not specifically required.
The term “cepstrum” is an anagram of “spectrum”, formed by reversing the first four letters. Similar anagrams used in the cepstrum terminology are “quefrency” corresponding to frequency, and “gamnitude”, corresponding to magnitude.
As indicated above, the cepstrum is the spectrum of a spectrum, and has certain properties that make it useful in many types of signal analysis. One of its more powerful attributes is the fact that any periodicities, or repeated patterns, in a spectrum will be sensed as one or two specific components in the cepstrum. If a spectrum contains several sets of sidebands or harmonic series, they can be confusing because of overlap, but in the cepstrum, they will be separated in a way similar to the way the spectrum separates repetitive time patterns in the waveform. In simplified terms, an pulse train in a time signal is represented with periodicity in the corresponding Fourier Transform, that again is represented by well-defined peaks in the cepstrum.
The present invention utilizes the fact that the GSM induced interference signal contains the fundamental and a lot of harmonics of the 217 Hz, which gives a periodic frequency spectrum (see
The cepstrum analysis will detect the characteristic periodicity in the frequency spectrum by giving a high “gamnitude” value at the quefrency index given by fs/2*1/217, where fs=sampling frequency. In the
To make the detection even more secure, the GSM noise detector may additionally look at the Q(2*111) and also the neighbor quefrency lines and switch in the eliminator filter if e.g.:
(Q(q)>threshold1) AND (Q(2*q)>threshold2)
(Q(q)>threshold1) AND (Q(2*q)>threshold2) AND
(Q(q±n)<threshold1), where n=[2 . . . 10].
Q(q)>2*max(Q(q±n)), where n=[2 . . . 10]
(Q(q)>2*max(Q(q±n))) AND (Q(2*q)>2*max(Q(2*q±n))), where n=[2 . . . 10]
In example 1 and 2 the decision is based on absolute thresholds and in example 3 and 4 the decision is based on thresholds relative to the maximum “gamnitudes” of the “quefrencies” not being monitored.
if(Q(111)>threshold)
normal processing
The present invention will make it possible in software to detect situations where the analogue audio system is disrupted by GSM mobile phones.
A noise detector according to the present invention could be installed at the near-end side of a conference before loading the audio signal on the near-end loudspeaker for removing noise originating from near-end equipment, but it could also be installed at the far-end side of a conference before loading the audio signal on the far-end loudspeaker for removing noise originating from near-end equipment. The advantage of the latter is that it allows for GSM noise detection and GSM noise removal even if the noise originates from installations not provided with the GSM noise detector/eliminator.
When noise is detected, several ways of eliminating or attenuating the noise could be initiated. One example is to mute the signal exposed to the noise. Another example is to filter the noise from the signal before transmitting the signal forward. Note that the present invention is not restricted to noise from a GSM phone. The present invention could be used in all other cell phone noise due to TDMA or similar systems.
While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
20052109 | Apr 2005 | NO | national |
Number | Name | Date | Kind |
---|---|---|---|
5355431 | Kane et al. | Oct 1994 | A |
6415253 | Johnson | Jul 2002 | B1 |
7010119 | Marton et al. | Mar 2006 | B2 |
7120580 | Rao Gadde et al. | Oct 2006 | B2 |
7155387 | Globerson | Dec 2006 | B2 |
20020123308 | Feltstrom | Sep 2002 | A1 |
20030158732 | Pi et al. | Aug 2003 | A1 |
Number | Date | Country |
---|---|---|
WO 0038180 | Jun 2000 | WO |
Number | Date | Country | |
---|---|---|---|
20060259300 A1 | Nov 2006 | US |