The invention relates generally to the field of signal analysis, and more particularly, to a system and method for detecting the frequency, amplitude and/or phase of one or more tones comprised within an input signal.
The discrete Fourier transform (DFT) is a popular tool for analyzing signals. However, before an input signal is transformed, it is quite often windowed with a windowing function. (It is noted that the action of capturing of a finite-length sequence of samples of the input signal automatically implies a rectangular windowing.) The transform Y of the windowed input signal will typically exhibit multiple scaled and shifted versions of transform function W, i.e., the transform of the window function. Each sinusoidal component of the input signal expresses itself as a pair of such shifted versions, one version shifted up to the frequency fj of the sinusoidal component, and the other shifted down to frequency −fj. The positive frequency version is referred to herein as a positive frequency image, and the negative frequency version is referred to herein as a negative frequency image. When a sinusoidal component frequency fj is small compared to the sample rate, the positive frequency image and the negative frequency image for the sinusoidal component may overlap in frequency space. Similarly, when a sinusoidal component frequency fj is close to one-half the sample rate, the positive frequency image and the negative frequency image for the sinusoidal component may overlap. Furthermore, when two sinusoidal components have frequencies that are close together, their positive images and negative images may overlap.
Prior art techniques for tone estimation quite often focus on identifying the peaks in the magnitude spectrum |Y|. The peaks roughly determine the frequency of the corresponding tones. However, because of the cross-interaction of the images from other tones, or the negative frequency image from the same tone, the peak of a positive frequency image may be perturbed away from a purely scaled and frequency-shifted version of the template function W. Thus, parameter estimation techniques which compute parameters for a given tone based only on transform array values (i.e. DFT values) in the vicinity of a corresponding image peak may not produce accurate results. Therefore, there exists a substantial need for a system and method which could estimate tone parameters from the transform array with increased accuracy.
The present invention comprises various embodiments of a system and a method for estimating signal parameters (e.g. frequency, amplitude and/or phase) of one or more sinusoidal tones present in an input signal. More particularly, one embodiment of the invention comprises a system and method for estimating parameters for a single tone based on a transform Y of the input signal. The input signal may be windowed with a window function w(n) and transformed into the frequency domain. The tone in the input signal expresses itself in the frequency domain as an additive combination of two spectra, one centered at the tone frequency and the other at the negative of the tone frequency. These two spectra are referred to herein as the positive frequency image and the negative frequency image respectively. The continuous-frequency transform W of the window function and the positive and negative frequency images have identically-shaped magnitude envelopes. Thus, a peak in the magnitude spectrum of the transform Y gives an initial estimate for the frequency and amplitude of the tone. Furthermore, the phase angle of the transform values in the neighborhood of the peak gives an estimate for the phase of the tone. The initial frequency, amplitude and phase estimates may be used to compensate for the effect of a negative frequency image on the transform array in the frequency domain, especially in the neighborhood of the peak frequency. In other words, estimate values of the negative frequency image may be subtracted from the complex coefficients of the transform array in the neighborhood of the peak frequency. The resulting difference values may be used to compute improved estimates for the tone frequency, amplitude and phase.
In one embodiment, a system may be configured to estimate signal parameters for one or more tones present in an input signal. The system may comprise an input for receiving an input signal, a memory, a processor and an output device, such as a display. The memory may store a software program which is executable by the processor. In response to execution of the software program, the processor is operable to perform the following operations.
In step (1) above, the processor may window the input signal, and compute a discrete Fourier transform of the windowed input signal. The discrete Fourier transform may be implemented by a fast algorithm such as the FFT.
The input signal may comprise one or more sinusoidal tones x1, x2, . . . , xL occurring at frequencies f1, f2, . . . , fL respectively. Each tone xi expresses itself in the transform array as an additive combination of a positive frequency image of the form
(Ai/2)exp(jθi)W(f−fi)
and a negative frequency image of the form
(Ai/2)exp(−jθi)W(f+fi),
where variable f denotes frequency, and W(f) is a continuous-frequency expression for the transform of the window function w(n). Thus, the transform array comprises an additive combination of positive frequency images and negative frequency images corresponding to the one or more tones. Because the positive and negative frequency images may overlap with each other (especially when the tone frequencies are near zero, near one-half the sample rate, or near to each other), the frequency locations of magnitude peaks in the transform array may provide only a rough approximation to the tone frequencies fi. In other words, the observability of a given image may be adversely affected by the other positive and negative frequency images which overlap with the given image.
The processor may identify the frequency locations of one or more magnitude peaks in the magnitude spectrum |Y(k)| of the transform array. In particular, the processor may search for magnitude peaks which exceed a magnitude threshold in a positive-frequency region of the transform array. A bin index value kmax may be determined for each of the threshold-exceeding magnitude peaks. The bin index value kmax for each magnitude peak defines the bin index at which the corresponding magnitude peak is maximized. It is noted that the index k of the transform array is referred to herein as the bin index.
The processor may compute a frequency estimate, an amplitude estimate and a phase estimate for each of the one or more tones based on a corresponding one of the magnitude peaks. The frequency estimate and amplitude estimate for a given tone are determined from the magnitude values of the corresponding magnitude peak under the assumption that the magnitude peak is a shifted and scaled version of the window transform magnitude |W|. The center frequency of the magnitude peak determines the frequency estimate, and the size of the magnitude peak relative the window magnitude |W| determines amplitude estimate. The phase estimate for a given tone is determined based on one or more the phase angles of the transform array coefficient (which are complex numbers) in the neighborhood of the corresponding magnitude peak.
The frequency, amplitude and phase estimates for the one or more tones are used to estimate the positive and negative frequency images, and to subtract out the cross-interaction between images. More particularly, the processor may correct the transform array parameters to correct the transform array values in the neighborhood of each peak frequency location. For a given tone, the processor may correct the transform array values around the corresponding peak frequency location by subtracting estimated values of any aliasing images. Aliasing images may include the positive and negative frequency images of tones other than the given tone, and the negative frequency image of the given tone.
After correcting the transform array values, the processor may recomputed the tone frequencies, amplitudes and phases based on the corrected transform array values. Because the corrected transform array values more closely approximate the positive frequency images that the original transform array values, the recomputed parameter estimates may be more accurate.
In one embodiment, the steps of correcting the transform array values and recomputing the parameter estimate may be performed repeatedly. When a termination criteria is achieved, the repetition may be terminated and final estimates for the signal parameters (e.g. tone frequencies, amplitudes and phases) may be transmitted to an output device (e.g. display screen).
In one embodiment, the tone frequencies, amplitudes and/or phases may be used to decode analog and/or digital signal information contained within the signal. For example, the method may be used to more accurately identify the tones present in the input signal. Thus, the final estimates for tone frequencies, amplitudes and/or phases may be used to recover encoded analog and/or digital signals.
A better understanding of the present invention can be obtained when the following detailed description of the preferred embodiment is considered in conjunction with the following drawings, in which:
While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that the drawings and detailed description thereto are not intended to limit the invention to the particular form disclosed, but on the contrary, the intention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the present invention as defined by the appended claims.
As shown in
Signal reception device SRD receives an input signal from the transmission medium or device 110 and converts the input signal into a form suitable for presentation to tone detection system 120. The input signal may be electrical or non-electrical in nature. Signal reception device SRD may include analog-to-digital conversion hardware to digitize the input signal. Alternatively, analog-to-digital conversion hardware may be comprised within tone detection system 120.
In one embodiment, signal reception device SRD may comprise a measurement device such as a microphone, an accelerometer, a spatial displacement sensor, a strain gauge, a pressure sensor, a temperature sensor (e.g., a thermocouple), a radiation sensor, an optical sensor, etc, or any combination thereof. In another embodiment, signal reception device SRD may represent an array of transducers or measurement devices of one or more types. SRD may thus be any of various transducers or sensors for receiving a signal.
Tone detection system 120 may couple to signal reception device SRD. Tone detection system 120 may be configured for detecting the frequency, amplitude and/or phase of one or more tones in the input signal. Tone detection system 120 may comprise a processor or central processing unit 140, memory 146, user input device(s) UID and a display device DD as shown in FIG. 1B. CPU 140 may be realized by any of a variety of computational devices such as a general purpose processor, a digital signal processor, a parallel processor, dedicated digital and/or analog circuitry, programmable gate array logic (e.g., an FPGA), etc., or any combination thereof. Memory 146 may comprise any of a variety of memory devices such as random access memory (RAM) and/or read-only memory (ROM), as described further below. Tone detection system 120 may also include specialized data acquisition and/or signal conditioning hardware, interface hardware, etc., or any combination thereof.
Tone detection system 120 may comprise any of various devices, such as a programmable computer system, a computer-based system such as a VXI-based system, a PXI-based system, a GPIB-based system, a computer-based data acquisition system, or a dedicated test instrument, such as a dynamic signal analyzer, an oscilloscope or any other signal acquisition and/or analysis device.
Tone detection system 120 may operate on samples of the input signal X generated by signal reception device SRD, and thus, may identify the frequency, phase and/or amplitude of one or more tones in the input signal. The frequency, phase and/or amplitude of the one or more tones may be presented to a user through the display device DD or some other output device, and/or may be stored to memory for future use.
User input device(s) UID may comprise a keyboard, a pointing device such as a mouse or trackball, a touch pad (such as those used in modem laptop computers for cursor control), a touch sensitive display screen, etc., or other input devices. In one embodiment, user input device(s) UID may include use of a graphical control panel configured with various control icons such as buttons, knobs, sliders, switches, indicators, etc., or any combination thereof. A user provides input to tone detection system 120 through user input device(s). Tone detection system 120 may manage a graphical user interface through display device DD and user input device(s) UID.
As shown, signal reception device SRD is configured and/or coupled to acquire signals from the transmission medium 110. The input signals acquired by signal reception device SRD may be optionally conditioned by the signal conditioning system 108 as shown in FIG. 2A. The conditioned input signals may then be provided to DAQ device 104 as shown. Signal conditioning system 108 may connect to DAQ device 104 via one or more cables.
Signal conditioning system 108 may comprise an external chassis 122 housing one or more signal conditioning modules 124 and optionally terminal blocks 126. Signal conditioning system 108 may be used to perform signal conditioning on field signals such as the signals generated by signal reception device SRD. As used herein, the term “signal conditioning” may include one or more of amplifying, linearizing, limiting, isolating, filtering, switching and/or multiplexing field signals (e.g. transducer excitation), among other signal processing functions. Signal conditioning system 108 may advantageously reduce the level of noise in the signals transmitted to DAQ device 104. DAQ device 104 may receive conditioned signals from signal conditioning system 108 as shown in FIG. 2A. Alternatively, DAQ device 104 may directly receive the input signal from signal reception device SRD as shown in FIG. 2B. DAQ device 104 may operate to perform analog to digital (A/D) conversion and provides the resultant digital signals to computer 102 for processing.
Computer system 102 may include various standard components, including a processor or central processing unit (CPU) 140, system memory 146, non-volatile memory, one or more buses, and a power supply. DAQ device 104 may be a specialized system for acquiring digital and/or analog signals from external devices. Thus, DAQ device 104 may include analog to digital (A/D) conversion circuitry and/or digital to analog (D/A) conversion circuitry. Examples of the DAQ device 104 include “E series” DAQ boards from National Instruments Corporation. DAQ device 104 may also comprise a computer-based instrument board, such as an oscilloscope, a digital multimeter (DMM), a dynamic signal analyzer, an arbitrary waveform generator, etc.
In one embodiment, computer 102 may comprise input/output (I/O) slots into which DAQ device 104 may be coupled. In another embodiment, computer 102 may comprise a VXI (VME Extensions for Instrumentation) chassis and bus, a GPIB (General Purpose Interface Bus) interface card, a serial port or parallel port by which DAQ device 104 may be coupled to the computer 102.
Tone detection system 120, e.g., computer system 102, preferably includes at least one memory medium on which computer programs according to the present invention may be stored. The term “memory medium” is intended to include various types of memory or storage, including an installation medium, e.g., a CD-ROM, or floppy disks 104, a computer system memory or random access memory such as DRAM, SRAM, EDO RAM, Rambus RAM, EPROM, EEPROM etc., or a non-volatile memory such as a magnetic media, e.g., a hard drive, or optical storage. The memory medium may comprise other types of memory as well, or combinations thereof. In addition, the memory medium may be located in a first computer in which the programs are executed, or may be located in a second different computer which connects to the first computer over a network. In the latter instance, the second computer may provide the program instructions to the first computer for execution. Also, the computer system 102 may take various forms, including a personal computer system, mainframe computer system, workstation, network appliance, Internet appliance, personal digital assistant (PDA), television system, dedicated test or measurement instrument or other device. In general, the term “computer system” can be broadly defined to encompass any system having a processor which executes instructions from a memory medium.
The memory medium preferably stores a software program according to one embodiment of the present invention for detecting one or more tones in the input signal. More particularly, the software program may be operable to analyze the input signal to determine the frequency, phase and amplitude of one or more tones in the input signal.
The software program may be implemented in any of various ways, including procedure-based techniques, component-based techniques, object-oriented techniques, or neural net based learning techniques, among others. For example, the software program may be implemented using ActiveX controls, C++ objects, Java objects, Microsoft Foundation Classes (MFC), or other technologies or methodologies, as desired. A processor, such as the host CPU, executing code and data from the memory medium, or a programmable device configured according to a net list, may comprise embodiments of a means for determining the frequency, phase and amplitude of the one or more tones embedded in the input signal according to the methods described below.
Various embodiments further include receiving, storing, and/or transmitting instructions and/or data implemented according to the present invention upon a carrier medium. Suitable carrier media include a memory medium as described above, as well as signals such as electrical, electromagnetic, or digital signals, conveyed via a communication medium such as networks and/or a wireless link.
FIGS. 3A&B—Aliasing Compensation Flowchart
In step 210, the CPU 140 may receive samples x(n) of the input signal provided by signal reception device SRD, and may multiply the input samples by a known window function w(n) to generate a windowed input signal y(n)=w(n)*x(n) as suggested by FIG. 4. It is noted that the input signal samples may be received from a storage device (e.g. disk, CD-ROM) having been previously recorded/captured from signal reception device SRD. Alternatively, the input signal samples may be simulated samples generated by a simulator (e.g. a CPU executing simulation code). The present invention contemplates a wide variety of possible sources for the input signal samples x(n).
The input signal is assumed to comprise a single sinusoidal tone in the presence of noise. Thus, the input signal may be modeled by the expression.
where θ is the phase of the sinusoidal tone, A is the amplitude of the sinusoidal tone, ω0=2πf0 is the frequency of the sinusoidal tone, and n is a discrete time index.
The window function w(n) may have any of a variety of forms. For example, the window function may be a rectangular window, a triangular window, a raised cosine window, a Hanning window, etc.
In step 220, CPU 140 may perform a discrete Fourier transform (DFT) on the windowed input signal y(n) to generate a transform array Y(k), where k is a frequency bin index which may range from 0 to N−1, or any interval of length N, where N is a positive integer. The transform array Y(k) may be modeled by the transform of the sinusoidal tone, i.e.
Y(k)=(A/2)exp(jθ)W(f−f0)+(A/2)exp(−jθ) W(f+f0),
where W(f) represents the Fourier transform of the window w(n). It is noted that the relationship between frequency f and frequency bin number k is given by
f=fS*(k/N),
where fS is the sample rate. The magnitude of the window transform W(f) typically has even symmetry and attains a maximum at f=0. Thus, the function W(f−f0) attains a maximum magnitude at frequency f=f0, and the function W(f+f0) attains a maximum magnitude at frequency f=−f0. The first term in the expression above, i.e.
P(f)=(A/2)exp(jθ)W(f−f0)
is referred to herein as the “positive-frequency image” since its center frequency occurs at the positive frequency f0. The second term in the expression above, i.e.
N(f)=(A/2)exp(−jθ)W(f+f0)
is referred to herein as the “negative-frequency image” since its center frequency occurs at the negative frequency −f0. Thus, the transform array Y(k) includes a positive-frequency image and negative-frequency image which combine additively (in the sense of complex addition). The input signal may also include noise and/or other spurious tones. However, these are assumed to be insignificant for the embodiments described in connection with
If tone frequency f0 stays away from zero or fS/2, and/or, the sample size N is sufficiently large, the overlap between the positive and negative frequency images may be small, and thus, their individual identities may be apparent in the transform array Y(k). The magnitude function |Y(k)| will thus exhibit two peaks which correspond to the positive and negative frequency images. The frequency locations of one of these peaks (i.e. the peak that occurs in the range of positive frequencies) may be used as an estimate for the tone frequency f0.
Conversely, if the tone frequency is close to zero or fS/2, and/or, the sample size N is sufficiently small, the positive-frequency image and negative frequency image may overlap significantly. Thus, their individual identities may not be apparent in the transform array Y(k). In other words, transform array Y(k) restricted to positive frequencies may be a poor approximation to the positive frequency image. Thus, the frequency location at which the magnitude function |Y(k)| attains a maximum, when considered over positive frequencies, is only a crude initial approximation to the tone frequency f0.
In step 230, CPU 140 may scan the DFT magnitude values |Y(k)| over the range of positive frequency bins to determine the bin index k which achieves the maximum magnitude. In other words, CPU 140 may select kmax as the integer bin index value k in the range from 0 to N/2 which maximizes the magnitude of Y(k). In addition, CPU 140 may perform a comparison of |Y(kmax−1)| and |Y(kmax+1)| to determine whether the second largest magnitude occurs at (kmax−1) or (kmax+1). Let k2 denote the location of this second largest magnitude. Let α=|Y(kmax)|, and let β=|Y(k2).
It is noted that the maximum of magnitude function |Y(k)| considered as a function of continuous frequency typically does not occur at the integer value kmax, although it should occur somewhere in the interval between kmax and k2.
In step 240, CPU 140 may compute estimates {circumflex over (f)}0 and Â0 for the tone frequency f0 and the tone amplitude A respectively based on the magnitude values |Y(k)| in the neighborhood of the maximizing index kmax and an assumed functional form for the window transform W(k).
For example, in the case where the window function w(n) used in step 210 is a rectangular window, the window transform W(k) may be approximated by the expression W(k)=sin(πk)/(πk). Thus, the frequency estimate {circumflex over (f)}0 and real amplitude estimate Â0 may be computed according to the relations
The plus solution for Δk is chosen if k2=kmax+1, and the minus solution for Δk is chosen if k2=kmax−1.
In the case where the window function w(n) used in step 210 is a Hanning window, the window transform W may be approximated by the expression W(k)=sin(πk)/[(πk)*(1−k2)]. Accordingly, the frequency estimate {circumflex over (f)}0 and real amplitude estimate Â0 may be computed according to the relations
Note that the plus solution for Δk may be chosen if k2=kmax+1, and the minus solution for Δk may be chosen if k2=kmax−1.
A variety of window functions are contemplated. For some window functions w(n), it may be difficult to obtain a simple formula for the window transform W(k). In these cases, values of the transform function W may be numerically approximated and used to compute the frequency and real amplitude estimates.
In step 245, CPU 140 may compute an estimate {circumflex over (θ)}0 for the tone phase using the phase angle of one or more of the complex values Y(k) in a neighborhood of kmax. In one embodiment, the phase of transform value Y(kmax) defines the phase estimate {circumflex over (θ)}0, i.e.
{circumflex over (θ)}0=angle(Y(kmax)),
where angle(z) denotes the principle angle of the complex number z.
In a second embodiment of step 245, CPU 140 may interpolate the phase of Y(k) between kmax and k2 to determine the phase estimate. For example, CPU 140 may perform a linear interpolation based on the phase of Y(kmax), the phase of Y(k2), and the value Δk.
In other embodiments of step 245, CPU 140 may determine the phase estimate {circumflex over (θ)}0 according to either of the expressions:
{circumflex over (θ)}0=angle(Y(floor(k0))) or
{circumflex over (θ)}0=angle(Y(ceil(k0))),
where floor(x) denotes rounding towards minus infinity, and ceil(x) denotes rounding towards plus infinity.
As noted above, the transform array Y(k) is an additive combination of the positive frequency image and the negative frequency image, i.e. Y(k)=P(k)+N(k). Because the positive and negative frequency images may overlap (around DC and/or around Nyquist depending on the value of the tone frequency f0), the peaks appearing in the transform array Y(k) may be interpreted as disturbed versions of the corresponding images. However, given the estimates for tone frequency, amplitude and phase computed in steps 240 and 245, it is possible to compute the DC-aliasing and Nyquist-aliasing contributions of the negative frequency image on the transform array Y(k) in the neighborhood of kmax. By subtracting these aliasing contributions from the transform array Y(k), a better approximation to the positive frequency image may be obtained.
In step 250, CPU 140 may use the phase estimate {circumflex over (θ)}0, the amplitude estimate Â0, and the frequency estimate {circumflex over (k)}0 to compute the “DC-aliasing” contribution of the negative frequency image at frequency bins k in the neighborhood of kmax. For example, CPU 140 may compute estimated values {circumflex over (N)}dc(k) of the negative frequency image according to the expression
for bins k=floor({circumflex over (k)}0)+i−1, where i equals 0, 1, 2 and 3, and where floor(x) is the function which rounds x towards minus infinity. (It is noted this neighborhood of kmax comprising four bins and starting at floor(k0)−1 represents one of many possible choices.) In step 255, CPU 140 may use the phase estimate {circumflex over (θ)}0, the amplitude estimate Â0, and the frequency estimate {circumflex over (k)}0 to compute the “Nyquist-aliasing” contribution of the negative frequency image at the frequency bins k in the neighborhood of kmax. For example, CPU 140 may compute estimated values {circumflex over (N)}Nyq(k) of the negative frequency image according to the expression
for bins k=floor({circumflex over (k)}0)+i−1, where i equals 0, 1, 2 and 3.
In step 260, CPU 140 may compute estimated values {circumflex over (P)}(k) for the positive frequency image according to the expression
{circumflex over (P)}(k)=Y(k)−{circumflex over (N)}dc(k)−{circumflex over (N)}Nyq(k),
for the bin index values k in the neighborhood of kmax.
It is noted that the bin location kmax of the maximum magnitude for the function {circumflex over (P)}(k) may not be the same as for transform array Y(k) as suggested by FIG. 7. Thus, the parameter kmax may be updated, i.e. set equal to the integer bin index k at which |{circumflex over (P)}(k)| is maximized as indicated in step 265,and α may be set equal to |{circumflex over (P)}(kmax)|. Similarly, parameter k2 may be set equal to the integer bin index k where |{circumflex over (P)}(k)| attains a second-highest value, and β may be set equal to |{circumflex over (P)}(k2)|.
In step 270, CPU 140 may compute a second estimate {circumflex over (k)}0(2) for the tone frequency and a second estimate Â0(2) for the real tone amplitude based on the complex difference values {circumflex over (P)}(k) generated in step 260. CPU 140 may use any of the methods described above in step 240 to determine these second estimates. Because the complex difference values {circumflex over (P)}(k) more closely approximate the positive frequency image than the transform values Y(k) in the neighborhood of kmax, the second estimates may be more accurate than the first estimates. In other words, since the effects of the negative frequency image have been substantially reduced or removed, the new estimates computed in step 270 may be more accurate.
In step 275, CPU 140 may compute an improved estimate {circumflex over (θ)}0(2) for the tone phase based on the phase angle of one or more of the complex numbers {circumflex over (P)}(k) in the neighborhood of the updated kmax. Any of the methods used to compute the phase estimate of step 245 may be used here to compute the improved phase estimate with the provision that {circumflex over (P)}(k) substitutes for Y(k).
In one embodiment, steps 250 through 275 may be iterated as many times as desired, or as many times as necessary to obtain convergence of the frequency, amplitude and/or phase estimates. In each iteration of steps 250 and 255, the negative frequency image may be approximated in terms of the most recent estimates for the tone frequency, amplitude and phase. For example, in a second iteration of step 250, the DC-aliasing contribution of the negative frequency image may be approximated by the expression
After step 275, or after multiple iterations of step 250 through 275, CPU 140 may output the final frequency estimate, real amplitude estimate and phase estimate to a user through display device DD or some other output device. Alternatively, these estimates may be stored in a memory for later use by some other signal processing device, or another software application running on CPU 140.
The embodiments described above may generate estimates for the tone frequency, amplitude and/or phase even when the positive and negative images overlap significantly. For example, the tone frequency may be close to DC or one-half the sample rate, and/or, the size N of the DFT may be small.
Hanning Window
In steps 250 and 255 described above, a phase estimate {circumflex over (θ)}0 is used to compute respectively DC-aliasing and Nyquist-aliasing contributions of the negative frequency image to bins in the neighborhood of kmax. In the Hanning window embodiment, the phase estimate may be handled in different ways depending on whether aliasing compensation is being performed about DC or about Nyquist. Namely, for DC aliasing compensation, CPU 140 computes phase value φ0 according to the expression
{circumflex over (φ)}0=π+angle(Y(kf)),
where kf=floor({circumflex over (k)}0) and {circumflex over (k)}0=kmax+Δk , and the DC aliasing contribution of the negative frequency image according to the expression
for bins k=floor({circumflex over (k)}0)+i−1, where i equals 0, 1, 2 and 3, and where |x| denotes the absolute value of x.
The form of the above expression for the phase estimate arises from the fact that the phase of Y(k) makes a jump of π radians between kmax and kmax±1 when the window function is a Hanning window.
{circumflex over (φ)}0=angle(Y(kf)),
i.e. without adding 180 degrees, and computes the Nyquist-aliasing contribution of the negative frequency image according to the expression
for bins k=floor({circumflex over (k)}0)+i−1, where i equals 0, 1, 2 and 3. See the source code appendix for a realization of the Hanning window embodiment of the aliasing compensation method written in LabView™.
Detection of Multiple Tones
In certain situations, the input signal may include multiple tones having different frequencies.
In step 310, CPU 140 may receive an input signal x(n), and may apply a window w(n) to the input signal x(n) to generate a windowed input signal y(n)=x(n)*w(n). The input signal x(n) may originate from transmission medium 110, and may be presented to tone detection system 120 through signal reception device SRD. However, the present invention contemplates a wide variety of source for the input signal samples x(n). For example, the input signal samples x(n) be may read from a memory medium (e.g. CD-ROM, magnetic disk, etc.) having been previously recorded/captured from transmission medium 110. Also, the input signal sample x(n) may be simulated samples generated by a simulator (i.e. a processor executing in response to simulation code).
In step 320, CPU 140 may compute the DFT of the windowed input signal y(n) to obtain a transform array Y(k).
The input signal may be modeled by the expression
where xi(n) represents the ith tone of L tones in the input signal. The tone Xi is assumed to have the form
where parameter ωi=2πfi is the frequency of the tone xi, parameter Ai is the real amplitude of the tone x1, and parameter θi is the phase of the tone xi. The input signal may also include noise and/or other spurious tones.
The transform of the ith windowed tone yi(n)=xi(n)*w(n) may be modeled as the sum of a positive frequency image
Pi(f)=(A1/2)exp(jθi)W(f−fi),
and a negative frequency image
Ni(f)=(Ai/2)exp(−jθi)W(f+fi),
where W is a continuous-frequency expression corresponding to the transform of window w(n). (The positive frequency image has a magnitude envelope which is centered at tone frequency fi. The negative frequency image has an identically-shaped magnitude envelope which is centered at frequency −fi.) Thus, transform array Y(k) may be modeled by a summation of positive and negative frequency images
If the tone frequencies maintain a sufficient mutual separation from one another, are sufficiently far from zero and fS/2, and the sample set size N is sufficiently large, the frequency support regions of the positive and negative frequency images may be essentially non-overlapping or minimally overlapping. Thus, each peak in the magnitude spectrum |Y(k)| may closely approximate one of the positive or negative frequency images, and the frequency location of the magnitude peak may accurately approximate the corresponding tone frequency fi. (Recall, the positive frequency images are centered on the tone frequencies).
Conversely, if any of the tone frequencies get too close together, too close to zero or fS/2, or N is sufficiently small, the positive and negative frequency images may significantly overlap, and thus, a peak in the magnitude spectrum |Y(k)| may only poorly approximate its corresponding positive (or negative) frequency image, and the center frequency of the magnitude peak may be perturbed away from the corresponding tone frequency f1.
In step 330, CPU 140 may scan the magnitude spectrum |Y(k)| to determine the frequency location of magnitude peaks occurring over the range of positive frequencies as suggested by FIG. 11. In other words, CPU 140 may search for integer bin values mi which correspond to local maxima of the magnitude spectrum when considered over integer bin values in the range from 0 to N/2. Let αi equal the maximal magnitude value for each peak, i.e. αi=|Y(mi)|. The local maxima may be subjected to a minimum magnitude test so that low-level noise peaks and signal side-lobes may be rejected.
In addition, CPU 140 may perform a comparison of the magnitudes |Y(mi+1)| and |Y(mi−1)| for each peak location mi to determine whether the second largest magnitude for the corresponding magnitude peak occurs at k=mi+1 or k=m1−1. Let pi denote the location of this second largest magnitude. Let βi represent this second largest magnitude value, i.e. βi=|Y(pi)|.
In one embodiment, CPU 140 may identify positive-frequency magnitude peaks which satisfy a magnitude threshold relative to the largest magnitude peak. For example, CPU 140 may select positive frequency magnitude peaks that are more than X decibels below the largest positive-frequency magnitude peak, where X is a user selectable value.
In step 350, CPU 140 may compute for each tone xi, i=1, 2, 3, . . . , L, an estimate {circumflex over (f)}i for the tone frequency fi and an estimate Âi for the tone amplitude. These estimates may be computed based on the transform magnitude values |Y(k)| in a neighborhood of corresponding positive-frequency peak location mi, and an assumed functional form for the continuous-frequency spectrum W.
In one embodiment, the window function w(n) is a rectangular window. Thus, the continuous-frequency spectrum W may be assumed to have the form W(k)=sin(πk)/(πk). In this case, the frequency estimate {circumflex over (f)}i and amplitude estimate Âi for tone xi may be computed according to the relations
The plus solution for Δki may be chosen if pi=mi+1, and the minus solution for Δki may be chosen if pi=mi−1.
In a second embodiment, the window function w(n) is a Hanning window. Thus, the continuous-frequency spectrum W may be assumed to have the form W(k)=sin(πk)/[(πk)*(1−k2)]. In this case, the frequency estimate {circumflex over (f)}i and amplitude estimate Âi may be computed according to the relations
The plus solution for Δki may be chosen if pi=mi+1, and the minus solution for Δki may be chosen if pi=mi−1.
A variety of window functions are contemplated. For some window functions w(n), it may be difficult to specify a simple formula for the spectrum W. In these cases, the values of W(k) may be numerically approximated and used to compute the frequency and amplitude estimates.
In step 355, CPU 140 may compute, for each tone xi, an estimate {circumflex over (θ)}i of the tone phase θ1 using the phase of one or more the transform array values Y(k) in the neighborhood of positive-frequency peak location mi. Any of the methods discussed above in the single tone embodiments may be used for the phase estimation of step 355.
Given the estimates for tone frequency {circumflex over (k)}i, tone amplitude Âi and tone phase {circumflex over (θ)}i, the corresponding positive frequency image Pi may be approximated by an expression such as
and the corresponding negative frequency image Ni may be approximated by expressions such as
In step 360, for each value of the index j running from 1 to L (i.e. the number tones), CPU 140 may compute the contributions of the other aliasing images on the transform array values Y(k) in the neighborhood of positive-frequency peak location mj. More specifically, for each value of the index j, CPU 140 may use the image approximations given above to compute a complex sum
for bins k in the neighborhood of positive-frequency peak location mj. In other words, the complex sum D(k) may include the estimated values at bin k of each positive frequency image other than Pj, and the estimated values at bin k of all negative frequency images.
In step 370, for each value of index j running from 1 to L, CPU 140 may subtract the sum D(k) from the corresponding DFT value Y(k) at each bin index value k in the neighborhood of positive-frequency peak location mj. The resulting difference values S(k)=Y(k)−D(k) comprise an improved approximation to the positive frequency image peak Pj.
In step 375, CPU 140 may update the integer peak locations mj based on the magnitude of the difference values S(k). Because of the subtraction operation of step 370, the magnitude peaks in the difference function S(k) may be shifted in frequency with respect to the corresponding peaks Uj in transform Y(k). For each j in the range 1 to L, CPU 140 may examine the magnitude values |S(k)| in the neighborhood of peak location mj (i.e. the original peak location mj computed above in step 330) to determine the integer bin index value of the new maximum magnitude. This bin index value becomes the updated value of peak location mj. The parameter αj may be updated as the new maximal magnitude, i.e. the magnitude of S(k) at new peak location mj. Similarly, CPU 140 may update the second-to-max peak locations pj and their corresponding magnitudes βi.
In step 380, for each value of the index j running from 1 to L, CPU 140 may compute a second estimate {circumflex over (f)}j(2) for the tone frequency fj and a second estimate Âj(2) for the tone amplitude Aj based on the magnitudes of the complex difference values S(k)=Y(k)−D(k) in the neighborhood of updated peak location mj. CPU 140 may use the same (or similar) methods as those described above in step 350 to determine the second estimates. Because the complex difference S(k) values more closely approximate the positive frequency image peak Pj, these second estimates may be more accurate than the first estimates. In other words, since the effects of the other negative and/or positive frequency images have been substantially reduced or removed, the new estimates computed in step 380 may be more accurate.
In step 385, for each value of index j running from 1 to L, CPU 140 may compute a second phase estimate {circumflex over (θ)}j(2) for tone phase ηj based on the phase angle of one or more of the differences S(k) in the neighborhood of updated peak location mj. Any of the methods discussed above in the single tone embodiments may be used for the phase estimation here.
In one embodiment, steps 360 through 385 may be iterated as many times as desired, or as many times as necessary to obtain convergence of the frequency, amplitude and/or phase estimates. In each iteration of step 360, the positive and negative frequency images that contribute to the sums D(k) may approximated in terms of the most recent estimates for the tone frequencies, amplitudes, and phases.
After step 385, or after multiple iterations of step 360 through 385, CPU 140 may output final estimates for the real amplitude, phase and frequency of each tone Tj as indicated in step 390. These final estimates for the multiple tones may be presented to the user on display device DD or through some other output device(s). Alternatively, these estimates for the various tones may be stored in a memory for later use by some other signal processing device, or another software application running on CPU 140.
The embodiments described above may generate estimates for the tone frequencies, amplitudes and/or phases even when the positive and negative frequency images of the tones overlap significantly. For example, the tone frequencies may be close to DC, close to one-half the sample rate, and/or close to each other. Overlap may also be due to spectral leakage when the size N of the DFT is small.
Applications
Embodiments of the present invention may be used in various applications. In general, embodiments of the present invention may be used in any system where it is desired to detect sinusoidal tones present in a signal, e.g., where it is desired to detect the precise frequency, amplitude and/or phase of the tones present in the signal. For example, an embodiment of the present invention may be used in a DTMF (Dual Tone Multi-Frequency) system for detecting tones present in a signal, such as a signal generated by a keypad of a telephone. Embodiments of the present invention are also contemplated for use in applications involving sonar, radar (e.g. Doppler radar), frequency-shift keying applications, mechanical systems analysis, etc. For example, the reflections generated by multiple moving objects in response to a radar pulse have distinct frequencies dependent on their radial velocities with respect to the radar station. Thus, the frequencies of the reflections are usable for tracking the multiple moving objects. In another example, a mechanical system excited with a physical stimulus (e.g. an impulse) may manifest vibrations at one or more frequencies. The frequency, amplitude and/or phase of these vibrations may provide information to a system analyst about the nature of flaws in the mechanical system. Embodiments of the present invention may be used in a wide variety of applications, i.e. in any application where it is desirable to identify one or more tones present in an input signal. The above-mentioned applications are merely representative examples.
Although the system and method of the present invention is described in connection with several embodiments, it is not intended to be limited to the specific forms set forth herein, but on the contrary, it is intended to cover such alternatives, modifications, and equivalents, as can be reasonably included within the spirit and scope of the invention as defined by the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
4698769 | McPherson et al. | Oct 1987 | A |
4841827 | Uchiyama | Jun 1989 | A |
5018428 | Uchiyama et al. | May 1991 | A |
5165051 | Kumar | Nov 1992 | A |
5412152 | Kageyama et al. | May 1995 | A |
5436403 | Usa | Jul 1995 | A |
5808225 | Corwin et al. | Sep 1998 | A |
6122657 | Hoffman, Jr. et al. | Sep 2000 | A |
6128370 | Barazesh et al. | Oct 2000 | A |
6195675 | Wang et al. | Feb 2001 | B1 |
6229889 | Cannon et al. | May 2001 | B1 |
6473732 | Chen | Oct 2002 | B1 |
6665622 | Chappell et al. | Dec 2003 | B1 |
6718217 | Shinohara et al. | Apr 2004 | B1 |
Number | Date | Country | |
---|---|---|---|
20020120354 A1 | Aug 2002 | US |