The invention relates to the field of communications, and in particular to the field of digital communications and computing/estimating the time delay of a signal transmitted over a communication channel.
As known, bothersome echoes occur in communication systems, such as telephone systems, that operate over long distances or in systems that employ long processing delays, such as digital cellular systems. The echoes are the result of electric leakage in the four-to-two/two-to-four wire hybrid circuit, due to an impedance mismatch in the hybrid circuit between the local loop wire and the balance network. To reduce the echoes, communication systems typically include one or more echo cancellers.
Echo cancellers typically include an adaptive filter that generates an estimate of the echo and subtracts the estimate from the return signal. Like any adaptive discrete time filter, the tap weights of the filter are adjusted based upon the difference between the estimate of the echo signal and the return signal. The adaptive filter employs an adaptive control algorithm to adjust the tap weights in order to drive the value of the difference signal to zero or a minimum value.
A problem with prior art echo cancellers is that they are required to handle echo tail lengths of up to 128 milliseconds. However, in order to meet this requirement the adaptive filter would have to have 1024 taps. Of course providing such a filter having such a large numbers of taps leads to processing complexity. Specifically, if the adaptive filter is implemented in a processor, implementing a filter having 1024 taps requires a lot of processing. Similarly, if the adaptive filter is implemented in an application specific integrated circuit (ASIC), implementing a filter having 1024 taps requires a lot of gates.
There is a need for a time delay estimator that maybe used by an echo canceller that includes a computationally efficient adaptive filter, including a time delay estimator.
According to one aspect of the invention, there is provided a system for time delay estimation in a discrete time processing system. The system includes a cross correlator that performs cross correlation on a first signal and a second signal, and provides a cross correlated output signals indicative thereof. A lag smoother receives the cross correlated output signals, and provides lag smoothed output signals indicative thereof. A select logic module selects a pre-defined number of signal values from a respective set indicative of the lag smoothed output signals to compute the time delay estimation associated with the first and second signals.
These and other objects, features and advantages of the present invention will become more apparent in light of the following detailed description of preferred embodiments thereof, as illustrated in the accompanying drawings.
The transmitted signal x[k] and its associated received signal y[k] may be voice signals (e.g., sampled at 8 kHz). However, one of ordinary skill in the art will recognize of course that the time delay estimator 200 of the present invention is not limited to this context. The invention may use other types of signals, such as signals used in communication networks, radar systems, or the like.
The time delay estimator 200 receives the transmitted signal x[k] and its associated received signal y[k]. Since voice signals have most of their energy in the lower half region of their bandwidth, they can be downconverted to 4 kHz (i.e., decimated by a factor of two). The time delay estimator 200 includes a first decimator 207, and a second decimator 208 that receives the signals x[k] and y[k], respectively. The decimators 207, 208 may downconvert the input signals by a factor of two. However, it is contemplated that other decimation factors may also be used. Therefore, each block of input data to the decimators 207, 208 includes, for example, 1024 samples, while the resultant decimated output signals each include 512 samples per block. The invention may also use signals x[k] and y[k] that have more or less samples per block.
The decimators 207, 208 provide a first decimated output signal on a line 210 and a second decimated output signal on a line 212, respectively. The decimated output signals are input to a cross correlator 202. The cross correlator 202 may also perform mathematical operations on the downconverted outputs, such as averaging. The cross correlator 202 provides on output signal on a line 214, wherein the output signal on the line 214 includes signal values (e.g., peak values of the signal on the line 214) that are indicative of the time delay between the discrete signals x[k] and y[k]. Since voice signals are non-stationary, it would be difficult to use solely the cross correlation to determine time delay as in the prior art. The time delay estimator 200 provides the capacity to determine time delay for discrete signals that are non-stationary.
where n is a sample index value, λ is a forgetting factor value, and τ is a lag index value. The sample index is associated with the number of data values in the downconverted signal. In this case, the sample index value n is 512 since, in this embodiment, each of the downconverted signals on lines 210, 212 (
The cross correlation may also be performed on a block-by-block basis, where each block includes signal values over a certain amount of time. The cross correlator 202 computation performed on a block-by-block basis can be expressed as:
where k is a block index value, B is a block length value, λ is a forgetting factor value, and τ is the lag index value. In one embodiment, the block length may be 5 ms. This sort of cross correlation gives a spatial dimension in analyzing the two signals x[k] and y[k]. Since the cross correlator 202 also preferably performs averaging on its input values, it can be shown that the recursive relationship is equivalent to relation EQ. 2 as follows:
for performing block-by-block cross correlation.
The cross correlation provides output average values that may either be positive or negative. However, the cross correlation output may also contain effects of time delay that are immersed within the signals x[k] and y[k]. The cross correlation output does not provide an accurate description of time delay effects, because the correlated signals are voice signals that exhibit non-stationary properties. As a result, further analysis is needed to compute the time delay estimate.
Referring again to
Referring still to
The lag smoother may compute the output values as follows:
where L is a sliding window size, P is the size of the window overlap, and r is indicative of the number of sets produced by the lag smoother 204. In this embodiment, the value r spans from 1-32, where each value is uniquely associated with one of the lag smoother 204 outputs as shown in
The outputs from the lag smoother 204 are input to a time smoother/filter 205, which performs temporal averaging. For example, each of the input signals on a block may be input to an associated single pole filter (e.g., an IIR filter). These filters reduce the variance of the lag smoother outputs. The time smoother 205 is an optional but preferable element of the time delay estimator 200, since it further reduces the variance, thus providing a better estimation of the time delay.
Sr[k]=(1−α)Sr[k−1]+α{tilde over (R)}r[k] EQ. 5
where α is the effective memory length of the each of the plurality of filters (50, 52 and 54), {tilde over (R)}r[k] is the set associated with the output of the lag smoother 204, and r corresponds to the indices of the output from the lag smoother 204. In one embodiment, the value of r ranges between 1 and 32. The time smoother 205 also maintains state information in computing its smoothed output.
The time smoother 205 may also perform other functions that aid the select logic module 206 in selecting the associated peak values, if the inputs to the select logic module 206 are from the time smoother 205. For example, if the discrete signals x[k] and y[k] contain low signal values on lines 210 and 212 or solely signal noise, these signals will produce outputs across the cross correlator 202, the lag smoother 204, and the time smoother 205 that are null or insignificant. Therefore, it would be necessary for the select logic module 206 to not replace its previous peak values with new peak values that are null or insignificant. The time smoother 205 notifies a confidence module 62 which in turn notifies the select logic module 206 to maintain its previous state information in such circumstances. Another situation is when the outputs to the time smoother 205 are relatively similar same values, thus there is low confidence on these outputs in producing a reliable time delay estimate because there is very little spread between the outputs in the time smoother 205. In this instance, the time smoother 205 notifies the confidence module 62 that also notifies the select logic module 206 to not change its state by selecting the peak values from its output. This reduces the likelihood of unreliable outputs by the select logic module 206. The confidence module 62 notifies the select logic module 206 preferably using boolean values to signify whether it should change its state, which will be discussed hereinafter.
The time smoother 205 may use an intensity module 60 and the confidence module 62 to determine whether the inputs to the time smoother 205 are reliable enough for the select logic module 206 to select peak values. The intensity module 60 measures whether the inputs to the cross correlator 202 are at an intensity level that is sufficient for selecting peak values at the select logic module 206. The intensity module 60 notifies the time smoother 205 of the measured intensity levels via a signal line 209. The confidence module 62 receives on line 211 from the time smoother 205 an indication whether the intensity levels are at a level appropriate for continued processing of the inputs on the lines 210 and 212. The confidence module 62 provides a boolean signal on line 220 to the select logic module 206 of its determination regarding the reliability that inputs to the time smoother 205 will provide accurate time delay estimates. In general, the partitioning of the task of determining whether the intensity is sufficient to provide confidence that accurate time delay information can be calculated by either time smoother 205 or confidence module 62, et cetera. In any event, a preferred embodiment is a discrete time system and the processing steps are associated with executable software/firmware routines. However, the system of the present invention may also be implemented in an ASIC or general-purpose processor. A boolean signal value of “0” on the line 220 signifies that the confidence that the inputs to the time smoother 205 will produce reliable time delay estimates is low. Similarly, a boolean value of “1” signifies that the confidence that the inputs to the time smoother 205 will produce reliable time delay estimates is high (i.e., not low).
For example, if the intensity module 60 and time smoother 205 determine that the intensity levels of the input signals on the lines 210 and 212 are not appropriate for producing time delay estimates, the time smoother 205 provides an indication to the confidence module 62 of that result. The confidence module 62 outputs a boolean signal value of “0” to the select logic module 206, thus notifying the selector logic module 206 not to change its state. Otherwise, the outputs of the cross correlator 202 are allowed to proceed on to the lag smoother 204. At that point, the time smoother 205 analyzes its inputs-to determine whether these values are sufficient for further processing by the select logic module 206. If not, the time smoother 205 notifies the confidence module 62 of this result via line 211, where the confidence module 62 sends a boolean signal of “0” on line 220 to the select logic module 206. This boolean signal value notifies the select logic module 206 not to change its state. Otherwise, if the input to the time smoother 205 is determined to be appropriate for further processing, the time smoother 205 notifies the confidence module 62. The confidence module 62 proceeds to send a boolean signal value of “1” to the select logic module 206, thus allowing the select logic module 206 to change its state by selecting peak values from its inputs.
The outputs of both the lag smoother 204 and the time smoother 205 aid in determining time delay estimation of non-stationary signals, such as voice signals. The outputs of the lag smoother 204 and time smoother 205 provide the peak values used for determining time delay estimates of the non-stationary signals. The major difference between the two outputs is that the time smoother 205 aims in the reduction of the variance of the lag smoother outputs to provide a more reliable output to determine the time delay estimates.
The select logic module 206 can either receive the outputs from the time smoother 205 or the lag smoother 204, as shown in
Although the present invention has been shown and described with respect to several preferred embodiments thereof, various changes, omissions and additions to the form and detail thereof, may be made therein, without departing from the spirit and scope of the invention.
This application claims priority from U.S. Provisional application designated Ser. No. 60/382,717 filed May 23, 2002 and entitled Time Delay Estimator, which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
4007330 | Winters | Feb 1977 | A |
4173759 | Bakhru | Nov 1979 | A |
4191853 | Piesinger | Mar 1980 | A |
4862450 | Guidoux | Aug 1989 | A |
5764690 | Blanchard et al. | Jun 1998 | A |
5787118 | Ueda | Jul 1998 | A |
5805584 | Kingston et al. | Sep 1998 | A |
6067295 | Bahai et al. | May 2000 | A |
Number | Date | Country | |
---|---|---|---|
20040037432 A1 | Feb 2004 | US |
Number | Date | Country | |
---|---|---|---|
60382717 | May 2002 | US |