1. Field of the Invention
The present invention relates to systems and methods for communicating information, and in particular to a system and method for estimating the impulse response of a communication channel using short synchronization codes.
2. Description of the Related Art
In packet-based communication systems, spreading codes are used for packet detection and synchronization purposes. Correlation techniques are used to identify and synchronize to its timing. In many instances, the spreading code sequence can be in the order of 1000 chips or more. Since the receiver must correlate through all possible delays, this process can result in unacceptable delays.
To ameliorate this problem, a short spreading code with good aperiodic autocorrelation can be used for packet detection and synchronization purposes. One example is the IEEE 802.11 Wireless Local Area Network (WLAN) system, which uses a length 11 Barker code as a spreading sequence for the preamble and the header of a packet. The short length of the spreading sequence makes it easy for receivers to quickly detect the presence of a packet in the communication channel and to synchronize to its timing.
In the case of a linear channel, for the purpose of receiver design, it is often desirable to estimate the impulse response of the communication channel. In the context of the WLAN, a multi-path linear channel is often utilized, and such communication channels require equalization for effective reception. Given an estimate of the impulse response of the communication channel, we can directly calculate equalizer coefficients through matrix computations, as opposed to the conventional adaptive algorithms. This is described in “Digital Communications,” by John G. Proakis, 4th edition, Aug. 15, 2000, which reference is hereby incorporated by reference herein. This allows equalizer coefficients to be computed in a digital signal processor (DSP) instead of in more expensive and less adaptable dedicated hardware implementing the adaptation algorithms.
Unfortunately, because the spreading code used is short (e.g. on the order of 11 symbols) a straightforward correlation using the spreading code will produce a distorted estimate. What is needed is a simple, computationally efficient technique that can be used to compute substantially undistorted communication channel impulse response estimates, even when the received signal was chipped with a short spreading code. The present invention satisfies that need.
To address the requirements described above, the present invention discloses a method and apparatus for estimating a communication channel impulse response h(t). The method comprises the steps of generating com(t)=co(t+mNTc) for m=0, 1, Λ, M by correlating a received signal r(t) with a spreading sequence Si of length N, wherein the received signal r(t) comprises a chip sequence c, applied to a communication channel characterizable by an impulse response h(t), and wherein the chip sequence cj is generated from a data sequence di spread by the spreading sequence Si; generating an estimated communication channel impulse response ĥM(t) as a combination of com(t) and dm for m=0, 1, Λ, M; and filtering the first estimated communication channel impulse response ĥM(t) to generate the estimated communication channel impulse response h(t) with a filter f selected at least in part according to the spreading sequence Si. The apparatus comprises a correlator for generating com(t)=co(t+mNTc) for m=0, 1, Λ, M by correlating a received signal r(t) with a spreading sequence Si of length N, wherein the received signal r(t) comprises a chip sequence Cj applied to a communication channel characterizable by an impulse response h(t), and wherein the chip sequence Cj is generated from a data sequence di spread by the spreading sequence Si; an estimator for generating an estimated communication channel impulse response ĥM(t) as a combination of com(t) and dm for m=0, 1, Λ, M; and a filter f selected at least in part according to the spreading sequence Si, the filter for filtering the first estimated communication channel impulse response ĥM(t) to generate the estimated communication channel impulse response ĥ(t).
The foregoing permits the impulse response ĥ(t) of the communication channel to be accurately estimated, even with short chip codes. Non-intuitively, in the case of a time-limited channel impulse response, the present invention yields an estimate that can be made perfect in the limit of high signal-to-noise ratio (SNR).
Referring now to the drawings in which like reference numbers represent corresponding parts throughout:
In the following description, reference is made to the accompanying drawings which form a part hereof, and which is shown, by way of illustration, several embodiments of the present invention. It is understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the present invention.
cj=ciN+n=di•Sn, 0≦n≦N−1 Eq. (1)
This spread chip sequence Cj 106 is transmitted through a linear transmission channel 108 having a combined channel impulse response h(t). The transmitted signal is received by a receiver 112. The received waveform r(t) 114 is:
where n(t) 121 is an additive noise component.
This formulation does not explicitly impose a causality requirement on h(t) 108. If explicit causality is desired, this can be accomplished by setting h(t)=0, t<0. For simplicity purposes, all the data and code sequences in the following discussion are assumed to be real, though the channel impulse response h(t) 108 and the additive noise component n(t) 121 could be complex in their baseband representations. Complex sequences could be easily accommodated if needed, but they are not common for synchronization purposes.
The receiver 112 receives the transmitted signal, and correlates the received signal r(t) 114 with the known spreading sequence Si 104 to identify the data as intended to be received by the receiver 112. Once the received signal r(t) 114 is received, the preamble can be examined to determine the address of the data and whether further processing is necessary.
Such systems also use the received signal to estimate the input response of the communication channel 108. This information is used to improve later detection and reception of signals from the transmitter 110. In circumstances where the spreading sequence Si 104 is relatively short, the data packet 128 must be detected quickly, and there is less data available to estimate the response of the communication channel 108. Conventional Detection and Synchronization
For detection and synchronization purposes, the search for the spreading code is conventionally performed by correlating the received signal r(t) 114 with the spreading sequence. This is accomplished by the correlator 116. Although this correlation is typically done after sampling in the time domain, for notational simplicity, we do not perform the time domain discretization. The correlator 116 output co(t) 118 is given by:
where D(l) is the correlation between the chip sequence and the spreading sequence and we will refer to it as the chip correlation.
For notational simplicity, we have introduced a (negative) group delay (ITc) in calculating the correlator output 118. The correlator 116 output is given by the convolution of the chip correlation D(l) with the sampled communication channel impulse response h(t−lTc) plus a noise component n (t). Upon further examination:
=dm•A(n)+dm+1•A(N−n), Eq. (10)
l=mN+n,0≦n≦N Eq. (11)
where A(n) is a two-sided aperiodic autocorrelation of the spreading sequence defined as:
A(n) is a property of the code sequence that is known by the correlator 116 apriori.
For detection and synchronization purposes, the spreading sequence Si 104 is designed to have minimum values of A(k) when k≠0. However, for small (e.g. on the order of 10) values of N (short spreading codes), even the smallest side lobe magnitude is not negligible compared to the in-phase autocorrelation.
Barker sequences, when they exist, give the best aperiodic autocorrelation. For an 11 chip Barker sequence, Si =1, −1, 1, 1, −1, 1, 1, 1, −1, −1, −1, the autocorrelation becomes A(i)=11, 0, −1, 0, −1, 0, −1, 0, −1, 0, −1 for 0≦i<11. Note that even for Barker codes, because the spreading sequence Si 104 is of limited length, the autocorrelation A(i) includes significant side lobes.
The correlator 116 output 118 can be rewritten as:
where the following is defined as the convolution of the spreading sequence aperiodic autocorrelation A(i) and the sampled channel impulse response h(t−iTc) as follows:
This is an estimate of the combined communication channel 108 impulse response ĥ(t) at the output of the code correlator 116.
The above equations can be more succinctly written using a convolutional notation. Defining a convolution of two infinite sequences Ai and Bi as
By defining an operator
that converts any sequence O to a time domain function using the Dirac delta function:
Using the above notations and further, by adopting the following definitions:
u(iN)=di(data) Eq. (22A)
u(iN+n)=0,0<n<N Eq. (22B)
the foregoing equations (1), (2), (3), (6), (12), (18), (16), (17) can be rewritten as:
c=u{circle over (×)}S Eq. (1′)
Determining a Communication Channel Impulse Response Estimate
For simplicity of notation, in the remaining discussion, we assume that data symbols are binary. The results however, can be generally applied to non-binary data.
Because the correlator 116 has access to the same code sequence Si 104 that was used to generate the spread chip sequence cj 106 before transmission, the correlator 116 can correlate the received signal r(t) 114 with the code sequence Si 104. However, aliasing can occur with short code sequences Si 104, because time delays may cause the correlator 116 to correlate different portions of adjacent code sequences. Conventionally, these aliasing effects are reduced by integrating or summing over multiple (e.g. M) code periods, as discussed below.
As described in Eqs. (13)-(17), based on the correlator 116 output 118 we can form an estimate of the channel impulse response over one code period Tc:
where d0 is a value of the data at time t=0.
This is a rough approximation to ĥ(t), corrupted by aliased copies of ĥ(t) spaced at multiples of NTc, away from the desired copy. These aliasing and the additive noise terms can be reduced through further summation over M code periods:
The foregoing indicates that through output 122 of estimator 120, by removing the data modulation through correlation with the data sequence, we obtain an estimate ĥ of the channel impulse response plus terms defined by the autocorrelation of the data sequence, which vanish when summed over infinite terms.
If DM(l) is defined as:
Therefore, in the limit of infinite summation (as M approaches infinity), we obtain an estimate that is equal to the true channel impulse response h(t) convolved with the aperiodic autocorrelation of the spreading sequence Si 104.
As the foregoing demonstrates, we can not obtain the true channel impulse response h(t) with simple integration. The best we have is smeared by the autocorrelation of the spreading sequence Si 104. In cases where the spreading sequence Si 104 is long, the autocorrelation approaches a delta function, and the side lobes disappear. However, when the spreading sequence Si 104 is short, the sidelobes of the autocorrelation are not negligible and will cause significant distortion to the estimate of the communication channel impulse response h(t).
Improved Channel Estimates for Short Spreading Sequences
As is demonstrated below, the present invention improves the communication channel impulse response estimate by filtering the first estimated communication channel impulse response ĥM(t) to generate the estimated communication channel impulse response h(t) with a filter f selected at least in part according to the spreading sequence Si. In particular, when the time span of the communication channel 108 is limited, a zero-forcing deconvolution can be used to improve the estimate.
Referring to
In block 210, an estimated communications channel impulse response ĥM(t) is generated by the estimator 120 as a combination of com(t) and dm for m=0, 1, K, M This can be accomplished, for example using the relationship described in Eq. (24) above.
Finally, in block 212, the first estimated communication channel response ĥM(t) is filtered with a filter f selected at least in part according to the spreading sequence Si 104. In one embodiment, the filter is a finite impulse response (FIR) filter f 302 designed with the following constraints:
Af=A{circle over (×)}f Eq. (29)
Af(0)=1,Af(n)=0,0<|n|≦L Eq. (30)
wherein A{circle over (×)}f is the convolution of the autocorrelation of the spreading sequence Si 104 and the filter, and Af is the autocorrelation of the spreading sequence Si 104 after filtering.
When the estimate of the communication channel impulse response is filtered with this filter, we obtain:
Using this technique, the effects of the side lobes (aliased versions of the autocorrelation of the spreading sequence Si 104) are eliminated between L and −L. The side lobes are not completely removed (since the filter passes components greater than L and less than −L) but the result near the origin (n=0) is of primary interest, and the effect of the side lobes can be significantly reduced in this region.
If the time span (duration of the impulse response) of the communications channel is less than LTc, i.e.
∃t1<t2,t2−t1<LTc, ∀t<t1∪t>t2:h(t)≈0 Eq. (32)
(that is, there exists a time t2 greater than t1 defining a time interval t2−t1 less than LTc, and for all time outside of the interval t2−t1, h(t) is close to zero),
Then, the filtered estimate hf (or, in the earlier notation, hf (t)) is composed of an exact copy of h (h(t)), plus some aliased versions of it in non-overlapping locations. So in this case h is resolvable from hf.
Such a filter with length 2L+1 can be designed with the simple zero-forcing criteria:
wherein f(i) is the impulse response of the filter f 302 such that Af (n) is a convolution of A(n) and f(i), Af (n)=1 for n=0 and Af (n)=0 for 0<|n|≦L, and
and wherein N is a length of the chip sequence Si 104. L can be chosen such that the product LTc (the chip period Tc is known) is approximately equal to the time span (e.g. the approximate duration of the impulse response) of the channel 108.
Note that the value A(n−i) is well defined . . . it is a property of the spreading sequence Si 104, which is known apriori.
As usual, the matrix structure of the linear equations is Toeplitz. By the design requirement of the spreading sequence Si, the matrix should be well conditioned. The filter coefficients can be computed offline given the spreading sequence and desired window width L.
While the foregoing has been described in respect to non recursive filters, other filters, such as recursive filters may also be used. A recursive filter, for example, may provide perfect filtering of the sidelobes, but the result may not be the quell conditioned matrix, hence the solution may be more difficult to determine. In fact, any filter of length 2L+1 can be defined.
It has been shown that given ĥ and with filtering, it is possible to recover the true channel impulse response for a time limited channel. However, in the foregoing discussion, ĥ was obtained through integration over multiple spreading sequence periods. The number of periods we need to integrate over can be large especially if 2L≧N, since we rely on the autocorrelation of the data to suppress the aliased copies of ĥ.
In one embodiment of the present invention, supercodes, such as Walsh-like supercodes, are used to drastically reduce the amount of the integration required. This technique is especially useful in systems having sufficient a signal-to-noise ratio (SNR).
Consider a pair of length 2 Walsh codes w0 {+1, +1} and w1 {+1, −1}. These codes can be used to form a data sequence:
. . . +,+,+,−,−,− . . .
Any length 2-symbol length segment from this sequence can be described as either w0 or −w0, except for a single w1 in the center. If this sequence is now correlated with w1, the resulting correlation will be characterized by a single peak in the center and zeros elsewhere (except near the boundaries). Negatives of the two codes may be taken (e.g. w0={−1,−1} and w1={−1,+1}and/or their roles may be swapped (e.g. w1={+1, +1} and w0={+1,−1} with the same result. The three additional patterns thus obtained and their correlator patterns are listed below:
Since the following results are equivalent for all of the above patterns when the additive noise is uncorrelated at sampling points, we limit our discussion to the first data sequence (i.e. . . . +,+,+,−,−,− . . . ). In this case,
di=+1,∀l1<i≦0 Eq. (34)
di=−1,∀l2≧i>0 Eq. (35)
If the condition that −l1N>(2N+L)Il2N>(2N+L) can be satisfied, ĥ can be reconstructed free of aliasing interference, and by deconvolution (aforementioned filtering technique), h can be reconstructed as well.
From the foregoing, it can be determined that a small supercode imposed on a portion of the data sequence can provide an alias free estimate of the communication channel impulse response when the channel response is time-limited. The only source of distortion from this estimate comes from the additive noise, which can be suppressed by the spreading gain times a factor of 2 (to account for the supercode). When the noise is low, such an approach is preferable over long integrations.
For moderate values of L, such code sequences can be easily embedded within a longer preamble to packet data, probably with multiple copies, without adversely affecting the spectrum properties of the transmission. In addition, when the signal to noise ratio (SNR) is low, traditional integration as outlined in the first half of this section can still be carried out on such a preamble to obtain a higher processing gain against the additive noise.
In block 502, a data sequence di 102 is generated. The data sequenced di 102 includes one or more data packets 128, each data packet having a preamble 124 including a constrained portion Cdi 602. The preamble 124, can be, for example, in the form of a pseudorandom code.
The constrained portion Cdi 602 is associated with at least two codes, w0 and w1. The codes w0 and w1 are selected such that the correlation Acode(k) of the constrained portion Cdi 602 and at least one of the codes w0 and w1, is characterized by a maximum value at k=0, and they value less than the maximum value at k≠0.
Ideally, the correlation Acode(k) of the constrained portion Cdi 602 is an impulse, with Acode(k) equal to one at k=0, and equal at all other values for k. However, because such correlation characteristics are typically not realizable, codes w0 and w1 can be chosen to approximate this ideal. For example, codes w0 and w1 can be chosen such that the correlation Acode(k) of the constrained portion Cdi 602 and at least one of the codes w0 and w1, is such that Acode(k)=1 at k=0 and Acode(k)≈0 for substantially all k ≠0. Or, codes w0 and w1 can be chosen such that the correlation Acode(k) of the constrained portion Cdi 602 and at least one of the codes w0 and w1, is such that Acode(k)=0 for 0<|k|≦J wherein J is selected to minimize the correlation of the constrained portion Cdi with the one of the codes w0, w1 for substantially all k≠0.
In one embodiment, the constrained portion Cdi 602 comprises the pair of length two Walsh codes in the first sequence described above. Other embodiments are envisioned in which the codes are of another length (other than length two), or are codes other than a Walsh code.
In block 504, a chip sequence cj 106 is generated. The chip sequence cj 106 is generated by applying a spreading sequence Si 104 of length N and having a chip period Tc to the data sequence di 102.
This spread chip sequence cj 106 is transmitted through a linear transmission channel 108 having a combined channel impulse response h(t). The transmitted signal is received by a receiver 112.
In block 506, the receiver 112 receives the transmitted signal, and correlates the received signal r(t) 114 with the known spreading sequence Si 104 to identify the data as intended to be received by the receiver 112. This is accomplished by generating com(t)=co(t+mNTc) for m=0, 1, Λ, M, using techniques analogous to those which were described above.
In block 508, an estimated communication channel impulse response ĥM(t) is generated as a combination of the correlation com(t) and the data sequence dm for m=0, 1, Λ, M.
In one embodiment, the codes w0 and w1 are two symbol-long Walsh codes, and ĥM(t) computed as
with M=2. In this case, ĥM(t) equals
Hence, where the data has been constrained with a symbol such as a Walsh super code, an improved estimate of the communications channel impulse response can be obtained by taking two consecutive values of the correlation of the received data and the spreading sequence and multiplying each result by the data sequence. In the example of Walsh codes w0={−1, −1} and w1 ={−1, +1} applied to the sequence . . . +,+,+,−,−,− . . . , and w1 applied at the receiver, the result is that one of the values of co(t) is multiplied by a one, and the other is multiplied by a minus one. Hence, the output will produce essentially no response until the transition between the two Walsh codes occurs, at which time a clean, alias-free copy of the communications channel impulse response will be produced.
A length 2 supercode for improved alias suppression has been described. When the SNR is low and longer integration period is desirable, it would appear attractive to generalize the code to longer lengths. Counterintuitively, this is not possible. This result is shown below, by presenting a definition of such codes and showing that no such codes with length larger than 2 exist for binary data sequences.
An infinite sequence A forms an impulsive correlation pair with a length L finite sequence B if A satisfies the following equations:
A(i)=B(i), ∀0≦i<L
By contradiction, it can be shown that for binary sequences, such a pair does not exist for L>2. Supposing such sequences exist, it is apparent that L must be even. Considering two such cases (L=4k and L=4k+2)
In the first case, L=4k, consider the first constraint:
Since there are 4k summands in the equation taking values from {+1, −1} half of them or 2k terms must be positive, and the other half negative. The product of all the summands must therefore be 1.
Similar arguments can be used to show that:
A(i)=B(L+i),−L<i<0 Eq. (41)
But this implies that:
which contradicts the assumption that the cross-correlation is zero everywhere except at the origin. Hence, by contradiction, we have shown that for binary sequences, such a pair does not exist for L>2.
A similar argument can be applied for the second case, L=4k+2, except that the product of all the summands in each equations must be −1, since now we must have 2k+1 negative terms. This leads to:
A(i)=(−1)iB(L+i),−L<i<0 Eq. (43)
When k>0,
Summing the two equations together we have:
However, this result is clearly impossible since there are an odd number of terms on the left. By contradiction it is therefore shown that it is impossible to satisfy the constraints when L>2 for binary sequences.
The foregoing has demonstrated that distortions due to this spreading sequence design can be removed from the estimate of the communications channel impulse response. Attention is now turned to the remaining distortion caused by the additive noise. n(t) 121. Assuming that the noise source is white and stationary and is filtered by a receiver filter for bandwidth matching, its distortion measure can be defined as follows:
The ensemble expectation of Eq. (46) can be taken over n(t), whose autocorrelation can be determined by the front end receive filter, and is assumed to be known).
When the noise n(t) is white, we have:
The processor system 1102 comprises a processor 1104 and a memory 1106, such as random access memory (RAM). Generally, the processor system 1102 operates under control of an operating system 1108 stored in the memory 1106. Under control of the operating system 1108, the processor system 1102 accepts input data and commands and provides output data. Typically, the instructions for performing such operations are also embodied in an application program 1110, which is also stored in the memory 1106. The processor system 1102 may be embodied in a microprocessor, a desktop computer, or any similar processing device.
Instructions implementing the operating system 1108, the application program 1110, and the compiler 1112 may be tangibly embodied in a computer-readable medium, e.g., data storage device 1124, which could include one or more fixed or removable data storage devices, such as a zip drive, floppy disc drive, hard drive, CD-ROM drive, tape drive, etc. Further, the operating system 1108 and the application program 1110 are comprised of instructions which, when read and executed by the computer 1102, causes the computer 1102 to perform the steps necessary to implement and/or use the present invention. Application program 1110 and/or operating instructions may also be tangibly embodied in memory 1106 and/or data communications devices 1130, thereby making an application program product or article of manufacture according to the invention. As such, the terms “article of manufacture,” “program storage device” and “computer program product” as used herein are intended to encompass a computer program accessible from any computer readable device or media.
Those skilled in the art will recognize many modifications may be made to this configuration without departing from the scope of the present invention. For example, those skilled in the art will recognize that any combination of the above components, or any number of different components, peripherals, and other devices, may be used with the present invention. For example, an application-specific integrated circuit (ASIC) or a Field-Programmable Gate Array (FPGA) can be used to implement selected functions, including the correlator 116, and filtering functions can be performed by a general-purpose processor, as described above.
This concludes the description of the preferred embodiments of the present invention. The foregoing description of the preferred embodiment of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. It is intended that the scope of the invention be limited not by this detailed description, but rather by the claims appended hereto. The above specification, examples and data provide a complete description of the manufacture and use of the composition of the invention. Since many embodiments of the invention can be made without departing from the spirit and scope of the invention, the invention resides in the claims hereinafter appended.
This application is related to the following co-pending and commonly assigned patent application(s), all of which applications are incorporated by reference herein: application Ser. No. ______, entitled “METHOD AND APPARATUS FOR REMOVING CODE ALIASES WHEN USING SHORT SYNCHRONIZATION CODES,” filed on same date herewith, by Haitao Zhang, attorney's docket number 020305.