The present invention relates to a channel estimator and, more particularly, to a channel estimator that estimates the response of a channel through which signals are received.
For communication systems utilizing a periodically transmitted training sequence, least-squares algorithms or correlation algorithms have been the most widely used alternatives for the estimation of the channel. The estimation of the channel may then be used for various purposes such as initializing the values of the taps of an equalizer at a receiver.
Both algorithms use a stored copy of the known transmitted training sequence at the receiver. The properties and length of the training sequence generally depend on the particular standard specifications of the communication system. In addition, a training sequence is also used in a communication system for synchronization to indicate the beginning and/or end of a transmitted data frame or packet.
In general, a communication system 10 that utilizes an equalizer adjusted by use of the training signal is shown in
where IkεA≡{α1, . . . , αM}⊂C1 is the transmitted training sequence, which is a discrete M-ary sequence taking values on the generally complex M-ary alphabet A, which also constitutes a two dimensional modulation constellation. The function q(t) as shown in a block 16 represents the pulse shaping performed by a transmitter filter of finite support [−Tq/2, Tq/2].
The overall complex pulse shape is denoted by p(t) and is given by the following equation:
p(t)=q(t)*q*(−t) (2)
where q*(−t) shown in a block 18 is the receiver matched filter finite impulse response. Although it is not required, it is assumed, for the sake of simplifying the notation, that the span, Tq, of the transmitter and receiver filters is an integer multiple of the symbol period T; that is, Tq=NqT, NqεZ+.
The physical channel between the transmitter and the receiver is denoted by c(t,τ) and is shown by a block 20, and throughout the discussion below the concatenation of p(t) and the physical channel c(t,τ) will be denoted by h(t) and is defined by the following equation:
h(t)=q(t)*c(t,τ)*q*(−t)=p(t)*c(t,τ) (3)
where h(t) is generally referred to as the channel response. The physical channel c(t,τ) between the transmitter and the receiver is generally described by the following impulse response:
which describes a time-varying channel, where {ck(τ)}⊂C1, −K≦k≦L, and t,τεR. However, it is assumed that the time-variation of the channel is slow enough that c(t,τ)=c(t) can be assumed to be a fixed (static) inter-symbol interference (ISI) channel throughout the training period; that is, it is assumed that ck(τ)=ck, which in turn implies that c(t) is defined by the following equation:
for 0≦t≦NT, where N is the number of training symbols.
The summation indices K and L, where K is the number of anti-causal multi-path delays and L is the number of causal multi-path delays, are dependent on the channel and are more fully explained below in the discussion of correlation based channel estimation and subsequently. The multi-path delays τk are not assumed to be at integer multiples of the sampling period T. Indeed, an accurate and robust way of recovering the pulse shape back into the concatenated channel estimate when the multi-path delays are not exactly at the sampling instants is disclosed herein.
Equations (3) and (5) can be combined according to the following equation:
Because both p(t) and c(t) are complex functions, the overall channel impulse response h(t) is also complex. Therefore, p(t), c(t), and h(t) can be written as functions of their real and imaginary parts as given by the following equations:
p(t)=pI(t)+jpQ(t) (7)
c(t)=cI(t)+jcQ(t) (8)
h(t)=hI(t)+jhQ(t) (9)
Substituting equations (7)–(9) into equation (6) produces the following equation:
The output matched filter 18 is denoted y(t) and is given by the following equation:
where {tilde over (ν)}(t) designates the complex noise process after the matched filter. As in the case of equations (7)–(9), y(t) can be written in terms of its real and imaginary parts as y(t)=yI(t)+jyQ(t). Sampling the output of the matched filter 18 at the symbol rate using a sampler 22 produces the discrete-time representation of the overall communication system according to the following equation:
The transmitted symbols are composed of frames (or packets) of length N′, where the first N symbols are the training symbols ak. Within a frame of length N′, the symbols may be denoted by the following equation:
where the distinction of the first N symbols is made to indicate that they are the known training symbols, and where it is possible that the training symbols ak belong to a subset of the M-ary constellation alphabet A; that is {akεÃ⊂A≡{α1, . . . , αM}}. In fact, for the 8-VSB digital television system, the signal alphabet is A≡{±1,±3,±5,±7}, while the training sequence can only take binary values within the subset Ã≡{−5,+5}.
h=[h[−Na],h[−Na+1], . . . ,h[−1],h[0],h[1], . . . , h[Nc−1],h[Nc]]T (14)
where Na denotes the number of anti-causal taps of the channel, NC denotes the number of causal taps of the channel, and Na+NC+1 is the total memory of the channel.
Based on equation (12) and assuming that N≧Na+Nc+1 where N again is the number of training symbols, the output of the matched filter 18 corresponding to the known training symbols can be written as follows:
which can be compactly written according to the following vector equation:
y=Ah+ũ (15)
where y comprises values extracted by a sample extractor 32 from the received signal and is defined according to the following equation:
y=[y[Nc],y[Nc+1], . . . ,y[N−1−Na]]T (16)
where A is based on the training signal and is defined according to the following equation:
and where {tilde over (ν)} is defined according to the following equation:
{tilde over (υ)}=[{tilde over (υ)}[Nc],{tilde over (υ)}[Nc+1], . . . , {tilde over (υ)}[N−1−Na]]T.
As long as the matrix A is a tall matrix and of full column rank such that N≧2(Na+Nc)+1 and rank {A}=Na+Nc+1, then the least-squares solution which minimizes the objective function given by the following equation
JLS(h)=||y−Ah||2 (18)
exists and is unique and is given by the following equation:
ĥLS=(AHA)−1AHy (19)
where the superscript H denotes Hermitian transpose, and where ĥLS is the least-squares estimation of the channel between the transmitter and the receiver. Thus, in accordance with equation (19), a matrix multiplier 34 multiplies the matrix y by a matrix A+, which is (AHA)−1AH and which is stored in a memory 36. Thus, the matrix multiplier produces the channel estimate ĥLS according to equation (19).
One of the problems associated with the least-squares based channel estimation is the uncertainty associated with the actual channel spread, i.e., the spread between the outermost multi-paths. That is, this uncertainty arises because the channel memory and the first and last channel taps are not actually known by the receiver. Therefore, in order to set up equations (16)–(19), some sort of preprocessing is needed, such as correlating the stored training symbols with the received signal, or some pre-assumed starting and ending points for the channel must be used such that the total channel spread is not too long that would otherwise result in the A matrix being a wide matrix rather than a tall one.
Even if the starting and ending points of the actual channel spread are known by the receiver, least-squares based channel estimators still have problems that may be insurmountable. For example, least-squares channel estimators cannot properly estimate the channel from the training sequence if the channel spread is too long for the training sequence to effectively support the channel.
That is, if N<2(Na+NC)+1, such that the number of training symbols is insufficient to support, or create, an over-determined channel convolution matrix A, then there are two cases to consider.
First, if the full A matrix as given by equation (17) is used, the least-squares estimate of equation (19) no longer exists. That is, the matrix AHA is singular.
Second, if the receiver does not attempt to construct the full A matrix and instead constructs an alternative thinner matrix à with N≧2(Ña+Ñc)+1 such that Ña<Na and/or Ñc<Nc, then the inverse of ÃHà will exist but the new least-squares solution given by the following equation:
{tilde over (h)}LS=(ÃHÃ)−1ÃHy
will not be a very desirable estimate. That is, the resulting channel estimate no longer resembles the actual channel, especially if the actual channel spread is much longer that the training sequence can support.
An example is illustrated in
As indicated above, it is also known to apply auto-correlation in estimating the channel. The sampled received sequence at the output of the matched filter is denoted by y[n] and is given in equation (12), where the pre-cursor symbols are at positions n≧0 and post-cursor symbols are at positions n<0. The known training sequence is denoted by s[n]=an for n=0, . . . , N−1. The auto-correlation functions rs[m] and rsy[m] are defined in accordance with the following equations:
where {s[k]εÃ}.
It is assumed that, in order to be able to use correlation based channel estimation, the training sequence must belong to a certain class of sequences and thereby possess certain desirable properties. An exemplary class of sequences is maximal length pseudo-noise (PN) sequences. A PN-sequence of length n is denoted herein as PNn. In general, the periodic auto-correlation of a binary valued ({+A, −A}) PNn sequence is given by the following equation:
More specifically, based on the auto-correlation property of equation (22), the channel can be estimated by cross-correlating s[n] (the training sequence known and stored in the receiver) with the received sequence y[n]. The initial pre-cursor impulse response estimate {tilde over (h)}a[n] is determined from the cross-correlation of the training sequence s[n] stored in the receiver and the actual received symbols at lags from 0 to −N+1 with respect to the start of the received training sequence. That is, {tilde over (h)}a[n] is given by the following equation:
Similarly, the initial post-cursor response estimate ĥc[n] is determined from the cross-correlation of the training sequence stored in the receiver and the actual received symbols at lags from 1 to N−1 with respect to the start of the received training sequence. That is, ĥc[n] is given by the following equation:
where rs[0] in equations (23) and (24) is defined according to the following equation:
Equations (23) and (24) imply that the correlations should be extended on either side by the length of the training sequence in order to obtain the longest possible channel estimate within the limits of the transmission standard, with the assumption that N≧K and N≧L.
With respect to the channel estimation procedure given by equations (23) and (24), the overall initial estimated channel transfer function {tilde over (H)}(z) is given by the Z-transform of the concatenated channel estimates according to the following equation:
where the concatenated estimated channel impulse {tilde over (h)}[n] is given by the following equation:
Equation (25) can be rewritten in accordance with the following equation:
where à is a (N+Na+NC) by (Na+NC+1) Toeplitz matrix with a first column [a0, a1, . . . , aN−1, 0, . . . , 0]T and a first row [a0, 0, . . . , 0]T, and where {tilde over (y)}=[y[−Na], . . . , y[N+NC−1]]T. In order to get rid of the side lobes of the aperiodic autocorrelation, the autocorrelation of the training sequence is determined according to the following equation:
and then Raa can be inverted and applied to {tilde over (H)}u(z) according to the following equation:
{tilde over (H)}c(z)=Raa−1{tilde over (H)}u(z) (29)
However, the channel estimate {tilde over (H)}c(z) obtained by equation (29) has the contributions due to unknown symbols prior to and after the training sequence, as well as the additive channel noise; only the side lobes due to the aperiodic correlation is removed.
As discussed below, the correlations of equations (23) and (24) will clearly yield peaks at {DkaεZ+, for k=−K, . . . , −1, 0, and at {DkcεZ+, for k=1, . . . , L. These delays Dka and Dkc are the sampling instants closest to the locations of the actual physical channel multi-path delays τk for k=−K, . . . , −1, 0, 1, . . . , L within a symbol interval.
As shown in
for n=−N+1, . . . , −1, 0, 1, . . . , N−1, where the subscript c denotes that the corresponding function has been cleansed of side lobes. As a result, the estimated transfer function Ĥc(z) after thresholding can, in general, be written according to the following equation:
where αI={tilde over (α)}I/{tilde over (α)}0 for 1≦i<L, and βk={tilde over (β)}k/{tilde over (α)}0 for 1≦k≦K. The notation of equation (31) is adopted because, after thresholding, there are usually very few dominant taps. Accordingly, the channel after correlation and thresholding is sparse. Moreover, the peaks in the channel estimate no longer have the pulse shapes of the actual channel response.
In equation (31), the coefficients βk correspond to the post-cursor (causal) part of the correlation, and the coefficients α1 correspond to the pre-cursor (anti-causal) part of the correlation. The delays Dka are the anti-causal delays, and the delays Dlc are the causal delays. As a convention, it is assumed that 1≦D1c<D2c< . . . <DLc and similarly that 1≦D1a<D2a< . . . <DKa. Also, by the construction of the channel estimates in equations (23) and (24), DKa≦N−1 and DLc≦N−1.
At this point, the causal part Ĥcc and the anti-causal part Ĥac of the transfer function of equation (31) can be defined in accordance with the following equations:
Ĥac(z)={tilde over (α)}0(βKzDa
Ĥcc(z)={tilde over (α)}0(α1z−D
such that Ĥc(z)=Ĥac(z)+Ĥcc(z). Again the first subscript denotes whether the part is anti-causal or causal, and the second subscript denotes that the part has been cleansed of the side lobes. The partition of equations (32) and (33) enable the development of the appropriate feed-forward filter initializations for a Decision Feedback Equalizer (DFE).
Unfortunately, if the PN training sequence is finite and the standard linear correlation is used, the auto-correlation values corresponding to the non-zero lags will not be constant and will not be as low as −A2. As a simple illustration, a sequence composed of six PN511 appended back to back may be considered so that the sequence is given by the following equation:
y=[PN511, PN511, PN511, PN511, PN511, PN511] (34)
Then, rxy[m], with x=[PN511]T, will be given as shown in
The training sequence that is used for digital television is given by the following equation:
{tilde over (s)}=[ss,PN511,PN63,±PN63,PN63]T
where ss=[+5−5−5+5] represents the four segment sync symbols at the beginning of the frame sync segment. There are also reserved frame bits and data bits right before and after the training sequence {tilde over (s)}.
In summary, the correlations of the received signal with the stored training sequence will be noisy because the PN sequences are finite in length so that they will not achieve the desired low correlation values for non-zero lags, and because the span of the cross-correlation includes the known training sequence as well as random data symbols and reserved symbols.
Because of these reasons, the initial estimate of the channel that is acquired from the correlation of the matched filter output with the stored training sequence will not yield a clean and accurate channel estimate. Accordingly, even if the channel tap estimates {αk,βk} are scaled properly, they will not be very close to the actual channel tap values {ckp(0)} due to the reasons given above as well as due to the fact that there may also be tap values in the tails of the adjacent multi-paths. Hence, both the amplitudes and phases of the estimated tap values may be incorrect.
Moreover, after the thresholding takes place, the most likely result is that the thresholded correlation will include only the peaks of the channel taps, which implies that the tails of the raised cosine pulse shape that are under the noisy correlation output may be eliminated entirely. This elimination of the tails may lead to critical performance loss in the initialization of the equalizer 12 and a slower convergence onto the correct tap values for the equalizer 12 after initialization.
The present invention mitigates the problems of the least-squares channel estimators and the correlation channel estimators providing a channel estimator that is a blend of a least-squares channel estimator and a correlation channel estimator.
In accordance with one aspect of the present invention, a method for determining a channel response comprises the following: determining a matrix Γ based on a correlation of a received signal and a known training sequence; extracting a vector y from a received signal; and, estimating the channel response from a least-squares solution based on the matrix Γ, the vector y, and a matrix A formed from the elements of the known training sequence.
In accordance with another aspect of the present invention, a method for determining a channel response comprises the following: determining a matrix Γ based on a correlation of a received signal and a known training sequence; extracting a vector y from a received signal; determining an unknown coefficient vector γLS according to the following equation:
γLS=(ΓHAHAΓ)−1ΓhAHy
wherein the unknown coefficient vector γLS comprises a least-squares solution of the following equation:
y=AΓγ+ν
and wherein A comprises a matrix based on the training sequence; and, determining a channel response according to the following equation:
{tilde over (h)}new=ΓγLS.
In accordance with still another aspect of the present invention, a method for determining a channel response comprises the following: correlating a received signal with a training sequence; constructing a matrix Γ by approximating a desired pulse shape for each peak of the correlation; constructing a matrix A based on the training sequence; solving a least-squares solution of the following equation in order to derive an unknown coefficient vector γ:
y=AΓγ
wherein y is a vector representing a received signal; and, determining the channel response from the unknown coefficient vector γ and the matrix Γ.
These and other features and advantages will become more apparent from a detailed consideration of the invention when taken in conjunction with the drawings in which:
a)–3(e) illustrate an exemplary simulation useful in explaining problems associated with the least-squares channel estimator of
Thus, the blended estimation procedure illustrated by
where the zeros represent the spaces between the peaks. The structure and notation of equations (31) and (35) are important to the derivation of the algorithm below.
As indicated above, the pulse shape p(t) is convolved by the physical channel c(t) according to equation (6). As also indicated above, this pulse shape is lost due to the correlation and thresholding process. Therefore, it is desirable to recover this pulse shape for each of the multi-paths. Accordingly, the shifted and scaled copies of the pulse shape p(t) (shifted by τk and scaled by ck) may be approximated by a linear combination of three pulse shape functions shifted by half a symbol interval (T/2). This linear combination is given by the following equation:
where {γl(k),K≦k≦L}l=−11⊂C1. The last part of equation (36) contains an approximation of the pulse shape of the transmitter/receiver filter combination. By making this approximation, the tails of the complex pulse shape p(t) can be efficiently recovered even though these tails are buried under the noisy output of the correlation processing and are lost when uniform thresholding is applied.
To arrive at this approximation, three vectors pk are introduced for k=−1, 0, +1, each containing T spaced samples of the complex pulse shape p(t) shifted by kT/2 (for k=−1, 0, +1) according to the following three equations:
By concatenating these vectors, a (2Nq+1)×3 matrix P is defined according to the following equation:
P=[p−1,p0,p1] (40)
This vector represents shifted and sampled values of a raised cosine pulse.
A matrix Γ whose columns are composed of the shifted vectors pk, where the shifts represent the relative delays of the multi-paths. This matrix Γ is given by the following equation:
where Γ is of dimension (DKa+DLc+2Nq+1)by 3(K+L+1) dependent on the channel spread, and 0mxn denotes an m by n zero matrix such as a DKa−DK−1a by 3 matrix of zeros. The received signal vector y and the training sequence vector A are defined by equations (16) and (17), respectively. Because of the assumption that q(t) spans Nq symbol durations, q[n] has Nq+1 sample points and p[n] in turn has 2Nq+1 samples. Therefore, Na=DKa+Nq and Nc=DLc+Nq.
An unknown vector γ is defined in accordance with the following equation:
γ=[γ(−K), . . . ,γ(0), . . . ,γ(L)]T (42)
where γ(k)=[γ−1(k),γ0(k),γ1(k) for −K≦k≦L. The unknown vector γ has coefficients {γ−1−K where n=−1, 0, 1, and where k=−K, . . . , 0, . . . , L and has a length of 3(K+L+1). Similarly to equation (15), the vector y representing the output of the receiver's matched filter may be given by the following equation:
y=AΓγ+ν (43)
where ν is the observed noise vector. Using a least-squares solution, the unknown coefficient vector γ can be estimated in accordance with the following equation:
{circumflex over (γ)}LS=(ΓHAHAΓ)−1ΓHAHy=(AΓ)⋄y (44)
Once the least-squares estimated vector γLS is obtained, the new channel estimate ĥnew, where the pulse tails are recovered, can simply be obtained in accordance with the following equation:
ĥnew=Γ{circumflex over (γ)}LS (45)
The multiplier 64 multiplies {circumflex over (γ)}LS from the matrix multiplier 54 by Γ according to equation (45) in order to produce ĥnew. Equations (44) and (45) may be solved using either a singular value decomposition (SVD) method, by a conjugate gradient (CG) method, or by any other suitable method.
Once the new estimated channel vector is determined per equation (45), the reconstructed vector at the output of the receiver matched filter can be defined, based on ĥnew according to the following equation:
{tilde over (y)}=Aĥnew (46)
{tilde over (y)}=AΓ{circumflex over (γ)}LS (47)
{tilde over (y)}=AΓ(ΓHAHAΓ)−1ΓHAHy (48)
The variance of the noise samples at the output of the receiver's matched filter, σ{circumflex over (v)}2, can be estimated based on the following equation:
which arises naturally. The length of the vector Y is N−DKa−DLc.
The tap weight initializer 66 may then initialize the tap weights of the taps of the equalizer 12 using the channel estimate ĥnew and the noise variance {circumflex over (σ)}{circumflex over (v)}2 in a manner that is known in the art.
When the present invention is used to initialize the taps weights of the taps of an equalizer, the noise variance given by equation (49) and the new channel response given by equation (45) can be combined in any known way so as to produce the initial tap weights for the taps of the equalizer. Alternatively, instead of combining the noise variance given by equation (49) and the new channel response given by equation (45), a small non-zero value noise variance may be assumed and this assumed value may be combined with the new channel response given by equation (45) so as to produce the initial tap weights for the taps of the equalizer.
Modifications of the present invention will occur to those practicing in the art of the present invention. For example, the present invention as described above relies on the use of a training signal. However, other signals, such as data signals, can be used to implement the present invention. Also, although the invention is specifically shown for use in initializing the taps of an equalizer, the channel estimate produced in accordance with the present invention can be used for other purposes.
Accordingly, the description of the present invention is to be construed as illustrative only and is for the purpose of teaching those skilled in the art the best mode of carrying out the invention. The details may be varied substantially without departing from the spirit of the invention, and the exclusive use of all modifications which are within the scope of the appended claims is reserved.
The present application claims the benefit of Provisional Application Ser. No. 60/336,415 filed on Oct. 24, 2001.
Number | Name | Date | Kind |
---|---|---|---|
5903610 | Skold et al. | May 1999 | A |
6470225 | Yutkowitz | Oct 2002 | B1 |
20030058787 | Vandenameele-Lepla | Mar 2003 | A1 |
20040209573 | Happonen et al. | Oct 2004 | A1 |
Number | Date | Country |
---|---|---|
0 755 141 | Jan 1997 | EP |
WO 9838772 | Sep 1998 | WO |
Number | Date | Country | |
---|---|---|---|
20030099309 A1 | May 2003 | US |
Number | Date | Country | |
---|---|---|---|
60336415 | Oct 2001 | US |