This invention relates to a method at the receiver of estimating the frequency response of multipath channels which is particularly but not exclusively directed to DOCSIS 3.1 uplink channels.
While the method is particularly applicable to the above, it can also be used in any digital communication systems including but not limited to single-carrier QAM and multi-carrier OFDM/OFDMA systems.
The demand for data services has steadily increased putting continuous pressure on data service providers to increase the data throughput of their networks. CATV networks are governed by a set of DOCSIS standards that place hard limits on bandwidth and data rates. The latest version of the DOCSIS standard, DOCSIS 3.1 was released in October 2013. DOCSIS 3.1 increases the bandwidth and data throughput available in CATV networks by up to 10 Gbps downstream and 1 Gbps upstream.
DOCSIS 3.1 is markedly different from prior versions of the standard in that Orthogonal Frequency Division Multiple Access (OFDMA) is used in both the upstream and downstream directions. By effectively modulating signals on narrow-band carriers, OFDM can mitigate inter-symbol interference while simplifying the structure of the channel equalizer.
While OFDMA systems have been studied for many years, the DOCSIS 3.1 standard is unique in that it combines OFDMA with very high-order modulation schemes (up to 4096-QAM). Furthermore, cable plants generate a number of channel impairments, some of which differ from typical OFDMA systems discussed in the literature.
In order to effectively demodulate a spectrally efficient signal, it is necessary to employ coherent demodulation which involves estimation and tracking of the multipath channel. To aid in channel estimation, DOCSIS 3.1 specifies a pilot-based wide-band probing mode, where the sub-carriers of an OFDM symbol are dedicated to channel estimation.
There are several channel estimation techniques that have been studied for pilot-based estimation. The simplest one, which is Least Square (LS) estimation, as set out in Document 1 below, does not require any channel state information (CSI). LS estimators work with samples in the frequency domain and are relatively low in complexity. However, they suffer from relatively high mean-square error, which is proportional to the power of additive white Gaussian noise (AWGN).
A better technique, which also performs estimation in the frequency domain, is linear minimum mean-square error (LMMSE) estimation in the Document 2 below. This technique yields much better performance than the LS estimator, especially under low signal-to-noise ratio (SNR) scenarios. The major drawback of the LMMSE estimator is that it requires knowledge of the channel auto-correlation matrix and the noise variance, which are usually unknown at the receiver. The computational complexity of the LMMSE estimator is also very high as it requires a matrix inversion. Many have attempted to reduce the complexity of the LMMSE estimator (as set out in Documents 3 and 4 below) at the expense of a small sacrifice in estimation accuracy.
Another very good approach uses discrete Fourier transform (DFT) based channel estimation. The DFT-based method firstly employs an LS estimator to obtain the channel's frequency response (CFR). Then the discrete-time channel impulse response (CIR) is obtained by performing an inverse discrete Fourier transform (IDFT) on the CFR. Since the energy of the CIR is typically concentrated in a few taps having short delays, the algorithm's performance can be improved if a few taps whose power is significantly higher than noise are preserved while the rest are forced to zero (as set out in Document 5 below). This operation is commonly referred to as denoising. After denoising, the CIR is transformed back to the frequency domain to obtain the estimated CFR. Consequently, the DFT approach helps to remove the noise power from the LS-estimated CFR. In general, DFT-based methods have moderate complexity thanks to Fast Fourier Fransform (FFT) algorithms and perform much better than the LS estimator at low SNRs (as set out in Document 6 below).
However, with the DFT method, performance degradation can occur due to leakage between samples in the discrete-time CIR. There are two sources of leakage. The first is leakage by multipath components that have non sample-spaced delays. In the case of non sample-spaced delays, the energy from a single multipath component is spread over multiple sample-spaced taps in the discrete time CIR. When the noise-only taps are eliminated, portions of the leakage energy are also removed and thus the estimation will show an error floor. The second type of leakage emerges if not all sub-carriers are used for channel estimation. In particular, in a typical OFDM system, the sub-carriers at both ends of the spectrum are left null to form guard bands. Not using the end sub-carriers degrades the performance of DFT-based techniques as this is equivalent to placing a rectangular window in the frequency domain which translates to convolution with a sinc-like function in the time domain. This causes the energy of the CIR to spread out in time. Denoising cuts off the tails of the sinc-like functions causing ripples around the edge sub-carriers when the denoised CIR is converted back to the frequency domain (as set out in Document 7 below). This phenomenon is often referred to as an “edge effect” or “border effect” and results in estimation errors not being equally distributed over all sub-carriers. To date studies that effectively address the two leakage issues of the DFT-based techniques have not been found.
The usefulness of the standard channel estimation techniques discussed above is somewhat limited in DOCSIS 3.1 systems, as upstream wideband probing has a subcarrier skipping option.
In subcarrier skipping mode, multiple upstream users transmit wideband probing signals on different subcarriers of the same OFDM symbol. Each user transmits on a different set of subcarriers that are spaced K sub-carriers apart, where K is the number of simultaneous users. The use of subcarrier skipping with K simultaneous users allows a K-fold increase in the efficiency of the wideband probing process as compared to a single user probing scheme. However, it places additional computational burden on the receiver, which must generate an estimate of the entire channel for each user despite receiving pilots on only every K'th subcarrier.
The following documents provide further information on this subject:
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. The disclosures in the above documents can be considered for further details of any matters not fully discussed herein.
According to the invention there is provided, in a digital communication system comprising:
Preferably the initial rough approximation of the multipath channel's impulse response is upsampled to provide better resolution.
Preferably the peak detection is enhanced by performing log-domain interpolation.
Preferably the method further is arranged where the transmitted signal is an OFDM/OFDMA signal having the characteristics that the test sequence consists of a set of pilot subcarriers which are inserted into a single OFDM/OFDMA symbol, the pilot subcarriers are equally spaced with a constant subcarrier skipping factor of K where the pilot subcarriers can have guard-band at both sides of the spectrum wherein the signal processing algorithm has the following characteristics:
Preferably the initial rough approximation of the channels impulse response is calculated by using (NU/K)-point discrete Fourier Transform, as defined in eq. (17) hereinafter where U is suitably chosen upsampling factor.
Preferably the peak detection is enhanced by performing log-domain interpolation as defined in eq. (38) hereinafter.
Preferably the peak detection is enhanced by performing log-domain interpolation as defined in eq. (38) hereinafter.
One embodiment of the invention will now be described in conjunction with the accompanying drawings in which:
In the drawings like characters of reference indicate corresponding parts in the different figures.
Consider the baseband-equivalent OFDMA system shown in
S(m)=S(0)+mK, m=0,1, . . . ,M−1, (1)
where S(0) is the “start” sub-carrier and K is sub-carrier “skipping” factor.
The OFDMA transmitter employs an IDFT module of size N for modulation. The standard IDFT/DFT is not used here, but rather the transform pair specified in DOCSIS 3.1, where the subcarrier indexing is shifted by −N/2 sub-carriers. Using the DOCSIS 3.1 IDFT, the transmitted time-domain samples are written as
where n=0, 1, . . . , N−1 denotes the sample index. To avoid inter-symbol interference (ISI), a cyclic prefix (CP) consisting of NCP samples is prefixed to the OFDMA symbol. After performing parallel to serial (P/S) conversion, the time-domain samples are serially passed through a DAC clocked at sampling rate Fs and filtered with an image rejection filter to generate the continuous-time signal. Assuming ideal digital to analog (D/A) conversion, the continuous-time signal can be expressed as
where Ts=1/Fs is the sampling period and Tg is guard interval in seconds. Tg is the duration of the cyclic prefix which is NCPTs. It is obvious that after the cyclic prefix is inserted xa(t)=xa(t+NTs), ∀tε[0,Tg]. In general, the validity of (3) depends on how well the up-conversion is performed.
A channel in a coaxial cable distribution network consists of many paths created by impedance mismatches among terminals and ports of devices that make up the network. Each path is characterized by a gain factor αi and an associated delay εi normalized to sampling period Ts. Without loss of generality, ε0 is taken to be 0 and εi is the delay of path ith relative to the delay of path 0th. The impulse response of the baseband-equivalent of the multipath channel is given by
where δ is the Dirac delta function. Furthermore, the parameter L is the number of paths in the multipath channel. The channel's delay spread in seconds is εmaxTs, where
which is the delay of the longest multipath component relative to the first.
The continuous time signal received at the receiver is the convolution of the transmitted signal and the impulse response of the multipath channel. That is
where w(t) is a zero-mean AWGN noise process and τ0 is the timing offset (normalized to sampling period Ts) introduced by error in detecting the start time of the received OFDMA symbol. There are many coarse timing estimation techniques, as set out in Document 9 above, that can detect the start time of the received OFDMA frame. With coarse timing, the detection error can be a few samples.
Assume a well designed system where the length of the cyclic prefix is greater than the channel's delay spread, i.e., Tg>εmaxTs, as illustrated in
0≦τ0≦NCP−εmax, (6)
where τ0 is the error in coarse timing in samples. By defining τi=εi+τ0, i=1, 2, . . . , L−1, the timing error can be incorporated into the base-band channel to get the more realistic impulse response given by:
Then (5) simplifies to
The continuous time signal is band-limited and digitally sampled at the receiver with the sampling rate Fs. After coarse timing detection is performed, the cyclic prefix is removed. The discrete-time samples after cyclic prefix removal are given by
where w[n] is w(t) sampled at t=nTs+Tg after it has been band-limited. w[n] is complex white Gaussian noise with zero mean and variance σw2. To recover the data, an N-point DFT block transforms the time-domain sequence back to the frequency-domain:
is complex Gaussian noise with zero mean and variance σw2. Then
where H[m] is the multipath channel's frequency response at sub-carrier S(m), given as
With the input/output model of (12), the signal-to-noise ratio (SNR) of the received signal is defined as
The task of channel estimation is to obtain the frequency response of the entire channel, which is ideally given as
from known values of X[m] and observed values of Y[m]. Since F[k] is a function of the 2L unknown parameters {αi,τi}i=0L-1, an estimate of F[k] is obtained from estimates of the 2L unknown parameters. Conventional methods estimate M values of H[m] and then interpolate between them to get the entire frequency response, F[k]. The invention presents a novel iterative algorithm to obtain 2L values of {αi,τi}i=0L-1. As long as 2L<<M, it will be shown that estimating {αi,τi}i=0L-1 directly provides a better estimate of F[k].
The iterative algorithm assumes a multipath channel that has a finite number of paths and is designed to estimate the channel parameters, which are time delays τi and amplitudes αi, i=0, 1, . . . , L−1, of the paths. The estimated parameters, denoted as {circumflex over (τ)}i and {circumflex over (α)}i can be used to obtain the frequency response of the channel with the following equation:
The transform has a length of NU/K, where U is an upsampling factor which controls the resolution of the resulting time domain vector q[u], given as
Note that U is not necessarily an integer, but rather a number that is chosen to make NU/K an integer. One suggestion is to make NU/K a power of two so that the complexity of (17) can be reduced through the use of Fast Fourier Transform algorithm. (17) can be simplified as follows:
where ρ[u] is the AWGN noise component given by
which has zero mean and
Since M≦N/K, the complexity of (17) is equivalent to an
The signal component of (17) can be expressed in a more meaningful form as
where b[u] is represented as a summation of several channel path kernel functions, γ(•), that are delayed by τi and scaled in amplitude by αi. Each γ(•) function represents a path in the channel.
The shape of the γ(•) function is more clear when it is expressed as
where Δ=2S(0)−N+(M−1)K and psinc(x, M) is the Dirichlet or periodic sine function defined as
The γ(x) function has zero-crossings at integer multiples of N/(MK), and therefore the width of the main lobe is 2N/(MK), which is inversely proportional to KM, as illustrated in
The iterative channel estimation technique centers on peak detection (105) of q[u]. Without loss of generality, the path indices are defined based on path strength such that
|α0|≧|α1|≧ . . . ≧|αL-1|. (23)
Provided U is chosen large enough for
where • indicates' rounding, then from (21), (23) and (24) it follows that
This indicates α0γ(τ0U/U−τ0) is the dominant magnitude contributor to b[τ0U]. The estimates of αi and τi (107) can be found iteratively starting with a rough approximation of the parameters of the first path, α0 and τ0 as follows:
where the super script [1] indicates that the value was found on the first iteration. Rough estimates of the parameters of the second path can then be generated by subtracting from q[u] the estimated contribution of the first path, {circumflex over (α)}0[1]γ(u/U−{circumflex over (τ)}0[1]) (108), given by:
Similarly, rough estimates for αi, τi, i=2, 3, . . . , L−1, are found using
After a set of rough estimates are obtained, the estimates for α0 and τ0 can be improved by removing from q[u] the estimated contributions from paths 1 to L−1. The improved estimates of α0 and τ0 in this 2nd iteration are given by
In a similar manner, the better estimates of α0 and τ0 can be used to produce better estimates of α1 and τ1. The estimates can be continually improved in this iterative fashion. Specifically, the impulse response of path i on iteration v is approximated by (109)
and the improved estimates of αi and τi are given by
After several iterations the estimates reach, or at least nearly reach, steady state values, which are denoted as {circumflex over (α)}i and {circumflex over (τ)}i. These steady state values are processed by a Fourrier transform (110) as described in (16) to obtain the frequency response of the channel (111).
These steady state values are used in (16) to obtain the estimated frequency response of the channel.
The accuracy of the aforementioned approximation is significantly affected by the error in i. The proposed approach ideally estimates {circumflex over (τ)}i=τiU/U. Thus the estimation error in the worst case could be 0.5/U, i.e., |{circumflex over (τ)}i−τi|≦0.5/U. Obviously, larger U reduces the error at the cost of increasing the estimator's complexity. Furthermore, the error in {circumflex over (τ)}i has the corresponding effect of diminishing the magnitude of {circumflex over (α)}i by up to |γ(0.5/U)|. For the case of M=N, the reduction can be as much as −4 dB, −0.9 dB and −0.2 dB for U=1, U=2 and U=4, respectively.
The estimate for τi used in (31) is rather crude. The accuracy of this estimator, which simply rounds the argument of (31) to the sample nearest to the peak, can be improved by interpolating between the two samples nearest the peak, thus eliminating the rounding error, as shown in
There are several ways of interpolating between two samples. The method used here is to find the parameters
As shown in
The solution for
where θiε[0, 1]. Since sin(x)>0 ∀xε(0, π), equation (34) can be simplified to
θi can be found as
θi=Γ−1(κ), 0≦θi≦1. (36)
Solving (36) in real-time is possible, but very costly due to the complexity of Γ−1(•). Moreover, the precision of the computation must be very high when θi is close to 0 or 1, i.e., when the denominator of (35) approaches zero.
Fortunately, (36) can be modified to yield a hardware friendly form. Since κ=eln(κ), θi can be expressed as the function of ln(κ) defined as Γlog−1(ln(κ))=Γ−1(eln(κ))=Γ−1(κ). The simplicity of the logarithmic form is illustrated in
θi can then be approximated by
θi≈β ln(t)+0.5=β(ln(|qi[v][ûi]|)−ln(qi[v][ûi+1]|))+0.5, (38)
for U≧2. Note that (38) does not require a division operation, and is therefore significantly more hardware friendly than (36).
There are two ways to find the echo strength
Although (39) is computationally simple, it amplifies the noise and ISI present in qi[v][ûi]. In the worst-case scenario, this results in the noise and ISI being increased by a factor of |γ(1/U)−1|. This factor decreases as the upsampling factor U increases. For the system parameters shown in
which is computationally more expensive than (39), but it prevents the noise and interference amplification effect. Consequently, for U≦2, it is advisable to use (40) in order to avoid severe performance degradation due to noise amplification. For U>2, the amplification effect is minimal, so the more computationally efficient (39) is preferred.
The ICE technique does not require any channel information, except for an initial estimation of the number of channel paths, denoted as {circumflex over (L)}, which must be determined before performing the iterative channel estimation. In reality, the parameter L in (20) should be replaced by {circumflex over (L)} so that the ICE algorithm will estimate 2{circumflex over (L)} channel parameters, {αi,τi}i=0{circumflex over (L)}-1, instead of 2L. Therefore it is reasonable to expect the best performance achieved when {circumflex over (L)}=L.
In some cases, such as CATV networks, the plant is maintained to limit the number of dominant echo paths. For example, networks that use DOCSIS 3.0 equipment are restricted to L≦4 while networks that use DOCSIS 3.1 equipment are restricted to L≦2. Therefore it is reasonable to fix parameter {circumflex over (L)}=2 in equipment used in DOCSIS 3.1 upstream transmission.
Although the ICE technique was initially designed for DOCSIS 3.1 systems, it applies to general OFDMA systems, where the parameter L is not so constrained and the initial guesstimation of {circumflex over (L)} affects the channel estimation performance. In particular, with the proposed ICE technique, if the number of paths in the channel is under-detected, i.e. {circumflex over (L)}<L, there will be performance degradation as the model is unable to compensate for the least significant channel paths. If the number of path is over-detected, i.e. {circumflex over (L)}>L, the ICE technique would interpret noise samples as channel paths. Since the power of noise is much less than the power of an echo, e.g. |αL-1|2>>ση2, performance degradation due to over-detection is generally less than the degradation caused by under-detection. Therefore, it is better to error on the side of over-detection.
Moreover, the performance degradation caused by over-detection can be mitigated as the significant power difference between echoes and noise can be exploited to suppress the over-detected paths. In particular, a threshold can be employed to differentiate the channel paths from the noise. The thresholding process replaces (39) and (40) with:
where λT is the threshold level. With any threshold level decision, there is always some probability of a false alarm where a noise sample is declared as an echo. The threshold can be set to obtain a false alarm probability of Pe, using
Simulation results show that the estimation performance is not particularly sensitive to threshold level λT. A reasonable threshold is obtained by setting the false alarm probability to Pe=10−3.
The iterative channel detection procedure is summarized below:
The performance of the proposed channel estimation algorithm is investigated. At first, the single echo channel model as set out in Document 8 above is considered, which fixes {circumflex over (L)}=L=2. Coarse timing error τ0 is modeled as a random variable that is uniformly distributed between 0 and 10, i.e. τ0˜U(0,10). The echo delay in seconds, i.e. ε1Ts, is uniformly distributed between 0 and 0.5 μs. The power of the micro-reflection is −16 dBc relative to the main path, which is the worst case specified in DOCSIS 3.1. The sampling rate of the system is Fs=102.4 MHz. The signal is generated using an N=2048 point IFFT, and has M=1900 pilots indexed by S(m)=m+74, m=0, 1, . . . , 1899 (no sub-carrier skipping), which leaves 74 unused carriers as guard bands at both ends of the spectrum.
Performance of the ICE technique is illustrated for three up-sampling factors: U=1, U=2 and U=4. The ICE method clearly outperforms both the LS and DFT techniques, even with U=1. However, with U=1, the ICE method hits an error floor of 4·10−5 for SNRs above 20 dB that can not be reduced by increasing the number of iterations.
When increasing the up-sampling factor to U=2, it can be seen that the error floor depends on the number of iterations. As can be seen in
It can be seen that the proposed algorithm outperforms the conventional methods, especially in the low-SNR region, where the ICE estimator is 30 dB better than the LS. In addition, the performance of the ICE asymptotically approaches that of the LMMSE, but does not need apriori knowledge of the auto-covariance of the channel. Furthermore, the ICE method requires only a single OFDMA symbol to achieve this level of performance.
It is notable that when the echo delay is less than a sample period, the estimation error is very high as compared to the error caused by larger echo delay. Specifically, when ε1<1, the performance of the ICE technique is limited to around 10−4 regardless of SNR level. That observation indicates that the error floor of the ICE method shown in
Since τ0 is the delay of the main path, it is also the timing error. Therefore the proposed technique not only estimates the channel's frequency response, but also detects the timing error.
Conventional OFDMA timing detection techniques, as set out in Document 9 above, are all timing-metric based estimation techniques, which limit detection resolution to a sample period. Therefore, timing offset variance of these conventional techniques is inherently greater than 1/12, which is easily outperformed by the proposed algorithm. The initial ICE algorithm has a timing offset that is uniformly distributed between −0.5/U and 0.5/U. Therefore, the timing variance of the initial ICE algorithm would asymptotically approach U−2/12, which is also plotted in
As shown
is inversely roportional to M, so it increases by 6 dB when the number of pilot sub-carriers is reduced from 1900 to 475. Therefore, experimental evidence suggests that the performance of the ICE technique scales well with the number of pilot sub-carriers.
Finally,
The detection performance behaves differently in the high SNR region, i.e. when SNR≧30 dB. In particular, the best performance is observed when {circumflex over (L)} is given the exact number of channel paths, e.g. when {circumflex over (L)}=7. As expected, when {circumflex over (L)}<7, under-detection significantly reduces channel estimation performance. However, only negligible performance degradations so are observed with over-detection, e.g. when {circumflex over (L)}>7, due to the threshold that mitigates the possibility of misidentifying noise samples as channel paths.
Since various modifications can be made in my invention as herein above described, and many apparently widely different embodiments of same made within the spirit and scope of the claims without department from such spirit and scope, it is intended that all matter contained in the accompanying specification shall be interpreted as illustrative only and not in a limiting sense.