1. Field of the Invention
The present invention relates to impulse noise estimation and removal in orthogonal frequency division multiplexing (OFDM) transmissions, and particularly to transmissions in power line communications and digital subscriber line (DSL) transmissions.
2. Description of the Related Art
Signal transmissions, such as those delivered via power line communications and digital subscriber line (DSL) transmission, must cope with intersymbol interference (ISI) distortion, additive white Gaussian noise (AWGN) and impulse noise. In telecommunication, intersymbol interference is a form of distortion of a signal in which one symbol interferes with subsequent symbols. This is an unwanted phenomenon, as the previous symbols have similar effect as noise, thus making the communication less reliable. ISI is often caused by multipath propagation and the inherent non-linear frequency response of a channel. ISI arises due to imperfections in the overall frequency response of the system. The presence of ISI in the system, however, introduces errors in the decision device at the receiver output. Therefore, in the design of the transmitting and receiving filters, the objective is to minimize the effects of ISI, and thereby deliver the digital data to its destination with the smallest error rate possible. Common techniques to fight against intersymbol interference include adaptive equalization and error correcting codes.
In communications, the additive white Gaussian noise channel model is one in which the information is given a single impairment, i.e., a linear addition of wideband or white noise with a constant spectral density (typically expressed as Watts per Hertz of bandwidth) and a Gaussian distribution of noise samples. The model does not account for the phenomena of fading, frequency selectivity, interference, nonlinearity or dispersion. However, it produces simple and tractable mathematical models that are useful for gaining insight into the underlying behavior of a system before these other phenomena are considered.
Wideband Gaussian noise comes from many natural sources, such as the thermal vibrations of atoms in antennas (referred to as thermal noise or Johnson-Nyquist noise), shot noise, black body radiation from the Earth and other warm objects, and from celestial sources such as the Sun. As an example of noise in signal transmissions, ADSL/VDSL over short distances operates at an extremely high signal-to-noise ratio (SNR) with very high spectral efficiencies (quadrature amplitude modulation, or “QAM”, constellations of up to 215 points can be used), and their main limiting factor is impulse noise and cross-talk, rather than AWGN. As will be described in greater detail below, impulse noise estimation and cancellation at the receiver is of particular interest and importance.
Impulsive noise is considered one of the biggest challenges in DSL technology and OFDM transmission in general. While impulsive noise is often attributed to switching electronic equipment, there is no general consensus as to the proper modeling of impulsive noise. There are various ways to deal with errors that take place at the physical layer in DSL. Forward error correction is one common way of counteracting, or accounting for, such errors. Specifically, a superframe in DSL implements an inner convolutional code with an interleaver and a Reed-Solomon outer code. The interleaver spreads the impulse noise errors around the signal, allowing the code redundancy to better deal with these errors.
Alternatively, other common techniques attempt to detect the presence of impulses and their locations, and use this information to enhance the performance of forward error correction (FEC). One common method detects the presence of an impulse using a thresholding scheme, and then erases the whole OFDM block in order not to exceed the error correction capability of the channel coding. Most standardized approaches try to detect or forecast the location of the errors. With this knowledge, one can theoretically detect twice as many errors as when the location of the errors is unknown. When the physical layer is not able to deal with erasures through forecasting and FEC, the physical layer tags the uncertain discrete multitone (DMT) symbols and sends them to higher layers.
Pre-coding techniques and frequency algebraic interpolation techniques inspired by Reed-Solomon coding and decoding over the complex numbers have been proposed to cope with this problem. Specifically, the presence of impulse noise within a few samples creates certain syndromes in a sequence of pilots or null frequencies, which can be used to detect the location of impulse noise, estimate it, and cancel the noise. The problems with such techniques are that they require a certain structure of the null frequencies or pilots, they are guaranteed to detect only a limited number of impulse noise samples, and they can be very sensitive to background noise (to the extent that some intermediate step is needed to ensure that the algorithm does not malfunction).
In
y
k=Σl=oLhlxk−l+zk+ek (1)
where xk and yk denote the channel input and output, h=(h0, . . . hL) is the impulse response of the channel, zk represents AWGN and is independent and identically distributed (i.i.d.) drawn from a zero mean normal distribution with variance N0 (the noise), or ˜(0, N0), and ek is an impulsive noise process, which, for purposes of this analysis, is assumed to be Bernoulli-Gaussian, i.e., ek=λkgk, where λk are i.i.d. Bernoulli random variables, with P(λk=1)=p, and gk are i.i.d. Gaussian random variables ˜CN(0, I0). The channel SNR is defined as Ex/N0, and the impulse to noise ratio (INR) is defined as I0/N0.
y=Hx+e+z (2)
where y and x are the time-domain OFDM receive and transmit signal blocks (after CP removal) and z˜CN(0,N0l). The vector e is an impulse noise process and, specifically, e is a random vector with support e) (a set of the non-zero components) uniformly distributed over all (sm) possible supports of cardinality s<<m, and i.i.d. non-zero components ˜CN(0, I0).
Due to the presence of the cyclic prefix, H is a circulant matrix describing the cyclic convolution of the channel impulse response with the block x. Letting F denote a unitary discrete Fourier transform (DFT) matrix with (k, l) elements
with k,lε{0, . . . , n-1}, the time domain signal is related to the frequency domain signal by:
and, furthermore, given a circulant convolution matrix H,
H=F
HDF (4)
where D=diag({hacek over (h)}) and {hacek over (h)}=√{square root over (nFH)} is the DFT of the channel impulse response (whose coefficients are found, by construction, on the first column of H).
Demodulation amounts to computing the DFT, as given in equation set (5) below:
where
are the DFT coefficients of the channel impulse response, and {hacek over (z)}=Fz has the same distribution of z. Without impulsive noise, it is well known that equation set (5) reduces to a set of m parallel Gaussian channels: {hacek over (y)}i=Hi{hacek over (x)}i+{hacek over (z)}i, for i=1, . . . , m.
In the presence of the impulsive noise, the performance of a standard OFDM demodulator may dramatically degrade since even a single impulse in an OFDM block may cause significant degradation to the whole block. This is because {hacek over (e)}=Fe can have a large variance per component, and, thus, affects more or less evenly all symbols of the block.
Thus, a method of estimating and removing noise in OFDM systems solving the aforementioned problems is desired.
The present invention relates to impulse noise estimation and removal in orthogonal frequency division multiplexing (OFDM) transmissions, and particularly to transmissions in power line communications and digital subscriber line (DSL) transmissions. Any suitable types of transmitters, receivers or transceivers may be utilized. The method includes the steps of: (a) modulating data to be transmitted by a transmitter; (b) performing an inverse fast Fourier transform on the modulated data; (c) inserting a cyclic prefix into the transformed, modulated data; (d) transmitting the transformed, modulated data as a set of OFDM symbols via a channel; (e) receiving the set of OFDM symbols on a receiver; (f) removing the cyclic prefix from the set of OFDM symbols; (g) performing a fast Fourier transform on the set of OFDM symbols; (h) estimating impulse noise in the transformed set of OFDM symbols; (i) canceling the impulse noise in the transformed set of OFDM symbols based upon the estimated impulse noise to produce a set of impulse noise-free data; (j) estimating the channel; and (k) demodulating and detecting the data transmitted based upon the estimated channel and the set of impulse noise-free data.
The channel may be estimated using blind, semi-blind or pilot-based methods. The impulse noise may be estimated by insertion of free carriers either randomly across the entire available frequency spectrum, or only within the guard bands thereof. Preferably, the impulse noise is estimated by first estimating the coarse support of the impulses, then refining the support of the impulses, and then estimating the amplitudes of the impulses.
Coarse support is preferably performed by a Compressive Sensing (CS) method, which may utilize the Candes-Randall-Tao SOCP Estimator algorithm, the Candes-Randall-Tao LP Estimator algorithm, or the Tropp I1-penalty Estimator algorithm. Refinement of the support may be performed by using a maximum likelihood (ML) method or a maximum a-posteriori probability (MAP) method. Estimation of the amplitudes may be calculated via the least squares (LS) method or the mean square error (MMSE) method.
These and other features of the present invention will become readily apparent upon further review of the following specification and drawings.
Similar reference characters denote corresponding features consistently throughout the attached drawings.
The resultant signal is delivered as one more channels C to be received by receiver R. As will be described in detail below, the cyclic prefix is removed at 108 and a fast Fourier transform is performed at 110. The actual impulse noise estimation and removal occurs at 112, and at 114, the signal is demodulated and the data is extracted and delivered to the user. It should be understood that the calculations necessary to perform these steps may be performed using any suitable type of processor, such as a programmable logic controller, for example, or specialized circuit modules integrated into receiver R. Transmitter T may be any suitable type of transmitter and receiver R may be any suitable type of receiver. Alternatively, transmitter T and receiver R may be integrated into a single transceiver unit.
As will be described in detail below, the channel may be estimated using blind, semi-blind or pilot-based methods. The impulse noise may be estimated by insertion of free carriers either randomly across the entire available frequency spectrum, or only within the guard bands thereof. Preferably, the impulse noise is estimated by first estimating the coarse support of the impulses, then refining the support of the impulses, and then estimating the amplitudes of the impulses.
Coarse support is preferably performed by a Compressive Sensing (CS) method, which may utilize the Candes-Randall-Tao SOCP Estimator algorithm, the Candes-Randall-Tao LP Estimator algorithm, or the Tropp I1-penalty Estimator algorithm. Refinement of the support may be performed by using a maximum likelihood (ML) method or a maximum a-posteriori probability (MAP) method. Estimation of the amplitudes may be calculated via the least squares (LS) method or the mean square error (MMSE) method. In the OFDM frequency domain channel model given below as equation (6), where ê denotes the resultant estimate of the impulse noise e produced by the compressive sampling algorithm. The signal actually fed to the receiver is given by:
{hacek over (y)}=D{hacek over (x)}+F(e−ê)+{hacek over (z)}. (6)
In the following, Ω⊂Zn denotes the set of frequencies that are not used to send modulation symbols. These frequencies are used to estimate the impulse noise vector e at the receiver R. The inventive method uses the null carriers that are available on the transmission spectrum to detect, estimate, and cancel impulsive noise. The time domain signal is constructed as:
x=F
H
S
x
{hacek over (d)} (7)
where d is the frequency-domain data symbol vector having dimension k≦n, and where Sx is an n×k selection matrix containing only one element equal to 1 per column and having m=n−k zero rows. The columns of Sx index the subcarriers that are used for data transmission in the OFDM system. The remaining subcarriers are either not used, or are used for transmitting known pilot symbols in the frequency domain. The known pilot symbols in the frequency domain are not used in the following analysis, since they are used for channel estimation, and these can be easily subtracted from the received signal at the receiver R. Thus, the subcarriers not indexed by columns of Sx are not used. In the following, S denotes the matrix having a single element equal to 1 per column, and that spans the orthogonal complement of the columns of Sx.
The frequency domain vector is given by:
{hacek over (y)}=Fy=DS
x
{hacek over (d)}+Fe+{hacek over (z)} (8)
where {hacek over (z)} has the same statistics as z, since F is unitary. The estimation of e is derived from projection into the orthogonal complement of the signal subspace. This is given by:
y′=S
T
{hacek over (y)}=S
T
Fe+z′ (9)
where z′ is an i.i.d. Gaussian vector with variance N0 per component, and having a length m. For future usage, the m×n projection matrix obtained by a row selection of F (according to S) is denoted by Ψ=STF. The observation vector y′ is a projection of the n-dimensional impulse noise onto a basis of dimension n−m<n corrupted by the AWGN z′.
Since n<m, there is an underdetermined system of linear equations for finding e which cannot be solved by standard linear estimation. Thus, the essential step in estimating e consists of finding its support . It is important to note that e is a sparse vector. This support estimate will then be used to estimate the amplitudes of the impulses.
The Candes-Randall-Tao SOCP estimator is formulated for the real numbers. Using the notation given above, the Candes-Randall-Tao SOCP estimator is given by the following:
minimize ∥{tilde over (e)}∥1, subject to ∥y′−Ψ{tilde over (e)}∥2≦ε (10)
for some small enough selected estimation factor ε.
The Candes-Randall-Tao LP estimator (also known as the Dantzig selector) is used for real vectors, and is given by the following:
minimize ∥{tilde over (e)}∥1, subject to ∥w−Ψ{tilde over (e)}∥∞≦λ (11)
for some small enough selected estimation factor λ.
The Tropp I1-penalty estimator considers a third, non-equivalent way of performing sparse approximation/estimation:
minimize ½∥y′−Ψ{tilde over (e)}∥22γ∥{tilde over (e)}∥1 (12)
where the parameters ε, λ and γ are related to the AWGN variance N0. One or a combination of the three above algorithms are used to estimate the support e).
Up to an insignificant proportionality factor, the joint probability (density) P(y) (i.e., the MAP matrix) can be maximized as:
where the covariance matrix of y′, given normalized by the noise variance, is given as:
where Ψ=STF=[ψ1, . . . , ψn], and where Ψ( denotes the submatrix formed by the columns {ψj: jε}, indexed by the support This relies on the fact that, under the support hypothesis = the observation y′ is conditionally Gaussian with covariance N0Σ( It should be noted that I0/N0 is the impulse-to-noise ratio (INR).
An optimal MAP support detector should test each hypothesis and find the one that maximizes the MAP metric given above. Even by limiting to a subset of most probable supports (i.e., of weight at most rmax for some reasonable value of rmax>np), this scheme is prohibitively complex. However, the following augmented CS scheme may be used effectively. A CS algorithm is used in order to find a set of candidate positions. Letting ê denote the estimated impulse vector from the CS algorithm, then its components are sorted in decreasing order of magnitude, and the candidate supports are considered as: =0 (i.e., no impulses); containing a single 1 in the position of the largest element of ê; containing two 1's in the position of the two largest elements of ê . . . and so on, until a maximum number rmax>np of ones is reached. The support is selected as the one that maximizes the MAP metric among the above set of candidates.
As an alternative, e may be treated as unknown and arbitrary, with support Thus, given the support the only known regarding y′ is that it is formed by a vector in the subspace spanned by the columns of Ψ( plus a white Gaussian noise vector z′. The conditional density of y′ given is proportional to the negative exponential of the projection of y′ on the orthogonal complement of the span of the columns of Ψ( It follows that the corresponding MAP metric of the hypothesis =becomes:
where P(J)⊥=I−Ψ(J)(Ψ(J)HΨ(J))−1Ψ(J)H is the orthogonal projector onto the orthogonal complement of the subspace spanned by the columns of Ψ
As shown in
which provides the corresponding LS estimate of e as êls=(φeHφe)−1φeHy.
The MMSE estimates are given by
It is to be understood that the present invention is not limited to the embodiments described above, but encompasses any and all embodiments within the scope of the following claims.