The present document relates to wireless communication, and more particularly, to signal reception schemes used in wireless communication.
Due to an explosive growth in the number of wireless user devices and the amount of wireless data that these devices can generate or consume, current wireless communication networks are fast running out of bandwidth to accommodate such a high growth in data traffic and provide high quality of service to users.
Various efforts are underway in the telecommunication industry to come up with next generation of wireless technologies that can keep up with the demand on performance of wireless devices and networks.
This document discloses techniques that can be used to implement receivers that receive orthogonal time frequency space (OTFS) modulated symbols and recover information bits from these symbols based, in part, on lattice reduction techniques for OTFS modulation.
In one example aspect, a method for wireless communication implementable at a receiver device is disclosed. The method includes receiving a signal comprising information bits modulated using an orthogonal time frequency space (OTFS) modulation scheme, where each delay-Doppler bin in the signal is modulated using a quadrature amplitude modulation (QAM) mapping, and estimating the information bits based on an inverse of a single error covariance matrix of the signal, where the single error covariance matrix is representative of an estimation error for all delay-Doppler bins in the signal.
In another example aspect, a wireless communication apparatus that implements the above-described method is disclosed.
In yet another example aspect, the method may be embodied as processor-executable code and may be stored on a non-transitory computer-readable program medium.
These, and other, features are described in this document.
Drawings described herein are used to provide a further understanding and constitute a part of this application. Example embodiments and illustrations thereof are used to explain the technology rather than limiting its scope.
To make the purposes, technical solutions and advantages of this disclosure more apparent, various embodiments are described in detail below with reference to the drawings. Unless otherwise noted, embodiments and features in embodiments of the present document may be combined with each other.
Section headings are used in the present document to improve readability of the description and do not in any way limit the discussion or the embodiments to the respective sections only.
One of the fundamental attributes of a Multiple Input Multiple Output (MIMO) system is the data transmission rate increase it offers for a given bandwidth. This is achieved by sending several data streams in parallel. Data streams can be coded into parallel stream of QAM symbols and can be transmitted simultaneously from different transmit antennas that are kept appropriately separated. By receiving these signals on a group of receive antenna, these vector of QAM symbols can be detected correctly and thus the transmitted bit streams can be recovered correctly. The capacity increase is proportional to either a) the number of transmit antenna or b) the number of receive antenna, whichever is minimum.
At the receiver, the received QAM vector is typically jointly detected. Optimal QAM vector detection for MIMO channels is extremely computationally intensive. To the point that for reasonable antenna configurations (for example, 4×4 antennas transmitting QAM 64), highly sub-optimal QAM detectors are currently used in practice.
In orthogonal time frequency space (OTFS) modulation, due to the delay-Doppler domain organization of transmitted symbols, all transmitted QAMs vectors experience the same channel. Therefore, after equalization, all QAM vectors will have the same error covariance matrix, denoted by Ree. This property is mathematically different from traditional orthogonal frequency division multiplexing (OFDM) based transmissions such as LTE systems because each QAM vector in the traditional OFDM schemes experiences a different channel and therefore the residual error matrix for each QAM vector would be typically different.
Embodiments of the disclosed technology exploit this property of OTFS modulation at receivers to apply a single pre-processing algorithm to Ree. After pre-processing the Ree, a computationally inexpensive QAM detection algorithm (for example, a Babai detector) can be effectively used for all transmitted QAM vectors. The pre-processing algorithm includes performing lattice reduction, as is disclosed in the present document.
Consider a MIMO channel. Assume that QAM signals transmitted on the MIMO channel have unit energy and are represented by:
y=Hx+w (1.1)
Table 1 below, explains some of the variables used in the sequel.
A minimum mean square estimate (MMSE) equalizer finds the most likely transmitted vector in the mean square error sense. This algorithm is typically implemented under the assumption that the noise in the system can be modelled as Additive White Gaussian Noise (AWGN).
If MMSE receive equalizer is denoted by C, then it can be shown that:
C=(σw−2H*H+I)−1H*σw−2 (1.2)
Applying the MMSE receive equalizer to the received vector gives a vector of soft estimates, denoted as xs, of the transmit vector:
xs=Cy (1.3)
A simple symbol detection mechanism can be implemented using a commonly known slicer. This is achieved by mapping the soft-estimate to the nearest QAM constellation points. The vector of soft-estimates must be close to the transmitted QAM vector to ensure a true reception of the transmitted QAM vector.
Let e denote the difference between the soft estimate and the true transmitted vector:
e=x−x
s (1.4)
Denote covariance of e by Ree. Then the theory of MMSE gives:
R
ee=(σw−2H*H+I)−1 (1.5)
In certain scenarios, no receiver can perform better than receivers that perform Maximum Likelihood (ML) based symbol detection. ML receivers, however, is computationally intractable, hence implementation of the same is very hard. Embodiments of the disclosed technology implement computationally efficient near-ML receiver for OTFS modulated signals.
The QAM estimation problem can be formulated as follows. The receiver needs to find the most likely transmitted vector of QAMs:
Here, the term QAML
For the channel model described in (1.1), the probability p({tilde over (x)}|y) can be well approximated by a Gaussian density function with mean xs and covariance Ree.
Using the probability approximation, the following simplification can be performed:
The search space QAML
|QAM
Lt|=16.77e6 (1.9)
It may be possible to use heuristics to speed up the search. However, these “short cuts” may not provide good results (or may not speed up calculations), particularly for ill conditioned channels. In conventional MIMO OFDM (e.g., LTE), a frequency block is divided into subcarriers. QAM vectors are transmitted across each of the subcarriers. Each transmitted QAM vector experiences a different channel. To recover the QAM vectors, a receiver may perform a separate QAM detection for each subcarrier. The computation complexity, above described, thus gets further boosted by the number of data sub-carriers in the system. This further forces the receiver to use suboptimal detection algorithms.
In MIMO OTFS the information to be transmitted is specified in the delay-Doppler domain. Namely, for each delay Doppler bin a vector of QAMs is transmitted.
Table 2 summarizes notation used with respect to MIMO OTFS signals.
For simplicity, QAM vectors are assumed to have unit average energy. A QAM vector assigned to the delay-Doppler bin (τ,v) is denoted by x(τ,v).
A MIMO OTFS system can also be described by (1.1). Here, y denotes the received signal in the delay-Doppler domain, H denotes the channel in delay Doppler domain and w denotes the noise in the delay-Doppler domain. In a typical equalization structure in OTFS, feed-forward equalizer (FFE) is applied in the time-frequency domain and a 2D noise-predicting DFE is applied in the hybrid delay-time domain.
e
in(τ, v)=(τ, v)−xin(τ, v). (1.10)
for all delay-Doppler bins (τ,v).
Due to spreading in OTFS the covariance matrix of the input error is constant across delay and Doppler frame. That is
[ein(τ,v)e*in(τ,v)]=[ein(τ′,v′)e*in(τ′,v′)] (1.11)
for all pairs of transmission delay-Doppler bins (τ, v) and (τ′,v′).
This means a single matrix defines or represents the error covariance for delay Doppler bins of a given transmission frame. Therefore, there exists an Lt×Lt matrix denoted Rein such that
[ein(τ,v)e*in(τ,v)]=Reein (1.12)
for all delay-Doppler bins (τ,v), (Reein)−1 can be obtained as a by-product of the DFE computation. As described earlier, the detection problem can be approximated with a quadratic minimization problem.
for all delay-Doppler transmission bins (τ,v).
If the matrix (Reein)−1 is perfectly conditioned, an ordinary slicer that slices each QAM symbol (of the QAM vector) co-ordinate by co-ordinate along the standard lattice of xin(τ,v) is optimal. In other words, if condition number is close to one, then an ordinary slicer can be a near optimal detector.
In some embodiments, the condition of the matrix (Reein)−1 can be improved by performing a scheme called lattice reduction. Section 2 and Section 3 provide a theoretical framework and some implementation examples of lattice reduction algorithms.
As depicted in
v∈ΛL
For a unimodular matrix U, the pre-conditioned lattice detection problem is
Here, q′=U−1q.
There exist algorithms to find a unimodular matrix U that makes (Ree−1) U well-conditioned. In some embodiments, a Lenstra-Lenstra-Lovász (LLL) lattice reduction algorithm may be used, and which is further detailed in Sections 2 and 3. Using these results, an OTFS QAM vector detection scheme may be implemented as follows.
First, a lattice reduction algorithm may be implemented to find a unimodular matrix U which makes (Ree−1)U well-conditioned.
Next, for each delay-Doppler bin the U pre-conditioned detection problem is solved using a computationally inexpensive algorithm (e.g., a Babai detector as described in Section 2).
Then, the result is multiplied by U to get a near ML estimate of the transmitted QAM vector.
As observed earlier, in OTFS all transmitted QAM vectors experience the same channel. Therefore, after MMSE equalization all QAM vectors of a given frame have the same error covariance matrix, (denoted Ree). This implies that a single pre-processing algorithm, such as lattice reduction, to Ree−1 for a given OTFS frame. This brings significant computational advantages compared to OFDM. After pre-processing any of several well-known QAM detection algorithms can be effectively used for all transmitted QAM vectors.
One of skill in the art would appreciate that this technique cannot be used for standard OFDM systems where each transmitted QAM vector experiences a different channel and hence after equalization has a different residual error matrix.
However, it may be possible to reduce computational complexity in traditional OFDM receivers using some techniques described herein. For example, as described, in the case of OFDM systems, the number of error covariance matrices to pre-process will be equal to the number of data sub-carriers in the system. As was noted, the complexity of approximate ML detection in such systems is very high. However, some simplifications can be done to ease this computational burden. For example, Ree from sub-carriers that have similar channel characteristics can be averaged to get a single average Ree. This entity can be taken as the representative Ree for all those sub-carriers. This mechanism can bring down the computational load for OFDM systems, with possibly some degradation in the over-all performance.
Gaussian Integers. We denote by i the ring of Gaussian integers, i.e., an element z∈i is a complex number of the form:
z=a+bi, (2.1)
where a,b∈. Gaussian integers constitute the complex generalization of the plain ring of integers ⊂. All the constructions in this manuscript will be derived over (i) with the understanding that they admit a simple counterpart over (). In particular, when considering a diagrammatic depiction we will restrict to ()
Euclidean metric. Let R∈N×N be a positive definite Hermitian matrix. The matrix R defines an Euclidean metric on N given by:
R(x,y)=xHRy, (2.2)
for every x,y∈N. We denote by ∥−∥R the norm associated with R, i.e.,:
∥x∥
R=√{square root over (R(x,x),)} (2.3)
for every x∈N. Finally, we denote by RN the standard Euclidean metric on N given by the unit matrix IN, i.e.,:
R
N(x,y)=xHy, (2.4)
We denote the norm associated with RN simply by ∥−∥. Finally, given a point x∈N, we denote by Br(x:R) the ball of radius r>0 around x with respect to the metric R, i.e.,:
B
r(x:R)={y∈N:∥y−x∥R<r}. (2.5)
Based Hermitian lattices. A based Hermitian lattice (BHL for short) of rank N is a pair =(G,R) where R is an Euclidean metric on N and G is an invertible matrix G∈N×N. The matrix G defines a full rank Gaussian lattice Λ⊂N given by:
Λ={ξ[1]g1+ξ[2]g2+. . . +ξ[N]gN:ξ[k]∈i}, (2.6)
where gk denotes the kth column of G, i.e.,:
Alternatively, the full rank Gaussian lattice can be expressed as:
Λ=G(iN), (2.8)
that is, every element λ∈Λ is uniquely represented as λ=Gξ for some ξ∈iN. We refer to the matrix G as a basis of the lattice Λ and to the integral vector ξ as the coordinates of λ with respect to the basis G. The standard example of a rank N based Hermitian lattice is N=(I,RN). To summarize, a based Hermitian lattice is a lattice equipped with a basis and residing in a complex vector space equipped with an Euclidean metric. Based Hermitian lattices are objects in a category (in contrast to being points of a set). As consequence they admit multitude of isomorphic representations which we refer to as realizations. To transform from one realization to another one applies two basic operations: change of basis and change of coordinates.
Change of basis. A change of basis of is a based Hermitian lattice ′=(G′,R) where G′=G∘T with T being an N×N matrix of determinant 1 with coefficients in the Gaussian ring i. The matrix T is called change of basis transition matrix. One can verify that the determinant condition ensures that the inverse matrix T−1 has also integral coefficients. We give two examples change of basis of 2=(I2,R2). The first is called unipotent change of basis (see
where a∈i. The second change of basis is called a flip (see
In fact, one can show that every unimodular integral matrix T, det T=1, is a finite composition of flips and unipotent transition matrices. This will become important when we discuss algorithms for computing reduced bases.
Change of coordinates. A change of coordinates of is a based Hermitian lattice ′=(G′,R′) where G′=A∘G and R′=A−HRA−1 with A being an N×N invertible matrix. The matrix A is called change of coordinates transition matrix. The matrix A defines an isometry between the Euclidean metrics R and R′, that is:
R′(Ax,Ay)=R(x,y), (2.11)
for every x,y∈N. There is a distinguished realization, called the upper triangular realization, where the basis matrix is upper triangular and the metric R=IN.
The corresponding transition matrix is given by A=UG−1, where U is the upper triangular factor in the Cholesky decomposition:
GHRG=UHU. (2.12)
Furthermore, it may be verified that
LLL reduced basis. A based Hermitian matrix =(G,R) is called LLL reduced (LLL stands for Lenstra-Lenstra-Lovász) if, roughly speaking, the vectors of the basis G are orthogonal with respect to the metric R, i.e., R(gi,gj)=0, for every i≠j.
Dimension two. We denote by P1 the orthogonal projection on the one dimensional subspace V1=g1 and respectively by P1⊥=I−P1 the orthogonal projection on the complement subspace. The formula of P1 is given by:
for every x∈2.
Definition 1.1. We say that is LLL reduced if it satisfies the following two conditions:
(1) Size reduction condition:
|ReR(g1,g2)|≤½R(g1,g1)
|ImR(g1,g2)|≤½R(g1,g1) (2.15)
(2) Well ordering condition:
R(g1,g2)≤R(g2,g2) (2.16)
The main statement in the two-dimensional theory is summarized in the following theorem:
Theorem 1.2 (Reduction theorem). If g1,g2 is LLL reduced basis then:
(1) The vector g1 satisfies:
√{square root over (R(g1,g1))}≤c√{square root over (R(λshort,λshort),)}
and λshort denotes the shortest non-zero vector in Λ.
(2) The vector g2 satisfies:
√R(P1⊥g2,g2)≤√R(P1⊥λ,λ),
In words, Theorem 1.2 asserts that the first basis vector g1 is no longer than a scalar multiple of the shortest non-zero vector in the lattice where the scalar is universal (does not depend on the lattice) and that the second basis vector is the shortest non-zero vector mod g1.
General dimension. Assuming is of rank N, we denote by Vn the subspace spanned by the first n basis vectors, i.e., Vn=(g1,g2, . . . gn). We take the convention V0={0}. The collection of subspaces Vn, n=0, . . . N form a complete flag:
0=V0⊂V1⊂ . . . ⊂VN=N. (2.17)
We denote by Pn the orthogonal projection (with respect to R) on the subspace Vn. Respectively, we denote by Pn⊥=I−Pn the orthogonal projection on the orthogonal complement subspace.
Definition 1.3. We say that (G,R) is LLL reduced if it satisfies the following two conditions:
(1) Size reduction condition:
|ReR(Pn−1⊥gn,gm)|≤½R(Pn−1⊥gn,gn)
|ImR(Pn−1⊥gn,gm)|≤½R(Pn−1⊥gn,gn).
(2) Well ordering condition:
R(Pn−1⊥gn,gm)≤R(Pn−1⊥gn+1,gn+1).
When G is upper triangular and R=I, the LLL conditions take a particularly simple form. The size reduction condition takes the form:
|Reg
nm|≤½gnn
|Img
nm|≤½gnn.
for every n=1, . . . N−1 and m>n. The ordering condition takes the form:
|g
nn|2≤|gn,n+1|2+|gn+1,n+1|2.
In this section, a delay Doppler convolution channel model is assumed:
y=h*x+w, (2.18)
where x[τ,v]∈N, y[τ,v]∈M, h[τ,v]∈M×N and w is the noise term. We assume that the input variables x[τ,v] are independent random variables, taking values in a QAM constellation set Ω⊂N, #Ω=2NQ.
Our ultimate goal is to calculate the finite a-posteriori probability distribution. Since this is formidable problem we approximate the prior probability distribution of x[τ,v] by a circular symmetric Gaussian distribution (
where e⊥{circumflex over (x)}s and ⊥x. Furthermore, the parameters of the above equation related through:
The MMSE variable {circumflex over (x)}s establishes a sufficient statistics for x, moreover the a-posteriori probability distribution is given by:
Roughly speaking, we wish to find a small subset of Ω that faithfully represents (2.22). We refer to this problem as the soft MAP detection problem. The expression in the exponent of (2.22) suggests to define the solution in geometric terms using the Euclidean metric R=Re−1. To this end, we introduce the following notion:
Definition 2.1. An Ω-ball of size K around the point {circumflex over (x)}s where K∈≥1 is the finite set:
B
K({circumflex over (x)}s,R,Ω)=Br(K)({circumflex over (x)}s,R)∩Ω, (2.23)
where r(K) the maximum radius such that |Br({circumflex over (x)}s,R)∩Ω|≤K.
In plain language, (2.23) is the ball of maximum radius around {circumflex over (x)}s that contains at most K constellation points. In particular, when K=1 the ball consists of one element {circumflex over (x)}h called the MAP detector, defined by:
{circumflex over (x)}
s=arg min{∥{circumflex over (x)}s−x∥R:x∈Ω}. (2.24)
The soft MAP detection problem is formulated as the problem of calculating the Ω-ball of size K for various choices of the parameter K. Note that (2.23) gives a precise mathematical definition of an optimal list of size K that approximates the a-posteriori probability distribution. In some embodiments, as the size of the list gets larger, the corresponding approximation gets better.
Extrinsic a-posteriori bit probabilities (LLR computation). Let L:Ω→≥0 denote the total MIMO likelihood function defined through the scaling equation (2.20) as:
L(x)∞exp(−∥{circumflex over (x)}s−(Ax+(1−A)
for every x∈Ω. Let bin denote the ith bit of the nth MIMO symbol where n=1, . . . N and i=1, . . . Q. Let Pin:{0,1}→≥0 denote the prior probability distribution on the (i,n) bit. One can verify that the (exact) extrinsic a-posteriori probability on the (i,n) bit is given by:
where m:{0,1}QN→Ω is the total mapping rule, converting between bits and constellation vectors.
Typically, in formula (2.26), the summation over all elements b∈{0,1}QN−1 is intractable for large values of N or/and Q. Instead, using the list we can define the approximated likelihood function:
Using this approximation, define the approximated extrinsic probability distribution by:
The summation in (2.28) is over a much smaller set of size≤K. In order for (2.28) to be well posed, the set BK({circumflex over (x)}s,R,Ω) should satisfy a mild technical requirement that for every (i,n) and bit value b∈{0,1} there exists a vector b∈{0,1}QN such that:
bin=b
m(b)∈BK({circumflex over (x)}s,R,Ω). (2.29)
In some embodiments, computing (2.23) may be difficult since the search space Ω grows exponentially with the MIMO dimension N and the constellation order Q. This difficulty increases when the Euclidean metric R is highly correlated. One example of reducing complexity of the search problem is to embed Ω inside a bigger set that is better behaved (and is generally referred to as problem relaxation).
One way to do that is to embed Ω as subset of an infinite (shifted) lattice. Let Λ=iN denote the standard Gaussian lattice of rank N. Let v0=(1+i)/2·1 where 1=(1, 1, . . . 1). One can easily verify that when Q is even. we have:
Ω⊂Λ+v0. (2.30)
We equip Λ with the based Hermitian lattice structure =(G,R) where G=IN. We define {circumflex over (x)}={circumflex over (x)}s−v0. We consider discrete balls centered at the point {circumflex over (x)} parametrized either by their radius or their size: given a positive real number r>0, we define the -ball of radius r around {circumflex over (x)} as the set:
B
r({circumflex over (x)},)={ξ∈iN:∥{circumflex over (x)}−G(ξ)∥R<r}. (2.31)
In words, the set (2.31) consists of all lattice points inside a ball or radius r around the point {circumflex over (x)} or, more precisely, the coordinates of these points with respect to the standard basis.
Given a positive integer K∈N≥1, we define the -ball of size K around {circumflex over (x)} as the set:
B
K({circumflex over (x)},)=Br(K)({circumflex over (x)},), (2.32)
where r(K) the maximum radius such that |Br({circumflex over (x)},)|≤K, i.e.,:
r(K)=max{r:|Br({circumflex over (x)},)|≤K}. (2.33)
In words, the set (2.32) is the -ball of maximum radius that contains at most K points. In some embodiments, the relation BK({circumflex over (x)}s,R,Ω)=BK({circumflex over (x)},)+v0 is available. The solution of the soft lattice detection problem may be defined to be the ball BK({circumflex over (x)},) for various choices of the parameter K. One benefit, amongst others, in replacing (2.23) by (2.32) is that based Hermitian lattices (unlike constellation sets) admit a multitude of isomorphic realizations (or images). This advantageously enables a convenient realization for to be chosen for computing (2.32), and this flexibility is one reason the lattice relaxation approach is employed.
In some embodiments, an approximate solution of the soft lattice detection problem may be implemented efficiently. For example, the construction may use an LLL reduced upper triangular realization of the based Hermitian lattice , which is denoted as LR.
That is, LR=(U,IN) where U=(unm) is an upper triangular matrix size reduced and well ordered. We further assume that we are given a change of basis T and a change of coordinates A such that U=A∘G∘T and IN=A−H∘R∘A−1. We construct an approximation of the ball BK(ŷ,LR) where ŷ=A{circumflex over (x)} and than derive an approximation of the ball BK({circumflex over (x)},) through the relation:
B
K({circumflex over (x)},)=T(BK(ŷ,LR)). (2.34)
We introduce a one parametric family of probability distributions on iN. For any positive real number a>0 we define a probability distribution Pa:iN→≥0 expressed as multiplication of one dimensional conditional probability distributions, i.e.,:
for every ξ=(ξ[1],ξ[2] . . . ξ[N])∈iN. The nth conditional probability is given by:
We note that sn(ξ[n+1:N]) is merely the normalization factor to ensure that the total probability is equal 1. Moreover, the complex number ŷn has geometric meaning: to explain it, we denote by Pn the orthogonal projection on the subspace Vn—spanned by the basis vectors u1, . . . un which by the upper triangular structure coincides with the standard coordinate n dimensional subspace Vn={x:x[n+1:N]=0}. We have:
To get some intuition for the construction it is worthwhile to consider the limiting case when a→∞. One can see, that in the limit the nth conditional probability collapses to a deterministic delta supported at the element ξbabai[n]=┌yn/unn┘ where ┌z┘∈i denotes rounding to the the closest Gaussian integer to z. The vector ξbabai=(ξbabai[1], . . . ξbabai[N]) is called the Babai lattice point and it can be interpreted as the solution of a successive cancellation detector. Fixing the value of the parameter K, we define the set:
B
K(a)={ξ:Pa(ξ)≥1/K}. (2.39)
In words, the set BK(a) consists of all points ξ∈iN whose probability is ≥1/K. Evidently, the number of elements in BK(a) is ≤K since the total probability is equal 1. The upshot is that BK(a) is a good approximation of the ball BK(ŷ,LR) if the parameter a is chosen appropriately. The main technical statement is that the set BK(a) contains a LR-ball of certain radius r which of course depends on the value of a and K. The precise statement is summarized in the following lemma:
Lemma 4.1 (Technical Lemma). We have Br(ŷ,ℒLR)⊂BK(a) where r=r(a,K) is given by:
with a=ln(ρ)/minm|umm|2.
Granting the validity of Lemma 4.1, it is natural to ask, for a given value of K, what is the optimal value of the parameter ρ that maximizes the radius r(a,K).
Denoting by ƒ(ρ) the expression on the right hand side of (2.40) and equating the derivative ƒ′(ρ0)=0, it may be verified that ρ0 and K are related through:
K=(ep0)4N/p
Substituting ρ0 back in ƒ we get the following formula for the optimal radius:
To put things in order, it is convenient to introduce an additional parameter: for a given radius r>0, we define:
To explain the meaning of this parameter we note that the expression ½ minm|umm| is the Babai radius, that is if ∥ŷ−Uξ★∥≤½ minm|umm| then ξbabai=ξ★ where ξ★ is the closest lattice point to ŷ. Granting this fact, G(r) can be interpreted as the radial gain of r over the Babai radius. For example if G(r)=2, we say r has 3 dB gain over the Babai radius. In practice, it is natural to fix the gain G and to ask for the value of the parameter K such that 4·r(K)2/minm|umm|2=G.
Using the expression in (2.42), and the definition in (2.43), we can derive:
This implies that ρ0=16N/G. Hence, using (2.41), we get:
K=(16Ne/G)G/4. (2.44)
From formula (2.44), the size parameter K as a function of the radial gain can be seen to grow as 0(NG/4) with the MIMO order, and thus, to realize larger gains, larger list sizes may be employed. For example, to get a radial gain of 3 dB, we use K=√{square root over (8eN)} and ρ0=8N.
In this section we describe a deformation of the standard sphere decoder that is specifically adapted to computing the set BK(a). For the rest of this section we fix the value of the parameters K and a. Next we introduce some basic terminology related to the underlying combinatories of sphere decoding.
For every number n=1, . . . N+1, we denote the nth level by Ln and it consists of elements ξ∈iN such that ξ[m]=0 for every m<n. In particular, the level LN+1 consists of one element which is the zero vector, called the root of the tree; on the other extreme, the level L1 is equal iN and its elements are called leaves. The edges connect vertices in Ln+1 with vertices in Ln: given a pair of elements ξn+1∈Ln+1 and ξn∈Ln, the ordered pair (ξn+1,ξn) is an edge if ξn[m]=ξn+1[m] for every m≥n+1.
Formula (2.36) may be used to introduce weights on the edges. For example, given an edge en=(ξn+1,ξn), its weight may be defined as:
It may be observed that a leaf ξ∈L1 belongs to the set BK(a) if and only if:
Here eN,eN−1, . . . ,e1 are the edges along the unique branch leading from the root to the leaf ξ. Note that substituting a=1 in (2.45) and omitting the affine term ln sn(ξn+1), we get the weights used by the conventional sphere decoder. In this regard, the new weights may be interpreted as a deformation of the sphere decoder structure. In some embodiments, and based on this weighted tree structure, the algorithm for computing the set BK(a) proceeds as a depth first search algorithm along the vertices of the weighted tree.
Reduction algorithm—invariant form. The algorithm accepts as input a basis matrix G and produces as output an LLL reduced basis matrix G′ such that G′=G∘T, where T∈SL2(i). Specifically, the algorithm constructs a sequence:
G=G0,G1, . . . GK=G′, (2.47)
such that Gk+1=Gk∘Tk where Tk∈SL2(i). Consequently, we have T=T0∘T1∘ . . . ∘TK−1.
The sequence (2.47) is constructed via alternating between the two basic transformations.
(1) Size reduction transformation. Reduce the size of the second basis vector with respect to the first basis vector according to the rule g2k+1=g2k−a·g1k where:
(2) Flipping transformation. Interchange the basis vectors according to the rule g1k+1=−g2k and g2k+1=g1k. This transformation is realized by the unimodular matrix:
Reduction algorithm—upper triangular form. We describe a version of the LLL reduction algorithm which is adapted to the upper-triangular realization. The algorithm accepts as input an upper-triangular basis matrix U and produces as output an LLL reduced basis matrix U′ such that U′=A∘U∘T where T∈SL2(i) and A∈U(2). Specifically, the algorithm constructs a sequence of upper triangular matrices:
U=U0,U1, . . . UK=U′, (2.48)
such that Uk+1=Ak∘Gk∘Tk where Tk∈SL2(i) and Ak∈U(2) (Unitary matrix). Consequently, we have T=T0∘T1∘ . . . ∘TK−1 and A=AK−1∘ . . . ∘A2∘A1.
In what follows, we use the notation uijk for the (i,j) coordinate of the matrix Uk and denote by unk=Uk(:,n) the nth vector of the kth basis.
The sequence of generators in (2.48) is constructed by alternating between the following two types of transformations.
(1) Size reduction transformation. Reduce the size of the second basis vector with respect to the first basis vector according to the rule u2k+1=u2k−au1k where:
(2) Flipping transformation. The basis vectors are flipped according to the rule g2k+1=−u1k and g1k+1=u2k. This change of basis is realized by the unimodular matrix:
Convergence of the algorithm. The convergence of the reduction algorithm can be proved by energy considerations. We define the energy functional:
E(G)=R(g1,g1)2R(P1⊥g2,g2).
Considering the sequence in (2.47), we have the following theorem:
Theorem 6.1. The following inequality always hold:
E(Gk+1)≤E(Gk),
moreover, when Gk is flipped we have E(Gk+1)≤αE (Gk) for some α<1.
Reduction algorithm—invariant form. The invariant LLL reduction algorithm accepts as input a basis matrix G and produces as output an LLL reduced basis matrix G′ such that G″=G∘T where T∈SLN(i). Specifically, the algorithm constructs a sequence:
G=G0,G1, . . . GK=G′, (2.49)
such that Gk+1=Gk∘Tk, Tk∈SLN(i). Consequently, we have T=T0∘T1∘ . . . ∘TK−1.
The sequence (2.49) is constructed using two types of change of basis transformations.
(1) Size reduction transformation. Let m>n∈[1,N]. Reduce the size of the jth basis vector with respect to the ith basis vector according to the rule gmk+1=gmk−agnk where:
(2) Flipping transformation. Let n∈[1, N−1]. Interchange the nth and the n+1th basis vectors according to the rule gnk+1=−gn+1k and gn+1k+1=gnk. This transformation is realized by the unimodular matrix:
Remark. The algorithm applies the size reduction step only if the size reduction condition is not satisfied and applies the flipping transformation always after applying size reduction transformation and only if the well ordering condition is not satisfied.
Reduction algorithm—upper triangular form. The upper triangular version of the LLL reduction algorithm accepts as input an upper triangular basis matrix U and produces as output an LLL reduced upper triangular basis matrix U′ such that U′=A∘U∘T where T∈SLN(i) and A∈U(N). In more details, the algorithm constructs a sequence of upper triangular matrices:
U0=U,U1, . . . UK=U′, (2.50)
such that Uk+1=Ak∘Uk∘Tk where Tk∈SLN(i) and Ak∈U(N). Consequently, T=T0∘T2∘ . . . ∘TK−1 and let A=A0∘A2∘ . . . ∘AK−1. In what follows, we use the notation uijk for the (i,j) coordinate of the matrix Uk and denote by unk=Uk(:,n) the nth vector of the kth basis.
The sequence (2.50) is constructed using two types of transformations, each being a combination of a change of basis with a change of realization.
(1) Size reduction transformation. For m>n, reduce the size of the mth basis vector um with respect to the nth basis vector un according to the rule umk+1=umk−aunk where:
Uk+1=Uk∘Tk,
(2) Flipping transformation. For n=1, . . . N−1, interchange between the nth and the n+1th basis vectors according to the rule gnk+1=−un+1k and gn+1k+1=unk followed by a change or realization to return to upper triangular form. The change or basis matrix is given by:
Typical constellations like QAM vectors can be naturally viewed as truncation of translated lattices in N. This observation suggests the idea of relaxing the problem of finding the closest constellation point (aka CCP problem) to the problem of finding the closest corresponding lattice point (aka CLP problem) ignoring the truncation boundary conditions. One of the advantages of working with lattices over working with constellation sets is that lattices admit various bases and realizations. This freedom enables to device lower complexity detection algorithms. Two examples of such algorithms are the Heuristic Babai detector and the exact sphere detector. Both algorithms use a canonical weighted tree structure which is associated with the lattice.
Hermitian vector spaces. Euclidean geometry over the complex numbers study Hermitian vector spaces.
Definition An Hermitian vector space is a pair (V,R) where V is a vector space over () and R:V×V→ an Hermitian product.
We follow the convention that the first variable is skew linear, that is:
R(λv,λ′v′)=
for every v,v′∈V and λ,λ′∈. Hermitian products transform under linear maps.
Given an Hermitian vector space (V,R) and a linear map A:U→V, one defines the pullback Hermitian product A*(R):U×U→ by the rule:
A*(R)(u,u′)=R(Au,Au′),
for every u,u′∈U. We note the relation with standard matrix definitions. When V=N the form R can be represented by an Hermitian matrix R so that:
R(x,y)=xHRy,
for every x,y∈N where xH denotes the row vector obtained as the complex conjugate of a column vector x. Moreover, if in addition U=M and A is an N×M matrix then:
A*(R)=AHRA,
where AH stands for the conjugate transpose of A. To conclude, one should view Hermitian vector spaces as the coordinate free counterpart of Hermitian matrices. In the sequel we use the notation:
R(v)=R(v,v),
for every v∈V. We also use the notation R0:N×N→ for the standard Hermitian product:
R
0(x,y)=xHy,
for every x,y∈N.
Duality. Hermitian vector spaces admit duals. Given an Hermitian vector space (V,R) there exists a canonical Hermitian form on the dual vector space V*. To define it, let {circumflex over (R)}:V→V* denote the canonical skew linear map sending a vector v∈V to the linear functional {circumflex over (R)}(v)∈V* defined by:
{circumflex over (R)}(v)(v′)=R(v,v′),
for every v′∈V. The map {circumflex over (R)} is a bijection since R is non-degenerate. The dual Hermitian product Rd:V*×V*→ is characterized by the pullback equation:
{circumflex over (R)}*(RD)=R⇔Rd={circumflex over (R)}−1*(R) (3.1)
Note that Rd is skew linear in the second variable. Finally, we note the relation with standard matrix definitions. Assume V=N. Let us denote by R0 the standard inner product on N. Assume that the Hermitian product R is represented by an Hermitian matrix:
R(x,y)=xHRy,
for every x,y∈N. Under these assumptions one can show that:
{circumflex over (R)}*
0(Rd)(x,y)=xHR−1y, (3.2)
for every x,y∈N. In words, the pullback of Rd under the map {circumflex over (R)}0 is represented by the inverse of the matrix R.
Remark. There is a nice interpretation of R and Rd in the framework of probability theory. Under this interpretation, the form R is the information matrix of a Gaussian random variable X∈V such that:
The dual form Rd is the covariance matrix of X. One can show that:
for every pair of linear functionals α,β∈V*. In words, Rd(α,β) is the covariance between any pair of scalar measurements of the random variable X. Note that in standard texts both R and Rd are given by Hermitian matrices satisfying Rd=R−1. This convention is not canonical and relies on the existence of another Hermitian form R0 on V used to identify V and V*.
Orthogonal projection. Probably the most important notion in Euclidean geometry is that of orthogonal projection. Let (V,R) be a finite dimensional Hermitian vector space. Let U⊂V be a linear subspace. The orthogonal complement of U is denoted by U⊥ and is defined as:
U
⊥
={v∈V:R(v,u)=0 for every u∈U}.
Every vector v∈V can be written in a unique manner as a sum:
v=P
U(v)+PU
where PU(v)∈U and PU⊥(v)∈U⊥. The rule vPU(v) gives a linear map PU:V→V called the orthogonal projection on U. Similarly, the rule vPU
P
U
+P
U
=Id
PU∘PU=PU,
PU
PU
In words, PU and PU
for every v∈V. In words, PU(v) is the vector in U that is the closest to v with respect to the Euclidean metric R.
Remark. The notion of orthogonality and hence the definition of orthogonal projection depends very strongly on the Hermitian structure R. Note that even when V=N and U={(x1, . . . xK,0, . . . 0):xi∈} is the standard coordinate subspace the orthogonal complement U⊥ might look very different from {(0, . . . 0,xK+1, . . . xN):xi∈} unless the Hermitian product is the standard product:
R
0(x,y)=xHy,
for every x,y∈N.
Schur reduction. Let (V,R) be a finite dimensional Hermitian vector space. Let U⊂V be a linear subspace. We define a degenerate Hermitian product RU:V×V→ by:
R
U(v,v′)R(PUv,v′). (3.3)
The Hermitian product RU is called the Schur reduction of R to the subspace U. One can verify that RU(u,v)=0 for every u∈U⊥ and v∈V thus RU reduces to a non-degenerate Hermitian product on the quotient vector space V/U⊥. Similarly, the Schur reduction with respect to U⊥ is given by:
The Hermitian product RU
We note the relation to standard matrix formulation. Assume V=N and U={(x1, . . . xK,0, . . . 0):xi∈}. Assume R is represented by an N×N Hermitian matrix R of the form:
where A, B and D are matrices of dimensions K×K, K×N X−K and N−K×N−K respectively. We assume A is invertible. The Schur reductions of RU and RU
Based Hermitian lattices. Let i=⊕i be the ring of Gaussian integers where i=√{square root over (−1)}. The main object of study in these notes is full rank lattices lying inside Hermitian vector spaces. Such objects lie at the intersection of lattice theory and Euclidean geometry. In fact, we will require in addition a choice of a particular basis of the lattice giving rise to the notion of a based Hermitian lattice.
Definition. A based Hermitian lattice over i is a triple (V,G,R) where:
A based Hermitian lattice (V,G,R) defines in particular a lattice Λ=G (iN). Elements of Λ are i linear combinations of the vectors λi=G (ei) where ei is the standard ith basis vector of N, that is:
Λ={a[1]λ1+ . . . +a[N]λN:a[k]∈i}.
The vectors λ1, . . . λn∈V form a basis of Λ. The map G is called the generator of Λ. The Hermitian vector space (V,R) is called the realization space of Λ.
Remark. A based Hermitian lattice is a full rank lattice in an Hermitian vector space equipped with a specific choice of basis.
Two basic operations on based Hermitian lattices include:
1. Change of basis. Changing the basis of a based Hermitian lattice amounts to changing the generator matrix G. Let SLN(i) denote the group of Gaussian unimodular N×N matrices, that is, the set consisting of N×N matrices of determinant 1 with coefficients in the ring i equipped with the operation of matrix multiplication (it can be shown that the inverse of a unimodular matrix is also unimodular).
Definition. A change of basis of a based Hermitian lattice (V,G,R) is a based Hermitian lattice of the form (V,G′,R) where G′=G∘T for some T∈SLN(i).
2. Change of realization. Changing the realization of a based Hermitian lattice amounts to changing the realization space (V,R).
Definition. A change of realization of a based Hermitian lattice (V,G,R) is a based Hermitian lattice (V′,G′,R′) whew G′=A∘G and R=A*(R′) for some isomorphism A:VV′.
Note that if V=V′=N then the forms R and R′ can be represented by Hermitian matrices and the pullback equation R−A*(R′) translates to the conjugation condition:
R=AHR′A.
Triangular realizations. Let (V,G,R) be a based Hermitian lattice. There exists two special realizations. An upper triangular realization (N,U,R0) where U is an upper triangular matrix. The generator U is defined by the upper Cholesky decomposition:
G*(R)=UHU.
Similarly, there exists a lower triangular realization (N,L,R0) where L is a lower triangular matrix. The generator L is defined by the lower Cholesky decomposition:
G′*(R)=LHL.
We refer to an upper triangular realization of an Hermitian lattice as upper triangular Hermitian lattice, and a lower triangular realization of an Hermitian lattice is referred to as a lower triangular Hermitian lattice.
Standard filtrations. Let (V,G,R) be a based Hermitian lattice with basis λi=G(ei),i=1, . . . N. We define two ascending filtrations of V. The lower filtration:
0=V0⊂V1⊂ . . . ⊂VN=V,
where Vn the subspace of V spanned by the basis vectors λ1, . . . λn. The upper filtration:
0=V0⊂V1⊂ . . . ⊂VN=V,
where Vn is the subspace of V spanned by the basis vectors λN−n+1, . . . λN. In what follows, we use the notations [R]n and [R]n for the Schur complements of R with respect to Vn and Vn respectively.
Duality. Based Hermitian lattices admit duals. Let (V,G,R) be a based Hermitian lattice with basis λ1=G(ei),i=1, . . . N. Let λ*i∈V*,i=1, . . . N denote the dual basis. Recall that λ*i is defined by;
for every j∈[1,N]. The dual based Hermitian lattice is the triple (V*,Gd,Rd) where the dual generator is defined by:
G
d(ei)=λ*i,
for every i∈[1,N]. Note that the lattice Λ*=Gd(iN) is dual to Λ in the conventional sense, namely, Λ*={α∈V*:α(λ)∈ for every λ∈Λ}.
VBLAST reduced basis. Let (V,G,R) be a based Hermitian lattice. Let λ1, . . . λN∈V be the corresponding basis where λi=G(ei). The VBLAST reduction condition is expressed in terms of the dual lattice.
Definition. We say that λ1, . . . λN is VBLAST reduced basis of Λ if it satisfies the following condition:
for n∈[0,N−1].
In words. λ1, . . . λN is VBLAST reduced if the basis vectors are well ordered so that λ*N−n has the minimum dual norm taken mod λ*N−n+1, . . . λ*N.
Remark. There is a nice interpretation of the VBLAST well ordering condition in the framework of Wiener theory of decision feedback equalization. Under this interpretation, the last among the first N−n symbols has the maximum SNR after substracting the interference from symbols N−n+1, . . . N. This implies that the VBLAST ordering is optimal with respect to error propagation effect.
VBLAST reduction algorithm. The VBLAST reduction algorithm accepts as input a based Hermitian lattice (V,G,R) and produces as output a based Hermitian lattice (V,G′,R) such that:
(1) G′=G∘T where T∈SLN(i) -S a permutation matrix.
(2) λ′i=G′(ei),i=1, . . . N is a VBLAST reduced basis of Λ=G(iN).
The VBLAST change of basis permutation π:[1,N]→[1,N] is defined according to the following recursive formula. At step n∈[0,N−1].
Define the value π(N−n) by:
Consequently, the change of basis transformation is the matrix T obtained by permuting the columns of the identity according to π.
LLL (Lenstra-Lenstra-Lovász) reduced basis. One can view the LLL reduction condition as kind of a generalization of the VBLAST reduction condition explained in the previous subsection. Let (V,G,R) be a based Hermitian lattice. Let λ1, . . . λN∈V be the corresponding basis where λi=G(ei).
Definition. We say that λ1, . . . λN is an LLL reduced basis of Λ if it satisfies the following two conditions:
(1) Size reduction condition:
|Re[R]i−1(λi,λj)|≤½[R]i−1(λi,λi),
|Im[R]i−1(λi,λj)|≤½[R]i−1(λi,λi),
(2) Well ordering condition:
[R]i−1(λi,λi)≤[R]i−1(λi+1,λi+1),
Note that the size reduction condition implies the inequality:
for every i=1, . . . N−1 and j>i. When λ1, . . . λN is LLL reduced basis we say that the generator matrix G is LLL reduced.
Reduction algorithm. We describe two versions of the LLL reduction algorithm. The first version is independent of the specific realization of the lattice and the second version is defined in terms of upper triangular realizations.
Invariant form. The invariant LLL reduction algorithm accepts as input a based Hermitian lattice (V,G,R) and produces as output a (change of basis) based Hermitian lattice (V,G′,R) such that:
(1) G′=G∘T where T∈SLN(i).
(2) λ′i=G′(ei),i=1, . . . N is an LLL reduced basis of Λ=G′(iN)=G(iN).
In more details, the algorithm constructs a sequence of generator matrices:
G=G0,G1, . . . GK=G′, (3.5)
such that Gk+1=Gk∘Tk,Tk∈SLN(i). Let T=T0∘T2∘ . . . ∘TN−1. The LLL reduced generator GK=G∘T.
The sequence (3.5) is constructed using two types of change of basis transformations.
Size reduction transformation. Let j>i∈[1,N]. Reduce the size of the jth basis vector with respect to the ith basis vector according to the rule λjk+1=λjk−aλik where:
Here [-] stands for the closest Gaussian integer. This transformation is realized by the unimodular matrix:
i.e., the unit matrix with additional non-zero entry Tk(i,j)=−a.
Flipping transformation. Let i∈[1,N−1]. Interchange the ith and the i+1th basis vectors according to the rule λik+1=−λi+1k and λi+1k+1=λik. This transformation is realized by the unimodular matrix:
i.e., the unit matrix except that
Remark. The algorithm applies the size reduction step only if the size reduction condition is not satisfied and applies the flipping transformation always after applying size reduction transformation if and only if the well ordering condition is not satisfied.
Upper triangular form. The upper triangular version of the LLL reduction algorithm accepts as input an upper triangular based Hermitian lattice (N,U,R0) and produces as an output an upper triangular based Hermitian lattice (N,U′,R0) such that:
(1) U′=A∘U∘T where T∈SLN(i) and A∈UN.
(2) λ′i=U′(ei),i=1, . . . N is an LLL reduced basis of Λ′=U′(iN).
In some embodiments, the algorithm constructs a sequence of upper triangular matrices:
U0=U,U1, . . . UK=U′, (3.6)
such that Uk+1=Ak∘Uk∘Tk where Tk∈SLN(i) and AK∈UN. Let T=T0∘T2∘ . . . ∘TN−1 and let A=A0∘A2∘ . . . ∘AN−1. The reduced upper triangular realization is given by UK=A∘U0∘T. In what follows we use the notation:
The sequence (3.6) is constructed using two types of transformations, each being a combination of a change of basis with a change of realization.
Size reduction transformation. Let j>i∈[1,N]. Reduce the size of the jth basis vector with respect to the ith basis vector according to the rule λjk+1=λjk−aλik where:
This is realized by the unimodular main:
i.e., the unit matrix with additional non-zero entry Tk(i,j)=−a. To conclude:
Uk+1=Uk∘Tk.
Note that the upper triangular size reduction transformation has the same form as its invariant counterpart since Uk∘Tk is already in upper triangular form.
Flipping transformation. Let i∈[1,N−1]. Interchange between the ith and the i+1th basis vectors according to the rule λik+1=−λi+1k and λi+1k+1=λik followed by a change of realization to return to upper triangular form. The change of basis matrix is given by:
The change of realization matrix is defined as follows. Let Gk+1=Uk∘Tk. One can verify that Gk+1 is almost upper triangular except that L=Gk+1(i:i+1,i:i+1) is lower triangular. Let L=QR be the Q−R decomposition of L where Q is unitary and R upper triangular. We define:
i.e., Ak is the unit matrix except that Ak(i:i+1,i:i+1)=Q. To conclude:
Uk+1=Ak∘Uk∘Tk.
Alternatively, we can define Uk+1=Gk except that Uk+1(i:i+1,i:i+1)=U where U is the upper triangular factor in the Cholesky factorization L*L=U*U.
The LLL reduction theory in dimension two. For pedagogical reasons it is worthwhile to explain the general theory in the special case when dim V=2. The set-up includes a based Hermitian lattice (V,G,R) where dim V=2. The vectors λ1=G(e1) and λ2=G(e2) form a basis of the corresponding rank 2 lattice Λ=G(i2).
Definition. We say that λ1,λ2 is an LLL reduced basis of Λ if it satisfies the following two conditions:
(1) Size reduction condition:
|ReR(λ1,λ2)|≤½R(λ1,λ1).
|ImR(λ1,λ2)|≤½R(λ1,λ1).
(2) Well ordering condition:
R(λ1,λ1)≤R(λ1,λ1).
Note that the size reduction condition implies the inequality:
The nice thing about LLL reduced bases is that they consists of short vectors which are nearly orthogonal to one another. The quantitative meaning of this statement is the content of the following theorem.
Theorem. (Reduction theorem). If λ1,λ2 is LLL reduced basis then:
(1) The vector λ1 satisfies:
√{square root over (R(λ1,λ1))}≤c√{square root over (R(λshort,λshort),)}
and λshort denotes the shortest non-zero vector in Λ.
(2) The vector λ2 satisfies:
In words, the Theorem asserts that the first basis vector λ1 is no longer than a scalar multiple of the shortest non-zero vector in the lattice where the scalar is universal (does not depend on the lattice) and that the second basis vector is the shortest non-zero vector mod λ1.
Reduction algorithm—invariant form. The algorithm accepts as input a based Hermitian lattice (V,G,R) and produces as output a (change of basis) based Hermitian lattice (V,G′,R) such that:
(1) G′=G∘T, where T∈SL2(i).
(2) λ′i=G′(ei),i=1.2 is an LLL reduced basis of Λ=G(i2).
In some embodiments, the algorithm constructs a sequence of generator matrices:
G=G0,G1, . . . GK=G′. (3.7)
Herein. Gk+1=Gk∘Tk where Tk∈SL2(i). The sequence (3.7) is constructed by alternating between the two following basic transformations.
Size reduction transformation. Reduce the size of the second basis vector with respect to the first basis vector according to the rule λ2k+1=λ2k−aλ1k where:
This transformation is realized by the unimodular matrix:
Flipping transformation. Interchange the basis vectors according to the rule λ1k+1=−λ2k and λ2k+1=λ1k. This transformation is realized by the unimodular matrix:
Reduction algorithm—upper triangular form. The algorithm accepts as input an upper triangular based Hermitian lattice (2U,R0) and produces as output an upper triangular based Hermitian lattice (2,U′,R0) such that:
(1) U′=A∘U∘T where T∈SL2(i) and A∈U2.
(2) λ′i=U′(ei),i=1,2 is an LLL reduced basis of Λ′=U′(i2).
In some embodiments, the algorithm constructs a sequence of upper triangular generator matrices:
U=U0,U1, . . . Uk=U′, (3.8)
such that Uk+1=Ak∘Gk∘Tk where Tk∈SL2(i) and Ak∈U2. In what follows, we use the notation:
The sequence of generators in (3.8) is constructed by alternating between the following two types of transformations.
Size reduction transformation. Reduce the size of the second basis vector with respect to the first basis vector according to the rule λ2k+1=λ2k−aλ1k where:
This transformation is realized by the unimodular matrix:
Flipping transformation. The basis vectors are flipped according to the rule λ2k+1=−λ1k and λ1k+1=λ2k. This change of basis is realized by the unimodular matrix:
The resulting matrix Gk+1=Uk∘Tk is lower triangular and is transformed back to upper triangular form by change of realization Ak=Q, where Q is the Unitary factor in the QR decomposition of Gk+1. Alternatively. Uk+1=U—the upper triangular multiplier in the Cholesky decomposition Gk+1*Gk+1=U*U.
Convergence of the algorithm. The convergence of the reduction algorithm can be proved by energy considerations. We define the energy functional:
E(G)=R(λ1,λ1)2[R]1(λ2,λ2).
Considering the sequence in (3.7), we have the following theorem.
Theorem. The following inequality always hold:
E(Gk+1)≤E(Gk).
moreover, when Gk is flipped we have E(Gk+1)≤αE(Gk) for some α<1.
Closest lattice point (CLP) problem. The CLP problem emerges as a relaxation of a hard finite quadratic minimization problem called the closest constellation point (CCP) problem. We consider the following general set up:
(1) Let (V,R) be an Hermitian vector space of dimension N.
(2) Let G:N→V be the generator of a lattice Λ=G(iN).
(3) Let {circumflex over (x)}∈V be a point in V called the soft estimation.
(4) Let Ω=(Λ+v0)∩B where B is some bounded set and v0∈V is some translation vector. The set Ω is called the set of constellation points.
The CCP problem is defined by:
Let {circumflex over (x)}t={circumflex over (x)}−v0. The CLP problem is the following relaxation of (3.9):
Consequently, the derived hard estimation of {circumflex over (x)} is given by {circumflex over (x)}h=G(a*)+v0. Interestingly, the price in performance due to lattice relaxation is negligible.
Remark. Note that the standard QAM constellation sets can be viewed as a truncation of translated Gaussian lattices.
Boosting CLP via lattice reduction. The main advantage in considering the lattice relaxation of the CCP problem is that lattices in contrast to constellation sets admit various bases and realizations which can be used to reduce the complexity of the CLP minimization problem. In particular, let (N,U,R0) be an LLL reduced upper triangular realization of (V,G,R). Recall that
U=A∘G∘T,
where:
(1) U is upper triangular.
(2) T∈SLN(i).
(3) A:VN is an isomorphism satisfying A*(R0)=R.
(4) λi=U(ei),i=1, . . . N is an LLL reduced basis of the lattice Λ=U(iN).
Note, that once T is known (for example after applying the invariant LLL reduction algorithm) the matrix U can be obtained via the Cholesky decomposition:
(G∘T)*(R)=UHU.
Let {circumflex over (z)}=A({circumflex over (x)}t). The CLP problem in the reduced realization is defined by:
The hard decision {circumflex over (x)}h is derived from the “reduced” minimum a* by the rule {circumflex over (x)}h=G∘T(a*)+v0. We refer to the CLP problem in the reduced basis as LR CLP.
Lattice detection problems. Herein, CLP relaxations of various detection problems originating from communication theory are described.
Lattice detection problem for a single tap channel. First consider the simpler case of a single tap channel model:
y=h(x)+w, (3.11)
where:
(1) h:V→M is the channel transformation.
(2) x∈Ω is the transmit vector assumed to belong to the constellation set.
(3) y∈M is the received vector.
(4) w∈M is the noise term assumed to be a Gaussian random vector w∈(0,Rww) of mean zero and covariance matrix Rww.
In order to define the CLP problem we assume a Gaussian prior x∈(0,Rxx). Typically, Rxx=P·Id for some positive power P. Under these assumptions, the conditional probability of x given y takes the form:
(x|y)∞e−R({circumflex over (x)}−x,{circumflex over (x)}−x),
where R=Ree−1=(h*Rww−1h+Rxx−1) and {circumflex over (x)}=Cy=Reeh*Rww−1y. In words, {circumflex over (x)} is the MMSE estimator of x in terms of y and Ree is the covariance matrix of the residual error e=x−{circumflex over (x)}. The detection problem is defined by:
Consequently, the CLP relaxation is defined by:
where {circumflex over (x)}t={circumflex over (x)}−v0.
Lattice detection problem for a multi-tap channel. Now consider the dispersive channel model:
y=h(x)+w, (3.12)
where:
(1) h is the sequence of channel taps, h[k]∈Hom(V,M),k∈.
(2) x is the transmit sequence, x[n]∈Ω,n ∈.
(3) y is the received sequence, y[n]∈N,n∈.
(4) w is the noise sequence assumed to be uncorrelated in time with w[n]∈(0,Rww)
We further assume that the channel is causal, that is, h[k]=0 for every k<0 and finite, that is, h[k]=0 for every k>v.
The term h(x) in (3.12) stands for the convolution: h(x)=Σk=0vh[k](x[n−k]). In order to define the CLP problem we assume an uncorrelated Gaussian prior on x where x[n]∈(0,Rxx). Under these assumptions, we define {circumflex over (x)} to be the decision feedback Wiener estimate of x, given by:
{circumflex over (x)}=f(y)+b(x). (3.13)
(1) f=f[−μ], . . . f[0]∈Hom(M,V),μ∈≥1 is the Wiener forward filter.
(2) b=b[1], . . . b[v]∈Hom(V,V) is the strict Wiener feedback filter.
For example, (3.13) is the sum of two convolution terms:
It can be shown that the error sequence e[n]=x[n]−{circumflex over (x)}[n] is uncorrelated in time with e[n]∈(0,Ree). Taking the Hermitian form R=Ree−1, the detection problem at time n is defined by:
Consequently, the CLP relaxation at time n is defined by:
where {circumflex over (x)}t[n]={circumflex over (x)}[n]−v0.
Computation of DF Weiner filters and the residual covariance matrix. The first step is to translate the channel model in (3.12) into a matrix form. To this end, define the column vectors:
where ny=μ+1,nx=μ+v+1. We define the matrix:
Under these conventions, the channel model translates to the matrix equation:
Y=HX+W. (3.14)
Furthermore, define the Weiner inverse channel by:
X=CY+E, (3.15)
where C∈Mat(nxN,nyM) is the Wiener estimator of X from Y and E is the residual error which, by definition, is uncorrelated with Y. We have the following formulas:
Consider REE as an nx×nx block matrix. Let REE=LDU be the block LDU decomposition where U is block upper triangular with IM on every block of the diagonal. D is block diagonal and L=UH.
Apply L−1 to both sides of the inverse channel equation (3.15) to get:
L
−1
X=L
−1
CY+L
−1
E. (3.16)
The equation (3.16) exhibits the property that the new noise term L−1E is uncorrelated in time with R{tilde over (E)}{tilde over (E)}=D. Write L−1=I−B. Note that B is strictly block lower triangular. Substituting in (3.16), we get
X=L
−1
CY+BX+L
−1
E. (3.17)
Taking the 0th block coordinate of both sides of Equation (3.17), we get:
x[0]=e0TL−1CY+e0TBX+e0TL−1E.
We have:
(1) f[k]=e0TL−1C[−k], for k=−μ, . . . 0.
(2) b[k]=e0TB[−k], for k=1, . . . v.
(3) Ree=D0,0.
Lattice detection problem for a multi-tap D channel. Let Nf∈≥1. Let /Nf denote the ring of integers modulo Nf. An element l∈/Nf is called a frequency slot. We consider the dispersive 2D channel model:
y=h(x)+w, (3.18)
where:
(1) h[k,l]∈Hom(V,M),k∈,l∈/Nf, called the 2D channel.
(2) x[n,l]∈Ω,n∈,l∈/Nf, called the transmit sequence.
(3) y[n,l]∈,n∈,l∈/Nf, called the receive sequence.
(4) w[n,l]∈,n∈,l∈/Nf, called the noise sequence.
The noise sequence is assumed to be uncorrelated in time and in frequency with w[n,l]∈(0,Rww). We further assume that the channel is causal and finite in time, that is, h[k,l]=0 for every k<0 and k>v.
The term h(x) in (3.18) stands for 2D convolution:
where the equation l1+l2=l under the second sum is taken modulo Nf. Hence, the first convolution is linear and the second convolution is cyclic. We assume an uncorrelated Gaussian prior on x where x[n,l]∈(0,Rxx). Under these assumptions, we define {circumflex over (x)} to be the 2D decision feedback Wiener estimate of x, given by:
{circumflex over (x)}=f(y)+b(x), (3.19)
where:
(1) f=f[−μ,l], . . . f [0,l]∈Hom(M,V),μ∈≥1 is the 2D Wiener forward filter.
(2) b=b[1,l], . . . b[v,l]∈Hom(V,V) is the strict 2D Wiener feedback filter.
More explicitly, (3.19) is the sum of two convolution terms:
It can be shown that the error sequence e[n,l]=x[n,l]−{circumflex over (x)}[n,l] is uncorrelated in time and Toeplitz in frequency with e[n,−]∈(0,Ree),
In the definition of the CCP problem we ignore the correlations between different frequencies, by taking R=r11−1. The detection problem at time n and frequency l is defined by:
Consequently, the CLP relaxation at time n and frequency l is defined by:
where {circumflex over (x)}t[n,l]={circumflex over (x)}[n,l]−v0.
Remark. Ignoring the cross correlations between different frequency slots in the definition of the CCP problem leads to a sub-optimal relaxation of the joint detection problem of x[n,−]∈ΩN
Computation of 2D Wiener filters via Fourier domain representation. Herein, reducing the computation of 2D Wiener filters f[k,l] and b[k,l] and the residual covariance matrix to a 1D computation is described. To this end, the (normalized) DFT along the frequency dimension is applied to the 2D channel model in (3.18). This yields a channel model of the form:
Y=H(X)+W, (3.20)
where:
(1) H[k,l]∈DFTl(h[n,l]),k∈,l∈/Nf.
(2) X[n,l]=DFTl(x[n,l]),n∈,l∈/Nf.
(3) Y[n,l]=DFTl(y[n,l]),n∈,l∈/Nf.
(4) W[n,l]=DFTl(w[n,l]),n∈,l∈/Nf.
We refer to (3.20) as the Fourier domain representation of the 2D channel model (3.18).
Since the DFT is a Unitary transformation, the Fourier noise term remains uncorrelated in time and frequency with W[n,l]∈(0,Rww). Most importantly, the DFT converts cyclic convolution to multiplication, hence, the Fourier 2D channel model is composed of Nf non-interacting 1D channels, namely:
for every l=0, . . . Nf−1. Let {circumflex over (X)}=F(Y)+B(X) denote the DF Wiener estimate of X in terms of Y where F[k,l]∈Hom(M,V) and B[k,l]∈Hom(V,V) are the forward and backward Wiener filters respectively. One can show that the residual error E=X−{circumflex over (X)} is uncorrelated in time and frequency with E[n,l]∈(0,REE(l)). l=0, . . . Nf−1. We have:
In this section, we describe various algorithms for finding the CLP. The algorithms fall into two classes: exact algorithm which find the true minimum and heuristic algorithm which find an approximation of the minimum. The trade-off is complexity. All algorithms incorporate a fundamental weighted tree structure representation of the lattice where leaves of the tree are in 1-1 correspondence with vectors in the lattice. This enables to localize the search problem enabling a branch and bound searching strategies. The different between the various algorithms is the strategy applied to search through the tree.
Set-up. We assume the following set-up:
(1) Let (V,G,R) be a based Hermitian lattice.
(2) Let Λ=G(iN) be the associated lattice with basis λi=G(ei),i=1, . . . N.
(3) Let {circumflex over (x)}∈V be the soft estimation.
The CLP problem is defined by:
In words: find the point in the lattice Λ that is the closest to {circumflex over (x)} with respect to the Euclidean metric R.
The lattice tree. There exists a canonical weighted tree structure associated with (V,G,R) and {circumflex over (x)}, called the lattice tree. Formally, the lattice tree is a triple =(V,∈,w) where:
(1) V is the set of vertices.
(2) ∈ is the set of edges.
(3) w:∈→≥0 is a weight function on the edges.
Topology of the lattice tree. The topology of the lattice tree is defined in terms of the standard lattice iN. The set of vertices is a disjoint union of N+1 layers:
where Vk={a∈iN:a[k]=0,k=1, . . . N−K} is the set of vertices at layer k. Consequently, the set of edges is a disjoint union N layers:
where ∈k⊂Vk×Vk+1 is the set of edges connecting layer k with layer k+1. An edge in ∈k is an ordered pairs (a,b) such that b[l]=a[l] for every l≥N−K+1.
Note that is a uniform infinite tree of depth N. The root of is the zero vector 0∈iN. The zero layer V0={0}. All the branches are of length N and are in one to one correspondence with vectors in iN. Under this correspondence a vector a∈iN is associated with the branch:
where ak∈Vk and ek=(ak,ak+1)∈∈k. The vertex ak is defined by:
Finally, we introduce the following notation. Given a vertex a∈Vk, we denote by out (a)⊂∈k the edges emanating from a and if k≥1, we denote by in(a)∈∈k−1 the unique edge that points to a.
The weight structure on the lattice tree. The weight function w is defined in terms of the based Hermitian lattice (V,G,R) and the soft estimation {circumflex over (x)}. Our approach is to first describe a vertex weight function μ:V→≥0 and than derive w as a localized version of μ. The precise relation between w and μ is:
for every ak∈Vk where:
is the unique branch connecting the root with ak. Let us fix a vertex ak∈Vk.
Definition. The weight μ(ak) is defined by:
In words, μ(ak) is the square distance between the vector {circumflex over (x)} and the vector G(ak) measured mod λ1, . . . λN−k. In order to define w we introduce the notation:
{circumflex over (x)}
k
=P
V
({circumflex over (x)}−G(ak)). (3.22)
Let ek=(ak,ak+1)∈∈k.
Definition. The weight w(ek) is defined by:
The relation between the vertex and edge weights is formalized in the following proposition.
Proposition. We have:
μ(ak+1)=μ(ak)+w(ek).
Explicit formulas for the weight function. In order to write the weight values in explicit form we assume that (V,G,R) is an upper triangular realization, that is:
(1) V=N.
(2) G=U is upper triangular matrix.
(3) R=-,- is the standard Hermitian product on N.
In this realization:
V
k={(x1, . . . xN):xi=0,i≥k+1},
V
k
⊥={(x1, . . . xN):xi=0,i≤k},
for every k=0, . . . N. The orthogonal projections PV
Given a vertex ak∈Vk, the weight μ(ak) is given by:
where d={circumflex over (x)}−U(ak). Given an edge ek=(ak,ak+1)∈∈k, the weight w(ek) is given by:
where {circumflex over (x)}k=PV
Remark. In some embodiments, working in an upper triangular realization is beneficial for calculation reasons as the weight formulas can be explicitly expressed in terms of the coefficient of the generator U by a simple recursive formula.
Babai detector. The Babai detector is the simplest heuristic detector. The output of the Babai detector is the branch a*∈iN.
defined recursively as:
for every k=0, . . . N−1. In words, the Babai detector is a greedy algorithm that picks the branch that at every layer extends to the next layer via the edge of minimal weight.
Remark. In some embodiments, the Babai lattice detector is equivalent to decision feedback MMSE. However, the performance of the Babai detector after applying LLL basis reduction is significantly superior to the standard DF MMSE.
Sphere detector. The sphere detector is an exact detector. The sphere detector travel through the lattice tree according to a depth first search strategy where the edges in out (ak) are chosen in ascending order according to their weight. The algorithm incorporates a threshold parameter T, initialized in the beginning to T=∞. The algorithm follow the following two rules:
(1) When reaching a leaf aN∈VN update T←μ(aN).
(2) Branch from ak∈Vk to ak+1∈Vk+1 only if μ(ak+1)<T.
The method 900 includes, at 920, estimating the information bits by inverting and pre-processing a single error covariance matrix representative of estimation error for all delay-Doppler bins. In other words, estimating the information bits is based on an inverse of a single error covariance matrix of the signal.
As described in the present document, in some implementations, the method 900 may further include computing a unimodular matrix U, and improving numerical conditioning (or equivalently, decreasing the condition number) of the single error covariance matrix by multiplying with U. While the matrix U could be calculated using several methods, e.g., a brute force method, in some embodiments a lattice reduction algorithm (including the exemplary implementations described in Sections 2 and 3) may be used to reduce numerical complexity. In some embodiments, the lattice reduction algorithm may include a size reduction transformation followed by a flipping transformation. In an example, the size reduction transformation may be based on a first unimodular matrix and the flipping transformation may be based on a second (different) unimodular matrix.
The method 900 may also include lattice detection. Further, this may include conversion of the above detected lattice to standard lattice (QAM lattice) and estimating information bits by performing symbol to bit de-mapping. In some embodiments, the method 900 includes, for each delay-Doppler bin, performing a Babai detection (lattice detection) on the output of the lattice reduction algorithm. In some embodiments, the method 900 includes, for each delay-Doppler bin, performing a sphere detection (lattice detection) on the output of the lattice reduction algorithm. In some embodiments, and more generally, the method 900 includes, for each delay-Doppler bin, performing a closest lattice point (CLP) detection on the output of the lattice reduction algorithm. In some embodiments, the lattice reduction algorithm may be implemented using the LLL algorithm, a Block Korkine Zolotarev (BKZ) algorithm, a random sampling reduction (RSR) algorithm or a primal dual reduction (PDR) algorithm.
The method 900 may also include first determining that the inverse of the single error covariance matrix is numerically well-conditioned (or equivalently, having a condition number close to unity), and then performing a slicing operation on the QAM symbols in each of the delay-Doppler bins.
It will be appreciated that the present document discloses, among other features, techniques that allow for embodiments that provide near-maximum likelihood performance in OFTS systems. Ideally, maximum likelihood algorithm may produce theoretically best results. However, a Babai detector or a sphere decoder may be used by practical systems to achieve performance nearly as good as ML receivers. As discussed herein, lattice reduction may be used as a pre-processing step for the implementation of a Babai detector or a sphere decoder. In particular, for OTFS modulated signals, implementation of lattice reduction can be made computationally bearable (compared to OFDM signals) by reducing the complexity of implementation by having to invert a single Ree matrix. Furthermore, conditioning of the matrix also becomes a numerically easier task because only one matrix needs to be processed, rather than multiple matrices in case of OFDM systems.
The disclosed and other embodiments, modules and the functional operations described in this document can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this document and their structural equivalents, or in combinations of one or more of them. The disclosed and other embodiments can be implemented as one or more computer program products, i.e., one or more modules of computer program instructions encoded on a computer readable medium for execution by, or to control the operation of, data processing apparatus. The computer readable medium can be a machine-readable storage device, a machine-readable storage substrate, a memory device, a composition of matter effecting a machine-readable propagated signal, or a combination of one or more them. The term “data processing apparatus” encompasses all apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, or multiple processors or computers. The apparatus can include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them. A propagated signal is an artificially generated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to suitable receiver apparatus.
A computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a standalone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program does not necessarily correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
The processes and logic flows described in this document can be performed by one or more programmable processors executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit).
Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read only memory or a random access memory or both. The essential elements of a computer are a processor for performing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks. However, a computer need not have such devices. Computer readable media suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
While this patent document contains many specifics, these should not be construed as limitations on the scope of an invention that is claimed or of what may be claimed, but rather as descriptions of features specific to particular embodiments. Certain features that are described in this document in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable sub-combination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a sub-combination or a variation of a sub-combination. Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results.
Only a few examples and implementations are disclosed. Variations, modifications, and enhancements to the described examples and implementations and other implementations can be made based on what is disclosed.
This patent document claims priority to and benefits of U.S. Provisional Patent Application No. entitled “LATTICE REDUCTION IN OTFS DECISION FEEDBACK EQUALIZER” and filed on 6 Sep. 2017. The entire content of the aforementioned patent application is incorporated by reference as part of the disclosure of this patent document.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/049759 | 9/6/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62554982 | Sep 2017 | US |