This invention generally relates to methods, apparatus and computer program code for decoding signals, and more particularly to trellis-based decoding using a variant of a BCJR procedure.
Broadly speaking a trellis diagram comprises a sequence of sets of states indexed by time, the states representing states of a data encoder and transitions between the states representing encoded data. A trellis decoder generally (but not necessarily) provides information on the probability of data symbols (transitions) or of individual trellis states since such soft information can be used as a priori information for iteratively improving a decoding process. The BCJR procedure (named after the inventors) provides trellis decoding assuming Markov data (where the evolution of a state depends on the current state but not its history).
In most wireless applications a channel codec is employed to improve the performance of the receiver. Therefore, for most receivers, especially iterative receivers, a soft output detector/decoder/equaliser is required to provide the soft input to the channel decoder. For an encoder with memory, the BCJR algorithm (L. Bahl, J. Cocke, F. Jelinek and J. Raviv, “Optimal decoding of linear codes for minimizing symbol error rate,” IEEE Trans. Information Theory, vol. 20, pp. 284-287, March 1974) is the optimal trellis-based decoder/detector. Here, the encoder with memory can be the channel encoder or the dispersive channel. Since the optimal BCJR algorithm suffers from a high computational cost for a large trellis, which occurs, for example, for broadband systems and MIMO systems, various reduced state BCJR algorithms have been proposed, such as the M-BCJR, T-BCJR and pre-filtered BCJR algorithms. However, these methods suffer from performance degradation due to loss of information when the trellis states are reduced. Also, the performance of these methods is dependent on the memory and energy distribution of the ‘encoder’.
The BCJR is an efficient and optimal algorithm for maximum a posteriori state estimation under the assumption of a hidden Markov chain with a discrete state space. It solves a trellis representation of the system by a forward recursion through each trellis stage (successive trellis stages corresponding to successive time instants) followed by a backward recursion through the trellis stages. This is described in more detail later. However in many applications there is a large number of possible states at each trellis stage.
Since the algorithm has a substantial computational cost for a large trellis reduced complexity variations are used for practical implementations. One such scheme is the M-BCJR algorithm (V. Franz and J. B. Anderson, “Concatenated decoding with a reduced-search BCJR algorithm,” IEEE Journal on Selected Areas in Commun., vol. 16, pp. 186-195, February 1998), which computes the recursions in the BCJR algorithm for only M states (a subset of the full set of trellis states) at every trellis stage. The M states are selected based on a metric of the BCJR algorithm, more particularly at a time t selecting the M states with the largest forward recursion parameter αt. This M-BCJR algorithm is able to perform well when the memory of the system is low (i.e. the channel impulse response is short), but performs poorly when the memory increases; the authors conclude that this approach is “not successful”. In an alternative approach described by Franz and Anderson in the same paper (T-BCJR) only states with the forward recursion parameter αt greater than a threshold are selected, but the authors note that this approach has the disadvantage that the computation can vary widely, which can present difficulties in certain applications, particularly in a hardware implementation with a maximum cycle rate.
The inventors have recognised that the M-BCJR algorithm selects its active states based on the filtered distribution of states up to that time instance and therefore makes a premature choice of the states selection.
According to a first aspect of the present invention there is therefore provided a trellis decoder for decoding symbols of a received signal, said symbol comprising a sequence defined by transitions between states of a trellis, the trellis decoder comprising: means for performing a forward recursion through said trellis to determine a first set of probabilities (α(Xt)) each defining a probability of a said trellis state (Xt) responsive to one or more current or earlier received signal values (y1:t); means for providing a second set of (β(Xt)) probabilities each defining a probability of one or more received signal values (yt+1:T) given an earlier said trellis state (Xt); means to decode said received signal to determine probabilities (p(bt|y1:T)) for symbols of a corresponding transmitted sequence of symbols from said first and second sets of probabilities; and wherein said decoder further comprises: means to select states for determining said first set of probabilities responsive to one or more received signal values later than a said trellis state defining a probability in said first set.
In some preferred embodiments the means for providing the second set of probabilities (β(Xt)) comprises means for performing a backwards recursion through the trellis to determine the probabilities. However in other embodiments this backwards recursion may be omitted and instead, for example, fixed probabilities for β(Xt) employed to implement, say, an improved type of soft output Viterbi. In this latter case the second set of probabilities (β(Xt)) may be provided by, say, a stored number, and all the probabilities β(Xt) may be set to the same value. In both cases the selection of states is more particularly responsive to one or more current or earlier received signal values y(1:t) in addition to the one or more received signal values later than the relevant trellis state.
Embodiments of the trellis decoder provide a reduced state variation of the BCJR procedure with improved performance in which a set of active states is selected at each time instance (trellis stage) in the forward recursion using received signals in the immediate future of the relevant time instant (trellis stage). Preferably the procedure looks over a predetermined time interval (a fixed lag), which may be selected, for example, responsive to the channel memory. In some preferred embodiments a smoothed distribution of states and/or state transitions is assumed at each time instance, which facilitates the use of a computationally simple Gaussian approximation, as described in more detail later. In embodiments this procedure is robust with respect to the memory or energy distribution of the underlying system and provides a performance approaching that of the conventional, optimal procedure but with only a fraction of the complexity.
Thus, broadly speaking, at each discrete time the procedure estimates a fixed lag smoothed distribution of the states (and state transitions) with a lag greater than zero. Then an active state (and state transition) selection is performed using these fixed lag smoothed distributions. Thereafter the metrics of the BCJR algorithm are computed only for the selected, active states. The backward recursion of the algorithm is similar to conventional BCJR but, again, is only carried out over the selected active states. In this way embodiments of the invention use only a fraction of the total states, thus reducing the complexity of the trellis based decoding whilst retaining a near-optimal performance. This is achieved by the use of future information obtained through the fixed lag smoothed distribution of states and state transitions and in this way the information loss through premature state selection is reduced.
Thus preferably the means to select states employs a Gaussian approximation to determine a likelihood of a trellis state from the future received signal values (in effect, treating the transmitted symbols as Gaussian random variables to reduce complexity, although in reality, this is not necessary always true). The selection of states (at a trellis stage) may be made by selecting transitions from previous trellis states (at a previous trellis stage). A transition defines a transmitted symbol and a further selection of state transitions may be made using the probability of a particular transmitted symbol given the previous trellis state and received signal values from a current time forward over the fixed time lag interval. Preferably states are further selected (at a current trellis stage) responsive to a probability of a particular state given the one or more later received signal values, and more particularly given a sequence of received signal values from an initially processed value through a current value to a future value a fixed time lag interval ahead of the current value.
In particular in embodiments states are selected based upon p(Xt|y1:t+L′), that is based on a probability distribution of the states given all the received values from a first time instance to time t+L′. Preferably a further selection of state transitions is made from a set of selected states of time (t−1) by selecting from p(bt|Xt−1,yt:t+L′), that is a probability distribution of the state transitions from a particular selected state Xt−1 given all the future received values at times t to t+L′.
The selection of states may be made dependent upon one or more log probability metrics dependent upon the one or more received signal values later than a time instant defined by a current trellis stage since this can reduce computational complexity.
In some embodiments the trellis decoding is applied iteratively, using a string of transmitted symbol probabilities given the received signal string as a priori information for another iteration of the decoding, with the aim of producing more accurate probabilities. As the skilled person will be aware, generally the output of a trellis decoder comprises soft (probability) information rather than hard information, although in embodiments the decoder may be configured to provide a (most likely) estimated transmitted symbol sequence output.
As previously mentioned, states at a current time are preferably selected responsive to a probability of the state given one or more later signal values, or more particularly given a sequence of signal values from an initial time to a future time ahead of the current time by a fixed time lag interval. This probability may be determined from a probability of a sequence of received signal values from a current time to a future time a fixed time lag interval ahead of the current time, given a previous (t−1) state. In preferred embodiments this latter probability is assumed to have a normal distribution, which simplifies the computational procedure. The multivariate normal distribution may be described by a mean vector and a covariance matrix. Where there is a non-uniform prior distribution of the symbols (for example because the decoding procedure is being applied iteratively or, for example because information is available from some other source such as a channel decoder), the covariance matrix becomes time dependent. The determination of the relevant probability (or equivalently a probability metric) requires a matrix inversion and thus for applications of this type the decoder preferably also includes means for extrapolating a value for the inverted covariance matrix at a particular time interval from its value at a previous time, this again reducing computational complexity.
More particularly a probability distribution p(yt:t+L′|Xt−1) (see Equations 2 and 9a below) may be approximated by a multivariate normal distribution with a covariance matrix Σt, with means for extrapolating a value of an inverse of said matrix from a value at one said trellis state to a value at a later said trellis state. The probability of the transmitted symbols given a particular selected state of earlier time instance and future received signals may evaluated by considering how the distribution factorises across the symbols transmitted, that is approximating p(bt|Xt−1,yt:t+L′) by the product of p(btj|Xt−1,yt:t+L′) for the index j running through all the unknown transmitted symbols. Here j labels, for example, the jth component of a sequence of symbols (such as space-time symbols) transmitted together. The probability distribution p(btj|Xt−1,yt:t+L′) may be approximated by a multivariate normal distribution with a covariance matrix Σt,j, with means for extrapolating a value of an inverse of the matrix from the value the inverse of the matrix Σt.
In some preferred embodiments of the invention a received signal value comprises a vector of signals received over a MIMO (Multiple Input Multiple Output) channel, or in general a vector of signals in a receiver having multiple observations pertaining to a single time instance. This vector may define, for example, a space-time symbol in a spatially multiplex MIMO communications system such as a system in which multiple transmit antennas/data streams are transmitted by a single user/transmitter. In other embodiments the decoder may be employed for multi-user detection, for example where multiple users each with a single transmit antenna transmit at the same time (in this case the symbols transmitted over the different channels, for example from users with different spreading sequences, are then uncorrelated).
In still other embodiments the decoder may be employed for decoding transmitted signals with other diversities such as CDMA, in which signals from multiple users, optionally employing multiple antennas, differentiated by their unique spreading sequence codes, are received at a receiver. This is because channel effects can reduce orthogonality of the received signals so that additional processing may be necessary to differentiate the signals from the different users. Embodiments of the decoder may also be employed in OFDM (Orthogonal Frequency Division Multiplexed) MIMO or other communication systems.
Embodiments of a trellis decoder as described above may be employed for trellis-based decoding of a space-time code and/or a channel code, and/or equalisation. In some preferred embodiments joint equalisation and decoding of a channel coded and/or space-time coded system is performed. In general embodiments of the above described decoder provide reduced complexity decoding for any “encoder” which can be represented by a trellis diagram. Particular applications of embodiments of the decoder include a turbo decoder for example of the type required by UMTS (Universal Mobile Telecommunications System) or some wireless LAN (Local area Network) standards, and equalises, for example of the type required by the EDGE (Enhanced Data for Global Evolution) standard.
In a related aspect the invention provides a method of decoding symbols of a received signal, said symbols comprising a sequence defined by transitions between states of a trellis, each said trellis state being associated with a time, the method comprising: performing a forward recursion through said trellis to determine a first set of probabilities each defining a probability of a said trellis state responsive to prior values of said received signal; obtaining a second set of probabilities each defining a probability of one or more received signal values given a prior said trellis state; and decoding said received signal to determine probabilities for symbols of a corresponding transmitted sequence of symbols from said first and second sets of probabilities; the method further comprising: selecting states for determining said first set of probabilities responsive to one or more received signal values later than a said trellis state defining a probability in said first set.
In some preferred embodiments obtaining the second set of probabilities comprises performing a backwards recursion through the trellis to determine the probabilities.
Thus broadly speaking in embodiments, as described above in the first set of probabilities each probability is associated with a trellis state and the states for processing are based upon received signal values from future times (each trellis state being associated with a respective time), that is times later than that of the relevant state.
In a further related aspect the invention provides a method of decoding a trellis coded signal using a BCJR-type procedure in which probabilities of transmitted symbols are determined by forwards and backwards iteration through a trellis having successive sets of states associated with successive received signal time intervals, a set of states defining a trellis stage, the method comprising reducing the number of states processed at a said trellis stage (Xt) by selecting states for evaluation for determining said probabilities based upon future received signal values (yt:t+L′) at said stage.
Preferably, as previously described, the selecting of states comprises selecting transitions from a previously selected set of states in a previous trellis stage, preferably followed by further selection from among the states selected by selecting transitions. This further selection is preferably responsive to a probability of a state (in a current trellis stage) conditioned on the (one or more) future received signal values (and preferably also on a current received signal value). Preferably a normal distribution is assumed for the future received sequence. A determined probability in embodiments comprises a log probability; in some preferred embodiments the selecting of transitions and/or states comprises evaluating one or more log probability metrics.
The invention further provides a decoder configured to implement the above described methods, and in particular including means for implementing the features of the methods.
Thus the invention also provides a decoder for decoding a trellis coded signal using a BCJR type procedure in which probabilities of transmitted symbols are determined by forwards and backwards iteration through a trellis having successive sets of states associated with successive received signal time intervals, a set of states defining a trellis stage, the decoder including a system to reduce the number of states processed at a said trellis stage (Xt) by selecting states for evaluation for determining said probabilities based upon future received signal values (yt:t+L′) at said stage, more particularly as well as the received values so far (y1:t−1).
The invention further provides a trellis decoder for decoding a trellis coded signal using a BCJR type procedure in which probabilities of transmitted symbols are determined by forwards and backwards iteration through a trellis having successive sets of states associated with successive received signal time intervals, a set of states defining a trellis stage, the decoder comprising: a received signal input for said trellis coded signal; an output for providing decoded signal data; program memory for storing processor control code; and a processor coupled to said received signal input, to said output and to said program memory for loading and implementing said code, said code comprising code to: reduce the number of states processed at a said trellis stage (Xt) by selecting states for evaluation for determining said probabilities based upon future received signal values (yt:t+L′) at said stage, more particularly as well as the received values so far (y1:t−1)
The invention further provides a trellis decoder for decoding symbols of a received signal, said symbols comprising a sequence defined by transitions between states of a trellis each said trellis state being associated with a time, the decoder comprising: a received signal input for said trellis coded signal; an output for providing decoded signal data; program memory for storing processor control code; and a processor coupled to said received signal input, to said output and to said program memory for loading and implementing said code, said code comprising code to: perform a forward recursion through said trellis to determine a first set of probabilities each defining a probability of a said trellis state responsive to prior values of said received signal; obtain a second set of probabilities each defining a probability of one or more received signal values given a prior said trellis state; and decode said received signal to determine probabilities for symbols of a corresponding transmitted sequence of symbols from said first and second sets of probabilities; the code further comprising code to: select states for determining said first set of probabilities responsive to one or more received signal values later than a said trellis state defining a probability in said first set.
The code to obtain the second set of probabilities may comprise code to perform a backwards recursion through said trellis to determine the probabilities, or may simply comprise code to assign a value, for example a fixed value, to each of said probabilities in said second set.
The invention further provides a receiver incorporating a decoder as described above.
The invention further provides processor control code to implement the above-described decoders and methods; and a trellis data structure storing BCJR trellis data for trellis states comprising substantially only those states which are selected by a decoder/method according to an embodiment of the invention. This code (and data) may be provided on a data carrier (sometimes referred to as a computer program product) such as a disk, CD- or DVD-ROM, programmed memory such as read-only memory (Firmware), or on a data carrier such as optical or electrical signal carrier.
For many applications embodiments of the above-described decoders, and decoders configured to function according to the above-described methods, will be implemented on a DSP (Digital Signal Processor), ASIC (Application Specific Integrated Circuit) or FPGA (Field Programmable Gate Array). Thus code (and data) to implement embodiments of the invention may comprise code in a conventional programming language such as C or, for example, code such as microcode. However code to implement embodiments of the invention may alternatively comprise code for setting up or controlling an ASIC or FPGA, or code for a hardware description language such as Verilog (Trade Mark), VHDL (Very high speed integrated circuit Hardware Description Language) or SystemC. As the skilled person will appreciate such code and/or data may be distributed between a plurality of coupled components.
As the skilled person will recognise, embodiments of the above described decoders and methods may be implemented in software or in hardware, in a combination of the two.
These and other aspects of the present invention will now be further described, by way of example only, with reference to the accompanying figures in which:
a to 2c show, respectively, a portion of equalizer trellis according to an embodiment of the present invention, and first and second parts of a flow diagram of a procedure according to an embodiment of the present invention;
We first describe the BCJR procedure; for additional background information reference may be made to Bahl, Cocke, Jelinek and Raviv, and to Franz and Anderson (ibid, hereby incorporated by reference).
Let us consider a state sequence, X0:T that forms a Markov chain and that produces an output sequence y1:T, with a joint distribution that factorizes generally as
with p(X0) being the initial state probability density. In other words at each time instant t we also have the conditional independence p(yt|X1:t)=p(yt|Xt−1,Xt). Situations with yt being independent of Xt−1 as well are thus a special case of this formulation. Here, the notation At:t+B refers to the sequence of states or symbols from time t to t+B, i.e At:t+B={At, At+1, . . . , At+B}. Let each state Xt come from the set SX with cardinality |SX|. When the sequence y1:T is observed, a linear time algorithm for the evaluation of the a posteriori probabilities p(Xt|y1:T) and p(Xt−1,Xt|y1:T) for all Xt−1,XtεSX and tε{1,. . . ,T} is the BCJR algorithm [1]. This algorithm involves a forward recursion through the trellis formed by the possibilities of the state sequence X0:T, in which the computation γ(Xt−1,Xt)=p(Xt,yt|Xt−1) and the recursive calculations of the metric
are made, and a backward recursion in which the recursive calculations of the metric
are made. Thereafter the A Posteriori probabilities of the states and state transitions can be evaluated by p(Xt|yt:T)∝α(Xt)β(Xt) and p(Xt−1,Xt|y1:T)∝α(Xt−1) γ(Xt−1,Xt)β(Xt) for all Xt−1,XtεSX and tε{1, . . . ,T}. Although the complexity of this algorithm is linear in time T, it is also linear on |SX| which can be millions in some practical problems such as the equalization of MIMO systems. Thus there are many practical applications in which the algorithm cannot be implemented cost effectively.
The inventors have recognised that a reduced state version of this procedure can still perform well, and this is now described.
The procedure selects M active states at each time instance in the forward recursion, but unlike the M-BCJR algorithm mentioned above, which selects the M active states which have the largest metric α(Xt), we use a fixed lag L′ smoothed distribution of states p(Xt|y1:t+L′) at a particular time instance t for selecting states at that time instance, with L′>0. Thereby we consider the future received (signal) values in making the active state selection for a particular time instance. The lag L′ can be chosen to account for the memory of the system (for example responsive to the average duration of the channel impulse response) and can be a compromise between performance and complexity.
To reduce the complexity of estimating this fixed lag smoothed density we use a Gaussian approximation technique. This technique has previously been described in Shoumin Liu and Zhi Tian, “Near-optimum soft decision equalization for frequency selective MIMO channels,” IEEE Trans. Signal Processing, vol. 52, pp. 721-733, March 2004, in particular at Section III-A of this paper, the contents of which are hereby incorporated by reference in their entirety.
We next present the implementation of these ideas for the equalization of a spatially multiplexed system. This will be described with specific reference to a MIMO system having a plurality m of transmit antennas and a plurality n of receive antennas, however the skilled person will recognise that the techniques described are general and can be applied to other decoding problems where the data for decoding can be represented by a trellis, no matter what the transmitted and received signals.
To facilitate understanding of the later discussion it is helpful to illustrate a typical example of a MIMO communications system.
Thus
The encoded transmitted signals propagate through MIMO channel 112 to receive antennas 114, which provide a plurality of inputs to a space-time (and/or frequency) decoder 116, which has the task of removing the effect of the encoder 108 and the MIMO channel 112. The output of the decoder 116 comprises a plurality of signal streams, one for each transmit antenna, each carrying so-called soft or likelihood data on the probability of a transmitted symbol having a particular value. This is (here) provided to a channel de-interleaver 118 which reverses the effect of channel interleaver 106, and then to a channel decoder 120, conventionally for example a Viterbi decoder, which decodes the convolutional code. The channel decoder 120 may be a SIHO (soft-in hard-out) decoder that is receiving symbol (or bit) likelihood data and providing data on which a hard decision has been made. The output of channel decoder 120 is provided to a data sink 122, for further processing of the data in any desired manner. In some systems turbo decoding may be employed, where a soft output from a SISO (soft-in soft-out) channel decoder 120 is provided to a channel interleaver 124 (corresponding to interleaver 106), which in turn provides soft (likelihood) data to decoder 116 for iterative space-time (and/or frequency) and channel decoding.
Embodiments of a decoder as described herein may be employed to implement one or both (optionally combined or jointly) of ST decoder 116 and channel 120, or may be employed for other decoding tasks.
We now describe in detail an embodiment of a procedure according to the present invention.
Let us consider an m transmit antenna n receive antenna spatial multiplexing system, where the channel memory of each transmit antenna to receive antenna path is L. Let the fading coefficient of the dth channel tap from jth transmit antenna to the ith receive antenna be denoted by hi,j(d) for dε[0,L], jε[1,m] and iε[1,n].
Let us also denote the signal received at the ith receive antenna at the tth time instance by yti, and the antenna symbol transmitted by the jth transmit antenna at the tth time instance by btj. Let us assume that the symbol bij is chosen from the set B={a1,a2, . . . ,aN} with cardinality N. The received signal yti is determined by the convolution of the channel impulse response and a sequence of symbols transmitted up to L+1 time instants and an additive white Gaussian noise wti:
The equalizer's task in this scenario is to consider the received signal yti and detect the transmitted symbols btj.
Here btj is a real or complex number, as is yti, and h(d) is an (n×m) matrix with the i th row and j th column component given by hi,j(d).
For a MIMO channel an estimate of the matrix h can be obtained in a conventional manner, for example, using a training sequence transmitted from each transmit antenna in turn (to avoid interference problems), each time listening on all the receive antennas to characterise the channels from that transmit antenna to the receive antennas. This need not constitute a significant overhead since training may only be performed at intervals, say every 0.1 seconds for a slowly changing indoor channel, and data rates may be high in between training. Alternatively orthogonal sequences may be transmitted simultaneously from all the transmit antennas (although this increases the complexity of the training as interference problems can then arise).
Now, denoting vector transpose operation by (•)⇑, a state of the decoder (equalizer) at time t can be defined as Xt=(btl, . . . ,btm, . . . ,bt−L+1l, . . . ,bt−L+1m)⇑ and the received vector at time t can be taken as yt=(ytl, . . . ,ytn)⇑. The symbols transmitted by all antennas at time t are collectively termed (in this example) a “space-time” symbol given by vector bt=(bt1,bt2, . . . ,btm)⇑; in other embodiments the vector bt can define multiuser, CDMA, OFDM or other transmitted signals.
Since X0:T forms a Markov chain and
the BCJR algorithm can be invoked to evaluate the a posteriori probabilities of the state transitions, which can be mapped to the a posteriori probabilities of the set of antenna symbols transmitted at each time instance. The algorithm we describe can, however, be implemented at a much-reduced practical complexity as explained below.
To account for any prior information about the transmitted symbols, let the prior probabilities of the N different possibilities of each antenna symbol be given by the vector (pt,j1,pt,j2, . . . ,pt,jN) for tε[1,T] and jε[1,m].
The algorithm we describe has a forward recursion and a backward recursion through the trellis similar to the BCJR algorithm. At each time instance in the forward recursion, the algorithm performs two functions: active state selection and the computation and storage of α(Xt) for the active states of time t and γ(Xt−1,Xt) for state transitions from the active states at times t−1 to t. Let the set of active states selected at time t be represented by Ωt={Xt(1),Xt(2), . . . ,Xt(M)} and the states that are contending to be active states be represented by Ξt, as shown in
We next describe active state selection at time t:
The general idea is to estimate the fix lag smoothed density of the states for the states in Ξt. The following derivations present one simple method of estimating these densities at a low complexity.
We can first make the observation that there are up to MNm states in Ξt corresponding to the state transitions emanating from the M active states of time t−1 (since there are Nm branches emanating from each active state). One method of estimating the fix lag smoothed density of the states in Ξt is by,
Here Z is the normalization constant of the estimated density of p(Xt|y1:t+L′). We can also make the further approximation of assuming that the antenna symbols of time t are independent after the observation of yt:t+L′ and given a particular active state of time t−1:
We will estimate the probabilities p(btj|Xt−1,yt:t+L′) and p(yt:t+L′|Xt−1) using a Gaussian approximation.
We first consider p(yt:t+L′|Xt−1).
Observing that:
yt:t+L′=Ab+w (4)
where the noise vector and the transmitted symbol vector are given by w=((wt1,wt2, . . . ,wt+L′1, . . . ,wt−L′n)⇑, b=(bt−L1,bt−L2, . . . ,bt+L′1, . . . ,bt+L′m)⇑ respectively, and the form of A can be identified from (1). Thus, A is a n(L′+1)×m(L′+L+1) Toeplitz matrix given by:
where the (n×m) matrix h(d) has ith row and jth column component given by hi,j(d).
Now in the matrix multiplication Ab of (4), there is a column vector of length n(L′+1) associated with each antenna symbol bsl for sε[t−L, . . . ,t+L′] and lε[1, . . . ,m]. Let us denote this column vector of A by hsl (ie. a symbol bsl has a column vector hsl associated with it containing information relating to the channel). Now, (4) can be written as:
A particular state Xt−1 renders the bsl known for sε[t−L, t−1] and lε[1,m]. Due to the uncertainty of other antenna symbols, the true distribution of yt:t+L′ given Xt−1 is a mixture of normal distributions. We will approximate this true distribution with a single Gaussian distribution that has the same moments of the true distribution. That is we can say that p(yt:t+L′|Xt−i) has a multivariate normal distribution with a mean vector {tilde over (μ)}t and a covariance matrix {tilde over (Σ)}t; which can be evaluated by moment matching as:
where Σ is the covariance matrix of w and (•)H denotes conjugate transpose operation ({tilde over (μ)}t is a vector of length n(L′+1) and {tilde over (Σ)}t is a n(L′+1)×n(L′+1) matrix).
The a priori mean and variance of the unknown antenna symbols can be evaluated by:
Here ps,ik is probability of the symbol transmitted by the lth antenna at time index s,bsl for sε[t,t+L′] and lε[1,m] being the symbol αk for kε[1,N]. Equal symbol probabilities may be assumed unless other information is available, for example from a previous iteration of the decoding procedure and/or from a channel decoder.
Therefore for the particular received sequence yt:t+L′, the probability p(yt:t+L′|Xt−1) can be estimated as follows:
We next consider p(btj|Xt−1,yt:t+L′).
Thus, similarly for the estimation of p(btj|Xt−1,yt:t+L′), we will try to fit a single normal distribution to the distribution of btj given Xt−1 and yt:t+L′, such that it is moment matched to the true discrete distribution. Thereby rewriting Equation (4) as
and again noting that particular state Xt−1 renders the bsl known for sε[t−L,t−1] and lε[1,m], we can assume that p(htjbtj|Xt−1,yt:t+L′) is having a normal distribution with the corresponding mean {circumflex over (μ)}t,j and covariance {circumflex over (Σ)}t,j given by:
Thus for known htj and for any particular btj, p(btj|Xt−1,yt:t+L′) can be estimated:
Now, for a given state Xt−1, the state transition to time instance t is determined by the transmitted space-time symbol bt. Here as a further method of reducing the system complexity the algorithm selects S transitions from each of the M states at time t−1 using the estimation of the density p(bt|Xt−1,yt:t+L′) given by (3) (and (9b)), selecting the most likely transitions ie. those with the largest probabilities. Therefore Ξt has up to MS states. We evaluate p(Xt|y1:t+L′) for these states (equation (2)) and select M distinct states at time t that have the largest fixed lag smoothed probability and include them in Ωt.
We now describe the computation of α(Xt) and γ(Xt−1, Xt). For the selected states in Ωt, α(Xt) is computed by
where γ(Xt−1,Xt) is computed as
γ(Xt−1,Xt)=p(yt|Xt−1,Xt)p(Xt|Xt−1) (11a)
This in our case can be computed as:
Where p(bt) is the prior probability of the space-time symbol bt which is transmitted from the state transition relevant to (Xt−1,Xt). In the absence of better information the space-time symbols may be considered equally likely.
In the backward recursion of the algorithm, β(Xt) is computed for each state in Ωt using the relation:
Taking ζ(bt) to denote the set of state transitions that results in the transmission of the particular space-time symbol bt, the A Posteriori probability of the space-time symbols at time instant t are computed by
Again, Z′ is the normalization constant of the estimated density of p(bt|yl:T). The a posteriori probability of each antenna symbol can be computed from these computed posterior probabilities of the space-time symbols as follows. Let us say the a posteriori probability density of the symbols transmitted by the antenna j at time instant t is given by the vector (qt,j1,qt,j2, . . . ,qt,jN), where qt,jk is the posterior probability of the particular antenna symbol being αk, for kε[1,N]. Then qt,jk for every tε{1, . . . ,T}, jε{1, . . . ,m} and kε{1, . . . ,N} can be calculated by summing space-time symbol probabilities over all the space-time symbols that result in the transmission of the antenna symbol αk from antenna j.
Where bitwise probability values are desired the probability of a particular bit having a particular binary value may be obtained by summing symbol probabilities over all the symbols in which that bit has that particular binary value.
We next describe pseudo code for implementing an embodiment of the invention for the equalization of a spatially multiplexed system. Reference may also be made to
We next describe how the computational complexity of the procedure may be reduced by using the Jacobian logarithm to represent the probabilities in logarithmic form.
The evaluation of probabilities α(Xt), β(Xt), γ(Xt−1,Xt) and p(bt|y1:T) can be represented in the logarithmic form to invoke the Jacobian logarithmic approximation,
in order to reduce the computational complexity. Here, the Jacobian logarithmic function could employ the look up table LUT(•) to approximate ln(1+exp(−|a−b|)).
Therefore the forward, backward and transition probabilities are modified from Equation (10)-(13) to:
Here Z″ is the normalization constant such that p(bt|y1:T) is a proper density. The selection of active states and state transitions according to Steps 5 and 6 in the above pseudo-code invoke Equations (9a) and (9b) that employs exponential function, which is costly for implementation. This can be avoided as explained below.
The selection of active states Ξt involves selecting S active state transition with largest p(bt|Xt−1,yt:t+L′) Here, referring to Equation (3) and (9b), the probability p(bt|Xt−1,yt:t+L′) can be represented as:
Therefore, the active state transition selection can be invoked by choosing the state transition with S lowest metrics
Here, the computation of the metrics Φ is less costly than the full evaluation of p(bt|Xt−1,yt:t+L′) in Equation (16).
Similarly the active state selection is based on
where referring to Equation (9a), the logarithmic term corresponding to p(yt:t+L′|Xt−1) is given by Ψ(yt:t+L′,Xt−1)=(yt:t+L′−{tilde over (μ)}t)H({tilde over (Σ)}t)−1(yt:t+L′−{tilde over (μ)}t).
Now taking the log probabilities,
where Z″ is the normalization constant such that p(Xt|y1:t+L′) is a proper density. Here, the computational simplifications due to the Jacobian logarithm approximation can be used to evaluate (18). An alternative further reduced complexity but less accurate method of active state selection is to select M distinct states that have the largest metric Θ(Xt;Xt−1,bt,yt+l:L′).
We next describe how the computational complexity of the procedure may be additionally or alternatively reduced by using the matrix inversion lemma.
The computation of the inversions of the matrices {circumflex over (Σ)}t,j and {tilde over (Σ)}t for each transmit antenna and for each time instant can be a computationally intensive task. One method of avoiding computing these inversions at each time instance in the case of implementations with uniform prior distributions for the space-time symbols and when the total state space is very large (eg in MIMO systems) is to increase the number of active states (M) instead of going into an iterative implementation. Inversions of the matrices {circumflex over (Σ)}t,j and {tilde over (Σ)}t become time instance dependent only in the presence of a non-uniform prior distribution on the symbols as, for example could result from Step 14 of the above pseudo-code procedure.
When these matrices inversions are necessary for each time instance, their computational complexity can be reduced using the following technique.
We can write
Now, without directly calculating the inverse ({tilde over (Σ)}t+1)−1, we can sequentially calculate the effect on the inverse of {tilde over (Σ)}t by the addition of the components in (19). For example let,
Of course it is {tilde over (Σ)}t,2m={tilde over (Σ)}t+1 whose inverse we are interested in. Now say we have the inverse of {tilde over (Σ)}t,k for some kε[0,2m−1]. Then ({tilde over (Σ)}t,k+1)−1=({tilde over (Σ)}t,k+uk+1(uk+1)Hλk+1)−1 can be computed as
From which method we can end up with ({tilde over (Σ)}t+1)−1.
The recursive calculations of {circumflex over (Σ)}t,j follow similarly. In fact, the inverse of {circumflex over (Σ)}t,j can be computed more efficiently using the inverse of {tilde over (Σ)}t, by observing that
{circumflex over (Σ)}t,j={tilde over (Σ)}t−htj(htj)HVar(btj) (21)
From which we can get:
The BCJR algorithm includes forward and backward recursion through the trellis to determine the probabilities for the estimated symbols. To further reduce the computational complexity, the backward recursion of the algorithm can be ignored and thus the contribution of the backward recursion metrics β(Xt) in the computation of the a posteriori probability of the space-time symbols in Equation (13) is not included. Therefore this reduced complexity version is similar to the soft output Viterbi algorithm which finds the best survivor path in the trellis.
Receiver 300 comprises one or more receive antennas 302a, b (of which two are shown in the illustrated embodiment) each coupled to a respective rf front end 304a, b, and thence to a respective analogue-to-digital converter 306a,b and to a digital signal processor (DSP) 308. DSP 308 will typically include one or more processors 308a (for example, for a parallel implementation) and some working memory 308b. The DSP 308 has a data output 310 and an address, data and control bus 312 to couple the DSP to permanent program memory 314 such as flash RAM or ROM. Permanent program memory 314 stores code and optionally data structures or data structure definitions for DSP 308.
In the illustrated example program memory 314 includes trellis-based BCJR-type (space time) decoder code 314a comprising state selection code, forward recursion parameter computation code, backward recursion parameter computation code, and transmit symbol probability estimation code to, when running on DSP 308, implement corresponding functions as described above. Program memory 314 also includes MIMO channel estimation code 314b to provide a MIMO channel estimate H, and optionally (depending upon the type of encoding applied) de-interleaver code 314c, interleaver code 314d, and channel decoder code 314e. Examples of suitable code 314e are well known to those skilled in the art. In other embodiments, as illustrated with reference to
Optionally the code in permanent program memory 314 may be provided on a carrier such as an optical or electrical signal carrier or, as illustrated in
The data output 310 from DSP 308 is provided to further data processing elements of receiver 300 (not shown in
The receiver front-end will generally be implemented in hardware whilst the receiver processing will usually be implemented at least partially in software, although one or more ASICs and/or FPGAs may also be employed. The skilled person will recognise that all the functions of the receiver could be performed in hardware and that the exact point at which the signal is digitised in a software radio will generally depend upon a cost/complexity/power consumption trade-off.
a and 8b show graphs of (uncoded) bit error rate (BER,
The graphs compare a full BCJR procedure (curve 800), M-BCJR according to the prior art (Franz and Anderson) with four (curve 802) and sixteen (curve 804) active states per trellis stage, and an embodiment of the algorithm as described above with M=4 and L′=3L (curve 806); S was set to the maximum value of 4. It can be seen that the procedure we describe remarkably achieves near optimal performance (without consideration of the channel decoder) by considering only 4 active states, whereas the optimal equalizer has a total of 256 states and the prior art M-BCJR with 16 active states performs less well.
a to 9d show graphs of uncoded and coded bit error rate (BER,
The optimal BCJR equalizers for the respective 3, 5 and 10-tap channels have a total of 64, 4096 and over 134 million states. As optimal equalization is computationally impossible we compared the performance of an embodiment of the above described procedure with a conventional soft output MMSE (minimum mean square error) equalization technique (M. S. Yee, M. Sandell, and Y. Sun, “Comparison study of single-carrier and multi-carrier modulation using iterative based receiver for MIMO system,” in Proc. IEEE VTC Spring 04, Milan, Italy, May 2004). Thus the graphs compare the performance of conventional soft output MMSE equalization (curves 900) with the performance of an embodiment of the algorithm as described above with M=8 (for the 3-tap channel; curves 902), M=16 (for the 5-tap channel; curves 904), and M=64 (for the 10-tap channel; curves 906); S was set to the maximum value of 8. It can be seen that embodiments of the above procedure achieve better performance than the linear MMSE scheme by considering only 8, 16 and 64 active states in the respective cases.
Thus it can be seen that for these examples, of equalization in a communication system, the systems we describe are able show relative invariance of performance when the energy distribution of the channel changes. Moreover the number of active states required for good performance shows little increase with the increase in channel memory, even though the total number of states of the trellis increases exponentially.
We have illustrated the equalization of spatial multiplexed systems, and the single Gaussian approximations used for the low complexity estimation of the fixed lag smoothed distributions of the states and state transitions can be seen to be working well for the BPSK, (QPSK) and 8PSK modulations with up to two transmit antennas. Similar techniques may also be employed in the frequency domain, for example in the context of MIMO decoding by serial-to-parallel converting and then performing a fast Fourier transform prior to MIMO decoding, applying an inverse Fourier transform and parallel-to-serial conversion after MIMO decoding.
Applications of embodiments of the invention include MIMO and non-MIMO receivers, for any type of signal which is susceptible to trellis-based decoding and include applications in base stations, access points, mobile terminals, wireless networks including PANs (personal area networks) and LANs (local area networks) and applications in third and fourth generation mobile phone networks.
Broadly speaking embodiments of the invention facilitate cheaper receivers and/or improved performance, for example increased data rates without correspondingly increased complexity and cost. Embodiments of the invention may also potentially find application in other non-radio communication systems, for example a disk drive with multiple read heads and/or multiple data recording layers (which act like one or more transmitters).
No doubt many other effective alternatives will occur to the skilled person. It will be understood that the invention is not limited to the described embodiments and encompasses modifications apparent to those skilled in the art lying within the scope of the claims appended hereto.
Number | Date | Country | Kind |
---|---|---|---|
0500351.2 | Jan 2005 | GB | national |
Number | Name | Date | Kind |
---|---|---|---|
6597743 | Khayrallah et al. | Jul 2003 | B1 |
6901548 | Hattori et al. | May 2005 | B2 |
6947506 | Mills | Sep 2005 | B2 |
7010051 | Murayama et al. | Mar 2006 | B2 |
7190743 | Learned | Mar 2007 | B2 |
7236536 | Hochwald et al. | Jun 2007 | B2 |
7245666 | Gardner et al. | Jul 2007 | B1 |
7248651 | Love et al. | Jul 2007 | B2 |
7248849 | Ariyavisitakul et al. | Jul 2007 | B1 |
7308047 | Sadowsky | Dec 2007 | B2 |
7321644 | Love et al. | Jan 2008 | B2 |
7321646 | Arslan et al. | Jan 2008 | B2 |
Number | Date | Country |
---|---|---|
WO 03075468 | Sep 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20060156207 A1 | Jul 2006 | US |