This application claims priority based on French Patent Application No. 02 04616, entitled “Method And Device For Detection Of Data Transmitted By Spectrum Spreading” by Dominique Noguet and Mathieu Bouvier des Noes, which was filed on Apr. 12, 2002, and which was not published in English.”
1. Technical Field and Prior Art
This invention relates to a method for detection of data transmitted by spectrum spreading by direct sequence and a device for using the method.
The field of application of the invention is digital transmissions by Direct Sequence Spread Spectrum (DSSS). DSSS digital transmissions are used in Code Division Multiple Access (CDMA) systems.
In a CDMA system, users share the same frequency band. Discrimination between users is possible due to the assignment of different spreading codes to each user.
A particular user's receiver receives not only the signal intended for this user but also signals addressed to other users. Interference caused by received signals addressed to other users, commonly called Multiple Access Interference (MAI), then needs to be minimized. Moreover, it is also necessary to reduce interference inherent to the transmission channel.
The signal received at the input to a digital receiver is represented by the complex vector e such that:
e=A.d+n (1)
where:
The receiver calculates a vector {circumflex over (d)} which is the estimate of the vector d based on knowledge of the vector e and the matrix A calculated or evaluated in advance. The vector {circumflex over (d)} must also be as close as possible to d. This is done by attempting to cancel out distortions and disturbances introduced by the transmission sequence between the transmitter and the receiver. This operation to cancel out distortions and disturbances is called equalization. If equalization is perfect, {circumflex over (d)}=d.
One essential difficulty lies in the fact that the receiver does not have sufficient information to solve equation (1). The solution of equation (1) is then based on the use of optimization criteria in order to obtain perfect equalization.
Various techniques are known for solving equation (1). One of these techniques is the joint detection technique that consists of estimating the entire vector d, in other words all data emitted by all users. In this case, the estimated vector {circumflex over (d)} is calculated by an equation of the following type:
{circumflex over (d)}=L−1.AH.e (2)
in which:
Different examples of joint detection can be mentioned. A first example relates to detection by which the matrix L is written AA.A and that is intended to cancel out MAI interference (ZF equalization, where ZF stands for “Zero Forcing”). A second example relates to detection by which the matrix L is written AH.A+σ2I, where σ2 is the interference variance and I is the unit matrix (MMSE—Minimum Mean Square Error—equalization). In this case, the distance between the estimated vector {circumflex over (d)} and the vector d is evaluated without distinguishing the effect of interference n from the effect of MAI interference.
In general, to solve equation (2), the receiver begins by calculating the vector z=AH.e starting from the matrix A that it knows and the vector of received data e. The equation (2) may then be expressed in the following form:
z=L.{circumflex over (d)} (3)
One conventional technique for solving the equation (3) consists of inverting the matrix L and multiplying the 2 terms of equation (2) at the right by the matrix L−1, the inverse of the matrix L. However, this technique must be avoided if the matrix L is large, and if the time available to solve the equation (3) is very short. For example, this is the case for TDD (Time Division Duplex) mode for the UMTS (Universal Mobile Telecommunication System) system. Equation (3) then has to be solved in less than 0.666 ms for a matrix L with a size of about 500×500 complex coefficients. The conventional technique by inversion of the matrix L mentioned above is incapable of solving equation (3).
Vector architectures had been proposed due to the complexity of the problems that arise (see “A vector multiprocessor for real-time multi-user detection in spread spectrum communication”, IEEE International Conference on Application Specific Systems, Architectures and Processors, Boston, July 2000). However, these architectures were only designed to solve small linear systems (32×32 matrix). A direct transposition of this type of architecture to detection systems in the field of the invention, for which the size may for example reach 500×500 (see above) is unthinkable because it would also involve extremely complex hardware systems, for example processors.
Another technique that requires fewer calculations, consists of making use of the properties of the L matrix to do Cholesky decomposition given by the formula L=UH.U where U is an upper triangular matrix with size N×N for which the elements denoted uij are determined from elements of the matrix L denoted lij.
The equation (3) is then solved in two steps. The first step is a Cholesky decomposition of the matrix L to obtain the matrix U. The result is then:
Z=L.{circumflex over (d)}=UH.U.{circumflex over (d)} (4)
A second step consists of solving the equation:
Z=UH.Y (5)
in order to obtain the matrix y from the known UH and z matrices, and then the equation:
Z=U.{circumflex over (d)} (6)
which is used to obtain {circumflex over (d)} from the U and y matrices calculated previously.
These operations are programmed on a conventional computer, or possibly a signal processor ((DSP). However, the performances of these computers are not sufficient to achieve real time performances for large matrices mentioned above.
The invention does not have the disadvantages mentioned above.
The invention relates to a device for joint detection of data transmitted by direct sequence spread spectrum (DSSS), the device comprising means of using a Cholesky decomposition to calculate a matrix U such that:
L=UH×U,
Where L is an equalization matrix and U is an upper triangular matrix. The means for using a Cholesky decomposition comprise:
where
li,i is the diagonal coefficient of the row rank i of matrix L and uk,i is the coefficient of the row rank k and column rank i of the matrix U, and
the means for calculating the non-diagonal coefficients including:
According to a first embodiment of the invention, the means of loading coefficients u*k,i on the return inputs from MAC cells consist of a set of N multiplexers (M1, M2, . . . , MN) controlled by a first command state, each multiplexer having a first input, a second input and an output, the output from a rank p multiplexer (p=1, 2, . . . , N) being connected to the return input of the same rank MAC cell, the first input of the rank p multiplexer being related to a coefficient u*k,i different from one multiplexer to the next, the second input of the rank p multiplexer being connected to the return output from the rank p+1 MAC cell, the second input to the rank N multiplexer being connected to the return output of the diagonal cell, the first command state setting up a direct electrical connection between the first input and the output of each multiplexer.
According to a second embodiment of the invention, the return output from the rank p MAC cell (p=1, 2, . . . , N) is connected to the return input of the rank p−1 MAC cell, the means of loading coefficients u*k,i on the return inputs of MAC cells consist of a multiplexer controlled by a first command state, the multiplexer having a first input, a second input and an output, the output from the multiplexer being connected to the return input of the rank N MAC cell, the first multiplexer input being connected in sequence to one of the u*k,i coefficients and the second multiplexer input being connected to the return output from the diagonal cell, the first command state setting up a direct electrical connection between the first multiplexer input and the multiplexer output.
According to another characteristic of the device according to the invention, MAC cells, the diagonal cell and all multiplexers controlled by a second command state form a network to calculate the estimated vector {circumflex over (d)} that corresponds to the vector d of data emitted by users.
The invention also relates to a method for joint detection of data transmitted by direct sequence spread spectrum (DSSS), the method comprising a Cholesky decomposition step to calculate a matrix U such that:
L=UH×U,
where L is an equalization matrix and U is an upper triangular matrix. The Cholesky decomposition step includes:
where li,i is the diagonal coefficient of row rank i of the matrix L and uk,i is the coefficient of the row rank k and column rank i of matrix U, and
the step for calculating the non-diagonal coefficients including
and
According to the first embodiment of the invention, the coefficients u*k,i are loaded in parallel on the return inputs of MAC cells.
According to the second embodiment of the invention, the u*k,i coefficients are loaded in sequence on MAC cell return inputs, starting from the rank N MAC cell.
According to another characteristic of the invention, the method comprises a step to calculate the estimated vector {circumflex over (d)} corresponding to the data vector d of emitted by users. The calculation step for the estimated vector {circumflex over (d)} includes:
to solve the equation z=UH.y, where y=U.{circumflex over (d)}, and
to solve the equation y=U.{circumflex over (d)}.
Advantageously, the joint detection method according to the invention uses Cholesky decomposition operations and operations to solve a triangular system using the same circuits, thus reducing the physical complexity of the device.
Control means are used to control sequential loading of the different circuits (MAC cells, diagonal cell, multiplexers) by appropriate coefficients.
In the above equations, the uii, uij, a, ad, b, accu coefficients are usually complex numbers. However, note that the invention also relates to the case in which these coefficients are real numbers.
Furthermore, note that the matrix U is a band matrix. Therefore, the coefficients ui,i and ui,j mentioned above are only calculated for coefficients uk,i and uk,j that are not zero. In the rest of the description, the coefficients uk,i and uk,j that are not zero will preferably be denoted ukp,i and ukp,j where p=1, 2, . . . , N.
Other characteristics and advantages of the invention will become clear after reading a preferred embodiment with reference to the attached figures among which
The same references denote the same elements in all the figures.
This first type of cell, called a MAC (“Multiplication ACcumulation”) cell is provided with three inputs and two outputs.
The cell comprises two delay operators 1, 2, a multiplier 3 and an adder 4. At time t, a coefficient a is applied on the coefficient loading input, data x(t) is input into the delay operator 1 and an accu(t) data is input into the adder 4. The multiplier 3 multiplies the coefficient a and the data x(t). The adder 4 adds the data accu(t) and the data ax(t) output by the multiplier 3. The data accu(t)+ax(t) that is input into the delay operator 2 at time t is output at time t+1 and the data x(t) that is input into the delay operator 1 at time t is output at time t+1.
This second type of cell called a diagonal cell is also provided with three inputs and two outputs.
The detection device according to the first embodiment of the invention comprises a circuit 5 for use of equation (1), a set of N cells MAC, MAC1, MAC2, . . . MACN, a diagonal cell DIAG and a set of N multiplexers M1, M2, . . . , MN. For the purposes of calculating equation (1), the circuit 5 includes for example a squaring operator, an adder, a subtractor and a square root calculation operator.
The accumulation input of the rank p MAC cell (p=1, 2, . . . , N) is connected to the accumulation output from the rank p−1 MAC cell and the accumulation output from the rank N MAC cell is connected to the accumulation input of the diagonal cell. The return inputs and outputs of the MAC cells are connected through multiplexers. Each multiplexer has two inputs and one output. The return input of the Rank p MAC cell (p=1, 2, . . . , N−1) is then connected to the rank p multiplexer output for which one of the two inputs is connected to the return output from the rank p+1 MAC cell. The return input of the rank N MAC cell is connected to the output from the rank N multiplexer for which one of the two inputs is connected to the return output from the diagonal cell.
The detection device according to the second embodiment of the invention comprises a circuit 5 for use of equation (1), a set of N cells MAC, MAC1, MAC2, . . . MACN, a diagonal cell DIAG and a multiplexer MN, each with two inputs and one output.
The MAC cell accumulation inputs/outputs are connected to each other as shown in
As mentioned above, the Cholesky decomposition consists of calculating the coefficients of ui,j of a matrix U such that:
L=UH×U
The matrix U is an upper triangular matrix. The diagonal coefficient of row i is written:
and the non diagonal coefficients of row i are written:
According to the invention, the coefficients of the matrix U are calculated row by row, from the first row to the last row, the diagonal coefficient of a row being calculated before the non-diagonal coefficients of the row. As a non-limitative example, for a 3×3 matrix U, the coefficients can be calculated as follows in sequence:
The diagonal coefficients ui,i are calculated using the circuit 5 that uses equation (1).
The principle used to calculate non-diagonal coefficients ui,j of the U matrix will now be described with reference to
During the calculation of the diagonal element ui,j, the coefficients u*k,i will be applied to the return inputs of the MAC cells. The return stream is fixed during the calculation of all coefficients ui,j in the same row since these values u*k,i are necessary for the calculation of all these coefficients.
There are two possible embodiments for loading the cefficients u*k,i.
Multiplexers controlled by a first command state enable parallel loading of u*k,i values on the return inputs of MAC cells during the calculation of non-diagonal coefficients of the matrix U.
According to the second loading method, the u*k,i values are loaded one after the other. The multiplexer MN controlled by a first command state is a means of applying u*k,i values on the return input of the rank N MAC cell.
According to this second loading method, the circuit 5 for calculating the diagonal elements may be connected to the multiplexer MN as shown in
is then calculated while loading the return stream registers. Once this loading has been terminated, the stream is fixed to calculate non-diagonal elements.
More generally, the order in which the coefficients u*ki are applied to the different return inputs of the MAC cells is not really important. The products u*kiukj may be calculated in any order before being summated. The only condition that has to be satisfied for loading the coefficients u*ki on the return inputs of the MAC cells is that the same index i is kept until the quantity
has been calculated.
One necessary condition for the calculation of the coefficient ui,j is to apply all coefficients uk,j, where j can be all possible values, to the coefficient input of the MAC cell to which the coefficient u*ki is applied on its return input.
The calculated diagonal and non-diagonal coefficients of the matrix U will be stored as they are determined. This is why the joint detection device according to the invention is provided with storage circuits (not shown in the figures).
One non-limitative example calculation of the coefficient ui,j of the matrix U with a size more than 4 will now be given with the description for
This example corresponds to an embodiment of the invention according to the coefficient loading mode shown in
The detection device comprises three MAC cells (MAC, (MAC1, MAC2, MAC3) and a diagonal cell (DIAG). It is assumed that the coefficients u*14, u*24, u*34 are applied to the corresponding return inputs of the MAC1, MAC2, MAC3 cells.
At time t=1, the coefficient u15 is applied to the coefficient input of the MAC1 cell. The accumulation output of this cell will then change to u*14u15 at time t=2.
At time t=2, the coefficient u25 is applied to the coefficient input of cell MAC2. The accumulation output of this cell will then change to u*24.u25+accu where accu is the value applied to the accumulation input of cell MAC2 which is equal to u*14u15. The accumulation output of the MAC2 cell is therefore equal to u*14u15+u*24u25.
At time t=3, u35 is applied to the coefficient input of cell MAC3. The accumulation output of MAC3 then changes to the value p1=u*34.u35+u*24u25+u*14.u15. Therefore the diagonal cell will have the value p1 set on its accumulation input. By applying u44 and l45 respectively onto the accumulation input and the return input of the diagonal cell, the result output from the diagonal cell changes to
which is the value that is to be calculated.
The accumulation of the terms of p1 moves towards the right of the figure (the direction of the accumulated data). The MAC cells remaining on the left are then no longer used. It is then advantageously possible to reassign these cells to the calculation of the product terms to be used by other elements in the next current row.
As a non-limitative example, the following table describes how the coefficients ukj are assigned to MAC cells to calculate the u45, u46, u47 values in the previous example.
Once the matrix U has been determined, the method according to the invention comprises the following steps in sequence:
to solve the equation z=UH.y, where y=U.{circumflex over (d)}, and
to solve the equation y=u.{circumflex over (d)}.
For the first and second steps in solving the triangular linear system mentioned above, the multiplexers are set such that the return output from the rank p MAC cell (p=1, 2, . . . , N) are directly connected to the return input of the rank p−1 MAC cell, the return output from the diagonal cell being directly connected to the return input of the rank N MAC cell.
The diagonal cell and the MAC cells then form a network like that described, for example, in the document entitled “Introduction to VLSI systems” (Kung, H. T. and Leiserson, C. E., chapter Systolic arrays for VLSI (chap. 8.3), Addison-Wesley, 1980).
In a manner known in itself, this type of network can be used to solve an equation of the type Ax=b where A is a known triangular matrix, b is a known vector and x is an unknown vector to be determined.
Each component of the solution vector x is calculation according to the following recurrence:
When an element xi(k) initialized to 0 circulates in the network, it accumulates the products calculated in each MAC cell. The final value is calculated in the diagonal cell. It is then reinjected into the network to circulate unchanged in the reverse direction in the network, thus enabling the following elements xj(k) (j>i) to accumulate the products aj,ixi. Several elements xi(k) are thus calculated in parallel as in a pipeline.
The MAC cell inputs and outputs between which elements xj(k) calculated by successive accumulations circulate, are the accumulation input and accumulation output respectively of MAC cells. Similarly, the MAC cell input and output between which elements xj(k) reinjected from the diagonal cell circulate, are the return input and return output of MAC cells respectively, and the inputs of MAC cells and of the diagonal cell on which the coefficients ai,k are applied are the coefficient loading inputs.
The coefficients of matrix A are loaded into the network diagonal by diagonal. Consequently, if A is a band matrix with width Lg, the number of MAC cells in the network may be reduced to Lg−1. Thus, the size of the network is not related to the band width of matrix A, rather than the size of the system.
For the first step in solving the triangular linear system, the coefficients of the transposed matrix of the conjugated matrix of U are applied to the coefficient inputs of the MAC cells and of the diagonal cell (see
In order to apply the invention to mobile phones, the method according to the invention may for example be capable of processing data transmitted in the FDD (Frequency Division Duplex) and TDD (Time Division Duplex) communication modes. Data may advantageously be processed in real time.
Number | Date | Country | Kind |
---|---|---|---|
02 04616 | Apr 2002 | FR | national |
Number | Name | Date | Kind |
---|---|---|---|
6307851 | Jung et al. | Oct 2001 | B1 |
7079607 | Brunel | Jul 2006 | B2 |
Number | Date | Country |
---|---|---|
9965160 | Dec 1999 | WO |
Number | Date | Country | |
---|---|---|---|
20040052303 A1 | Mar 2004 | US |