1. Field of the Invention
The present invention relates generally to multiple access digital communication systems. More specifically, the invention relates to a multiuser detector system and method for the simultaneous reception of data from multiple users having different spreading factors.
2. Description of the Related Art
A multiple-access communication system allows a plurality of users to access the same communication medium to transmit or receive information. The media may comprise, for example, a network cable in a local area network or lan, a copper wire in the classic telephone system, or an air interface for wireless communication.
A prior art multiple access communication system is shown in
An example CDMA prior art communication system is shown in
As shown, a single subchannel of a given bandwidth is mixed with a unique spreading code which repeats a predetermined pattern generated by a wide bandwidth, pseudo-noise (pn) sequence generator. These unique user spreading codes are typically pseudo-orthogonal to one another such that the cross-correlation between the spreading codes is close to zero. A data signal is modulated with the pn sequence producing a digital spread spectrum signal. A carrier signal is then modulated with the digital spread spectrum signal and transmitted in dependence upon the transmission medium. A receiver demodulates the transmission extracting the digital spread spectrum signal. The transmitted data is reproduced after correlation with the matching pn sequence. When the spreading codes are orthogonal to one another, the received signal can be correlated with a particular user signal related to the particular spreading code such that only the desired user signal related to the particular spreading code is enhanced while the other signals for all other users are not enhanced.
Each value of the spreading code is known as a chip and has a chip rate that is the same or faster than the data rate. The ratio between the chip rate and the subchannel data rate is the spreading factor.
To extend the possible range of values of the data signal, a symbol is used to represent more than two binary values. Ternary and quaternary symbols take on three and four values respectively. The concept of a symbol allows for a greater degree of information since the bit content of each symbol dictates a unique pulse shape. Depending upon the number of symbols used, an equal number of unique pulse or wave shapes exist. The information at the source is converted into symbols which are modulated and transmitted through the subchannel for demodulation at the destination.
The spreading codes in a CDMA system are chosen to minimize interference between a desired subchannel and all other subchannels. Therefore, the standard approach to demodulating the desired subchannel has been to treat all other subchannels as interference, similar to interference that manifests itself in the communication medium. Receivers designed for this process are single-user, matched filter and RAKE receivers.
Since different subchannels do interfere with each other somewhat, another approach is to demodulate all subchannels at a receiver. The receiver can listen to all of the users transmitting at once by running a decoding algorithm for each of them in parallel. This ideology is known as multiuser detection. Multiuser detection can provide a significant performance improvement over single-user receivers.
Referring to
Optimal multiuser detectors are computationally intensive devices performing numerous complex mathematic operations and are therefore difficult to implement economically. To minimize expense, suboptimal multiuser detectors such as linear detectors have been developed requiring less computational complexity as a compromise approximating the performance of optimal detectors. Linear detectors include decorrelators, minimum mean square error or MMSE detectors, and zero-forcing block linear equalizers or ZF-BLEs.
A system block diagram of a prior art linear multiuser detector for synchronous or asynchronous CDMA communication is shown in
To effect parallel detection of K subchannel users in a physical system, linear multiuser detector methods are executed as fixed gate arrays, microprocessors, digital signal processors or DSPs and the like. Fixed logic systems allow for greater system speed while microprocessor driven systems offer programming flexibility. Either implementation that is responsible for the multiuser detection performs a sequence of mathematic operations. To describe the functions, the following variables typically define the structure and operation of a linear multiuser detector:
The signal received at the receiver includes all user signals r(k) plus noise. Therefore, we can define the received data vector r as follows:
The vector n in Equation 2 represents noise introduced by the communication channel.
A system transmission response matrix for user k is typically denoted as A(k). The ith-row, nth-column element is denoted as Ai,n(k) and is defined as:
Each column of the matrix A(k) corresponds to a matched filter response for a particular symbol sent by user k during the period of interest. Referring back to
where each vector bn(k) has a dimension of
Q(k)+W−1, Equation 5
and is offset from the top of the matrix An(k) by
Q(k)(n−1). Equation 6
Since the spreading codes are not periodic over symbol times; bi(k)≠bj(k) for I≠j. The elements of a vector which may be non-zero values are referred to as the support of the vector. Therefore, bn(k) is the support of An(k).
Once a system transmission matrix for each user is created, a total system transmission response matrix, denoted as A is created by concatenating the system transmission matrices for all the users as shown below:
A=[A(1), . . . , A(k), . . . , A(K)]. Equation 7
In accordance with prior art modulation techniques, the elements of h(k) can be complex numbers. It then follows that the non-zero elements of A can be complex numbers.
An example total system transmission response matrix A for a hypothetical prior art multiuser detector assembled in accordance with Equations 4, 5, 6 and 7 is
for two (k=2) users, A(1) and A(2), having sixteen chips in a data block (Nc=16), a channel impulse response length of four (W=4) and a spreading factor for the first user of two (Q(1)=2) and a spreading factor for the second user of four (Q(2)=4). In the resultant total system transmission response matrix A, bn,i(k) denotes the ith element of the combined system and channel response for the nth symbol of the kth user.
The received data r is processed using the total system transmission response matrix A which represents a bank of matched filter responses to create a vector of matched-filter outputs which is denoted as y. The matched filtering operation is defined as
y=AHr. Equation 9
The matrix AH represents the Hermitian (or complex) transpose of the matrix A. The Hermitian transpose is defined as AijH=Āji where the over-bar denotes the operation of taking a conjugate of a complex number. The matched filter outputs are then multiplied by the inverse of an objective matrix O. The objective matrix O represents the processing which differentiates each type of linear receiver model. It is derived from the system transmission matrix A.
The zero-forcing block linear equalizer (ZF-BLE) receiver is a linear receiver with an objective matrix specified as O=AHA. The minimum mean square error block linear equalizer (MMSE-BLE) receiver is a linear receiver with an objective matrix specified as O=AHA+σ2I where σ2 is the variance of the noise present on each of the symbols of the received data vector r and the matrix I is known as an identity matrix. An identity matrix is square and symmetric with is on its main diagonal and zeros everywhere else. The size of the identity matrix is chosen so as to make the addition operation valid according to the rules of linear algebra.
For a decorrelator (decorrelating receiver), matrix A is simplified by ignoring the channel responses h(k), considering only the spreading codes and their cross-correlation (interference) properties. A cross-correlation matrix, commonly referred to as R, is generally constructed for decorrelator type receivers. This matrix can be constructed by assuming that W=1 and hi(k)=1 in the definition of A above (i.e. the channel response of every subchannel is an impulse). Then the cross correlation matrix R is the objective matrix O as defined for the ZF-BLE receiver. A decorrelator often serves as a sub-process of a more complex multiuser detection receiver. Once the objective matrix is created, the multiuser detector will invert the matrix, denoted as O1.
The inverse of the objective matrix is then multiplied by the matched filter output vector y to produce estimates of the data vector d where d(estimate)=O−1y. The inversion of the objective matrix O is a complex, computationally intensive process. The number of operations required to perform this process increase as the cube of the size of the matrix O. For most asynchronous CDMA receivers, the size of O is very large which makes the process of inversion impracticable.
To overcome this limitation, and to make the system physically realizable, a numerical method due to Cholesky is used. Cholesky decomposition can significantly reduce the computational complexity of inverting the matrix O if the matrix is banded.
A banded matrix is a square matrix that contains non-zero values only on several diagonals away from the main diagonal. The number of non-zero diagonals adjacent to the main diagonal that have at least one non-zero element is referred to as bandwidth. Thus, a symmetric matrix M is said to be banded with bandwidth p if
m
ij=0 for all j>I+p, Equation 10
where mij is an element of M, with I being the row index and j the column index. For a banded matrix with size denoted as n and bandwidth denoted as p, Cholesky decomposition can reduce the required numeric operations of inverting the objective matrix O from varying as the cube of the size of the matrix, n3, to varying as the size of the matrix times the square of the bandwidth, np2.
As discussed above, the objective matrix for a ZF-BLE receiver is O=AHA. To illustrate the numeric complexity, the objective matrix O for the total system response matrix A shown in Equation 6 is
where zeros denote all elements that by mathematical operation yield zero and with x's representing non-zero values. If the non-zero elements of the ith row and jth column of the total system response matrix A do not have the same vector index, then the corresponding element of objective matrix O with row index I and column index j will be 0. The bandwidth of O (Equation 11) is equal to 9 since there are non-zero elements as far as nine columns away from the main diagonal.
The objective matrix O as it is constructed in the prior art receiver shown in
The process which computes the column re-arrangement of matrix A performs the re-arrangement without any additional information. The re-arrangement reduces the operational complexity when inverting the matrix. Once the detection procedure is complete, a user data vector d is computed, a reversed re-arrangement process is performed descrambling vector d back to its original form for further processing.
In a typical asynchronous CDMA system, the bandwidth of a re-arranged objective matrix is at least ten times less than its original size. Therefore, a savings of at least a factor of 100 in processing time is achieved when Cholesky decomposition is performed on an objective matrix based upon a re-arranged total system response matrix. However, the prior art has not addressed a re-arrangement method for when different spreading factors are in use between active users.
Accordingly, there exists a need to determine a method to reduce the number of inversion steps when different spreading factors are in use.
The present invention relates to a multiuser detector method and receiver that detects and decodes synchronous or asynchronous CDMA subchannels having different spreading factors with reduced computational complexity. The multiuser receiver of the present invention is preferably compatible with ZF-BLE, MMSE, decorrelating detectors and the like using Cholesky decomposition to minimize numeric operations. The receiver and method arranges the columns of system transmission response matrices representing the response characteristics of individual users into a well-banded total system transmission response matrix which represents a plurality of matched-filter responses for a given block of received data. The invention when used in conjunction with Cholesky decomposition reduces the number of required mathematic operations prior to parallel matched filtering.
The embodiments will be described with reference to the drawing figures where like numerals represent like elements throughout.
Shown in
To obtain user data d(k) for a specific user from the combined user data r, the user data must be filtered using a matched filter 25 or the like. One knowledgeable in this art recognizes that a matched filter 25 requires a response characteristic which is the complex conjugate of the combination of the spread pulse shape and the user's subchannel impulse response to produce an output with a level representative of the signal prior to transmission. Signals input to the filter 25 which do not match with a given response characteristic produce a lower output.
Each individual k subchannel impulse response estimate h(k) is input to a first memory 27 where it is combined with the same user's spreading code 29 (Equation 3) creating a system transmission response estimate matrix A(k) for that user. An arrangement processor 33 of the present invention 17 performs a re-ordering of all matrix An(k) columns. The arrangement method 99 requires that each subchannel system transmission response matrix A(k) have the column structure defined by Equation 4 which is typical of linear receivers. If the system transmission response matrices A(k) are not of the form defined by Equation 4, the arrangement processor 33 first re-arranges the columns to the structure defined by Equation 4. The present invention 17 does not require that all system transmission response matrices A(k) be concatenated into a total system transmission response matrix A as defined by Equation 7.
The arrangement processor 33 examines each system transmission response matrix A(1), A(2), A(3), . . . A(k) column for the number of zero-value elements from the support of each vector bn(k) (Equation 4) defining top o(k)Tn and bottom offsets o(k)Bn as shown in
If two columns are encountered where one has a greater top offset and a greater bottom offset than another, if the difference between top offsets is greater than the difference between bottom offsets, the column with the lower top offset is assigned the lower index ni. If the difference between bottom offsets is greater than the difference between top offsets, the column with the greater bottom offset is assigned the lower index ni. If the differences between top and bottom offsets are equal, either of the two columns can be assigned the lower index ni.
The arrangement processor 33 assembles a total system transmission response matrix A′ in the order of the assigned column indices ni. The column indices ni are retained in memory 33 for use during the descrambling process 45. As an example, using the total system response matrices A(1) and A(2) described and shown in Equation 8, the arrangement method 99 of the present invention 17 produces the total system transmission response matrix A shown below
The arrangement method 99 indexed the eight columns (1-8) of system transmission response matrix A(1) and the four columns (9-12) of system transmission response matrix A(2) in an order of 1, 9, 2, 3, 10, 4, 5, 11, 6, 7, 12, 8 to create a well-banded total system transmission response matrix A (Equation 12).
The arrangement method 99 embodiment described above involves an examination of each system transmission response matrix A(1), A(2), A(3), . . . A(k) comparing each column with every other column for top o(k)Tn and bottom o(k)Bn offsets. Given the special structure of each system transmission response matrix A(k), namely, that the columns of each matrix are arranged in order of increasing top offsets and decreasing bottom offsets as you progress from left to right (reference Equation 8, matrices A(1) and A(2)), an alternative method 199 can be performed without having to examine each system transmission response matrix A(k) directly.
The alternative method 199 is shown in
An exemplary system illustrating the performance of the present invention 199 contains seven users having four different spreading factors Q(k) assigned as follows:
A respective spreading factor group g comprises at least one system transmission response matrix A(k), where each matrix A(k) is arbitrarily indexed from 1 to L(g). Each spreading factor group g is indexed according to increasing spreading factor magnitude.
Within each spreading factor group, the columns of the associated system transmission response matrices A(k) are assembled into common spreading factor group transmission response matrices AG(g), where g=1, 2, 3, . . . G (step 205). As shown in
All matrices in a spreading factor group g have the same number of columns due to the same spreading factor. Therefore, the assembled spreading factor group transmission response matrices AG(g) will have L(g) times the number of columns in one associated system transmission response matrices A(k).
To assemble a total system transmission response matrix A′ accommodating variable spreading factors, the spreading factor group transmission response matrix AG(g) having the lowest spreading factor is copied sequentially (step 207) into memory 33a, beginning with the first column, i.e., column one of AG(g), to the first allocated column of A′. The spreading factor group transmission response matrix AG(g) having the lowest spreading factor has the maximum number of columns. All other spreading factor group transmission response matrix columns will be inserted into this base matrix A′.
If the system spreading factors are even integer multiples of each other (step 209), the processor 33 assembles the total system transmission matrix A′ (step 211) by considering the remaining spreading factor group transmission matrices AG(g) in any order (step 209). For each spreading factor group transmission matrix AG(g), the processor 33 derives a column placement reference index m,
where Q(g) denotes the spreading factor associated with the spreading factor group transmission matrix AG(g) under consideration, Q(1) denotes the lowest spreading factor among all groups and n is the column of the spreading factor group transmission response matrix AG(g) under consideration where n=1, 2, 3, . . . N (step 211).
To use the column placement index m, a reference location in A′ is derived (step 215) using the total number of system transmission response matrices L(1) that constitute the spreading factor group matrix having the lowest spreading factor,
m×L(1) Equation 14
The processor 33 derives a column set from the spreading factor group transmission response matrix AG(g) under consideration (step 217) using the number of system transmission response matrices that belong to the spreading factor group currently under consideration,
L(g)×(n−1)+1 through L(g)×n. Equation 15
The processor 33 copies the column set defined by Equation 15 from AG(g) and inserts it (step 219) into the base matrix A′ after the column of AG(1) which has the reference location defined by Equation 14 as shown in
When the system spreading factors are not even integer multiples of each other, the right side expression of Equation 13 does not yield an integer. In this case, the processor 33 will round the result of Equation 13 to the nearest integer above or the nearest integer below the value (step 213). The rounding direction has negligible effect on overall system performance. The order in which the rest of the group system transmission matrices AG(g) are considered may have some effect on the system performance. A priori knowledge of the spreading factors can be used to choose an optimum order in advance.
Using the arrangement techniques described above, and for the case when spreading factors are even integer multiples of each other, a matrix bandwidth B can be achieved which can be shown to be bounded as:
Equation 16 predicts that the bandwidth of the total system transmission response matrix of Equation 11 will be between 3 and 6. An examination of Equation 12 reveals that the bandwidth after either arrangement method 99, 199 of the present invention 17 is 4.
The improvement the present invention 17 provides is further appreciated as the number of transmitted symbols increase. If a system transmitted 16,000 chips (800 symbols for a first user and 400 symbols for a second user), the bandwidth of the matrix AHA would be approximately 800. Using the arrangement method 99 to produce a total system response matrix A, the bandwidth of A′HA′ remains four since bandwidth (Equation 16) is independent of the number of transmitted symbols. After all of the elements of objective matrix O are derived, the inverse 41 is performed. Since the complexity of inverting a matrix is proportional to the square of its bandwidth, the present invention 17 provides a reduction of computational complexity by a factor of approximately (800/4)2=2002=40,000.
The total system transmission response matrix A′ provides the response characteristics to the matched-filter 25. Each column of the system response matrix A′ is a vector which represents the response characteristics of a particular symbol. The received data vector r is input to the matched-filter 25 where it is matched with every response characteristic from the total system transmission response matrix A′ to produce a matched filter output vector y. Each element of output vector y corresponds to a preliminary estimate of a particular symbol transmitted by a given user. The output vector y from the matched-filter 25 is loaded into a multiplier 43 with the inverted objective matrix O. Both the matched-filter 25 output vector y and the inverted objective matrix O are multiplied yielding a user data vector d. The user data vector d contains all of the data transmitted from all users during the discreet time block. Since the objective matrix O and the matched filter 25 output are based on the total system response matrix A′, the user data vector d must be de-scrambled. The de-scrambling process 149 is the inverse of the arrangement methods 99, 199.
A descrambler 45 re-arranges each element of the user data vector d based upon the column re-assignments performed while undergoing either arrangement method 99, 199. The elements of the data vector d are in the same order dictated by the total transmission response matrix A, 1, 9, 2, 3, 10, 4, 5, 11, 6, 7, 12, 8, transposed vertically. The descramber 45 allocates a memory space having the same dimension and places each vector element in sequential order, 1-12. After the user data vector d is descrambled 149, the user data is output 23 for further processing.
While the present invention has been described in terms of the preferred embodiment, other variations which are within the scope of the invention as outlined in the claims below will be apparent to those skilled in the art.
This application is a continuation of U.S. patent application Ser. No. 10/100,997, filed Mar. 19, 2002, which is a continuation of international application No. PCT/US00/02621, filed Feb. 2, 2000, which claims priority to U.S. Provisional Patent Application No. 60/154,985, filed Sep. 21, 1999, all of which are incorporated by reference.
Number | Date | Country | |
---|---|---|---|
60154985 | Sep 1999 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10100997 | Mar 2002 | US |
Child | 11588454 | US | |
Parent | PCT/US00/02621 | Feb 2000 | US |
Child | 10100997 | US |