Efficient joint detection

Information

  • Patent Grant
  • 7643584
  • Patent Number
    7,643,584
  • Date Filed
    Friday, January 23, 2009
    15 years ago
  • Date Issued
    Tuesday, January 5, 2010
    14 years ago
Abstract
Data signals are received over a shared spectrum in a code division multiple access communication format as a combined signal. The combined signal is sampled as a plurality of received vector versions. A plurality of system matrices and an associated covariance matrix using codes and estimated impulse responses of the data signals is produced. Each system matrix corresponds to a received vector version. The system and covariance matrices are extended and approximated as block circulant matrices. A diagonal matrix of each of the extended and approximated system and covariance matrices are determined by prime factor algorithm-fast Fourier transform (PFA-FFT)without division of the matrix. The received vector versions are extended. A product of the diagonal matrices and the extended received vector versions is taken. An inverse block discrete Fourier transform is performed by a PFA-FFT a on a result of the product to produce the estimated data.
Description
BACKGROUND


FIG. 1 is an illustration of a wireless communication system 10. The communication system 10 has base stations 121 to 125 (12) which communicate with user equipments (UEs) 141 to 143 (14). Each base station 12 has an associated operational area, where it communicates with UEs 14 in its operational area.


In some communication systems, such as frequency division duplex using code division multiple access (FDD/CDMA) and time division duplex using code division multiple access (TDD/CDMA), multiple communications are sent over the same frequency spectrum. These communications are differentiated by their channelization codes. To more efficiently use the frequency spectrum, TDD/CDMA communication systems use repeating frames divided into timeslots for communication. A communication sent in such a system will have one or multiple associated codes and timeslots assigned to it.


Since multiple communications may be sent in the same frequency spectrum and at the same time, a receiver in such a system must distinguish between the multiple communications. One approach to detecting such signals is multiuser detection (MUD). In MUD, signals associated with all the UEs 14, are detected simultaneously. For TDD/CDMA systems, one of the popular MUD techniques is a joint detection technique using block linear equalizer (BLE-JD). Techniques for implementing BLE-JD include using a Cholesky or an approximate Cholesky decomposition. These approaches have high complexity. The high complexity leads to increased power consumption, which at the UE 14 results in reduced battery life.


Accordingly, it is desirable to have computationally efficient approaches to detecting received data.


SUMMARY

K data signals, or bursts, are transmitted over a shared spectrum in a code division multiple access communication format. A combined signal is received and sampled over the shared spectrum, as a plurality of received vector versions. The combined signal includes the K transmitted data signals. A plurality of system matrices and an associated covariance matrix using codes and estimated impulse responses of the K data signals is produced. Each system matrix corresponds to a received vector version. The system and covariance matrices are extended and approximated as block circulant matrices. A diagonal matrix of each of the extended and approximated system and covariance matrices are determined by prime factor algorithm-fast Fourier transform (PFA-FFT) without division of the matrices. The received vector versions are extended. A product of the diagonal matrices and the extended received vector versions is taken. An inverse block discrete Fourier transform is performed by a PFA-FFT on a result of the product to produce the estimated data of the K data signals.





BRIEF DESCRIPTION OF THE DRAWING(S)


FIG. 1 is a wireless communication system.



FIG. 2 is a simplified transmitter and an efficient joint detection receiver.



FIG. 3 is an illustration of a communication burst.



FIGS. 4
a and 4b are a flow chart of a preferred embodiment for efficient joint detection.



FIG. 5 is an illustration of a data burst indicating extended processing areas.



FIG. 6 is a block diagram of a preferred implementation of efficient joint detection.



FIG. 7 is a simplified receiver having multiple antennas.



FIG. 8 is a simplified receiver sampling the received signal using fractional sampling.



FIG. 9 is a simplified receiver having multiple antennas and using fractional sampling.



FIG. 10 is a block diagram of a preferred implementation of efficient joint detection for fractional sampling or receive diversity.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S)


FIG. 2 illustrates a simplified transmitter 26 and receiver 28 using efficient joint detection in a TDD/CDMA communication system, although efficient joint detection is applicable to other systems, such as FDD/CDMA. In a typical system, a transmitter 26 is in each UE 14 and multiple transmitting circuits 26 sending multiple communications are in each base station 12. The joint detection receiver 28 may be at a base station 12, UEs 14 or both.


The transmitter 26 sends data over a wireless radio channel 30. A data generator 32 in the transmitter 26 generates data to be communicated to the receiver 28. A modulation/spreading/training sequence insertion device 34 spreads the data with the appropriate code(s) and makes the spread reference data time-multiplexed with a midamble training sequence in the appropriate assigned time slot, producing a communication burst or bursts.


A typical communication burst 16 has a midamble 20, a guard period 18 and two data fields 22, 24, as shown in FIG. 3. The midamble 20 separates the two data fields 22, 24 and the guard period 18 separates the communication bursts to allow for the difference in arrival times of bursts transmitted from different transmitters 26. The two data fields 22, 24 contain the communication burst's data.


The communication burst(s) are modulated by a modulator 36 to radio frequency (RF). An antenna 38 radiates the RF signal through the wireless radio channel 30 to an antenna 40 of the receiver 28. The type of modulation used for the transmitted communication can be any of those known to those skilled in the art, such as quadrature phase shift keying (QPSK) or M-ary quadrature amplitude modulation (QAM).


The antenna 40 of the receiver 28 receives various radio frequency signals. The received signals are demodulated by a demodulator 42 to produce a baseband signal. The baseband signal is sampled by a sampling device 43, such as one or multiple analog to digital converters, at the chip rate of the transmitted bursts. The samples are processed, such as by a channel estimation device 44 and an efficient joint detection device 46, in the time slot and with the appropriate codes assigned to the received bursts. The channel estimation device 44 uses the midamble training sequence component in the baseband samples to provide channel information, such as channel impulse responses. The channel information is used by the efficient joint detection device 46 to estimate the transmitted data of the received communication bursts as soft symbols.


The efficient joint detection device 46 uses the channel information provided by the channel estimation device 44 and the known spreading codes used by the transmitter 26 to estimate the data of the desired received communication burst(s).


Although efficient joint detection is explained using the third generation partnership project (3GPP) universal terrestrial radio access (UTRA) TDD system as the underlying communication system, it is applicable to other systems. That system is a direct sequence wideband CDMA (W-CDMA) system, where the uplink and downlink transmissions are confined to mutually exclusive timeslots.


The receiver 28 receives a total of K bursts that arrive simultaneously, within one observation interval. For the 3GPP UTRA TDD system, each data field of a time slot corresponds to one observation interval. For a frequency division duplex (FDD) CDMA system, the received signals are continuous, i.e., not in bursts. To handle the continuous signals, FDD systems divide the received signals into time segments prior to applying efficient joint detection.


A code used for a kth burst is represented as c(k). The K bursts may originate from K different transmitters or for multi-code transmissions, less than K different transmitters.


Each data field of a communication burst has a predetermined number of transmitted symbols, Ns. Each symbol is transmitted using a predetermined number of chips, which is the spreading factor, Q. Accordingly, each data field has NS×Q chips. After passing through the wireless radio channel, which can introduce a delay spread of up to W−1 chips, the observation interval at the receiver is of length Q×NS+W−1 chips.


The symbol response vector b(k) as the convolution of the channel response vector h(k) with the corresponding spreading code c(k) is per Equation 1.

b(k)=h(k)∘c(k)  Equation 1

∘ denotes the convolutional operator. The length of b(k) is SF+W−1.


Using the symbol response vectors, the system matrix A is defined as per Equation 2.









A
=

[



B






























B



Q

_































B



























































B



]





Equation





2







The size of the matrix is (Ns•SF+W−1)×Ns•K. A is a block Toeplitz matrix.


Block B is defined as per Equation 3.

B=[b(1)b(2) . . . b(K)]  Equation 3

The received vector sampled at the chip rate can be represented by Equation 4.

r=Ad+n  Equation 4


The size of vector r is (Ns•SF+W−1) by 1. This size corresponds to the observation interval.


Data vector d of size Ns•K by 1 has the form of Equation 5.

d=[d1Td2T . . . dNsT]T  Equation 5


The sub-vector dn of size K by 1 is composed of the nth symbol of each user and is defined as Equation 6.

dn=[dn(1)dn(2) . . . dn(K)]T, n=1, . . . , Ns  Equation 6


The vector n of size (Ns•SF+W−1) by 1 is the background noise vector and is assumed to be white.


Determining d using an MMSE solution is per Equation 7.

d=R−1(AHr)  Equation 7

(•)H represents the Hermetian function (complex conjugate transpose). The covariance matrix of the system matrix R for a preferred MMSE solution is per Equation 8.

R=AHA+σ2I  Equation 8

σ2 is the noise variance, typically obtained from the channel estimation device 44, and I is the identity matrix.


Using block circulant approximation and block DFT using PFA-FFTs, d in Equation 7 can be determined per Equation 9.













d
_

=




F


(

R

-
1


)




F


(


A
H



r
_


)













F

-
1




(


Λ

-
1




Λ
A



F


(


r
_

c

)



)









Equation





9








F(•) and F−1(•) indicate the block-DFT function and the inverse block-DFT, respectively. The derivation of the block diagonal matrices Λ and ΛA is described subsequently. Instead of directly solving Equation 9, Equation 9 can be solved using the LU decomposition and the forward and backward substitution of the main diagonal block of Λ. Alternately, Equation 9 can be solved using Cholesky decomposition.



FIG. 4 is a flowchart for a preferred method of determining the data vector d using fast joint detection. The system matrix A is determined using the estimated channel response vector h(k) and the spreading code c(k) for each burst, 48. R, the covariance matrix of the system matrix, is formed, 49. The system matrix A and its covariance matrix R are extended block square matrices. The extended A is of size D•Q by D•K and the extended R is of size D•K by D•K, respectively. D is chosen as per Equation 10.









D





N
s

+


W
-
1

Q








Equation





10








Both extended matrices are approximated to block circulant matrices, Ac and Rc, 50. Because of the extension of A and R, the received vector, r, is extended to the vector rc of size D•SF×1 by inserting zeros, 51. The block-diagonal matrix, Λ, is determined by taking a block DFT using PFA-FFT of the first block column of Rc, 52.


The block DFT of matched filtering F(AHr) is approximated by F(AcHrc). It is calculated by taking a block DFT using PFA-FFT of Ac and rc, 53. Due to the block-diagonal structure of Λ and ΛA, the blocks F(d)(i), i=1, . . . , D, are of size K by 1 in F(d). They are determined by performing on the main diagonal blocks, Λ(i) of Λ, LU decomposition, Λ(i)=L(i)U(i), forward substitution, L(i)y(i)A(i)HF(rc)(i), 55, and backward substitution, U(i)[F(d)](i)=y(i), 56. L(i) is a lower triangular matrix. U(i) is an upper triangular matrix. ΛA(i) is the ith main diagonal block of size SF by K in ΛA and F(rc)(i) is the ith block of size Q×1 in F(rc). ΛA is the block DFT using PFA-FFT of the first column of Ac and F(rc) is the block DFT using PFA-FFT of the vector, rc. The estimated data vector, d, is determined by an inverse block-DFT of F(d), 57.


Although Equation 9 is a MMSE based solution, fast joint detection can be applied to other approaches, such as a zero forcing approach as per Equation 11.

Rd=(AHA)d=AHr  Equation 11

As shown in Equation 11, in the zero forcing solution, the σ2I term is deleted from Equation 8. The following is a derivation for the MMSE solution, although an analogous derivation can be used for a zero forcing solution.


To reduce the complexity in determining F(AHr), a block DFT using PFA-FFT approach taking advantage of the block-Toeplitz structure of A may be used as shown in Equation 2. First, by repeating B, we extend A to a block-square matrix of size D•Q×D•K, to use all of the chip symbols in the observation interval. The extended A is composed of D2 blocks of size Q×K. The extended A is approximated to the block-circulant matrix Ac.


Ac can be decomposed into three matrices per Equation 12.

Ac=F(Q)HΛAF(K)  Equation 12

F(n)=Fcustom characterIn is a block DFT using PFA-FFT matrix of size D•n by D•n. custom character denotes a Kronecker product. In is an identity matrix of size n×n and F is a DFT matrix of size D×D, whose elements fil, i and l=1, 2, . . . , D are per Equation 13.










f
il

=


1

D




exp


(


-
j




2

π





il

D


)







Equation





13








D is the length of the DFT and FHF=I. I is an identity matrix of size D×D.


The block diagonal matrix ΛA is of size D•Q×D•K and has the form per Equation 14.










Λ
A

=

[




Λ
A

(
1
)




0





0




0



Λ
A

(
2
)







0


















0


0






Λ
A

(
D
)





]





Equation





14







Each of its entries ΛA(i), i=1, 2, . . . , D, is a Q by K block per Equation 15.










Λ
A

(
i
)


=

[




λ

1
,
1


(

A
,
i

)











λ

1
,
K


(

A
,
i

)







































λ

Q
,
1


(

A
,
i

)











λ

Q
,
K


(

A
,
i

)





]





Equation





15







Alternatively, the main diagonal blocks, ΛA(i), i=1, 2, . . . , D, can be computed by, per Equation 16.

A(1)TΛA(2)T . . . ΛA(D)T]T=(F(Q)Ac(:,1:K))  Equation 16

Ac(:,1:K) denotes the first block column of Ac. Namely, the first K columns of Ac. F(SF)Ac(:,1:K) can be calculated by Q•K parallel non-block DFTs of length D, using PFA-FFTs.


Due to the extension of A, the received vector r is also extended by inserting zeros, becoming vector rc of size D•Q by 1.


Using the above, F(AHr) is approximated to F(AcHrc). It can be written as Equation 17.

F(AcHrc)=F(K)HΛAHF(Q)rc  Equation 17


The covariance matrix R of size Ns•K×Ns•K has the block-square matrix form shown in Equation 18.









R
=

[




R
0




R
1
H







R
L
H















0





0


0





R
1




R
0







































R
1










































































R
L






































0



R
L






































0






R
L



























































0






































R
L
H



0






































R
L
H











































0


0





















R
L







R
1




R
0




]





Equation





18








L is defined per Equation 19.









L
=




Q
+
W
-
1

Q







Equation





19








Each entry, Ri, in the R matrix is a K by K block and 0 is a K by K zero matrix. Due to the size of the extended A, the matrix R is also extended to size D•K by D•K by inserting zeros.


The extended R is approximated to a block-circulant matrix, Rc, of size D•K by D•K per Equation 20.










R
c

=

[




R
0




R
1
H







R
L
H















0






R
2




R
1






R
1




R
0







































R
1




























R
L










































R
L






R
L






































0



R
L






































0






R
L



























































0








R
L
H































R
L
H



0








R
L
H































R
L
H












































R
1
H




R
2
H






















R
L







R
1




R
0




]





Equation





20







The block circulant matrix Rc, is decomposed into three matrices per Equation 21.

Rc=F(K)HΛF(K)  Equation 21

F(K)=Fcustom characterIK is a block DFT using PFA-FFT matrix of size D•K×D•K. custom character denotes a Kronecker product. IK is an identity matrix of size K×K and F is a DFT matrix of size D×D as described in Equation 13.


The block diagonal matrix Λ of size D•K by D•K has the form per Equation 22.









Λ
=

[




Λ

(
1
)




0





0




0



Λ

(
2
)







0


















0


0






Λ

(
D
)





]





Equation





22








Each of its entries, Λ(i), i=1, 2, . . . , D, is a K by K block, per Equation 23A.










Λ

(
i
)


=

[




λ

1
,
1


(
i
)











λ

1
,
K


(
i
)







































λ

K
,
1


(
i
)











λ

K
,
K


(
i
)





]





Equation





23

A








Alternatively, the main diagonal blocks, Λ(i), i=1, 2, . . . , D, can be computed per Equation 23B.

(1)TΛ(2)T . . . Λ(D)T]T=(F(K)Rc(:,1:K)),  Equation 23B

Rc(:,1:K) denotes the first block column of Rc. Namely, the first K columns of Rc. F(K)Rc(:,1:K) can be calculated by K2 parallel non-block DFTs of length D, using PFA-FFTs.


The estimated data vector, d, in Equation 7 is preferably approximated per Equation 24A.













d
_

=




R

-
1




A
H



r
_













R
c

-
1




A
c
H




r
_

c








=




F

(
K
)

H



Λ

-
1




Λ
A
H



F

(
Q
)





r
_

c









Equation





24

A








The block diagonal matrix Λ−1 is per Equation 24B.










Λ

-
1


=

[




Λ


(
1
)


-
1





0





0




0



Λ


(
2
)


-
1








0


















0


0






Λ


(
D
)


-
1






]





Equation





24

B








The inversion of Λ requires an inversion of K×K matrices, Λ(i), i=1, 2, . . . , D.


Equation 24A can be rewritten as Equation 25.

F(d)=Λ−1ΛAHF(rc)  Equation 25

F(rc) is per Equations 26A and 26B.

F(rc)≡F(Q)rc  Equation 26A
F(dc)≡F(K)rc  Equation 26B

Due to the block-diagonal structure of Λ−1 and ΛAH, Equation 25 can be efficiently calculated as follows. The terms of Equation 25 are partitioned into D blocks, as per Equation 27.










[





F


(

d
_

)



(
1
)








F


(

d
_

)



(
2
)













F


(

d
_

)



(
D
)





]

=




[




Λ


(
1
)


-
1




























Λ


(
2
)


-
1





















































Λ


(
D
)


-
1






]



[




Λ
A

(
1
)



























Λ
A

(
2
)




















































Λ
A

(
D
)





]


H





[





F


(


r
_

c

)



(
1
)








F


(


r
_

c

)


2












F


(


r
_

c

)



(
D
)





]







Equation





27







Each block in Equation 27 is solved separately, per Equation 28.

F(d)(i)(i)−1ΛAHF(rc)(i)  Equation 28


F(d)(i) is a K by 1 vector. Λ(i) is a K by K matrix, per Equation 22. ΛA(i) is a Q by K matrix, per Equation 14. F(rc)(i) is a Q by 1 vector and is composed of elements (1+(i−1)Q) through (i•Q) of F(rc).


To avoid the direct inversion of Λ(i), Equation 28 can be solved by using LU decomposition and forward and backward substitution. Equation 28 is rewritten per Equation 29.

Λ(i)F(d)(i)A(i)HF(rc)(i)  Equation 29

Λ(i) is decomposed per Equation 30.

Λ(i)=L(i)U(i)  Equation 30

L(i) is a lower triangular matrix and U(i) is an upper triangular matrix.


Using LU decomposition, Equation 28 is represented as, per Equation 31.

L(i)U(i)F(d)(i)A(i)HF(r)(i)  Equation 31

F(d)(i) in Equation 31 is solved by forward substitution, per Equation 32, and backward substitution, per Equation 33.

Forward Substitution: L(i)y(i)A(i)HF(r)(i)  Equation 32
Backward Substitution: U(i)[F(d)](i)=y(i)  Equation 33


Finally, d is determined for all blocks as per Equation 34.










d
_

=



F

-
1




(

d
_

)


=


F

-
1




(

[





F


(

d
_

)



(
1
)








F


(

d
_

)



(
2
)













F


(

d
_

)



(
D
)





]

)







Equation





34








FIG. 6 is a block diagram of a preferred implementation of efficient joint detection in a TDD/CDMA system. Using the received vector, r, rc is formed by inserting zeros, then a block DFT 100 of rc is performed per Equation 26, to produce F(rc).


Using the received training sequences, the channel impulse responses for each transmitted burst, h(k), is determined by an estimate channel response block 102. Using each channelization code, c(k) and the channel impulse response, h(k), the system matrix, A, is determined by compute block matrix A block 104 per Equation 2.


To determine ΛA, the system matrix A is extended by extend block 132, to use all received chips in the observation interval. The first block column of the block-circulant matrix Ac is determined by selecting the first K columns of the extended A matrix by first block column block 114. By taking a block DFT using PFA-FFT 118 using PFA-FFT, ΛA is determined.


To determine Λ, R is first determined by compute R block 140. For an MMSE solution, R=AHA+σ2I is used; for a zero forcing solution, R=AHA is used. Due to the size of the extended A, R is also extended by extend block 134. The first block column of the extended R matrix is determined by selecting the first K columns of the extended R matrix by first block column block 108. The first block column of the extended R matrix is circularized by a circularize block column block 110. It becomes the first block column of a block-circulant Rc. By taking a block DFT using PFA-FFT by block DFT block 112, Λ is determined.


To efficiently compute the estimated data vector d, ΛA, Λ, and F(rc) are divided into blocks ΛA(i), Λ(i), and F(rc)(i), i=1, 2, . . . , D, respectively, exploiting the block-diagonal structures of ΛA and Λ, by partition block 136. The complex conjugate transpose of ΛA(i), ΛA(i)H, is determined by a transpose block 130. A multiplier 128 multiplies ΛA(i)H by F(rc)(i). Λ(i) is decomposed using LU decomposition by a LU decomposition block 126, per Equation 30. By performing forward and backward substitution, per Equations 31-33, using forward and backward substitution blocks 124 and 122, respectively, F(d)(i) is determined. By repeating the LU decomposition and forward and backward substitution D times, F(d) is found. Taking an inverse block DFT using PFA-FFT of F(d) by block inverse DFT block 120, d is estimated.



FIGS. 7, 8 and 9 are simplified diagrams of receivers applying efficient joint detection to multiple reception antennas and/or fractional (multiple chip rate) sampling. A receiver 28 with multiple reception antennas is shown in FIG. 7. Transmitted bursts are received by each antenna 401 to 40m (40). Each antennas' version of the received bursts are reduced to baseband, such as by demodulators 421 to 42m. The baseband signals for each antenna are sampled by sampling devices 431 to 43m to produce a received vector, r1 to rm, for each antenna 40. The samples corresponding to the midamble are processed by a channel estimation device 144 to produce channel response matrices, H1 to Hm, for each antenna 40. The received data vector, d, is determined by an efficient joint detection device 142 using the received vectors and the channel response matrices.


A receiver 28 sampling using fraction sampling is shown in FIG. 8. Transmitted bursts are received by the antenna 40. The received bursts are reduced to baseband, such as by a demodulator 42. The baseband signal is sampled by a sampling device 43 to produce factional samples as received vectors, r1 to rm. Each received vector represents chip rate samples sampled at a fraction of a chip offset. To illustrate, for twice the chip rate sampling, two received vectors r1 and r2 are produced. Each of those vectors has samples spaced by half a chip in time. Samples corresponding to the midamble are processed by a channel estimation device 144 to produce channel response matrices, H1 to Hm, for each set of fractional samples. The received data vector, d, is determined by an efficient joint detection device 142 using the received vectors and the channel response matrices.


A receiver 28 with multiple reception antennas and using fractional sampling is shown in FIG. 9. Transmitted bursts are received by each antenna 401 to 40i(40). Each antennas' version of the received bursts are reduced to baseband, such as by demodulators 421 to 42i. The baseband signals for each antenna are sampled by sampling devices 431 to 43j to produce a received vectors, r1 to rm. The received vectors for each antenna correspond to each multiple of the chip rate samples. The samples corresponding to the midamble are processed by a channel estimation device 144 to produce channel response matrices, H1 to Hm, for each antenna's fractional samples. The received data vector, d, is determined by an efficient joint detection device 142 using the received vectors and the channel response matrices.


In applying efficient joint detection to either receive diversity, fractional sampling or both, the received communication bursts are viewed as M virtual chip rate received bursts. To illustrate, for twice the chip rate sampling and two antenna receive diversity, the received bursts are modeled as four (M=4) virtual chip rate received bursts.


Each received burst is a combination of K transmitted bursts. Each of the K transmitted bursts has its own code. The channel impulse response vector of the kth out of K codes and the mth out of the M virtual received bursts is h(k,m). h(k,m) has a length W and is estimated from the midamble samples of the burst of the kth code of the mth virtual received burst.


Each of the N data symbols of the burst of the kth code is per Equation 35.

d(k)=[d1(k)d2(k) . . . dN(k)]T, 1≦k≦K  Equation 35


The code of the kth burst is per Equation 36.

c(k)=[c1(k)c2(k) . . . cQ(k)]T, 1≦k≦K  Equation 36


The symbol response of the kth code's contribution to the mth virtual burst, b(k,m) is per Equation 37.

b(k,m)=h(k,m)custom characterc(k)  Equation 37


The length of the symbol response is Q+W−1. Q is the spreading factor. The system matrix, A(m), for each mth received burst is per Equation 38.










A

(
m
)


=

[




B

(
m
)

































B

(
m
)





Q

_
































B

(
m
)






























































B

(
m
)





]





Equation





38







Each block B(m) is of size (Q+W−1) by K and is per Equation 39.

B(m)=└b(1,m)b(2,m) . . . b(K,m)┘  Equation 39


The overall system matrix A is per Equation 40.









A
=

[




A

(
1
)







A

(
2
)












A

(
M
)





]





Equation





40







As shown in Equation 38, each sub-system matrix A(m) is block Toeplitz. The overall received vector of the M virtual bursts is of size M(NQ+W−1) and is per Equation 41.

r=[r1Tr2T . . . rMT]T  Equation 41

The mth received vector rm is of size NQ+W−1 by 1.


Equation 42 is a model for the overall received vector.

r=Ad+n  Equation 42

n is the noise variance.


Each mth received virtual burst is per Equation 43.

rm=A(m)d+nm  Equation 43

nm is the noise variance for the mth received virtual burst.


To solve for the data vector d in Equation 42, a block linear equalizer with either a zero forcing or minimum mean square error (MMSE) approach may be used per Equation 44.

{circumflex over (d)}=R−1AHr  Equation 44

R is the covariance matrix.


For a zero forcing solution, R is per Equation 45.









R
=





m
=
1

M




A


(
M
)

H




A

(
m
)




=


A
H


A






Equation





45







For a MMSE solution, R is per Equation 46.









R
=






m
=
1

M




A


(
m
)

H




A

(
m
)




+


σ
2


I


=



A
H


A

+


σ
2


I







Equation





46







The covariance matrix for either the zero forcing or MMSE solution is a block Toeplitz. To apply a discrete Fourier transform to the block-Toeplitz A(m) matrix, a block-circulant approximation of A(m), Ac(m) is used. To make A(m) a block-square matrix, A(m) is extended. The extended A(m) matrix is then approximated to a block circulant matrix Ac(m).


The Ac(m) matrix is composed of D by D blocks. Each block is of size Q by K. Accordingly, the size of Ac(m) becomes DQ by DK. To include all the elements of A(m), D is chosen to be an integer larger than Dmin as determined per Equation 47.










D
min

=



N
+


(

W
-
1

)

Q








Equation





47








┌•┐ represents a round up to an integer function.


The covariance matrix R is a block-square matrix of size NK by NK with blocks of size K by K. For R to be compatible with the extended Ac(m) matrix, R is extended to the size DK by DK by zero-padding and approximating the extended R to a block circulant covariance matrix Rc. For the received vector, r(m), to be compatible with Ac(m) and Rc, r(m) is extended to a DQ by 1 vector, rc(m) by zero padding.


After extending the received vectors, r(m), the overall received vector is per Equation 48.











r
_

c

=


[





r
_

c


(
1
)

T






r
_

c


(
2
)

T









r
_

c


(
M
)

T





]

T





Equation





48







Each block-circulant matrix Ac(m) is diagonalized to a block-diagonal matrix by block discrete Fourier transform matrices per Equation 49.

Ac(m)=F(Q)HΛA(m)F(K)  Equation 49


F(Q) is per Equation 50.

F(Q)=Fcustom characterIQ  Equation 50


F(K) is per Equation 51.

F(K)=Fcustom characterIK  Equation 51

F is a discrete Fourier transform matrix of size D by D and is an n by n identity matrix. ΛA(m) is a block diagonal matrix of the form of Equation 52.










Λ
A

(
m
)


=

[




Λ

(

1
,
m

)




0





0




0



Λ
A

(

2
,
m

)







0


















0


0






Λ
A

(

D
,
m

)





]





Equation





52







ΛA(l,m) for l=1, . . . , D, and m=1, . . . , M is a non-zero block of size Q by K. 0 is a zero matrix of size Q by K having all zero elements.


ΛA(l,m) is alternately computed per Equation 53.

ΛA(m)=diagB(F(Q)Ac(m)(:,1:K))  Equation 53


Ac(m)(:,1:K) is the first block column of Ac(m). The first block column having K columns. To determine ΛA(m), preferably F(Q)Ac(m)(:,1:K) is determined by QK parallel non-block DFTs of length D, using PFA-FFTs. The block circulant matrix Rc is also preferably diagonalized to the block diagonal matrix ΛR by a block DFT matrix F(K)=Fcustom characterIK as per Equation 54.

Rc=F(K)HΛRF(K)  Equation 54


The block diagonal matrix ΛR is composed by blocks ΛR(l), l=1, . . . , D of size K by K in its main diagonal block, per Equation 55.










Λ
R

=

[




Λ
R

(
1
)




0





0




0



Λ
R

(
2
)







0


















0


0






Λ
R

(
D
)





]





Equation





55







Another approach to determine ΛR is per Equation 56.

ΛR=diagB(F(K)Rc(:,1:K))  Equation 56

Rc(:,1:K) is the first block column of Rc. F(K)Rc(:,1:K) is preferably determined using K2 parallel non-block DFTs of length D. In one implementation, the K2 parallel non-block DFTs are implemented using K2 parallel non-block prime factor algorithm fast Fourier transforms (PFA-FFTs) of length D.


Preferably to perform the block equalization of Equation 44, the matched filtering is approximated per Equation 57.














A
H



r
_







A
c
H




r
_

c








=






m
=
1

M




A


(
m
)

H





r
_

c

(
m
)










=






m
=
1

M





(


F

(
Q
)

H







Λ
A

(
m
)








F

(
K
)



)

H




r
_

c

(
m
)










=




F

(
K
)

H






m
=
1

M




Λ
A


(
m
)

H




F

(
Q
)





r
_

c

(
m
)












Equation





57







The block-diagonalization of Ac(m) is per Equation 58.

Ac=└Ac(1)TAc(2)T . . . Ac(M)TT  Equation 58


The estimation of the data vector {circumflex over (d)}, is per Equation 59.














d
^

_

=




R

-
1




A
H



r
_













R
c

-
1




A
c
H




r
_

c








=





(


F

(
K
)

H



Λ
R



F

(
K
)



)


-
1




F

(
K
)

H






m
=
1

M



(


Λ
A


(
m
)

H




F

(
Q
)





r
_

c

(
m
)



)









=




F

(
K
)

H



Λ
R

-
1




F

(
K
)




F

(
K
)

H






m
=
1

M



(


Λ
A


(
m
)

H




F

(
Q
)





r
_

c

(
m
)



)









=




F

(
K
)

H



Λ
R

-
1







m
=
1

M



(


Λ
A


(
m
)

H




F

(
Q
)





r
_

c

(
m
)



)









=




F

(
K
)

H



y
_









Equation





59







The vector y is of size DK by 1 and is per Equation 60.













y
_

=




Λ
R

-
1







m
=
1

M



(


Λ
A

(
m
)




F

(
Q
)





r
_

c

(
m
)



)









=




[



y
_



(
1
)

T









y
_



(
2
)

T















y
_



(
D
)

T



]

T








Equation





60








y(l), l=1, . . . D is a vector of size K by 1.


Preferably to determine y, F(Q)rc(m) is determined using Q parallel non-block DFTs of a length D. In one implementation, the Q parallel non-block DFTs are implemented using Q parallel non-block PFA-FFTs of length D. ΛR−1 is a block diagonal matrix having blocks of size K by K in the main diagonal and is per Equation 61.










Λ
R

-
1


=

[




Λ
R


(
1
)

-
1




0





0




0



Λ


(
2
)

-
1







0


















0


0






Λ
R


(
D
)

-
1





]





Equation





61








Each ΛR(l)−1, l=1, . . . , D, is a block of size K by K.


Preferably using the block diagonal structure of ΛR−1, y(l) is determined by the Cholesky decomposition of ΛR(l) and forward and backward substitution in parallel. Alternately, ΛR(l) is directly inverted.


To perform the Cholesky decomposition, a vector









m
=
1

M



(


Λ
A


(
m
)

H








F

(
Q
)





r
_

c

(
m
)



)






is divided into D blocks of a size K by 1, per Equation 62.













x
_

=






m
=
1

M



(


Λ
A


(
m
)

H




F

(
Q
)





r
_

c

(
m
)



)








=




[



x
_



(
1
)

T









x
_



(
2
)

T















x
_



(
D
)

T



]

T








Equation





62








A Cholesky factor G(l) of ΛR(l) is determined using a factorization, per Equation 63.

ΛR(l)=G(l)G(l)H  Equation 63

Using the Cholesky factor G(l), each y(l) is determined by forward and backward substitution separately per Equations 64, 65 and 66.

ΛR(l)y(l)=G(l)G(l)Hy(l)=x(l)  Equation 64
Forward Substitution: Find z(l) in G(l)z(l)=x(l), where z(l)=G(l)Hy(l)  Equation 65
Backward Substitution: Find y(l) in z(l)=G(l)Hy(l)  Equation 66


By performing a block inverse DFT of y, the data vector d is estimated as {circumflex over (d)}. Preferably, the block inverse DFT is implemented using K parallel non-block inverse PFA-FFTs of a length D.



FIG. 10 is a block diagram of a preferred implementation of efficient joint detection in a TDD/CDMA system. Although FIG. 10 illustrates using two sets of samples, the figure can be extended to other multiple sets. Using the received vector for each set of chip rate samples, r1 and r2, rc(1) and rc(2) is formed by inserting zeros, by Extend blocks 2321 and 2322, respectively. A block DFT 2001 and 2002 using PFA-FFT of rc(1) and rc(2) is then performed, to produce F(Q)rc(1) and F(Q)rc(2).


Using the received training sequences, the channel impulse responses for each chip rate version of each transmitted burst, h(k)(1) and h(k)(2), is determined by estimate channel response blocks 2021 and 2022. Using each channelization code, c(k) and the channel impulse response, h(k)(1) and h(k)(2), each system matrix, A(1) and A(2), is determined by compute sub-system matrix blocks 2041 and 2042 per Equations 37 and 38.


To determine ΛA(1) and ΛA(2), each system matrix, A(1) and A(2), is extended by extend blocks 2311 and 2312. The first block column of each block-circulant matrix, A(1) and A(2), is determined by selecting the first K columns of the extended A(m) matrix by first block column blocks 2141 and 2142. By taking block DFTs 2181 and 2182, ΛA(1) and ΛA(2) are determined using a PFA-FFT.


To determine ΛR, the first block column of R is determined by compute first block column R block 240. The first column of R is extended by extend block 234. The first block column of the extended R is determined by a first block column determining device 208. The first block column of the extended R matrix is circularized, Rc, by a circularize block column block 210. By taking a block DFT by block DFT block 212, ΛR( ) is determined using a PFA-FFT.


To efficiently compute the estimated data vector d, each of ΛA(1), ΛA(2) and F(rc(1)) and F(rc(2)) as well as ΛR are used. Each of F(rc(1)), F(rc(2)), ΛA(1), ΛA(2) and ΛR is divided into D blocks, by partition block 236. The complex conjugate transpose of each of ΛA(1,i) and ΛA(2,i), ΛA(1,i)H and ΛA(2,i)H, where i is the ith block, is determined by transpose blocks 230, 231. A multiplier 228 multiplies ΛA(2,i)H by F(Q)(rc(2))(i). A multiplier 229 multiplies ΛA(1,i)H by F(Q)(rc(1))(i). A summer 225 sums the multiplied results per Equation 62. ΛR(i) is decomposed using Cholesky decomposition 226, per Equation 63. By performing forward and backward substitution, per Equations 65 and 66, using forward and backward substitution blocks 224 and 222, respectively, F(d) is determined. Taking an inverse block DFT using PFA-FFT of F(d) by block inverse DFT block 220 using a PFA-FFT, {circumflex over (d)} is estimated.

Claims
  • 1. A method of estimating a transmitted data vector d from a received vector r comprising: receiving a wireless communications signal and producing a vector r from the signal, where r has a form r=Ad+n and where A is a block Toeplitz matrix and n is a noise vector;producing a covariance matrix R of the form AHA+σ2I for a minimum mean square error block linear equalizer (MMSE-BLE) based solution, where AH is a Hermetian of A, σ2 is a noise variance and I is an identity matrix or AHA for a zero forcing block linear equalizer (ZF-BLE) based solution;extending the A matrix and R matrix;approximating the extended A and R matrices as block circulant matrices;determining a diagonal matrix of each of the extended and approximated A and R matrices, using the block column of the extended and approximated A and R matrices;extending r;taking a Fourier transform of r;taking products of the diagonal matrices and the extended r;summing the products; andestimating d using an inverse Fourier transform and the summed products.
  • 2. The method of claim 1 wherein the A and R matrices and r are extended to be compatible with a prime factor algorithm fast Fourier transform.
  • 3. The method of claim 1 further comprising performing LU decomposition on the diagonal of the R matrix and a forward substitution and a backward substitution devices for producing an inverse Fourier transform of d.
  • 4. The method of claim 1 further comprising performing Cholesky decomposition on the diagonal of the R matrix and a forward substitution and a backward substitution devices for producing an inverse Fourier transform of d.
CROSS REFERENCE TO THE RELATED APPLICATION(S)

This application is a continuation of U.S. patent application Ser. No. 11/926,534 filed Oct. 29, 2007 which is a continuation of U.S. patent application Ser. No. 10/644,361 filed Aug. 20, 2003 which claims the benefit of U.S. Provisional Application No. 60/404,561, filed Aug. 20, 2002, which are incorporated by reference as if fully set forth.

US Referenced Citations (14)
Number Name Date Kind
5588032 Johnson et al. Dec 1996 A
6144711 Raleigh et al. Nov 2000 A
6208295 Dogan et al. Mar 2001 B1
6230176 Mizutani May 2001 B1
6252540 Hale et al. Jun 2001 B1
6370129 Huang Apr 2002 B1
6424596 Donald Jul 2002 B1
6625203 De et al. Sep 2003 B2
6724743 Pigeonnat Apr 2004 B1
6952460 Van Wechel et al. Oct 2005 B1
20020146078 Gorokhov et al. Oct 2002 A1
20030021335 De et al. Jan 2003 A1
20030026325 De et al. Feb 2003 A1
20040136316 Kwak et al. Jul 2004 A1
Foreign Referenced Citations (5)
Number Date Country
09-212489 Sep 1997 JP
2001-189684 Jul 2001 JP
9940698 Aug 1999 WO
0213266 Apr 2002 WO
02089346 Nov 2002 WO
Related Publications (1)
Number Date Country
20090129447 A1 May 2009 US
Provisional Applications (1)
Number Date Country
60404561 Aug 2002 US
Continuations (2)
Number Date Country
Parent 11926534 Oct 2007 US
Child 12358739 US
Parent 10644361 Aug 2003 US
Child 11926534 US