The invention relates to telecommunication methods and systems, more in particular Wide-band Code Division Multiple Access (WCDMA) for wireless communications.
Wideband direct sequence code division multiple access (WCDMA) is emerging as the pre-dominant wireless access mode for forthcoming 3G systems, because it offers higher data rates and supports larger number of users over mobile wireless channels compared to conventional access techniques like TDMA and narrowband CDMA [1]. Demodulation of WCDMA signals in multi-path channels is conventionally achieved with a Maximum Ratio Combining (MRC) RAKE receiver [2]. Although the RAKE receiver is optimal for single-user multi-path channels, multi-user interference (MUI) severely limits its performance in a multi-user setting. Moreover, the MUI is enhanced by the near/far situation in the uplink, where the data signal of a desired farby user can be overwhelmed by the data signal of an interfering nearby user.
Multi-user detection techniques, that use joint code, timing and channel information, alleviate the near/far problem and offer significant performance improvement compared to the RAKE receiver [3]. Linear symbol-level multi-user equalizer receivers, that implicitly depend on the instantaneous code-correlation matrix, have been extensively studied in [4], [5], [6], [7], [8] and [9]. Practical adaptive implementations of these receiver techniques require a constant code-correlation matrix in order to guarantee convergence of the adaptive updating algorithm and therefore call for the use of short spreading codes (the code period equals the symbol period) [10]. Although well-suited for implementation in the base-station (the standard foresees a mode with short spreading codes for the uplink [11], these techniques can not be used in the mobile station since the WCDMA downlink employs long spreading codes (the code period is much longer than the symbol period).
In the WCDMA downlink, the symbol streams of the different active users are multiplexed synchronously to the transmission channel by short orthogonal spreading codes that are user specific and a long overlay scrambling code that is base-station specific. The MUI is essentially caused by the channel, since the different user signals are distorted by the same multi-path channel when propagating to the mobile station of interest. Moreover, power control in the downlink enhances the MUI and creates a far/near situation where the data signal of a desired nearby user is overwhelmed by the data signal intended for a farby user. Linear chip-level equalization, introduced in [12], can restore the orthogonality of the user signals and suppress the MUI. Ideal Zero-Forcing (ZF) and Minimum Mean Squared Error (MMSE) chip-level equalizer receivers have also been investigated in [13], [14], [15] and [16]. They consist of a linear chip-level equalizer mitigating the inter-chip interference (ICI), followed by a descrambler/despreader and a decision device.
However, adaptive implementations of such chip-level equalizer receivers that can track time-varying multi-path channels are hard to realize in practice because of two reasons. On the one hand, sending a training chip sequence at regular time instants in correspondence with the coherence time of the channel, would reduce the system's spectral efficiency. On the other hand, using blind techniques that do not require any training overhead, would reduce the system's performance. Nevertheless, most current proposals use blind algorithms to determine the equalizer coefficients. One method applies a standard single-user blind channel identification algorithm and employs the obtained channel estimate to calculate the equalizer coefficients [17]. Another blind approach, pursued in [18] and [19] makes use of the fact that, in the absence of noise, the received signal after equalization should lie in the subspace spanned by the user codes.
GB 2362 075 A describes a Wideband Code Division Multiple Access (WCDMA) method, wherein according to
One embodiment of the invention provides in a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising transmitting a block from a basestation to a terminal, the block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, and performing in the terminal generating a plurality of independent signals having at least a channel distorted version of the transmitted block, combining the plurality of independent signals with a combiner filter with filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal and despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific spreading codes.
Another embodiment of the invention provides in a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising transmitting a block from a basestation to a terminal, the block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols, which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, transmitting projection information, enabling projecting of a signal on the orthogonal complement of the space spanned by the composite codes of the basestation specific scrambling code and the user specific codes, and performing in the terminal generating at least two independent signals comprising at least a channel distorted version of the transmitted block, combining with a combiner filter with filter coefficients determined by using the projection information being determined on the signals, thus obtaining a combined filtered signal and despreading and descrambling the combined filtered signal with a composite code of the basestation specific scrambling code and one of the user specific codes.
Another embodiment of the invention provides a receiver system for wireless communication, comprising a receiver configured to receive a block comprising a plurality of chip symbols which are data symbols spread by using scrambling and spreading codes and at least one pilot symbol, at least one combining filter circuit configured to receive at least two independent signals derived from the received block and output a multi-user interference suppressed signal, and at least one despreading circuit configured to despread the multi-user interference suppressed signal with one of the scrambling and spreading codes.
Another embodiment of the invention provides a receiver system for wireless communication, comprising a receiver configured to receive a block comprising a plurality of chip symbols which are data symbols spread by using scrambling and spreading codes, and projection information, enabling projecting of a signal on the orthogonal complement of the space spanned by scrambling and spreading codes, at least one combining filter circuit configured to input at least two independent signals derived from the received block and output a multi-user interference suppressed signal, and at least one despreading circuit configured to despread the multi-user interference suppressed signal with one of the scrambling and spreading codes.
Another embodiment of the invention provides a receiver system for wireless communication, comprising J combining filter circuits, each of the combining filter circuits receiving a plurality of independent signals and outputting a multi-user interference suppressed signal, a circuit configured to determine the coefficients of the combining filters circuit, a plurality of despreading circuits configured to despread the J multi-user interference suppressed signals and a combining circuit configured to combine the J multi-user interference suppressed signals.
Still another embodiment of the invention provides in a wireless communication system having at least one basestation and at least one terminal, a method of wideband multiple access communication, comprising receiving a block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, generating at least two independent signals based on the received block, and combining the at least two independent signals with a combiner filter having filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal.
Still another embodiment of the invention provides a receiver system for wireless communication, comprising means for receiving a block comprising a plurality of chip symbols scrambled with a base station specific scrambling code, the plurality of chip symbols comprising a plurality of spread user specific data symbols which are user specific data symbols spread by using user specific spreading codes and at least one pilot symbol, means for generating at least two independent signals based on the received block and means for combining the at least two independent signals with a combiner filter having filter coefficients which are determined by using the pilot symbol, thus obtaining a combined filtered signal.
Yet another embodiment of the invention provides a method of retrieving a desired user data symbol sequence from a received signal, the method comprising receiving a channel modified version of a transmitted signal comprising a plurality of user data symbol sequences, each being encoded with a user specific known code, determining an equalization filter directly and in a deterministic way from the received signal, and applying the equalization filter on the received signal to thereby retrieve the transmitted signal.
Yet another embodiment of the invention provides a system for retrieving a desired user data symbol sequence from a received signal, comprising means for receiving a channel modified version of a transmitted signal comprising a plurality of user data symbol sequences, each being encoded with a user specific known code, means for determining an equalization filter directly and in a deterministic way from the received signal, and means for applying the equalization filter on the received signal to thereby retrieve the transmitted signal.
In the invention, methods for communication, more in particular, wireless communication, between devices and the related devices are presented (
Recall that the invention exploits spreading with orthogonal codes for separating different users. Unfortunately said channel distortion is destroying the orthogonality of the used codes, leading to a bad separation. This problem is denoted multi-user interference. Hence the invention concerns a method for retrieving a desired user's symbol sequence (denoted by sk[i] in equation (46)), from a received signal transmitted in a communication context with multi-user interference. In said method a step of inputting or receiving said received signal, being a channel distorted version of a transmitted signal comprising a plurality of user data symbol sequences, each being encoded with a user specific known code is found.
It is an aspect of the invention to provide a method which suppresses the multi-user interference by performing operations on the chip symbols, hence before despreading and descrambling. This multi-user interference suppression is obtained by performing combining of said independent signals resulting in a combined filtered signal (320). In an embodiment of the invention said combining, also denoted chip-level equalization, is a linear space-time combining. For said combining (chip-level equalizer) a combiner filter (230) is used. The (chip-level equalization) filter coefficients (420) of said combiner filter are determined directly from said independent signals (310), hence without determining an estimate of the channel characteristic. One can state that from said independent signals (310) in a direct and deterministic way a chip-level equalization filter is determined. Said chip-level equalization filter is such that said transmitted signal is retrieved when applying said filter to said received signal.
More in particular said filter coefficients are determined by using said independent signals and the knowledge of the pilot symbol. Finally despreading and descrambling (240) with a code being the composite of said basestation specific scrambling code and one of said user specific codes (denoted by ck[n] equation (46)) is applied on said combined filtered signal (320) to obtain at least of one of said user specific data symbols. Said last step thus retrieves the desired user data symbol by correlating said retrieved transmitted signal (320) with the desired user specific known code and basestation specific code.
Note that within the despreading and descrambling step steps of performing correlations of modified received spread spectrum signals against selected replica of a spreading sequence are done, here with modified is meant linear space-time combining. Further said despreading step is used to extract the user corresponding to the spreading code used, more in particular the replica of the spreading code used at the transmitting side.
Alternatively said method for detection of a spread-spectrum signal can be formulated as comprising the steps of receiving a spread-spectrum signal including a training sequence, said signal corresponds to a superposition of signals of users; generating a first spread spectrum signal derived from said received spread spectrum signal and a second spread spectrum signal derived from said received spread spectrum signal; directly calculating the coefficients of a filter using knowledge of said training sequence from said derived spread-spectrum signals; and at least passing said first spread-spectrum signal and said second spread-spectrum signal different from said first spread-spectrum signal through said filter to generate a combined spread-spectrum signal; and despreading the filtered spread-spectrum signal, to extract a given user.
Within the invention said generated signals are independent, meaning received through another channel, either by having multiple antenna's, using the polarisation diversity of the available antennas or using temporal oversampling. Said generated signals are not a time-delayed version of each other. The combining of said generated signals is done in order to correct the signal for multi-user interference, more in particular to suppress multi-user interference. The combining filter is independent of the user one finally wants to detect. The combined signal is only then despread, meaning within the context of the description, correlated and decimated. The correlating is performed only with the user specific code of the user one wants to detect. Within the invention the combining is done before correlating steps.
The method of detection of a user signal, thus comprises the steps of generating independent signals, filtering said generated independent signals with a filter being independent of said user, said filtering results in a combined signal; and despreading and descrambling with a code being user specific. Alternatively it can be stated that the method of detection of a user signal, involves in the detection part, only the code being user specific. The combining is done at chip-rate signals, hence before any step of despreading or even correlating is made. For the sake of clarity it should be understood that in some embodiment in the part for determining the combining filter, thus not in the detection part itself, despreading is done as a first step before the filter is determined. However this despreading is done with the pilot code only. Within the method (
In a first embodiment of said method said performing of combining (
In a second embodiment of said method the direct filter coefficient determination is disclosed, indicating that said filter coefficients are determined such that one version or function of the combined filtered signal is as close as possible to a version or function of the pilot symbol. With as close as possible is meant according to a norm defined on signals, for instance a two-norm, resulting in Least Squares (LS) sense minimization. The LS minimization can also be recursified leading to Recursive Least Squares (RLS) minimization. Other minimization approaches such as Least Mean Squares (LMS) are also possible.
In a first embodiment of said direct filter coefficient determination a communication method is presented, denoted code division multiplexed pilot based determination, wherein for each user a user specific code is available but at least one of said user specific codes is used for spreading a known signal, denoted the pilot symbol. The associated user specific code is therefore denoted the pilot code Hence in said transmitted block a pilot symbol being spread by a pilot code is found.
B.1 Training Based Subembodiment
In a first subembodiment thereof a so-called training-based approach is used, relying on the knowledge of said pilot symbol. More in particular said direct filter determination is such that the used version of the combined filtered signal is the combined filtered signal after despreading with a code being the composite code of said basestation specific scrambling code and said pilot code (denoted by cp[n] in equation (46)) while said version of said pilot symbol is said pilot symbol itself. It should be noted however, that in practice said independent signals are first despread before combining takes place (see
In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the composite pilot codes, a Hankel structure output matrix, involving the received signals.
B.2 Semi-Blind Based Subembodiment
In a second subembodiment thereof a so-called semi-blind based approach is used, relying on the knowledge of said pilot symbol but also on characteristics of the codes. More in particular said direct filter determination is such that the used version of the combined filtered signal is the combined filtered signal after projecting on the orthogonal complement on the subspace spanned by the composite codes, being the codes composed of said base station specific scrambling code and said user specific codes, and said version of the pilot symbol is said pilot symbol spread with a composite code of said base station specific scrambling code and said pilot code. It should be noted however, that in practice said independent signals are first projected before combining takes place (see
In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the pilot codes, a Hankel structure output matrix, involving the received signals and a matrix involving the user codes. Hence one can state that said chip-level equalization determination step exploits said known pilot symbol sequence, said pilot specific known code(s) and substantially all (active) user specific known codes.
In a second embodiment of said direct filter coefficient determination a communication method is presented, denoted time division multiplexed pilot based method, wherein for each user a user specific code is available but in said block of symbols a plurality of a known pilot symbols are found, for each user code at least one. One can state that said transmitted block comprises at least two pilot symbols, each being spread by a user specific code. Hence said pilot symbols can be denoted user specific known pilot symbols (represented by sk[i] in equation (64)),each being encoded with a user specific known code.
C.1 Training Based Subembodiment
In a first subembodiment of said second embodiment a so-called training-based approach is used, relying on the knowledge of said pilot symbol. More in particular said direct filter determination is such that the used version of the combined filtered signal is the combined filtered signal after despreading with a code being the composite code of said basestation specific scrambling code and a user specific code and said version of said pilot symbol is said pilot symbol itself. It should be noted however, that in practice said independent signals are first despread before combining takes place (see
In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the pilot codes and a Hankel structure output matrix.
C.2 Semi-blind Based Subembodiment
In a second subembodiment of said second embodiment a so-called semi-blind based approach is used, relying on the knowledge of said pilot symbol but also on characteristics of the codes. More in particular said direct filter determination is such that the used version of the combined filtered signal is the pilot part of the combined filter signal; and said version of the pilot symbol is the sum of the spread pilot symbols within said transmitted block. In said method further said filter coefficients are determined such that the data part of the combined filter signal after projecting on the orthogonal complement on the subspace spanned by the composite codes, being the codes composed of said base station specific scrambling code and said user specific codes being as close as possible to the zero subspace. It should be noted however, that in practice said data part of said independent signals are first projected before combining takes place (see
In an example thereof said chip-level equalization filter is determined as a multiplication involving said known pilot symbol vector, a matrix involving the pilot codes, a Hankel structure output matrix, involving the received signals and a matrix involving the user codes. Hence one can state that said chip-level equalization determination step exploits substantially all (active) user specific known codes.
In a third embodiment of the invention, as shown in
In a fourth embodiment of the invention within said terminal at least M=J+1 independent signals (310) are generated from the receiver (220) with J the amount of basestations. Within said terminal said steps of space-time combining (230) is performed J times as shown in FIG. 4. Each of the resulting combined filtered signals (320) are multi-user interference suppressed Versions of the signal transmitted from one of said basestations. Finally said despreading and descrambling (240) is performed J times as shown in
In a second aspect of the invention a wireless semi-blind communication method wherein besides user data symbols and pilot symbols, additional information is transmitted from the basestation(s) to said terminals. Said additional information (430), received by the receiver block (220), is used in the filter coefficient determination (400). More in particular said additional information is projection information, enabling projecting of said independent signals on the orthogonal complement of the subspace spanned by the composite codes of said basestation specific scrambling code and said user specific codes. Hence the space-time combiner filter has filter coefficients being determined by using said projection information. Depending on the length of the user specific codes N a maximal amount of users, being N in case of time division multiplexed pilots and N−1 in case of code division multiplexed pilots. In practice an amount of active user Ua less than said maximal amount are active.
In a first embodiment of said second aspect said projection information is constructed by using or considering the active user's.
In a second embodiment of said second aspect said projection information is constructed by using or considering a predetermined set of assumed active user's, the size of said set being another upperbound on the amount of active users allowed.
In a third aspect of the invention an apparatus with at least one antenna (210) comprising (i) a receiver (220) capable of receiving a block (300) comprising of a plurality of chip symbols being data symbols spread by using scrambling and spreading codes and at least one pilot symbol, (ii) at least one linear space-time combining filter circuit (230), inputting at least two independent signals (310) derived from said received block and outputting a multi-user interference suppressed signal (320); (iii) a circuit (400) for determination of said combining filter circuit's coefficients (420), said filter circuit's coefficients being determined (directly) by using said pilot symbol; and (iv) at least one despreading circuit (240), used for despreading with one of said scrambling and spreading codes said multi-user interference suppressed signal (320). Alternatively formulated said apparatus can be denoted a radio receiver comprising a front-end circuitry (220) for providing complex baseband samples for a plurality of channels in digital format; an integrated adaptive digital combining circuitry (230), connected to said front-end circuitry, for combining said complex baseband samples; and a digital component (240), connected to said combining circuitry, being arranged to perform correlations of received spread spectrum signals against selected replica offsets of a spreading sequence
Alternatively formulated said apparatus for detecting of data signals from a transmitted signal in a Code-Division Multiple Access (CDMA) communication system, wherein the transmitted signal comprising a plurality of time overlapping coded signals, each coded signal associated to an individual user and distinguishable only by a user specific encoding (signature, spreading sequences), comprising a convolutional coder having at least two independent signals being representative for the transmitted signal as an input and outputting a single vector being representative for said transmitted signal; a decoder having said single vector as an input and outputting symbol information for at least one individual user.
In a fourth aspect of the invention an apparatus with at least one antenna (210) comprising (i) a receiver (220) capable of receiving a block comprising of a plurality of chip symbols being data symbols spread by using scrambling and spreading codes, and projection information, enabling projecting of said independent signals on the orthogonal complement of the subspace spanned by scrambling and spreading codes; (ii) at least one linear space-time combining filter circuit (230), inputting at least two independent signals (310) derived from said received block and outputting a multi-user interference suppressed signal, (iii) a circuit (400) for determination of said combining filter circuit's coefficients with filter circuit's coefficients (420) being determined by using said projection information (430), and (iv) at least one despreading circuit (240), used for despreading with one of said scrambling and spreading codes said multi user interference suppressed signal (320).
In a fifth aspect of the invention an apparatus comprising a plurality J of linear space-time combining filter circuits (230), each of said combining filter circuits inputting at least J+1 independent signals (310) and outputting a multi user interference suppressed signal (320), a circuit (400) for determination of said combining filters circuit's coefficients (420), a plurality of J despreading circuits (240), used for despreading said J multi-user interference suppressed signals (320); and a linear combining circuit (250-260) for combining said J spread multi-user interference suppressed signals.
The last combining circuit can comprise of a substep of determining a soft estimate (250) and a substep of determining a hard estimate (260) based on said soft estimate.
Alternatively formulated said apparatus can be denoted a radio receiver comprising a frontend circuitry (220) for providing complex baseband samples for a plurality of channels in digital format; a plurality of integrated adaptive digital combining circuitry (230), corrected to said front-end circuitry, for combining said complex baseband samples; and a digital component (240), connected to said combining circuitry, being arranged to perform correlations of received spread spectrum signals against selected replica offsets of a spreading sequence.
Alternatively said apparatus for detection of a spread-spectrum signal can comprise of a processor arranged to provide a plurality of multiple input combining filter circuits (230), the plurality of multiple input combining filter circuits being respectively coupled to a plurality of despreading circuits (240).
Alternatively formulated said apparatus for detecting of data signals from a transmitted signal in a Code-Division Multiple Access (CDMA) communication system, wherein the transmitted signal comprising a plurality of time overlapping coded signals, each coded signal associated to an individual user and distinguishable only by a user specific encoding (signature, spreading sequences), comprising a plurality of convolutional coders having at least two independent signals being representative for the transmitted signal as an input and outputting a single vector being representative for said transmitted signal; a decoder having said single vector as an input and outputting symbol information for at least one individual user.
Said apparatus further comprises an algorithm unit (400), said algorithm unit is coupled to said plurality of combining filter circuits. In an embodiment said algorithm unit is an adaptive algorithm unit. The algorithm unit is used for calculating the coefficients of said combining filters based upon a training sequence within said spread-spectrum signal, more in particular exploiting a priori knowledge of the training sequence at the detection apparatus side of the communication link.
Before describing the embodiment of the invention more in detail one should note that the block transmitted from said at least one basestation to an at least one terminal comprises of a plurality of chip symbols scrambled with a base station specific scrambling code, said plurality of chip symbols comprising a plurality of spread user specific data symbols, being user specific data symbols spread by using user specific spreading codes and at least one pilot symbol.
Alternatively formulated in a single-cell concept (meaning a single basestation) the transmitted signal can be a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific aperiodic scrambling code. The user aperiodic code sequence is the multiplication of the corresponding spreading code and the base-station specific aperiodic scrambling code. In a code division multiplexed pilot approach the multi-user chip sequence (denoted by x[n] in equation (46)) comprises of K user signals plus at least one continuous pilot signal. In a time division multiplexed pilot approach the multi-user chip sequence (denoted by x[n] in equation (64)) comprises of a first set of user symbols, also denoted data symbols and a second set of known pilot symbols.
Alternatively formulated in a multiple cell concept (meaning a plurality of basestations) the transmitted signal of each basestation can be a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific aperiodic scrambling code. The user terminal then receives a sum of essentially all said transmitted signals. The user aperiodic code sequence is the multiplication of the corresponding spreading code and the base-station specific aperiodic scrambling code. In a code division multiplexed pilot approach the multi-user chip sequence comprises of K user signals plus at least one continuous pilot signal. In a time-division multiplexed pilot approach the multi-user chip sequence comprises of a first set of user symbols, also denoted data symbols and a second set of known pilot symbols.
Below various embodiments of the invention are described. The invention is not limited to these embodiments but only by the scope of the claims.
G.1 Data Model for Multi-cell WCDMA Downlink
G.1.a Multi-channel framework. We consider the downlink of a multi-cell WCDMA system with J active base-stations, transmitting to the mobile station of interest (soft handover mode). Each base-station transmits a synchronous code division multiplex, employing short orthogonal Walsh-Hadamard spreading codes that are user specific and a long overlay scrambling code that is base-station specific. As shown in
with
The data symbol sequence skj[i] corresponds to the Dedicated Physical Data CHannel (DPDCH) for the k-th user from the j-th base-station whereas the pilot symbol sequence spj[i] corresponds to the j-th base-station's Common PIlot CHannel (CPICH) in the UTRA specification for 3G systems [11]. For notational simplicity, we assume that the power of a symbol sequence is incorporated in the symbol sequence itself. Each user's data symbol sequence skj[i] (pilot symbol sequence spj[i]) is spread by a factor N with the length-ρN user composite code sequence ckj[n] (pilot composite code sequence cpj[n]). The k-th user's composite code sequence for the j-th base-station ckj[n] (pilot composite code sequence cpj[n]) is the multiplication of the user specific short Walsh-Hadamard spreading code ćkj[n] (pilot-specific Walsh-Hadamard spreading code ćpj[n] and the base-station specific long scrambling code csj[n].
Assume that the mobile station is equipped with M receive antennas and let hmj(t) denote the continuous-time channel from the j-th base-station to the m-th receive antenna, including the transmit and receive filters. The received signal at the m-th receive antenna can then be written as:
with em(t) the continuous-time additive noise at the m-th receive antenna By sampling the received signal at the chiprate Tc, we obtain the following received sequence at the m-th receive antenna:
with em[n] the discrete-time additive noise at the m-th receive antenna and hmj[n]=hmj(nTc) the discrete-time channel from the j-th base-station to the m-th receive antenna. Stacking the received samples obtained from the M receive antennas:
y[n]=[y1[n]y2[n] . . . yM[n]]T
we can write:
where e[n] is similarly defined as y[n] and hj[n] is the discrete-time M×1 vector channel from the j-th base-station to the M receive antennas, given by:
hj[n]=[h1j[n]h2j[n] . . . hMj[]]T
Note that we model hj[n] as an M×1 FIR vector filter of order Lj with delay index δj (hj[n]≠0, for n=δj and n=δj+Lj, and hj[n]=0, for n<δj and n>δj+Lj).
Until now, we have considered chip rate sampling at the receiver side. Since in practice the transmit and receive filters are root raised cosine filters for a rate
with a bandwidth somewhat higher than
we should actually sample at a rate that is higher than
This is called temporal oversampling; Sampling at St times the chip rate, the data model described in Equation 65 would become a multi-channel model with MSt diversity channels per base-station. Similar to temporal oversampling, polarization diversity can increase as well the number of diversity channels per base-station [21]. Sampling at St times the chip rate and applying Sp-fold polarization diversity, the data model described in Equation 65 would become a multi-channel model with MStSp diversity channels per base-station instead of merely M. In practice a temporal oversampling factor of St=2 suffices. Since in real-life only the horizontal and the vertical polarization can be exploited, the polarization diversity order is limited to Sp=2. For simplicity reasons, we will only consider chip rate sampling and 1-fold polarization diversity at the receiver side. Note however that all future discussions remain valid if we replace M by MStSp.
G.1.b Data model for block processing. Let us now introduce the following (Q+1)M×BN output matrix Ya:
where B is the block length, a is the processing delay and Q+1 is the temporal smoothing factor. This output matrix can be written as
where the noise matrix Ea is similarly defined as Ya and the j-th base-station's (Q+1)M×rj (rj=Lj+1+Q) channel matrix j with block Toeplitz structure is given by:
The j-th base-station's rj×BN input matrix Xaj is given by:
Xaj=[xa−δj−L
where the multi-user chip sequence vector xaj, transmitted by the j-th base-station, starting at delay a is defined by:
xaj=[xj[a]xj[a+1] . . . xj[a+BN−1]] (5)
Note that Equation 4 can also be written as:
Ya=Xa+Ea (6)
where is the (Q+1)M×r channel matrix, given by:
=[1 . . . J] (7)
and Xa is the following r×BN input matrix:
Xa=[Xa1
is called the system order. In order to guarantee the existence of Zero-Forcing (ZF) chip-level equalizers, we make the following rather standard assumptions about the data model:
Assumption 1: The channel matrix has full column rank r.
Assumption 2: The input matrix Xa has full row rank r.
The first assumption requires that (Q+1)M≧r, which is equivalent with:
(9)
Therefore, in order to simultaneously track J base-stations the mobile station should be equipped with at least M=J+1 receive antennas for chip rate sampling. However, when we use a temporal oversampling factor of St=2 and we exploit Sp=2-fold polarization diversity at the receiver, we obtain MStSp=4 (good for soft handover between J=3 base-stations) with only M=1 receive antenna at the mobile station.
The second assumption, on the other hand, requires that:
BN≧r (10)
which states that the number of observed chip samples BN should be larger than the system order r.
G.2 Space-time Block Chip-level Equalizer Receivers
In this section, we develop four space-time block chip-level equalizer receivers for the mobile station operating in soft handover mode: the pilot-aided block RAKE receiver, the fully-trained, the pilot-trained and the enhanced pilot-trained block chip-level equalizer receiver. The receivers can detect the desired user's data symbols from each of the J active base-station signals. They address blocks of B symbols at once and, as shown in
G.2.a Preliminary definitions. The multi-user chip vector transmitted by the j-th base-station x0j, starting at delay a=0, consists of two parts: a first part corresponding to the unknown data symbols of the different DPDCHs and a second part corresponding to the known pilot symbols of the CPICH. Using Equation 1 and 67, we can write the 1×BN multi-user chip vector, transmitted by the j-th base-station, starting at delay a=0, as follows:
x0j=sdjCdj+spjCpj (11)
where sdj is the j-th base-station's 1×KjB multi-user data symbol vector that stacks the data symbol vectors of the different active users served by the j-th basestation
sdj=[s1j . . . sK
and skj is the 1×B data symbol vector that stacks the data symbols of the k-th user served by the j-th base-station:
skj=[skj[0]skj[1] . . . skj[B−1]]
The 1×B transmitted pilot symbol vector of the j-th base-station s-p is similarly defined as skj. The KjB×BN multi-user composite code matrix for the j-th base-station stacks the composite code matrices of the different active users:
Cdj=[C1j
where Ckj is the B×BN composite code matrix of the k-th user served by the j-th base-station
and ckj[i] is the k-th user's composite code vector for the j-th base-station used to spread and scramble the data symbol skj[i]:
ckj[i]=[ckj[(i mod ρ)N] . . . ckj[(i mod ρ)N+N−1]]
It is important to note that the k-th user's composite code vector ckj[i] can be written as the component-wise multiplication of the user specific short spreading code vector ćkj and the base-station specific long scrambling code vector csj[i]
ckj[i]=ćkj⊙csj[i] (12)
where ćkj=[ćkj[0] . . . ćkj[N−1]] stacks the Walsh-Hadamard spreading code coefficients for the k-th user served by the j-th base-station and csj[i] is similarly defined as ckj[i]. The pilot composite code matrix Cpj, the pilot composite code vector cpj[i] and the pilot spreading code vector ćpj[i] of the j-th base-station are similarly defined as Ckj, ckj[i] and ćkj respectively.
The vector x0j is a row of every input matrix from the set
and is therefore contained in every output matrix from the set
For this reason, x0j can be determined from the set of Aj output matrices {Ya
The B×AjBN super pilot composite code matrix Cpj corresponding to the j-th base-station is defined by:
Cpj=[Cpj . . . Cpj]
while the KjB×AjBN super multi-user composite code matrix Cdj corresponding to the j-th basestation is given by:
Cdj=[Cdj . . . Cdj]
G.2.b Pilot-aided block RAKE receiver. The j-th branch of the pilot-aided block RAKE receiver estimates the desired-user's data symbol vector sk
The pilot-aided channel estimation part of the j-th branch exploits the knowledge of the pilot composite code matrix and the pilot symbol vector to determine the j-th base-station's space-time channel coefficients [22]. The columns of the (Lj+1)M×B matrix Vj contain the initial estimates of j-th base-station's space-time channel vector at the different symbol instants. Vj is determined as follows:
Vj=(Yδ
where the j-th base-station's (Lj+1)M×B pilot symbol matrix Spj is defined as follows:
Spj=[spj
The (Lj+1)M×BN output matrix Yδ
This final estimate of the space-time channel vector vj may subsequently be used to extract the desired user's soft symbol decisions in the j-th branch:
by despreading the coherently combined output matrix vj
The final soft symbol decisions for the desired user are then obtained by averaging the soft symbol decisions from each of the J branches:
Finally, the soft decisions
G.2.c Fully-trained block chip-level equalizer receiver. The j-th branch of the fully-trained block chip-level equalizer receiver estimates the desired user's data symbol vector sk
We assume first, for the sake of clarity, there is no additive noise present in Ya
wa
where waj is a ZF linear chip-level equalizer with (Q+1)M−r degrees of freedom.
In order to derive an identifiability condition on wa
because the rows of Xa
BN≧r (18)
which states that the number of observed chip samples should be larger than the system order r.
Let us now assume that additive noise is present in Ya
which can be interpreted as follows. The equalized output matrix for the j-th base-station wa
The obtained fully-trained chip-level equalizer
The final soft symbol decisions for the desired user are then obtained by averaging the soft symbol decisions from each of the J branches:
Finally, the soft decisions
G.2.d Pilot-trained block chip-level equalizer receiver. The j-th branch of the pilot-trained block chip-level equalizer receiver estimates the desired user's data symbol vector sk
We assume first, for the sake of clarity, there is no additive noise present in Ya
ga
where gaj is a ZF linear chip-level equalizer with (Q+1)M−r degrees of freedom. By despreading the above equation with the pilot composite code matrix Cpj by using Equation 70 we can then write
ga
because CdjCpj
In order to derive an identifiability condition on ga
because the rows of Xa
B≧r (23)
which states that the block length B should be larger than the system order r.
Note that Equation 22 can be derived for all Aj processing delays a=A1j, A1j+1, . . . , A2j and these results can be combined, leading to:
where gj is the 1×Aj(Q+1)M linear pilot-trained super chip-level equalizer for the j-th base-station:
gj=[gA
For the pilot-trained receiver, choosing Aj>1 corresponds to taking a larger Q. For this reason, we will only focus on one processing delay (Aj=1).
Let us now assume that additive noise is present in Ya
which can be interpreted as follows. The equalized output matrix for the j-th base-station ga
It is easy to prove that the LS problem in Equation 24 can be rewritten as follows:
showing that the equalized output matrix ga
should-then be as close as possible to the known pilot chip vector of the j-th base-station spjCpj in a Least Squares sense.
The user-specific detection part of the pilot-trained block chip-level equalizer receiver is very similar to the one of the fully-trained block chip-level equalizer receiver. Equations 20 and 75 remain valid if we replace
G.2.e Enhanced pilot-trained block chip-level equalizer receiver. The j-th branch of the enhanced pilot-trained block chip-level equalizer receiver estimates the desired user's data symbol vector sk
We assume first, for the sake of clarity, there is no additive noise present in Ya
fa
where fa
fa
which is a ZF problem in both the equalizer vector fa
In order to derive an identifiability condition on fa
because the rows of Xa
B(N−Kj)≧r (27)
which states that the block length B times the number of unused spreading codes N−Kj should be larger than the system order r. Therefore the maximum number of users that can be supported is Kj=N−1, where N is the spreading factor.
Note that Equation 26 can be derived for all Aj processing delays a=A1j, A1j+1, . . . , A2j and these results can be combined, leading to:
fjyj−sdjdj−spjpj=0
where fj is the 1×Aj(Q+1)M linear enhanced pilot-trained super chip-level equalizer for the j-th base-station:
fj=[fA
For the enhanced pilot-trained receiver, taking Aj>1 does not correspond to taking a larger Q but to the mutually referenced equalizer (MRE) approach presented in [23]. However, the performance improvements obtained by taking Aj>1 come at a very high cost. For this reason, we will only focus on one processing delay (Aj=1).
Let us now assume that additive noise is present in Ya
Since the LS cost function is a quadratic form in fa
because CdjCdj
which can be interpreted as follows. The equalized output matrix for the j-th base-station fa
It is easy to prove that the modified LS problem of Equation 30 can be rewritten as follows:
showing that the enhanced pilot-trained LS problem naturally decouples into two different parts: a training-based part and a fully-blind part. On the one hand, the training-based part (described by the first term of Equation 31) corresponds to the pilot-trained chip equalizer of Equation 24. On the other hand, the fully-blind part (described by the second term of Equation 31) projects the equalized output matrix fa
The user-specific detection part of the enhanced pilot-trained block chip equalizer receiver is very similar to the one of the fully-trained block chip equalizer receiver. Equations 20 and 75 remain valid if we replace
G.3 Space-time Adaptive Chip-level Equalizer Receivers
Up till now, we have only discussed the block version of the different chip equalizer receivers. It addresses a block of B symbols at once and is only suited for a block fading channel, that remains constant during the entire duration of the block. Since in practice, the multi-path fading channel is time-varying, we have to devise an adaptive version as well that updates the equalizer coefficients on the fly. In this section, we derive an adaptive version of the different chip-level equalizer receivers from their corresponding block version and provide complexity figures for each of them. The adaptive receivers address a single symbol at once and consist of J parallel branches, one for each base-station connected to the mobile station of interest. The j-th branch basically consists of two parts: a pilot-aided updating part and a user specific detection part.
G.3.a Preliminary definitions. We introduce the following (Q+1)M×N output matrix block Ya[i]:
where i represents the symbol instant and a the processing delay.
The (Lj+1)M×1 pilot symbol vector for the j-th base-station at the i-th symbol instant is defined as follows:
spj[i]=[spj[i] . . . spj[i]]T
The 1×N multi-user chip vector transmitted by the j-th base-station at the i-th symbol instant starting at delay a is defined by:
xaj[i]=[xj[iN+a] . . . xj[(i+1)N+a−1]]
The Kj×N multi-user composite code matrix Cdj[i] stacks the j-th base-station's active user composite code vectors at the i-th symbol instant:
Cdj[i]=[c1j[i]T . . . CK
Note that the Kj×N multi-user spreading code matrix Ćdj can be similarly defined as Cdj[i], stacking the j-th base-station's active user spreading code vectors for each symbol instant.
G.3.b Pilot-aided adaptive RAKE receiver. The j-th branch of the pilot-aided adaptive RAKE receiver estimates the desired user's data symbol skj[i] (we assume the desired user to be the kj-th user of the j-th base-station multiplex) from the set Yδ
The pilot-aided channel updating part of the j-th branch exploits the knowledge of the pilot composite code vectors and pilot symbols of the corresponding j-th base-station [22]. It continuously updates the j-th base-station's space-time channel vector at the symbol rate. By having a closer look at the corresponding block processing algorithm in Equation 13, it is rather straightforward to derive an adaptive processing algorithm for the j-th branch. The new estimate of the space-time channel vector is the weighted sum of the old estimate and some correction term:
vj[i]=(1−α)vj[i−1]+α(Yδ
The correction term is the component-wise multiplication of the new despread output matrix block Yδ
computations in total for all J branches. By setting L=maxjLj, we can upperbound this complexity figure by (JLMN).
The user-specific detection part of the j-th branch utilizes the updated space-time channel vector, provided by the channel updating part, to coherently combine the received output matrix block Yδ
a by despreading the coherently combined output matrix block vj[i]HYδ
computations in total for all J branches. With L=maxj Lj, we finally arrive at (JNLM) computations.
The final soft decision
Finally, the soft decision
G.3.c Fully-trained adaptive chip-level equalizer receiver. The j-th branch of the fully-trained adaptive chip-level equalizer receiver estimates the desired user's data symbol sk
The fully-trained updating part of the j-th branch exploits the knowledge of the multi-user chip vectors of the corresponding j-th base-station. It continuously updates the equalizer coefficients at the symbol rate using a Square Root Information (SRI) RLS type of adaptive algorithm [24]. By having a closer look at the corresponding LS block processing algorithm in Equation 19, it is indeed rather straightforward to derive an RLS type of adaptive processing algorithm for the j-th branch. The new incoming output matrix block Ya
In this equation, λ is the forget factor, that should be chosen in correspondence with the coherence time of the time-varying channel. Ta
wa
The fully-trained SRI-RLS updating part for the j-th branch consists of two computational steps per time update, namely first a triangular QRD-updating step and then a triangular backsubstitution step. On the one hand, the QRD-updating step amounts to triangularizing a structured matrix, namely a triangular matrix plus N extra columns, and requires (NQ2M2) computations. On the other hand, the triangular backsubstitution step requires (Q2M2) computations. So, the fully-trained updating part requires (JNQ2M2) computations in total for all J branches.
The user-specific detection part of the j-th branch utilizes the updated chip equalizer coefficients wa
by despreading the equalized output matrix block wa
The final soft decision
Finally, the soft decision
G.3.d Pilot-trained adaptive chip-level equalizer receiver. The j-th branch of the pilot-trained adaptive chip-level equalizer receiver estimates the desired user's data symbol sk
The pilot-trained updating part of the j-th branch exploits the knowledge of the pilot composite code vectors and pilot symbols of the corresponding j-th base-station [25]. It continuously updates the equalizer coefficients at the symbol rate using a Square Root Information (SRI) RLS type of adaptive algorithm [24]. By having a closer look at the corresponding LS block processing algorithm in Equation 24, it is indeed rather straightforward to derive an RLS type of adaptive processing algorithm for the j-th branch. The new incoming output matrix block Ya
In this equation, λ is again the forget factor and Ta
ga
The pilot-trained SRI-RLS updating part for the j-th branch consists of three computational steps per time update, namely first a pilot despreading step, then a triangular QRD-updating step and finally a triangular backsubstitution step. The pilot despreading step requires (QMN) computations. The QRD-updating step amounts to triangularizing a structured matrix, namely a triangular matrix with one extra column, and requires (Q2M2) computations. The triangular backsubstitution step also requires (Q2M2) computations. So, the pilot-trained updating part requires (JQMN+JQ2M2) computations in total for all J branches.
The user-specific detection part of the pilot-trained adaptive chip-level equalizer receiver is very similar to the one of the fully-trained adaptive chip-level equalizer receiver. Equations 38 and 39 remain valid if we replace wa
G.3.e Enhanced pilot-trained adaptive chip-level equalizer receiver. The j-th branch of the enhanced pilot-trained adaptive chip-level equalizer receiver estimates the desired user's data symbol sk
The enhanced pilot-trained updating part of the j-th branch exploits the knowledge of all composite code vectors (so both pilot and user composite code vectors) and pilot symbols of the corresponding j-th base-station. It continuously updates the equalizer coefficients at the symbol rate using a Square Root Information (SRI) RLS type of adaptive algorithm [24]. By having a closer look at the corresponding LS block processing algorithm in Equation 30, it is indeed rather straightforward to derive an RLS type of adaptive processing algorithm for the j-th branch. Each new incoming output matrix block Ya
Pdj[i]=IN−Cdj[i]HCdj[i] (42)
The new projected output matrix block Ya
In this equation λ is again the forget factor and Ta
fa
The enhanced pilot-trained SRI-RLS updating part for the j-th branch consists of four computational steps per time update, namely first a projection step, then a pilot symbol spreading step, next a triangular QRD-updating step and finally a triangular backsubstitution step. The projection step decomposes into two substeps, namely the projection matrix calculation substep and the projection matrix multiplication substep. The projection matrix calculation substep would normally require (KjN2) computations. Note however that the multi-user composite code correlation matrix Cdj[i]HCdj[i] in Equation 42 can be expressed as follows:
Cdj[i]HCdj[i]=ĆdjHĆdj⊙csj[i]Hcsj[i] (45)
showing that it is the component-wise multiplication of the multi-user spreading code correlation matrix ĆdjHĆdj and the scrambling code correlation matrix csj[i]Hcsj[i]. Since the multi-user spreading code correlation matrix does not depend on the symbol instant i, it can be precalculated at the basestation and broadcasted to all active mobile stations. This leaves only (N2) complexity for the projection matrix calculation substep at the mobile station. The projection matrix multiplication substep requires (QMN2) computations. The pilot symbol spreading step only requires (N) computations. The QRD-updating step amounts to triangularizing a structured matrix, namely a triangular matrix plus N extra columns, and requires (NQ2M2) computations. Finally, the triangular backsubstitution step requires (Q2M2) computations. So, the enhanced pilot-trained updating part requires (JQMN2+JNQ2M2) computations in total for all J branches.
The user-specific detection part of the enhanced pilot-trained adaptive chip equalizer receiver is very similar to the one of the fully-trained adaptive chip equalizer receiver. Equations 38 and 39 remain valid if we replace wa
G.3.f Complexity comparison. Table I compares the complexity of the different adaptive chin level equalizer receivers. The user-specific detection part is common to all receivers. Moreover, all receivers have linear complexity in the number of actively tracked base-stations J. The updating part of the pilot-aided adaptive RAKE receiver (PA-RAKE) has linear complexity in both the total number of RAKE fingers LM and the spreading factor N. The updating part of all other receivers has quadratic complexity in the total number of equalizer coefficients QM. The updating part of both the fully-trained (FT-CE) and the pilot-trained (PT-CE) adaptive chip-level equalizer receiver has linear complexity whereas that of the enhanced pilot-trained (EPT-CE) adaptive chip-level equalizer receiver has quadratic complexity in the spreading factor N. In fact, the updating part of the EPT-CE is N times more complex than that of the PT-CE. Note that the complexity figure of the EPT-CE agrees with the complexity figure of the LS channel estimators for long code DS-CDMA systems recently proposed in [26].
G.4 Simulation Results
G.4.a Block processing. The simulations for the block chip-level equalizer receivers are performed for the downlink of a WCDMA system with J=2 active base-stations, Kj=K users per base-station, QPSK data modulation, real orthogonal Walsh-Hadamard spreading codes of length N=12 along with a random overlay code for scrambling whose period measures
ρ=B symbols. The j-th base-station's interfering user signals and pilot signal are transmitted with a Pij=Pi dB respectively Ppj=Pp dB higher power than the j-th base-station's desired user signal. The mobile station is equipped with the minimum number of receive antennas for chip rate sampling M=J+1=3 to simultaneously track J=2 base-stations. Each base-station's vector channel with order Lj=L=3 has M(L+1)=12 Rayleigh distributed channel taps of equal average power and is assumed to be constant during the entire duration of the block (block fading channel). The temporal smoothing factor Q+1 is chosen in correspondence with Equation 9 to be Q+1=JL=6. Note that the system order in this case equals r=J(J+1)L=18. All figures show the average BER versus the average SNR per bit for the pilot-aided block RAKE receiver (PA-RAKE), the pilot-trained (PT-CE), the enhanced pilot-trained (EPT-CE) and the fully-trained (FT-CE) block chip-level equalizer receiver. Also shown in the figures is the single-user bound (SUB) which is the theoretical BER-curve of QPSK with M(L+1)-th order diversity in Rayleigh fading channels [27].
the EPT-CE outperforms the PA-RAKE. The PT-CE has worse performance than the PA-RAKE. Only at rather high SNR per bit
the PT-CE has better performance than the PA-RAKE. For a relative interference power of Pi=+10 dB, shown in
G.4.b Adaptive processing. The simulations for the adaptive chip-level equalizer receivers are performed for the downlink of a WCDMA system with J=1 active base-station, Kj=K equal power users, QPSK data modulation, real orthogonal Walsh-Hadamard spreading codes of length N=12 along with a random overlay code for scrambling whose period measures ρ=10 symbols. The mobile station is equipped with the minimum number of receive antennas for chip rate sampling M=J+1=2 to simultaneously track J=1 base-station. The base-station's time-varying vector channel with order Lj=L=3 has M(L+1)=8 Rayleigh distributed channel taps of equal average power with the classical Jakes spectrum and a normalized coherence time of τc=2000 symbols.
being
of the normalized coherence time τc. Also shown in the figures is the single-user bound (SUB) which is the theoretical BER-curve of QPSK with M(L+1)-th order diversity in Rayleigh fading channels [27].
For half system load, shown in
For full system load, shown in
G.5 Conclusion
In this paper, we proposed new pilot-trained and enhanced pilot-trained space-time chip-level equalizer receivers for the downlink of WCDMA systems with a continuous code-multiplexed pilot. With MStSp=J+1 diversity channels at the mobile station, the proposed receivers can simultaneously track J active base-station signals in soft handover mode. These diversity channels can be obtained by either M-fold space diversity, St-fold temporal oversampling, Sp-fold polarization diversity or a combination of the three above mentioned techniques. For instance, with a temporal oversampling factor of St=2 and Sp=2-fold polarization diversity at the receiver, we obtain MStSp=4 (good for soft handover between J=3 base-stations) with only M=1 receive antenna at the mobile station. For both receivers a Least Squares algorithm for block processing and a QRD-based Recursive Least Squares algorithm for adaptive processing has been derived. The proposed receivers are compared with the conventional pilot-aided space-time RAKE receiver and the ideal fully-trained space-time chip-level equalizer receiver both in terms of performance and complexity.
For full system load, the pilot-trained and the enhanced pilot-trained receiver have exactly the same performance. However, for low to medium system load, the enhanced pilot-trained receiver outperforms the pilot-trained receiver and its performance comes close to that of the ideal fully-trained receiver. Moreover, the enhanced pilot-trained chip-level equalizer can be viewed as a semi-blind chip equalizer, since its cost function can be written as the sum of a training-based term and a fully-blind term.
The Least Squares block processing algorithms on the one hand, detect blocks of B symbols at once. For large block lengths B→∞, the performance of both the pilot-trained and the enhanced pilot-trained block chip equalizer receiver converges to the performance of the fully-trained block chip equalizer receiver. Conversely, the performance gain of the enhanced pilot-trained block chip-level equalizer receiver compared to the pilot-trained block chip-level equalizer receiver increases for decreasing block lengths B. Both receivers can deal with a severe near/far situation caused by the power control in the downlink. Moreover they are robust against both channel order over- and underestimation and they can cope with channels with a head or tail approaching zero.
The QRD-based Recursive Least Squares adaptive processing algorithms on the other hand, that detect symbol by symbol, are easily derived from their corresponding block processing algorithm. Both the pilot-trained and the enhanced pilot-trained adaptive chip-level equalizer receiver can track time-varying multi-path channels. The pilot-trained adaptive chip-level equalizer receiver outperforms the pilot-aided adaptive RAKE receiver at medium to high SNR per bit while the enhanced pilot-trained adaptive chip-level equalizer receiver always outperforms the pilot-aided adaptive RAKE receiver. Both the pilot-trained and the enhanced pilot-trained chip-level equalizer receiver have linear complexity in the number of tracked base-stations J and quadratic complexity in the total number of equalizer coefficients QMStSp. Whereas the pilot-trained receiver has only linear complexity in the spreading factor N, the enhanced pilot-trained receiver has quadratic complexity in the spreading factor N. In fact, the enhanced pilot-trained adaptive chip-level equalizer receiver exhibits a N times higher complexity than the regular pilot-trained adaptive chip-level equalizer receiver.
We can conclude that the enhanced pilot-trained space-time chip-level equalizer receiver is a promising receiver for future WCDMA terminals, from a performance as well as a complexity point of view.
H.1 WCDMA Forward Link Data Model
H.1.a Multi-channel framework. Let us consider the forward link of a single-cell WCDMA system with K active user terminals. The base-station transmits a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific aperiodic scrambling code. The transmitted multi-user chip sequence consists of K user signals and a continuous pilot signal:
with
Each user's data symbol sequence sk[i] (pilot symbol sequence sp[i]) is spread by a factor N with the user code sequence ck[n] (pilot code sequence cp[n]). The k-th user aperiodic code sequence ck[n] (pilot code sequence cp[n]) is the concatenation of the corresponding Walsh-Hadamard spreading code and the base-station specific aperiodic scrambling code.
Assume that the user terminal is equipped with M receive antennas and let hm(t) denote the continuous-time channel from the base-station to the m-th receive antenna, including the transmit and receive filters. By sampling the different received antenna signals at the chiprate
N/T, we obtain the following received vector sequence:
y[n]=[y1[n]y2[n] . . . yM[n]]T
which can be written as:
where e[n] is similarly defined as y[n] and h[n] is the discrete-time M×1 vector channel from the base-station to the M receive antennas, given by:
h[n]=[h1[n]h2[n] . . . hM[n]]T
In this equation
is the discrete-time channel impulse response from the base-station to the m-th receive antenna. Note that we model h[n] as an M−1 FIR vector filter of order L.
H.1.b Data model for block processing. Let us now introduce the (Q+1)M×BN output matrix Ya (with Hankel structure), shown in equation VIII-I.1.b, where B is the block length, a is the processing delay and Q+1 is the temporal smoothing factor. This output matrix can be written as
Ya=Xa+Ea (49)
where the noise matrix Ea is similarly defined as Ya and is the (Q+1)×r (r=L+Q+1) channel matrix (with Toeplitz structure). The r×BN input matrix Xa (with Hankel structure) is given by:
Xa=[xa−LT . . . xa+QT]T
with the transmitted multi-user chip sequence vector at delay a:
xa=[x[a]x[a+1] . . . x[a+BN−1]]
H.2 Semi-blind Chip Equalizer Receiver
In this section, we discuss a new semi-blind chip equalizer receiver, that exploits all code information on one hand and pilot symbol information on the other hand. This receiver is based on the fully blind chip equalizer receiver for the reverse link of WCDMA systems, presented in the second part of [28]. One algorithm for block processing and one for adaptive processing is derived.
H.2.a Block processing. The block processing algorithm for the semi-blind chip equalizer detects B data symbols at once. We can write the transmitted multi-user chip sequence at delay a=0 as follows:
x0=sdCd+spCp (50)
where sd is the 1×KB total transmitted data symbol vector
sd=[s1 . . . sK]
and sk is the k-th user's 1×B transmitted data symbol vector:
sk=[sk[0]sk[1] . . . sk[B−1]]
The transmitted pilot symbol vector sp is similarly defined as sk. The KB×BN user code sequence matrix Cd stacks the code sequence matrices of the individual users:
Cd=[C1T . . . CKT]T
where Ck is the k-th user's B×BN code sequence matrix
and ck[i] is the k-th user's code sequence vector used to spread the data symbol sk[i]:
ck[i]=[ck[iN] . . . ck[iN+N−1]]
The B×BN pilot code sequence matrix Cp and the pilot code sequence vector cp[i] are similarly defined as Ck respectively ck[i].
The vector x0 is a row of every input matrix from the set {Xa}a=−QL and is therefore ‘contained’ in every output matrix from the set {Xa}a=−QL. The semi-blind block processing problem addressed here is to compute the desired user's data symbol sequence s1 (we assume the first user to be the user of interest) from Ya, with −Q≦a≦L, based on the knowledge of the user code sequence matrix Cd, the pilot code sequence matrix Cp and the pilot symbol vector sp. In order to solve this problem we make the following rather standard assumptions:
Assumption 3: The channel matrix has full column rank r.
Assumption 4: The input matrix Xa has full row rank r.
The first assumption requires that:
(Q+1)(M−1)≧L
Therefore the number of antennas should be at least M=2. The second assumption, on the other hand, requires that:
BN≧r
Let us first, for the sake of clarity, assume there is no additive noise present in Ya (−Q≦a≦L). Because of assumptions 5 and 6, the rows of Ya span the row space of Xa. Hence, there exists a 1×(Q+1)M linear chip equalizer fa, for which:
faYa−x0=0
and this linear chip equalizer fa is a ZF linear chip equalizer with (Q+1) M−r degrees of freedom (hence, this linear chip equalizer is only unique when (Q+1)M−r). Using Equation 50 we can then write:
faYa−sdCd−SpCp=0 (51)
In order to guarantee the uniqueness of the solution for fa,sd up to a complex scaling factor, the matrix [XaT−CdT−(spCp)T]T should have at most a one-dimensional left null space. This leads to the following identifiability condition:
B(N−K)≧r (52)
Therefore the maximum number of users that can be supported is K=N−1.
Let us now assume that additive noise is present in Ya (−Q≦a≦L). We then solve the following Least Squares (LS) minimisation problem:
Since the LS cost function is a quadratic form in fa,sd, the minimisation can be done independently for fa and sd. In order to obtain a direct semi-blind equalizer estimation, we first solve for sd, assuming fa to be known and fixed. The LS solution for sd can be simplified to:
because CdCdH=IKB and CpCdH=0B×KB due to the orthogonality of the user code sequences and the pilot code sequence at each symbol instant. Substituting
which can be interpreted as follows. The equalized signal faYa is projected on the orthogonal complement of the space spanned by the user code sequences. Furthermore, the projected equalized signal faYa(IBN−CdHCd) should be as close as possible to the transmitted pilot chip sequence spCp in a Least Squares sense. Eventually, the LS solution for fa can be written as:
H.2.b Adaptive processing. In this subsection, we derive a Square-Root Information (SRI) RLS type of adaptive algorithm for the semi-blind chip equalizer. By having a closer look at
Equation 55, we notice that each new incoming (Q+1)M×N output matrix block Ya[i] is first projected on the orthogonal complement of the space spanned by the user code sequences, by using the projection matrix {tilde over (C)}d[i]:
The new projected output matrix block Ya[i]{tilde over (C)}d[i] is then used together with the new transmitted pilot chip sequence vector sp[i]cp[i] in a QRD-updating step, shown in Equation 57. In this equation, λ is the forget factor, that should be chosen in correspondence with the coherence time of the time-varying channel. The QRD-updating step tracks a lower-triangular factor Ra[i] and a corresponding right-hand side za[i]. The new value for the semi-blind chip equalizer fa[i] then follows from the backward substitution step:
fa[i]·Ra[i]=za[i] (58)
H.3 Training-based Chip Equalizer Receiver
In this section, we discuss a training-based chip equalizer receiver [25], that exploits (besides the desired user's code information) only pilot code information on one hand and pilot symbol information on the other hand. Again, one algorithm for block processing and one for adaptive processing is derived.
H-3.a Block processing. The block processing algorithm for the training-based chip equalizer, that detects B data symbols at once, can again be formulated as a LS minimisation problem:
which can be interpreted as follows. The equalized signal gaYa is despread with the pilot code sequence matrix Cp. The equalized signal after despreading gaYaCpH should then be as close
as possible to the transmitted pilot symbol vector sp in a Least Squares sense. The LS solution for ga can be written as:
In order to guarantee the uniqueness of the solution for & up to a complex scaling factor, the matrix [(XaCpH)T−spT]T should have at most a one-dimensional left null space. This leads to the following identifiability condition:
B≧r(61)
Note that when K=N−1, IBN−CdHCd=CpHCp. This means that for a fully loaded system (K=N−1) the semi-blind method is exactly the same as the training-based method. This is also indicated by the identifiability conditions 52 and 61.
H.3.b Adaptive processing. In this subsection, we derive an SRI-RLS type of adaptive algorithm for the training-based chip equalizer. By having a closer look at Equation 59, we notice that each new incoming (Q+1)M×N output matrix block Ya[i] is first despread with the pilot code sequence vector cp[i]. The new despread output matrix block Ya[i]cp[i]H is then used together with the new pilot symbol sp[i] in a QRD-updating step, shown in Equation 62. In this equation λ is again the forget factor. The QRD-updating step tracks a lower-triangular factor {circumflex over (R)}a[i] and a corresponding right-hand side {circumflex over (z)}a[i]. The new value for the training-based chip equalizer ga[i] then follows from the backward substitution step:
ga[i]·{circumflex over (R)}a[i]={circumflex over (z)}a[i] (63)
H.4 Conclusion
We have developed new training-based and semi-blind space-time chip equalizer receivers for the forward link of WCDMA systems employing a continous code-multiplexed pilot. The proposed receivers can track fast fading multipath channels and outperform the conventional RAKE receiver with perfect channel knowledge. For full system load, the training-based and the semi-blind approach have exactly the same performance. However, for low to medium system load, the semi-blind approach outperforms the training-based approach.
I.1 WCDMA Forward Link Data Model
I.1.a Multi-channel framework. Let us consider the forward link of a single-cell WCDMA system with K active user terminals. The base-station transmits a synchronous code division multiplex, employing user specific orthogonal Walsh-Hadamard spreading codes and a base-station specific aperiodic scrambling code. The transmitted multi-user chip sequence can then be written as:
with
Each user's symbol sequence is transmitted in blocks of B=P+D symbols, where the first P symbols are known pilot symbols and the final D symbols are unknown data symbols. The pilot symbols correspond to the Dedicated Physical Control CHannel (DPCCH) whereas the data symbols correspond to the Dedicated Physical Data CHannel (DPDCH) in the UTRA specification for 3G mobile communications [11]. Each user's symbol sequence sk[i] is spread by a factor N with the length-ρN user code sequence ck[n]. The k-th user aperiodic code sequence is the multiplication of the user specific Walsh-Hadamard spreading code and the base-station specific aperiodic scrambling code.
We assume that the user terminal is equipped with M receive antennas. The received antenna signals are sampled at the chiprate
and the obtained samples are stacked in the M×1 received vector sequence y[n]=[y1[n] y2[n] . . . yM[n]]T, which can be written as:
where e[n] is the M×1 received noise vector sequence and h[n]=[h1[n] h2[n] . . . hM[n]]T is the M×1 discrete-time vector channel from the basestation to the M receive antennas. Note that we model h[n] as an M×1 FIR vector filter of order L.
I.1.b Data model for block processing. Let us now introduce the following (Q+1)M×BN output matrix Ya (with Hankel structure):
where B is the block length, a is the processing delay and Q+1 is the temporal smoothing factor. This output matrix can be written as:
Ya=Xa+Ea (66)
where the noise matrix Ea is similarly defined as Ya, the (Q+1)M×r channel matrix (with block Toeplitz structure) is given by:
and the r×BN input matrix Xa (with Hankel structure) is defined by:
Xa=[xa−LT . . . xa+QT]T
with the transmitted multi-user chip sequence vector xa, starting at delay a, given by:
xa[x[a] x[a+1] . . . x[a+BN−1]] (67)
r=L+1+Q is called the system order. In order to guarantee the existence of Zero-Forcing (ZF) chip equalizers, we make the following rather standard assumptions about the data model:
Assumption 5: The channel matrix has full column rank r.
Assumption 6: The input matrix Xa has full row rank r.
The first assumption requires that:
(Q+1)(M−1)≧L (68)
which states that the number of receive antennas should be at least M=2 and correspondingly the temporal smoothing factor Q+1 should be at least the channel order L. The second assumption requires that:
BN≧r (69)
which states that the number of observed chip samples BN should be larger than the system order r.
I.2 Space-time Block Chip Equalizer Receivers
In this section we develop a DPCCH-trained and an enhanced DPCCH-trained chip equalizer receiver for the mobile user terminal that can detect the desired user's data symbols from the received base-station signal. Both chip equalizer receivers consist of a linear space-time chip equalizer followed by a correlator. The space-time chip equalizer tries to restore the transmitted multi-user chip sequence by linearly combining the discrete-time signals from the different receive antennas. The correlator descrambles and despreads the equalized signal with the desired user's aperiodic code sequence and generates the soft decisions about the desired user's data symbols. These soft decisions are then input to a decision device that generates the final hard decisions.
The DPCCH-trained and the enhanced DPCCH-trained chip equalizer receivers differ in the amount of a-priori information they assume to determine their equalizer coefficients. In the following, we will describe for both receivers a Least Squares (LS) block processing algorithm, that addresses a block of B symbols at once.
I.2.a Preliminary definitions. The transmitted multi-user chip sequence vector x0, starting at delay α=0, consists of two parts: a first part corresponding to the known pilot symbols of the DPCCH and a second part corresponding to the unknown data symbols of the DPDCH. Using Equation 1 and 67, we can write the 1×BN transmitted multi-user chip sequence vector x0 as follows:
x0=spCp+sdCd (70)
where sp is the 1×KP total pilot symbol vector that stacks the pilot symbol vectors of the different active users:
sp=[sp,1 . . . sp,K]
and sp,k is the 1×P pilot symbol vector that stacks the pilot symbols of the k-th user:
sp,k=[sk[0]sk[1] . . . sk[P−1]]
The 1×KD total data symbol vector sd and the 1×D data symbol vector of the k-th user sd,k are similarly defined as sp respectively sp,k. The KP×BN total pilot code sequence matrix Cp stacks the pilot code sequence matrices of the different active users:
Cp=[Cp,1T . . . Cp,KT]T
where Cp,k is the P×BN pilot code sequence matrix of the k-th user:
Cp,k=[
and
The 1×N code sequence vector of the k-th user ck[i] is used to spread the data symbol sk[i]:
ck[i]=[ck[(i mod ρ)N] . . . ck[(i mod ρ)N+N−1]]
The KP×PN total effective pilot code sequence matrix
The KD×BN total data code sequence matrix Cd and the D×BN data code sequence matrix of the k-th user Cd,k are similarly defined as Cp respectively Cp,k. The D×DN effective data code sequence matrix of the k-th user
The vector x0 is a row of every input matrix from the set {Xa}a=−QL and is therefore contained in every output matrix from the set {Ya}a=−QL. Note that the output matrix Ya can also be partitioned into two parts:
Ya=[Yp,aYd,a]
where the (Q+1)M×PN pilot part Yp,a corresponds to the DPCCH and the (Q+1)M×DN data part Yd,a corresponds to the DPDCH.
I.2.b DPCCH-trained block chip equalizer receiver. The DPCCH-trained block chip equalizer receiver estimates the desired user's data symbol vector sd,k (we assume the desired user to be the k-th user of the basestation multiplex) from Ya, with −Q≦a≦L, based on the knowledge of the desired user's data code sequence matrix Cd,k, the desired user's pilot code sequence matrix Cp,k and the desired user's pilot symbol vector sp,k.
We assume first, for the sake of clarity, there is no additive noise present in Ya(−Q≦a≦L). Because of assumptions 5 and 6, the rows of Ya span the row space of Xa. Hence, there exists a 1×(Q+1)M linear DPCCH-trained chip equalizer ga, for which:
gaYa−x0=0
where ga is a ZF linear chip equalizer with (Q+1)M−r degrees of freedom. By despreading the above equation with the desired user's pilot code sequence matrix Cp,k and by using Equation 70 we can then write:
gaYaCp,kH−sp,k=0 (71)
because of the orthogonality between the data code sequence matrices Cd,k (k=1 . . . K) and the pilot code sequence matrices Cp,k(k=1 . . . K) of the different active users.
Let us now assume that additive noise is present in Ya(−Q≦a≦L). We then solve the following Least Squares (LS) minimisation problem:
which can be interpreted as follows. The equalized signal gaYa is first despread with the desired user's pilot code sequence matrix Cp,k. The equalized signal after despreading gaYaCp,kH should then be as close as possible to the desired user's known pilot symbol vector sp,k in a Least Squares sense.
It is easy to prove that the LS problem in Equation 72 can be rewritten as follows:
showing that the DPCCH-trained LS problem only exploits the pilot part Yp,a of the output matrix Ya.
The LS solution for ga can be written as:
The obtained LS DPCCH-trained chip equalizer
Finally, the soft decisions
I.2.c Enhanced DPCCH-trained block chip equalizer receiver. The enhanced DPCCH-trained block chip equalizer receiver estimates the desired user's data symbol vector sd,k from Ya, with −Q≦a≦L, based on the knowledge of the total data code sequence matrix Cd, the total pilot code sequence matrix Cp and the total pilot symbol vector sp.
We assume first, for the sake of clarity, there is no additive noise present in Ya(−Q≦a≦L). Because of assumptions 5 and 6, the rows of Ya span the row space of Xa. Hence, there exists a 1×(Q+1)M linear enhanced DPCCH-trained chip equalizer fa, for which:
faYa−x0=0
where fa is a ZF linear chip equalizer with (Q+1)M−r degrees of freedom. Using Equation 70 we can then write:
faYa−sdCd−spCp=0 (76)
Let us now assume that additive noise is present in Ya(−Q≦a≦L). We then solve the following Least Squares (LS) minimisation problem:
Since the LS cost function is a quadratic form in fa,sd, the minimisation can be done independently for fa and sd. In order to obtain a direct equalizer estimation, we first solve for sd, assuming fa to be known and fixed. The LS solution for sd can be simplified to:
because of the orthogonality between the data code sequence matrices Cd,k(k=1 . . . K) and the pilot code sequence matrices Cp,k(k=1 . . . K) of the different active users. Substituting
which can be interpreted as follows. The equalized signal faYa is projected on the orthogonal complement of the subspace spanned by the total data code sequence matrix Cd. The equalized signal after projecting faYa(IBN−CdHCd) should then be as close as possible to the known total pilot chip sequence vector spCp in a Least Squares sense.
It is easy to prove that the modified LS problem of Equation 79 can be rewritten as in Equation 80 (see the top of the next page) showing that the enhanced DPCCH-trained LS problem naturally decouples into two different parts: a fully-trained part and a fully-blind part. On the one hand, the fully-trained part (corresponding to the first term in Equation 80)
forces the equalized signal to be as close a possible (in a Least Squares sense) to the transmitted multi-user chip sequence during the training mode of the receiver. On the other hand, the fully-blind part (corresponding to the second term of Equation 80) projects the equalized signal on the orthogonal complement of the subspace spanned by the total effective data code sequence matrix during the blind mode of the receiver. Furthermore, the energy in the projected equalized signal should be as small as possible (in a Least Squares sense) which actually corresponds to a Minimum Output Energy (MOE) criterion. For this reason, the enhanced DPCCH-trained chip equalizer is actually a semi-blind chip equalizer.
The LS solution for fa can be written as:
The obtained LS enhanced DPCCH-trained chip equalizer
I.3 Space-time Adaptive Chip Equalizer Receivers
In this section, we derive Square Root Information (SRI) Recursive Least Squares (RLS) type [24] of adaptive chip equalizer receivers from the corresponding LS block chip equalizer receivers. These receivers address a single symbol at once and consist of two parts: a DPCCH-aided updating part and a user specific detection part.
I.3.a DPCCH-trained adaptive chip equalizer receiver. The DPCCH-trained updating part can work in two modes: a training mode and a decision-directed mode. By having a closer look at the corresponding LS block processing algorithm in Equation 73, it is rather straightforward to derive an SRI-RLS type of adaptive processing algorithm. The (Q+1)M×N new incoming output matrix block Ya[i] is first despread with the desired user's code sequence vector for the i-th symbol instant ck[i]. The new despread output matrix block Ya[i]ckH[i] is then used together with the new desired user's pilot symbol sk[i] (during the training mode) or the new desired user's estimated data symbol ŝk[i] (during the decision-directed mode) in a QRD-updating step [24]. The QRD-updating step tracks a (Q+1)M×(Q+1)M lower triangular factor Ra[i] and a corresponding 1×(Q+1)M right-hand side za[i]. The new values for the equalizer coefficients then follow from the backsubstitution step:
ga[i]·Ra[i]=za[i] (82)
I.3.b Enhanced DPCCH-trained adaptive chip equalizer receiver. The enhanced DPCCH-trained updating part can also work in two modes: a training mode and a blind mode. By having a closer look at the corresponding LS block processing algorithm in Equation 80, it is rather straightforward to derive an SRI-RLS type of adaptive processing algorithm. During the training mode, the new incoming output matrix block Ya[i] is used together with the 1×N new multi-user chip sequence vector x0[i] in a QRD-updating step. During the blind mode, the new incoming output matrix block Ya.[i] is first projected on the orthogonal complement of the subspace spanned by the active user codes, using the N×N projection matrix IN−CdH[i]Cd[i]. The new projected output matrix block is then used together with the 1×N zero vector 01×N in a QRD-updating step [24].
I.4 Simulation Results
The simulations are performed for the forward link of a WCDMA system with K active, equal power users, QPSK data modulation, real orthogonal Walsh-Hadamard spreading codes of length N=8 along with a random scrambling code whose period measures ρ=B symbols. The number of receive antennas M=2, the block length B=200 and the number of pilot symbols P=15. The vector channel with order L=3 has M(L+1)=8 Rayleigh distributed channel taps of equal average power. The temporal smoothing factor is chosen to be Q=L and the performance is averaged over 100 randomly generated channels.
I.5 Conclusion
In this paper, we proposed new DPCCH-trained and enhanced DPCCH-trained chip equalizer receivers for the forward link of WCDMA systems employing a time-multiplexed pilot. In reduced as well as full load settings, the enhanced DPCCH-trained receiver outperforms the regular DPCCH-trained receiver and comes close to the performance of the ideal fully-trained receiver.
The LS block chip equalizer receivers on the one hand, that detect B symbols at once, are suited for packet-switched type of communication. The performance gain of the enhanced DPCCH-trained receiver compared to the regular DPCCH-trained receiver increases for decreasing pilot length P. The RLS adaptive chip equalizer receivers on the other hand, that detect symbol by symbol, are suited for circuit-switched type of communication. They are able to track time-varying multipath channels and outperform the conventional RAKE receiver with perfect channel knowledge.
We can conclude that the enhanced DPCCH-trained chip equalizer receiver is a promising receiver for future WCDMA terminals, from a performance as well as complexity point of view.
J.1 Forward Link DS-CDMA Data Model
Let us consider the forward link of a single-cell DS-CDMA system. The base-station transmits a synchronous code division multiplex, employing user-specific orthogonal Walsh-Hadamard spreading codes and a site-specific aperiodic scrambling code. The transmitted multi-user chip sequence during the i-th symbol period consists of K user signals and a continuous pilot signal:
with Ak, bk[i], ck[i,n] and Ap, bp[i], cp[i,n] the amplitude, the transmitted bit and the aperiodic spreading code of user k respectively the pilot signal during the i-th symbol period. ck[i,n]=wk[n]·s[i,n] for n=0 . . . N=1, with N the spreading factor, is the componentwise multiplication of the Walsh-Hadamard spreading code wk[n] and the aperiodic scrambling code s[i,n]. A similar equation also holds for the aperiodic spreading code of the pilot signal cp[i,n]=wp[n]·s[i,n]. The total transmitted multi-user chip sequence is then the sum of different time-shifted versions of x[i,n]:
The impulse response of the time-varying composite channel during the i-th symbol period, consisting of the low-pass transmit and receive filters and the actual propagation channel, can be modeled as follows:
with M the number of paths in the propagation channel. h(i)[m] and τm are the time-varying complex channel gain respectively the delay of the m-th path. The composite chip waveform ΨRC(t) includes both transmit and receive low-pass filters and has a raised-cosine spectrum.
The total received baseband signal at the mobile user terminal can be written as:
where η(t) is a zero-mean complex colored Gaussian noise process with power spectral density σ2. The noise coloring is due to the receive low-pass filter. The signal y(t) is now temporally oversampled at twice the chip-rate to obtain the first phase y1[n]=y(nTc) respectively the second phase
of the oversampled received signal. These discrete-time signals correspond to the first phase h1[i,n]=h(i)(nTc) respectively the second phase
of the oversampled composite channel, that both have a channel order L.
We can now construct the (N+2F)-dimensional received signal vector for the first and the second phase, q=1 . . . 2, during the i-th symbol period yq[i]=[yq[iN−F] . . . yq[(i+1)N−1+F]]T, with 2F+1 the total length of the equivalent symbol-spaced equalizers. The total received signal vector during the i-th symbol period y[i]=[y1T[i] y2T[i]]T stacks the received signal vectors for both phases and relates to the (N+2F+L)-dimensional transmitted multi-user chip sequence vector x[i]=[x[iN−F−L] . . . x[(i+1)N−1+F]]T through the following equation:
y[i]=[i]·x[i]+η[i] (87)
The total channel matrix [i] stacks the channel matrices for the first respectively the second phase during the i-th symbol period:
while η[i] is the received noise vector during the i-th symbol period. Hq[i], for q=1 . . . 2 is a (N+2F)×(N+2F+L) Toeplitz-matrix, as shown in equation 89 on the next page.
Finally, the transmitted multi-user chip sequence vector relates to the transmitted bits through the following equation:
x[i]=C[i]·A·b[i] (90)
The (3K+3)-dimensional bit vector b[i] contains the user and pilot transmitted bits during the i-th symbol period as well as the previous and the next symbol period. The (N+2F+L)×(3K+3) code matrix C[i] contains the aperiodic spreading codes corresponding to each of the transmitted bits while the (3K+3)-dimensional diagonal matrix A contains the amplitudes of the user signals and the pilot signal.
J.2 Adaptive Chip Equalizer Receiver
The proposed pilot-aided adaptive chip equalizer receiver is well-suited for implementation in a mobile user terminal, operating in fast fading multipath channels. By exploiting the presence of a continuous pilot signal in forthcoming third generation cellular and LEO satellite communication systems, it continuously updates at the symbol rate using a simple NLMS or a more advanced RLS adaptive scheme. The proposed receiver basically consists of two parts: a user-specific detection part and a pilot-aided updating part.
J.2.a User-specific detection part. The user-specific detection part operates like the conventional chip equalizer receiver, as shown in
be written as follows:
with ck[i]=[ck[i,0] . . . ck[i, N−1]] the k-th user aperiodic spreading code vector. The total equalizer matrix [i] simply stacks the equalizer matrices for both phases during the i-th symbol period:
[i]=[G1[i]G2[i]] (92)
The equalizer matrix Gq[i], with q=1 . . . 2 for the first respectively the second phase is a N×(N+2F) Toeplitz-matrix of the corresponding equalizer coefficient vector, as shown in equation 93. Note that the equalized signal is common to all codes that have been assigned to the mobile user terminal in a multi-code system, so a single equalizer suffices.
J.2.b Pilot-aided updating part. The updating part employs the pilot signal to adaptively update the equalizer coefficients at the symbol rate, as opposed to a classical chip equalizer that would be updated at the chip rate. The latter requires a continuous multi-user training chip sequence for fast fading channel conditions, which is impossible to realize in practice.
By reversing the order of the equalization and the descrambling/despreading for the pilot signal, one obtains an elegant adaptive scheme operating at the symbol rate, as shown in
of the pilot symbol during the i-th symbol period can be written as:
with Cp[i] the descrambling/despreading matrix for the pilot signal, being a (2F+1)×(N+2F) Toeplitz-matrix of the pilot's aperiodic spreading code vector cp[i]=[c9[i,0] . . . cp[i, N−1]], as shown in equation 95. For a particular symbol the pilot's descrambling/despreading matrix provides 2F+1 correlation values for each phase (so 2·(2F+1) in total), with F the single-sided symbol-spaced equalizer length. These values are the correlator outputs at the correct symbol instant and F chip periods before and after the correct symbol instant. The total equalizer coefficient vector gH[i] (with 2·(2F+1) coefficients) coherently combines these correlation values to obtain an estimate of the transmitted pilot symbol. The optimal equalizer coefficients are determined iteratively by using a simple NLMS scheme or a more advanced RLS scheme [24].
J.3 Simulation Results
The simulations are performed for the forward link of a DS-CDMA system with K active, equal power users, BPSK data modulation, real orthogonal Walsh-Hadamard codes of length N=32 along with a Gold overlay code for scrambling. The period of the Gold overlay scrambling code measures 12 symbols or equivalently 384 chips. Both the first and second phase of the oversampled channel were modeled by L+1=2 (with L the channel order) independent time-varying Rayleigh-distributed channel taps of equal average power. The Symbol-Spaced (SS) Chip Equalizer (CE) receiver only uses the first phase while the Fractionally-Spaced (FS) Chip Equalizer (CE) receiver uses both phases of the received signal. The independence of the different channel taps of the different phases approximately corresponds to a roll-off factor α=1.
The stepsize of the NLMS algorithm was optimally chosen to be μ=0.5. The optimal value of the RLS forgetting factor proved to be λ=0.9, corresponding to a data memory
being
of the normalized coherence time. Only for rather high SNR per bit, the SS-CE achieves better performance than the RAKE receiver. The FS-CE with RLS updating outperforms the corresponding SS-CE by 4 dB for a BER of 10−3 and approaches the single-user bound. Performance of NLMS updating is within one dB of that of RLS updating.
J.4 Conclusion
We have proposed a new pilot-aided adaptive fractionally-spaced receiver structure for the forward link of DS-CDMA systems employing aperiodic overlay scrambling codes and operating in fast fading multipath channels. By equalizing the multipath channel effects, the receiver restores the orthogonality between the different users and therefore suppresses the MUI. An NLMS or RLS symbol rate adaptation scheme, well-suited for fast fading channels, has been obtained by reversing the order of the equalizer and the descrambler/despreader for the updating part. Simulation results show that the proposed receiver outperforms the conventional RAKE receiver with perfect channel knowledge, for both slow and fast fading multipath channels.
Another embodiment of the invention is shown in
This application is a divisional application of U.S. application Ser. No. 10/134,307, filed Apr. 26, 2002, now U.S. Pat. No. 7,158,558 which claims benefit of 60/286,486 filed on Apr. 26, 2001, the full disclosure of which is incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
5461640 | Gatherer | Oct 1995 | A |
5862186 | Kumar | Jan 1999 | A |
6301298 | Kuntz et al. | Oct 2001 | B1 |
6426972 | Endres et al. | Jul 2002 | B1 |
6522683 | Smee et al. | Feb 2003 | B1 |
6580750 | Aue | Jun 2003 | B2 |
6665545 | Raleigh et al. | Dec 2003 | B1 |
6671334 | Kuntz et al. | Dec 2003 | B1 |
6801565 | Bottomley et al. | Oct 2004 | B1 |
6937292 | Patel et al. | Aug 2005 | B1 |
6990137 | Smee et al. | Jan 2006 | B2 |
20010046255 | Shattil | Nov 2001 | A1 |
20020191568 | Ghosh | Dec 2002 | A1 |
20030165187 | Tesfai et al. | Sep 2003 | A1 |
20070104282 | Vihriala | May 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20070064775 A1 | Mar 2007 | US |
Number | Date | Country | |
---|---|---|---|
60286486 | Apr 2001 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10134307 | Apr 2002 | US |
Child | 11599680 | US |