U.S. patent application publication US2014/0140360 published May 22, 2014, assigned to the Assignee of the present disclosure, titled “SYSTEMS AND METHODS FOR SPARSE CODE MULTIPLE ACCESS”, herein incorporated by reference in its entirety, discloses exemplary systems, methods, and devices in which embodiments disclosed herein may be practiced.
The present disclosure relates generally to Sparse Code Multiple Access (SCMA) and Low Density Signature (LDS) Multiple Access and more particularly to low complexity systems, methods, and devices for SCMA/LDS detection.
CDMA is a well-known multiple access technique in which data symbols are spread out over orthogonal or non-orthogonal code sequences. Multicarrier CDMA (MC-CDMA) takes advantages of both OFDMA and CDMA to enable flexible code domain multiplexing with the simplicity of OFDMA transceiver techniques especially for wideband communication.
Low complexity SCMA/LDS detection systems, methods and devices are disclosed. In an embodiment, a SCMA/LDS receiver runs a message passing algorithm only over selective sub-graphs, the sub-graphs approximate an underlying full factor graph that represents a codebook for the multicarrier, multiple access code system.
In an embodiment, the sub-graphs are selected such that least number of branches connect to each function node while similar performance as a full message passing algorithm is achieved.
In an embodiment, sub-graphs are selected which have maximum length of a shortest cycle contained in the sub-graph.
In an embodiment, sub-graphs are selected which include all variable nodes.
In an embodiment, message passing algorithms are only performed on those sub-graphs selected such that the fewest number of branches are connected to each function node but similar performance as a full message passing algorithm is achieved.
In an embodiment, a SCMA/LDS receiver decodes by rotating between the selective sub-graphs for a predetermined number of rotation cycles.
In an alternative instrumentality, combinations for which an EXP(.) function (or Euclidean distance) should be computed are preselected so that substantially the same performance is achieved compared to full EXP(.) calculations.
In another alternative instrumentality, a receiver performs a modified MPA utilizing the fact that the number of projection points over each tone is less than the number of constellation points (a property of the SCMA codebook) while achieving similar performance as a full MPA but with much lower complexity.
Multiple instrumentalities are independent of one another and hence, system benefit may be achieved individually or with a combination of instrumentalities. The features and advantages of the disclosure can be realized and obtained by means set forth in detail herein.
For a more complete understanding of the present disclosure, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, wherein like numbers designate like objects, and in which:
It may be advantageous to first set forth definitions of certain words and phrases used throughout this disclosure. The terms “include” and “comprise,” as well as derivatives thereof, mean inclusion without limitation. The term “or” is inclusive, meaning and/or. The phrases “associated with” and “associated therewith,” as well as derivatives thereof, mean to include, be included within, interconnect with, contain, be contained within, connect to or with, couple to or with, be communicable with, cooperate with, interleave, juxtapose, be proximate to, be bound to or with, have, have a property of, or the like. The term “algorithm” is used herein to describe a method for calculating a function.
As will be appreciated by one skilled in the art, aspects of the present disclosure may be embodied as a method, system, device, or computer program product. Accordingly, aspects of the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit”, “module” or “system”. Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs) and general purpose processors alone or in combination, along with associated software, firmware and glue logic may be used to construct the present disclosure.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by those of skill in the art to which this disclosure pertains. For example, the term sub-channels, subcarriers, and tones are used interchangeably herein.
Sparse Code Multiple Access (SCMA) encoding is a technique that encodes binary data streams directly to multi-dimensional codewords. By directly encoding the binary data to multi-dimensional codewords, the SCMA encoding techniques circumvent QAM symbol mapping, thereby achieving coding gains over conventional CDMA encoding. Notably, SCMA encoding techniques convey binary data using the multi-dimensional codeword, rather than a QAM symbol. Moreover, SCMA encoding techniques provide multiple access through assignment of a different codebook for each multiplexed layer, as opposed to using a unique spreading sequence (e.g., Low Density Signature (LDS)—a special case of multi-carrier CDMA wherein its spreading sequences have low density) as is common in conventional CDMA encoding. Further, the SCMA codebooks comprise sparse codewords such that receivers can use low complexity message passing algorithms (MPAs) to detect their respective codewords amongst the multiplexed codewords, which reduces baseband processing complexity on the receiver side. Multiple SCMA layers share the same time-frequency resources of OFDMA. The sparsity of codewords makes near-optimal detection feasible through an iterative MPA. Such low complexity of multi-layer detection allows excessive codeword overloading in which the dimension of multiplexed layers exceeds the dimension of codewords. Optimization of the overloading factor along with modulation-coding levels of layers provides a more flexible and efficient link adaptation mechanism. The signal spreading feature of SCMA improves link-adaptation as a result of less colored interference.
Although complexity is reduced by using a MPA, decoding is still very complex for longer sequences and/or higher overloading factors (e.g. for massive connectivity or Coordinated Multi-Point (CoMP) applications).
By reducing the decoding complexity, delay, cost, and receiver power consumption is reduced. Therefore, there is a need for low complexity decoding systems, methods, and devices for SCMA/LDS while having similar performance to a decoder employing a full MPA.
Sparse Code Multiple Access (SCMA) code can be represented by a factor graph matrix defined as: F=(f1, . . . , fJ), where fj, j=1, . . . , J represents a binary vector of size K, in which J represents the total number of variable nodes (VNs) and K represents the total number of function nodes (FNs). Variable node j and function node k are connected if and only if (F)kj=1.
As will be appreciated by those skilled in the art, factor graphs are graphical representations of complex functions—that is, representations of how variables of a global function are grouped together locally. These graphical methods have arisen as a description of the structure of probabilistic networks, where one of their primary purposes is to provide a simple visualization of probabilistic dependencies.
Factor graphs describe the network of connections among variables by edges between nodes, along which probabilities can be exchanged locally with the purpose of calculating global probabilistic functions. The factor graph represents a blueprint for a computational machine that evaluates the global function via local processing such as in a baseband processor in a handset or in the base station.
Factor graphs are bipartite graphs wherein their nodes fall into just two groups such that there are no connections inside each group, but only between nodes in different groups. These groups are referred to as variable nodes (VN) and function nodes (FN). In the present disclosure, the VNs represent the SCMA layers whereas the FNs represent the tones over which the layers are mapped.
An example of the factor graph representation for F is illustrated in
The MPA 100 iteratively updates the values at FNs 104a-104d according to the values sent from the VNs 102a-102f (starting from the initial ap values) and subsequently uses the updated values at FNs 104a-104d to update the values at the VNs 102a-102f. Updating the vectors or values back and forth between the VNs 102a-102f and the FNs 104a-104d is referred to as message passing or exchange between the two node sets. This back and forth information passing between the FNs 104a-104d and the VNs 102a-102f is repeated until the probability values at the VNs 102a-102f converge to a solution. The converged probability values at the VNs 102a-102f are then processed to determine each of the six symbols. Additional information concerning MPAs is provided by Hoshyar, et al., in “Novel Low-Density Signature for Synchronous CDMA Systems Over AWGN Channel,” IEEE Transactions on Signal Processing, Vol. 56, No. 4, April 2008, and by Hoshyar, et al., in “Efficient Multiple Access Technique,” IEEE 71st VTC 2010, pp. 1-5, U.S. patent application publication US2014/0169408 published Jun. 19, 2014, titled “SYSTEM AND METHOD FOR OPEN-LOOP MIMO COMMUNICATION IN A SCMA COMMUNICATIONS SYSTEM” to Bayesteh et al., and assigned to the Assignee of the present disclosure, and U.S. patent application publication US2014/0169409 published Jun. 19, 2014, titled “SYSTEMS AND METHODS FOR OPEN-LOOP SPATIAL MULTIPLEXING SCHEMES FOR RADIO ACCESS VIRTUALIZATION” to Ma et al. and assigned to the Assignee of the present disclosure, all of said references are herein incorporated by reference in their entirety.
Reference is now made to
According to the present disclosure, sub-graphs are selected such that less number of branches connect to each FN while providing similar performance as a MPA on a full factor graph. The running of a MPA only over sub-graphs rather than a full factor graph is hereinafter referred to as a clustered MPA (CMPA). The sub-graphs are selected such that the number of branches connected to each FN is reduced to a predefined number. The predefined number of branches is determined by the constraint on the MPA. For this purpose, the list of all possible clusters are first considered. For example in
Once an underlying factor graph is established for a selected number of layers and tones, a set of sub-graphs are determined/selected that provide similar performance as the full factor graph.
In general, factor graphs for different number of VNs and FNs (i.e. layers and tones) may be generated and the optimum sub-graphs for each can be simulated/selected offline (such as in a computer workstation using principles of
In the receiving device, a cluster MPA (CMPA) is run over each selected sub-graph which may be stored in memory. A CMPA is similar to the MPA depicted in
Reducing a branch at FNs however, reduces the computational complexity at FNs by an order of magnitude reducing the decoding complexity thus reducing delay, cost, and receiver power consumption. The exponential terms are weighted according to the passed information from the dashed branches. The operations at FN1 for the sub-graph depicted in
I
c
→u
(xm)=Σx
Where: EXP(.) is shorthand for the function e raised to the power of x (ex)—where e is a constant approximately equal to 2.718281828 and xm denotes the mth constellation point, m=1, . . . , M, f(x2,xm) denotes the weighted EXP(.) functions obtained from
A CMPA is run for each sub-graph and continues to run on subsequent sub-graphs based on a pre-determined order. This is called sub-graph rotation. The VN probabilities obtained from running MPA over previous sub-graphs act as a priori probabilities for the next sub-graph for which CMPA is going to be run over. This procedure is continued for a predetermined number of rotation cycles or until the probabilities converge.
A CMPA early termination mechanism can also be enabled for any of the sub-graphs based on a pre-defined convergence metric, similar to those used in a regular MPA. Furthermore, unnecessary calculations can also be skipped for sufficiently small terms.
Link-level simulation was performed to evaluate the performance of CMPA compared to an ideal MPA. A simulator such as, but not limited to, MATLAB® from The MathWorks, Inc., 3 Apple Hill Drive, Natick, Mass., was used for simulation. For this purpose, both LDS and SCMA detection using CMPA were considered. For CMPA, it was assumed that all of the selected sub-graphs include two branches connected to each FN. Simulation parameters for a LDS scenario are given in Table 1 below.
For CMPA, fifteen sub-graphs were assumed. Sub-graphs were selected based on the process depicted in
Simulation parameters for a SCMA scenario are given in Table 2 below.
For CMPA, fifteen sub-graphs were assumed. Sub-graphs were selected based on the process depicted in
The sources of complexity for SCMA/LDS detection are:
Applicants have found through experimentation and simulation that CMPA does not imply any substantial change in terms of the number of required (internal) MPA iterations or the number of required outer-loop iterations (see
In a regular MPA, the sources of complexity for each MPA iteration are the computational complexity at the VNs and FNs. Computational complexity at the VNs includes addition and multiplication. This complexity is the same for CMPA and MPA as the functionality of VNs does not substantially change. However, VN complexity is negligible compared to FN complexity. Accordingly, VN complexity was disregarded and only the FN complexity considered.
Computational complexity at FNs includes addition, multiplication and the EXP(.) computation (recalling that EXP(.) is shorthand for the function e raised to the power of x (ex)—where e is a constant approximately equal to 2.718281828). The EXP(.) computation is performed once for the entire SCMA/LDS decoding.
It is assumed that at each FN, df(I) number of dashed branches exist for which no information is passed from FN wherein df denotes the total number of branches connected to each FN, N denotes the number of FNs (i.e. spreading factor), and M is the size of constellation points. The number of additions, multiplications, and EXP(.) calculations can be obtained from:
N
add
=N(df−df(I))(Md
N
mul
=N(df−df(I))Md
N
EXP
=M
d
. (4)
In another embodiment, MPA implementation is simplified by using Max-Log MAP (MLM) instead of calculating the sum in equation (1). This is accomplished by calculating log(.) of equation (1) and approximating the log of the sum of EXP(.) terms by MAX(.) of those terms. This converts multiplication to addition and the EXP(.) calculation to a MAX(.) operation. Hence,
N
add
(MLM)
=N(df−df(I))Md
N
MAX
(MLM)
=M
d
. (6)
Tables 3 and 4 below compare the complexity reduction of CMPA compared to MPA for four and eight point SCMA constellations for the original algorithm and Max-Log MAP implementation of MPA. It was assumed that all selected sub-graphs include two branches connected to each FN, i.e. df(I)=1. The total complexity is defined as Nadd+3Nmul 10NEXP for regular implementation and Nadd(MLM)+1.5NMAX(MLM) for Max-Log MAP implementation of MPA.
In alternative complexity reduction SCMA/LDS detection systems and methods, combinations are preselected for which the EXP(.) function (or Euclidean distance) is computed such that substantially similar performance is achieved compared to a full EXP(.) calculation.
Looking at the EXP(.) terms at FN n:
The combinations for {xk}kεV
is sufficiently large to consider those combinations for further computations at FN. A process hereinafter referred to as “SelEXP”, identifies those combinations that are sufficiently large to consider without the need to calculate yn−ΣkεV
The distance term is decomposed into two terms namely, one of the hk,nxk terms (denoted by hk*,nxk*) and the rest of the terms: yn−ΣkεV
SelEXP is now described for a single receiver antenna case, i.e. hk,n's are complex scalars. The EXP(.) term is written as:
Where xR represents the “reference” constellation points, φh=phase(hk*,n), φx=phase(xR)−phase(xk*).
Reference is now made to
The quantization regions are identified by defining a region for each of the scaled reference constellation points x*R=|hk*,n|xR. The simplest regions are rectangular regions. For example, for the QPSK modulation, regions can be defined as follows:
={y∥Re(y)−Re(x*R(i))|<th,|Im(y)−Im(x*R(i))|<th}.
The parameter th controls the size of the regions and hence, the complexity reduction percentage of the algorithm. The lower th is—the more reduction of complexity. These regions are depicted in the
Useful combinations are determined by calculating the term A in equation (8) above for all combinations {xk}kεV
U
k*(c)={i|Acεi} (9)
If the above set is empty, EXP(.) is not computed for that particular combination of {xk}kεV
Two solutions exist for identifying the useful combinations that EXP(.) should be computed for with multiple receive antennas.
Solution 1: U(t), t=1, . . . , NR, is denoted as the set of combinations EXP(.) terms are computed for over the t-th receive antenna. The final set of combinations EXP(.) terms are computed for are defined as the intersection of U(t) over all receive antennas, i.e. U=∩t=1N
Solution 2: The maximum ratio combination of the received signals over all antennas follows the subsequent process to find the set of combinations of EXP(.) terms that should be computed. More precisely, the terms A in equation (8) can be computed from:
A=e
−φ
(hk*,nHyn−ΣkεV
The scaled reference constellation points are defined as x*R=∥hk*,n∥2xR. The remaining method is the same as the single antenna receiver case.
The disclosed process works for any arbitrary selection of k* among all VNs connected to a FN. The process is further optimized by selecting the best choice for k*. One option is to select the one with the largest ∥hk,n∥, i.e. k*=arg maxk∥hk,n∥, which maximizes the minimum distance between the points of the expanded reference constellation points and consequently, selecting the useful combinations more efficiently.
Link-level simulation results illustrate the performance and complexity of the SelEX method. Complexity is defined as the average number of combinations for which EXP(.) is calculated normalized by the total number of combinations. From equation (9), complexity reduction percentage (CRP) is computed from:
Where
Reference is now made to
One of the advantages of SCMA over LDS is that the number of projections of codewords over different tones can be less than the number of constellation points. This property can be seen in some of the proposed SCMA codebooks. For example,
Therefore in another alternative instrumentality, a receiver performs a modified MPA utilizing the fact that the number of projection points over each tone is less than the number of constellation points (a property of the SCMA codebook) while achieving similar performance as a full MPA but with much lower complexity.
The projections of the codewords of VN k over tone n are denoted by the set Pk,n=(Pk,n(1), . . . , Pk,n(M)) in which Pk,n(m) denotes the projection of the m-th codeword of VN k over tone n and suppose that some elements in this vector are the same. Tk,n is denoted as the indices corresponding to the repetitive projection points in Pk,n, i.e. Pk,n(i)=Pk,n(j)ck,n, ∀i,jεTk,n. The operations at FN n is expressed as:
To better understand the instrumentality having a MPA for reduced set of projection points hereinafter referred to as “ProjMPA”, two observations are made.
Calculation at FNs:
Consider a VN m*εV(n)\k. The right hand side of equation (11) can be expressed as:
Looking at the first term, ΠmεV(n)\k,m*Iu
Thus, the first term is expressed as:
In other words, from the perspective of FN, the codeword indices in Tk,n are considered as a single codeword with the probability equal to the summation of the probabilities of the codewords in Tk,n.
Passing Information from FN to VN:
Consider the probabilities Ic
This is the same as Ic
From the above observations, the following four step ProjMPA algorithm is provided. 1) Operation at VN is the same as a regular MPA. 2) There is an additional step when passing information from VN k to FN n which shrinks the probability vector from size M to size M−|Tk,n|+1. This is accomplished by removing all indices in Tk,n except one and setting the probability of this element as a sum of probabilities of the indices in Tk,n. This means that the probability space seen at FN n is (M−|Tk,n|+1)-dimensional instead of M-dimensional. 3) The operation complexity at FN is significantly reduced as the probability vectors are of size M−|Tk,n|+1 instead of M. 4) As the probability space at FN n is (M−|Tk,n|+1)-dimensional while the probability space at VN k is M-dimensional, the probability vector to be passed from FN to VN needs to be expanded. This is accomplished by dividing the probability value of the repetitive component by |Tk,n|, and assigning the value to all indices in |Tk,n|. This is the reverse of the operation of passing information from VN k to FN n.
The ProjMPA algorithm has two additional steps compared to the regular MPA. However, the complexity of steps 2 and 4 is very low (one multiplication and couple of additions), while the complexity of step 3 (computations at FN) is significantly reduced due to lower size of probability vectors.
Applicants have performed link-level simulation to evaluate the performance of ProjMPA compared to an ideal MPA and to verify the validity of the analysis above. An eight-point codebook of T1008 was considered, which has five projection points over each tone. Details of the simulation setup are given in the following table 6:
Using the same notations as above, df denotes the total number of branches connected to each FN, N denotes the number of FNs (spreading factor), and M is the size of constellation points, and MP denotes the number of projection points over each tone (for simplicity, this number is assumed to be the same for all FNs).
The number of additions, multiplications and EXP(.) calculations for the ProjMPA algorithm can be obtained from:
N
add
=Nd
f(Mpd
N
mul
=Nd
f
M
p
d
−1(df−2+Mp), (13)
N
EXP
=M
P
d
. (14)
Similar to MPA, Max-Log MAP (MLM) can also be implemented for ProjMPA which converts multiplication to addition and the EXP(.) calculation to MAX(.) operation. Hence, we have:
N
add
(MLM)
=Nd
f
M
p
d
−1(df−2+Mp), (15)
N
MAX
(MLM)
=M
p
d
. (16)
The following tables 7 and 8 compare the complexity reduction of ProjMPA compared to MPA for four and eight point SCMA constellations (T1003 and T1008 respectively) for the original algorithm and a Max-Log MAP implementation of ProjMPA. Total complexity is defined as Nadd+3Nmul+10NEXP for regular implementation and Nadd(MLM)+1.5NMAX(MLM) for Max-Log MAP implementation.
ProjMPA provides another dimension in codebook design, which is the number of projections per each tone. The lower the number of projections, the lower the receiver complexity. Even with a regular MPA implementation, lower number of projections results in less complexity due to reducing the number of EXP(.) calculations. For example, the number of EXP(.) calculations for T1008 is approximately reduced by 76% compared to a codebook with eight projections per each tone.
Reference is now made to
The receiver 715 receives the received signal with Additive White Gaussian Noise (AWGN). The OFDM Demodulation module 716 removes the guard interval (GI) 718, converts the data stream from serial to parallel 720, and performs a fast Fourier transform (FFT) 722 on the data. The SCMA/LDS detector module 702 receives the data from the fast Fourier transform (FFT) 722 output. The output of SCMA/LDS detector module 702 is coupled to FEC decoders 7241 . . . 724K. The output of FEC decoders 7241 . . . 724K may be fed back to the SCMA/LDS detector module 702 to enable MPA outer-loop detection.
Reference is now made to
A processing module 600 performs radio baseband functions preferably using Digital Signal Processors (DSPs), Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), general purpose processors, software, or a combination thereof. The algorithms used to modulate and demodulate the input and outputs signals may use a variety of methodologies, including but not limited to, middleware, e.g., common object request broker architecture (CORBA), or virtual radio machines, which are similar in function to JAVA virtual machines. Processing module 600 performs the methods of low complexity detection in a multicarrier, multiple access code method, in accordance with principles of the present disclosure.
An antenna or antennas 602 provides gain versus direction characteristic to minimize interference, multipath, and noise.
The RF signal is picked up by the antenna(s) 602, filtered, amplified with a low noise amplifier (LNA), and down converted with a local oscillator (LO) to baseband (or IF) by flexible RF hardware 604. The incoming signal is digitizing with an analog to digital converter (ADC) 606. Similarly, an outgoing digital signal is converted to analog by digital to analog converter (DAC) 606. Digital filtering (channelization) and sample rate conversion are provided by module 608 to interface the output of the ADC 606 to the processing module 600. Likewise, module 608 provides digital filtering and sample rate conversion to interface the processing module 600 that creates the modulated waveforms to the digital to analog converter 606.
While this disclosure has described certain embodiments and generally associated methods, alterations and permutations of these embodiments and methods will be apparent to those skilled in the art. Accordingly, the above description of example embodiments does not define or constrain this disclosure. Other changes, substitutions, and alterations are also possible without departing from the spirit and scope of this disclosure, as defined by the following claims.