The invention relates generally to communication systems; and, more particularly, it relates to parallel decoding of signals within such communication systems employing turbo codes.
Data communication systems have been under continual development for many years. One such type of communication system that has been of significant interest lately is a communication system that employs iterative error correction codes. One type of communication system that has received interest in recent years has been one which employs turbo codes (one type of iterative error correcting code). Communications systems with iterative codes are often able to achieve lower bit error rates (BER) than alternative codes for a given signal to noise ratio (SNR).
A continual and primary directive in this area of development has been to try continually to lower the SNR required to achieve a given BER within a communication system. The ideal goal has been to try to reach Shannon's limit in a communication channel. Shannon's limit may be viewed as being the data rate to be used in a communication channel, having a particular SNR, that achieves error free transmission through the communication channel. In other words, the Shannon limit is the theoretical bound for channel capacity for a given modulation and code rate.
The use of turbo codes providing such relatively lower error rates, while operating at relatively low data throughput rates, has largely been in the context of communication systems having a large degree of noise within the communication channel and where substantially error free communication is held at the highest premium. Some of the earliest application arenas for turbo coding were space related where accurate (i.e., ideally error free) communication is often deemed an essential design criterion. The direction of development then moved towards developing terrestrial-applicable and consumer-related applications. Still, based on the heritage of space related application, the focus of effort in the turbo coding environment then continued to be achieving relatively lower error floors, and not specifically towards reaching higher throughput.
More recently, focus in the art has been towards developing turbo coding, and variants thereof, that are operable to support higher amounts of throughput while still preserving the relatively low error floors offered within the turbo code context.
Generally speaking, within the context of communication systems that employ turbo codes, there is a first communication device at one end of a communication channel with encoder capability and second communication device at the other end of the communication channel with decoder capability. In many instances, one or both of these two communication devices includes encoder and decoder capability (e.g., within a bi-directional communication system).
The present invention is directed to apparatus and methods of operation that are further described in the following Brief Description of the Several Views of the Drawings, the Detailed Description of the Invention, and the claims. Other features and advantages of the present invention will become apparent from the following detailed description of the invention made with reference to the accompanying drawings.
Many communication systems incorporate the use of a turbo code. When performing decoding of turbo coded signals, there are a variety of means to do so. One means of decoding turbo coded signals is to perform parallel decoding such that a number of turbo decoders are arranged in parallel. In addition, such a parallel decoding approach often involves a number of memories that are also arranged in parallel.
However, there is a challenge to ensure that there are no read and write conflicts between the various turbo decoders and the various memories when performing this parallel decoding approach. When the conflicts during memory access are avoided, then that operation is referred to as contention-free. A novel means is presented herein in which contention-free memory mapping is truly achieved in the context of performing parallel decoding of a turbo coded signal. A novel means of performing the contention-free memory mapping is provided to ensure that any one turbo decoder (of a group of parallel arranged turbo decoders) accesses only memory (of a group of parallel arranged memories) at any given time. In doing so, access conflicts between the turbo decoders and the memories are avoided.
While there are many potential applications that can employ turbo codes, means are presented herein that can be applied to the 3GPP channel code to support an arbitrary number of information bits. Some examples of the number of bits that can be supported using the various aspects of the invention presented herein are 40 to 5114 for WCDMA and HSDPA and more for LTE.
Additional information regarding the UTRA-UTRAN Long Term Evolution (LTE) and 3GPP System Architecture Evolution (SAE) can be found at that organization's publicly available Internet web site.
Within the channel coding system in 3GPP LTE, there is a need and desire to supply and provide for a wide range of block sizes (i.e., turbo code block lengths). Furthermore, turbo decoding of this system needs to be implemented using a parallel decoding arrangement because of the very high data throughput and large block size desired. The parallel decoding requires the contention-free memory accessing (i.e., any one turbo decoder (of a group of parallel arranged turbo decoders) accesses only memory (of a group of parallel arranged memories) at any given time). In reference [1] as depicted below, the authors therein assert that there always exists a contention-free memory mapping for parallel decoding of any degree.
Turbo coding was suggested for 3GPP LTE channel coding. For this coding system, the algebraic interleave referred to as the “almost regular permutation (ARP)” in reference [3] is considered as one of the candidates. In reference [3] and reference [4], a memory mapping for parallel decoding is mentioned. However, as shown herein, that particular memory mapping is not actually contention-free for any block length, but rather only for a block size L satisfying gcd(L,C)=1 where C is the degree of parallelism (i.e., the total number of turbo decoders implemented in the parallel arrangement). In fact, the memory mappings for all the examples in reference [3] and reference [4] are not contention-free.
Herein, it is also shown that the memory mapping defined in reference [2] is also not contention-free for ARP except for the case where C2 is a factor of the block size (i.e., the block length, L). However, as mentioned above, the coding system in 3GPP LTE needs to be adaptable to and operable for a wide variety of block sizes. Therefore, a new memory mapping, that actually is a contention-free memory mapping, is required for all types of block sizes.
Herein, a general and algebraic contention-free memory mapping for ARP is presented that is adaptable to turbo coded signals having any arbitrary block size (i.e., any block length). The proof of the contention-free property of the proposed memory mapping is also given in the APPENDIX. With this contention-free mapping, parallel decoding of turbo codes with interleave ARP of all possible block sizes can be performed.
The goal of digital communications systems is to transmit digital data from one location, or subsystem, to another either error free or with an acceptably low error rate. As shown in
Referring to
To reduce transmission errors that may undesirably be incurred within a communication system, error correction and channel coding schemes are often employed. Generally, these error correction and channel coding schemes involve the use of an encoder at the transmitter and a decoder at the receiver.
Contention-free Memory Mapping
We recall the definition from reference [1]. Consider an index set I={0,1, . . . , L−1} of L integers. Suppose L=CM and let E0, . . . , EM−1 be a partition of the set I such that every set Ei has C elements. Then parallel decoding of degree C (i.e., a parallel arrangement of turbo decoders including C turbo decoders) can be considered as follows.
At the i-th cycle the C processors work in parallel on C elements of the index subset Ei. Let Ê0, . . . , ÊM−1 be another partition of the given index set with ∥Êk∥=C, k=0, . . . , M−1. Then a map defined from the index set I to the set ZC={0,1, . . . , C−1} is a contention-free mapping for the two partitions if it satisfy the following condition: for every j,j′ε{0, . . . , L−1}, j≠j′
j,j′εEi(j)≠(j′) and j,j′εÊi(j)≠(j′). (EQ-1)
In other words, the integers belonging to the same subset in either partition are mapped to different bins.
When this definition applies to turbo code, the set I is the set of the index of the coded symbols (a symbol is combined with information bits and redundancy bits) and one partition is on original index sequence and another one is the interleaved index sequence.
Parallel decoding of Turbo code (reference [1]): Let π be an interleave. The partition E0 ∪. . .∪EM−1 and partition Ê0∪. . .∪ÊM−1 have to be defined by
Ei={jM+i,j=0, . . . , C−1} and Êi={π(jM+i), j=0, . . . , C−1}, i=0, . . . , M−1
Therefore, the j-th processor works on the symbol sequence with the indexes jM, jM+1, . . . , (j+1)M−1 (i.e. a consecutive sequence) or the interleaved sequence with the indexes π(jM), π(jM+1), . . . , π((j+1)M−1).
In reference [2], a contention-free memory mapping is given for some turbo code interleave π, which maps π(jk+i) to └π(jk+i)/k┘ and maps π−1(jk+i) to └π−1(jk+i)/k┘. In fact, the definition is equivalent to the mapping
:i|→└i/k┘ (EQ-2)
This turbo decoder 200 includes a plurality of turbo decoders 221-227, a plurality of memories 241-247, and a processing module 230 that is operable to perform the memory mapping between the plurality of turbo decoders 221-227 and the plurality of memories 241-247. As can be seen, more than one turbo decoders tries to access a same memory at a given time. Specifically, turbo decoder 221 and turbo decoder 222 are trying to access memory 241, and turbo decoder 224 and turbo decoder 225 are trying to access memory 245. Also, turbo decoder 227 and another turbo decoder (included in the region as shown by the ellipsis . . .) are trying to access memory 247. This creates conflicts and incurs deleterious performance.
This turbo decoder 300 includes a plurality of turbo decoders 321-327, a plurality of memories 341-347, and a processing module 330 that is operable to perform contention-free memory mapping between the plurality of turbo decoders 321-327 and the plurality of memories 341-347. As can be seen, only one turbo decoder accesses any one memory at a given time. This is a truly contention-free memory mapping between the plurality of turbo decoders 321-327 and the plurality of memories 341-347.
Referring to the communication system 400 of
The other device 490 to which the communication device 410 is coupled via the communication channel 499 can be another communication device 492, a storage media 494 (e.g., such as within the context of a hard disk drive (HDD)), or any other type of device that is capable to receive and/or transmit signals. In some embodiments, the communication channel 499 is a bi-directional communication channel that is operable to perform transmission of a first signal during a first time and receiving of a second signal during a second time. If desired, full duplex communication may also be employed, in which each of the communication device 410 and the device 490 can be transmitted and/or receiving from one another simultaneously.
The decoder 421 of the communication device 410 includes a turbo decoder 420, a processing module 430, and a memory 440. The processing module 430 can be coupled to the memory 440 so that the memory is operable to store operational instructions that enable to the processing module 430 to perform certain functions.
Generally speaking, the processing module 430 is operable to perform contention-free memory mapping between the turbo decoder 420 and the memory 440 during iterative decoding processing.
It is also noted that the processing module 430 can be implemented strictly as circuitry. Alternatively, the processing module 430 can be implemented strictly in software such as can be employed within a digital signal processor (DSP) or similar type device. In even another embodiment, the processing module 430 can be implemented as a combination of hardware and software as well without departing from the scope and spirit of the invention.
In even other embodiments, the processing module 430 can be implemented using a shared processing device, individual processing devices, or a plurality of processing devices. Such a processing device may be a microprocessor, micro-controller, digital signal processor, microcomputer, central processing unit, field programmable gate array, programmable logic device, state machine, logic circuitry, analog circuitry, digital circuitry, and/or any device that manipulates signals (analog and/or digital) based on operational instructions. The processing module 430 can be coupled to the memory 440 that is operable to store operational instructions that enable to processing module 430 to perform the appropriate contention-free memory mapping between the turbo decoder 420 and the memory 440.
Such a memory 440 may be a single memory device or a plurality of memory devices. Such a memory 440 may be a read-only memory, random access memory, volatile memory, non-volatile memory, static memory, dynamic memory, flash memory, and/or any device that stores digital information. Note that when the processing module 430 implements one or more of its functions via a state machine, analog circuitry, digital circuitry, and/or logic circuitry, the memory storing the corresponding operational instructions is embedded with the circuitry comprising the state machine, analog circuitry, digital circuitry, and/or logic circuitry.
Referring to the communication system 500 of
A communication device includes a turbo decoder that is itself composed of a plurality of turbo decoders 621, 622, and 623. Such a communication device also includes a memory that is itself composed of a plurality of memories 641, 642, and 643. A processing module 630 is operable to perform contention-free memory mapping between the plurality of turbo decoders 621, 622, and 623 and the plurality of memories 641, 642, and 643 during iterative decoding processing of a turbo coded signal.
At any given time, the processing module 630 is operable to ensure that only one turbo decoder accesses a given memory at a given time. For example, a processing module 630 is operable to perform a first contention-free memory mapping at a time 1, as shown by reference numeral 601. The processing module 630 is operable to perform a second contention-free memory mapping at a time 2, as shown by reference numeral 602. The processing module 630 is operable to perform a second contention-free memory mapping at a time 3, as shown by reference numeral 603. The processing module 630 is operable to perform a second contention-free memory mapping at a time 4, as shown by reference numeral 604. As can be seen, only one turbo decoder is connected to any one memory at any given time in each of these 4 diagrams.
As can be seen, the contention-free memory mapping between the turbo decoders 621, 622, and 623 and the plurality of memories 641, 642, and 643 changes as a function of time during iterative decoding processing of a turbo coded signal.
Almost Regular Permutation (ARP)
In reference [3] a regular permutation is defined as circular permutation, based on congruence properties. Circular permutation π, for blocks having L information symbols, is devised as follows. The data are read out such that the j-th datum read was written at the position i given by:
i=π(j)=jP mod L (EQ-3)
where the skip value P is an integer, prime with L. The almost regular permutation (ARP) of size L=CM (i.e. C is a divider of L) is introduced in reference [3] defined by modify (EQ-3) to
i=π(j)=jP+A(j mod C)P+B(j mod C) mod L (EQ-4)
To insure the function defined in (EQ-4) is a permutation (i.e. one to one and on to), reference [3] takes A(j)P+B(j)=C(α[j mod C)P+β(j mod C)].
R1-051310 reference [4] gave an example of (EQ-4) for the period C=4 with the following
i=π(j)=jP+D(mod 4)+3 mod L (EQ-5)
where the function D is defined by:
However, as pointed out in reference [3] if Q1,Q2 and Q3 is not a multiple of 4, the function in (EQ-5) may not be a permutation. In fact the Q's in examples given in reference [4] are all chosen to be multiple of 4.
Contradictions on contention-free memory mapping given in reference [3] and reference [4]
Consider ARP π defined in (EQ-4) and (EQ-5) with period C=4. It is easy to see that π(4l) mod 4, π(4l+1) mod 4, π(4l+2) mod 4, π(4l+3)mod 4 are different. With this conclusion reference [3] and reference [4] suggest the following parallel decoding that will have contention-free. “the circle can be evenly divided into four parts, or quadrants, four Soft-in/Soft-Out (SISO) (forward or backward) processors are assigned to the four quadrants of the circle under process. At the same time, the four units deal with data that are located at places corresponding to the four possible congruencies. For each clock period, the processors will change quadrants through cyclic shifts and two processors will therefore never access the same quadrant.”
According to the claim stated in reference [3] and reference [4], the memory mapping defined in the papers is
:i|→i mod C (EQ-6)
where C is the period and C=4 in reference [4].
Based on the discussion in Section 3 and in reference [1], parallel turbo decoding of degree C. has to have the following index partitions:
Ei={jM+i, j=0, . . . . , C−1} and
Êi={π(jM+i), j=0, . . . , C−1}, i=0, . . . , M−1.
That is to say at the i-th clock, the C parallel processors should work on the symbols with indexes i,i+M, . . . , i+(C−1)M. However, according to reference [3] and reference [4], these C indexes should have different congruencies mod C, i.e. the C elements
i mod C, (i+M) mod C, . . . , [i+(C−1)m] mod C
have to be distinct. This is not true when gcd(M,C)≠1. In fact, when gcd(M,C)=α>1, we have C=αC′ with C′<C. Thus C′M mod C=0. Therefore (i)=(i+C′M), which contradicts the contention-free definition and the claim given in [3] and [4]. The examples given in reference [3] with L=5472=12*456 (C=12) and L=408=4*102 (C=4) and examples given in reference [4] with L=320=4*80 (C=4) and L=640=4*160 (C=4) are all belong to this non-contention-free memory mapping. Therefore, the claim in reference [3] and reference [4] is not true for many cases, except the case where gcd(M,C)=1.
Consider the following memory mapping given in reference [2].
One can prove that the mapping works for ARP only on the case gcd(M,C)=C.
Example 1 (reference [3]) L=408=4* 102 (C=4) with gcd(4,102)=2, i.e. M=102.
Using the interleave π given in reference [3], the following is provided:
π(0)=0,π(M)=74, π(2M)=204, π(3M)=278
This gives
Therefore, the mapping is not contention-free.
Example 2 Let L=804, C=4, P=43, Q1=24, Q2=12 and Q3=4 and interleave π is given in (EQ-5) (i.e. based on reference [4]). Then M=201 i.e. gcd(M,C)=1. However,
π(1)=70, π(M+1)=29, π(2M+1)=624, π(3M+1)=247
This gives
Therefore, the mapping is not contention-free.
Generalization of ARP
Before giving a general contention-free memory mapping for all size of ARP, the ARP definition as provided in reference [3] and reference [4] is combined and generalized below.
Consider interleave a size L information symbols. Let C be a divider of L and let P be an integer prime with L. Suppose L=CM and denote m=M mod C. Let A(x) and B(x) be functions defined on the set {0, . . . , C−1} such that
(A(x)P+B(x))mod C=(A(y)P+B(y))mod C if x=y mod m (EQ-7)
Furthermore, let θε{0, . . . . , C−1}. An ARP π of blocks size L is defined as follows by
π(j)=Pj+A(j mod C)P+B(j mod C)+θ mod L,jε{0, . . . , L−1} (EQ-8)
Moreover, to insure π is a permutation, the condition π(j)≠π(j′) when j≠j′ must be satisfied. In fact, as pointed out in reference [3], by taking A(v)=Cα(v), B(v)=Cβ(v) (as defined in reference [3]), one can show that π is indeed a permutation.
Contention-free Mapping for ARP of Arbitrary Size
Let π be an ARP interleave of size L=CM defined in (EQ-7) with gcd(P,L)=gcd(P,C)=1. According to the discussion in Section 3 (or reference [1]), to have parallel turbo decoding of degree C. the indexes set of information sequence and the indexes set of the interleaved sequence have to be partitioned as
Ei={jM+i|j=0, . . . , C−1}, Êi={π(jM+i)|j=0, . . . , C−1} (EQ-9)
This means at the i-th time clock the C processors will update the soft estimate of the information symbols with the indexes in Ei (or Êi) in parallel.
Denote ZC={i mod C|iεZ} where Z is the integer ring and denote m=M mod C. The order of m in ZC, ordC(m), is defined to be an integer q such that qm mod C=0 but am mod C≠0 for any 0<α<q. Since qM mod C=qm mod C=0, q is also referred to as the order of M and denoted to be ordC(M). It is obvious that q|C. Let Cf be the factor of C such that C=ordC(m)Cf. In the following when there is no confusion, q is denoted as ordC(M)=ordC(m). There is a further partition of Ei and Êi to
where
Ei,s={(sq+r)M+i|r=0, . . . , q−1} and
Êi,s={π[(sq+r)M+i]|r=0, . . . , q−1} (EQ-10)
Definition 1 Let η0,η1, . . . , ηC
by
Theorem 1 is a contention-free mapping.
Proof of this theorem is given in Appendix. In the following, examples will be given.
Example 3 Let L=24, C=4. Then M=6. Let P=7. Define
We have m=M mod C=2 and q=ordC(m)=2. Thus
Ei={i,6+i,12+i,18+i}, i=0, . . . , 5 and
Ei,0 mod C={i mod 4, (2+i) mod 4}, Ei,1 mod C={i mod 4, (2+i) mod 4} Since qM=12, the following results:
Take ηj=j,j=0,1 the following results:
(Ei)={i mod 4, (2+i) mod 4, (1+i) mod 4, (3+i) mod 4}=Z4.
Thus is a contention-free map on Ei.
On the other hand, there is
Consider all 6 cases.
Thus (Ê0)={0,2,1,3}, i.e. contention-free on Ê0.
Thus (Ê1)={3,1,0,2}, i.e. contention-free on Ê1.
Thus (Ê2)={3,0,2,1} , i.e. contention-free on Ê2
Thus (Ê3)={1,0,2,3}, i.e. contention-free on Ê3
Thus (Ê4)={0,2,1,3}, i.e. contention-free on Ê4
Thus (Ê5)={0,2,3,1}, i.e. contention-free on Ê5.
Example 4 (Mapping from reference [3]) L=5472, C=12, P=97. Then M=456 and m=M mod C=0. Thus q=ord(m)=1. Then Ei={456r+i|r=0, . . . , 11}, i=0, . . . , 455 with Eimod C={i}. Moreover,
Take ηj=j,j=0, . . . ,11, then, (Ei)={(i+j)mod 12|j=0, . . . , 11}=Z12. Therefore is a contention-free map on Ei.
On the other hand, considering the following
π(rM+i)=[(rM+i)P+A(i mod C)P+B(i mod C)]mod L
with A(i mod C)P+B (i mod C) defined in the following table, where it is supposed that i mod C=ki.
Thus, by M mod C=0, π(rM+i) mod C=[iP+A(i)P+B(i)]mod C. Therefore, Êi mod C={π(rM+i) mod C|r=0, . . . , C−1}={[iP+A(i)P+B(i)]mod C}, π(rM+i)=[rMP+iP+A(i)P+B(i)]mod CM=[rM+iP+A(i)P+B(i)] mod CM
Let π(i)=(C−k)M+γ(i) with 0≦γ<M, then the following results
π(jM+i)=(C−k+j)M+γ(i), j=0, . . . , k−1 and
π(jM+i)=(j−k)M+γ(i), j=k, . . . . , C−1
Thus
Therefore,
Thus is a contention-free map on Êi.
Example 5 (continue of Example 1) L=408=4*102 (C=4) with gcd(4,102)=2, i.e. M=102. Thus, m=M mod C=2, q=ord(m)=2 and Cf=2. Thus
Ei={i,102+e,204+i,306+i}, i=0, . . . , 101
Ei,0 mod C={i mod 4, (2+i) mod 4}, Ei,1 mod C={i mod 4, (2+i) mod 4}.
Since qM=204, the following results:
Take ηj=j,j=0,1, then
(Ei)={i mod 4, (2+i) mod 4, (1+i) mod 4, (3+i) mod 4}. Thus is a contention-free map on Ei.
On the other hand, according to reference [3], P=43, θ=0, A(0)=A(1)=0,A(2)=A(3)=4 and B(0)=0,B(1)=B(2)=B(3)=4. Thus
Then
Thus (Ê0)={0,2,1,3}, i.e. contention-free on Ê0.
Thus (Ê1)={3,1,0,2}, i.e. contention-free on Ê1.
Thus (Ê2)={3,1,2,0}, i.e. contention-free on Ê2.
. . . .
Thus (Ê96)={0,1,2,3}, i.e. contention-free on Ê96.
. . . .
Example 6 (Mapping from reference [4]) L=320=4*80 (C=4) with gcd(C,80)=4, i.e. M=80. Thus, m=M mod C=0, q=ord(m)=1 and Cf=C=4. Thus
Ei={i,80+i,160+i,240+i}, i=0, . . . , 79 and Ei mod C={i mod 4}. Since qM=80, the following results:
Take ηj=j,j=0,1,2,3, the following results:
(Ei)={i mod 4, (1+i) mod 4, (2+i) mod 4, (3+i) mod 4}.
Thus is a contention-free map on Ei.
On the other hand, according to reference [4], P=197, θ=3, A(0)=A(1)=0, A(2)=A(3)=4 and B(0)=0,B(1)=8,B(2)=20, B(3)=12. Thus
Then
Thus (Ê0)={3,0,1,2}, i.e. contention-free on Ê0.
2) Ê1={π(1),π(81),π(161),π(241)}={208,288,48,128} with
Thus (Ê1)={2,3,0,1}, i.e. contention-free on Ê1.
Thus (Ê2)={0,1,2,3}, i.e. contention-free on Ê2.
Thus (Ê3)={3,0,1,2}, i.e. contention-free on Ê3.
. . . .
It is shown herein that for many block size the memory mappings given in reference [3], reference [4], and reference [2] are not contention-free for the interleave defined by ARP.
A general contention-free mapping for ARP of arbitrary block size is proposed. This mapping is algebraic and can be used in the 3GPP LTE turbo coding, among other coding types.
Thereafter, the method 700 continues by reading the first soft information from the first turbo decoder and writing the first soft information to a third turbo decoder, as shown in a block 731. The method 700 also continues by reading the second soft information from the second turbo decoder and writing the second soft information to the first turbo decoder, as shown in a block 732. Each memory is accessed by only one turbo decoder at a time within the method 700.
Thereafter, the method 700 continues by updating the first soft information using the third turbo decoder, as shown in a block 741, and the method 700 continues by updating the second soft information using the first turbo decoder, as shown in a block 742.
After performing a final turbo decoding iteration, the method 700 involves generating a best estimate of at least one bit that has been encoded into a turbo coded signal using the most recently updated first soft information and second soft information, as shown in a block 750.
The present invention has also been described above with the aid of method steps illustrating the performance of specified functions and relationships thereof. The boundaries and sequence of these functional building blocks and method steps have been arbitrarily defined herein for convenience of description. Alternate boundaries and sequences can be defined so long as the specified functions and relationships are appropriately performed. Any such alternate boundaries or sequences are thus within the scope and spirit of the claimed invention.
The present invention has been described above with the aid of functional building blocks illustrating the performance of certain significant functions. The boundaries of these functional building blocks have been arbitrarily defined for convenience of description. Alternate boundaries could be defined as long as the certain significant functions are appropriately performed. Similarly, flow diagram blocks may also have been arbitrarily defined herein to illustrate certain significant functionality. To the extent used, the flow diagram block boundaries and sequence could have been defined otherwise and still perform the certain significant functionality. Such alternate definitions of both functional building blocks and flow diagram blocks and sequences are thus within the scope and spirit of the claimed invention.
One of average skill in the art will also recognize that the functional building blocks, and other illustrative blocks, modules and components herein, can be implemented as illustrated or by discrete components, application specific integrated circuits, processors executing appropriate software and the like or any combination thereof.
Moreover, although described in detail for purposes of clarity and understanding by way of the aforementioned embodiments, the present invention is not limited to such embodiments. It will be obvious to one of average skill in the art that various changes and modifications may be practiced within the spirit and scope of the invention, as limited only by the scope of the appended claims.
[1] A. Tarable, S. Benedetto and G. Montorsi “Mapping Interleaving Laws to Parallel Turbo and LDPC Decoder Architectures,” IEEE Transactions on Information Theory, Vol. 50, No. 9, pp. 2002-2009, September 2004.
[2] A. Nimbalker, T. E. Fuja, D. J. Costello, Jr., T. K. Blankenship and B. Classon, “Contention-Free Interleavers,” IEEE ISIT 2004, Chicago, USA, Jun. 27-Jul. 2, 2004.
[3] C. Berrou, Y. Saouter, C. Douillard, S. Kerouédan, and M. Jézéquel, “Designing good permutations for turbo codes: towards a single model,” 2004 IEEE International Conference on Communications (ICC), Vol.: 1, pp: 341-345, 20-24 Jun. 2004.
[4] France Telecom, GET, Enhancement of Rel. 6 Turbo Code, 3GPP TSG RAN WG1#43, R1-051310, 2005.
Without loss of generality, in the following proof it is assumed that ηi=i for i=0, . . . , Cf−1. Also, the following definition is employed:
x⊕y=(x+y) mod C and X mod C={x mod C|xεX} for any integer subset.
Before proving the theorem, two lemmas are introduced. Define a set
G(m)={0, m, . . . , (q−1)m} (EQ-A1)
where q is the order of m (as well as M). Then G(m) is a subgroup of ZC.
Lemma 1 Let Hj={h0(j), h1(j), . . . , hq−1(j)}, j=0, . . . , Cf−1 be subsets of non-negative integers such that Hj mod C=α⊕G(m) where αεZC. Let δ0, δ1, . . . , δC
a) The sets (H0+δ0) mod C, (H1+δ1) mod C, . . . , (HC
b) {(H0+δ0) mod C}∪{(H1+δ1) mod C}∪. . .∪{(HC
Proof: It is easy to see that (Hj+δj) mod C=(α⊕δj)⊕G(m). Assume a) of the proposition is not true, then there exist two distinct integers u, v, 0≦v,u<Cf such that δu>δv and (α⊕δu)⊕G(m) and (α⊕δv)⊕G(m) are not disjoint. Then there exist s,tε{0, . . . , q−1} such that s≠t but (α⊕δu)⊕sm=(α⊕δv)⊕tm. This implies δu−δv=(t−s)m mod C which is equivalent to (δu−δv)q=0 mod C s. Thus δu=δv. This contradicts to the assumption on u,v. Therefore, a) is correct. b) of the proposition is a direct consequence of a) and the fact that every set in a) has q elements in ZC.
Lemma 2 If s1≠s2 then
Proof: Since for any sε{0, . . . , Cf−1}, [(sq+r)M+i] mod C=(rM+i) mod C, we can define λi,r=[(sq+r)M+i] mod C for every sε{0, . . . , Cf−1}. Since
π(u)={uP+A(u mod C)P+B(u mod C)+θ} mod CM
we have
Êi,s={[(sq+r)M+i]P+Ω(λi,r)+θ|r=0, . . . , q−1}
where ω(λi,r)=A(λi,r)P+B(λi,r). Assume the lemma is not true, say
for some sε{0, . . . , Cf−1}. Then there exist 0≦rj<q, 0≦ij<M for j=1,2 such that
π((sjq+r) M+i)=[(sjq+r)M+i]P+ω(λi,r)+θ−AjCM=(sq+rj)M+ij.
Subtracting these two equations results with
(s1−s2)q MP−(A1−A2)CM=(r1−r2)M+(i1−i2) (EQ-A1)
This implies M|(i1−i2). Therefore i1=i2. Now (EQ-A1) becomes
(s1−s2)qP−(A1−A2)C=(r1−r2)
This implies q|(r1−r2) since q|C . Thus r1=r2. Therefore
π((s1q+r)M+i)=π((sqq+r)M+i)
which contradicts the fact that π is a permutation and (s1q+r)M+i≠(s2q+r)M+i.
The proof of Theorem 1 will be divided to two parts.
Part I: for i=0, . . . , M−1, and j≠j′, j,j′εEi(j)≠(j′).
Part II: for i=0, . . . , M−1 and j≠j′, j,j′εÊ(j)≠(j′).
Proof of Part I: By the definition in (EQ-10), Ei,s mod C=(i mod C)⊕G(m). Thus
(Ei,s)=((s+i) mod C)⊕G(m).
Since i mod C, (1+i) mod C, . . . , (Cf−1+i) mod C are distinct, by Lemma 1 this proves (Ei,0), (Ei,1), . . . , (Ei,C
This proves that the memory mapping on Ei is one to one and on to since ∥Ei∥=C. This proves the first part of Theorem 1.
Proof of Part II: Consider the sets Êi,s={π((sq+r)M+i)|r=0, . . . , q−1}, 0≦s<Cf, 0≦i<M. For rε{0, . . . , q−1}, iε{0, . . . , M−1} define
then Lemma 2 shows α0,r,i, . . . , αC
{α0,r,i, . . . αC
Therefore, given a tε{0, . . . , Cf−1} there exist st,0, st,1, . . . , st,q−1 such that s
αs
Define Fi,t={π((st,rq+r)M+i)|r=0, . . . . , q−1}, then the result is
Moreover, since (st,rq+r)M+i=rM+i mod C the following result is provided,
[(st,rq+r)M+i]mod C={[(st,rq+r′)M+i]mod C} mod m.
for any r, r′ε{0, . . . , C−1}. Then, by (EQ-7) in the definition of the general ARP, the following relationship results
[A(((st,rq+r)M+i) mod C)P+B((st,rq+r) M+i) mod C)] mod C=[A(((st,rq+r′) M+i) mod C) P+B((st,rq+r′)M+i) mod C)]mod C
for any r,r′ε{0, . . . , C−1}. This value is defined by φ(m,i). Then
Fi,t mod C=G(mP)⊕(iP+φ(m,i)+θ) mod C with ordC(mP)=ordC(m)=q since gcd(P,C)=1. Since
(π((sq+r)M+i))=[π((sq+r)M+i)+αs,r,i]mod C=((rm+i)P+φ(m,i)+θ+αs,r,i) mod C
we have (Fi,t)=G(mP)⊕t⊕(iP+φ)(m,i)+θ) mod C=t⊕Fi,t mod C. Thus, by Lemma 1, (Fi,0), . . . , (Fi,C
Therefore
That means the memory mapping on the set Êi is one to one and on to since ∥Êi∥=C. This proves the second part of Theorem 1.
The present U.S. Utility Patent Application claims priority pursuant to 35 U.S.C. § 119(e) to the following U.S. Provisional patent application which is hereby incorporated herein by reference in its entirety and made part of the present U.S. Utility Patent Application for all purposes: U.S. Provisional application Ser. No. 60/850,492, entitled “General and algebraic-constructed contention-free memory mapping for parallel turbo decoding with algebraic interleave ARP (almost regular permutation) of all possible sizes,” filed Oct. 10, 2006, pending.
Number | Name | Date | Kind |
---|---|---|---|
5406570 | Berrou et al. | Apr 1995 | A |
5446747 | Berrou | Aug 1995 | A |
5563897 | Pyndiah et al. | Oct 1996 | A |
6065147 | Pyndiah et al. | May 2000 | A |
6119264 | Berrou et al. | Sep 2000 | A |
6122763 | Pyndiah et al. | Sep 2000 | A |
6392572 | Shiu et al. | May 2002 | B1 |
6715120 | Hladik et al. | Mar 2004 | B1 |
6957376 | Becker et al. | Oct 2005 | B2 |
7185260 | Coombs et al. | Feb 2007 | B2 |
7200798 | Bickerstaff | Apr 2007 | B2 |
7200799 | Wang et al. | Apr 2007 | B2 |
7441174 | Li et al. | Oct 2008 | B2 |
7500169 | Lin | Mar 2009 | B2 |
7530011 | Obuchii et al. | May 2009 | B2 |
7549113 | Lin | Jun 2009 | B2 |
Number | Date | Country |
---|---|---|
0735696 | Oct 1996 | EP |
0735696 | Jan 1999 | EP |
9105278 | Oct 1992 | FR |
10-2004-0034607 | Apr 2004 | KR |
02093755 | Nov 2002 | WO |
Number | Date | Country | |
---|---|---|---|
20080086673 A1 | Apr 2008 | US |
Number | Date | Country | |
---|---|---|---|
60850492 | Oct 2006 | US |