The present invention relates to forward error correction generally and, more particularly, to a method and/or apparatus for implementing simplified parallel address-generation for interleavers.
In a mobile cellular communication network a forward error correction (FEC) component is commonly determined and added to signals communicated between base stations and mobile units of the network. In addition, the signals are commonly interleaved prior to transmission to reduce the effect of possible burst errors in the network. A mobile unit or base station receiving an interleaved signal with added FEC components decodes and de-interleaves the signal. When a concatenated code (i.e., turbo code) is used for FEC, the transmitting station employs a turbo encoder to generate the code and the receiving unit employs a turbo decoder to process the code. The turbo encoder and the turbo decoder internally perform interleaving and de-interleaving.
The conventional approaches to implementing a turbo code internal interleaver include (i) straight forward implementation of the equation, Pi(i)=(f1*i+f2*î2)mod K, (ii) implementation of the turbo code interleaver with a look up table, and (iii) an interleaver address generation scheme presented by Ericsson document 3GPP R1-063137 “Quadratic Permutation Polynomial Interleavers for LTE Turbo Coding,” Ericsson 2006. A disadvantage of the first approach is that multipliers are needed and therefore the hardware and cycle count is high. A disadvantage of the second approach is that a memory is needed for the look up table that requires additional memory accesses. A disadvantage of the third approach is that two recursion operations are needed per one address generation.
It would be desirable to have a turbo code interleaver that reduces the cycle count of a turbo encoder and decoder.
The present invention concerns an apparatus including a first circuit and a second circuit. The first circuit may be configured to store a block of data values arranged in a first order. The first circuit may be further configured to present a plurality of the data values in parallel in response to a plurality of address signals, where the data values are presented in a second order. The second circuit may be configured to generate the plurality of address signals in response to a first signal, a second signal and a third signal. The second circuit generally includes an even number of address generators configured to generate the plurality of address signals in parallel.
The objects, features and advantages of the present invention include providing a method and/or apparatus for implementing simplified parallel address-generation for interleavers and de-interleavers that may (i) generate multiple addresses in parallel, (ii) generate addresses instead of looking up addresses in a lookup table, (iii) generate addresses without multipliers, (iv) generate addresses with only one recursion per address-generation, (v) use only addition and modulo operations to implement a turbo code interleaver, (vi) provide a turbo encoder with reduced cycle counts, and/or (vii) provide a turbo decoder with reduced cycle counts.
These and other objects, features and advantages of the present invention will be apparent from the following detailed description and the appended claims and drawings in which:
Referring to
The interleaver address generator 100 may be connected (e.g., via an address bus, traces on one or more layers of an integrated circuit, etc.) to a storage medium 102 to implement an interleaver. The storage medium 102 may be implemented, in one example, as a memory. However, other types of storage devices may be implemented accordingly to meet the design criteria of a particular implementation. The storage medium 102 may have an input that may receive a block of K data values (e.g., C(0), . . . , C(K-1)), a number of inputs that may receive the signals ADDR(0,J), . . . , ADDR(M-1,J), and a number of outputs that may present a plurality of signals (e.g., C′(0,J), . . . , C′(M-1,J)). The data values C(0), . . . , C(K-1) are generally arranged in a first order in the storage medium 102. The signals C′(0,J), . . . , C′(M-1,J) generally comprise values from the block of data values arranged in a second order. The second order is generally determined by the signals ADDR(0,J), . . . , ADDR(M-1,J). In one example, the circuit 100 may be configured to generate the signals ADDR(0,J), . . . , ADDR(M-1,J) such that a relationship between the signals C′(0,J), . . . , C′(M-1,J), J=0, 1, 2, . . . , K/M-1, and the block of data values C(0), . . . , C(K-1) may be expressed by the following Equation 1:
C′(i)=C(Pi(i)), i=0, 1, . . . , (K-1), Eq. 1
where the relationship between the output index i and the input index Pi(i) satisfies the following quadratic form:
Pi(i)=(F1*i+F2*î2)mod K. Eq. 2
The parameters F1 and F2 may depend on the block size K. In one example, the parameters F1 and F2 may be implemented using turbo code internal interleaves parameters as summarized in Table 5.1.3-3 of section 5.1.3.2.3 of the LTE standard 3GPP TS 36.212 V*.5.0 (2008-12), which is herein incorporated by reference.
In one example, the circuit 100 may comprise a block (or circuit) 110 and a block (or circuit 112). The circuit 110 may be implemented, in one example, as a control circuit. The circuit 112 may be implemented, in one example, as an address generators module. The circuits 110 and 112 may represent modules and/or blocks that may be implemented as hardware, software, a combination of hardware and software, or other implementations. The circuit 110 may have a first input that may receive the signal I, a second input that may receive the signal K, and a third input that may receive the signal M. The circuit 110 also may have a first output that may present a signal (e.g., J), a second output that may present a signal (e.g., F1) and a third output that may present, either serially or in parallel, a plurality of signals (e.g., BIAS(0)-BIAS(M-1). The signal J may be implemented as an index signal. The signal F1 may be implemented as a parameter signal. The signals BIAS(0)-BIAS(M-1) may be implemented as bias (or base) address signals. The circuit 110 may be configured to generate the signals J, F1, and BIAS(0)-BIAS(M-1) based upon the signals I, K, and M).
The circuit 112 may have, in one example, a first input that may receive the signal M, a second input that may receive the signal K, a third input that may receive the signal J, a fourth input that may receive the signal F1, a fifth input that may receive, either serially or in parallel, the plurality of signals BIAS(0)-BIAS(M-1), and an output that may present, either serially or in parallel, the plurality of signals ADDR(0,J), . . . , ADDR(M-1,J). The circuit 112 may be configured to generate the plurality of signals ADDR(0,J), . . . , ADDR(M-1,J) in response to the signals J, K, M, F1, and BIAS(0)-BIAS(M-1).
Referring to
The circuit 120 may have an input that may receive the signal K, a first output that may present the signal F1, and a second output that may present a signal (e.g., F2). The signal F2 may be implemented as a parameter signal. In one example, the signals F1 and F2 may comprise factors F1 and F2 for the turbo encoding internal interleaver equation (e.g., from Table 5.1.3-3, of section 5.1.3.2.3 of the LTE standard 3GPP TS 36.212 V*.5.0 (2008-12), which is herein incorporated by reference). The circuit 122 may have a first input that may receive the signal F1, a second input that may receive the signal F2, a third input that may receive the signal K, a fourth input that may receive the signal M, and an output that may present, either serially or in parallel, the signals BIAS(0)-BIAS(M-1). The circuit 124 may have an input that may receive the signal I and an output that may present the signal J. The circuit 120 may be configured to generate the signals F1 and F2 based upon the signal K. In one example, the signal K may be used to index a table containing the corresponding values for F1 and F2. The circuit 122 may be configured to generate the signals BIAS(0)-BIAS(M-1) based upon the signals F1, F2, K, and M. In one example, the circuit 122 may be configured to generate the signals BIAS(0)-BIAS(M-1) using Eq. 2 above and a number of initial indices (e.g., I0=0, . . . , I(M-1)=M-1).
Each of the circuits 126a-126m-1 (described in more detail below in connection with
ADDR(P,J)=(BIAS(P)+J*M*F1)mod K, Eq. 3
where P=0, . . . , M-1 and J=0, . . . , K/M-1.
Referring to
Referring to
A sequence to be turbo encoded (e.g., S) may be presented (e.g., as a signal S) to an input of the circuit 202 and an input of the circuit 204. In one example, the sequence S may comprise a systematic data sequence. An output of the circuit 202 may present a first output signal (e.g., P1). In one example, the signal P1 may comprise parity information from the circuit 202. An output of the circuit 204 may present a signal (e.g., S′) to an input of the circuit 206. An output of the circuit 206 may present a second output signal (e.g., P2). In one example, the signal P2 may comprise parity information from the circuit 206.
Referring to
The signal XK may be presented to a first input of the block 302. An output of the block 302 may be presented to an input of the block 304 and an input of the block 306. An output of the block 304 may be presented to an input of the block 308 and a second input of the block 306. An output of the block 306 may be presented to a first input of the block 314. An output of the block 308 may be presented to an input of the block 312 and a first input of the block 310. An output of the block 312 may be presented to a second input of the block 314 and a second input of the block 310. An output of the block 310 may be presented to a second input of the block 302. An output of the block 314 may present the signal ZK.
Referring to
A sequence or block (e.g., a systematic data sequence) to be turbo decoded (e.g., in a mobile unit) may comprise triplets of values (e.g., S, P1, P2). The circuit 402 may have a first input that may receive the signal S, a second input that may receive the signal P1 and a third input that may receive a signal (e.g., λ_IN_0). An output of the circuit 402 may present a signal (e.g., λ_OUT_0) to an input of the circuit 404. In one example, the signal λ_OUT_0 may comprise a logarithm of a first likelihood ratio associated with the output of the circuit 402. An output of the circuit 404 may present a signal (e.g., λ_IN_1). An input of the circuit 406 may receive the signal S. An output of the circuit 406 may present a signal (e.g., S′). The signal λ_IN_1 may be presented to a first input of the circuit 408. The signal S′ may be presented to a second input of the circuit 408. A third input of the circuit 408 may receive the signal P2. The circuit 408 may have a first output that may present a signal (e.g., λ_OUT_1) and a second output that may present a signal (e.g. HD). In one example, the signal λ_OUT_1 may comprise a logarithm of a second likelihood ratio associated with the output of the circuit 408. The signal HD may comprise decoded data (e.g., hard decisions). The signal λ_OUT_1 may be presented to an input of the circuit 410. An output of the circuit 410 may present the signal λ_IN_0.
Referring to
When the parameters F1 and F2 have been determined, the process 500 may move to the step 506. In the step 506, the process 500 may generate a plurality of values corresponding to the M bias signals (e.g., BIAS0-BIAS(M-1)). The M bias values may be generated based upon the values K, M, F1, and F2. In one example, values of a number of initial indices (e.g., I0, I1, . . . , I(M-1) may be set based upon the value of M, and the bias values determined using a corresponding initial index and Equation 1 above. When the bias values have been determined, the process 500 may move to the step 508. In the step 508, the process 500 may generate M address signals (e.g., ADDR0-ADDR(M-1)) in parallel based upon the M bias values, the block size K, the number of addresses to be generated in parallel M, the parameter F1, and an index J generated in response to the index I. When the M addresses have been generated, the process 500 may retrieve M data values from memory using the M addresses and move to the step 510. In the step 510, the-process 500 may determine whether all K data values have been interleaved. If all of the K values have not been processed, the process 500 may move to the step 512. In the step 512, the process 500 increments the index J and returns to the step 508. The steps 508 through 512 may be repeated until all K data values have been interleaved. When all K data values have been interleaved, the process 500 may end.
In one example, an embodiment of the present invention may provide a simplified parallel address-generation scheme for interleavers. An address generator in accordance with an embodiment of the present invention may provide a plurality of addresses generated in parallel. An approach in accordance with an example embodiment of the present invention may base parallel address generation on a calculation of a difference between calculated addresses, thus, part of the modulo operations may be omitted. The approach in accordance with an embodiment of the present invention may be described using the following mathematical equations:
(Pi(I)−Pi(I−M))mod K=((F1*I+F2*Î2)mod K−((I−M)*F1+F2*(I−M)̂2)mod K=(F1*I+F2*Î2 (I−M)*F1−F2*(I−M)̂2)mod K=(M*F1+M*F2(2I−M))mod K Eq. 4
Initial values of the index I may be chosen along with M to satisfy the following:
(M*F2(2I−M))=LK→(M*F2(2I−M))mod K=0, Eq. 5
where L is an integer. Using Equation 5, Equation 4 may be simplified to:
(M*F1+M*F2(2I−M))mod K=(M*F1)mod K. Eq. 6
In general, the calculation of the addresses involves one recursion per address-generation, and uses only add and modulo operations.
Operation of an interleaver address generator in accordance with an embodiment of the present invention may be illustrated by the following examples. In a first example, an address generator may be implemented with two address generation units. Selecting K=40, F1=3, F2=10, the expression of Equation 5 becomes:
(M*10(2I−M))mod 40.
By choosing M to be even, (M*10(2I−M))mod 40=0 for every I. Therefore, the generation of addresses may be simplified by using Equation 6 as follows:
For M=2, two initial indices (e.g., I0=0 and I1=1) may be set and two address signals ADDR0 and ADDR1 generated (e.g., parallelism of two address-generation units).
I=I0+J*M, where J=0, 1, 2, 3 . . . K/M
I=0,2,4 . . . .(K-2).
The address output may be expressed as:
BIAS(0)=0 by using Equation 1 with I0=0;
(BIAS(0)+J*M*F1)mod K=(0+J6)mod 40=0, 6, 12, 18, 24, 30, 36, 2, 8 . . .
I=I1+J*M, where J=0, 1, 2, 3 . . . K/M
I=1, 3, 5, 7, . . . , (K-1).
The address output may be expressed as:
BIAS(1)=13 by using equation (1) with I1=1;
(BIAS1+J*M*F1)mod K=(13+J6)mod 40=13, 19, 25, 31, 37, 3, 9, 15, 21 . . .
In another example, the same parameters may be chosen as in the above example (e.g., K=40, F1=3, and F2=10), however, four addresses maybe generated in parallel. Again, for K=40, F1=3, and F2=10, the expression of Equation 5 becomes:
(M*10(2I−M))mod40.
Choosing M to be even, yields (M10(2I−M))mod40=0 for every I. Therefore, the address generation may be simplified using Equation 6 as follows:
For M=4, four initial indices (e.g., I0=0, I1=1, I2=2 and I3=3) may be set and four address signals ADDR0-ADDR3 generated (e.g., parallelism of four address-generation units).
I=I0+J*M, where J=0, 1, 2, 3, . . . , K/M
I=0, 4, 8, 12, . . . , (K-4).
The address output may be expressed as:
BIAS(0)=0 by using Equation 1 with I0=0;
(BIAS(0)+J*M*F1)mod K=(0+J12)mod 40=0, 12, 24, 36, 8, 20, 32, 4, 16 . . .
I=I1+J*M, where J=0,1,2,3, . . . , K/M
I=1, 5, 9, . . . , (K-3).
The address output may be expressed as:
BIAS(1)=13 by using Equation 1 with I1=1;
(BIAS(1)+J*M*F1)mod K=(13+J12)mod 40=13, 25, 37, 9, 21, 33, 5, 17 . . .
I=I2+J*M, where J=0, 1, 2, 3, . . . , K/M
I=2, 6, 10, . . . , (K-2).
The address output may be expressed as:
BIAS(2)=6 by using Equation 1 with I2=2;
(BIAS(2)+J*M*F1)mod K=(6+J12)mod 40=6, 18, 30, 2, 14, . . .
I=I3+J*M, where J=0, 1, 2, 3 . . . K/M
I=3, 7, 11, . . . , (K-1).
The address output may be expressed as:
BIAS(3)=19 by using Equation 1 with I3=3;
(BIAS (3)+J*M*F1)mod K=(19+J12)mod 40=19, 31, 3, 15, 27, 39, . . .
It will be apparent to those of ordinary skill in the relevant art(s) that the address generation units may be extended (scaled) easily to any parallelism level.
Referring to
The functions performed by the diagram of
The present invention may also be implemented by the preparation of ASICs (application specific integrated circuits), Platform ASICs, FPGAs (field programmable gate arrays), PLDs (programmable logic devices), CPLDs (complex programmable logic device), sea-of-gates, RFICs (radio frequency integrated circuits),
ASSPs (application specific standard products) or by interconnecting an appropriate network of conventional component circuits, as is described herein, modifications of which will be readily apparent to those skilled in the art(s).
The present invention thus may also include a computer product which may be a storage medium or media and/or a transmission medium or media including instructions which may be used to program a machine to perform one or more processes or methods in accordance with the present invention. Execution of instructions contained in the computer product by the machine, along with operations of surrounding circuitry, may transform input data into one or more files on the storage medium and/or one or more output signals representative of a physical object or substance, such as an audio and/or visual depiction. The storage medium may include, but is not limited to, any type of disk including floppy disk, hard drive, magnetic disk, optical disk, CD-ROM, DVD and magneto-optical disks and circuits such as ROMs (read-only memories), RAMS (random access memories), EPROMs (electronically programmable ROMs), EEPROMs (electronically erasable ROMs), UVPROM (ultra-violet erasable ROMs), Flash memory, magnetic cards, optical cards, and/or any type of media suitable for storing electronic instructions.
The elements of the invention may form part or all of one or more devices, units, components, systems, machines and/or apparatuses. The devices may include, but are not limited to, servers, workstations, storage array controllers, storage systems, personal computers, laptop computers, notebook computers, palm computers, personal digital assistants, portable electronic devices, battery powered devices, set-top boxes, encoders, decoders, transcoders, compressors, decompressors, pre-processors, post-processors, transmitters, receivers, transceivers, cipher circuits, cellular telephones, digital cameras, positioning and/or navigation systems, medical equipment, heads-up displays, wireless devices, audio recording, storage and/or playback devices, video recording, storage and/or playback devices, game platforms, peripherals and/or multi-chip modules. Those skilled in the relevant art(s) would understand that the elements of the invention may be implemented in other types of devices to meet the criteria of a particular application.
As would be apparent to those skilled in the relevant art(s), the signals illustrated in
While the invention has been particularly shown and described with reference to the preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made without departing from the scope of the invention.