Encoding of data into constant weight codes

BACKGROUND

This invention relates to coding of data.

A binary constant weight code is a code where each member of the code (i.e., each codeword) has the same number of 1's. Constant weight codes have numerous applications.

A conventional general purpose technique for encoding data into constant weight codes is based on a recursive expression for determining the lexicographic index of an element of a codebook. The operation of encoding is equivalent to determining the codeword, given its index, and the operation of decoding is equivalent to determining the index, given the codeword. If b=(b₁, b₂, K, b_n) is used to denote the codeword, b_iε{0, 1}, the lexicographic index v(b) is

$\begin{matrix} v (b) = \sum_{m = 1}^{n} b_{m} (\begin{matrix} n - m \\ w_{m} \end{matrix}) & (1) \end{matrix}$

where w_mis the number of ones in the m-bit prefix of b. See T. M. Cover, “Enumerative source encoding,” IEEE Trans. Information Theory, vol. 19, no. 1, pp. 73-77, January 1973; and J. P. M. Schalkwijk, “An algorithm for source coding,” IEEE Trans. Information Theory, vol. IT-18, pp. 395-399, May 1972. The resulting code is fully efficient, but the complexity of the technique limits its direct application to small block lengths. This is mainly due to the fact that the binomial coefficients in (1) become extremely large, requiring extended precision arithmetic to prevent overflow errors.

Arithmetic coding is an efficient variable length coding technique for finite alphabet sources. Given a source alphabet and a simple probability model for sequences, with p(x) and F(x) denoting the probability distribution and cumulative distribution function of sequence x, respectively, an arithmetic encoder represents x by a number in the interval [F(x)−p(x),F(x)]. The implementation of such an arithmetic coder can also run into problems with very long registers, but elegant finite-length implementations are known and are widely used. See I. H. Witten et al., “Arithmetic coding for data compression,” Communications of the ACM, vol. 30, pp. 520-540, June 1987. For constant weight codes, the idea is to reverse the roles of encoder and decoder, i.e., to use an arithmetic decoder as an encoder and an arithmetic encoder as a constant weight decoder. An efficient algorithm for implementing such codes using the arithmetic coding approach is given in T. V. Ramabadran, “A coding scheme for m-out-of-n codes,” IEEE Trans. Communications, vol. 38, no. 8, pp. 1156-113, August 1990. The probability model used by the coder is adaptive, in the sense that the probability that the incoming bit is a 1 depends on the number of 1's that have already occurred. This approach successfully overcomes the finite-register-length constraints associated with computing the binomial coefficients and the resulting efficiency is often very high, the loss of information bits being one bit or less, in most cases. The encoding complexity of the method is O(n).

A different method for encoding and decoding balanced constant weight codes was developed by Knuth, as described in D. E. Knuth, “Efficient balanced codes,” IEEE Trans. Information Theory, vol. 32, no. 1, pp. 51-53, January 1986, and is referred to as the complementation method. The method relies on the key observation that if the bits of a length-k binary sequence are complemented sequentially, starting from the left, there must be a point at which the weight is equal to └k/2┘. Given the transformed sequence, it is possible to recover the original sequence by specifying how many bits were complemented (or the weight of the original sequence). This information is provided using check bits of constant weight, and the resulting code consists of the transformed original sequence followed by the constant weight check bits.

In a series of papers, Bose and colleagues extended Knuth's method in various ways, and determined the limits of this approach. See, for example, J.-H. Youn and B. Bose, “Efficient encoding and decoding schemes for balanced codes,” IEEE Trans. Computers, vol. 52, no. 9, pp. 1229-1232, September 2003, and the references therein. Knuth's method is simple and efficient, and even though the overall complexity is O(n), for n=100 it can be eight times as fast as the method based on arithmetic codes. However, this method only works for balanced codes, which restricts its applicability.

In light of the available prior art, what is still needed is an effective and fast method for encoding and decoding constant weight codes that is not restricted in its applicability.

SUMMARY

An advance in the art is realized with a method that employs a piecewise linear algorithm, P, to map m-dimensional symbols into code tuples, followed by the construction of codes of weight m from the code tuples. To reverse the operation, constant weight codes are converted to code tuples, and a reverse piecewise linear algorithm P′ is used to map the code tuples into symbols, from which data is recovered. The m-dimensional symbols are obtained from mapping of input data into the symbols, which are contained within an m-dimensional parallelopiped, with each coordinate having a different span but in which the symbols along each of the coordinate axes are equally spaced apart. The code tuples, which are obtained by employing process P, are contained within an m-dimensional simplex.

BRIEF DESCRIPTION OF THE DRAWING

FIG. 1 depicts code tuples of a weight 2 constant weight code of length 5;

FIG. 2 depicts code tuples of a weight 2 constant weight code of length 10;

FIGS. 3A and 3B shows a mapping of a weight 2 constant code weight code—of any length—to a space where code tuples occupy a rectangular region;

FIG. 4A depicts code tuples of a weight 3 constant weight code of length 10;

FIG. 4B depicts a solid convex region (an orthoscheme) that contains all code tuples of a weight 3 constant weight code of given length, n;

FIG. 5 shows the stepwise process for converting a 3-dimensional orthoscheme into a 3-dimensional “brick”;

FIG. 6A shows one way of dissecting (cutting) the orthoscheme in order to convert it to a prism, and FIG. 6B shows how the cut pieces may be rearranged to form the prism;

FIG. 7 presents an alternative pictorial view (an interval diagram) of the algorithm for weight 2 constant weight codes; and

FIG. 8 presents a pictorial view of the algorithm for weight 3 constant weight codes.

DETAILED DESCRIPTION

A binary constant weight five-bit code of weight 2 is a code whose members (code words) have 5 bits each, and precisely 2 of the bits are 1's. This is illustrated in the first (left most) column below:

00011
(4, 5)

00101
(3, 5)

00110
(3, 4)

01001
(2, 5)

01010
(2, 4)

01100
(2, 3)

10001
(1, 5)

10010
(1, 4)

10100
(1, 3)

11000
(1, 2)

This code can be described by two-number tuples as shown in the second column in the above table, where each number describes the ordinal position of the “1” in the code. Thus, the (3,4) tuple (third row of the table), for example, states that there is a 1 in the third and fourth bit (counting from the left) of the associated code word. Henceforth herein, tuples that describe a code word in a constant weight code are referred to as code tuples.

It may be noted that the codewords in the above table are effectively ordered in descending order, from one row to the next row, and that the numbers in the tuple are ordered in an ascending order (viewed from the left). If the first number is designated by y₁and the second number is designated by y₂, then one can say that

0<y₁<y₂≦n, (2)

where n is the number of bits in a code word, or the code length. It may also be noted that the code tuples reduce dimensionality; in this case, from 5 to 2, and that the two-dimensional tuples, when normalized to 1 (i.e., all numbers are divided by 5) and depicted in a two-dimensional graph, occupy a triangle, as shown in FIG. 1. The normalized number y₁is designated x₁, and it is depicted along the (conventional) x axis. The normalized number y₂is designated x₂, and it is depicted along the (conventional) y axis. The discussion below employs the normalized values for consistency and convenience.

FIG. 2 shows the tuples for a length-10 code of constant weight 2. It may be observed that as the length of the code increased from 5 to 10 (i.e., from FIG. 1 to FIG. 2) the upper right hand corner of the triangle moved from (0.8,1) to (0.9, 1), and by induction one realizes that the upper right hand corner of the triangle is defined generally by (1−δ,1), where δ=1/n and, hence, d diminishes toward 0 as n increases. Similarly, the upper left hand corner of the triangle is defined by (δ,1), and the lower corner of the triangle is defined by (δ,2δ). This is effectively described by triangle 10 in FIG. 3A.

What can be further realized is that a constant weight code of weight 3 may be described by tuples having 3 numbers each which, when depicted in three dimensional space, are enclosed in a three dimensional polyhedron, a tetrahedron, with each of the four faces being right triangles. FIG. 4A depicts the tetrahedron, hereafter referred to as an orthoscheme, of a weight 3 code of length 10. The minimal bounding orthoscheme for arbitrarily large block lengths is shown in FIG. 4B.

Extending the above concepts to w dimensions, for weight w codes, one can realize that the code tuples, having w numbers each, are circumscribed by a w-dimensional simplex having an edge path consisting of w successive orthogonal vectors. A simplex with these properties is called an orthoscheme (H. S. M. Coxeter, Regular Polytopes, 3^rded., Macmillan, 1968.)

The process of coding data bits can be viewed as a process of mapping those data bits into points of the simplex that are tuples representing the codewords; and once the tuples are identified, mapping the tuples to the binary codewords. Two difficulties arise in mapping data bits into points of the simplex. First, the number of code tuples in a code is not a power of two. For example, for the weight 2, n=5, constant weight code (FIG. 1) the set of code tuples contains 10 code tuples; and for the weight 2, n=10, constant weight code (FIG. 2) the set of code tuples contains 45 code tuples. Second, the number of code tuples with certain of the coordinates fixed depends on the values of those coordinates. This can be seen by observing that the number of a code tuples at level 110 of simplex 100 (FIG. 4B), is smaller than the number of code tuples at level 120 of simplex 100.

We realized, however, that if a bijective function, or mapping, P (and its inverse, P′) can be found¹, between the w-dimensional orthoscheme and a w-dimensional parallelopiped (a w-dimensional “brick”), and then the process of coding and decoding data becomes straightforward, and efficient. ¹P′(P(a))=a where a is in A and P′(P(b))=b where b is in B, and A and B are disjoint sets.

To illustrate, as demonstrated above, the code tuples of a constant weight code of length 10 and weight 2 belong to a right triangle 10 of FIG. 3A. FIGS. 3A and 3B combine to show that triangle 10 can be dissected along line 11 to yield portions 12 and 13 such that when portion 12, for example, is rotated, what results is a rectangle that is subsumed by the diagonal corners (0,0) and (0.5,1). In the FIG. 3B dissection, the ranges of x₁and x₂are: 0<x₁≦1, ½<x₂≦1, respectively. It should be mentioned that x₁=y₁/n.

Given an incoming stream of symbols defined by number pairs a₁and a₂(with dynamic ranges 0 to 4 and 0 to 8 respectively), mapping number pairs to points in the space defined by regions 12 and 13′ (herein, symbols) is quite simple. What is left, then, is to map the symbols in the space defined by region 13′ of FIG. 3B into code tuples in region 13 of FIG. 3A. This can be accomplished as follows. If the point falls in region 13 of FIG. 3B, it maps onto itself (i.e., nothing needs to be done). If, on the other hand, it falls in region 12′ of FIG. 3B, for example, point 15′, it needs to be mapped to region 12 of FIG. 3A, as code tuple 15. Mathematically, if the coordinate values of an encoded data block are x₁and x₂, corresponding to the conventional x and y axes of FIG. 2, respectively, then, given that

0<x₁≦1, and ½<x₂≦1 (3)

by construction, it follows that:

if x₁≧x₂
then set x′₁=1−x₁and x′₂=1−x₂+1/n (4)
else set x′₁=x₁and x′₂=x₂

Similarly for codes of weight 3, an inductive process exists for converting the orthoscheme to a “brick.” This is illustrated in FIG. 5, where a 3 dimensional simplex is converted to a prism, and then converted to a brick. The resulting range of values for the (asymptotic) brick is 0<x₁<1, ½<x₂<1, and ⅔<x₃<1. The conversion from the simplex to the prism is shown in FIGS. 6A, and B. Specifically, the 3-dimensional simplex of FIG. 6A, which has an equilateral right triangle base with legs being 3 units long, is cut into thirds to form solids S, U, and V, the middle third is dissected into solids U1 and U2, and solid U2 is reflected about the plane that contains the trapezoid with legs 2, √{square root over (2)}, 1, √{square root over (3)}. Solids U1, U2 (reflected) and V are appended to solid s as shown in FIG. 6B to form the prism of height 1 and a right triangle base of 3.

Algorithmically, the same result is achieved by first handling the mapping of a point's x₁and x₂coordinates, and then handling the mapping of the x₃coordinate.

It is hard to visualize dissections in dimensions greater than 3, and even harder to visualize the necessary mapping of points from the brick (into which the symbols of a block of data are mapped) to the simplex (to create code tuples), and vice versa, so an alternative approach for visualization is needed.

Returning to two dimensions, as can be seen from FIGS. 7A, B, which shows the range of possible values of x₁and x₂, two possibilities exist: x₁≧x₂, or x₂>x₁. In the first case, shown in FIG. 7A, the value of x₂is less than the value of x₁. It this case, the mappings x′₁=1−x, and x′₂=1−x₂+1/n are carried out. In the second case, no mapping is necessary, as is demonstrated by FIG. 7B.

Once the relationship of x₁and x₂is properly set; that is, insuring that x₁is smaller than x₂, one proceeds to the third dimension, to handle x₃.

Our aim is to convert the orthoscheme of FIG. 6A to the brick which is defined by the ranges 0<x₁≦1, ½<x₂≦1, and ⅔<x₃<1. Given that x₂>x₁, only three situations can occur:

x₁<x₂<x₃
x₁<x₃<x₂or (5)
x₃<x₁<x₂

FIG. 8 which actually shows 4 situations, with the x₁<x₃<x₂situation separated into the sub-situation where x₁≧⅓ and the sub-situation where x₁<⅓. The operation for the case where x₁<x₂<x₃, which is the correct order, is shown in FIG. 8A, and is a mapping of the point to itself; i.e., the “do-nothing” or the identity transformation. The operation for the case where x₁<x₃<x₂is to subtract a value in order to reduce x₂. Since x₂can be just slightly higher than x₃and x₃can be as low as ⅔, it would appear that one could subtract ⅔. However, since x₁can be very low, the algorithm subtracts ⅔ modulo 1, and for x₁that means that ⅓ is added. This is shown in FIG. 8B where the resulting values are also re-labeled, with the modified x₁becoming x₃′ and the modified x₃becoming x₁′. When, however, x₁is greater than ⅓, ⅓ can be subtracted from all coordinates. This is shown in FIG. 8C where the resulting values are also relabeled, with the modified x₂becoming x₃′ and the modified x₃becoming x₂′. Lastly, the operation for FIG. 8D is to subtract ⅔ from each value and re-label x′₁as x′₂, x′₂as x′₃, and x′₃as x′₁.

The set of operations that are depicted in FIG. 8 actually form a single, unified piecewise linear algorithm, and that meets the objective of having a simple, reversible, algorithm. It must be reversible in the sense that given a symbol, which is the encoded representation of data, the reverse process of mapping from a code tuple to a symbol should be achievable.

We discovered a piecewise algorithm that is not only simple and reversible, but also contains information within the symbols that informs the user as to how to map forward, and also within the code tuples to inform the user as to how to reverse the map. The mappings shown in FIGS. 7 and 8 embody this algorithm from which, for example, the following may be observed relative to the forward mapping.

Before delving into the algorithm's equations, let us observe that the added dimension, x₃, is last in the order of FIG. 8A (1,2,3), but is in the middle in the of FIGS. 8B and 8C (1.3.2), and is first in the order of FIG. 8D (3,1,2). In other words, the position of X₃in the order distinguishes the different possible situations, except for the grouping of FIGS. 8B and 8C. However, the value of coordinate x′₁is greater than ⅓ in FIG. 8B, and is less than ⅓ in FIG. 8C. This information is sufficient to define the algorithm that is to be employed.

To reiterate, we discovered a piecewise algorithm that is simple and reversible, and also inherently relies on the data to determine how the forward and reverse mappings are to be carried out. Moreover, the algorithm applies to dimensions higher than 3, meaning that may be used for constant weight code of any desired weight. The following describes the algorithm in mathematical terms which, as indicated above, is iterative in the sense that it starts with handling 2 coordinates, then handles the third coordinate, then the fourth coordinate, etc.

Expressed formally, the problem is to find a bijection between set A_wand B_w, assuming that the required bijection between A_w-1and B_w-1is already known. The induction is advanced by finding a bijection between

$B_{w - 1} \times ((1 - \frac{1}{w}), 1)$

and B_w(where the × designates the Cartesian product of two sets). The w^thstep in the forward mapping,

$B_{w - 1} \times ((1 - \frac{1}{w}), 1) -> B_{w}$

is described by the following.

The input to the forward mapping is the vector (x₁, x₂, x₃, K, x_w) where (x₁, x₂, x₃, K, x_w-1)εB_w-1and

$x_{w} ε ((1 - \frac{1}{w}), 1) .$

The mapping produces the vector (x′₁, x′₂, x′₃, K, x′_w) where (x′₁, x′₂, x′₃, K, x′_w-1)εB_w-1,

Forward mapping: f_w

$\begin{matrix} 1) Compute i_{o} = \min_{x_{w} \leq x_{i}, 1 \leq i \leq w} i and j_{0} = \min_{x_{i} \geq \frac{w - i_{0} + i - 1}{w}, 1 \leq i \leq i_{0}} (i - 1) & (6) \\ 2) Compute x_{k}^{'} = {\begin{matrix} x_{k + j_{0}} - \frac{w + j_{0} - i_{o}}{w} & k = 1, K, i_{0} - j_{0} - 1, \\ x_{w} - \frac{w + j_{0} - i_{0}}{w} & k = i_{0} - j_{0}, \\ x_{k + j_{0} - 1} - \frac{w + j_{0} - i_{0}}{w} & k + j_{0} = i_{0} + 1, K, w, \\ x_{k - (w - j_{0})} + \frac{i_{0} - j_{0}}{w} & k = w - j_{0} + 1, K, w . \end{matrix} & (7) \end{matrix}$

The above piecewise equation identifies the shift and switch operations required to obtain x′_kfor different ranges of the variable k. We follow the convention that if the starting index of a range of k-values is smaller than the ending index, the range is empty, and the corresponding transformation is not carried out. Also if an index for x is not in the range 1, . . . , w, it is regarded as a void index, and thus voids the operation. Note that i₀=w implies j₀=0, in which case Step 2 is the identity.

The next algorithm describes the w^thstep in the inverse mapping g_wto recover symbols from code tuples:

$B_{w} -> B_{w - 1} \times ((1 - \frac{1}{w}), 1),$

where the input to the mapping is the vector (x′₁, x′₂, x′₃, K, x′_w)εB_w. The output is the vector

$x \in B_{w - 1} \times ((1 - \frac{1}{w}), 1),$

where x=(x₁, x₂, x₃, K, x_w).

Inverse Mapping: g_w

$\begin{matrix} 1) Let m_{0} = \max_{x_{i}^{'} \geq \frac{i - 1}{w}, 1 \leq i \leq w} i & (8) \\ 2) Let j_{0} = {\begin{matrix} w - (\max_{x_{i}^{'} \geq \frac{m_{0}}{w}, m_{0} + 1 \leq i \leq w} i), & m_{0} \neq w, \\ 0, & m_{0} = w, \end{matrix} & (9) \end{matrix}$

and let i₀=j₀+m₀.

3) then x is obtained from x′ by:

$\begin{matrix} x_{k} = {\begin{matrix} x_{k + (w - j_{0})}^{'} - \frac{i_{0} - j_{0}}{w}, & k = 1, K, j_{0}, \\ x_{k - j_{0}}^{'} + \frac{w + j_{0} - i_{0}}{w}, & k = j_{0} + 1 K, i_{0} - 1, \\ x_{k - j_{0} + 1}^{'} + \frac{w + j_{0} - i_{0}}{w}, & k = i_{0}, K, w - 1, \\ x_{i_{0} - j_{0} + 1}^{'} + \frac{w + j_{0} - i_{0}}{w}, & k = w . \end{matrix} & (10) \end{matrix}$

To apply the above algorithm to the problem of encoding and decoding constant weight codes, positive integers must be used, and this results in a certain rate loss. The algorithms remain largely unchanged. In a manner analogous to the real-valued case, we find a bijection between A_w^N⊂N^wand B_w^N⊂N^wfor given w and n (n>2w), where

${(y_{1}, y_{2}, \dots, y_{w}) \in N^{w} : n - (w - i) - ⌊ \frac{n - (w - i)}{i} ⌋ + 1 \leq y_{i} \leq n - (w - i), i = 1, 2, K, w}$

and B_w^N={(y₁, y₂, . . . , y_w)εN^w: 1≦y₁≦y₂< . . . <y_w≦n}. Note that usually |A_w^N|≦|B_w^N|, which means rate loss is generated.

The following algorithm provides the forward mapping, i.e.,

${(y_{1}, y_{2}, \dots, y_{w}) : (y_{1}, y_{2}, \dots, y_{w - 1}) \in B_{w - 1}^{N}, n - ⌊ \frac{n}{w} ⌋ + 1 \leq y_{w} \leq n} \to B_{w}^{N} .$

Given w and n=pw+q, where p≧0 and 0≦q≦w−1, we divide the range 1, 2, . . . , n into w partitions, where the first n−w−l partitions each have p elements, and the next q partitions each have p+1 elements, and the last partition hasp elements, which makes up the total n elements.

$\begin{matrix} 1) Let i_{0} = \min_{y_{w} \leq y_{i}, 1 \leq i \leq w} i and j_{0} = \min_{y_{i} > T_{i}, 1 \leq i \leq i_{0}} (i - 1) & (11) \end{matrix}$

where T_i=(w−i₀+i−1)p+max(q−i₀+i, 0).

$\begin{matrix} 2) Let y_{k}^{'} = {\begin{matrix} y_{k + j_{0}} - T_{j_{0} + 1} & k = 1, K, i_{0} - j_{0} - 1 \\ y_{w} - T_{j_{0} + 1} & k = i_{0} - j_{0} \\ y_{k + j_{0} - 1} + 1 - T_{j_{0} + 1} & k + j_{0} = i_{0} + 1, K, w \\ y_{k - (w - j_{0})} + n - T_{j_{0} + 1} & k = w - j_{0} + 1, K, w \end{matrix} . & (12) \end{matrix}$

The following algorithm provides the inverse mapping

$B_{w}^{N} \to {(y_{1}, y_{2}, \dots, y_{w}) : (y_{1}, y_{2}, \dots, y_{w - 1}) \in B_{w - 1}^{N}, n - ⌊ \frac{n}{w} ⌋ + 1 \leq y_{w} \leq n} .$

Again, assume n=pw+q.

$\begin{matrix} 1) Let m_{0} = \max_{y_{i}^{'} > S_{i}, 1 \leq i \leq i_{0}} i & (13) \end{matrix}$

where S_i=q+(i−1)p+min(i−q−1,0).

$\begin{matrix} 2) Let j_{0} = {\begin{matrix} w - (\max_{y_{i}^{'} \leq S_{m_{0} + p, m_{0} + 1 \leq i \leq w}} i, & m_{0} \neq w \\ 0, & m_{0} = w \end{matrix} i_{0} = j_{0} + m_{0} & (14) \end{matrix}$

The overall complexity of the transform algorithm is O(w²), because at each induction step, the complexity is linear in the weight at that step. Recall that the complexities of the arithmetic coding method and Knuth's complementation method are both O(n). Thus when the weight w is larger than √{square root over (n)}, the geometric approach is less competitive. When the weight is low, the proposed geometric technique is more efficient, because Knuth's complementation method is not applicable, while the dissection operations of the proposed algorithm makes it faster than the arithmetic coding method. Furthermore, due to the structure of the algorithm, it is possible to parallelize part of the computation within each induction step to further reduce the computation time.

So far little has been said about mapping a binary sequence to an integer sequence y₁, y₂, . . . , y_wsuch that y₁ε|L_i, U_i|, where L_iand U_iare the lower and upper bound of the valid range as specified by the algorithm. A straightforward method is to treat the binary sequence as an integer number and then use “quotient and remainder” method to find such a mapping. However, this requires a division operation, and when the binary sequence is long, the computation is not very efficient. A simplification is to partition the binary sequence into short sequences, and map each short binary sequence to a pair of integers, as in the case of a weight two constant weight codes. Through proper pairing of the ranges, the loss in the rate can be minimized.

The overall rate loss consists of two parts, where the first part is from the rounding in using natural numbers, and the second is from the loss in the above simplified translation step. However, when the weight is on the order of √{square root over (n)}, and n is in the range of 100-1000, the rate loss is usually 1-3 bits per block. For example, when n=529, w=23, then the rate loss is 2 bits/block compared to the best possible code which would encode k₀=132 information bits.

Number	Name	Date	Kind
4569050	Ohme	Feb 1986	A
4613860	Currie et al.	Sep 1986	A
6462898	Blaum et al.	Oct 2002	B2
6661355	Cornelius et al.	Dec 2003	B2
7075461	Kim	Jul 2006	B2
7164373	Shim et al.	Jan 2007	B2

Encoding of data into constant weight codes

Information

Patent Number

Date Filed

Date Issued

Inventors

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Term Extension

Abstract

Description

Claims

US Referenced Citations (6)