This invention relates to block codes for adding redundancy to transmitted signals. More particularly, the invention relates to block codes that are, or are similar to, linear block codes and have cyclic, or other group-theoretic properties.
It has long been known that when a message word in the form m=(m1, m2, . . . , mk) is to be sent over a noisy channel, recovery of the message at the receiving end of the channel can be enhanced by adding redundancy to the transmitted message. In one well-known approach, m is subjected, before transmission, to a linear transformation, thereby to produce a codeword c=(c1, c2, . . . , cn), in which the total number n of characters ci in the codeword is greater than the total number k of characters in the message word. The codeword, which incorporates desired redundant information, is transmitted over the channel and decoded at the receiving end.
Each character ci of the codeword is a weighted sum of characters of the corresponding message block. The respective weight coefficients can be expressed in the form of, for example, an n×k matrix. At the receiving end, the decoder uses the coding matrix, or equivalent information, in conjunction with well-known techniques to recover the transmitted message word.
Some examples of linear block codes are Reed-Solomon, BCH, Golay, Goppa, and Hamming codes.
One measure of the effectiveness of a linear code is the minimum code distance. Roughly speaking, the minimum code distance measures the amount of corruption a received codeword can sustain without being mistaken for a different codeword. In mathematical terms, the codewords of a given code are pictured as an array of points in an abstract mathematical space. Associated with the space is a measure of distance between pairs of points. One such distance measure is the well-known Hamming distance. With reference to such a space and such a distance measure, the minimum code distance is related to the separation between the closest pair of codewords.
Linear codes are typically characterized in terms of three values: the code length n, the message length k, and the minimum distance d. In linear codes of the prior art, however, it has not generally been possible to make independent selections of n, k, and d. For example, a primitive BCH code will be completely determined by the designed minimum distance d and the finite alphabet from which the message characters mi are to be selected. Since a Reed-Solomon code is a special case of a primitive BCH code, it will have the same property. (It should be noted that the message alphabet is often the binary alphabet {0, 1}.)
Thus, there has been a need for linear codes that can be designed with greater flexibility. There is a general requirement, known as the “singleton bound,”, that d can never be greater than n+k−1. However, given a message alphabet, there has been a need for codes that can be designed, within the constraint imposed by the singleton bound, with greater independence among the choices of n, k, and d.
We have found a new method for designing linear block codes, which leads to hitherto unknown codes. Importantly, our design method will generally provide, for a given choice of alphabet, code length, and message length, a range of distinct codes, each having its own distance properties. Moreover, the codes produced by our design method have group-theoretic properties, analogous to those of, e.g., Reed-Solomon and BCH codes, which can simplify the computation or estimation of minimum code distance, and could lead to efficient decoding algorithms.
The design procedure for our new codes makes reference to a mathematical construction known as an elliptic curve. Those practiced in algebraic geometry will appreciate that there is an addition rule associated with elliptic curves. Under that rule, any two selected points on an elliptic curve can be summed to produce a third point which also lies on the same curve. A “point at infinity” is also considered to belong to each elliptic curve. The point at infinity is the identity element under the addition rule, and thus bears an analogy to zero as the identity element in ordinary addition. Below, the point at infinity will be denoted 0.
Under the addition rule, every point of an elliptic curve will exhibit cyclic behavior. That is, for every point on the curve there exists a positive integer μ such that when the point is added to itself μ times, the resulting sum will equal the identity element 0. Such a point is referred to as a μ torsion point. Thus, P is a μ torsion point if P+P+ . . . +P=0, and P occurs μ times in the just-stated sum. Such a sum is written μP, and referred to as the scalar product of μ times P.
By applying known methods, it is possible to state, for a given elliptic curve, a polynomial whose roots identify the μ-torsion points. Such a polynomial is referred to as a division polynomial of order μ.
In a broad aspect, our invention involves a coding method, which comprises obtaining as input a message word consisting of a finite sequence of characters, and applying a linear transformation to the message word, thereby to produce as output a codeword consisting of a finite sequence of characters. Each character of the codeword is a sum of characters of the message word, computed such that each message-word character that contributes to said sum is weighted by a respective weight coefficient. The weight coefficients are derived from a division polynomial of an elliptic curve.
A mathematical group is a set of elements together with a binary operation, here denoted “+”. The binary operation must satisfy the following: (a) If x and y are elements of the group, then x+y is also an element of the group. That is, the group is closed under the operation “+”. (b) There exists some element of the group, here denoted “0”, for which, given any element x of the group, x+0=x. (c) For every element x of the group, there exists an element y of the group for which x+y=y+x=0. (d) For any three elements x, y, z of the group, the operation “+” must have the associative property that x+(y+z)=(x+y)+z.
A mathematical ring is a set of elements together with two binary operations, which we refer to as ring addition and ring multiplication, and denoted by “+” and “x”, respectively. The set is closed under both of these operations. The operations must satisfy: (a) The set is a group under ring addition. (b) x+y=y+x for all pairs x, y of elements of the set. That is, ring addition is commutative. (c) (x×y)×z=x×(y×z) for all three-tuples x, y, z of elements of the set. (d) x×(y+z)=(x×y)+(x×z) for all three-tuples x, y, z of elements of the set.
A mathematical field is a ring having the following further properties: (a) the ring is a group under the operation of ring addition. (b) Exclusive of the 0 element, the ring forms a group under ring multiplication. (c) the product of the 0 element with any element of the ring, under ring multiplication, is 0. (d) Addition and multiplication in a field are both commutative. The field addition and multiplication operations are analogous to ordinary addition and multiplication of real numbers, except that it is possible for a field to have only a finite number of elements.
The characteristic χ of a ring or field is the least positive integer for which, given any element x of the ring or field, x+x+ . . . +x=0, where x is taken χ times in the just-stated sum.
Given a ring R, an ideal I of R is a subring of R, i.e., a subset of R which itself forms a ring, which has at least one of the following properties: (a) For all elements i of I and all elements r of R, ri is an element of I. (b) For all elements i of I and all elements r of R, ir is an element of I.
In a restricted sense, an elliptic curve over a field K is a curve of the form
y2+a1xy+a3y=x3+a2x2+a4x+a6,
to which is also appended a point at infinity, here denoted 0, and in which x and y and the coefficients a1, a2, a3, a4, and a6 are elements of the field K. The field K may be a finite field; that is, it may have only a finite number of elements. The above definition will be sufficient to impart an understanding of the principles of the present invention. A fuller definition can be found in many well-known references, such as J. Silverman, Arithmetic of Elliptic Curves, Springer-Verlag, 1986.
A block code having codewords (c1, c2, . . . , cn) is said to be linear if the coefficients c1, c2, . . . , cn belong to a field, the sum of any two codewords is a codeword, and the ordered n-tuple (0, 0, . . . , 0) is a codeword.
Mathematical Observations
A. Relationship Between Products of Polynomials and Linear Transformations
Consider the polynomials a=a2x2+x+a and b=b2x2+b1x+b0. The product of these polynomials is
c=a2b2x4+(b2+a2b1)x3+(a2b0+a1b1+a0b2+(a1b0+a0b1)x+a0b0.
Write each of the original two polynomials as a vector, in which the power of x is implied from position within the vector. That is, write the polynomials, respectively, as a=(a2 a1 a0) and b=(b2 b1 b0). Then in like manner, their product is readily written as
c=(a2b2 a2b1+a1b2 a2b0+a1b1+a0b2 a1b0+a0b1 a0b0).
It will be appreciated that each of the five terms of the product vector is a weighted sum of the elements of a, in which each of the weight coefficients is an element of b. That is, the product vector c is obtained from a linear transformation of a, which may be represented in matrix form by c=Ba, where B is given by:
Those skilled in the art will appreciate that multiplication of any polynomial a, of any degree, by a second polynomial b, also of any degree, can be analogously represented in matrix form as a linear transformation of a.
B. Group Behavior of Elliptic Curves
From the above geometric description, those skilled in the art will find it easy to write down an explicit algebraic formula for the addition rule. An explicit formula can also be found in any of many well-known reference works, including book by J. Silverman, cited above. In the discussion below, we will provide an explicit formula for a special case of the addition rule.
As noted above, the points P of the elliptic curve for which the product μP equals 0 are referred to as μ-torsion points. The total number of μ-torsion points on a given elliptic curve, including the point at infinity, is μ2. However, it should be recalled that the elliptic curve is taken over a given field K. The number of μ-torsion points (x, y) for which x and y are elements of K may be less than μ2.
C. Binary Fields
The characteristic of a field must be a prime number. (It should be noted that the characteristic of a ring need not be a prime number.) For every choice of characteristic p and integer r, there exists one and only one field Φq having q=p′ elements.
As is well known in the art, the field Φ2 consists of the binary set {0, 1}, together with the following addition and multiplication tables:
The characteristic of Φ2 is 2, because either element, added to itself, gives a sum of 0. We have adopted the term “binary field” to denote any field of characteristic 2. It should be noted that this terminology has been adopted here for convenience, and does not necessarily reflect the terminology to be found in the published literature.
Another binary field is Φ4, which consists of elements a, b, c, d, together with the following addition and multiplication tables:
The above addition table shows that element a is the identity element for addition, and that any element, added to itself, gives a sum of a. Thus, the characteristic of Φ4 is 2. The elements a, b, c, d can be identified with the binary pairs (0, 0), (0, 1), (1, 0), and (1, 1), respectively. When that identification is made, it becomes clear that the addition operation represented in the above addition table is ordinary vector addition, modulo 2. Those skilled in the art will appreciate that there are extensions to Φq, where q=2r and r is any integer, in which each element of the field can be identified with a binary r-tuple, and the addition operation remains ordinary vector addition modulo 2. Because of the properties of addition modulo 2, any such field will be “binary” according to our usage of that term.
At block 20, a chosen field Φq is determined by selecting the integer r. The number of elements of the field is q=p′. As explained above, the choice of p and r completely determines the field, and thus determines the message alphabet. In the specific example to be illustrated below, q=2 and r=1, and thus the selected field is the binary field Φ2.
At block 30, a particular elliptic curve over Φq is selected. In the specific example to be illustrated below, the selected elliptic curve is defined by the equation y2+y=x3+x. Reference to the addition and multiplication tables for Φ2 will show that over that field, the points of this curve are (0, 0), (0, 1), (1, 0), and (1, 1), as well as the point 0. It will be understood that by, for example, the ordered pair (0, 0) is meant the point whose x-coordinate is 0 and whose y-coordinate is 0.
Reference to the addition and multiplication tables for Φ4 will show that over that field, the points of this curve are (a, a), (a, b), (b, a), and (b, b), as well as the point 0. Because a and b have the same addition and multiplication properties as the elements 0 and 1 of Φ2, those skilled in the art will appreciate that the points (a, a), (a, b), (b, a), and (b, b) are equivalent to the points (0, 0), (0, 1), (1, 0), and (1, 1) as embedded in the field Φ4.
At block 40, the number μ of desired μ-torsion points is selected. This number is also the order μ of the corresponding division polynomial ψμ(X, Y). The possible choices for μ are limited by the requirement that the characteristic p must not be a divisor of μ. Otherwise, the division polynomial (see below) will have repeated roots; that is, it will be “non-separable.” Non-separability is disfavored because it generally leads to codes that perform poorly.
At block 50, the division polynomial ψμ(X, Y) is determined.
At block 60, a polynomial which divides ψμ(X, Y) is selected. This polynomial is here denominated the generator polynomial g(X, Y) of the code. The degree of g(X, Y) in x will be less than the degree of ψμ(X, Y) in x by an integer amount k′. This difference will relate to the length of the message words in the following respect: The sequence of individual characters that make up each message word corresponds to a polynomial over the selected field Φq. As explained above, the coefficients of this polynomial are the respective characters of the message word. This polynomial is of maximum degree k′−1 in x, and of maximum degree 1 in y. Therefore, the maximum number of terms in this polynomial (and hence the maximum number of characters in the message word) is k′−1 (terms in non-zero powers of x)+k′−1 (terms in y times a non-zero power of x)+1 (term in y)+1 (constant term)=2 k′.
The code is defined by the choice of generator polynomial g(X, Y). This is conveniently explained with reference to
At block 120, ĉ is reduced to at most first degree in y. In this procedure, use is made of the fact that the equation defining the selected elliptic curve is quadratic in y. Thus, a term that is first-order in y can be substituted for every second-order term. For example, the elliptic curve used in the specific example illustrated below is defined by the equation y2+y=x+x. According to this equation, every occurrence of y2 can be replaced with x3+x−y. (It should be noted in this regard that for arithmetic in binary fields, subtraction is equivalent to addition.) By repeatedly making such substitutions, c is reduced to a polynomial of no more than first degree in y. In block 120, we refer to this procedure as taking the quotient of c over the elliptic-curve equation.
At block 130, the quotient obtained in block 120 is further reduced to a degree in x that is less than the degree in x of ψμ(X, Y). If the selected field Φq is binary, it will be possible to express ψμ(X,Y) as a polynomial in x only. In that case, the equation ψμ(X, Y)=0 readily yields a substitution of a sum of lower-order terms for the highest power of x in ψμ(X, Y). As above, repeated substitution will ultimately yield a polynomial of the desired reduced order. In block 130, we refer to this procedure as taking the quotient over the division polynomial. The output of blocks 120 and 130 is the codeword c.
Even if the selected field is not binary, the leading term of ψμ(X, Y) will often involve a power of x only, so that substitution of a sum of lower-order terms can readily be made as above. In the most general case, there are well-known techniques, based on the theory of Groebner bases, for obtaining a polynomial of the desired reduced order. Such techniques are described, for example, in D. Cox, J. Little, and D. O'Shea, Ideals, Varieties, and Algorithms, Springer-Verlag, New York, 1992.
When adding and multiplying coefficients of the polynomials, the addition and multiplication tables for the selected field must be obeyed.
It should be noted that a consistent ordering must be chosen for the terms of a polynomial in x and y. Those skilled in the art will appreciate that several alternative orderings are know to be useful in this regard. One such ordering is the so-called lexicographic ordering, in which: (a) the constant term comes first; (b) next come the terms in powers of x only, beginning with the lowest; (c) next come mixed powers of x and y; and (d) last come the terms in powers of y only, beginning with the lowest. As between two terms in mixed powers xmyn and xm′yn′ of x and y, the applicable rules, in order of precedence, are: (a) the term in the greater of m and m′ comes first; and (b) the term in the greater of n and n′ comes last.
When drawing a correspondence between a codeword c as a polynomial, and the same codeword c as a vector, the order of the coefficients is typically maintained.
As noted above, the individual characters of the message word and of the codeword are drawn from an alphabet. If the alphabet corresponds to Φ2, these characters are advantageously sent as binary bits. If the alphabet contains more than two characters, then each character may, for example, be sent as an n-tuple of binary bits, or it may be sent using a multilevel code, or in one of many other forms known to those in the art for sending characters selected from non-binary alphabets.
In a specific, illustrative example, the selected field is Φ2, and the selected elliptic curve is y2+y=x3+x. Over the selected field, this curve has five points, including the point at infinity.
The addition rule for the selected elliptic curve over the selected field reduces to the following, for P1=(x1, y1), P2=(x2, y2), P3=(x3, y3)=P1+P2:
As noted above, addition brings about the same result as subtraction for any binary field. Thus, in particular, addition in Φ2, which is addition modulo 2, has the property that adding an increment of unity (i.e., +1) brings about the same result as subtracting a unity increment.
Next, letting μ=5, we will find the 5-division polynomial ψ5(X, Y), in the expectation that all five points of the elliptic curve will be among the twenty-five 5-torsion points of the curve. By definition, a point P=(x, y) of the curve is a 5-torsion point if 5P=0, where the multiplication by 5 is scalar multiplication. The preceding expression can be rewritten as −P=4P.
By application of the addition rules, −P is found to equal (x, y+1), and 4P is found to equal (x16,x24+x12+x8+x6+x4+x3+x2+x+y+1). The x-component of −P is subtracted from the x-component of 4P to obtain a polynomial expression that is set equal to zero, and similarly for the y-components of −P and 4P. Each of the resulting polynomial expressions is then factored over the field 42. The result is that for the x-components,
0=x(x+1)(x2+x+1)(x4+x+1)(x4+x3+1)(x4+x3+x2+x+1)
and for the y-components,
0=x(x+1)2(x2+x+1)(x3+x+1)(x4+x3+1)(x4+x3+x2+x+1)(x8+x7+x3+x2+1)
The common solution of the just-preceding two equations is the desired 5-division polynomial; that is,
ψ5(x)=x(x+1)(x2+x+1)(x4+x3+1)(x4+x3+x2+x+1)
The above polynomial is a polynomial in x only. This will often be the case for the μ-division polynomial when the elliptic curve is defined over a binary field. It will always be the case when the field is binary and the elliptic curve conforms to what is referred to as “Weierstrass” form. There is an advantageous computational simplification when the μ-division polynomial is a polynomial in x only.
We now construct the code. For g(x), we can take any divisor of ψ5(x). Here, we choose g(x)=x(x+1)(x2+x+1)=x4+x. The degree of g(x) is less than the degree of ψ5(X,Y) by the integer amount k′=8. Consequently, the sequence of individual characters that make up each message word will correspond to a polynomial m(x,y) over Φ2 of maximum degree 7 in x, and of maximum degree 1 in y. Therefore, the maximum number of terms in this polynomial m(x,y) (and hence the maximum number of characters in the message word) is 16.
The code, then, consists of all polynomials over Φ2 of the form m(x,y)g(x). The code length is 24. This length is obtained by counting 12 terms in powers of x only, including a constant term (zeroeth power of x), and a like number of terms in y.
Those skilled in the art will appreciate that the code described above is the ideal generated by g(x) in the quotient ring Φ2[X, Y]/(E, ψ5(X,Y)), where Φ2[X, Y] is the field of polynomials over Φ2, and E is the selected elliptic curve.
The example provided above is illustrative only, and not meant to be limiting. For example, numerous other elliptic curves can be selected, and numerous fields, both binary and non-binary, other than Φ2 can be selected.
Letting E represent any selected elliptic curve, and letting K represent any selected field, the concepts described above can be generalized further by letting the code be any ideal in the quotient ring K [X, Y]/(E, ψμ(X,Y)), where μ, as before, is a selected order for the μ-torsion points and the corresponding division polynomial. In this more general case, it will typically be necessary to define an ordering on the monomials—i.e., on the single-term expressions—in the quotient ring. Such an ordering is readily defined using well-known techniques from Groebner basis theory. Such techniques are described, for example, in D. Cox et al., Ideals, Varieties, and Algorithms, Springer-Verlag, New York, 1992.
In one example of code design, a μ-torsion point P is selected, and the set of the first d scalar, integer multiples {P, 2P, . . . , dP} is taken. The positive integer d is advantageously selected to be a desired minimum code distance for the resulting code. The designed code is the ideal corresponding to the set of d points described above. As is well known, every ideal is generated by a finite set of polynomials. The generator polynomials for the designed code are readily found by applications of Groebner basis theory. The codewords are formed by taking products of message strings with generator polynomials and summing the products. In preliminary studies, we have found that codes designed in this manner tend to have a minimum code distance that is greater than d.
Even more generally, the code can be defined over a ring rather than a field. As noted above, a code defined over a ring which is not a field is, strictly speaking, a nonlinear code. Given a linear code defined as the ideal I in the quotient ring Φp[X, Y]/(E, ψμ(X,Y)), a new, generally non-linear code can be defined by a procedure known as “lifting.” The theoretical basis for lifting is provided by a well-known theorem known as Hensel's Lemma.
Define q, as above, as the integer pr, where p is the characteristic of the field Φp, and r is a positive integer. Generalize the elliptic curve E by now defining it over the ring Zq. The ring Zq has the structure of the integers 0, 1, . . . , (q−1), modulo q. It should be noted that a modular mapping from Zq to Φp is defined by identifying with each x∈□q an image in Φp obtained by taking x modulo p.
The generalized curve is here denominated Eq. The coefficients of the equation defining the original elliptic curve E are replaced by the corresponding elements of Zq. The coefficients of Eq will reduce to the coefficients of E when the coefficients are taken modulo p.
Find the μ-division polynomial
of Eq over the ring Zq. An application of Hensel's Lemma will lead to a generator polynomial gq(x,y) which divides
and which transforms back to the generator function for I when its coefficients are taken modulo p. The lifted code is the ideal generated by gq(x,y) over the quotient ring
The pertinent mathematical procedures, deriving from Hensel's Lemma, are described, for example, in F. P. Gouvea, p-adic Numbers: an Introduction, 2d Ed., Springer-Verlag, 1997.
Even in the general cases described above, the codes described here are readily decoded using any standard algorithm for decoding linear codes. To make it effective, the decoder is provided the parameters used for encoding the message.
The following explicit formulas will be useful for defining the division polynomials when the characteristic of the selected field is at least 5 and the elliptic curve has the form y2=x2+bx+c:
ψ1=1; ψ2=2y; ψ3=3x4+6bx2+12cx−b2;
ψ4=4y(x6+5bx4+20cx3−5b2x2−4bcx−8c2−b3);
Given the integer q=pn, where p is a prime number and n is an integer, an elliptic curve E defined over the field Φq and containing a finite number of points #(E) is said to be supersingular if |#(E)−(q+1)| is divisible by p.
The following explicit formulas will be useful for defining the division polynomials when the characteristic of the selected field is 2, the selected elliptic curve is not supersingular and has the form y2+xy=x3+a6, and a6 is a non-zero element of the field:
ψ0; ψ1=1; ψ2=x; ψ3=x4+x3+a6; ψ4=x6+a6x2;
The codes described above can be used wherever a block code is conventionally used. One possible application of our codes, for example, is for the encoding of short segments of information to be sent on the control channel of a wireless system between a mobile station and the network.
The codes that are described here lend themselves to a highly efficient method of error detection. As noted above, each of the codes described here may be understood as an ideal over a ring of polynomials. Therefore, a received codeword can be checked for error by determining whether it is an element of the ideal that corresponds to the code. Thus, error-detection is carried out by testing for ideal membership. Algorithms for testing for ideal membership are well-known. More specifically, it is known from the theory of Groebner bases that, given an ideal I having a Groebner basis G, a given polynomial will lie in I only if G divides the polynomial, with zero remainder. Standard algorithms are available for carrying out such a division. The pertinent theory of Groebner bases is described, for example, in the book by Cox cited above.
Given a code, the encoding and decoding of messages is readily performed using any conventional apparatus for encoding and decoding block codes. The computational steps involved in designing a code according to the methods described above are readily carried out using, by way of example and not of limitation, a digital computational device under the control of an appropriate software program.
Number | Name | Date | Kind |
---|---|---|---|
4979174 | Cheng et al. | Dec 1990 | A |
5363107 | Gertz et al. | Nov 1994 | A |
5768296 | Langer et al. | Jun 1998 | A |
6038577 | Burshtein | Mar 2000 | A |
6484192 | Matsuo | Nov 2002 | B1 |
6611597 | Futa et al. | Aug 2003 | B1 |
6628728 | McCarty, Jr. | Sep 2003 | B1 |
6671709 | Glaser et al. | Dec 2003 | B1 |
6721771 | Chang | Apr 2004 | B1 |
6728052 | Kondo et al. | Apr 2004 | B1 |
Number | Date | Country | |
---|---|---|---|
20030212945 A1 | Nov 2003 | US |