A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.
The disclosed embodiments relate generally to asymmetric cryptography, and in particular to small memory implementations of fast elliptic encryption (FEE).
Since the advent of public-key cryptography numerous public-key cryptographic systems have been proposed. Today, only three types of systems are still considered secure and efficient. These systems include integer factorization systems, discrete logarithm systems and elliptic curve cryptography (ECC) systems. The security afforded by integer factorization rests on the difficulty of factoring a large product of two prime numbers. The security of discrete logarithm systems rests on the difficulty of solving the discrete logarithm problem (DLP). The security of ECC systems rests on the difficulty of solving the elliptic curve DLP (ECDLP) problem, which amounts to finding a log in a group of points defined on an elliptic curve over a prime field. ECC's advantage over other systems is that its inverse operation gets harder, faster, against increasing key length, making it suitable for portable devices having small form factors with limited power and memory.
In recent years FEE has emerged as a strong option for memory constrained devices due, in part, to its speed and efficiency. FEE uses special primes and fast finite field and modular operations to reduce processor cycles, leading to less power consumption and heat dissipation. Various embodiments of FEE systems are described in U.S. Pat. No. 6,307,935, issued Oct. 23, 2001, entitled “Method and Apparatus For Fast Elliptic Encryption With Direct Embedding,” and U.S. Pat. No. 6,285,760, issued Sep. 4, 2001, entitled “Method and Apparatus For Digital Signature Authentication,” each of which is incorporated herein by reference in its entirety. Although existing FEE systems provide reduced processor cycles, there remains a need for small memory implementations of FEE.
The disclosed embodiments provide small memory implementations of FEE. In one embodiment, a method of generating a digital signature includes generating a first random number from a finite field of numbers, and generating field elements defining a first point on an elliptic curve defined over the finite field of numbers by performing elliptic curve arithmetic on the first random number and an initial public point on the elliptic curve. The method continues by generating a product from a field element, a private key, and a second random number received from a challenger seeking verification of a digital signature, and generating a signature component by summing the product and the first random number. The signature component is reduced using one or more modular reduction operations, using a modulus equal to an order of the elliptic curve, and then the reduced signature component and the field elements are sent to the challenger as a digital signature for verification by the challenger.
The following description of embodiments assumes that the reader has a basic understanding of ECC and its underlying mathematics. A detailed discussion of ECC can be found in numerous publicly available references, such as D. Hankerson, A. Menezes, and S. Vanstone, Guide to Elliptic Curve Cryptography, Springer-Verlag, 2004. Asymmetric cryptographic systems and applications using Fast Elliptic Encryption (FEE) are described in U.S. Pat. Nos. 5,159,632 and 6,285,760.
The SFEE embodiments described herein were developed based on certain assumptions and defining constraints. First, an elliptic curve over finite field Fp for odd prime p was assumed to have an equation (in Montgomery form) given by
y2=x3+cx2+x, c≠±2, (1)
wherein the parameters x and y are field elements describing a point (x, y) on the elliptic curve if x and y satisfy equation (1).
Next, it was assumed that the prime characteristic p given by
p=ws−k, kε[1,w−1],k≡1(mod 4), (2)
wherein w is a word size for the field arithmetic (e.g., 16 bits), and s is a chosen exponent, which can be used to set the security level of the SFEE.
For SFEE, specific elliptic curves are selected such that the curve order o is of the form
o=ws−j, (3)
wherein j>0 and initial public points are selected with point orders that divide the curve order o and contain the largest prime number in the curve order o. Note that in some embodiments where, for example, 16×16, 32-bit machine operations are available, w=216 is an efficient choice for a word size. One may think of the exponent s as a security level, subject to the difficulty of breaking the ECDL problem. Thus, if w=216 and s=10, then 160 bit encryption can be achieved, i.e., (216)10=2160. In some embodiments, extra optimization is possible when c in equation (1) is set equal to 4.
The foregoing SFEE assumptions and defining constraints imply both p, o<ws. It also follows from the Hasse theorem that o<j<w(1+s/2), so that j can be about half the allocation of a typical (mod p) residue, if desired. It is also noted that Montgomery curves typically cannot have a prime curve order o. Rather, the curve order o should be divisible by 4. For maximum security, however, p can be selected such that o is divisible by a large prime.
To facilitate discussion regarding the memory saving features of the disclosed embodiments the software types “1Giant” and “bGiant” will be assumed. 1Giant means a “little giant” type and bGiant means a “big Giant” type. The actual number of bytes used to represent an 1Giant or bGiant is dependent on the size in bits of the 1Giant or bGiant.
Although 1Giant and bGiant types are platform dependent, in some embodiments, these types have constrained upper limits. For example, an 1Giant variable type is defined to have at most s+1 digits (i.e., arithmetic words) and one word to count significant digits. Thus, the number of significant digits in an 1Giant is less than or equal to s+1. For example, if s=10 and a word is 16-bits, then an 1Giant would occupy 22 bytes of storage. A bGiant type has at most 2s digits. Thus, the number of significant digits in a bGiant is less than or equal to 2s. For example, if s=10 and a word is 16-bits, then a bGiant would occupy 40 bytes of storage. The reason for the (s+1) digit upper limit on the 1Giant type is that certain field arithmetic functions (such as adding two 1Giants) results in another 1Giant. This is due to each 1Giant being a field element in the interval [0, p−1] and 2p−2<ws+1. Similarly, a vector multiply function used in SFEE, such as
vecmulg(a, x); //Replace x with a*x, (4)
where a is one word and x is a field element, results in a modified x which is still an 1Giant, because ax≦(w−1)(ws−1)<ws+1. Note that an example code layout for the function vecmulg(a, x) is included in Appendix A hereto.
Various features of SFEE were developed to provide tight control of array sizes to ensure the smallest possible memory footprint. In some embodiments, the arithmetic for the SFEE system is unsigned and field elements are constrained to 1Giant integer types by forcing the field elements to reside in the interval [0, p−1], where p is the field prime characteristic. Also, there are no subtract operations in some embodiments of SFEE. Rather, negation of a field element y can be handled modulo p, using the identity
(−y)mod p≡ws−(y+k)mod p. (5)
In some embodiments of SFEE, modular operations, whether in the field or not, use a “feemod” procedure involving shifts, multiplications, and adds. That is, there are no explicit divisions, which are speed-costly operations. An example of such an efficient “feemod” procedure involving only shifts, multiplications and adds is included in Appendix A. Note that in embodiments involving signature schemes, there can be extra modular reductions with respect to, for example, the point order, which can also benefit from the “feemod” function included in Appendix A. Another feature of SFEE is the avoidance of field inversion operations, which are costly in the sense that Montgomery coordinate pairs (x, z) are used throughout SFEE.
For purposes of this embodiment, it is assumed that certain parameters have been specified, including the parameters s, k, j and word-size w, which are variables in equations (2) and (3) for the prime field characteristic p and the curve order o, respectively. There is an initial public point P1=(x1,1) specified on the elliptic curve with point order dividing o. In some embodiments, the signing device 104 has access to a private key K, which is an 1Giant variable type and Kε[2,o,−2]. Because o has (s 1g w) bits always, the constraint on K can be easily handled by forcing the high bit of an s-word random 1Giant to be 0. Then K will have at most ((s 1g w)−1) significant bits, the values K=0, 1 are excluded, and the key constraint is effected with no modular operations.
It is also assumed that the signing device 104 has access to a public key Ppub=(xp, zp) defined by
(xp,zp)=K·(x1,1), (6)
where K·(x1,1) is obtained through elliptic multiplication. For instance, the public key may be obtained from a registry or other trusted source of public keys, or the public key may be obtained from the signing device and then validated by a trusted validation service. In some embodiments, each of the field elements xp and zp are an 1Giant. Generally, if some 1Giant xε[2,o−1], it suffices to limit x to (s 1g w−1) bits and exclude the values x=0, 1.
Referring again to
The signing device 104 receives the random number m from the unsecured channel and performs the signing operation, as described with respect to
The interface circuitry 202 includes circuitry for establishing and maintaining a connection and communication session with other devices or with a network. Such circuitry may include a transmitter, a receiver, line drivers, buffers, logic devices, signal conditioning circuitry, etc. If the signing device 104 is wireless, then the interface circuitry 202 would include appropriate wireless circuitry (e.g., a wireless transceiver) for establishing and maintaining a wireless communication session with another device or network. The random number generator 212 can be implemented in software or hardware or a combination of both.
In some embodiments, the challenging device 102 generates a random 1Giant type integer mε[2, o−1] and sends it to the signing device 104 over a communication channel (shown as an unsecured channel in
(xr,zr)=r·(x1,1), (7)
wherein r is a random number generated by the random number generator 212 and (x1, 1) is an initial public point on the elliptic curve in Montgomery form. Note that in some embodiments, the random number r is an 1Giant in the interval [2, o−1] and is further constrained to have a low Hamming weight (e.g., 48). The “1” bits, however, can be in any bit position.
The multiply module 208 forms a product xrkm using non-field multiplication, wherein xr is the x field element of the point (xr, zr) on the elliptic curve, K is a private key (e.g., a bGiant type) and m is the random number sent by the challenging device 102. Using non-field addition, the summing module 206 adds this product to the random number r to form the sum xrKm+r. The mod module 204 reduces this value by the curve order o using fast modular operations (e.g., the “feemod” function in Appendix A) to produce a signature component u given by
u:=(xrKm+r) mod o. (8)
The signature component u and the field elements xr, zr are then sent to the challenging device 102 as a digital signature packet via the interface circuitry 202. Note that u is an 1Giant because of the defining constraint o<p, and therefore at most a 3s+3 word signature is sent to the challenging device 102 for verification.
The challenging device 102 receives the signature packet (u, xr, zr) from the signing device 104. The elliptic multiplier module 310 computes the point
(x,z)=u·(x1,1), (9)
wherein u is the signature component of the signature packet received from the signing device 104. The point (x, z) is sent to the compare module 302 where it is used to validate the digital signature.
Next, the multiplication module 306 used non-field multiplication to form a product xrm from the field element xr received from the signing device 104 and the random number m generated by the random number generator 308. This is the same random number m previously sent by the challenging device and used by the signing device to produce its digital signature. The product xrm is sent to the mod module 304, where it is reduced to a temporary component h using FEE modular operations and a modulus set equal to the curve order o. Thus, the multiplication and modular operations give
h=xrm mod o (10)
The elliptic multiplier module 310 receives the temporary component h and a public key represented by the public point (xp, zp) on the elliptic curve, and performs an elliptic multiplication on these values to give
(xv,zv)=h·(xp,zp). (11)
After computing equation (10), the points (xv, zv) and (xr,zr) are then sent to the compare module 302 where they are used to validate or invalidate the signature sent by the signing device 104. In some embodiments, the compare module 302 uses the points (xv, zv) and (x, z), and the point (xr, zr) sent by the signing device 104 to determine whether there is an elliptic identity given by
(xr,zr)±(xv,zv)=(x,z),tm (12)
wherein the elliptic identity is determined by the algebraic expression
(xrzv−zrxv)2x2−2xz[(xrxv+zrzv)(xrzv+xvzr)+2cxrxvzrzv]+(xrxv−zrzv)2=0. (13)
In some embodiments, the sigcompare (xr, zr, xv, zv, x, z) function included in Appendix A calculates the algebraic expression modulo the prime p and returns TRUE if and only if the result is 0. In these embodiments, the sigcompare( ) function uses four auxiliary 1Giant variables. Note that the sigcompare( ) function determines whether P=P1+/−P2 on an elliptic curve, without explicit elliptic addition, as described in U.S. Pat. No. 6,285,760.
The process 400 begins when a connection is established (step 402) with a challenging device 102. In some embodiments, the challenging device 102 can be plugged directly into a port (e.g., USB, FireWire™, Ethernet, PCI slots, etc.) of the signing device 104 or vice versa, or otherwise attached via a cable or other physical medium. In other embodiments, a wireless connection is established between the challenging device 102 and the signing device 104 using known wireless protocols and techniques (e.g., IEEE 802.11, etc.). The challenging device 102 and signing device 104 can be physically separate devices from the devices that desire to communicate. For example, one or both devices 102, 104 can be key or dongle (e.g., Xkey™) that is coupled to a port on one or two other devices.
After a connection is established, the challenging device 102 generates and sends a random number m to the signing device 140 as a challenge. The signing device 104 receives the random number m (step 404) and generates another, different, random number r (step 406). In some embodiments, the random numbers m and r are generated local to the devices 102, 104. In other embodiments, the random numbers are generated elsewhere (e.g., network computer) and provided to the devices 102, 104. For example, the random numbers m and r may be downloaded from the Internet or other network as part of a registration process.
Upon generation of a random number r, the signing device 104 computes the public point (xr, zr) from r and a initial public point (x1,1), as previously described with respect to
The process 500 begins when a signing device 104 is detected (step 502). Upon detection of a signing device 104, the challenging device 102 generates a random number m, sends it to the signing device 104 (step 504) as a challenge, then waits for a signature packet (u, xr, zr) from the signing device 104. When the challenging device 102 receives the signature packet (step 506) it computes the public point (x, z) from the signature component u and the initial public point (x1, 1) (step 508), as described with respect to
The signing device 600 can optionally include one or more control devices 605 (e.g., mouse and keyboard, or keypad, touch sensitive display, etc.) and may optionally include a display device 607 (e.g., CRT, LCD, etc.) for enabling a user to communicate and control various aspects of the signing device architecture 600. The communications interface 604 can be a port, network interface card, wireless interface card and the like. In some embodiments, the communications interface is a USB or FireWire™ port for connecting directly with a challenging device 102 or indirectly through a network.
The computer-readable medium 608 includes an operating system 610 (e.g., Mac O/S, Linux, Windows™, Unix, etc.) having various software components and drivers for controlling and managing various tasks (e.g., memory management, hard disc control, power management, etc.). A network communication module 612 includes software programs and/or protocol stacks for establishing and maintaining communication links with other devices or networks via the communications interface 604. The computer-readable medium 608 also includes a signature generation module 614, which includes various software components containing code or instructions for performing or controlling the signature generation process 400 described with respect to
In some embodiments, the curve parameter structure 620 is used to define a complete set of curve parameters. Preferably, the curve parameter structure 620 has a total word size less than a single 1Giant's allocation. An example of such a curve parameter structure 620 is as follows:
Note that the curve parameter structure 620 disclosed above does not explicitly store the field prime characteristic p or the curve order o. Only one 1Giant type is used and all other entries are significantly smaller “word16” types. In this embodiment, the “word16” type is an unsigned integer of 16 bits. If desired, once j is known, the 1Giant type can be changed to an even smaller type, since j will typically be about one half the size of an 1Giant type. Assuming a word size of w=216, a suitable curve parameter structure 620 would be:
par→s=10; //Selected for desired level of security.
par→k=57; //Selected so the field prime is p=2160−57 (which is not explicitly stored).
par→j=1347399065782960596453580; //Selected so the curve order is o=2160−j
par→x1=30; //Selected so the public point is P1:=(30,1), with point order dividing o.
par→c=4; //Selected to provide extra optimization.
With the above parameter assignments, P1=(30, 1) has a point order=curve order=o:=w10−j. The curve order o can be factored as:
Thus, the point order of x1, which is also the curve order o, is minimally composite. However, security is still afforded because of the large prime factor of the order. It is well-known that signature schemes work when the order is minimally composite.
The computer-readable medium 708 includes an operating system 710 (e.g., Mac O/S, Linux, Windows, Unix, etc.) having various software components and drivers, executable by the processor(s) 702, for controlling and managing various tasks (e.g., memory management, hard disc control, power management, etc.). The network communication module 712 includes software programs and/or protocol stacks (executable by the processor(s) 702) for establishing and maintaining communication links with other devices or a network via the communications interface 704. The computer-readable medium 708 also includes a signature verification module 714, which includes various software components containing code or instructions for generating the various steps of the signature verification process 500 described with respect to
An advantage of the disclosed embodiments is the use of unsigned finite field arithmetic. Appendix A includes code layouts of examples of functions that can be used in SFEE calculations. These example functions have been specifically designed for small memory environments and minimize the amount of memory allocated to a memory stack. In some embodiments, the amount of storage allocated for the memory stack to perform a signature generation operation does not exceed the amount of storage required to store several 1Giant values, one bGiant value and a predetermined number of n-byte length fields per 1Giant or bGiant. In some embodiments, the amount of storage allocated for storing temporary values in a memory stack to perform a signature generation operation does not exceed an amount of storage associated with 7 1Giants and 1 bGiant, plus a 2-byte length field. In some embodiments, the amount of storage allocated for storing temporary values in a memory stack to perform a signature verification operation does not exceed an amount of storage associated with 8 1Giants and 1 bGiant, plus a 2-byte length field. In an embodiment in which an 1Giant value requires 22 bytes of storage and a bGiant value requires 40 bytes of storage, the stack allocation required to store temporary values while performing a signature generation operation does not exceed 200 bytes, plus a small amount of memory for storing procedure return information. The stack allocation required to perform a signature verification operation does not exceed 230 bytes, plus a small amount of memory for storing procedure return information.
In some embodiments, the amount of storage allocated for storing temporary variable to perform a signature generation operation does not exceed an amount of storage associated with 10 1Giants, where the storage associated with an 1Giant is the amount of storage required to store the largest value in the finite field of integers in which the signature verification operation is performed. Similarly, in some embodiments, the amount of storage allocated for storing temporary variable to perform a signature verification operation does not exceed an amount of storage associated with 11 1Giants, where the storage associated with an 1Giant is the amount of storage required to store the largest value in the finite field of integers in which the signature verification operation is performed.
The functions can be implemented in any suitable software language (e.g., “C” code, assembly language, etc.) or hardware (e.g., digital signal processors, ASICs, microprocessors, etc.). The example functions included in Appendix A assume a priori the existence of fundamental giant-integer operations, where each of the example functions listed in Table I below involves either 1Giant types, bGiant types, or a combination of both.
Note that the “feemod” function includes the integer “whichmod” for enabling the use of a different modulus in the FEE modular reduction. For this particular embodiment, if whichmod=0, the prime characteristic p is used as the modulus. When whichmod !=0, then the curve order o is used as the modulus. One example of using whichmod !=0 would be for computing the signature component given by
u:=(K*m+r)mod o, (14)
where m is a message to be encrypted and K is the public key of the challenging device 102 and not the k parameter in the curve order field prime equation p:=ws−k. This basic signature scheme was described more fully in U.S. Pat. No. 6,285,760.
An advantage of the example functions included in Appendix A and summarized in Table I is the ability to operate in a small memory environment while still maintaining robust security. Specifically, these example functions provide small memory implementations of fast elliptic encryption (FEE) through the use of: 1) Montgomery algebra, 2) compact curve parameter structures, 3) small memory finite field arithmetic, 4) special fast primes, and 5) fast modular arithmetic with respect to either field prime characteristic p or curve order o. In addition to the signature generation and verification processes 400, 500, the functions in Appendix A can be used with any of the FEE techniques described in U.S. Pat. No. 6,285,760, with little or no modifications.
The example functions included in Appendix A reduce the total number of stack memory allocations during signature signing and verification.
Note that
The disclosed embodiments are not intended to be exhaustive or limited to the precise forms disclosed. Many modifications and variations to the disclosed embodiments are possible in view of the above teachings.
A. Code Layout For Finite Field Arithmetic
©2005 Apple Computer. All Rights Reserved.
B. Code Layout For FEE Modular Reduction Function
©2005 Apple Computer. All Rights Reserved.
C. Code Layout For Signature Comparison Function
©2005 Apple Computer. All Rights Reserved.
D. Code Layout For Small Memory Elliptic Multiplication
©2005 Apple Computer. All Rights Reserved.
This application is related to U.S. Provisional Application No. 60/642,340, Attorney Docket No. APL1P344P/P3503US1, filed Jan. 7, 2005, entitled “Accessory Authentication For Electronic Devices,” which provisional application is incorporated herein by reference in its entirety. This application is related to U.S. Patent Application No. ______, Attorney Docket No. APL1/P344/P3503US1, filed Feb. 3, 2005, entitled “Accessory Authentication For Electronic Devices,” which application is incorporated herein by reference in its entirety.