Many crytosystems today, e.g. RSA encryption, are based on problems such as prime factorization or modular exponentiation. These problems are hard for conventional computers, but may not be for quantum computers. Thus new advances in quantum computing may render cryptosystems based on those problems unsecure. There is a need for new cryptosystems based on problems that are hard for both conventional computers and quantum computers.
Lattice-based problems can be used as the basis for cryptosystems. One example of a lattice problem may be the short integer solution problem, which relates to finding the shortest vector in an n-dimensional lattice of points. Lattice problems can be shown to be hard for both conventional and quantum computers. However, computing the cryptographic keys for lattice-based cryptosystems can be computationally expensive, and can result in large keys that also make it difficult to sign and verify messages. Therefore, there is a need to make lattice-based cryptography more efficient.
Embodiments of the present disclosure address these and other problems, individually and collectively.
Some embodiments of the present disclosure are directed to methods of generating a lattice-based cryptographic verification key and an associated trapdoor matrix. A generating device can generate a gadget matrix G. The generating device can then drop the first l columns of the gadget matrix G to generate a reduced gadget matrix F. Then, the generating device can sample a distribution to form a trapdoor matrix R. The trapdoor matrix R can function as a secret key for generating a digital signature, and may be an approximate trapdoor. The generating device can then use the reduced gadget matrix F and the trapdoor matrix R to generate a verification matrix A. The generating device can transmit the verification matrix A to one or more verification devices. The one or more verification devices can use the verification matrix A to verify a digital signature x generated by a sending device to sign a message m, if the digital signature x was generated using the trapdoor matrix R.
Other embodiments of the present disclosure are directed to methods of generating a lattice-based signature. A sending device can receive a message m. The message may be a hash h(m), or the sending device may hash the message m to generate the hash h(m). The sending device may also store a verification matrix A and a trapdoor matrix R. The sending device may generate a signature vector x using the verification matrix A and the trapdoor matrix R. The signature vector x may be generated such that, for some modulus q and an error vector e, the relationship Ax=h(m)+e mod q is satisfied. The modulus q can be selected to give a desired level of data security (e.g., selecting q=224 may give 100-bit security). The signing device may transmit the message m and the signature vector x to a verification device. The verification device may have previously received the verification matrix A, or may receive the verification matrix A from the sending device, and may use the verification matrix A to verify that the signature vector x was generated by the sending device. This may allow the verification device to verify that the message m is from the sending device as well.
Other embodiments of the present disclosure are directed to methods of verifying a lattice-based signature. A verification device can store a verification matrix A. The verification device can also receive a message m and a signature vector x from a sending device. The message may be a hash h(m), or the verification device may hash the message m to generate the hash h(m). The verification device may compute an error vector e and by solving Ax=h(m)+e mod q for some modulus q and quantify the size of the error vector e. The verification device can compare the error vector e to a threshold and verify the signature vector x if the error vector e is less than the threshold. For example, if the length of the error vector e is less than the dimension of the hash h(m).
Other embodiments of the present disclosure are directed to systems and computer readable media associated with the above-described methods.
These and other embodiments of the invention are described in further detail below with reference to the Figures and the Detailed Description.
Lattice-based cryptography, cryptography based on lattice problems, can provide cryptographic solutions that are robust especially with the development of quantum computing that can reduce the effectiveness of more traditional cryptosystems. Lattice problems include, for example, the inhomogeneous short integer solution (ISIS) problem, which relates to finding the shortest vector in an n-dimensional lattice of points. The inhomogeneous short integer solution (ISIS) problem asks to find a vector x that satisfies the relationship A·x=y(mod q). In this example, the vector y may be a message, or a hash of the message, and the vector x may be a signature associated with the message. Lattice problems can be computationally difficult, even with quantum computing, making them well suited for cryptographic applications. Solving lattice problems can be made easier with a trapdoor matrix, which can be a matrix that makes a hard problem (such as the ISIS problem) easy, or computable within a reasonable time. In the example of the ISIS problem, a trapdoor matrix can be used to compute the vector x without needing to solve the equation. Thus in a cryptographic application, a trapdoor can function as a secret key.
Different constructions of a trapdoor matrix can have different impacts on the size and usefulness of the lattice problem as a cryptosystem. Trapdoor matrices that solve exactly solve the ISIS problem may result in the public key (the verification matrix A) and the private key (the trapdoor matrix) being very large, and thus not very practical for signing messages and transmitting signatures.
Instead, embodiments can use an approximate trapdoor which relaxes the solution to the ISIS problems to find an approximate solution instead of an exact solution, i.e., A·x=y+e(mod q), with an error vector e. The signature can still be verified if the error vector is small. In particular, a gadget matrix (a matrix where columns comprise ascending powers of a small base) can be used to find an exact trapdoor matrix, and by removing a few columns of the gadget matrix, a reduced gadget matrix can be used to find an approximate trapdoor matrix. With the approximate solution, the signature x, the trapdoor matrix and the verification matrix A can be smaller and thus more practical.
In embodiments, a generating device can compute a verification matrix A and a trapdoor matrix R. The verification matrix A, functioning as a public key, can be distributed or even made public. In particular, the verification matrix A can be sent to a sending device, that will send signed messages, and a verification device, that will verify the signature on the messages to authenticate the sending device. The trapdoor matrix R, functioning as a private key, can be sent securely to the sending device, enabling the sending device to generate a signature.
In the past two decades, lattice-based cryptography has emerged as an active area of research. It can enable both advanced cryptographic capabilities, such as fully homomorphic encryption [Gen09]; and practical post-quantum secure public-key encryptions and signatures, as observed in the ongoing NIST post-quantum cryptography (PQC) standardization procedure [AAAS+19]. A large fraction of the lattice-based cryptosystems uses lattice trapdoors. Those cryptosystems include basic primitives like public-key encryption and signature schemes [GGH97, HPS98, HHP+03, GPV08], as well as advanced primitives such as identity-based encryption [GPV08, ABB10, CHKP12], attribute-based encryption [GVW13], and graded encodings [GGH15].
In this work, we focus on the trapdoor for the lattice-based one-way function defined by Ajtai [Ajt96], and its application in digital signatures [GPV08]. Given a wide, random matrix A, and a target vector y. The inhomogeneous short integer solution (ISIS) problem asks to find a short vector x as a preimage of y, i.e. A·x=y(mod q):
Without the trapdoor, finding a short preimage is proven to be as hard as solving certain lattice problems in the worst case [Ajt96]. A trapdoor for the matrix A, on the other hand, allows its owner to efficiently produce a short preimage. An explicit construction of the trapdoor for Ajtai's function was first given in [Ajt99] and later simplified by [AP11, MP12].
Towards the proper use of lattice trapdoors in cryptography, what really gives the trapdoor a punch is the work of Gentry, Peikert and Vaikuntanathan [GPV08]. They show how to sample a short preimage from a trapdoor-independent distribution, instead of a distribution which may leak information about the trapdoor (as observed by the attacks [GS02, NR06] on the initial attempts of building lattice-based signatures [GGH97, HHP+03]). The trapdoor-independent preimage sampling algorithm allows [GPV08] to securely build a hash-and-sign signature as follows: Let the matrix A be the public verification key, the trapdoor of A be the secret signing key. To sign a message m, first hash it to a vector y, then use the trapdoor to sample a short preimage x as the signature. The secret signing key is guaranteed to be hidden from the signatures, since the signatures are produced from a trapdoor-independent distribution.
Despite its elegant design, the hash-and-sign signature based on Ajtai's function suffers from practical inefficiency due to its large key size and signature size. Indeed, all the three lattice-based signature candidates that enter the second round of NIST PQC standardization [AAAS+19] are built from two alternative approaches—Falcon [FHK+18] is based on the hash-and-sign paradigm over NTRU lattices; Dilithium [DKL+18] and qTESLA [ABB+19] are based on the rejection sampling approach [Lyu12, BG14]. The suggested parameters for the three candidates lead to competitive performance measures. For example, for 128-bit security, the sizes of the public keys & signatures for all the three candidates are below 5 kB & 4 kB (respectively). By contrast, for the hash-and-sign signature based on Ajtai's function, the sizes of the public keys & signatures are more than 35 kB & 25 kB according to the implementation benchmarks of [BB13, BFRS18, GPR+18].
We can define a relaxed notion of lattice trapdoors called approximate trapdoors, which can be used to solve the ISIS problem approximately instead of exactly. The primary motivation is to improve the efficiency of hash-and-sign signatures based on Ajtai's one-way function. With a relaxation of the correctness requirement, it is possible to generate smaller public matrices, trapdoors, and preimages for Ajtai's function, which translate to smaller public-keys, secret-keys, and signatures for the hash-and-sign signature scheme.
Embodiments of the disclosure show that the gadget trapdoor proposed by Micciancio and Peikert [MP12] can be modified to an approximate trapdoor. In particular, we show how to use the approximate gadget trapdoor to sample preimages from a trapdoor-independent distribution. The analysis of the distribution can use linear transformations of discrete Gaussians on lattices.
Our approximate gadget trapdoor can be used together with existing optimization techniques, such as using the Hermite normal form and using a bigger base in the gadget, to improve the concrete performance of the hash-and-sign signature in the random oracle model under RingLWE and RingSIS assumptions. We show that the sizes of the public key & signature can be reduced to 5 kB & 4.45 kB for an estimation of 100-bit security, and 11.25 kB & 9.38 kB for an estimation of 192-bit security. Those are much closer to the sizes of the signatures based on the rejection sampling approach [Lyu12, B G14, DKL+18, ABB+19].
A. Example Cryptographic Context
In step 102, the generating device 105 can generate a verification key vk and a secret key sk. For example, the verification key vk may be a verification matrix A and the secret key sk may be a trapdoor matrix R. The generating device 105 can then publicly transmit the verification key vk to the verification device 125. An attacker 135 may also receive the verification key vk. The generating device 105 can also privately send the secret key sk to the sending device 115.
In step 104, the sending device 115 can receive a message m. The sending device 115 can then hash the message with a hashing function to generate a hashed message h(m). Alternatively, the sending device 115 may generate the message m or may receive the hash h(m).
In step 106, the sending device 115 can generate a signature sig based on the hash h(m) and the secret key sk. For example, the sending device 115 can generate the signature sig compute a function of the hash h(m) and the secret key sk. The sending device 115 may generate the signature sig such that the product of the verification key vk and the signature sig is the hashed message h(m).
In step 108, the sending device 115 can send the message m and the signature sig to a verification device 125. The attacker 135 may be able to intercept the message m and the signature sig as they are sent.
In step 110, the attacker 135 may attempt to modify the message m to generate a fraudulent message m′. The attacker 135 may also attempt to modify the signature sig to generate a fraudulent signature sig′. However, because the attacker 135 does not have access to the secret key sk, they may not be able to generate a valid signature. Therefore, the fraudulent signature sig′ may not be a valid signature. The attacker 135 may then send the fraudulent message m′ and the fraudulent signature sig′ to the verification device 125.
In step 112, the verification device 125 can attempt to verify the message m from the sending device 115 and the fraudulent message m′ from the attacker 135. For example, the verification device 125 may multiply the signature sig and the verification key vk, and compare the result to the hashed message h(m). If the result is equal to the hash h(m), or within an accepted threshold, then the message m can be verified. If the result is not equal to the hashed message h(m), or within an accepted threshold, then the message m may not be verified. More generally, verifying may involve a function of the message, the hashed message, the signature, and/or the verification key. Signatures that were not generated with the secret key sk may not give valid result and may not be verified. The verification device 125 can thus differentiate between authentic messages sent by the sending device 115 and fraudulent messages sent by the attacker 135.
B. Generation of Verification Matrix
A verification matrix can be generated that is used as a public key to verify a signature. A trapdoor matrix (as a private key) can also be generated, which can be used in generating the verification matrix and in generating the signature.
In step 202, the generating device may generate a gadget matrix G. The gadget matrix G may be a matrix where all entries are powers of a small integer b, e.g., 2, 3, 4, or 5. For example, the entries of the gadget matrix G may be ascending powers of 2. The gadget matrix G may have dimensions n×log q.n may be a security parameter, such as 64, 128, 256, and 512.q may be set to n√n. In some embodiments, the modulus q may be between 216 to 224.
In step 204, the generating device may reduce the dimensions of the gadget matrix G by dropping selected columns to generate a reduced gadget matrix F. For example, the generating device may drop the first l columns of the gadget matrix G. The parameter l may be chosen, for example, to be between 0 and
or between 0 and log q. l may be experimentally determined to provide a desired security level and key size. A larger value of l (resulting a smaller reduced gadget matrix F) may result in smaller keys, but may also be less secure. A larger value of l (resulting in a larger reduced gadget matrix F) may result in greater security but with larger keys. For example, for a security parameter n=512 and base b=2, the modulus may be set to q=224. Then l may be chosen from the range between 0 and 12.
In step 206, the generating device may sample a distribution to form a trapdoor matrix R. For example, each element of the trapdoor matrix R may be sampled independently from a Gaussian distribution with a small standard deviation. The trapdoor matrix R can also be used to calculate a small matrix D, where
D may thus be the trapdoor matrix R vertically augmented with an identity matrix of the same dimensions. A small matrix may be a matrix where each entry is less than n. The trapdoor matrix R can function as a secret key for generating a digital signature.
In step 208, the generating device may generate a verification matrix A. The generating device may first generate a matrix Ā, based on a uniformly random matrix Â. Each element in the uniformly random matrix  may be chosen with equal probability. A can then be calculated from F, R, and Ā as A :=[Ā|F−ĀR]. Thus, the verification matrix A is the matrix Ā augmented with the matrix resulting from the expression F−ĀR.
In step 210, the generating device may transmit the verification matrix A to a sending device (e.g., 115 in
C. Signature Generation
In step 302, the sending device can receive a message m. In some embodiments, the sending device may have generated the message m. The message m may be, for example, an authorization request message or a payment transaction message. The message m may be hashed. The sending device can hash the message m to generate a hashed message h(m). For example, the sending device may take the modulus of the message m. The message m and the hash h(m) may be vectors.
In step 304, the sending device can store a verification matrix A and a trapdoor matrix R. The verification matrix A and the trapdoor matrix R may be received from a generating device. In some embodiments, the sending device may generate the verification matrix A and the trapdoor matrix R. The sending device may receive a small matrix D instead of, or in addition to, the trapdoor matrix R. The trapdoor matrix R and the verification matrix A may be received, for example, through a secure channel or using a secret sharing algorithm. A secure channel may include, for example, an Internet communication channel secured with SSH encryption.
In step 306, the sending device may generate a signature vector x using the hash h(m), the verification matrix A and the trapdoor matrix R. The sending device may first sample a probability distribution (e.g., a Gaussian distribution) to form a perturbation vector p. Each element in the perturbation vector p can be independently sampled from the probability distribution. The sending device may then compute a lattice vector v=h(m)−Ap. The lattice vector v may then be used to sample an intermediate vector z from an lattice Gaussian distribution based on a lattice formed form the lattice vector v. A lattice Gaussian distribution may be a discrete Gaussian distribution over a lattice, in particular the lattice defined by lattice vector v. Then, the sending device can then generate the signature vector x from a product of the intermediate vector z and the trapdoor matrix R as
The perturbation p may prevent an attacker from determining the trapdoor matrix R based on a message-signature pair. As the perturbation p can be sampled from a distribution that does not depend on the message m or the trapdoor matrix R, it can mask the trapdoor matrix R in the signature vector x.
In step 308, the sending device may transmit the message m and the signature vector x. These can be transmitted to a verification device. The sending device may also transmit the verification matrix A to the verification device. The verification device can then verify the message m. The verification device can multiply the verification matrix A by the signature vector x, and if the result matches the hashed message with a small error then the message m can be verified. The verification device can also verify that the signature vector x was generated by the sending device.
D. Signature Verification
In step 402, the verification device can store a verification matrix A. The verification matrix A may be received from a generating device. Alternatively, the verification device may receive the verification matrix A from a sending device, or may be publically available. The verification matrix A may alternatively be generated by the verification device.
In step 404, the verification device can receive a message m and a signature vector x from a sending device. The verification device may receive a hashed message h(m) in addition to, or instead of the message m. The verification device may hash the message m to generate the hashed message h(m). The message m and the hash h(m) may be vectors.
In step 406, the verification device can use the verification matrix A to verify the signature vector x. In order to verify the signature vector x, the verification device can compute and error vector e by solving the equation Ax=h(m)+e mod q, for some modulus q. In particular, because Ax−h(m)=e mod q, the verification device can compute Ax−h(m) with the verification matrix A, the signature vector x, and the hash h(m) to determine the error vector e. The modulus q may be determined as n√{square root over (n)} for a security parameter n. For example, for a parameter n=128, q may be 223. In some embodiments, the modulus q may be selected from a range between 216 and 224. Because the signature vector x and verification matrix A are generated with an approximate trapdoor, the product may not be exactly the hash h(m), but instead may differ by the error vector e. If the signature vector x had been generated with an exact trapdoor, the error vector e would be 0. The verification device can then quantify the size of the error vector e, such as by finding the Euclidean length of the error vector e, and compare the size to a threshold. In other embodiments, the size of the error vector e can be quantified using a different lp-norm, or an infinity norm.
If the error vector e is less than the threshold then the signature vector x can be verified. In some embodiments, the threshold may be the dimension of the hash h(m) (e.g., the dimension of a vector of the hash h(m). In other embodiments, the threshold may be chosen between 0 and
For example, if q=223, the threshold may be 212. If the error vector e is large, this can indicate that the message m was not signed with signature vector x generated from the same trapdoor as the verification matrix A, and thus the message m is likely fraudulent or otherwise compromised.
After the signature is verified, the verification device can act upon information in the message. As one example, the message may include a request to access a secure area, such as a building or venue, or to access secure data or other resource that may reside on a networked computer. After verifying the signature and the message, the verification device can then send an indication to an access device to allow access. As another example, the message may include a request to establish a secure connection between the sending device and the verification device. After verifying the message, the verification device can establish the secure connection, e.g., by performing a key exchange. Additionally, or alternatively, the verification device may respond to the sending device. The response may include information such as a cryptogram, an encryption key, or secure data, such as an access token. Some embodiments may be applied to a payment processing system. For example, the message may be a payment request message, and the verification device may use the information in the payment request message as part of generating an authorization response, e.g., by transmitting the authentication (verification) to a processing network or to an authorization server that uses the authentication to prepare the authorization response.
E. Approximate Trapdoors
In
We can now discuss in more detail the difference between approximate trapdoors and exact trapdoors.
Given a public matrix A∈qn×m where m=O(n log q), and a target y. We call a vector x∈m an approximate short preimage of y if
A·x=y+z(mod q)
for some z∈n, and both x and z are short. An approximate trapdoor for A is defined to be a string that allows its owner to efficiently find an approximate short preimage given a target y.
To make sense of the word “trapdoor”, we can argue that solving the approximate version of ISIS is hard without the trapdoor. Under proper settings of parameters, we show the approximate ISIS problem is as hard as the standard ISIS problem, or no easier than LWE. The reductions extensively use the Hermite normal form (HNF) and are pretty straightforward.
The approximate ISIS problem and the approximate trapdoor are natural generalizations of their exact variants. Indeed, both notions have been used in the literature, at least on an informal level. For example, the approximate ISIS problem was used in the work of Bai et al. [BGLS19] to improve the combinatorial algorithms of the exact ISIS problem.
For the approximate trapdoor, an exact trapdoor of a public matrix in the HNF, say a trapdoor for A=[In|A′], can be used as an approximate trapdoor for A′. Such a method was often used in the implementation of signatures to decrease the sizes of the public key and the signature by a dimension of n. Our goal is thus to further reduce the sizes compared to the HNF approach, while preserving the quality of the trapdoor, i.e. at least not increasing the norm of the preimage.
One contribution of embodiments is to show that the gadget trapdoor (G-trapdoor) proposed by Micciancio and Peikert [MP12] can be modified to an approximate trapdoor, in a way that further reduces the sizes of the public matrix, the trapdoor, and the preimage. An aspect of the G-trapdoor is a specific “gadget” matrix of base b,
G :=In⊗gt:=In⊗(1, b, . . . , bk-1)∈n×(nk)
where k:=[logbq]. The base b is can be chosen as 2 for simplicity, or a larger value in practical implementations.
Micciancio and Peikert [MP12] show how to generate a random matrix A together with a matrix D of small norm such that A·D=G (mod q). In particular, A can be designed as
A=[Ā|G−ĀR]
where R is a matrix with small entries and is the actual trapdoor. The matrix D is then equal to
Since the kernel of the G matrix has a public short basis, one can first solve the ISIS problem under the public matrix G, then use D to solve the ISIS problem under the public matrix A.
We observe that if we drop a few (say l) entries corresponding to the small powers of b from the gadget matrix G, i.e. let the following F matrix be a reduced gadget matrix
F :=In⊗ft:=In⊗(bl, . . . , bk−1)∈n×n(k−l)
then we are still able to solve the ISIS problem with respect to the public matrix F up to a bl-approximation of the solution (i.e., the norm of the error vector is proportional to bl). Replacing G by F in A gives
A=[Ā|F−ĀR]
Then the dimensions of the trapdoor R and the public matrix A can be reduced.
Given a public matrix A together with its approximate G-trapdoor R, finding an arbitrary approximate short preimage of a given target u is quite straightforward, but sampling the preimage from a trapdoor-independent distribution can be non-trivial. As mentioned, the ability to sample from a trapdoor-independent distribution is involved in many of the trapdoor applications including digital signatures.
We provide an algorithm that samples an approximate short preimage from a trapdoor-independent distribution. The algorithm itself can build on the perturbation-based discrete Gaussian sampler from [MP12], but the analyses of the preimage distribution from [MP12] are not easy to generalize. Our analyses of the preimage distribution and the approximation error distribution extensively use a linear transformation theorem on lattice distributions (cf. Theorem 2.7, implicitly used in [MP12, MP13, BPMW16]).
While our algorithm works for target images u∈qn and does not cause any blow up in the standard deviation of the distribution, the analysis of trapdoor-independence only applies to a target image u sampled uniformly from qn, as opposed to the analysis for the exact trapdoor in [GPV08, MP12] which is able to spell out the distribution of the preimages of all the u∈qn. To briefly explain the reason for this gap, we observe that the methods we have tried to handle all the target images require significant increases in the smoothing parameters of the lattice intersections required in the linear transformation theorem (Theorem 2.7). In other words, the norm of the resulting preimage increases significantly rendering the result meaningless.
Still, sampling the approximate preimages for uniform targets from a trapdoor-independent distribution suffices for replacing the use of exact lattice trapdoors in many cryptosystems, including the hash-and-sign signature [GPV08], IBE [GPV08, ABB10, CHKP12], ABE [GVW13], and the special purpose obfuscation schemes [GKW17, WZ17], the private-constrained PRF and the reusable garbled circuits [CC17, CVW18] built on the GGH15 graded encoding scheme [GGH15].
We now explain the efficiency gain of using our approximate trapdoor compared to the exact trapdoor and the other existing optimization techniques, with a focus on the signature application. Our goal is to set the parameters to achieve the following “win-win-win” scenario: 1) save on the preimage size (bandwidth); 2) save on the size for the public matrix A; and 3) retain, or even gain, concrete security, which is related to the discrete Gaussian width of the preimage and the norm of the error term.
Let us start with an understanding of the dependency of the savings on the variable l, i.e, the number of entries dropped from the gadget g. In Table 1 we provide a comparison of the parameters between the exact G-trapdoor of [MP12] and the approximate G-trapdoor samplers in this paper. In both cases the public matrices are instantiated in the pseudorandom mode. For the approximate trapdoor, the dimension of the trapdoor decreases from nk to n(k−l). The dimension m of the public matrix and the preimage decreases. The width s of the preimage distribution also decreases slightly following the decreasing of m. However, the norm of the error factor in the image grows with l. So in the concrete instantiation of the hash-and-sign signature discussed later, we need to coordinate the value of l with the norms of the preimage and the error, which can determine the security estimation together.
Our algorithm inherits the 0(log q)-space, 0(n log q)-time G-preimage sample subroutine from [MP12, GM18]. So the saving of space and time in the sampling of the perturbation is proportional to the saving in the dimension m.
Let us make a quick remark for the applications beyond signatures. The saving of the dimension m is of significant importance to the applications built on the GGH15 graded encoding scheme (implemented in [HHSS17, CGM+18]). In those applications, the modulus q is proportional to md and (where d∈ is the number of “levels” of the graded encodings; larger d supports richer functionalities). So reducing the dimension m would dramatically reduce the overall parameters.
The parameters in table 1 are derived under a fixed lattice dimension n, a fixed modulus q≥√{square root over (n)}, and a fixed base b. Let k=[logbq]. Let l denote the number of entries removed from g(1≤l<k). Then we list m as the dimension of the public matrix and the preimage; σ as the width of the gadget preimage distribution; s as the width of the final preimage distribution (where C>0 is a universal constant; τ as the width, or subgaussian parameter, of the distribution of the entries in the trapdoor); ν as the length bound of the error for each entry in the image. Note that some embodiments may use δ in place of τ.
F. Example Parameters for the Signatures
We give a proof-of-concept implementation of the hash-and-sign signature based on our approximate trapdoor. The security is analyzed in the random oracle model, assuming the hardness of RingLWE for the pseudorandomness of the public key and RingSIS for the unforgeability of the signature. Here we provide a short summary and leave more details in Section 5.2.
We also use smaller moduli and bigger bases to reduce the size and increase the security level. The parameters in
Our implementation shows that the sizes of the public-key & signature can be reduced to 5 kB & 4.45 kB for an estimation of 100-bit security, and 11.25 kB & 9.38 kB for an estimation of 192-bit security. Those are much closer to the sizes of the signatures based on the rejection sampling approach [Lyu12, BG14, DKL+18, ABB+19]. As a reference, the sizes of the public-key & signature for qTESLA [ABB+19] are 4.03 kB & 3.05 kB for an estimation of 128-bit security, and 8.03 kB & 6.03 kB for an estimation of 192-bit security. The sizes for Dilithium [DKL+18] are even smaller. Let us remark that our implementation has not used many low-level optimizations like Dilithium [DKL+18] and qTESLA [ABB+19]. So it is reasonable to expect we have more room to improve after adding lower-level optimizations. The parameters for Falcon [FHK+18] are the smallest due to the use of NTRU lattices, so they are rather incomparable with the ones based on RingLWE.
There are many folklore optimizations regarding trapdoors for Ajtai's one-way function. We discuss the comparison and compatibility of a few of them with our construction. Throughout these comparisons we are concerned with the “win-win-win” scenario mentioned in the beginning.
First is the approximate trapdoor from the HNF optimization: A=[I|A′]. This barely achieves the “win-win-win” scenario with a slight savings on the public key and the signature. Our construction can be used in the pseudorandom-mode of the gadget trapdoor which has automatically included the HNF optimization, and saves around 50% in addition.
Our method can also be used together with any base in the gadget, including a large base of size b≈√{square root over (q)} (the resulting gadget is g=[1, √{square root over (q)}]), as was used in [dPLS18] when the modulus is large. This construction suffers from a large Gaussian width (√{square root over (q)}), which can hurts concrete security and may be infeasible in the smaller modulus regime we implement in Section 5. Specifically for the smaller moduli, the signature's Gaussian width is larger than the modulus, as was confirmed both on paper and in our experiments. So we use a moderately large base b.
One may also try to construct a short integer matrix S for A :=[(I, Ā′)|F−(I, Ā′)R] (corresponds to the pseudorandom public key in Eqn. (1)) such that AS=G, and hope this view provides a better approximate trapdoor. From here, the hash-and-sign signature scheme is to return Sz+p where p is a perturbation and z is a G-lattice sample. However, such a matrix S requires a bl term. So this method does save on the public key, but does not improve the signature size and, most importantly, increases the Gaussian width by a factor of bl . The increase of the width decreases the concrete security of the underlying SIS problem. In fact, to achieve the same saving in the public key, one can instead directly increase the base from b to bl in the gadget.
Before describing specific examples in detail, we can describe aspects of lattice-based cryptosystems that may be utilized in embodiments of the present disclosure.
A. Notations and Terminology.
In cryptography, the security parameter (denoted as λ) is a variable that is used to parameterize the computational complexity of the cryptographic algorithm or protocol, and the adversary's probability of breaking security. An algorithm is “efficient” if it runs in (probabilistic) polynomial time over λ.
When a variable v is drawn uniformly random from the set S we denote as v←U(S). We use ≈s and ≈c as the abbreviations for statistically close and computationally indistinguishable. For two distributions D1, D2 over the same support X, we denote D1D2 to denote that each x∈X has D1(x)∈[1±ε]D2(x) and D2(x)∈[1±ε]D1(x).
Let , Δ, be the set of real numbers, integers and positive integers. Denote /(q by q. For n∈, [n]:=1, . . . , n. A vector in (represented in column form by default) is written as a bold lower-case letter, e.g. v. For a vector v, the ith component of v will be denoted by vi. For an integer base b>1, we call a positive integer's “b-ary” decomposition the vector (q0, q1, . . . , qk−1)∈{0, . . . , b−1}k where k:=┌logbq┐, and q=Σqibi.
A matrix is written as a bold capital letter, e.g. A. The ith column vector of A is denoted ai. The length of a vector is the p-norm∥v∥p:=(Σvip)1/p, or the infinity norm given by its largest entry ∥v∥∞:=maxi{|vi|}. The length of a matrix is the norm of its longest column: ∥A∥p:=maxi∥ai∥p. By default we use 2-norm unless explicitly mentioned. When a vector or matrix is called “small” or “short”, we refer to its norm but not its dimension, unless explicitly mentioned. The thresholds of “small” or “short” can be precisely parameterized when necessary. The notations [A B], [A, B], and [A|B] can all denote that a matrix A is augmented or concatenated horizontally with a second matrix B. The notation
can denote that matrices A and B are vertically concatenated, and may also be called a stack.
B. Linear Algebra
Let {ei}i=1n be the canonical basis n, with entries δ(j, k) where δ(j, k)=1 when j=k and 0 otherwise. For any set S⊆n, its span (denoted as span(S)) is the smallest subspace of n containing S. For a matrix, M∈n×m, its span is the span of its columns vectors, written as span(M). We write matrix transpose as Mt. Let {tilde over (B)} denote the Gram-Schmidt orthogonalization of B. The GSO of an ordered basis B=[b1, . . . , bk] is assumed to be from left to right, =b1, unless stated otherwise.
Recall M's singular value decomposition (SVD), i.e. M=VDW∈n×m where V∈n×m along with W∈m×m are unitary ,and D∈n×m is a triangular matrix containing M's singular values. Further, let q=min{n, m} and Dq=diag(s1, . . . , sq) be the diagonal matrix containing M's singular values {0≤si}. Then, D=Dq when n=m, D=[Dq 0] when m>n, and
in the case m<n.
A symmetric matrix Σ∈n×m is positive semi-definite if for all {right arrow over (x)}∈n, we have {right arrow over (x)}tΣ{right arrow over (x)}≥0. It is positive definite, Σ>0, if it is positive semi-definite and {right arrow over (x)}tΣ{right arrow over (x)}=0 implies {right arrow over (x)}=0. We say Σ1>Σ2 (≥) if Σ1−Σ2 is positive-(semi)definite. This forms a partial ordering on the set of positive semi-definite matrices, and we denote Σ≥αl often as Σ≥α for constants αΣ+. For any positive semi-definite matrix Σ, we write √{square root over (Σ)} to be any full rank matrix T such that Σ=TTt. We say T is a square root of Σ. For two positive semi-definite matrices, Σ1 and Σ2, we denote the positive semi-definite matrix formed by their block diagonal concatenation as Σ1⊕Σ2. Let M* denote Hermitian transpose. The (Moore-Penrose) pseudoinverse for matrix M with SVD M=VDW is M+=WD+V* where D+is given by transposing D and inverting M's nonzero singular values. For example, T=sI and T+=s−1 for a covariance Σ=s2I. (An analogous T+ is given for the non-spherical, full-rank case Σ>0 using Σ's diagonalization.)
C. Lattices Background
An n-dimensional lattice of rank k≤n is a discrete additive subgroup of n. Given k linearly independent basis vectors B=b1, . . . , bk∈n, the lattice generated by B is
Given n, m∈ and a modulus q≥2, we often use q-ary lattices and their cosets, denoted as
for A∈qn×m, denote Λ⊥[A] or Λq⊥[A] as {x∈m: A·x=0(modq)};
for A∈qn×m, w∈qn, denote Λw⊥[A] as {x∈m: A·x=w(modq)}
We can define Gaussians on lattices. For any s>0 define the Gaussian function on n with parameter s:
For any c ∈n, real s>0, and n-dimensional lattice Λ, define the discrete Gaussian distribution DΛ+c,s as:
The subscripts s and c are taken to be 1 and 0 (respectively) when omitted.
For any positive semidefinite Σ=T·Tt, define the non-spherical Gaussian function as
and ρT(x)=0 for all x span(Σ). Note that ρT(·) only depends on Σ but not the specific choice of the T, so we may write ρT(·) as ρ√{square root over (Σ)}(·).
For any c∈n, any positive semidefinite Σ, and n-dimensional lattice Λ such that (Λ+c)∩ span(Σ) is non-empty, define the discrete Gaussian distribution
as:
Such a discrete Gaussian distribution may be referred to as a lattice Gaussian distribution.
We recall the definition of smoothing parameter and some useful facts.
Definition 2.1 [Smoothing parameter [MR07]] For any lattice Λ and positive real ε>0, the smoothing parameter ηε(Λ) is the smallest real s>0 such that ρ1/s(Λ*\{0})≤ε.
Notice that for two lattices of the same rank Λ1⊆Λ2, the denser lattice always has the smaller smoothing parameter, i.e. ηε(Λ2)≤ηε(Λ1). We can also use a generalization of the smoothing parameter to the non-spherical Gaussian.
Definition 2.2 For a positive semi-definite Σ=TTt, an ε>0, and a lattice Λ with span(Λ)⊆span(Σ), we say ηε(Λ)≤√{square root over (Σ)} if ηε(T+Λ)≤1.
When the covariance matrix Σ>0 and the lattice Λ are full-rank, √{square root over (Σ)}≥ηε(Λ) is equivalent to the minimum eigenvalue of Σ, λmin (Σ), being at least ηε2(Λ).
Lemma 2.3 [Smoothing parameter bound from [GPV08]] For any n-dimensional lattice (B) and for any ω(√{square root over (log n)}) function, there is a negligible ε(n) for which
The following is a generalization of [GPV08, Corollary 2.8] for non-spherical Gaussian.
Corollary 2.5 [Smooth over the cosets] Let Λ, Λ′ be n-dimensional lattices s.t. Λ′⊆Λ. Then for any ε>0, √{square root over (Σ)}≥ηε(Λ′), and c∈span(Λ), we have
Lemma 2.6. [[PR06, MR07]] Let B be a basis of an n-dimensional lattice , and let s ≥∥B∥·ω(log n), then PrxD
We can use the following general theorem regarding the linear transformation, T, of a discrete gaussian. It states that as long as the original discrete gaussian is smooth enough it the kernel of T, then the distribution transformed by T is statistically close to another discrete Gaussian.
Theorem 2.7 For any positive definite Σ, vector c, lattice coset A:=Λ+a⊂c+span(Σ), and linear transformation T, if the lattice ΛT=Λ∩ker(T) satisfies span(ΛT)=ker(T) and ηε( ΛT)≤√{square root over (Σ)}, then
where
D. Gadgets, or G-Lattices
Let G=In⊗gt∈qn×nk with gt=(1, b, . . . , bk−1), k=[logbq]. G is commonly referred to the gadget matrix. The gadget matrix's q-ary lattice, Λq⊥(G), is the direct sum of n copies of the lattice Λq⊥(gt). Further, Λq⊥(gt) has a simple basis,
where (q0, . . . , qk−1)∈{0,1, . . . , b−1}k is the b-ary decomposition of the modulus, q. When q=bk, we can set q0=q1= . . . =qk−2=0 and qk−1=b. Either way, the integer cosets of Λq⊥(gt) can be viewed as the syndromes of gt as a check matrix, in the terminology of coding theory. These cosets are expressed as Λu⊥(gt)={x∈k:gtx=umodq}=Λq⊥(gt)+u where u can be any coset representative. A simple coset representative of Λu⊥(gt) is the b-ary decomposition of u. The integer cosets of Λq⊥(G) are expressed through the direct-sum construction, Λu⊥(G)=Λu
As an example, consider a gadget matrix G with base b=2, q=8 (and thus k=3), and n=3. The gadget matrix is then G=I3⊗gt ∈83×9 with gt=(1, 2, 22), or
We can then form a reduced gadget matrix F by dropping the l columns with the smallest powers b from the gadget matrix G. For example, if l=1, then this corresponds to dropping any column with a 1. The reduced gadget matrix is thus
E. SIS and LWE
We first recall the short integer solution (SIS) problem.
Definition 2.8[SIS[Ajt96]] For any n, m, q∈ and β∈, define the short integer solution problem SISn,m,q, as follows: Given A∈qn×m, find a non-zero vector x∈m such that ∥x∥≤β, and
Ax=0 mod q
Definition 2.9 [ISIS] For any n, m, q ∈ and ∈, define the inhomogeneous short integer solution problem ISISn,m,q, as follows: Given A∈qn×m, y∈qn, find x∈m such that ∥x∥≤β, and
Ax=y mod q
Lemma 2.10 [Hardness of (I)SIS based on the lattice problems in the worst case [Ajt96, MR07, GPV08]] For any m=poly(n), any β>0, and any sufficiently large q≥β·poly(n), solving ISISn,m,q,β or ISISn,m,q,β (where aryy is sampled uniformly from qn) with non-negligible probability is as hard as solving GapSVPΥ and SIVPΥ on arbitrary n-dimensional lattices with overwhelming probability, for some approximation factor γ=β·poly(n).
All the (I)SIS problems and their variants admit the Hermite normal form (HNF), where the public matrix A is of the form [In|A′] where A′∈qm−n. The HNF variant of (I)SIS is as hard as the standard (I)SIS. This can be seen by rewriting A∈qn×m as A=:[A1|A2]=A1·[ln|A1−1·A2] (we may work with n, q such that A1←U(qn×m) is invertible with non-negligible probability).
We can also recall the decisional learning with errors (LWE) problem.
Definition 2.11 [Decisional learning with errors [Reg09]] For n, m∈ and modulus q≥2, distributions for secret vectors, public matrices, and error vectors θ, π, χ,⊆,q. An LWE sample is obtained from sampling s←θn, A←πn×m, e←χm, and outputting (A, yt:=stA+etmodq).
We say that an algorithm solves LWEn,m,q,θ,π,χ if it distinguishes the LWE sample from a random sample distributed as πn×m×U(qm) with probability greater than ½ plus non-negligible.
Lemma 2.12 [Hardness of LWE based on the lattice problems in the worst case [Reg09, Pei09, BLP+13, PRS17]] Given n∈N, for any m=poly(n), q≤2poly(n). Let θ=π=U(q), χ=D,s where s≥2√{square root over (n)}. If there exists an efficient (possibly quantum) algorithm that breaks LWEn,m,q,θ,π,χ, then there exists an efficient (possibly quantum) algorithm for solving GapSVPγ and SIVPγ on arbitrary n-dimensional lattices with overwhelming probability, for some approximation factor γ=Õ(nq/s).
The next lemma shows that LWE with the secret sampled from the error distribution is as hard as the standard LWE.
Lemma 2.13 [[ACPS09, BLP+13]] For n, m, q, s chosen as was in Lemma 2.12, LW is as hard as LW for m′≤m−(16n+4 loglog q).
The (I)SIS and LWE problems bear similarities. It is sometimes convenient to talk about one of the two problems, and an analogous result immediately applies for the other. On a high level they can be considered equally hard, since breaking one of them would morally break the other. But a careful examine of the current status of the reductions suggests that LWE is a stronger assumption than (I)SIS. Since if there is a polynomial time algorithm that solves the SIS problem with respect to a public matrix A∈n×m, we can simply use the SIS solution x to break the decisional LWE problem by computing the inner-product of x and the LWE challenge vector y; on the other hand, given a polynomial time algorithm that solves the LWE problem, we know of a more involved polynomial time quantum algorithm that solves SIS [SSTX09].
Nevertheless, a trapdoor for a public matrix A ∈qn×m is defined in [Ajt99, GPV08] as anything that allows us to efficiently solve both the (I)SIS and LWE problems w.r.t. A.
Given a matrix A∈qn ×m, define an approximate trapdoor of A as anything that allows us to efficiently solve the approximate version of the ISIS problem w.r.t. A. We first define the approximate ISIS problem.
Definition 3.1 (Approximate ISIS) For any n, m, q∈ and α, β, ∈, define the approximate inhomogeneous short integer solution problem Approx.ISISn,m,q,α,β as follows: Given A∈q n×m, y∈q n, find a vector x∈m, such that ∥x∥≤β, and there is a vector z∈n satisfying
∥z∥≤α and Ax=y+z(mod q)
That is to say, given a verification matrix A and a message (or hash) y, the approximate ISIS relates to the problem of finding a signature vector x such that Ax=y with a small error z. Let us remark that the approximate ISIS is only non-trivial when the bounds α, β are relatively small compared to the modulus q. Also, our definition chooses to allow the zero vector to be a valid solution, which means when ∥y∥≤α, the zero vector is trivially a solution. Such a choice does not cause a problem since the interesting case in the application is to handle all the y∈qn or y sampled uniformly from qn.
Definition 3.2 (Approximate trapdoor) A string τ is called an (α, β)-approximate trapdoor for a matrix A∈qn×m if there is a polynomial time algorithm (in n, m, log q) that given τ, A and any y ∈qn, outputs a non-zero vector x ∈m such that ∥x∥≤β, and there is a vector z ∈n satisfying
∥z∥≤α and Ax=y+z(modq)
To make sense of the approximate trapdoor, we argue that for those who do not have the trapdoor, the approximate ISIS problem is a candidate one-way function under proper settings of parameters.
First, we observe a rather obvious reduction that bases the hardness of approximate ISIS on the hardness of decisional LWE with low-norm secret (e.g. when the secret is sampled from the error distribution). In the theorem statement below, when the norm symbol is applied on a distribution D, i.e. ∥D∥, it denotes the lowest value ν∈+ such that Prd←D[∥d∥<v]>1 −negl(λ).
Theorem 3.3 For n, m, q∈, α, β∈+, θ, π, χ be distributions over such that q>4(∥θ∥·(α+1)+∥θn∥·α+∥χm∥·β).Then LWEn,m,q,θ,π,χ≤p Approx.ISISn,m,q,α,β.
Proof. Suppose there is a polynomial time adversary A that breaks Approx.ISISn,m,q,α,β, we build a polynomial time adversary B that breaks decisional LWE.
Let r=└α┘+1. Given an LWE challenge (A, w) ∈q n×m×qm, where w is either an LWE sample or sampled uniformly from qm. B picks a vector y:=(r, 0, . . . , 0)t ∈qn , sends A and y to the adversary A as an approximate ISIS challenge. A replies with x ∈m such that ∥x∥≤β, and there is a vector z ∈n satisfying
∥z∥≤α and Ax=y+z(modq).
Note that x≠0 since ∥y∥>α.
B then computes v:=w,x. If wt=stA+et for s←θn, e←χm, then
v=(stA+et)x=st(y+z)+etx⇒∥v∥θ∥·r+∥θn∥·α+∥χm∥·β<q/4
Otherwise v distributes uniformly random over q. So B can compare v with the threshold value and wins the decisional LWE challenge with probability ½ plus non-negligible.
Alternatively, we can also prove that the approximate ISIS problem is as hard as the standard ISIS.
Theorem 3.4 ISISn,n+m,q,β≥p Approx.ISISn,m,q,α+β,β; ISISn,n+m,q,α+β≤p Approx.ISISn,m,q,α,β·
Proof. The reductions go through the HNFs of the ISIS and the approximate ISIS problems. We will show ISIS=HNF.ISIS=HNF.Approx.ISIS=Approx.ISIS under proper settings of parameters.
Recall that ISISn,m,q,β=HNF.ISISn,m,q,β as explained in the preliminary. Also, HNF.ISISn,m,q,β≥p HNF.Approx.ISISn,m,q,α,β for any α≥0 by definition. It remains to show the rest of the connections.
Lemma 3.5 HNF.ISISn,m,q,α+β≤p HNF.Approx.ISISn,m,q,α,β.
Proof. Suppose there is a polynomial time algorithm A that solves HNF.Approx.ISISn,m,q,α,β, we build a polynomial time algorithm B that solves HNF.ISISn,m,q,α+β. Given an HNF.ISIS instance [In|A]∈qn×m, y, B passes the same instance to A, gets back a vector x such that
[In|A]·x=y+z(modq)
where ∥x∥≤β, ∥z∥<α. Now write x=: [x1t|x2t] where x1∈n, x2∈m. Then x′:=[(x1−z)t|x2t]t satisfies
[In|A]·x′=y(modq),
and ∥x′∥≤α+β. So x′ is a valid solution to HNF.ISIS.
Lemma 3.6 HNF.Approx.ISISn,n+m,q,α,β≤p Approx.ISISn,m,q,α,β.
Proof. Suppose there is a polynomial time algorithm A that solves Approx.ISISn,m,q,α,β, we build a polynomial time algorithm B that solves HNF.Approx.ISISn,n+m,q,α,β. Given [In|A]∈qn×(n+m), y∈qn as an HNF.Approx.ISIS instance, B passes A∈qn×m, y to A, gets back a short vector x∈m. Then [0nt|xt]t is a valid solution to the HNF.Approx.ISIS instance.
Lemma 3.7 HNF.Approx.ISISn,n+m,q,α,β≥p Approx.ISISn,m,q,α+β,β.
Proof. Suppose there is a polynomial time algorithm A that solves HNF.Approx.ISISn,n+m,q,α,β, we build a polynomial time algorithm B that solves Approx.ISISn,m,q,α+β,β. Given an Approx.ISIS instance A∈qn×m, y ∈n, B passes [In|A]∈qn×(n+m), y as an HNF.Approx.ISIS instance to A, gets back an answer x ∈m+n such that
[In|A]·x=y+z(modq), (1)
where ∥x∥≤β, ∥z∥≤α.
Now write x=: [x1t|x2t]t where x1∈n, x2∈m. Rewriting Eqn. (1) gives
A·x2=y+z−x1(modq),
so x2 is a valid solution to Approx.ISISn,m,q,α+β,β.
Theorem 3.4 then follows the lemmas above.
The following statement immediately follows the proof of Lemma 3.7.
Corollary 3.8 An (α, β)-approximate trapdoor for [I|A] is an (α+β, β)-approximate trapdoor for A.
Lemma 3.9 HNF.Approx.ISISn,n+m,q,α,β≥p Approx.ISISn,m,q,2(α+β),β.
Proof. Suppose there is a polynomial time algorithm A that solves HNF.Approx.ISISn,n+m,q,α,β, we build a polynomial time algorithm B that solves Approx.ISISn,m,q,α+β,β. Given an Approx.ISIS instance A ∈qn×m, y ∈n, B first check if ∥y∥≤α+β. If so, then find y′=: y+Δy such that ∥y′∥>α+β and ∥y′−y∥≤α+β; if not then simply set y′:=y and Δy:=0. B then passes [In|A] ∈qn×(n+m), y′ as an HNF.Approx.ISIS instance to A, gets back an answer x ∈m+n such that
[In|A]·x=y′+z(modq), (2)
where ∥x∥≤β, ∥z∥≤α.
Now write x=: [x1t|x2t]t where x1∈n, x2∈m. Since ∥y′∥>α+β, x2 must be a non-zero short vector. Rewriting Eqn. (2) gives
A·x2=y+Δy+z−x1(modq),
so x2 is a valid solution to Approx.ISISn,m,q,2(α+β),β.
We present an instantiation of an approximate trapdoor based on the gadget-based trapdoor generation and preimage sampling algorithms of [MP12] (without the tag matrices). In short, we show how to generate a pseudorandom A with entries modulo q along with an approximate trapdoor R with small integer entries.
In the rest of this section, we first recall the exact G-trapdoor from [MP12], then present the approximate trapdoor generation algorithm and the approximate preimage sampling algorithm. Finally we analyze the preimage distribution. The analyses make extensive use of Theorem 2.7 (linear transformations of discrete Gaussians).
A. Recall the G-trapdoor from MP12
Let b≥2 be the base for the G-lattice. Let q be the modulus, k=┌logbq┐. b is typically chosen to be 2 for simplicity, but often a higher base b is used for efficiency trade-offs in lattice-based schemes.
Recall the MP12 gadget-lattice trapdoor technique: the public matrix is
A=[Ā|G−ĀR]
where G is the commonly used gadget matrix, G:=In⊗gkt, gkt:=(1, b, . . . , bk−1), and R is a secret, trapdoor matrix with small, random entries. A is either statistically close to uniformly random or pseudorandom, depending on the structure of Ā and the choice of χ (in the pseudorandom case χ⊆ is chosen to be a distribution such that LW is hard). In this paper we focus on the pseudorandom case since the resulting public matrix A and preimage have smaller dimensions.
In order to sample a short element in Λu⊥(A), we use the trapdoor to map short coset representatives of Λq⊥(G) to short coset representatives of Λq⊥(A) by the relation
Using the trapdoor as a linear transformation alone leaks information about the trapdoor. Therefore, we perturb the sample to statistically hide the trapdoor. Let Σp be a positive definite matrix defined as
where σ is at least ηε(Λq⊥(G)). The perturbation can be computed offline as p←D
B. The Algorithms of the Approximate G-trapdoor
As mentioned in the introduction, one objective of obtaining an approximate trapdoor is to adapt the MP12 algorithms with a gadget matrix without the lower-order entries. Let 0<l<k be the number of lower-order entries dropped from the gadget vector g ∈k . Define the resulting approximate gadget vector as f:=(bl, bl+1, . . . , bk−1)t∈(k−l). Let w=n(k−l) be the number of columns of the approximate gadget F:=In⊗ft∈n×w. Then the number of columns of A will be m:=2n+w.
1. Algorithm 1: Sampling from gadget distribution
Algorithm 1 shows an algorithm for sampling a vector from a Gaussian distribution based on lattice derived from a gadget matrix. Algorithm 1 takes in a value v from q (v may be one component of a vector). The gadget sampling algorithm also takes in σ, which defines the width of the distribution. This algorithm may be performed by a sending device, such as sending device 115 of
In step 701, the sending device can sample a vector x comprising k values from . The values may be pulled from a distribution D. D may be a lattice Gaussian distribution, that is, a Gaussian distribution over a lattice, and the lattice may be defined by a gadget matrix, and the deviation σ. The distribution D is described in further detail in Section II.C.
In step 702, the sending device can drop the first l entries from the vector x and take the last k−l entries of the vector x as an intermediate vector z.
In step 703, the sampling algorithm can return the intermediate vector z.
2. Algorithm 2: Generating Trapdoor Matrix
Algorithm 2 shows a trapdoor matrix generation algorithm. As input, this algorithm takes in a security parameter λ. Trapdoor generation can be done by a generating device, such as generating device 105 of
Prior to generating the trapdoor matrix, the generating device can generate a gadget matrix G. The gadget matrix G may be a matrix where each column comprises one value of an ascending power of a base b (e.g., b=2), for k powers (k=┌logbq┐). There may be n repeats of the k columns with ascending powers of b, resulting in an n×k matrix. The generating device can then generate a reduced gadget matrix F that has dimension n×w (where w=n(k−l)), after dropping the first l columns (the columns with the smallest powers) of each group of k columns of the gadget matrix G. More detail about gadget matrices and the generation of a reduced gadget matrix can be found in section II.D.
In step 711, the generating device can randomly sample an n×n uniform matrix Âfrom a uniform distribution over q for a modulus q.
In step 712, the generating device can form a generation matrix Ā. The generation matrix Ā can be a n×2n matrix formed from concatenating the uniform matrix  and an n×n identity matrix In. Thus Ā:=[In, Â].
In step 713, an approximate trapdoor R can be sampled from a distribution χ. The approximate trapdoor R can be a 2n×w matrix. The distribution χ may be the distribution , the Gaussian distribution over integers with width σ.
In step 714, the generating device can form the verification matrix A. The verification matrix A can be A :=[Ā|F−ĀR]. The verification matrix may have dimension n×m, where m=2n+w=2n+n(k−l), with elements from q.
3. Algorithm 3: Generating Lattice-based Signature
Algorithm 3 shows a signature generation algorithm. The signature generation algorithm takes as input the verification matrix A, the trapdoor matrix R, a message vector u, and a distribution width s. The distribution width s may be related to the distribution width σ through the equation in table 1. Signature generation may be done by a sending device, such as sending device 115 of
In step 721, the generating device samples a perturbation vector p from a Gaussian distribution. The perturbation vector p has length m. The covariance of the perturbation is chosen to hide the trapdoor as in the exact gadget-based trapdoor, i.e.
In step 722, the generating device can form a lattice vector v from the message u, the verification matrix A, and the perturbation vector p. The lattice vector v has length n and is formed as v=u−Ap.
In step 723, the generating device can sample an approximate gadget preimage vector z (i.e., the intermediate vector), using the gadget sampling algorithm of Algorithm 1. The gadget sampling algorithm can sample a vector z for each element of the lattice vector v and concatenate them together, thus there are n(k−l) total elements in the total preimage vector z.
In step 724, the generating device can form the signature vector y, which is a vector of length m=2n+w=2n+n(k−l). The perturbation p can be added to a product of the trapdoor matrix R and the preimage vector z. The trapdoor matrix is R has dimension 2n×w, and thus it can be stacked on a w×w identity matrix, in order to have the appropriate dimension of a 2n+w×w matrix. This stack is multiplied by the preimage vector z which has length w, so the product is a vector of length 2n+w=m.
In step 725, the signature generation algorithm can return the signature vector y.
The restults of this section are summarized in the following theorem.
Theorem 4.1 There exists probabilstic, polynomial time algorithms Approx.TrapGen(·) and Approx.SamplePre(·,·,·,·) satisfying the following.
for any σ≥√{square root over (b2+1)}ω(√{square root over (log n)}) and s≥√{square root over (σ2s12(R)+ω(√{square root over (log n)}))}. Furthermore, in the second distribution, A is computationally indistinguishable from random assuming LWE. [0181] Let us remark that while the preimage sampling algorithm works for any target u∈qn, we are able to prove the approximate preimage sample-coset pair (y, u) hides the approximate trapdoor, R, over a uniform target u ∈qn . This is unlike the exact gadget-based trapdoor setting in [MP12] which proves the trapdoor is hidden for each fixed u. In the overview of the proof idea, we will explain where the proof breaks down when we try to spell out the preimage distributions of every u∈qn.
C. Analyze the Preimage and Error Distributions for a Uniformly Random Target
This subsection is dedicated to proving Theorem 4.1.
Let x∈nk denote the short preimage of u−Ap(modq) under the full gadget matrix G, i.e. Gx=u−Ap(modq). The main idea of the proof is to first show that the joint distribution of (p, x) produced in Algorithm 4 is statistically close to
for any u∈qn (this is a stronger theorem than what we need). And then apply the linear transformation theorem on (p, x) to obtain the distributions of y and e. However, applying the linear transformation theorem directly on the lattice coset Λu⊥[A, G] leads to a technical problem. That is, the intermediate lattice intersections ΛT required in Theorem 2.7 have large smoothing parameters. To get around this problem, we use the condition that u is uniformly random to argue that (p,x) is statistically close to
Then the support of (p,x) is m+nk, so we can apply the linear transformation theorem to prove the final distributions of y and e are trapdoor-independent.
Formally, let ε=negl(λ)>0. We first prove three lemmas.
Lemma 4.2 For any σ≥ηε(Λ⊥(G)), the random process of first choosing u←U(qn) then returning x←DΛ
Proof. The proof follows directly from det(Λq⊥(G))=qn and Corollary 2.7. Alternatively, one can use two applications of the fact ρr(Γ+c)∈(1±ε)σn/det(Γ) for any r≥ηε(Γ). The latter yields Pr{Process returns x}
Lemma 4.3 The following random processes are statistically close for any σ≥√{square root over (b2+1)}ω( √{square root over (log n)})≥ηε(gt): sample a uniformly random coset u←(q) then return the error e:=u−Gsamp.Cut(u, σ)modq. Or, return
Proof. For a fixed u ∈q, the error is distributed as
Randomizing over u gives us e˜εL· by Lemma 4.2. Next, we apply Theorem 2.7. Let Bq=[b1, . . . , bk] be the basis of Λq⊥(gt) given in Section 2. Then, the kernel of L is generated by {b1, . . . , bl−1, el, . . . , ek−1} (recall that {ei}i=1n denotes the canonical basis n) and ΛL:=Ker(L)∩k is the set of all integer combinations of these vectors. ΛL spans the kernel of L and the smoothing parameter of ΛL is at most √{square root over (b2+1)}ω(√{square root over (log n)}) since ∥bi∥≤√{square root over (b2+1)} for all i=1, . . . , l−1.
Let
Next, we analyze the distribution given by the linear transformation representing truncating x∈nk in the joint distribution of
to z∈n(k−l) (as in GSAMP.CUT, Algorithm 1) and returning y:=p+R′z. For simplicity, we permute the columns of G to G′:=[F|In⊗(1, b, . . . , bl−1)]. This allows us to express truncation and convolution as a simple linear transformation: y=L(p, x) for L:=[Im|R′ |0].
Lemma 4.4 For any
is statistically close to
Proof. The range and covariance are immediate. Next, we use Theorem 2.7. The kernel of L is given by all vectors (a, b, c) where (b, c) ∈nk and a=−R′b. The integer lattice m+nk contains all such integer vectors so ΛL:=m+nk ∩ ker(L) spans L's kernel.
Now we determine the smoothing parameter of ΛL. Add nl zero columns to R′ to form Rnk:=[R′|0] and rotate the space by
This rotation yields QΛL={0}⊕nk and since rotating a covariance does not change its eigenvalues, we have √{square root over (Σρ⊕σ2Ink)}≥ηε(nk). This implies
is statistically close to .
We are now ready to prove Theorem 4.1.
Proof. (of Theorem 5.1) We prove Theorem 5.1 via a sequence of hybrid distributions. Let M:=[In ⊗(1, b, . . . , bl−1, 0, . . . , 0)]∈n×(nk) be the linear transformation corresponding to obtaining the error vector e ∈n from x∈nk. Let L be the linear transformation used in Lemma 4.4.
Note that the equation u=v+Ap=Gx+Ap=Fz+e+Ap=AL(p, x)+e=Ay+e(modq) holds in all the hybrids. The difference lies in the order and the distributions that these vectors are sampled from. In all the hybrids except the last one, A is generated as A:=[Ā|F−ĀR] for Ā:=[In, A0] ∈qn×2n.
Real distribution: The real distribution of {(A, y, u, e)} is:
Hybrid 1: Here we swap the order of sampling u and v by first sampling v←U(qn) and setting
We keep x, e, and y unchanged: x←DΛ
Hybrid 2: Instead of sampling a uniform v∈qn and a G-lattice sample x←DΛ
Lemma 4.2 implies Hybrid 1 and Hybrid 2 are statistically close.
Hybrid 3: We combine p, x into the joint distribution
Hybrid 4: Here we apply the linear transformation theorem on e, y:
Lemmas 4.3 and 4.4 imply Hybrids 3 and 4 are statistically close.
Final distribution: Sample A←U(qn×m) and keep the rest of the vectors from the same distribution as Hybrid 4 (notice that the trapdoor R of A are not used to sample p, x, e and y). The final distribution is computationally indistinguishable from Hybrid 4 assuming LW.
D. From an Approximate G-trapdoor to an Approximate Kernel-Trapdoor
Given an approximate G-trapdoor for A∈qn×m in the form of R∈(m−w)×w where w=n(k−l) and
We transform it to an approximate kernel of m columns (i.e., smaller than n+m) that still preserves the trapdoor functionality.
The transformation is similar to the one from [MP12] that works over the exact trapdoors. Write A=[A1|A2] where A1∈wx(m−w) . Let W ∈wx(m−w) be an approximate short preimage of A1 under the public matrix F, i.e. F·W=A1+E1(modq). Let S ∈wxw be an approximated kernel of F, i.e. F·S=E2(modq) (both the norms of E1 and E2 are small). Then
So K is an approximate kernel of A of m columns (i.e. lower than n+m) that is at the same time an approximate trapdoor.
We spell out the details of the hash-and-sign signature scheme from [GPV08] instantiated with the approximate G-trapdoor instead of an exact trapdoor.
Recall the parameters from the last section. We set k=[logbq], set l to be the number of entries dropped from the G-trapdoor such that 1≤l<k and m=n(2+(k−l)). Let σ, s∈+ be the discrete Gaussian widths of the distributions over the cosets of Λq⊥(G) and Λq⊥(A) respectively. Let χ be the distribution of the entries of the trapdoor R chosen so that LWEn,n,q,χ,U(
Construction 5.1 Given an approximate trapdoor sampler from Theorem 4.1, a hash function H={Hλ:{0,1}*→Rλ} modeled as a random oracle, we build a signature scheme as follows.
We provide a proof-of-concept implementation of the signature. Embodiments can use several groups of parameters using different dimensions n, moduli q, bases b, targeting different security level (mainly around 100-bit and 192-bit security). In each group of parameters, we use fixed n, q, b, and compare the use of exact trapdoor (under our reference implementation) versus approximate trapdoor. In
Let us first explain how we make the security estimations. The concrete security estimation of lattice-based cryptographic primitive is a highly active research area and more sophisticated methods are proposed recently. Here we use relatively simple methods to estimate the pseudorandomness of the public-key (henceforth “LWE security”), and the hardness of breaking approximate ISIS (henceforth “AISIS security”). Let us remark that our estimations may not reflect the state-of-art, but at least provide a fair comparison of the parameters for the exact trapdoor versus the approximate trapdoor.
LWE security depends on the choices of q, n, and the Gaussian width τ of the trapdoor R. The estimation of LWE security was done with the online LWE bit security estimator with BKZ as the reduction modell [ACD+18].
For the approximate ISIS problem, the only direct cryptanalysis result we are aware of is the work of Bai et al. [BGLS19], but it is not clearly applicable to the parameters we are interested. Instead we estimate AISIS through ISIS,n,m,q,α+β following the reduction in Lemma 3.5, where α and β are the upper-bounds of l2 norm of the error z and preimage x. We estimate the security level of ISISn,m,q,α+β based on how many operations BKZ would take to find a vector in the lattice
of length α+β. Further, we can throw away columns in A. We choose to only use 2n columns of A as done in [BFRS18], denoted A2n, since Minkowski's theorem tells us
has a short enough vector. Following [APS15, ACD+18], we use sieving as the SVP oracle with time complexity 2.292k+16.4 in the block size, k. BKZ is expected to return a vector of length δ2ndet1/2n for a lattice of dimension 2n. Hence, we found the smallest block size k achieving the needed δ corresponding to forging a signature
Finally, we used the heuristic
to determine the relation between k and δ, and we set the total time complexity of BKZ with block-size k, dimension
For an estimation of 100-bit security, our reference implementation for the exact trapdoor under the modulus q≈224 and base b=2 matches the parameters reported in [BB13] (the parameters in the other implementation [BFRS18, GPR+18] are possibly measured in different ways). We also use smaller moduli and bigger bases to reduce the size and increase the security level. The parameters in
Our implementation shows that the sizes of the public-key & signature can be reduced to 5 kB & 4.45 kB for an estimation of 100-bit security, and 11.25 kB & 9.38 kB for an estimation of 192-bit security. Those are still larger than, but much closer to the sizes for the signatures based on the rejection sampling approach [Lyu12, BG14, DKL+18, ABB+19]. As a reference, the sizes of the public-key & signature for qTESLA [ABB+19] are 4.03 kB & 3.05 kB for an estimation of 128-bit security, and 8.03 kB & 6.03 kB for an estimation of 192-bit security.
In the security analysis we use the following properties on the distributions produced by Approx.SamplePre proven in Theorem 4.1:
To prove that the signature satisfies the strong EU-CMA security, we can make use of an additional “near-collision-resistance” property for Ajtai's function, which can be based on the standard SIS assumption. Let us remark that without this property, we can still prove the signature scheme satisfies static security based on the hardness of the approximate ISIS problem.
Lemma 5.2 (The near-collision-resistance of Ajtai's function) For any n, m, q∈ and α, β∈. If there is an efficient adversary A that given A ←U(qn×m), finds x1≠x2∈m such that
∥x1∥≤β and ∥x2∥≤β and ∥Ax1−Ax2(mod q)∥≤2α
Then there is an efficient adversary B that solves
Proof. Suppose B gets an HNF.
challenge (which is as hard as
with the public matrix [In|A], B sends A to A, gets back x1≠x2∈m such that
∥x1∥≤β and ∥x2∥≤β and ∥y:=Ax1−Ax2(modq)∥≤2α
B then sets z:=[−yt|(x1−x2)t]t as the solution. z is then non-zero and satisfies ∥z∥≤2√{square root over (α2+β2)} and [In|A]z=0(modq).
Theorem 5.3 Construction 6.1 is strongly existentially unforgeable under a chosen-message attack in the random oracle model assuming the hardness of
and LW.
Proof. The correctness follows the functionality of the approximate trapdoor.
Suppose there is a polynomial time adversary A that breaks the strong EU-CMA of the signature scheme, we construct a polynomial time adversary B that breaks the near-collision-resistance of Ajtai's function, which is as hard as
To start, B sends Ajtai's function A to A as the public key for the signature scheme. Once A makes a random oracle query w.r.t. a message m, B samples y←Dpre, computes u:=Ay+Derr(modq) as the random oracle response on m. B then replies u to A and stores (m, u) in the random oracle storage, (m, y) in the message-signature pair storage. Once A makes a signing query on the message m (wlog assume m has been queried to the random oracle before, since if not B can query it now), B finds (m, y) in the storage and reply y as the signature. The signatures and the hash outputs produced by B are indistinguishable from the real ones due to the properties of the distributions Dpre and Derr, and the assumption that a real public key is indistinguishable from random under LW.
Without loss of generality, assume that before A tries to forge a signature on m*, A has queried H on m*. Denote the pair that B prepares and stores in the random oracle storage as (m*, u*), and the pair in the signature storage as (m*, y*). Finally A outputs y as the forged signature on m*. So we have ∥A(y−y*)(modq)∥≤2α. It remains to prove that y≠y* so as to use them as a near-collision-pair. If m* has been queried to the signing oracle before, then y≠y* by the definition of a success forgery; if m* has not been queried to the signing oracle before, then y* is with high min-entropy by the settings of the parameter, so y≠y* with overwhelming probability.
Any of the computer systems mentioned herein may utilize any suitable number of subsystems. Examples of such subsystems are shown in
The subsystems shown in
A computer system can include a plurality of the same components or subsystems, e.g., connected together by external interface 81, by an internal interface, or via removable storage devices that can be connected and removed from one component to another component. In some embodiments, computer systems, subsystem, or apparatuses can communicate over a network. In such instances, one computer can be considered a client and another computer a server, where each can be part of a same computer system. A client and a server can each include multiple systems, subsystems, or components.
Aspects of embodiments can be implemented in the form of control logic using hardware circuitry (e.g. an application specific integrated circuit or field programmable gate array) and/or using computer software with a generally programmable processor in a modular or integrated manner. As used herein, a processor can include a single-core processor, multi-core processor on a same integrated chip, or multiple processing units on a single circuit board or networked, as well as dedicated hardware. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will know and appreciate other ways and/or methods to implement embodiments of the present invention using hardware and a combination of hardware and software.
Any of the software components or functions described in this application may be implemented as software code to be executed by a processor using any suitable computer language such as, for example, Java, C, C++, C#, Objective-C, Swift, or scripting language such as Perl or Python using, for example, conventional or object-oriented techniques. The software code may be stored as a series of instructions or commands on a computer readable medium for storage and/or transmission. A suitable non-transitory computer readable medium can include random access memory (RAM), a read only memory (ROM), a magnetic medium such as a hard-drive, or an optical medium such as a CD (compact disk) or DVD (digital versatile disk), flash memory, and the like. The computer readable medium may be any combination of such storage or transmission devices.
Such programs may also be encoded and transmitted using carrier signals adapted for transmission via wired, optical, and/or wireless networks conforming to a variety of protocols, including the Internet. As such, a computer readable medium may be created using a data signal encoded with such programs. Computer readable media encoded with the program code may be packaged with a compatible device or provided separately from other devices (e.g., via Internet download). Any such computer readable medium may reside on or within a single computer product (e.g. a hard drive, a CD, or an entire computer system), and may be present on or within different computer products within a system or network. A computer system may include a monitor, printer, or other suitable display for providing any of the results mentioned herein to a user.
Any of the methods described herein may be totally or partially performed with a computer system including one or more processors, which can be configured to perform the steps. Thus, embodiments can be directed to computer systems configured to perform the steps of any of the methods described herein, potentially with different components performing a respective step or a respective group of steps. Although presented as numbered steps, steps of methods herein can be performed at a same time or in a different order. Additionally, portions of these steps may be used with portions of other steps from other methods. Also, all or portions of a step may be optional. Additionally, and of the steps of any of the methods can be performed with modules, circuits, or other means for performing these steps.
The specific details of particular embodiments may be combined in any suitable manner without departing from the spirit and scope of embodiments of the invention. However, other embodiments of the invention may be directed to specific embodiments relating to each individual aspect, or specific combinations of these individual aspects.
The above description of exemplary embodiments of the invention has been presented for the purpose of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form described, and many modifications and variations are possible in light of the teaching above. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications to thereby enable others skilled in the art to best utilize the invention in various embodiments and with various modifications as are suited to the particular use contemplated.
A recitation of “a”, “an” or “the” is intended to mean “one or more” unless specifically indicated to the contrary. The use of “or” is intended to mean an “inclusive or,” and not an “exclusive or” unless specifically indicated to the contrary.
All patents, patent applications, publications and description mentioned herein are incorporated by reference in their entirety for all purposes. None is admitted to be prior art.
This application is a Divisional of U.S. patent application Ser. No. 17/429,317 filed Aug. 6, 2021, which is a 371 National Phase Application based on PCT Patent Application No. PCT/US2019/044753 filed Aug. 1, 2019, which claims the benefit of the filing date of U.S. Provisional Application No. 62/803,325, filed on Feb. 8, 2019, all of which are herein incorporated by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
62803325 | Feb 2019 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 17429317 | Aug 2021 | US |
Child | 18066732 | US |