METHODS AND SYSTEMS FOR THE IMPLEMENTATION OF NTRU-LIKE CRYPTOSYSTEM RELYING ON OPTICAL FOURIER TRANSFORMS

TECHNICAL FIELD

The invention generally relates to systems for public-key cryptography. More particularly, the invention relates to secure information exchange suitable for implementation on optical devices. The invention has particular applications which are quantum secure (that is, would not be easy to break by a quantum computer) and which thus have long term security as required by banking applications for example.

BACKGROUND AND PRIOR ART KNOWN TO THE APPLICANT(S)

Currently there are two main types of cryptosystems used to securely exchange information over an untrusted channel: private-key, which is generally fast, but requires that keys have already been securely exchanged, and public-key which is generally slower, but has no such requirement. For example, a public-key cryptosystem may be used to securely exchange keys to be used in a private-key cryptosystem.

Some of the most-common public-key cryptosystems, such as RSA and elliptic-curve algorithms, are based on arithmetic problems which would be easy to break by a quantum computer, resulting in lack of future-proofness.

The NTRU family of cryptosystems to which NTRUEncrypt belongs is one of the main proposals for post-quantum public-key cryptography, combining strong security arguments, resistance to known quantum attacks, and small key size. NTRU are lattice-based cryptosystems, expected to be quantum-secure, the NTRU being 2nd-round candidate for the NIST post-quantum standardization process. The main operations in NTRU involve polynomial multiplication. U.S. Pat. No. 6,081,597A is a disclosure of the prior art cryptosystems.

The following prior art is acknowledged: Bagheri Khadijeh et al entitled “A non-commutative cryptosystem based on quaternion algebras”, Designs, Codes and Cryptography, Kluwer Academic Publishers, vol. 86, no. 10, 22 December 2017 (2017-12-22), pages 2345-2377, XP036577232, DOI: 10.1007/S10623-017-0451-4.

This prior art reference requires the use of quaternion algebra over a ring of multinomials. By contrast, embodiments of the invention depart from this teaching as further described in the following section.

Embodiments of the invention seek to improve on the existing prior art methodologies.

SUMMARY OF THE INVENTION

In a broad independent aspect, an embodiment of the invention provides a crypto-method of securely communicating a message; the method comprising the steps of:

- selecting a ring R′ of bi or multi variate multinomials;
- (A multinomial over a ring R′ is a sum of a finite number of terms consisting in an element of R′ times powers of several variables.)
- generating a private key which has a multinomial f;
- generating a public key which has a multinomial h;
- encrypting by representing said message as a multinomial m in R′, selecting a random multinomial r, and computing an encrypted message; and
- decrypting said message using said public key.

In preferred embodiments, the crypto-method employs a multinomial ring which employs multinomial algebra over that ring. This difference over the most recently cited prior art is not only formal but very significant in the context of an optical implementation which provides an efficient way to perform multinomial inversion. By contrast the prior art works with a specific ideal. Embodiments of the invention allow for efficient multinomial inversion in any ideal making it more versatile than the prior art methodology. This is further advantageous in embodiments of optical implementation where the choice of ideal will be partially constrained by the optical device input size and output accuracy.

In a subsidiary aspect, the steps comprise multiplications which are performed in a ring of multinomials.

In a further subsidiary aspect, the steps comprise multiplications which are performed in the canonical algebra over R′.

In a further subsidiary aspect, the method further comprises the step of casting said message as a two-dimensional array.

In a further subsidiary aspect, the ring R′ is chosen as the quotient of Z[X,Y] over the ideal generated by two uni-variate polynomials.

In a further subsidiary aspect, the steps comprise multiplications which are performed modulo two polynomials.

In a further subsidiary aspect, the method further comprises the step of providing an optical processing system and thereby performing a two-dimensional discrete Fourier transform.

In a further subsidiary aspect, the ring R′ of multinomials is represented by the formula custom-character [X, Y]X^N1−1, Y^N2−1, where (N₁, N₂)∈*²wherein N₁and N₂are prime numbers.

In a further subsidiary aspect, the crypto-method further comprises the step of providing a message digest m in the form of a multinomial.

In a further subsidiary aspect, the crypto-method further comprises the steps of providing 2D optical arrays and wherein the multinomial m represents a discrete 2D array.

In a further subsidiary aspect, the crypto-method further comprises the steps of providing an optical system suitable for performing both a Fourier transform and an inverse Fourier transform and optically realising both said Fourier transform and said inverse Fourier transform.

In a further subsidiary aspect, the crypto-method comprises the steps of performing a multinomial multiplication and reducing coefficients of a product of multinomials.

In a further subsidiary aspect, the method further comprises the step of reducing the amplitude of individual coefficients of the multinomials.

In a further subsidiary aspect, the method further comprises the step of iteratively reducing coefficients of a product of multinomials by writing each factor as a sum of multinomials with smaller coefficients.

In a further subsidiary aspect, the crypto-method further comprises the step of reducing coefficients by reducing degrees of the multinomials to be multiplied.

In a further subsidiary aspect, the method further comprises the step of reducing the coefficients of a product of multinomials by writing each factor as a sum of multinomials with smaller degrees.

In a further subsidiary aspect, the security of the cryptosystem is established by reduction to a short vector problem using tensors.

In a further broad aspect, the system comprises a processor configured to perform the steps of:

- selecting a ring R′ of bi or multi variate multinomials;
- generating a private key which has a multinomial f;
- generating a public key which has a multinomial h;
- encrypting by representing said message as a multinomial m in R′, selecting a random multinomial r, and computing an encrypted message; and
- decrypting said message using said private key.

In a further broad aspect, the system comprises an optical processor configured to perform Fourier transform processing to carry out a multinomial multiplication in the ring R of multinomials.

In a further broad aspect, the system comprises an optical processor configured to perform Fourier transform processing to carry out a multinomial multiplication in the canonical algebra of a ring R′ of two or more variate multinomials.

In a further broad aspect, the method for encrypting and decrypting a digital message, the method comprises the steps of:

- selecting a ring R, an ideal q of R, and a hash function;
- (An ideal is a subset of a ring which has a group structure and is stable under multiplication by any element of the ring.)
- generating elements f and g of the ring R, and generating an element f⁻¹that is an inverse of f in the ring R modulo q;
- producing a public key that includes h where h is equal to a product that can be derived using g and f⁻¹;
- producing a private key from which f and g can be derived;
- producing a message digest m by applying the hash function to the digital message;
- encrypting the message using the public key;
- decrypting the message using the private key
- wherein in the step of producing the encrypted message, a multiplication is computed in the ring R using Fourier transform processing implemented optically.

In a subsidiary aspect, the steps are performed after reducing the magnitude of the multinomials as in the method of any one of the preceding aspects.

In a further subsidiary aspect, the system comprises an electronic processor configured to perform the steps of:

- selecting a ring R, an ideal q of R, and a hash function;
- generating elements f and g of the ring R, and generating an element f⁻¹that is an inverse of f in the ring R modulo q;
- producing a public key that includes h where h is equal to a product that can be derived using g and f⁻¹;
- producing a private key from which f and g can be derived;
- producing a message digest m by applying the hash function;
- encrypting the message digest using the public key;
- decrypting the message using the private key
- the system further comprising an optical processor configured to perform Fourier transform processing and compute a multiplication in the ring R for producing the message digest m.

In contrast to the prior art, in certain embodiments, multiplications are performed in a ring of multinomials, instead of polynomials (in prior systems, the message and keys being cast as polynomials before performing encryption or decryption). According to embodiments of the invention, the multiplications are cast as multinomials with two variables (or more), being equivalent to two-dimensional (or higher) convolutions which can be accelerated using optical Fourier transforms.

The public-key cryptosystem methodology outlined herein as “NTRU2D” is based on NTRU but using a different set of public and private keys as well as a different algebraic structure. It is a multi-dimensional (at least two-dimensional) system, which synergistically allows for implementation on an optical device performing a two-dimensional discrete Fourier transform. The resulting system works similarly to NTRUEncrypt from a user point of view, but with different internal mechanics. Advantageously, the system can be straightforwardly extended to a higher number of dimensions, although the two-dimensional version is probably the best-suited for optical implementation.

The method could be efficiently implemented on optical chips of the type developed by Optalysys Ltd, leading to potentially decreased runtimes and smaller power consumption. PCT/EP2020/065740 illustrates examples of optical systems and is incorporated by reference.

In a subsidiary aspect, the ring R of multinomials is represented by the formula custom-character [X,Y]/X^N1−1, Y^N2−1, where (N₁, N₂)∈*², wherein N₁and N₂are prime numbers.

In a subsidiary aspect, the message digest m is cast as a multinomial representing a 2D array. Accordingly, multiplications may be performed modulo two (or more) polynomials. In the prior art, the correspondence between polynomial multiplication and convolution is due to a reduction (mathematically, a modulo operation) using a fixed polynomial, whose degree is one of the parameters of the cryptosystem. Advantageously, the degrees here are two parameters of the cryptosystem.

In a further subsidiary aspect, the method further comprises the step of performing a multinomial multiplication in the ring R of multinomials using Fourier transform processing. The multiplication may thus be component-wise multiplication. In further subsidiary aspect the method further comprises the step of applying an inverse Fourier transform and representing the message digest m as a multinomial.

In a preferred embodiment, the Fourier transform (and inverse Fourier transform) processing is implemented optically. This provides enhanced security and potential faster processing.

In a further subsidiary aspect, performing the multinomial multiplication comprises the step of reducing coefficients of a product of multinomials. In an embodiment, the step of reducing coefficients comprises reducing the amplitude of the individual coefficients of the multinomials to be multiplied. In an alternative embodiment, the step of reducing coefficients comprises reducing degrees of the multinomials to be multiplied. Advantageously, each of these algorithms compensates for potential low output accuracy of optical systems.

In a preferred embodiment, using a combination of these two algorithms, multinomial multiplication can be performed on a low-accuracy device at the expense of an increased runtime. They can in principle be applied to NTRU or NTRUPrime as well as NTRU2D.

In a subsidiary aspect, the security of the cryptosystem is related to the difficulty of solving a short vector problem (which can be proved using tensors, instead of matrices for the prior art). Accordingly, the reduction involves a different type of mathematical objects compared to the prior art.

In a subsidiary aspect, using Fourier transform processing may comprise performing a block decomposition for a discrete 2D Fourier transform. One advantage of such a decomposition is to reduce the maximum modulus of the Fourier coefficients of each block, thus potentially improving the accuracy. For example, a typical workflow can be:

- 1. Perform the Fourier transform on each block using a fast but low-accuracy device.
- 2. Combine the results to compute the full Fourier transform on a slower, high-accuracy device.

In a further broad aspect, the invention provides a system comprising an electronic processor configured to perform the steps of:

- selecting a ring R of multinomials, an ideal q of R, and a hash function;
- generating elements f and g of the ring R, and generating an element f⁻¹that is an inverse of fin the ring R modulo q;
- producing a public key that includes h where h is equal to a product that can be derived using g and f⁻¹;
- producing a private key from which f and g can be derived;
- producing a message digest m by applying the hash function to the digital message, the message digest being represented as a multinomial in the ring R;
- encrypting the message digest using the public key;
- decrypting the message using the private key.

In a subsidiary aspect, the system comprises an optical processor configured to perform Fourier transform processing to carry out a multinomial multiplication in the ring R of multinomials.

In a further broad aspect, the invention provides a method for encrypting and decrypting a digital message, the method comprising the steps of:

- selecting a ring R, an ideal q of R, and a hash function;
- generating elements f and g of the ring R, and generating an element f⁻¹that is an inverse of fin the ring R modulo q;
- producing a public key that includes h where h is equal to a product that can be derived using g and f⁻¹;
- producing a private key from which f and g can be derived;
- producing a message digest m by applying the hash function to the digital message;
- encrypting the message digest using the public key;
- decrypting the message using the private key,
- wherein in the step of producing the message digest m, a multiplication is computed in the ring R of using Fourier transform processing implemented optically.

In a further broad aspect, the invention provides a system comprising an electronic processor comprising the steps of:

- selecting a ring R, an ideal q of R, and a hash function;
- generating elements f and g of the ring R, and generating an element f⁻¹that is an inverse of f in the ring R modulo q;
- producing a public key that includes h where h is equal to a product that can be derived using g and f⁻¹;
- producing a private key from which f and g can be derived;
- producing a message digest m by applying the hash function to the digital message;
- encrypting the message digest using the public key;
- decrypting the message using the private key,
- the system further comprising an optical processor configured to perform Fourier transform processing and compute a multiplication in the ring R for producing the message digest m.

In a further broad independent aspect, the invention provides a method for encrypting and decrypting a digital message, the method comprising the steps of:

- selecting a ring R of multinomials, an ideal q of R, and a hash function;
- generating elements f and g of the ring R, and generating an element f⁻¹that is an inverse of f in the ring R modulo q;
- producing a public key that includes h where h is equal to a product that can be derived using g and f⁻¹;
- producing a private key from which f and g can be derived;
- producing a message digest m by applying the hash function to the digital message, the message digest being represented as a multinomial in the ring R;
- encrypting the message digest using the public key;
- decrypting the message using the private key.

As a public-key cryptosystem, embodiments of the present invention can be advantageously used in secure communication, including banking applications. It could be used, for instance, to exchange keys between two participants over an untrusted channel and establish a secure communication protocol.

In a subsidiary aspect, in accordance with any of the preceding aspects, if the number of variables is smaller or larger than 2, the optical Fourier transform is computed by performing sequential one- and two-dimensional Fourier transforms.

In a subsidiary aspect, in accordance with any of the preceding aspects, the two-dimensional transforms are performed either directly using the optical system or in sequential steps using a Cooley-Tukey algorithm.

In a subsidiary aspect, in accordance with any of the preceding aspects, the one-dimensional transforms are performed using a modification of a Cooley-Tukey algorithm with different twiddle factors.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows a flow diagram in accordance with an embodiment of the invention.

FIG. 2 shows a flow diagram in accordance with an embodiment of the invention.

FIG. 3 shows a flow diagram in accordance with an embodiment of the invention.

FIG. 4 shows a flow diagram in accordance with an embodiment of the invention.

FIG. 5 shows a flow diagram in accordance with an embodiment of the invention.

FIG. 6 shows a flow diagram of multinomial multiplication.

FIG. 7 shows a flow diagram of an optical process.

DETAILED DESCRIPTION
1 INTRODUCTION

NTRU is a public-key cryptosystem first proposed in 1996 [1], published in 1998 [2], and in the public domain since 2017. It consists of two families of systems: NTRUEncrypt, an asymmetric encryption scheme, and NTRUSign, a digital signature scheme. Contrary to most current schemes based on arithmetic problems which are prone to quantum attacks (using for instance Shor's algorithm), NTRU is based on lattice problems against which no efficient attack is known, and which are conjectured to be impossible to break in polynomial time. (The term ‘conjectured’ here is understood in a strong sense: decades of research on these problems and attempts at breaking them have found no evidence that they can be broken in polynomial time.) No security vulnerability was found in the more than twenty years since it was first proposed, and it is thought to be resistant against both classical and quantum attacks.

A provably secure (but less efficient) version was proposed in 2013 [3] and is currently studied by the European commission [4] as a possible future-proof alternative to current cryptosystems. Another version, called NTRU Prime, was proposed in 2016 [5] to remove some algebraic structures which might introduce weaknesses—specifically to reduce the number of automorphisms and other endomorphisms of the ring of polynomials in which calculations are performed. (However, at the time of writing, no efficient attack making use of these structures is known.)

(A ring (R, +,∘) is a set R with two binary internal operations R×R→R, hereafter denoted by + and ∘, satisfying the following axioms:

- 1. (R, +) is an abelian group, i.e.,
  - + is associative: for each (a, b, c)∈R³, (a+b)+c=a+(b+c).
  - + is commutative: for each (a, b)∈R², a+b=b+a.
  - There exists an element of R, called 0, such that, for each a∈R, a+0=a.
    
    For each a∈R, there exists b∈R such that a+b=0.
- 2. (R,∘) is a monoid, i.e.,
  - ∘ is associative: for each (a, b, c)∈R³, (a∘b)∘c=a∘(b∘c).
  - There exists an element of R, called 1, such that, for each a∈R, a∘1=1∘a=a.
- 3. ∘ is distributive over +, i.e.,
  - for each (a, b, c)∈R³, a∘(b+c)=(a∘b)+(a∘c) (left distributivity).
  - for each (a, b, c)∈R³, (a+b)∘c=(a∘c)+(b∘c) (right distributivity).

Both the original NTRUEncrypt and NTRU Prime have advanced to the second round of the NIST Post-Quantum Cryptography Standardization project (https://csrc.nist.gov/projects/post-quantum-cryptography/round-2-submissions). Besides its expected post-quantum security, the NTRU family also makes key generation particularly efficient [6], opening possible use cases where the key needs to be changed regularly.

Central to the NTRU algorithms are polynomial multiplications in a finite ring. These operations can be mapped to convolutions of vectors, and thus have efficient implementations in Fourier space. For this reason, the optical Fourier transform of embodiments of the invention can significantly increase the speed and decrease the power consumption of the algorithm.

Naive polynomial multiplication has a complexity O(n²) where n is the degree of the polynomials. A recursive algorithm splitting each polynomial in two reduces the number of required scalar multiplications to O(n^log²⁽³⁾) at the expense of some overhead. An algorithm based on the fast Fourier transform has a complexity O(n log n). Using an optical Fourier transform can, in certain embodiment, reduce it to O(n), with the Fourier transform being performed in O(1) runtime.

Example of Possible Application

A public-key cryptosystem can be used, for instance, in inter-device communication in an Internet of Things network. To take a specific example, two devices would need to communicate securely to exchange personal data about a user (e.g., medical data exchanged between one device used to perform a diagnostic and a hospital storage system) which need to be protected from external unauthorized access. A secure communication channel could be opened between the two devices via the exchange of encryption ciphers, or keys for any other encryption system, which would themselves be encrypted using a public-key cryptosystem to prevent interception by any external actor. For devices operating on low power, making the cryptosystem secure enough would be challenging with current technology. The cryptosystem proposed here will alleviate this problem as its most computationally intensive operation can be performed optically, significantly reducing the power usage.

2 THE NTRUEncrypt FAMILY
2.1 Description of the Algorithm

NTRUEncrypt is a family of cryptosystems with three parameters N, p, and q in custom-character * such that

- N is a prime number (see Appendix A.2),
- q>p,
- q and p are coprime.

It involves polynomial multiplications in the ring R= custom-character [X]/X^N−1, which can be recast as a convolution and efficiently performed in Fourier space (see appendix A.1.1).

As any public-key cryptosystem, it involves three steps: generation of private and public keys, encryption of a message, and decryption. We now describe each of these steps.

Generation of public and private keys, with reference to FIG. 1: The private key is a doublet (f, f_p) where

- f is an element of R with coefficients in {−1, 0, +1} which has an inverse modulo p and an inverse modulo q,
- f_pis the inverse off modulo p, i.e., f·f_p=1 mod p, where · denotes the multiplication in R.

(Technically, f_pcan be recovered from f , so the private key may be taken as f only. However, in practice it is generally more convenient to save fp rather than to re-compute it at the decryption stage [6]. The requirement that the coefficients of f be between −1 and +1 can be relaxed in practice; although they must still be ‘small’.)

The public key is the polynomial h obtained by multiplying p, the inverse f_qof f modulo q, and a polynomial g in R with coefficients in {−1, 0, +1}, and taking the result modulo q: h=(p fq·g) mod q.

An embodiment concerning how the inverse of a polynomial modulo X^N−1 can be computed in appendix A.3.1. To get f_pand f_q, the Euclidean algorithm described there must be performed with K= custom-character _pand K=_q, respectively. The relation with a short vector problem is outlined in appendix, A.4.1.

Encryption, with reference to FIG. 2: The encryption procedure is:

- 1. Represent the message as a polynomial m in R with coefficients between 1−┌p/2┐ and └p/2┘.
- 2. Choose a random polynomial r in R with relatively small coefficients.
- 3. Compute the encrypted message e given by e=(r·h+m) mod q.
  - Discard the polynomial r.

It is important that r be never revealed.

Decryption, with reference to FIG. 3: For the decryption, all the modulo operations are centred: for any integer n, a quantity modulo n is taken between 1−┌n/2┐ and └n/2┘. The decryption procedure is:

- 1. Compute a=(f·e) mod q. Using the definitions of e and h, we have: a=(pr·g+f·m) mod q. If q is sufficiently large and r is sufficiently small, we then have a=pr·g+f·m. In the following, we assume this is true.

NB: This is always true if

$⌈ \frac{q}{2} ⌉ - 1 \geq p { r }_{1} + ⌊ \frac{p}{2} ⌋ N,$

where ∥ ∥₁denotes the sum of the absolute values of the coefficients. One can relax the assumption on q by imposing some conditions on the private key. Indeed, the conditiona=pr·g+f·m is satisfied for any message m provided

$⌈ \frac{q}{2} ⌉ - 1 \geq p { r }_{1} + ⌊ \frac{p}{2} ⌋ { f }_{1} .$

- 2. Compute b≡a mod p. This gives b≡(f·m) mod p.
- 3. Compute c≡(f_p·b) mod p. It should be equal to m.

2.2 Practical Considerations

Secure parameters: Provided the parameters are chosen so that a short vector for the lattice of appendix A.4.1 cannot be found using lattice reduction techniques, the most efficient known attacks at the time of writing are meet-in-the-middle attacks. Their complexity is the square root of that of a brute-force attack [6]. Typically, the polynomials f, g, and r are chosen to have a fixed number of 1s, −1s, and 0 s. If a polynomial has d₊ positive coefficients and d₋ negative ones, this leaves N!/(d₊!d₋!(N−d₊−d₋) possibilities. The complexity C_MITMof a meet-in-the-middle attack is thus, up to a prefactor,

$C_{MITM} (N, d_{+}, d_{-}) \approx \sqrt{\frac{N!}{d_{+}! d_{-}! (N - d_{+} - d_{-})}}$

In 2012, the company Security Innovation, which held the NTRU patents, proposed the following sets of parameters in its NTRU tutorial (https://web.archive.org/web/20120606210107/http:/www.securityinnovation.com/security-lab/crypto/155.html, see also the article [6]). Here d_f, d_g, and d_rare the numbers of 1s in the polynomials f, g, and r, respectively. The former has one fewer −1s than it has 1s; the others have has many −1s as they have 1s, and the other coefficients are 0. The ‘security’ columns give the numbers of bits of security against meet-in-the-middle attacks as given in [6]. (We show them only when given by the NTRU team.) The key security is given by:

$\frac{1}{2} \log_{2} [(N 2 d_{g}) (2 d_{g} d_{g})] = \frac{1}{2} \log_{2} [\frac{N!}{(N - 2 d_{g})! {(d_{g})!}^{2}}],$

i.e., half the logarithm in base 2 of the number of possible polynomials g. Similarly, the message security is given by

$\frac{1}{2} \log_{2} [(N 2 d_{r}) (2 d_{r} d_{r})] = \frac{1}{2} \log_{2} [\frac{N!}{(N - 2 d_{r})! {(d_{r})!}^{2}}] .$

More parameters can be found in table 2.1 of [7].

NB: These parameters are still susceptible to multiple transmission attacks [6]: if the same message is sent several times using different random vectors r, an attacker could recover most of their coefficients by multiplying the difference between encrypted messages by a pseudo-inverse of h. As described in [6], this and some other attacks can be parried by appending a hash and the output of a generating function to the message before encryption. (The modified message is sometimes called a digital envelope, parrying multiple transmission attacks.) Similarly, if g(1)=0, we have e(1)=m(1), so that some information is leaked. This can be prevented by reserving one coefficient of the message to ensure m(1) has a specified value (e.g. 0) independent of the information to be conveyed. The resulting workflow is schematically represented in FIGS. 4 and 5.

FIG. 4 is a schematic representation of the NTRU encryption workflow. The message and digital envelope elements can be publicly revealed without security issues. The random vector and cipher elements should never be revealed. FIG. 5 is a schematic representation of the NTRU decryption workflow. The random vector is the same as that used for encryption.

A chosen ciphertext attack is described in reference [8], making use of the fact that some messages are not decrypted correctly to learn information on the private key. This and other similar attacks can in principle be parried using the construction described in reference [9], which makes NTRUEncrypt indistinguishable against adaptive chosen-ciphertext attack (IND-CCA2) (see also [10, 11] and references therein).

More recent estimates (see for instance the presentation [12]) suggest choosing the polynomial f in the form f=1+pF, where F has d_fcoefficients equal to 1, d_fequal to −1, and its other coefficients vanish, with one of the following parameter sets and p=3, shown in the following table:

Classical
Quantum

N
q
d_f
d_g
d_m
security
security

443
2048
148
148
115
128
128

587
2048
169
196
157
192
128

743
2048
247
247
204
156
128

Performance of electronic implementations: The NTRU project (https://tbuktu.github.lo/ntru/) reports (as of 14 May 2020) about 30,000 encryption operations per second, 22,000 decryption operations per second, or 2,000 key generations per second with 256 bits of security on an Intel Xeon™ at 1.6 GHz.

Two low-power implementations of NTRU on specialized hardware are proposed in [13] for the ‘moderate security’ parameters. The encryption-only design requires 1.72 μW and encryption takes a bit more than 56 ms. The encryption-decryption design requires about 6 μW; encryption and decryption take about 56.78 ms and 119.23 ms respectively.

2.3 NTRU Prime

The NTRU Prime family of cryptosystems (see also the ntruprime.cr.yp.to website) is a tweak of the original NTRU proposal using rings with a different structure; see reference [5]. Its development was motivated by recent quantum attacks against the Ideal-SVP [14, 15] casting doubts on the future-proofness of cryptosystems relying on cyclotomic rings. (A cyclotomic ring is a ring of integers of the number field custom-character (∈) where ∈ is a complex root of unity. One can show that it is equal to [∈], i.e., the ring of polynomials inEwith integer coefficients. If ∈ is a root of the identity and n is the smallest positive integer such that ∈ⁿ=1, the ring of integers of (∈) is isomorphic to

$\frac{Z [X]}{〈 X^{n} - 1 〉} .)$

While these are not known to affect the security of NTRU, working with different rings is expected to reduce the probability that successful attacks will be found in the near- or medium-term future. NTRU Prime is currently a second-round candidate in the NIST Post-Quantum Cryptography Standardization Project (https://cscr.nist.gov/projects/post-quantum-cryptography).

The central idea of NTRU Prime is to work in the ring of polynomials custom-character [X]/X^N−X−1 for some prime number N , to reduce the number of automorphisms and other endomorphisms which might be used to construct attacks. It comes in two variants: Streamlined NTRU Prime and NTRU LPRime. The first one is extensively described in [5]. Besides the use of different rings, it also eliminates the possibility of decryption failures (by setting a lower bound on the value of the parameter q) and introduces a rounding mechanism which simplifies protection against chosen-ciphertext attacks. However, using Fourier transform methods to perform polynomial multiplications in this ring is more intricate. It is also not clear to what extent the change of ring precisely affects security beyond the speculation (supported by past examples) that reducing the number of endomorphisms may prevent yet-to-be-discovered attacks.

In general, if G is a subring of a field K and P is a polynomial of degree N with N distinct roots in K, then multiplication in the ring

$R = \frac{G [X]}{〈 P (X) 〉}$

is equivalent to component-wise multiplication of vectors after the change of basis given by the Vandermonde matrix of the roots of P. Indeed, if a and b are two elements of R, A and B are the vectors of their coefficients, c=ab, C the vector of coefficients of c, W the vandermonde matrix of the roots of P, and if a cross denotes the component-wise multiplication, then WC=(WA)×(WB). (This works because c(x)=a(x)b(x) provided x is a root of P.) If P(X)=Xⁿ−1, then multiplication amounts to a discrete Fourier transform.

3 NTRU2D
3.1 Description
3.1.1 The NTRU2D Cryptosystem

A bi-variate version of the algorithm described in section 2.1 is described, replacing the ring of polynomials custom-character [X]/X^N−1 by [X, Y]/X^N1−1, Y^N2−1, where (N₁, N₂)∈*².

Parameters

- two prime numbers N₁and N₂,
- two coprime positive integers p and q such that q>p,
- three subsets _f, _g, and _hof the ring R′≡[X, Y]/X^N₁−1, Y^N₂−1.

Private key: A multinomial f∈ custom-character _fwhich has an inverse in R′ modulo p, hereafter called f_p, and an inverse in R′ modulo q, hereafter called f_q.

Public key: Choose a multinomial g in custom-character _g. The public key is the multinomial h given by

h=(pf_q·g) mod q.

Encryption

- 1. Represent the message as a multinomial m in R with coefficients between 1−┌p/2┐ and └p/2┘.
- 2. Choose randomly a multinomial r in .
- 3. Compute the encrypted message e given by e=(r·h+m) mod q.
- 4. Discard the polynomial r.

It is important that r be never revealed.

Decryption: For the decryption, all the modulo operations are centred: for any integer n, a quantity modulo n is taken between 1−┌n/2┐ and └n/2┘. The decryption procedure is:

- 1. Compute a=(f·e) mod q. Using the definitions of e and h, we have: a=(prg+f·m) mod q. If q is sufficiently large and the involved multinomials are sufficiently small, we then have a=pr·g+f·m. In the following, we assume this is true.
- 2. Compute b≡a mod p. This gives b≡(f·m) mod p.
- 3. Compute c≡(f_p·b) mod p. It should be equal to m.

Typically, the three sets custom-character _f, _y, and _rcontain multinomials with small L²norms. As shown in appendix A.4.3, finding the private key from the publicly-known parameters and public key is then as hard as solving a short vector problem.

NB: As for NTRUEncrypt, the message can, and probably should in real-world applications, be enclosed in a digital envelope before the step 3 of encryption to increase the level of security.

NB2: This cryptosystem can be generalized to a higher number of variables.

The encryption and decryption steps both rely on multinomial multiplication, which can be accelerated by making use of an optical computing device. Here, it is proposed a possible implementation using two devices: an electronic one and an optical one. It is illustrated on FIG. 6.

- The two multinomials to be multiplied, m₁and m₂, are both cast as two-dimensional arrays by the electronic device. The arrays are sent to the optical device.
- The optical device performs the discrete Fourier transform of each array.
- The results can either be multiplied on the optical device if it supports this operation, or sent to the electronic one, multiplied, and the result sent back to the optical device.
- The optical device performs the inverse Fourier transform of the result.
- The result is sent to the electronic device where it is cast as a multinomial.

This procedure can be used to compute the products at step 4 in FIG. 2 and steps 2 and 4 in FIG. 3.

In practice, the first and last steps may not be required if the input or desired output are two-dimensional arrays rather than multinomials. For instance, the full NTRU2D workflow (including the key generation, encryption, and decryption) may be performed using two-dimensional arrays in place of multinomials, with each coefficient of the multinomials being identified with the corresponding coefficient in one of the arrays.

The inverse Fourier transform of the fourth step may also be replaced by a direct Fourier transform. A rescaling of the output is then required, which may be performed either immediately or at a later stage.

An embodiment will now be described in more details of one possible implementation. The data could be sent to the optical device via an optical fibre link, hereafter called the input link. For instance, it could be encoded in the intensity of monochromatic coherent light emitted by a laser upstream and modulated by a series of heaters or Mach-Zehnder interferometers. Said light would be collimated, e.g. by passing through a series of lenses, before passing through a single lens placed one focal distance away from the collimation plane. It could then be focused into another optical fibre link, hereafter called the output link, by another array of lenses. The signal could then either be converted to an electronic signal by photodiodes at the end of the output link and sent to the electronic device, or sent to the output link of the same or another optical device for further optical processing.

In one embodiment, the optical device would compute only the absolute value of the Fourier transform of the signal. The procedure should then be repeated twice, with two different constants added to the input signal, to recover the full Fourier transform.

In another embodiment, the optical device would compute the full Fourier transform of the signal.

In either of said embodiments, the procedure could be repeated several times to increase the accuracy of the output. The input data would first be split into several datasets with a smaller magnitude by the electronic device. Each dataset would be processed separately by the optical device and sent to the same or another electronic device. Said electronic device would then combine the outputs.

Said optical processing could be performed using the technology patented by Optalysys Ltd.

3.1.2 Probability of Decryption Failure

The decryption process may fail if the polynomial a has at least one coefficient smaller than 1−┌q/2┐ or larger than └q/2┘. Let us estimate the probability of this event. To make things simple, here we assume that

- the polynomials r, g, and f have coefficients in {−1, 0, +1},
- they have at most, respectively, N_r, N_g, and N_fnonvanishing coefficients.

Call N the product N₁N₂.

First, notice that decryption will always succeed if p min (N_r, N_g)+└p/2┐N_f≤┌q/2┐−1. To estimate the probability of decryption failure when this condition is not satisfied, assume, in one embodiment, that the probability distribution for the values of each coefficient in each of the polynomials f, g, and r is independent of its position. The probability that at least one coefficient of a is too large is then bounded (from above) by N multiplied by the probability that its first coefficient is too large.

An embodiment first looks at the distribution of values for the product r·g. Call Nr⁽⁺⁾the number of positive coefficients of r , Nr⁽⁻⁾the number of negative coefficients, and similarly for g with the letter r replaced by g. Optionally, assume these four numbers are fixed. The probability that a number n₊₊ of +1s in r coincide with +1s of g and a number n₊₋ of them coincide with −1s in g and similarly for −1s with the first index of n replaced by − is, for n₊₊+n_+−x≤N_r′⁽⁺⁾and n−₊+n₋₋≤N_r⁽⁻⁾:

$\frac{(\begin{matrix} N_{g}^{(+)} \\ n_{++} \end{matrix}) (\begin{matrix} N_{g}^{(-)} \\ n_{+ -} \end{matrix}) (\begin{matrix} N_{g}^{(+)} - n_{++} \\ n_{- +} \end{matrix}) (\begin{matrix} N_{g}^{(-)} - n_{+ -} \\ n_{--} \end{matrix}) (\begin{matrix} N - N_{g}^{(+)} - N_{g}^{(-)} \\ N_{r}^{(+)} - n_{++} - n_{+ -} \end{matrix}) (\begin{matrix} N - N_{g}^{(+)} - N_{g}^{(-)} - N_{r}^{(+)} + n_{++} + n_{+ -} \\ N_{r}^{(-)} - n_{- +} - n_{--} \end{matrix})}{(\begin{matrix} N \\ N_{r}^{(+)} + N_{r}^{(-)} \end{matrix}) (\begin{matrix} N_{r}^{(+)} + N_{r}^{(-)} \\ N_{r}^{(+)} \end{matrix})},$

which may be rewritten as:

$?$

$? indicates text missing or illegible when filed$

For each n∈ custom-character , the probability that the first coefficient of r·g be equal to n is the sum of this expression over each positive of zero integer values of n₊₊, n₊₋, n₋₊, and n₋₋ such that n₊₊+n₊₋≤N_r′⁽⁻⁾, n₋₊+n₋₋≤N_r′⁽⁻⁾, and n₊₊−n₊₋−n₋₊+n₋₋=n of this expression.

The expression (1), denoted by P_rgbelow, can be simplified using the Stirling formula in the limit N→∞. Denote with the greek letter λ the ratio of each quantity (denoted by a letter N or n with subscripts and possibly a superscript) over N, and assume these ratios are fixed as N increases. It follows:

$\log P_{rg} \underset{N \to \infty}{=} N ⌊ p \log λ_{g}^{(+)} + p \log λ_{g}^{(-)} + p \log λ_{r}^{(+)} + p \log λ_{r}^{(-)} + p \log (1 - λ_{g}^{(+)} - λ_{g}^{(-)}) + p \log (1 - λ_{r}^{(+)} - λ_{r}^{(-)} - p \log λ_{++} - p \log λ_{+ -} - p \log λ_{- +} - p \log λ_{--} - p \log (λ_{g}^{(+)} - λ_{++} - λ_{- +}) - p \log (λ_{g}^{(-)} - λ_{+ -} - λ_{--}) - p \log (λ_{r}^{(+)} - λ_{++} - λ_{+ -}) - p \log (λ_{r}^{(-)} - λ_{- +} - λ_{--}) - p \log (1 - λ_{g}^{(+)} - λ_{g}^{(-)} - λ_{r}^{(+)} - λ_{r}^{(-)} + λ_{++} + λ_{+ -} + λ_{- +} + λ_{--})] - 2 \log N + O (1) .$

where the function plog is defined by:

$p \log : (\begin{matrix} ℝ_{+}^{*} \to ℝ \\ x \mapsto x \log x \end{matrix}) .$

Typically, the quantity Prg thus decreases exponentially with N. It is thus expected that large deviations from typical values in the coefficients of r·g will be (up to polynomial prefactors) exponentially unlikely as N becomes large.

To get a feel for how small this probability typically is, consider the case λ_g⁽⁺⁾=λ_g⁽⁻⁾=λ_r⁽⁺⁾=λ_r⁽⁻⁾=¼ and λ₊₊=λ₊₋=λ₋₊=λ₋₋= 1/16, and work with base-2 logarithms. The following arises:

$\begin{matrix} \log 𝒫_{rg} \underset{N \to \infty}{=} N [4 p \log (\frac{1}{4}) + 2 p \log (\frac{1}{2}) - 4 p \log (\frac{1}{16}) - 4 p \log (\frac{1}{8}) - p \log (\frac{1}{4})] - 2 \log N + O (1), \\ \log 𝒫_{rg} \underset{N \to \infty}{=} N [- 2 - 1 + 1 + \frac{3}{2} + \frac{1}{2}] - 2 \log N + O (1) = - 2 \log N + O (1) . \end{matrix}$

For this particular set of values, the term linear in N vanishes and P_rgscales like N⁻². However, for these values the first coefficient of r·g is 0, so the relatively high probability is not a problem. In one embodiment, now consider the case λ₊₊=λ₊₋=λ₋₊=⅛, λ₋₊=λ₋₋=λ₋₋=0. It follows:

$\log 𝒫_{rg} \underset{N \to \infty}{=} N [4 p \log (\frac{1}{4}) + 2 p \log (\frac{1}{2}) - 4 p \log (\frac{1}{8}) - 2 p \log (\frac{1}{4})] - 2 \log N + O (1),$

$\log 𝒫_{rg} \underset{N \to \infty}{=} N [- 2 - 1 + \frac{3}{2} + 1] - 2 \log N + O (1) = - \frac{N}{2} - 2 \log N + O (1) .$

The probability of this configuration thus scales (up to polynomial factors) like 2^−N/2.

Now consider the polynomial f·m. For definiteness, assume p=3 and that each coefficient of m is a random variable chosen uniformly and independently between −1 and +1. Then, the first coefficient of the product is the sum of independent, identically distributed random variables with a vanishing mean and a variance equal to ⅔. According to the central limit theorem, in the limit N_f→∞ its distribution becomes close to a Gaussian centred on 0 with variance 2N_f/3. Denoting by P_mfthis probability distribution, it follows for each n∈ custom-character :

$𝒫_{mf} (n) \underset{N_{f} \to \infty}{\sim} \sqrt{\frac{3}{4 π N_{f}}} \exp (- \frac{3 n^{2}}{4 N_{f}}) .$

The probability that n differs from 0 by λN_f, where λ is a real number whose absolute value is noticeably smaller than 1 is thus:

$𝒫_{mf} (λ N_{f}) \underset{N_{f} \to \infty}{\sim} \sqrt{\frac{3}{4 π N_{f}}} \exp (- \frac{3 N_{f}}{4} λ^{2}) .$

For λ₀>0, and up to a polynomial factor, the probability that |(m·r)(0)|>λ₀N goes exponentially to 0 when N→∞.

From these results, and if it is assumed that q scales at least linearly with N, the probability of decryption failure should decrease exponentially in N.

3.2 Choice of Parameters and Security

An embodiment briefly envisages a possible choice of parameters. It is only given for illustration purposes: more research is required to say whether or not they are secure against combinations of lattice-reduction and meet-in-the-middle attacks.

One embodiment is configured to aim for parameters close to those recommended for NTRUEncrypt. The same values for q and p: q=1024 and p=3 may be chosen. Optionally, choose N₁and N₂close to 20, e.g., N₁=N₂=23. Optionally, choose the polynomials f, g, and r to have respectively d_f=149, d_g=148, and d_r=148 coefficients equal to 1. The polynomials g and r are chosen to have as many coefficients equal to −1 as they have coefficients equal to 1, while f has one fewer of them, and its first coefficient is fixed to be 1. Call N the product N₁N₂.

The number of bits of security s_gof g against meet-in-the-middle attacks is:

$s_{g} = \frac{1}{2} \log_{2} [(\begin{matrix} N \\ 2 d_{g} \end{matrix}) (\begin{matrix} 2 d_{g} \\ d_{g} \end{matrix})] = \frac{1}{2} \log_{2} [\frac{N!}{(N - 2 d_{g})! {(d_{g}!)}^{2}}] \approx 410.$

The number of bits of security s_fof f is:

$s_{f} = \frac{1}{2} \log_{2} [(\begin{matrix} N - 1 \\ 2 (d_{f} - 1) \end{matrix}) (\begin{matrix} 2 (d_{f} - 1) \\ d_{f} - 1 \end{matrix})] = \frac{1}{2} \log_{2} [\frac{(N - 1)!}{(N - 2 d_{f} + 1)! {((d_{f} - 1)!)}^{2}}] \approx 405.$

Finally, the number of bits of security s_rof r is:

$s_{r} = \frac{1}{2} \log_{2} [(\begin{matrix} N \\ 2 d_{r} \end{matrix}) (\begin{matrix} 2 d_{r} \\ d_{r} \end{matrix})] = \frac{1}{2} \log_{2} [\frac{N!}{(N - 2 d_{r})! {(d_{r}!)}^{2}}] \approx 410.$

Notice that decryption failures could be eliminated by choosing q such that q≥4p min(d_r,d_g)+4d_f−1=2371. Provide an estimate of the decryption failure for q=1024. To make the analysis simpler, assume that each coefficient of g is chosen randomly in {−1,0, +1}. This should provide an overestimate of the result, as a polynomial thus chosen will generally have fewer vanishing coefficients than actual possible choices for g. The probability that r·g takes a value n₁∈ custom-character and f·m a value n₂∈ is then, assuming N_fand N_rcan be considered large, close to

$\frac{3}{4 π \sqrt{N_{f} N_{r}}} \exp [- \frac{3}{4} (\frac{n_{1}^{2}}{N_{r}} + \frac{n_{2}^{2}}{N_{f}})] .$

The probability P_ethat the first coefficient of a be larger than [q/2]−1 in absolute value is thus of the order of

$𝒫_{e} \approx \frac{3}{2 π \sqrt{N_{f} N_{r}}} \sum_{n = ⌈ q / 2 ⌉}^{\infty} \sum_{n_{1}} \exp [- \frac{3}{4} (\frac{n_{1}^{2}}{N_{r}} + \frac{{(n - {pn}_{1})}^{2}}{N_{f}})] .$

(The factor 2 n the coefficient of the sum accounts for positive and negative values.) Letting n₁take real values for a moment, the argument of the exponential is maximized when n₁takes a value such that n₁/N_r=p(n−pn₁)/N_f, i.e., n₁=pn/(p²+N_f/N_r). The argument of the exponential is then

$- \frac{3 n^{2}}{4 (p^{2} N_{r} + N_{f})} .$

It is maximized for n taking its smallest possible value. Assuming q can be considered large, we thus have:

$\log_{2} 𝒫_{e} \approx - \frac{3 q^{2} / (\ln 2)}{16 (p^{2} N_{r} + N_{f})} \approx - 95.8,$

where ln denotes the natural logarithm. The quantity P_eis thus, assuming the approximations made are not too bad, smaller than 2⁻⁹⁰. The probability that one coefficient of pr·g+f·m is larger than [q/2]−1 is smaller than NP_e≈1×10⁻²⁶. We thus expect it to be negligible too. If necessary, the probability of decryption error can be further decreased by increasing q.

An algorithm to generate parameters for NTRUEncrypt is given in reference [16]. In certain embodiments, it is applicable, with minor modifications, to NTRU2D.

3.2.1 Runtime on a Low-Accuracy Device

In one embodiment, consider two multinomials a and b with at most N terms having, respectively, at most N_aand N_bnonvanishing coefficients, with absolute values bounded by a_M>0 and b_M>0. The maximum possible absolute value of the coefficients of a·b is min(N_a, N_b) a_Mb_M.

Assume these multinomials have non-negative coefficients. Optionally, perform the multiplication using a device with a number l_dof bits of accuracy, and which can deal with non-negative integers only.

Assume that n_atimes the procedure is performed as described in appendix B.1.1 and n_dthat described in appendix B.1.2. The multiplication of a and b can then be recast as the sum of products of multinomials where each term has coefficients with absolute value smaller than or equal to

$⌈ \frac{N}{2^{n} d} ⌉ 2^{⌈ \log_{2} (a_{M} b_{M}) / 2^{n} a ⌉} .$

Each of them can thus be computed by the device if and only if

$\log_{2} ⌈ \frac{N}{2^{n} d} ⌉ + ⌈ \frac{\log_{2} (a_{M} b_{M})}{2^{n} a} ⌉ \leq l_{d} .$

In the case of NTRU2D, take N=N₁N₂and a_M=b_M=q−1 for each multinomial product.

The above condition becomes:

$\log_{2} ⌈ \frac{N_{1} N_{2}}{2^{n} d} ⌉ + ⌈ \frac{\log_{2} (q - 1)}{2^{n} a} ⌉ \leq l_{d} .$

Taking N₁=N₂=23 and q=1024, choosing n_a=3 and n_d=5, the left-hand side is smaller than 8. All multinomial multiplications should thus be doable on a device with 8 bits of accuracy in 15552 frames. Choosing N₁=N₂=15 and q=258, a value smaller than 8 can be achieved by choosing n_a=n_d=3, so that each term requires only 1728 frames. Assuming the device can run in the GHz range, we thus expect a throughput of the order of a million of multiplications per second for these parameters. More results are given in the following table.

Number of frames of a 8-

N₁
N₂
p
q
bits system

23
23
3
1024
15552

23
23
3
258
15552

15
15
3
258
1728

5
5
3
124
48

These estimates are based on current generic algorithms to compute the Fourier transform on a low-accuracy device. In certain embodiments, they can be significantly improved by making use of more specific algorithms designed for multinomial multiplication modulo two polynomials and an integer.

Appendices
A
A.1 Multinomial Multiplication and Discrete Fourier Transform
A.1.1 Case of Polynomials

Here it is shown how polynomial multiplication in the ring R defined in section 2.1 can be performed in Fourier space. First, choose some notations. As in section 2.1, N is a positive integer and R is the ring custom-character [X]/X^N−1 of polynomials with integer coefficients modulo X^N−1. Let a and b be two elements of R. Optionally call their coefficients, respectively, a₀, a₁, . . . , a_N−1and b₀, b₁, . . . b_N−1, so that

$\begin{matrix} a = \sum_{j = 0}^{N - 1} a_{j} X^{j}, & b = \sum_{j = 0}^{N - 1} b_{j} X^{j} . \end{matrix}$

Let · denote the product in R. We define c=a·b and call its coefficient c₀, c₁, . . . , c_N−1, so that

$c = \sum_{j = 0}^{N - 1} c_{j} X^{j} .$

Let a, b, and c be the N-dimensional vectors with components, respectively, a_j, b_j, and c_j, for j between 0 and N−1. (For simplicity, in this subsection we take vector indices from 0 to N−1.)

Optionally, denote with a ˜ the discrete Fourier transform: for x∈a, b, c, define the N-dimensional vector with components

${\tilde{x}}_{u} = \sum_{j = 0}^{N - 1} e^{- 2 i π ju / N} x_{j}, u \in [[0, N - 1]] .$

The inverse Fourier transform is given by

$\forall j \in [[0, N - 1]], x_{j} = \frac{1}{N} \sum_{u = 0}^{N - 1} e^{2 i π ju / N} {\tilde{x}}_{u} .$

For each j between 0 and N−1, the coefficient of order j in the polynomial c is:

$c_{j} = \sum_{k = 0}^{N - 1} a_{k} b_{(j - k) modN} .$

For each j between 0 and N−1, it follows (using the equality e^2iπ=1 to get the second line):

$\begin{matrix} {\tilde{c}}_{u} = \sum_{j = 0}^{N - 1} e^{- 2 i π ju / N} \sum_{k = 0}^{N - 1} a_{k} b_{(j - k) modN} \\ = \sum_{j = 0}^{N - 1} \sum_{k = 0}^{N - 1} e^{- 2 i π (k + ((j - k) modN)) u / N} a_{k} b_{(j - k) modN} \\ = \sum_{j = 0}^{N - 1} \sum_{k = 0}^{N - 1} [e^{- 2 i π ku / N} a_{k}] [e^{- 2 i π ((j - k) modN) u / N} b_{(j - k) modN}] \\ = [\sum_{k = 0}^{N - 1} e^{- 2 i π ku / N} a_{k}] [\sum_{l = 0}^{N - 1} e^{- 2 i π lu / N} b_{l}], \end{matrix}$

where to get the last line, the new variable l=(j−k) mod N was defined, which varies between 0 and N−1 when k goes from 0 to N−1 for each j∈ custom-character . This expression can be simplified as:

{tilde over (c)}_u=ã_u{tilde over (b)}_u.

Polynomial multiplication in R is thus equivalent to component-wise multiplication in Fourier space.

A.1.2 Case of Multinomials

Let n∈N* and (N₁, N₂. . . , N_n)∈N*ⁿ. Let R′ be the ring

$\frac{ℤ [X_{1,} X_{2,} \dots, X_{n}]}{〈 X_{1}^{N_{1}} - 1, X_{2}^{N_{2}} - 1, \dots, X_{n}^{N_{n}} - 1 〉} .$

Let

I=Π_i=1ⁿ custom-character 1, N_i

Optionally, denote the multiplication in R′ by a dot. Let a and b be two elements of R′, and C≡a·b.

Optionally, call a, b, and c the sequences of their coefficients, so that

$a = ? a_{I} ?$

$? indicates text missing or illegible when filed$

and similarly for b and c. The latter is given by:

$\forall I \in ℐ, c_{I} = \sum_{J \in ℐ} ? b_{J} .$

$? indicates text missing or illegible when filed$

Optionally, denote with a ˜ the discrete Fourier transform: for x∈a, b, c, x˜ is the sequence with the same shape whose elements are given by:

$\forall U \in ℐ, {\tilde{x}}_{U} = \sum_{I \in ℐ} x_{I} \prod_{i = 1}^{n} e^{- 2 i π I_{i} U_{i} / N_{i}} .$

The inverse transform is given by:

$\forall I \in ℐ, x_{l} = \frac{1}{\prod_{i = 1}^{n} N_{i}} \sum_{U \in ℐ} {\tilde{x}}_{U} \prod_{i = 1}^{n} e^{+ 2 i π I_{i} U_{i} / N_{i}} .$

Let U∈I. It follows:

$\begin{matrix} {\tilde{c}}_{U} = \sum_{I \in ℐ} c_{l} \prod_{i = l}^{n} e^{- 2 i π I_{i} U_{i} / N_{i}} \\ = \sum_{(I, J) \in ℐ^{2}} \\ a_{(I_{1} - J_{1}) \mod N_{1}, (I_{2} - J_{2}) \mod N_{2}, \dots, (I_{n} - J_{n}) \mod N_{n}} b_{J} \prod_{i = 1}^{n} e^{- 2 i π I_{i} U_{i} / N_{i}} \\ = \sum_{(I, J) \in ℐ^{2}} a_{(I_{1} - J_{1}) \mod N_{1}, (I_{2} - J_{2}) \mod N_{2}, \dots, (I_{n} - J_{n}) \mod N_{n}} \\ b_{J} \overset{n}{\prod_{i = 1}} e^{- 2 i π ((I_{i} - J_{i}) \mod N_{i}) U_{i} / {N_{i}}_{e^{- 2 i π J_{i} U_{i} / N_{i}}}} \\ = [\sum_{I \in ℐ} a_{I} \prod_{i = 1}^{n} e^{- 2 i π I_{i} U_{i} / N_{i}}] [\sum_{J \in ℐ} b_{J} \prod_{j = 1}^{n} e^{- 2 i π J_{j} U_{j} / N_{j}}] . \end{matrix}$

This is equivalent to:

{tilde over (c)}_U=ã_U{tilde over (b)}_U.

Multinomial multiplication in R′ is thus equivalent to component-wise multiplication in Fourier space.

A.2 Integer Polynomial Factors of X^N−1

The reason why the parameter N must be prime in NTRU is that the polynomial X^N−1 must be the product of two prime polynomials in custom-character [X] to prevent some lattice-based attacks. These attacks won't be described here, but only give a simple argument showing that X^N−1 has more than two factors if N is not prime.

Let a and b be two integers larger than or equal to 2 and let us assume that N=a b. Then,

$\begin{matrix} X^{N} - 1 = {(X^{a})}^{b} - 1 \\ = (X^{a} - 1) (X^{a (b - 1)} + X^{a (b - 2)} + \dots + X^{a (b - b)}) \\ = (X^{a} - 1) (\sum_{i = 0}^{b - 1} ?) \\ = (X - 1) (X^{a - 1} + X^{a - 2} + \cdot + X^{a - a}) (\sum_{i = 0}^{b - 1} ?) \\ = (X - 1) (\sum_{j = 0}^{a - 1} X^{j}) (\sum_{i = 0}^{b - 1} ?) . \end{matrix}$

$? indicates text missing or illegible when filed$

So, if N is not prime, x^N−1 can be expressed as the product of three non-unit polynomials.

A.3 Inverse of a Polynomial or Multinomial
A.3.1 Inverse of a Polynomial

Let K be a field. Let P and S be two elements of K[X], i.e., two polynomials over K. Then, there exists a unique couple of polynomials. (R, Q)∈K[X]²such that the degree of R is smaller than that of S and P=QS+R. We say that Q is the quotient and R the remainder of the Euclidean division of P by S. (Q can be constructed monomial by monomial by matching the highest-order monomial in P, then the second-highest, and so on. Doing so, one can match all monomials in P with degrees smaller than or equal to that of S. What remains is R.)

Let (P, Q)∈K[X]², where the degrees of P and Q are at least 1 and that of P is smaller than that of Q. In order to see if P is invertible modulo Q, the Euclidean algorithm is applied:

- 1. Define P₀=Q and P₁=P, and set i=1.
- 2. Perform the Euclidean division of P_i−1by P_i, call the quotient A_iand the remainder P_i+1.
- 3. If the degree of P_i+1is larger than or equal to 1, increment i and go back to step 2.
- 4. If P_i+1=0, P_iis a common divisor to Q and P. The polynomial P is thus not invertible modulo Q,

If the degree of P_i+1is zero and P_i+1≠0, we can go back to find an inverse of P modulo Q. Indeed, we have:

P
_i+1
=P
_i−1
−A
_i
P
_i
=P
_i−1
−A
_i(P_i−2−A_i−1P_i−1)=(1+A_iA_i−1)P_i−1−A_iP_i−2=. . .

Going backwards, we obtain a series of expressions of the form

P
_i+1
=L
_j
P
_i−1−j
+R
_j
P
_i−j

for j=0, 1, . . . , i, where the sequences (L_j) and (R_j) are defined by L₀=1, R₀=−A_i, and, for each j between 0 and i−1, L_j+1=R_iand R_j+1=L_j−A_i−j−1R_j. Taking j=i−1 gives:

P
_i+1
=L
_i−1
Q+R
_i−1
P.

Since P_i+1is a non-vanishing polynomial of degree 0, it is invertible (it is, up to the identifications of the unit polynomial with the unit of K, a nonvanishing element of K). We can thus write:

$\frac{L_{i - 1}}{P_{i + 1}} Q + \frac{R_{i - 1}}{P_{i + 1}} P = 1,$

which gives

$\frac{R_{i - 1}}{P_{i + 1}} P = 1 \mod Q .$

The polynomial R_i−1/P_i+1is thus the inverse of P in K[X]/ custom-character Q.

A.3.2 Inverse of a Multinomial Modulo an Integer: Cyclic Case

Let p be a prime number, n be a positive integer, and N₁, N₂, . . . , N_nbe n positive integers. Optionally, define the set I=Π_i=1ⁿ custom-character 1, N_i. Optionally, work in the ring R′=[X₁, X₂, . . . X_n]. Let a∈R′. Optionally, call its coefficients a_i, i∈I. Let b be another element of R′ with coefficients b_i, i∈I. Optionally, call c their product modulo p: c=(a·b) mod p, and its coefficients c_i, i∈I. For each coefficient, it follows:

$c_{I} = \sum_{J \in ℐ} a_{J} ? \mod p .$

$? indicates text missing or illegible when filed$

The multinomial b is the inverse of a modulo p if c_i=0 for i≠0 and c₀=1. Finding it is equivalent to solving a system of N₁N₂. . . N_nlinear equations modulo p. Since p is a prime number, custom-character _pis a field and Gaussian elimination can be used to determine if a is invertible and, if yes, to compute its inverse. This can be extended to the case where p is not a prime by a suitable modification of the Gaussian reduction algorithm, as shown in the example Python code as follows:

def Gaussian_elimination(m_, p):

″′

Compute the inverse of m modulo p using Gaussian elimination, return None if

m is not invertible.

Arguments:

m_: square 2D array of integers

p: positive integer

Return: matrix of integers or None

′″

# copy the input matrix

m = m_.copy( )

# size of the matrix

size = m_.shape[0]

# dictionary of inverses modulo p

inverses = inverse_mod_p(p)

# matrix which will contain the result

res = identity(size, dtype=int)

# Step 1: eliminate the lower-left triangle

for i in range(size):

# look for a row having an ith coefficient invertible modulo p

line = None

coeffs = [ ]

for j in range(i, size):

if m[j,i] in inverses:

line = j

break

coeffs.append(m[j,i])

# if none has, see if we can make one by taking linear combinations of

# the lines

if line is None:

coeffs = asarray(coeffs)

while True:

if 0 in coeffs:

return None

imax = coeffs.argmax( )

imin = coeffs.argmin( )

quotient = coeffs[imax] // coeffs[imin]

coeffs[imax] −= quotient * coeffs[imin]

m[imax] −= quotient * m[imin]

res[imax] −= quotient * res[imin]

if coeffs[imax] in inverses:

line = i + imax

break

# exchange the lines

temp_m = m[i].copy( )

temp_res = res[i].copy( )

m[i] = m[line]

res[i] = res[line]

m[line] = temp_m

res[line] = temp_res

# divide the ith line by its ith coefficient

inverse_mii = inverses[m[i][i] % p]

res[i] = (res[i]*inverse_mii) % p

m[i] = (m[i]*inverse_mii) % p

# eliminate the ith coefficient from all lines below

for j in range(i+ 1, size):

res[j] = (res[j] − m[j][i] * res[i]) % p

m[j] = (m[j] − m[j][i] * m[i]) % p

# Step 2: eliminate the upper-right triangle

for i in range(size−1):

for j in range(size−i−1):

res[j] = (res[j] − m[j,size−i−1] * res[size−i−1]) % p

m[j] = (m[j] − m[j,size−i−1] * m[size−i−1]) % p

return res

A.4 NTRUEncrypt and Short Vector Problem (SVP)
A.4.1 Finding the Private Key Implies Solving a SVP

This embodiment sketches how finding the private key of the NTRUEncrypt scheme can be related to a Short Vector Problem (SVP). Optionally, the notations of section 2.1 are used. Denote by h₀, h₁, . . . , h_N−1the coefficients of h, so that

Optionally, define the square matrix H of size N by: h(X)=Σ_i=0^N−1h_iXⁱ

$\forall (i, j) \in {[[1, N]]}^{2}, H_{i, j} = h_{(i - j) \mod N},$

$i . e .,$

$H = (\begin{matrix} h_{0} & h_{N - 1} & h_{N - 2} & \dots & h_{N - 1} \\ h_{1} & h_{0} & h_{N - 1} & \dots & h_{N - 2} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ h_{N - 1} & h_{N - 2} & h_{N - 3} & \dots & h_{0} \end{matrix}) .$

For any vector V of size N, HV is the vector of the coefficients of the polynomial h·v, where

v≡Σ_i=0^N−1V_iXⁱ.

Let I_Nbe the identity matrix in dimension N and O_Nthe null matrix. The matrix B_his defined by

$B_{h} \equiv [\begin{matrix} I_{N} & 0_{N} \\ H & {qI}_{N} \end{matrix}] .$

Optionally, call f₀, f₁, . . . , f_N−1the coefficients of f. and g₀, g₁, . . . , g_N−1those of g. Since f·h=(pg) mod q, one can find integer coefficients a₀, a₁, . . . , a_N−1such that

$B_{h} (\begin{matrix} f_{0} \\ f_{1} \\ ⋮ \\ f_{N - 1} \\ a_{0} \\ a_{1} \\ ⋮ \\ a_{N - 1} \end{matrix}) = (\begin{matrix} f_{0} \\ f_{1} \\ ⋮ \\ f_{N - 1} \\ {pg}_{0} \\ {pg}_{1} \\ ⋮ \\ {pg}_{N - 1} \end{matrix}) .$

(Simply choose the a_is to be opposite of the coefficients of f·h−pg divided by q.) So, the vector made of the coefficients of f and p times those of g is a vector in the lattice custom-character (B_h) generated by the columns of B_h. If these coefficients are small enough, this is a small vector in the lattice.

Optionally, assume there exists an algorithm to find fin time t from the knowledge of h and q. This algorithm could be used to find pg (equal to (f·h) mod q), and thus a short vector in the lattice custom-character (B_h) after performing a number of operations polynomial in N. Finding the secret key of NTRUEncrypt for a given distribution of polynomials f and g is thus at least as hard (up to some polynomial overhead) as finding a ‘short’ (in a sense which depends on the constraints on f and g) vector in the corresponding lattices.

Similarly, recovering the message m from the encrypted e can be mapped to a Close Vector Problem. Indeed, denoting with an italic letter the sequence of the coefficients of the polynomial denoted by the corresponding bold-face letter, it follows:

$E \equiv (\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ e_{0} \\ e_{1} \\ ⋮ \\ e_{N - 1} \end{matrix}) = B_{h} (\begin{matrix} r_{0} \\ r_{1} \\ ⋮ \\ r_{N - 1} \\ 0 \\ 0 \\ ⋮ \\ 0 \end{matrix}) + (\begin{matrix} - r_{0} \\ - r_{1} \\ ⋮ \\ - r_{N - 1} \\ m_{0} \\ m_{1} \\ ⋮ \\ m_{N - 1} \end{matrix}) .$

From the knowledge of m, one can get r in polynomial time, and thus a vector custom-character (B_h) close to E.

A.4.2 How ‘Random’ is the Lattice?

The SVP is conjectured to be hard for both classical and quantum computers over random lattices. The lattice generated by the matrix B_habove is, however, not random due to its block form and the circulant nature of the matrix H. In order for the reduction to a SVP to be a convincing security argument, it is crucial that the lattice structure should not make it easy to find short vectors.

First, it is trivial to find vectors of length q. Denoting by ∥·∥₂the L²norm (i.e., for a polynomial, the square root of the sum of its squared coefficients), the vector giving the private key has a length smaller than q if and only if

∥f∥₂²+p²∥g∥₂²<q².

On the other hand, the norms of these polynomials must not be too small. To see this, optionally follow the argument given in section 3.6.1 of reference [6]. Optionally, generalize the matrix B_hby adding a positive parameters and define

$B_{h} (α) \equiv [\begin{matrix} α I_{N} & 0_{N} \\ H & {qI}_{N} \end{matrix}] .$

It follows:

$B_{h} (\begin{matrix} f_{0} \\ f_{1} \\ ⋮ \\ f_{N - 1} \\ a_{0} \\ a_{1} \\ ⋮ \\ a_{N - 1} \end{matrix}) = (\begin{matrix} α f_{0} \\ α f_{1} \\ ⋮ \\ α f_{N - 1} \\ {pg}_{0} \\ {pg}_{1} \\ ⋮ \\ {pg}_{N - 1} \end{matrix}),$

so that the private key can be recovered by looking for short vectors in custom-character (B_h(a)). The L²norm of this vector is

√{square root over (α²∥f∥₂²+p²∥g∥₂²)}

For a random lattice of dimension n generated by a matrix of determinant D, the smallest vector is typically expected to have a length slightly larger than D^I/n√{square root over (n/(2πe))}, where e is Euler's constant [6]. In our case, n=2N and D=α^Nq^N. So, the shortest vector is expected to be typically slightly larger than √{square root over (αNq/(πe))}. Let c(α) be the ratio of the length of the above vector to this quantity. It follows:

$c (α) = \sqrt{\frac{π e (α { f }_{2}^{2} + α^{- 1} p^{2} { g }_{2}^{2})}{Nq}} .$

An attacker may find this vector faster than would be possible for a typical random lattice if c(α) is significantly smaller than 1.

The minimum value c_minof c(α) is obtained for α=p∥g∥₂/∥f∥₂.

$c_{\min} = \sqrt{\frac{2 π ep { f }_{2} { g }_{2}}{Nq}} .$

It is larger than or close to 1 provided

$p { f }_{2} { g }_{2} ≳ \frac{Nq}{2 π e} .$

This is compatible with the previous condition provided

N custom-character πeq.

A.4.3 Extension to Higher Dimensions

The main idea of the argument given in section A.5.1 is to relate polynomial multiplication in R to matrix multiplication. This can be extended to multinomials using more general tensors instead of matrices.

To see this, optionally choose

L∈ custom-character *, (N₁, N₂, . . . , N_L)∈*^L,

and define the ring:

R′≡
custom-character
[X
₁
, X
₂
, . . . X
_L
]/

X
₁
^N
¹−1, X₂^N²−1, . . . , X_L^N^L−1.

Let a be an element of R′. Optionally, define the tensor A of the coefficients of a. (optionally make the indices of tensors start from 0 to simplify the notations.) Let h be a multinomial in R′. Optionally, call h the sequence of its coefficients and H the corresponding tensor such that, for each possible value of (i₁, i₂. . . i_l, j₁, j₂, . . . , j_L)

H_i₁_{, i}₂_{, . . . i}_l_{, j}₁_{, j}₂_{, . . . j}_L=h_(i₁_−j₁_{) mod N}₁_,(i₂_−j₂_{) mod N}₂_{, . . . , (i}_L_−j_L_{) mod N}_L.

Then, the coefficients of the product h·a are those of the tensor HA, defined by:

$\forall I \in \prod_{l = 1}^{L} 〚 0, N_{l} 〛, {(HA)}_{I} = \sum_{J \in \prod_{l = 1}^{L} 〚 0, N_{l} 〛} H_{I, J} A_{J} .$

Optionally also define the unit tensor I of the same shape as H, defined by:

$\forall (I, J) \in {(\prod_{l = 1}^{L} 〚 0, N_{l} 〛)}^{2}, I_{I, J} = {\begin{matrix} 1 & if J = I \\ 0 & otherwise \end{matrix} .$

Optionally, work in the space of real tensors of shape (N₁, N₂, . . . , N_L, 2), with indices starting from 0. Linear transformations in this space can be represented by real tensors of shape (N₁, N₂, , N_L, 2, N₁, N₂, . . . , N_L, 2), in the following way: if custom-character is such a tensor, and a tensor of size (N₁, N₂, . . . , N_L, 2), their product is the tensor of size (N₁, N₂, . . . N_L, 2) given by:

$\forall I \in [\prod_{l = 1}^{L} 〚 0, N_{l} 〛] \times {0, 1}, y_{1} = \sum_{J \in [\prod_{l = 1}^{L} 〚 0, N_{l} 〛] \times {0, 1}} 𝒜_{I, J} 𝒳_{J} .$

Optionally, define the tensor B in the following way: for each (I, J)∈(Π_i=1^L custom-character 0, N_i)².

- _I,0,J,0=I_I,J,
- _I,0,J,1=0,
- _I,1,J,0=H_I,J,
- _I,1,J,1=qI_I,J.

Let f and a be two elements of R′. We denote by y the product h·f and by f, a, and y the sequences of their coefficients. Optionally, define the tensor custom-character of shape (N₁, N₂, N_L, 2) by: for each

- _I,0=f_Iis the coefficient of X^I¹X^I². . . X^I^Lin f,
- _I,1=a_Iis the coefficient of X^I¹X^I². . . X^I^Lin a.

Then,

custom-character
is the tensor of shape (N₁, N₂, . . . N_L, 2) defined by:

I∈Π_i=1^L custom-character 0, N_i,

for

custom-character
_I,0=f_I,

custom-character
_I,1=y_I+qa_I.

One can map any tensor custom-character of shape (N₁, N₂, . . . , N_L2, N₁, N₂, , N_L, 2) to a square matrix B of size 2 N₁N₂. . . N_Land any vector X of size (N₁, N₂, . . . , N_L, 2) to a vector of the same size by turning a multi-index/to a single index given by

$I_{L + 1} \prod_{l = 1}^{L} N_{L} + \sum_{l = 1}^{L} I_{l} \prod_{1 \leq k < i} N_{k} .$

Optionally, call M this mapping. Using the above notations, it follows that:

M( custom-character )M()=M()=M().

So, since the two polynomials (and thus the tensor custom-character , and thus M(™)) have integer coefficients,

M(y) is an element of the lattice custom-character (M()) generated by M().

Finding a short tensory which can be constructed from two elements of R′, f and a, allows/gives a short vector in custom-character (M())

The argument then proceeds as for the case of polynomials: assuming an algorithm exists to find the secret key f of NTRU2D from the knowledge of the public key h and of q, one could then compute pg=(f·h) mod q and, by suitably choosing the coefficients a_Iso that no y_I,1is larger than └q/2┘ in absolute value, and assuming the L²norms of f and pg are sufficiently small, a short vector M(y) in custom-character (M()) with an overhead at most polynomial in N.

Similarly, recovering a message m from the ciphertext e can be mapped to a close vector problem. To see this, consider the tensors custom-character , , and and with the same shape as above and coefficients given by, for each value of I:

- _I,0=r_I,
- _I,1=0,
- _I,0=0,
- _I,1=e_I,
- _I,0=−r_I,
- _I,1=m_I.

It follows:

custom-character =+.

So, since the mapping M is linear and preserves multiplication,

M( custom-character )=M()M()+M().

Given e, from the knowledge of m, one can recover r in polynomial time in N. One can thus, also in polynomial time, get a vector which, if r and m are sufficiently small, is a vector of

custom-character (M())

close to M (y).

B Practical Considerations
B.1 Multinomial Multiplication and Finite Accuracy

One possible difficulty for the optical implementation of NTRU2D is that coefficients of the product of two multinomials with high degrees can be significantly larger than those of each factor, which can be a problem on low-accuracy devices where the output can take a limited number of different values. We here show two techniques which can be used to mitigate the problem. They both rely on writing each factor as a sum of two multinomials and their product as a sum of products of ‘smaller’ ones. These rewritings can be done in succession several times until the two factors in each term are small enough to be dealt with by the device one wishes to implement multiplication on. The results can then be combined by a higher-accuracy device using bit- or register-shifting and additions

B.1.1 Reducing the Coefficients Amplitudes

Let n∈ custom-character *, q∈\{0,1} and let R be either R₀≡_q[X₁, X₂, . . . , X_n] or R₀/M₁, M₂, . . . , M_L where L∈* and M₁, M₂, . . . M_L∈R₀.

Denote by · the multiplication in R. All operations are done modulo q.

Let (a, b)∈R². Let l∈ custom-character *.

Optionally assume that the coefficients of a and b are (2l)-bits integers. One can then find four multinomials a₁, a₂, b₁, and b₂whose coefficients are l-bits integers such that a=2^la₁+a₂and b=2^lb₁+b₂.

(One can take the coefficients of a₁(respectively b₁) to be the integers given by the l highest bits of those of a (respectively b) and those of a₂(respectively b₂) to be the integers given by the l lowest bits of those of a (respectively b).)

Consequently:

a·b=2^2la₁·b₁+2^l[a₁·b₂+a₂·b₁]+a₂·b₂.

The product of a and b can thus be computed in the following way:

- 1. Compute the four products a_i·b_jfor (i,j)∈{1,2}².
- 2. Perform bit-shifts by 2l and l to compute 2^2la₁·b₁, 2^la₁·b₂, and 2^la₂·b₁.
- 3. Sum the results.

Repeating above procedures n times, the total number of multinomial multiplications is 4ⁿ, and the number of bits needed to write each coefficient of the multinomials a and b is divided by 2ⁿ.

The number of multiplication needed at each step can be reduced to 3 by noting that:

a
₁
·b
₂
+a
₂
·b
₁=(a₁−a₂)·(b₂−b₁)+a₁·b₁+a₂·b₂.

However, one bit then needs to be reserved for the sign of each coefficient.

B.1.2 Reducing the Degree

A similar technique can be used to reduce the degrees of the multinomials. Unless explicitly stated, optionally use the same notations as above. Let k be an integer between 1 and n, and l a positive integer. We assume that the largest power of X_kin a and b is no larger than 2l. We can then choose four multinomials a₁, a₂, b₁, and b₂with a highest power in X_kno larger than l and such that X_k^la₁+a₂=a and X_k^lb₁+b₂=b. Then,

a·b=X
_k
^2l
a
₁
·b
₁
+X
_k
^l[(a₁−a₂)·(b₂−b₁)+a₁·b₁+a₂·b₂]+a₂·b₂.

The product of a and b can thus be computed by performing 3 multinomial multiplications (a₁·b₁, a₂·b₂, and (a₁−a₂)·(b₂−b₁)). This procedure can be iterated to reduce, possibly several times, the maximum power of several or all variables.

B.2 Block 2D Discrete Fourier Transform—FIGS. 6 and 7

FIG. 8 is a flow chart of a multinomial multiplication using a Fourier transform and its inverse (multinomial multiplication using 2D Fourier transform). The Fourier transform and inverse Fourier transform operations can be advantageously performed on an optical chip in a fast manner, at low power. FIG. 7 is a flow chart of the multinomial inversion using a Fourier transform and its inverse. If a is invertible, the algorithm returns its inverse b.

The optical implementation may be realized on any one or a combination of the prior art optical systems which are embodied in any of the following patent applications which are owned by Optalysys Limited:

- EP1420322;
- WO2018167316;
- EP1546838;
- U.S. Pat. No. 10,289,151;
- U.S. Pat. No. 10,409,084;
- WO2019207317;
- PCT/EP2020/065740.

Each one of these documents is incorporated by reference. The prior art system architectures would be configured to operate the method of various embodiments of the invention.

It will be appreciated that it is possible to extend the same algorithms to higher dimensions replacing the 2D arrays by higher-dimensional ones. Different rings of multinomials are also envisaged.

We now describe how to perform a block decomposition for the discrete 2D Fourier transform. One advantage of such a decomposition is to reduce the maximum modulus of the Fourier coefficients of each block, thus potentially improving the accuracy. A typical workflow can be:

- 1. Perform the Fourier transform on each block using a fast but low-accuracy device.
- 2. Combine the results to compute the full Fourier transform on a slower, high-accuracy device.

For consistency with the notations of the rest of the document, the values of matrix indices start at 0.

Let N₁and N₂be two integers and let A be a matrix of integers with size (N₁, N₂). We define the Fourier transform Ã of A as the complex matrix with the same shape with coefficients given by:

$\forall (u, υ) \in 〚 0, N_{1} - 1 〛 \times 〚 0, N_{2} - 1 〛,$

${\tilde{A}}_{u, υ} = \sum_{i = 0}^{N_{1} - 1} \sum_{j = 0}^{N_{2} - 1} \exp [2 i π (\frac{ui}{N_{1}} + \frac{υ j}{N_{2}})] A_{i, j} .$

Let n₁be a divisor of N₁and n₂a divisor of N₂. Optionally define q₁≡N₁/n₁and q₂≡N₂/n₂.

Optionally, define the matrices A^(i,j)of size (q₁, q₂) for (i;j) in custom-character 0, n₁−1×0, n₂−1 by:

Optionally also define their Fourier transforms Ã^(i,j)in the following way: For each couple of integers (i,j) where I is between 0 and n₁−1 and j is between 0 and n₂−1, is the complex matrix Ã^(i,j)of size (q₁; q₂) given by:

$\forall (U, V) \in 〚 0, q_{1} - 1 〛 \times 〚 0, q_{2} - 1 〛,$

${\tilde{A}}_{U, V}^{(i, j)} = \sum_{I = 0}^{q_{1} - 1} \sum_{J = 0}^{q_{2} - 1} \exp [2 i π (\frac{UI}{q_{1}} + \frac{VJ}{q_{2}})] A_{I, J}^{(i, j)} .$

Let (u,v)∈ custom-character 0, N₁−1×0, N₂−1. We have:

${\tilde{A}}_{u, ?} = \sum_{I = 0}^{q_{1} - 1} \sum_{J = 0}^{q_{2} - 1} \sum_{i = 0}^{n_{1} - 1} \sum_{j = 0}^{n_{2} - 1} \exp [2 i π (\frac{uI}{q_{1}} + \frac{υ J}{q_{2}})] \exp [2 i π (\frac{ui}{N_{1}} + \frac{υ j}{N_{2}})] A_{I, J}^{(i, j)},$

${\tilde{A}}_{u, υ} = \sum_{i = 0}^{n_{1} - 1} \sum_{j = 0}^{n_{2} - 1} \exp [2 i π (\frac{ui}{N_{1}} + \frac{υ j}{N_{2}})] {\tilde{A}}_{?}^{(i, j)} .$

$? indicates text missing or illegible when filed$

Once the discrete Fourier transforms of the ‘blocks’ A^(i,j)are computed, the full Fourier transform can thus be obtained after performing N₁N₂n₁n₂multiplications by a complex exponential (n₁n₂for each entry). To make this number small, it is desirable to keep n₁and n₂as low as possible.

The main interest of this procedure is that the Fourier coefficients of each ‘block’ are typically smaller than those of Ã. Indeed, if the absolute value of the coefficients of A is bounded from above by some positive number a_max, those of the Fourier transform of each block are bounded from above by N₁N₂a/(n₁n₂), versus aN₁N₂for those of Ã. A device which can reach an acceptable accuracy provided the coefficients have an absolute value no larger than some positive number a_dwill thus be able to compute the Fourier transform of each block provided

$n_{1} n_{2} \geq \frac{a}{a_{d}} N_{1} N_{2} .$

C. Computing a “Large” 2D Discrete Fourier Tranform or a Batch of 1D Ones from “Small” 2D Tranforms: Extending the Cooley-Tukey FFT Algorithm

The Cooley-Tukey Fast Fourier Transform (FFT) is an algorithm used to compute a discrete Fourier transform with complexity O(NlogN), where N is the number of entries. It is often used to compute one-dimensional Fourier transforms on electronic hardware where the the fundamental operations are scalar additions and multiplications. This embodiment shows how to use it to accelerate the computation of the Fourier transforms of large images using an optical device of the kind referred to any previous section.

Before that, some notations are introduced:

- Consider the two-dimensional discrete Fourier transform with shape (N_x, N_y) for some positive integers N_xand N_y. Two-dimensional arrays of real or complex numbers with the same shape will be denoted by bold capital Latin letters, and their coefficients with the non-bold version and two indices denoting their positions. The set of such arrays is denoted by .
- Denote by FT the discrete two-dimensional Fourier transform, i.e., the function → defined by, for each array A∈ and each (j, k)∈[[1, N_x]]×[[1, N_y]]:

$F {T (A)}_{j, k} = \overset{N_{x}}{\sum_{u = 1}} \overset{N_{y}}{\sum_{v = 1}} \exp [2 π i (\frac{(j - 1) (u - 1)}{N_{x}} + \frac{(k - 1) (v - 1)}{N_{y}})] A_{u, v} .$

- Denote by FT⁽¹⁾the discrete one-dimensional Fourier transform, i.e., the function → defined by, for each array A∈ and each (j, k)∈[[1, N_x]]×[[1, N_y]]:

$F {T^{(1)} (A)}_{j, k} = \sum_{u = 1}^{N_{x}} \exp [2 π i (\frac{(j - 1) (u - 1)}{N_{x}})] A_{u, v} .$

- Assume the device of interest has a rectangular input array of pixels, with sides given by two positive integers n_xand n_y. Two-dimensional arrays of real or complex numbers with the same shape will be denoted by bold lowercase letters. The set of such arrays is denoted by .
- Denote by OFT the optical Fourier transform. (It is a function from to itself.) For the sake of simplicity, assume OFT is an exact two-dimensional discrete Fourier transform defined by, for each array a∈ and each (j, k)∈[[1, n_x]]×[[1, n_y]]:

$OFT {(a)}_{j, k} = \sum_{u = 1}^{n_{x}} \sum_{v = 1}^{n_{y}} \exp [2 π i (\frac{(j - 1) (u - 1)}{n_{x}} + \frac{(k - 1) (v - 1)}{n_{y}})] a_{u, v} .$

- Finally, assume that N_xis a multiple of n_xand that N_yis a multiple of n_y. Define the ratios

$d_{x} = \frac{N_{x}}{n_{x}} and d_{y} = \frac{N_{y}}{n_{y}} .$

C.1 Smaller Fourier Transform

Notice that, if m_x, l_x, m_y, and l_yare two positive integers such that m_xl_x=n_xand m_yl_y=n_y, the function OFT can be used to compute a Fourier transform of shape (m_x, m_y) as follows.

Let b be a complex array with shape (m_x, m_y). Define the array a with shape (n_x, n_y) by, for each (j, k)∈[[1, n_x]]×[[1, n_y]],

- If j−1≡0 mod l_xand k−1≡0 mod l_y, then

$a_{j, k} = b_{\frac{j - 1}{l_{x}} + 1, \frac{k - 1}{l_{y}} + 1} .$

- Otherwise, a_j,k=0.

Let {tilde over (b)} be the array obtained from OFT(a) by restricting its first coefficient to [[1, m_x]] and the second one to [[1, m_y]]. For each (j, k)∈[[1, m_x]]×[[1, m_y]],

${\tilde{b}}_{j, k} = \sum_{u = 1}^{m_{x}} \sum_{v = 1}^{m_{y}} \exp [2 π i (\frac{(j - 1) (u - 1)}{m_{x}} + \frac{(k - 1) (v - 1)}{m_{y}})] b_{u, v} .$

So, {tilde over (b)} is the Fourier transform of b.

C.2 Larger Fourier Transform

The question dealt with here is: Given a device which can perform the function OFT, how can one reconstruct the function FT? An embodiment presents a solution in three steps:

divide the input (which is an element of custom-character ) into d_x×d_yelements of , perform OFT on each of them, and recombine the results.

Let A be an element of custom-character . For each (j, k)∈[[1, d_x]]×[[1, d_y]], define the array a^(j,k)of shape (n_x, n_y) by: for each (u, v)∈[[1, n_x]]×[[1, n_y]], a_u,v^(j,k)=A_j+(u−1)d_x_,k+(v−1)d_y. Assume the optical Fourier transform of each of these arrays is computed. Then, the Fourier transform of A can be computed as follows. Let (j, k)∈[[1, d_x]]×[[1, d_y]]. Using the definition of the function FT and decomposing each integer between 1 and N_x(respectively between 1 and N_y) in the right-hand side as a multiple of d_x(respectively d_y) plus the remainder of the Euclidean division by d_x(respectively d_y), it follows:

${FT (A)}_{j, k} \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \sum_{u^{'} = 1}^{n_{x}} \sum_{v^{'} = 1}^{n_{y}} \exp [2 π i (\frac{(j - 1) (j^{'} + (u^{'} - 1) d_{x} - 1)}{N_{x}} + \frac{(k - 1) (k^{'} + (v^{'} - 1) d_{y} - 1)}{N_{y}})] A_{j^{'} + (u^{'} - 1) d_{x}, k^{'} + (v^{'} - 1) d_{y}} .$

This may be rewritten using the smaller arrays just defined as:

$FT {(A)}_{j, k} \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \sum_{u^{'} = 1}^{n_{x}} \sum_{v^{'} = 1}^{n_{y}} \exp [2 π i (\frac{(j - 1) (j^{'} + (u^{'} - 1) d_{x} - 1)}{N_{x}} + \frac{(k - 1) (k^{'} + (v^{'} - 1) d_{y} - 1)}{N_{y}})] a_{u^{'}, v^{'}}^{(j^{'}, k^{'})} .$

Using that exp(c₁+c₂)=exp(c₁)exp(c₂) for any two complex numbers c₁and c₂, this becomes:

$FT {(A)}_{j, k} \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \exp [2 π i (\frac{(j - 1) (j^{'} - 1)}{N_{x}} + \frac{(k - 1) (k^{'} - 1)}{N_{y}})] \sum_{u^{'} = 1}^{n_{x}} \sum_{v^{'} = 1}^{n_{y}} \exp [2 π i (\frac{(j - 1) (u^{'} - 1)}{n_{x}} + \frac{(k - 1) (v^{'} - 1)}{n_{y}})] a_{u^{'}, v^{'}}^{(j^{'}, k^{'})} .$

Performing the last two sums is equivalent to computing the optical Fourier transform of the array a^(j′,k′). So,

${FT (A)}_{j, k} = \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \exp [2 π i (\frac{(j - 1) (j^{'} - 1)}{N_{x}} + \frac{(k - 1) (k^{'} - 1)}{N_{y}})] {OFT (a^{(j^{'}, k^{'})})}_{j, k} .$

In this equation, the array indices are assumed to be periodic (with period equal to the size of the array in the corresponding direction) to (slightly) simplify the notations, i.e, the indices i and j of the right-most array should be taken modulo n_xand n_y, respectively. To get the formula without the periodicity condition, simply replace OFT(a^(j′,k′))_j,kby OFT(a^(j′,k′))_j%n_x_,k%n_y, where % denotes the modulo operator such that, for two positive integers a and b, a % b is the remainder of the Euclidean division of a by b if this remainder is not 0, and b if this remainder is 0. (This slightly unusual definition is due to the fact that arrays are indexed from 1.)

This expression can be simplified in the following way. Define the two arrays Ω^(x)and Ω^(y)with respective shapes N_xby d_xand N_yby d_yby: for each (j,j′)∈[[1, N_x]]×[[1, d_x]] and (k,k′)∈[[1, N_y]]×[[1, d_y]],

$Ω_{j, j^{'}}^{(x)} = \exp [\frac{2 i π}{N_{x}} (j - 1) (j^{'} - 1)] and$

$Ω_{k, k^{'}}^{(y)} = \exp [\frac{2 i π}{N_{y}} (k - 1) (k^{'} - 1)] .$

Then, re-introducing the modulo operator for completeness,

$F {T (A)}_{j, k} = \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} Ω_{j, j^{'}}^{(x)} Ω_{k, k^{'}}^{(y)} {OFT (a^{(j^{'}, k^{'})})}_{j % n_{x}, k % n_{y}} .$

(Notice that this may be seen as performing n_xn_yFourier transforms with size N_xby N_y, keeping only d_xby d_yarrays for the input and output.)

Estimate the complexity C of the calculation. Call C_OFTthe complexity of each OFT operation. There are n_xn_ysuch operations and then d_xd_yterms to sum for each coefficient. The complexity of the full calculation is thus O((N_xN_y+C_OFT)d_xd_y). In general, C_OFTwill be much smaller than N_xN_yfor large images. Asymptotically, the complexity thus becomes O(N_xN_yd_xd_y). This is better than the naive Fourier transform approach (which has complexity O(N_x²N_y²)) by a factor n_xn_y.

This result can be further improved by performing the decomposition iteratively several times. Indeed, there is nothing special about the use of the OFT function in the above calculation: we only used that it is a Fourier transform on a subset of the full array. Let us assume, for definiteness, that there exists a positive integer m such that N_x=2^mn_xand N_y=2^mn_y. Then, the Fourier transform of A can be computed by first separating A into 4 sub-arrays (the number of required recombination operations to reconstruct the full result from their Fourier transforms will be 4N_xN_y), then each sub-array in 4 smaller array (requiring again 4N_xN_yrecombination operations), . . . After m such subdivisions, perform the 2^2moptical Fourier transforms on each sub-array of shape (n_x,n_y) and recombine the results using the above formula iteratively, with the function OFT replaced by the discrete Fourier transform of the small arrays. The total number of operations scales like O(4mN_xN_y+2^2mC_OFT). It may be rewritten using the total number N=N_xN_yof coefficients, in the limit where N and n_xn_yare both large, as

$C = O (N \log_{2} (\frac{N}{n_{x} n_{y}}) + \frac{N}{n_{x} n_{y}} C_{O F T}) .$

Assuming C_OFTis at most linear in n_xn_y, this gives

$C = O ({N \log}_{2} (\frac{N}{n_{x} n_{y}})),$

which is better than the complexity O(Nlog₂N) of the Cooley-Tukey approach for large values of n_xn_y.

C.3 Re-Formulation in Terms of a Second Fourier Transform

Define, for each (j, k)∈[[1, n_x]]×[[1, n_y]], the array ā^(j,k)with shape (d_x, d_y) by:

∀(j′,k′)∈[[1, d_x]]×[[1, d_y]], ā_j′,k′^(j,k)=Ω_j,j′^(x)Ω_k,k′^(y), OFT(a^(j′,k′))_j,k′

Then, the above equation becomes:

${FT (A)}_{j, k} = \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \exp [\frac{2 i π}{d_{x}} (\begin{matrix} \underline{j - 1} \\ \overline{n_{x}} \end{matrix}) (j^{'} - 1)] \exp [\frac{2 i π}{d_{y}} (\begin{matrix} \underline{k - 1} \\ \overline{n_{y}} \end{matrix}) (k^{'} - 1)] {\overline{a}}_{(j^{'}, k^{'})}^{(j % n_{x}, k % d_{y})},$

where custom-character denotes the Euclidean division. For each element (q_x, q_y, r_x, r_y) of [[0, d_x−1]]×[[0, d_y−1]]×[[1, n_x]]×[[1, n_y]], we have:

${FT (A)}_{q_{x} n_{x} + r_{x}, q_{y} n_{y} + r_{y}} = \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \exp [\frac{2 i π}{d_{x}} q_{x} (j^{'} - 1)] \exp [\frac{2 i π}{d_{y}} q_{y} (k^{'} - 1)] {\overline{a}}_{j^{'}, k^{'}}^{(r_{x}, r_{y})} .$

The array FT(A) can thus be computed by performing n_xn_yFourier transforms with shape (d_x, d_y) as follows. For each (r_x,r_y)∈[[1, n_x]]×[[1, n_y]], define the array Ā^(r^x^,r^y⁾with shape (d_x, d_y) by:

$\forall (s_{x}, s_{y}) \in [[1, d_{x}]] \times [[1, d_{y}]],$

${\overline{A}}_{s_{x}, s_{y}}^{(r_{x}, r_{y})} = \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \exp [\frac{2 i π}{d_{x}} (s_{x} - 1) (j^{'} - 1)] \exp [\frac{2 i π}{d_{y}} (s_{y} - 1) (k^{'} - 1)] {\overline{a}}_{j^{'}, k^{'}}^{(r_{x}, r_{y})} .$

The array Ā^(r^x^,r^y⁾is simply the Fourier transform of ā^(r^x^,r^y⁾. Then, for each (j, k)∈[[1, N_x]]×[[1, N_y]], we have:

${FT (A)}_{j, k} = {\overline{A}}_{\begin{matrix} \underline{j - 1} \\ \overline{n_{x}} \end{matrix} + 1, \begin{matrix} \underline{k - 1} \\ \overline{n_{y}} \end{matrix} + 1}^{(j % n_{x}, k % n_{y})} .$

All in all, computing the Fourier transform of A can, in a preferred embodiment, thus be performed in three steps:

- one involving d_xd_yFourier transforms with shape (n_x,n_y),
- multiplication of the result by some complex numbers with unit modulus,
- one involving n_xn_yFourier transforms with shape (d_x, d_y).

If there exist a positive integer α such that N_x=n_x^α and N_y=n_y^α, this procedure can be iterated to perform the full Fourier transform using the OFT function as a building-block. This function will then be called αn_x^α−1n_y^α−1times in total, and the procedures involves (α−1)N_xN_ymultiplications by a complex exponential. If C_OFTdenotes the complexity of the OFT function and C_xthat of the multiplication by a complex exponential, the total complexity C is thus

$C \approx \frac{N_{x} N_{y}}{n_{x} n_{y}} \frac{\log (N_{x} N_{y})}{\log (n_{x} n_{y})} C_{OFT} + N_{x} N_{y} (\frac{\log (N_{x} N_{y})}{\log (n_{x} n_{y})} - 1) C_{x} .$

(The ≈ symbol is used because some re-ordering of coefficients or matrix transpositions may be required depending on the implementation.) For large values of α, this may be simplified as:

$C \approx N_{x} N_{y} \frac{\log (N_{x} N_{y})}{\log (n_{x} n_{y})} (\frac{C_{OFT}}{n_{x} n_{y}} + C_{x}) .$

C.4 One-Dimensional Fourier Transforms

Using the above equation

${FT (A)}_{j, k} = \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}}   \exp [2 π i (\frac{(j - 1) (j^{'} - 1)}{N_{x}} + \frac{(k - 1) (k^{'} - k)}{N_{y}})] {OFT (a^{(j^{'}, k^{'})})}_{j % n_{x}, k % n_{y}} .$

and performing the inverse Fourier transform gives, denoting by FT⁽¹⁾the one-dimensional Fourier transform along the first axis:

${{FT}^{(1)} (A)}_{j, k} = \frac{1}{N_{y}} \sum_{k^{″} = 1}^{N_{y}} \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}}   \exp [2 π i (\frac{(j - 1) (j^{'} - 1)}{N_{x}} + \frac{(k^{″} - 1) (k^{'} - k)}{N_{y}})] {OFT (a^{(j^{'}, k^{'})})}_{j % n_{x}, k^{″} % n_{y}} .$

This may be rewritten as:

${{FT}^{(1)} (A)}_{j, k} = \frac{1}{N_{y}} \sum_{k^{″} = 1}^{n_{y}} \sum_{k^{′′′} = 1}^{d_{y}} \sum_{j^{'} = 1}^{d_{x}} \sum_{k^{'} = 1}^{d_{y}} \exp [2 π i (\frac{(j - 1) (j^{'} - 1)}{N_{x}} + \frac{(k^{″} + (k^{′′′} - 1) n_{y} - 1) (k^{'} - k)}{N_{y}})] {OFT (a^{(j^{'}, k^{'})})}_{j % n_{x}, k^{″}} .$

Summing over k^m(noting that the sum gives 0 unless k′−k≡0 [d_y] since N_y=n_yd_y) gives:

${{FT}^{(1)} (A)}_{j, k} = \frac{1}{n_{y}} \sum_{k^{″} = 1}^{n_{y}} \sum_{j^{'} = 1}^{d_{x}} \exp [2 π i (\frac{(j - 1) (j^{'} - 1)}{N_{x}} + \frac{(k^{″} - 1) (\begin{matrix} \underline{k - 1} \\ \overline{d_{y}} \end{matrix})}{n_{y}})] {OFT (a^{(j^{'}, k % d_{y})})}_{j % n_{x}, k^{″}} .$

Define, for each (j, k)∈[[1, n_x]]×[[1, d_y]], the array ā^(j,k)with shape (d_x,n_y) by:

$\forall (j^{'}, k^{'}) \in [[1, d_{x}]] \times [[1, d_{y}]],$

${\overset{=}{a}}_{j^{'}, k^{'}}^{(j, k)} = \exp [2 π i \frac{(j - 1) (j^{'} - 1)}{N_{x}}] {OFT (a^{(j^{'}, k)})}_{j, (((n_{y} - k^{'} + 1) %_{0} n_{y}) + 1)},$

where %₀denotes the standard modulo operator, i.e., if n and m are two integers, n %₀m is the positive integer between 0 and m−1 (included) such that n−(n %₀m) divides m.

Then, the above equation becomes:

${{FT}^{(1)} (A)}_{j, k} = \frac{1}{n_{y}} \sum_{k^{'} = 1}^{n_{y}} \sum_{j^{'} = 1}^{d_{x}} \exp [2 π i (\frac{\begin{matrix} \underline{j - 1} \\ \overline{n_{x}} \end{matrix} (j^{'} - 1)}{d_{x}} + \frac{(k^{'} - 1) \begin{matrix} \underline{k - 1} \\ \overline{d_{y}} \end{matrix}}{n_{y}})] {\overset{=}{a}}_{j^{'}, k^{'}}^{(j % n_{x}, k % d_{y})} .$

Let us call, for each (j, k)∈[[1, n_x]]×[[1, d_y]], the array A^(j,k)as the Fourier transform of a^(j,k). Then, the above equation becomes:

${{FT}^{(1)} (A)}_{j, k} = \frac{1}{n_{y}} {\overset{=}{A}}_{\begin{matrix} \underline{j - 1} \\ \overline{n_{x}} \end{matrix} + 1, \begin{matrix} \underline{k - 1} \\ \overline{n_{y}} \end{matrix} + 1}^{(j % n_{x}, k % d_{y})} .$

This gives a way to perform N_ybatched one-dimensional Fourier transforms of size N_xfrom two-dimensional ones.

In particular, choosing N_y=n_y(and thus d_y=1) and N_x=n_x²(and thus d_x=n_x), this procedures allows to compute n_y1D Fourier transforms with size n_x²by performing

- n_x²n_ymemory accesses,
- n_x2D Fourier transforms with shape (n_x, n_y),
- n_x²n_ycomplex multiplications and memory accesses,
- n_x2D Fourier transforms with shape (n_x, n_y),
- n_x²n_ymemory accesses.

In total, this algorithm requires 3n_x²n_ymemory accesses, n_x²n_ycomplex multiplications, and 2n_x2D Fourier transforms with shape (n_x, n_y) to compute n_y1D Fourier transforms with size n_x².

C.5 Higher-Dimensional Fourier Transforms

One- and two-dimensional Fourier transforms can be combined to produce higher-dimensional ones. For any positive integer D, the (D-dimensional) Fourier transform of a D-dimensional array A can be computed, for instance,

- by performing one-dimensional Fourier transforms along each of the dimensions,
- or by performing two-dimensional Fourier transforms in

$⌊ \frac{D}{2} ⌋$

planes with no common non-vanishing vector and, if D is odd, one-dimensional ones in the last direction.

For instance, the three-dimensional Fourier transform of an array with shape (N_x, N_y, N_z) can be performed by doing N_ztwo-dimensional Fourier transforms with shape (N_x, N_y) followed by N_xN_yone-dimensional Fourier transforms with size N_z.

REFERENCES

[1] Jeffrey Hoffstein, Jill Pipher, and Joseph H. Silverman. NTRU: A new high speed public key cryptosystem. 13 Aug. 1996. preliminary draft.

[2] Jeffrey Hoffstein, Jill Pipher, and Joseph H. Silverman. NTRU: A ring-based public key cryptosystem. In Joe P. Buhler, editor, Algorithmic Number Theory, pages 267-288, Berlin, Heidelberg, 1998. Springer Berlin Heidelberg.

[3] Damien Stehle and Ron Steinfeld. Making NTRUEncrypt and NTRUSign as Secure as Standard Worst-Case Problems over Ideal Lattices. Cryptology ePrint Archive, Report 2013/004, 2013. https://eprint.iacr.org/2013/004.

[4] Daniel Augot, Lejla Batina, Daniel J. Bernstein, Joppe Bos, Johannes Buchmann, Wouter Castryck, Orr Dunkelman, Tim Guneysu, Shay Gueron, Andreas Nuking, Tanja Lange, Mohamed Saied Emam Mohamed, Christian Rechberger, Peter Schwabe, Nicolas Sendrier, Frederik Vercauteren, and Bo-Yin Yang. Post-Quantum Cryptography for Long-Term Security. 7 Sep. 2015.

[5] Daniel J. Bernstein, Chitchanok Chuengsatiansup, Tanja Lange, and Christine van Vredendaal. NTRU Prime: reducing attack surface at low cost. 2018.

[6] Jeffrey Hoffstein, Jill Pipher, and Joseph H. Silverman. NTRU: A public key cryptosystem. 1999.

[7] Ruiqing Dong. Efficient Multiplication Architectures for Truncated Polynomial Ring. PhD thesis, 2016. https://scholar.uwindsor.ca/etd/5814.

[8] Eliane Jaulmes and Antoine Joux. A chosen-ciphertext attack against NTRU. In Mihir Bellare, editor, Advances in Cryptology—CRYPTO 2000, pages 20-35, Berlin, Heidelberg, 2000. Springer Berlin Heidelberg.

[9] Nick Howgrave-Graham, Joseph H. Silverman, Ari Singer, and William Whyte. NAEP: Provable Security in the Presence of Decryption Failures. 2003. wwhyte@ntru.com 12278 received 14 Aug. 2003.

[10] Eiichiro Fujisaki and Tatsuaki Okamoto. Secure Integration of Asymmetric and Symmetric Encryption Schemes. Journal of Cryptology, 26:80-101, 2013.

[11] Jeffrey Hoffstein and Joseph H. Silverman. Protecting ntru against chosen ciphertext and reaction attacks. 2000.

[12] Zhenfei Zhang. A short review of NTRU cryptosystem, 07 2017.

[13] Ali Atici, Lejla Batina, Junfeng Fan, Ingrid Verbauwhede, and S. B. O. Yalgin. Low-cost Implementations of NTRU for pervasive security. pages 79-84, 08 2008.

[14] Jean-Frangois Biasse and Fang Song. Efficient quantum algorithms for computing class groups and solving the principal ideal problem in arbitrary degree number fields. In Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA '16, page 893-902, USA, 2016. Society for Industrial and Applied Mathematics.

[15] Ronald Cramer, Leo Ducas, and Benjamin Wesolowski. Short stickelberger class relations and application to Ideal-SVP. Cryptology ePrint Archive, Report 2016/885, 2016. https://eprint.iacr.org/2016/885.

[16] Philip Hirschhorn, Jeffrey Hoffstein, Nick Howgrave-Graham, and William Whyte. Choosing NTRUEncrypt parameters in light of combined lattice reduction and MITM approaches. volume 5536, pages 437-455, 06 2009.

METHODS AND SYSTEMS FOR THE IMPLEMENTATION OF NTRU-LIKE CRYPTOSYSTEM RELYING ON OPTICAL FOURIER TRANSFORMS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

PCT Information