The invention relates to a computation device, a computation method, and a computer readable medium.
Cryptographic devices which execute some cryptographic operation are under various types of threats from attackers on those devices. On one level, an attacker may try to attack the cryptographic operation itself. For example, an attacker may try to collect pairs of inputs and outputs, say of an encryption or decryption, or of a signing operation, and try to derive therefrom secret information, e.g., a secret key, such as a secret symmetric key, or a secret asymmetric key, e.g., a so-called private key. Attacks of this type are sometimes referred to as black box attacks, since the attacker does not need to know anything about the implementation of the operation in the cryptographic device.
On a next level, an attacker may try to obtain additional information by running the cryptographic operation on the device and observing the process. In particular, the attacker may collect, what is known as side-channel information. Typical side-channels include the power usage, or the electromagnetic radiation produced by the device while it is running the operation. If side channels have not been taken into account during the design of an operation, then a device is typically vulnerable to side-channel attacks. Side-channel attacks of some kind are known against many types of algorithms. This includes both symmetric algorithms, such as block ciphers, and asymmetric algorithms. Algorithms that are based on arithmetic using mathematical principles are also vulnerable to side-channel attacks, e.g., the RSA based algorithms for decryption or signing, or the Diffie-Hellman key agreement algorithms. Known countermeasures against side-channels reduce the correlation between the operation that is performed and the side-channel. Side-channel based attacks are sometimes referred to as grey-box attacks, since the attacker has some ability to peer into the device that performs the operation. The sophistication of side-channel attacks is every increasing. For example, not only should individual measurement be uncorrelated to secret information, preferably combinations of measurements should also be uncorrelated, etc.
In addition, attacks known from the far end of the scale, the white-box attacks, inspire attackers to more complicated attacks in grey-box settings as well. In a white-box attack, an attacker has full access to the inner workings of the device. There is a fine line though between grey-box and white-box attacks. For example, in some situations the best known attacks in the white-box model, are in fact taken from side-channel analysis.
There is a difficulty in protecting algorithms that are based on arithmetic. Typically, the numbers on which such algorithms act are very large. For example, the RSA algorithm is routinely applied to 2048 or 4096-bit numbers. One approach to protecting such algorithms is to blind, or mask the numbers. For example, instead storing and calculating with the real variable x, one may instead store and calculate with a blinded number x′. Two types masking have been explored in the context of cryptographic algorithms based on arithmetic: multiplicative masking and additive masking. In the former, one takes x′=cx, for some blinding data c, in the second case one takes x′=x+c. Typically, if multiplications are to be performed, then multiplicative masking is more convenient, whereas if additions are needed then additive masking is more convenient. If both types of operations are needed, one has either to re-encode to another type of masking or to perform a correcting calculation after the operation to account for the errors introduced by the masking. Note that multiplicative masking does not work well in case the variables has the value 0. This is generally not a problem, and may be addressed by avoiding the value 0.
For this reason, algorithms which rely on many multiplications, such as the exponentiation in RSA or Diffie-Hellman have typically been protected using multiplicative blinding. There have been attempts though to use additive masking in cryptographic operations which are based on multiplications.
A first example, is given in US 2009/0086961. In this approach a variable A is masked by replacing it by A+kn, wherein k is a random number and n is the modulus of the multiplication. As the modulus will eventually fall out, e.g., through a modular reduction, the masking has no effect on either additions or multiplications. In this case the additive masking is possible by restricting the allowable masking data. Only multiples of the public modulus are allowed to be added in order to mask. This restricts the randomness that can be introduced.
A second example, is given in “Thwarting side-channel analysis against RSA cryptosystems with additive blinding”, by HeeSeok Kim. In this example, additive blinding is used to secure an RSA cryptosystem against power analyses. In this type of blinding, two input values A and B are blinded as A+MA, and B+MB, respectively. The article next performs a mathematical analysis to compute conditions on the masking so that the multiplication (A+MA)(B+MB) receives a correct and masked outcome. The condition that is imposed is that the inputs are to have a constant ratio, i.e., that
where c is a constant. As it turns out this restriction can be accommodated if the only calculation that is needed is an exponentiation. Thus in this example, full additive blinding is obtained, but the downside is that only a limited number of multiplications are possible.
A computation device is proposed which addresses these and other concerns is defined in the claims. The computation device can perform various arithmetic operations on variables, e.g., multiplication and conditional assignments. These operations can in turn be combined into other secure operations, such as a secure exponentiation.
The inventors made several insights which improve the masking of variables, especially for use in obfuscated arithmetic, e.g., to increase resistance against side-channel attacks. First of all, the inventors created an algorithm that correctly computes a multiplication result even if the inputs are masked additively. Rather than replacing a variable by a blinded variable, a variable may be replaced by a set of values. Each individual member of the set may be randomly chosen, however the sum of the set equals the represented value. This is a representation of a number as multiple additive shares. The algorithms can multiply values even if represented as a set of additive shares, and produce as output a number which is itself masked as a set of additive shares. The number of additive shares in the set can be 2, but it can also be more than 2. For example, the number of elements may be 4 or larger, 8 or larger, etc. In fact, a larger number additive shares implies a more tenuous relationship between side-channels and the actual computation. Most of the computation, even when taken in combination, may thus operate on random data. Only the combination of all additive shares has a relationship to the actual represented variable.
However, the inventors found that yet more complicated representation is compatible with arithmetic operations. For example, in an embodiment the computation device encodes variables in two stages. First a variable is encoded as a one or more multiplicative shares. Then the multiplicative shares are individually encoded as multiple additive shares. Only the latter need to be present in a memory of the device. Although the multiplicative shares can be computed from the sets of additive shares that represent them, this is generally not needed. For example, the latter may be done during a decoding step, but for the most part of the computation, e.g., for multiplication, conditional assignment, or exponentiation it is not needed to compute the multiplicative shares.
The number of multiplicative shares is at least one. If only one multiplicative share is used, then the multiplicative share is equal to the number itself. Such an embodiment has the advantage that number can be added as well as multiplied easily. For example, to add two numbers, one may add the additive shares of the two addends component wise. To multiply, an algorithm as disclosed herein may be used. A downside of using only one multiplicative share is that the relationship between shares and the represented number is linear. Moreover using multiple multiplicative shares has the advantage that an attacker has to monitor more values simultaneously.
Thus, in an embodiment the number of multiplicative shares is one. An example of such an embodiment is an electronic computation device arranged for obfuscated execution of a multiplication. The device comprises
In an embodiment, the number of multiplicative shares is more than one. An advantage of using more than one multiplicative share is that the relationship between the shares and the represented number is non-linear. This means that side-channel attacks that try to establish a correlation between secret information and multiple measurements are more difficult since the relationship between measurement and secret data is not linear. For example, the number of multiplicative shares may be 2 or more, 4 or more, 8 or more, etc. The total number of shares is the product of the number of multiplicative shares and the number of additive shares. In an embodiment, the total number of shares is at least 3.
Below we will generally assume that more than one multiplicative share is used, but such embodiments are easily adjusted to the situation in which only one multiplicative share is used. The inventors had the insight that on such a representation both multiplication and conditional assignments can be effectively computed. Such a computation neither needs a re-encoding into a different encoding format, nor are corrections needed to correct computations results done on encoded variables. Operating on sets of shares is preferable to protect against side channels, since anything less than perfect knowledge of the shares does not leak information about the information that is being encoded. This property makes the representation particularly suitable for operating on secret information, such as may be done in cryptographic operations. Having multiplications and conditional assignments a secure exponentiation can be performed. The latter can be used to secure cryptographic operations that depend on exponentiations with secret exponents, e.g., RSA decryption, RSA signatures and Diffie-Hellman key agreement in finite fields, e.g., with a prime number of elements.
The computation device is an electronic device. For example, it may be a mobile electronic device, such as a mobile phone. The computation device may be a set-top box, a smart-card, a computer, etc. The computation device and method described herein may be applied in a wide range of practical applications. Such practical applications include communication in which confidentiality or authenticity is protected using cryptographic operations.
A method according to the invention may be implemented on a computer as a computer implemented method, or in dedicated hardware, or in a combination of both. Executable code for a method according to the invention may be stored on a computer program product. Examples of computer program products include memory devices, optical storage devices, integrated circuits, servers, online software, etc. Preferably, the computer program product comprises non-transitory program code stored on a computer readable medium for performing a method according to the invention when said program product is executed on a computer.
In a preferred embodiment, the computer program comprises computer program code adapted to perform all the steps of a method according to the invention when the computer program is run on a computer. Preferably, the computer program is embodied on a computer readable medium.
Another aspect of the invention provides a method of making the computer program available for downloading. This aspect is used when the computer program is uploaded into, e.g., Apple's App Store, Google's Play Store, or Microsoft's Windows Store, and when the computer program is available for downloading from such a store.
Further details, aspects, and embodiments of the invention will be described, by way of example only, with reference to the drawings. Elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. In the Figures, elements which correspond to elements already described may have the same reference numerals. In the drawings,
100 a computation device
110 a communication interface
120 a storage
132 a convolution unit
134 an addition unit
142 a multiplication unit
144 a conditional assignment unit
150 an exponentiation unit
2 a number
20 multiple multiplicative shares
21-22 a multiplicative share
210, 220 multiple additive shares
211-213, 221-223 an additive share
30, 40 multiple multiplicative shares
31-33, 41-43 a set of additive shares
51, 52 a selection number represented as multiple additive shares
350, 351, 352 a convolution unit
353 an addition unit
1000 a computer readable medium
1010 a writable part
1020 a computer program
1110 integrated circuit(s)
1120 a processing unit
1122 a memory
1124 a dedicated integrated circuit
1126 a communication element
1130 an interconnect
1140 a processor system
While this invention is susceptible of embodiment in many different forms, there are shown in the drawings and will herein be described in detail one or more specific embodiments, with the understanding that the present disclosure is to be considered as exemplary of the principles of the invention and not intended to limit the invention to the specific embodiments shown and described.
In the following, for the sake of understanding, elements of embodiments are described in operation. However, it will be apparent that the respective elements are arranged to perform the functions being described as performed by them.
Further, the invention is not limited to the embodiments, and the invention lies in each and every novel feature or combination of features described herein or recited in mutually different dependent claims.
The inventors have realized that a particular representation of the variables involved in the multiplication is particularly advantageous. The representation combines two different types of obfuscated representation to achieve additional advantages. In particular, the representation allows efficient computation of multiplications and of conditional assignments. More in particular, the representation does not require the re-encoding of variables from one representation to another, or the computation of compensating data to correct the effect of operating on obfuscated representation, rather than on a regular representation.
In general, the computations indicated herein will be modulo some modulus, e.g., depending on the variables being modulo some number, e.g., as is required in RSA exponentiation operations. For example, modulo operations may be inserted at suitable points to avoid number becoming too large, e.g., after each multiplication, etc. It is not necessary that each integer is kept below the modulus at all times. For example, an embodiment may employ so-called pseudo-residues which are only occasionally reduced modulo the modulus. The modulus may, e.g., be a prime number p, or a composite number, e.g., a product of two primes pq.
Computation device 100 comprises storage 120 which comprises variables on which computation device 100 operates in encoded form. A number x, e.g., an integer, e.g., an integer modulo a modulus is initially represented as multiple multiplicative shares X=(x0,x1, . . . ,xm−1). That is the product of the numbers xi equals the number x being represented. We will denote sequence of multiple numbers, e.g., tuples, with capital letters, and particular numbers with small case letters. We will refer to the number xi as multiplicative shares. Note that the multiplicative shares will typically not be stored in storage 120, and in fact will typically not be present at all in storage 120—possibly with some temporary exceptions, e.g., during encoding or decoding of a variable to or from the special representation herein defined. In other words, in an embodiment, an attacker performing, say, a memory scraping attack on device 100, in which the memory is combed to find variables relating to the operation, will not find the multiplicative shares themselves. Instead, the multiplicative shares are represented themselves as additive shares. For example, the multiplicative share xi may be represented in storage 120, e.g., in a memory of device 100, as multiple additive shares, e.g., as a tuple Xi=(xi,0,xi,1, . . . , xi,n−1). Assuming that a number is represented using m multiplicative shares, which are each represented as n additive shares, the number may thus be represented as nm shares. Representation as shares is desirable since even perfect knowledge of any number of the shares, except all shares, does not reveal any information about the underlying number.
For example, the numbers: 55, 40, and 70 may be represented as the tuples (78, 105, 98), (34, 40, 79), (12, 81, 90), since their sums are 55, 40, and 70 respectively modulo 113. In an embodiment, the number 94 modulo 113 may thus be represented as the sequence 78, 105, 98, 34, 40, 79, 12, 81, 90. Further obfuscation techniques may be further applied to this sequence. For example, the sequence need not be stored in this particular order, but may be permuted or scattered throughout storage 120. For example, the additive shares may be encoded.
Returning to
The communication interface 110 may be arranged to communicate with other devices over a computer network. The computer network may be an internet, an intranet, a LAN, a WLAN, etc. The computer network may be the Internet. The computer network may be wholly or partly wired, and/or wholly or partly wireless. For example, the computer network may comprise Ethernet connections. For example, the computer network may comprise wireless connections, such as Wi-Fi, ZigBee, and the like. The connection interface may be arranged to communicate with other devices as needed. For example, the connection interface may comprise a connector, e.g., a wired connector, e.g., an Ethernet connector, or a wireless connector, e.g., an antenna, e.g., a Wi-Fi, 4G or 5G antenna. Communication interface 110 may be used to receive transactions to operate upon, or for receiving secret information, e.g., secret keys. Messages may be digital messages, e.g., received in electronic form.
Computation device 100 may comprise various units, e.g., one or more of a convolution unit 132, an addition unit 134, a multiplication unit 142, a conditional assignment unit 144, and an exponentiation unit 150. If, e.g., exponentiation is not needed, then exponentiation unit 150 may be omitted, etc. If no conditional assignment is needed, then, addition unit 134 and conditional assignment unit 144 may be omitted, etc.
The execution of the computation device, e.g., of an embodiment of a computation method may be implemented in one or more processor circuits, examples of which are shown herein.
The inventors had the insight that numbers represented as indicated above can be multiplied using convolutions. To this end, one or more convolution units 132 are arranged to convolute two sets of multiple additive shares, e.g., that together represent one multiplicative share.
Multiplicative unit 142 may be arranged to multiply a first variable in the storage with a second variable in the storage. For example, the first variable may be the number x represented as multiple multiplicative shares X=(x0,x1, . . . , xm−1), which in turn are represented in the storage as multiple additive shares Xi=(xi,0,xi,1, . . . , xi,n−1), with 0≤x<m. For example, the second variable may be the number y represented as multiple multiplicative shares Y=(y0,y1, . . . , ym−1), said multiplicative shares being represented in the storage as multiple additive shares Yi=(yi,o,yi,1, . . . , yi,n−), with 0≤i<m.
To obtain a multiplication result z=xy, represented as above, the multiplying unit 142 is configured to
Returning to
Conditional assignment unit 144 takes input two variables, say, a first variable x and a second variable y, and a binary condition d. One may take for d a regular binary variable having the values 0 or 1. The operations which depend on d could be integrated with the computer program code which computed the condition d. Furthermore, the variable d could be obfuscated, e.g., encoded. Conditional assignment unit 144 either assigns the first variable to its output, e.g., to a third variable z, or assigns the second variable to its output, depending on the condition d. In an embodiment, the conditional assignment unit 144 uses two selection numbers to do this, e.g., R and R′. The selection numbers may be constants and may be stored in storage 120, or may be hardcoded in the unit 144, etc. The selection numbers are represented as a set of additive shares. A representation using multiplicative shares as well as additive shares is possible for the selection numbers, but it is not needed. However, using multiple multiplicative shares for the selection numbers may require an addition circuit, which is preferably avoided. The conditional assignment unit is configured to
Which selection numbers are used, depends on the conditional d. For example, if storage 120 stores two selection numbers R1 and R2, then the unit 144 may set R=R1, R′=R2 if d is true, e.g., d=1, and R=R2, R′=R1 if d is false, e.g., d=0. In an embodiment, the selection numbers are additive representations of either 0 or 1. For example, one of the selection numbers may be an additive representation of 0 or 1, while the other selection number represents the other of 0 and 1.
It is not needed that each iteration uses the same selection numbers, different numbers may be used instead. In particular, different representations that represent the same value, for example, the value 0 may be used in each iteration, but represented each time, or some time as with a different representation. It is also not needed that the computations are performed in the order indicated herein. Both the multiplication and assignment are highly parallel. This may be exploited in embodiments by permuting the order of the calculations, e.g., depending on a random variable, as a further obfuscation step. Randomization helps against averaging attacks. Note that the selection numbers could be generated randomly before an operation, before an assignment, or even before each or some iterations.
Both multiplication and assignment may be further obfuscated by adding dummy operations. For example, a suitable dummy operation is to do a multiplication with a representation of the number 1. For example, a suitable dummy operation is to add an additive representation of the number 0, to one or more additive share sets. Including one or more dummy operations will randomize the timing between iterations, and/or between different calls of the operation. Especially, if the dummy operations are inserted in dependence on a random process, e.g., a call to a random number generator. Randomized timing thwarts averaging attacks, moreover if the dummy operations are alike to regular operations as above, re-syncing by the attacker is hard.
Note that the obfuscated multiplication, e.g., using multiplication unit 142 and the obfuscated assignment, e.g., using assignment unit 144, can operate on numbers in the same representation. No re-encoding to move from the unit 142 to 144 or vice versa is needed. This means that these operations can be stringed together with ease. For example, in an embodiment, an operation, e.g., a cryptographic operation comprises a conditional assignment followed by a multiplication with the conditional assignment result. For example, in an embodiment, an operation, e.g., a cryptographic operation comprises conditional assignment followed by a multiplication with the assignment result. In an embodiment, an exponentiation comprises repeated obfuscated multiplications.
An even better obfuscated exponentiation can be obtained by also using obfuscated assignments. For example, exponentiation operations can be effectively performed using multiplication and conditional assignments. For example, the exponentiation may comprise repeated multiplications, wherein the multiplications depend on bits in an exponent. A conditional assignment is executed in dependency on the bits in the exponent followed by a multiplication. For example, to effectively perform the exponentiation, a so-called Montgomery ladder may be used.
The Montgomery ladder may be implemented according to:
wherein h represents the base of the exponentiation, and the bits di represent bits of the exponent, the conditinoal assignments (I) and the multiplications (II) being obfuscated as in an embodiment. Note that an exponentiation algorithm that uses an obfuscated assignment, and includes an equal number of multiplications in each iteration has increased resistance against side channel attacks, since it is hard to determine the exponent from either the assignment, nor from the multiplications. The above Montgomery ladder has this property since both the assignment and the multiplications are protected by shares. Furthermore, the multiplications are independent from the secret exponent bits.
If the number of multiplicative shares, and/or the number of additive shares is large then the exponentiation, which is already an expensive operation can become quite expensive indeed. This can be avoided by performing a protected exponentiation with a smaller protected exponent, and a regular exponentiation with an unprotected exponent. The protected exponent, and the unprotected exponent can be chosen such that their combined result equals exponentiation with the intended exponent, in case of RSA, with the secret exponent, e.g., the secret key. For example, in an embodiment, the exponentiation unit 150 may be configured to perform an exponentiation by obtaining a first exponent and a second exponent, a first exponent having fewer bits than the second exponent, said subsequent exponentiation by the first and second exponent being equal to exponentiation by the exponent, wherein the exponentiation with the first exponent comprises obfuscated multiplication and/or conditional assignments. For example, the first and second exponents may be chosen during key generation, and stored in device 100, e.g., in storage 120.
In the various embodiments of device 100, the communication interface may be selected from various alternatives. For example, the interface may be a network interface to a local or wide area network, e.g., the Internet, a storage interface to an internal or external data storage, a keyboard, an application interface (API), etc.
The device 100 may have a user interface, which may include well-known elements such as one or more buttons, a keyboard, display, touch screen, etc. The user interface may be arranged for accommodating user interaction for performing a computation, e.g., a cryptographic operation, e.g., a decrypt or sign operation.
Storage 120 may be implemented as an electronic memory, say a flash memory, or magnetic memory, say hard disk or the like. Storage 120 may comprise multiple discrete memories together making up storage 110. Storage 120 may also be a temporary memory, say a RAM. In the case of a temporary storage 120, storage 120 contains some means to obtain data before use, say by obtaining them over an optional network connection (not shown).
Typically, the device 100 comprises a microprocessor (not separately shown) which executes appropriate software stored at the device 100; for example, that software may have been downloaded and/or stored in a corresponding memory, e.g., a volatile memory such as RAM or a non-volatile memory such as Flash (not separately shown). Alternatively, the device 100 may, in whole or in part, be implemented in programmable logic, e.g., as field-programmable gate array (FPGA). Device 100 may be implemented, in whole or in part, as a so-called application-specific integrated circuit (ASIC), i.e. an integrated circuit (IC) customized for their particular use. For example, the circuits may be implemented in CMOS, e.g., using a hardware description language such as Verilog, VHDL etc.
In an embodiment, the computation device comprises one, or more, or all of a convolution circuit, an addition circuit, a multiplication circuit, a conditional assignment circuit, an exponentiation circuit, a communication interface circuit. The circuits implement the corresponding units described herein. The circuits may be a processor circuit and storage circuit, the processor circuit executing instructions represented electronically in the storage circuits.
A processor circuit may be implemented in a distributed fashion, e.g., as multiple sub-processor circuits. A storage may be distributed over multiple distributed sub-storages. Part or all of the memory may be an electronic memory, magnetic memory, etc. For example, the storage may have volatile and a non-volatile part. Part of the storage may be read-only.
Thus, a device and method are disclosed to perform modular exponentiation with a secret exponent with increased resistance against leaking the exponent through a side channel. Embodiments combine, e.g., a Montgomery ladder, multiplicative sharing and additive sharing. Applications include the RSA algorithm, and the DiffieHellman algorithm in Z*p. In the RSA application, the secret exponent typically is large, e.g., 2048 of 4096 bits. An optimization is shown to reduce the size of the exponent used which greatly speeds up the calculations. Below these and further embodiments are discussed, in a more mathematical language.
If a number x=Σi=0n−1xi, then the n numbers x0, x1, K, xn−1 are an additive n-share representation of x. Notation: x=A(x0, x1, K, xn−1). If a number x=πi=0m−1xi, then the m numbers x0, x1, K, xm−1 are a multiplicative m-share representation of x. Notation:
x=M(x0, x1, K, xm−1).
If x=A(x0, K, xn−1) and y=A(y0, y1, K, yn−1), then the per-share addition gives an additive n-share representation of the sum x+y:
A(x0+y0, x1+y1, K, xn−1+yn−1)=x+y. If x=M(x0, K, xm−1) and y=M(y0, y1, K, ym−1), then the per-share multiplication gives a multiplicative m-share representation of the product xy: M(x0y0, x1y1, K, xn−1, yn−1)=xy.
Let X=(x0, K, xn−1) and Y=(y0, K, yn−1) be two n-tuples of numbers, then the convolution of X and Y, denoted X*Y, is defined as the n-tuple Z=(z0, K, zn−1) where
for 0≤i≤n−1. Example for n=3: if X =(x0, x1, x2) and Y=(y0, y1, y2), then X*Y=(x0y0+x1y2+x2y1, x0y1+x1y0+x2y2, x0y2+x1y1+x2y0).
It holds for all X and Y that A(X*Y)=A(X)A(Y). In other words: the convolution of two additive share representations results in an additive share representation of the product. Let x=M(x0, K, xm−1) and y=M(y0, K, ym−1).
It would be advantageous if there existed a linear operation which plays an analogous role for multiplicative share representations: such a hypothetical operation of two multiplicative share representations would result in a multiplicative share representation of the sum, i.e., for all X and Y, the operation, denoted X⋄Y , would satisfy M(X⋄Y)=M(X)+M(Y). It turns out, that is can be proven through mathematical argument, that such a hypothetical operation cannot exist in general. Nevertheless, even though it is not possible to create multiplicative shares of an arbitrary weighted sum (1−α)x+αy from the multiplicative shares of x and y by taking linear combinations, it is possible when α=0 or α=1: zi=(1−α)xi+αyi for 0≤i≤m−1 satisfies
M(z0, K, zm−1)=(1−α)M(x0, K, xm−1)+αM(y0, K, ym−1)when α ∈{0,1}.
Note that each share of x and y is multiplied by either zero or one. This can be camouflaged, e.g., in the following way.
O
1=(oi,0, K, oi,n−1) with A(oi,0, K, oi,n−1)=0for0≤i≤K−1.
J
i=(ji,0, K, ji,n−1) with A(ji,0, K, ji,n−1)=1for0≤i≤K−1.
x
i=A(xi,0, K, xi,n−1).
y
i=A(yi,0, K, yi,n−1).
Z
i=Jk
where ki and li are arbitrarily chosen integers satisfying 0≤ki, li≤K−1.
If α=1, Zi is calculated as
Z
i=Ok
The two cases can be summarized in the single equation
Z
i=R(1−di)k
where R(0) stands for O and R(1) stands for J.
In RSA and in the Diffie-Hellman protocol it is often required to perform a modular exponentiation of a ‘public’ number h and a ‘secret’ exponent d modulo a public modulus N:
s=h
d mod N.
For example, In RSA signing, we may have: h is the padded message digest, d is the private key and s is the resulting signature. Let λ denote the bit-length of N (and d), and write the binary expansion of d as d=Σi=0λ−1di2i, with di ∈{0,1}. A simple algorithm for calculating s is the following:
This algorithm is side-channel sensitive, because the pattern of squarings and multiplications reveals the exponent d. This holds also when the if-statement is unrolled for a given d.
The Montgomery ladder calculates s in the following way:
The Montgomery ladder offers some protection against side-channel attacks, because in each step both a squaring and a multiplication occur. However, any side channel that allows the attacker to observe whether s or t is squared in each step, still reveals the key.
The same results are obtained with the following variant of the Montgomery ladder:
This variant of the Montgomery ladder uses multiplications and an addition, but no squarings. A side channel that allows an attacker to observe in each step whether s is multiplied by zero or by one, or, equivalently, whether u=s or u=t, leaks the key.
In order to make it even harder for an attacker to obtain the key from a side-channel attack, we the obfuscation techniques from the first section may be used to make it harder to see whether u=s or u=t, or whether something is multiplied by zero or by one.
The implementer chooses numbers m≥1 (number of multiplicative shares representing a Montgomery ladder variable) and n≥2 (number of additive shares of a multiplicative share). A Montgomery ladder variable is thus represented by mn shares. Preferably both m and n are large.
The implementer chooses a set of numbers {Aμ,v}μ=0,v=0m−1,n−1 and {Bμ,v}μ=0,v=0m−1,n−1 such that
to initialize the ladder. The ladder works with mn numbers Sμ,v, Tμ,v and Uμ,nu, the n shares (Sμ,0, K, Sμ,n−1) are denoted Sμ, and similarly for the T and U shares. The numbers ki,μ and li,μ, with 0≤i≤λ−1 and 0≤μ≤m−1 may be arbitrarily chosen integers from [0,K).
Protected Montgomery ladder with multiplicative and additive shares
In an embodiment, the dummy transformations may be performed on the shares that leave the underlying value invariant, e.g., he may choose to apply a random permutation of the n additive shares, or a random permutation of the m multiplicative shares (represented by additive share sets). Preferably, these transformations are chosen during run time, not at compile time. For example, they may be chosen dependent upon the input h. For example, the input h may be used to seed a random number generator.
In some applications, e.g., signing it is primarily the exponent in an exponentiation which is secret, but neither the input nor the output of the algorithms is confidential. In such a case, intermediate results may still be sensitive since they rely on only part of the key, and may thus allow brute forcing of part of the key.
To mask an input variable h as multiple shares one may first compute multiplicative shares, and then represent each of the multiplicative shares as additive shares. For example, in an embodiment, the variable h is used as a seed to a random number generator which in turn generates all multiplicative or additives shares but one, after which the final shares are computed. Alternatively, a set of random multiplicative functions ƒi or additive functions gi may be selected at compile time. The functions satisfy that πƒi(x)=x and Σgi(x)=x, in both cases modulo the public modulus. Thus the product or sum of these functions is independent of the input, at least modulo the modulus. The number of functions is equal to the number of multiplicative shares or additive shares per multiplicative share. If there is only one multiplicative share, the multiplicative functions may be omitted. To represent any variable as shares, e.g., the variable h, one may first compute (ƒ0(h), ƒ1(h), . . . , ƒm−1(h)). To represent a multiplicative share s, one may compute (y0(s), . . . , yn−1(s)).
One may also use this approach in part. For example, the multiplicative shares may be computed on the fly at runtime, e.g., using a random number generator, computing the final share using an inverse algorithm. Each of the multiplicative shares is then mapped to a set of multiple additive shares using a set of additive functions. Note that the additive functions may be different for some or each of the set of additive shares. For example, at compile time all the additive functions but one may be chosen as random polynomials, after which the final polynomial can be computed.
For example, to generate shares one may do the following. Suppose we wish to represent ƒ(l) on input I, then let ri(I) be functions providing random shares on input I for i=1 . . . n−1 and let rn(I) be defined as ƒ(I)−r1(I) . . . −rn−1(I). Now xi=ri(I) gives an additive n-sharing of ƒ(I). Similarly let Ri(I) be functions providing random shares on input I for i=1 . . . n−1 and let Rn(I) be defined as ƒ(I)/(R0(I)* . . . *Rn−1(I)), then Xi=Ri(I) gives an n-multiplicative sharing of f(I). The functions r and R may be pseudo random functions.
The Montgomery ladder with mn multiplicative and additive shares takes approximately mn times as many operations as the Montgomery ladder without shares. This may be a high price to pay for the extra side-channel security. The following method may reduce the number of operations significantly, without compromising on security. Consider the following:
operations. For N≈22048 this translates to roughly 112 bits of security, for N≈24096 about 150 bits.
The secret exponent d may split in a secret part and a public part. For example, choose a random t-bit number ds, the ‘secret’ part of d, and calculate dp=ds−1d modφ(N), the λ-bit ‘public’ part of d. This uses the Euler function φ(N) of the modulus N, e.g., obtained from a factorization of N. Here, φ is the Euler function
The exponentiation is done in two steps:
s=h
d mod N=hd
The inner modular exponentiation with exponent ds is done using mn multiplicative and additive shares using the protected Montgomery ladder above, the outer modular exponentiation with exponent dp is done using a less secure Montgomery ladder. The exponent dp is likely to leak through a side channel, but that does not matter as long as ds does not leak. Note that since
(hd
the ‘inner’ and ‘outer’ modular exponentiations can be swapped.
The total workload for this method is λ+tmn, compared to λmn for the protected method above: a considerable improvement when mn>1 since t is much smaller than λ. For example, in an embodiment, t is less than λ. For example, 10 t is less than λ. For example, t may be less than 256, while λ is larger than 2048.
Splitting the secret exponent into a secret part and a public part can be done at the same time key generation is done. These parts of the exponent can be stored in a storage, e.g., a memory of the computation device.
The multiplying comprises
Many different ways of executing the method are possible, as will be apparent to a person skilled in the art. For example, the order of the steps can be varied or some steps may be executed in parallel. Moreover, in between steps other method steps may be inserted. The inserted steps may represent refinements of the method such as described herein, or may be unrelated to the method. For example, operations 430 and 440 may be executed, at least partially, in parallel. Moreover, a given step may not have finished completely before a next step is started.
A method according to the invention may be executed using software, which comprises instructions for causing a processor system to perform method 400. Software may only include those steps taken by a particular sub-entity of the system. The software may be stored in a suitable storage medium, such as a hard disk, a floppy, a memory, an optical disc, etc. The software may be sent as a signal along a wire, or wireless, or using a data network, e.g., the Internet. The software may be made available for download and/or for remote usage on a server. A method according to the invention may be executed using a bitstream arranged to configure programmable logic, e.g., a field-programmable gate array (FPGA), to perform the method.
It will be appreciated that the invention also extends to computer programs, particularly computer programs on or in a carrier, adapted for putting the invention into practice. The program may be in the form of source code, object code, a code intermediate source, and object code such as partially compiled form, or in any other form suitable for use in the implementation of the method according to the invention. An embodiment relating to a computer program product comprises computer executable instructions corresponding to each of the processing steps of at least one of the methods set forth. These instructions may be subdivided into subroutines and/or be stored in one or more files that may be linked statically or dynamically. Another embodiment relating to a computer program product comprises computer executable instructions corresponding to each of the means of at least one of the systems and/or products set forth.
For example, in an embodiment, the computation device may comprise a processor circuit and a memory circuit, the processor being arranged to execute software stored in the memory circuit. For example, the processor circuit may be an Intel Core i7 processor, ARM Cortex-R8, etc. In an embodiment, the processor circuit may be ARM Cortex M0. The memory circuit may be an ROM circuit, or a non-volatile memory, e.g., a flash memory. The memory circuit may be a volatile memory, e.g., an SRAM memory. In the latter case, the device may comprise a non-volatile software interface, e.g., a hard drive, a network interface, etc., arranged for providing the software.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments.
In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. Use of the verb ‘comprise’ and its conjugations does not exclude the presence of elements or steps other than those stated in a claim. The article ‘a’ or ‘an’ preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the device claim enumerating several means, several of these means may be embodied by one and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
In the claims references in parentheses refer to reference signs in drawings of exemplifying embodiments or to formulas of embodiments, thus increasing the intelligibility of the claim. These references shall not be construed as limiting the claim.
Number | Date | Country | Kind |
---|---|---|---|
17201106.6 | Nov 2017 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2018/079537 | 10/29/2018 | WO | 00 |