The present invention relates in general to the technical field of impeding crypto analysis, in particular of protecting at least one data processing device against at least one attack, for example against at least one E[lectro]M[agnetic] radiation attack, or against at least one analysis, for example against at least one D[ifferential]P[ower]A[nalysis].
More specifically, the present invention relates to an arrangement for and a method of protecting at least one data processing device, in particular at least one embedded system, for example at least one chip card or smart card, against at least one attack, in particular against at least one side-channel attack, for example against at least one current trace analysis, the data processing device, in particular at least one integrated circuit of the data processing device, carrying out calculations, in particular cryptographic operations.
Data processing devices, in particular embedded systems, such as chip cards or smart cards, use P[ublic]K[ey]I[nfrastructure] systems for exchanging keys and have to be protected against several forms of attacks targeted on finding out the private key. One such attack is to influence the calculation, in particular the cryptographic operation, by directing
For calculations based on the R[ivest-]S[hamir-]A[dleman] algorithm and/or on the E[lliptic]C[urve]C[ryptography] algorithm, a lot of multiplications are required. Normally, these calculations are performed without protection against side-channel attacks, as for instance current trace analysis.
This might be vulnerable to a D[ifferential]P[ower]A[nalysis] attack because an attacker might take a lot of current traces each time the same multiplication is performed. After adding these traces, most of the noise is removed. When the attacker does the same but for different inputs, the attacker can compare the current traces and learn the secret key bitwise, i.e. bit for bit.
Prior art document WO 01/97009 A1 discloses a method for cryptographic calculation comprising a modular exponentiation routine. This known method works with two random variables to blind intermediate results; in this context, prior art document WO 01/97009 A1 works also with an addition of a random variable but only the multiplication operation is blinded.
However, before the result is used for the next calculation, this result is first unblinded which makes the result again vulnarable; not only the multiplication is sensitive to D[ifferential]P[ower]A[nalysis] but also the access of the R[andom]A[ccess]M[emory] of the unblinded results.
Prior art article “On Boolean and Arithmetic Masking against Differential Power Analysis” by Jean-Sébastien Coron and Louis Goubin discusses the D[ifferential]P[ower]A[nalysis] attack and suggests in the fourth and fifth paragraph of page 2 to mask all inputs and outputs. The fifth paragraph discusses masking of R[ivest-]S[hamir-]A[dleman] by multiplication, wherein reference is made to Thomas S. Messerges, “Securing the AES Finalists Against Power Analysis Attacks”, FSE 2000, Springer-Verlag.
Prior art thesis “Modeling and applications of current dynamics in a complex processor core” by Radu Muresan mentions on pages 33 to 37 the blinding of the point on the elliptic curve before applying E[lliptic]C[urve]C[ryptography].
Regarding the technical background of the present invention, additional reference can be made to
Starting from the disadvantages and shortcomings as described above and taking the prior art as discussed into account, an object of the present invention is to further develop an arrangement as described in the technical field as well as a method of the kind as described in the technical field in order to be capable of securely averting an attack, for example an E[lectro]M[agnetic] radiation attack, or an analysis, for example a D[ifferential]P[ower]A[nalysis], such attack or such analysis in particular targeted on finding out a private key.
The object of the present invention is achieved by an arrangement comprising the features of claim 1 as well as by a method comprising the features of claim 8. Advantageous embodiments and expedient improvements of the present invention are disclosed in the respective dependent claims.
The present invention is principally based on the idea to use an arrangement for as well as a method of blinding intermediate results for providing invulnerability, in particular D[ifferential]P[ower]A[nalysis] invulnerability; in particular, such blinding is employed in multiplications, for example by addition, comprised by the calculations, in particular by the cryptographic operations, by employing at least one random variable, wherein the calculation of the inversion of any operand is not required.
More specifically, a message M can be blinded with a variable V. This variable V can be derived from a randomly chosen variable v. In this way, all intermediate results are also blinded; these intermediate results remain blinded until the end of the calculations, in particular until the end of the cryptographic operations.
According to an expedient embodiment of the present invention, all intermediate results are blinded by a random variable which is kept constant during a complete R[ivest-]S[hamir-]A[dleman] calculation or a complete E[lliptic]C[urve]C[ryptography] calculation but which is changed when a new calculation is started. By this, all current traces are changed, even when all inputs are the same because the random variable is not the same.
In a preferred embodiment of the present invention, the principle of Montgomery reduction is used. The Montgomery reduction is an efficient algorithm for multiplication in modular arithmetic introduced in 1985 by Peter L. Montgomery. More concretely, the Montgomery reduction is a method for computing c=a·b mod(n) where a, b, and n are k-bit binary numbers.
The Montgomery reduction is now applied particularly in cryptography. Let m be a positive integer, and let R and T be integers such that R>m, g[reatest]c[ommon]d[ivisor](m,R)=1, and 0≦T<m·R. To calculate TR−1 mod(m) without using classical method is called the Montgomery reduction of T modulo m with respect to R. With suitable choice of R, the Montgomery reduction can be efficiently computed.
Advantageously, the present invention is not restricted to the Montgomery reduction but the present invention can also be adapted to other reduction principles.
The present invention does not require the ability to calculate the inversion of an operand, which might be favourable for R[ivest-]S[hamir-]A[dleman] applications.
The present invention further relates to a data processing device, in particular to an embedded system, for example to a chip card or to a smart card, comprising at least one integrated circuit carrying out calculations, in particular cryptographic operations, wherein the integrated circuit is protected
by blinding all intermediate results of the calculations by at least one random variable, without inverting any operand of the calculations.
The present invention finally relates to the use of at least one arrangement as described above and/or of the method as described above in at least one data processing device as described above to be protected against D[ifferential]P[ower]A[nalysis].
As already discussed above, there are several options to embody as well as to improve the teaching of the present invention in an advantageous manner. To this aim, reference is made to the claims respectively dependent on claim 1 and on claim 8; further improvements, features and advantages of the present invention are explained below in more detail with reference to a preferred embodiment by way of example and to the accompanying drawings where
The same reference numerals are used for corresponding parts in
The embodiment of a data processing device, namely an embedded system in the form of a chip card or of a smart card comprising an I[ntegrated]C[ircuit] carrying out cryptographic operations refers to a P[ublic]K[ey]I[nfrastructure] system and works according to the method of the present invention, i. e. is protected by a protection arrangement 100 (cf.
The present invention does not require the ability to calculate the inversion of an operand.
The cryptographic calculations of the integrated circuit can be based on the R[ivest-]S[hamir-]A[dleman] algorithm (cf. prior art document U.S. Pat. No. 4,405,829 or prior art article “A Method for Obtaining Digital Signatures and Public-Key Cryptosystems” by Ron Rivest, Adi Shamir, and Len Adleman in Communications of the ACM, 21 (2), pages 120 to 126, February 1978) calculating for encryption C=Me mod(N) wherein
the decryption calculates M=Cd mod(N).
One of the ways to calculate Me (or Cd) is the following:
first step: starting with R=1;
second step: scanning the exponent e from left to right:
third step: always calculating R=R2 mod(N);
fourth step: when the scanned bit of e=1, moreover R=R.M mod(N) is calculated.
Thus, the calculation comprises a number of squarings and multiplications.
It is assumed that the modulus N comprises a number of words m of n bits,
i.e. N=+nm−1Bm−1+nm−2Bm−2 . . . +n1B+n0 with B=2n.
After the modular reduction, the variables comprise also of m words of n bits, although the M[ost]S[ignificant]W[ord] might have a few bits more. Before the modular reduction, the result will have more words, usually one.
As will be shown in more detail below, the present invention initially blinds M with a randomly chosen variable v of one word. This randomly chosen variable v is subtracted from every word of M mod(N). With V=(Bm−1+Bm−2+ . . . +B+1)v, M can be calculated as M=M−V mod(N); in this context, the underlining indicates that the variable is blinded. Then the multiplication and the squarings are modified such that the result R is also blinded in the same way with V; so all intermediate results are also blinded. Then at the very last end, when the exponentiation is ready, the result is unblinded.
In more detail, in the first stage of initial blinding let v be a randomly chosen variable of n bits. An additional condition can be v<nm−1 in order to facilitate the reduction but when nm−1 has a number of leading zeroes, this might jeopardize the blinding because v would always receive at least the same number of leading zeroes.
Then, the randomly chosen variable v is subtracted from every word of M. If the result is negative, N or 2N is added; however, it is expedient to know beforehand whether the result is negative or not.
For this, first Mm−1−v−1+nm−1 is calculated:
The subtraction of v is done by using its 2's complement,
i. e. −V=−Bm+(B−v−1)Bm−1+ . . . +(B−v−1)B+(B−v−1)+1. So all positive numbers except for Bm are added. The term −Bm is not used but when the addition of (B−v−1)Bm−1+ . . . +(B−v−1)B+(B−v−1)+1 to another variable gives a carry bit, the term −Bm is annihilated.
The mathematical implementation of the above-described calculations is as follows:
In the fourth step of R[ivest-]S[hamir-]A[dleman] calculation without protection, i. e. of multiplication, the following calculations are performed:
R=X*Y mod(N)
X=x
m−1
B
m−1
+x
m−2
B
m−2
+ . . . +x
1
B+x
0
Y=y
m−1
B
m−1
+y
m−2
B
m−2
+ . . . +y
1
B+y
0
B=2n,
wherein m is the number of words, for instance m=16, and n is the number of bits of a word, for instance n=64.
In a substep of the multiplication, X·Yj+R is calculated, and then a Montgomery reduction Mr is performed. This is done as follows:
C=0;
for i=0 to m−1. {(B·C+Ri)=XiYi+Ri+C}
Rm=C.
In the case of protection, it is assumed that all operands are blinded with V,
i.e. V=(Bm−1+ . . . +B+1)v
Then, X=X−V mod(N), Y=Y−V mod(N), and R=R−V mod(N) are calculated.
First, Yj, is unblinded:
v is added to every word Yj; when it gives a carry, it is added to the next higher word Yj+1:
B·c+Y
j
=Y
j
+v+c.
Now, R′=X·Yj+R+V·Yj−Bm·v=(X−V)Yj+R−V+V·Yj−Bm·v=X·Yj+R−V−Bm·v=R′−V is calculated. The term −Bm·v is to blind the M[ost]S[ignificant]W[ord] (index m) of the product X·Yj. So the new result R′ is also blinded by V. Therefore, V·Yj−Bm·v has to be added to the multiplication X·Yj+R; it can be written v·Yj=BWH+WL.
This results in the following algorithm:
C=0;
B·c+Y
j
=Y
j
+v+c;
B·W
H
+W
l
=v·Y
j;
for i=0 to m−1: {(B·C+Ri)=XiYj+B·WH+WL+Ri+C}
R
m
=C−v.
For j=0, R=X·Y has to be calculated without the addition of R which performed a part of the blinding; in that case, X·0−V is calculated instead:
C=0;
B·c+Y
0
=Y
0
+v+c;
B·W
H
+W
L
=v·Y
0;
for i=0 to m−1: {(B·C+Ri)=Xi·Y0+B·WH+WL−v+C}
R
m
=C−v.
As to the substep of additional reduction, the Montgomery reduction Mr reduces by one word which might be insufficient. During the multiplication, R′=X·Yj+R+V·Yj−Bm·v is calculated. In this context, it should be noted that for Yj=B−1, it is V·Yj−Bm·v=−v. With 0≦X<Bm, 0≦Yj<B, 0≦R<Bm, 0≦v<B, it follows that −Bm+1<R′<Bm+1. The intermediate result might be negative.
The total result of the multiplication and reduction is R″=(N·Q+X·Yj+R+V·Yj−Bm·v)/B. When it is assumed that Yj has its maximum value B−1, then (V·Yj−Bm·v)/B=−v/B>−1, so those terms can be ignored.
In that case, it can be proven that when R<N+X, then also R″<N+X. So R and therefore R″ is at most one bit larger but it does not accumulate during a number of calculations. Only at the very last end, i. e. when Ym−1 is used, then an additional reduction by subtracting N at most twice might have to be performed.
At the other end, when Q=0 and Yj−1=0, it can be proven that when R>−v·Bm−1, then also R″>−v·Bm−1; so R″ might become negative but it will not accumulate during a number of additional reductions; so the result is left negative. Only, at the very last end, i. e. when Ym−1 is used, then an additional reduction by adding N at most twice might have to be performed.
The above-described calculations additionally imply that
In the third step of R[ivest-]S[hamir-]A[dleman] calculations without protection, i. e. of squaring,
It is XHj=B31 1Xm−1+ . . . +Bj+1Xj+1, i.e. all terms of X starting with Xj+1;
RHj=Bm−1Rm−1+ . . . +Bj+1Rj+1i. e. all terms of R starting with Rj+1;
however, RH0=0.
In general, it is calculated Xj2+Rj and 2XHj·Xj+RHj+C; then, a Montgomery reduction (=reference numeral Mr in
In the third step of R[ivest-]S[hamir-]A[dleman] calculations without protection, i. e. of squaring, it is assumed that all operands are blinded with V, i. e. V=(Bm−1+ . . . +B+1)v.
After calculating X=X−V mod(N) and R=R−V mod(N), first Xj is unblinded; v is added to every word Xj; when it gives a carry, it is added to the next higher word Xj+1: B⇄c+Xj=Xj+v+c.
Now, for the squaring, the following is calculated:
B·C+R
j
=X
j
·X
j
+R
j
+v·X
j=(Xj−v)·Xj+Rj+v·Xj=Xj2+Rj.
The addition of the blinded Rj blinds the term Rj′ again.
For the double products,
R
Hj′=2XHj·Xj+RHj+C+2VHj·Xj−Bm·v=2XHj·Xj+RHj+C−Bm·v,
wherein VHj=(Bm−1+ . . . +Bj+1)v:
The term RHj blinds all terms with index ranging from j+1 to m−1;
the term −Bm·v blinds the the M[ost]S[ignificant]W[ord] of the result (Rm);
all terms R with index ranging from 0 to j−1 are unchanged and therefore blinded.
So, the new result R′ is also blinded by V.
Therefore, v·Xj has to be added to the squaring, and 2VHj·Xj−Bm·v has to be added to the double products.
For j=0, R=X2 has to be calculated without the addition of R which performed a part of the blinding; in that case, X·X0−V is calculated instead.
This gives the following algorithm:
R
m = C − v;
R
m = C − v;
R = Montgomery(R);
In the substep of additional reduction, the Montgomery reduction (=reference numeral Mr in
During the multiplication, it is calculated:
R=(Xj·Xj+Rj+v·Xj)Bj+2XHj·Xj+RHj+2VHj·Xj−Bm·v with
R<2XHj−1·Xj+RHj−1+2VHj−1·Xj−Bm·v.
With Xj<B, XHj−1<Bm−Bj; VHj−1<Bm−Bj; v<B, it follows that R′<3Bm+1. With all variables being zero, except for v, then R′>−Bm+1. The intermediate result might be negative or when positive overflow by two bits.
The total result of the multiplication and reduction is
R″
<(N·Q+2XHj−1·Xj+RHj−1+2VHj−1·Xj−Bmv)/B.
It can be proven that when R<N+2XHj−1+2VHj−1−Bm, then also R″<N+2XHj−1+2VHj−1−Bm<3Bm+N<4Bm.
So R and therefore R″ is at most two bit larger but it does not accumulate during a number of calculations. So, it can be left for all reductions, except for the last one.
The last reduction, however, ends with only a squaring and no double products. This gives the same result as multiplication (see above substep of additional reduction during multiplication without protection).
The above-described calculations additionally imply that also 2(B·WH+WL) has to be added, beside B·WH+WL, during the multiplication; this implies that the additional adder for B·WH+WL has a multiplexer at the input for shifting the input.
For E[lliptic]C[urve]C[ryptography] (cf. prior art article “A Reconfigurable System on Chip Implementation for Elliptic Curve Cryptography over GF(2n)” by M. Ernst, M. Jung, F. Madlener, et al., pages 381 to 399), an elliptic curve and a point P on that curve are chosen. At a first instance A, a random number a is chosen; a·P is calculated and sent as public key to a second instance B. At this second instance B, also a random number b is chosen; b·P is calculated and sent as public key to the first instance A. Then the first instance A calculates K=a·(b·P) and the second instance B calculates K′=b·(a·P). Now K=K′ and this is the common secret of the two instances A and B.
The basic operation is the multiplication of a point P by a scalar a. This is a repeated point addition X=aP=P+P+ . . . +P (a times). It is started with point P, and the scalar a is scanned from left to right:
The algorithm for the so-called point doubling and the algorithm for the so-called point addition use operations as X·Y mod(N) and X2 mod(N) (like the R[ivest-]S[hamir-]A[dleman] algorithm) but also by operations as R=X+Y mod(N) and R=X−Y mod(N).
The point doubling algorithm and the point addition algorithm require also an inversion operation calculating X−1 with X·X−1 mod(N)=1.
The blinding is not suited for inversion, so the operand has first to be unblinded, then inverted and then blinded again; this is not such a problem because most algorithms work with projective coordinates having only one inversion, and this is postponed to the end. There are other known ways to blind the inversion operation.
The number of words for E[lliptic]C[urve]C[ryptography] is much smaller than the number of words for R[ivest-]S[hamir-]A[dleman]. Therefore, first the complete multiplication with addition/subtraction is performed before the reduction. Like RSA, it is also possible to interleave the multiplication and the reduction.
Here, the Montgomery reduction is used but the blinding can also be designed for other types of reduction.
In the first stage of initial blinding, this initial blinding is performed in the same way as described above for the R[ivest-]S[hamir-]A[dleman] algorithm but now both coordinates of point P have to be blinded. All operations give a result which is blinded in the same way.
In the second stage of multiplication (X·Y mod(N)) and squaring (X2 mod(N)), the blinding of these operations is performed in the same way as described above for the R[ivest-]S[hamir-]A[dleman] algorithm.
In the last step of additional reduction (R=X±Y mod(N)),
The implementation of the present invention may be at least partly on software basis; in this context, processors being suited for R[ivest-]S[hamir-]A[dleman] programming and/or for E[lliptic]C[urve]C[ryptography] programming can also implement the additional reduction algorithm as described above.
An exemplary hardware implementation of the protecting arrangement 100 according to the present invention is shown in
B·c+r=x·y+B·u−k·B·x+z+c for k=−2, . . . , 3;
r is the L[east]S[ignificant]W[ord] and c is the M[ost]S[ignificant]W[ord] of the result for example for n2·g+B·r5−B·k·n2+c2+c or for x·y±B·z+r+c for the multiplication of X·Y±Z.
The multiplier 10
As can further be taken from
The building of the multiplier 10 with look-up tables is advantageous, for instance two bit because then the multiples of x (especially 3×) are already available.
Independently thereof or in connection therewith, it is also advantageous to perform the summation with carry-save adders and to use only for the r-register 14r a full-adder 16r. In this case, c then comprises two words, namely the carry and the sum of the carry-save adders. The c-registers 14c are then doubled, and also the input c comprises two words.
The ranger 18 has to decide whether the result B·c+r is smaller than f·2P/Bm−1 where f has the following values: 0, ¾ or 1 (for Re) and −⅞, −¾, −½, 0, ¾, ⅞, 1, 3/2, 7/4 and 2. Whether the result B·c+r is positive or negative can be found by looking at the sign bit. For a value of ⅞ for instance, then the four bits from position p-2m−1 and below are 0111. If p=Bm, then these four bits are the four M[ost]S[ignificant]B[it]s of c.
The multiplier 10 is connected (=reference numerals 12a, 12b in
Furthermore, there is a state machine 30
Number | Date | Country | Kind |
---|---|---|---|
05105806.3 | Jun 2005 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB06/52055 | 6/23/2006 | WO | 00 | 12/20/2007 |