The present invention relates to methods of generating and protecting cryptographic keys, and is particularly concerned with doing so in a white-box attack environment.
Traditionally, cryptographic implementations were designed to resist only black-box attacks. These are attacks where an adversary attempts to deduce the cryptographic key inside the implementation by analyzing only inputs and outputs. In a black-box attack, it is assumed that the adversary knows what family of cryptographic algorithm they are targeting (e.g., AES, DES, RSA, etc.), but all other details (e.g. execution time, power consumption, memory accesses) are unavailable to them. In essence, a black-box attacker treats the implementation as a mathematical function. However, a white-box attacker is a much more powerful type of adversary and can analyze all parts of the implementation. Rather than just study inputs and outputs, a white-box attacker can see everything that goes on inside the implementation. For example, if the attacker is targeting cryptographic software running on, say, a PC or mobile phone, then they can execute that software inside a debugger and examine memory and register values during its execution. Conventional software implementations of cryptographic algorithms are completely insecure against white-box attackers since the cryptographic key can be observed in memory. New implementations of cryptographic algorithms that resist white-box attacks have recently been proposed (e.g., AES [1], DES [2], ECDSA [3], SHA [4]); however, there have not yet been any proposals for protecting key agreement algorithms, such as Diffie-Hellman [5].
The systems and methods disclosed herein provide techniques for implementing a key agreement scheme that resists white-box attacks.
An object of the present invention is to provide methods for protecting cryptographic keys and other sensitive parameters from a white-box attacker in implementations of key agreement schemes such as Diffie-Hellman.
Elliptic curve Diffie-Hellman (ECDH) is a variant of traditional Diffie-Hellman (DH) where computations are done in an algebraic group defined by points on an elliptic curve. ECDH is defined in the document “SEC 1: Elliptic Curve Cryptography, Version 2.0” [8] and in “Suite B Implementer's Guide to NIST SP 800-56A” [7]. Introductory information on elliptic curve cryptography and ECDH can be found in the text “Guide to Elliptic Curve Cryptography” by Hankerson, Menezes and Vanstone [9]. At high level, ECDH works as follows: two parties exchange public keys and combine them with their respective private keys to derive a shared secret. Even though their respective private keys are different, the mathematics underlying ECDH ensures that each party will derive the same shared secret. Private keys are typically just random numbers, and a corresponding public key is computed from a private key using a mathematical operation that is computationally easy to evaluate but very computationally expensive to invert.
ECDH is utilized in a number of cryptographic protocols including Transport Layer Security (TLS), which is used to secure Internet web browsing, and the Full Authentication protocol, which is part of the Digital Transmission Content Protection (DTCP) system [10]. The shared key established using ECDH can be used to build a secure channel between end-parties (i.e. using symmetric key cryptography for message confidentiality and authentication). Any attacker sitting between the end-parties that records and analyzes messages exchanged during the ECDH scheme should not be able to recover the shared key unless they carry out some computation which is widely believed to be infeasible. However, in some applications, the attacker does not sit between the end-parties; instead, the attacker is able to observe the ECDH computations taking place at one of the end-parties. This is a potential concern in DTCP.
DTCP is a digital rights management (DRM) scheme and is commonly used to enforce permissions on video content in home networks. For example, a user might wish to stream video from their set-top box to a personal device like a laptop or mobile phone. If the rights associated with the video content permit this activity, the set-top box will initiate ECDH with the personal device to derive a shared key; the shared key is used to encrypt video content as it is transmitted to the personal device where it will be played. If an attacker was able to determine the shared key, then it would be able to record the video stream and decrypt it outside the DRM system, thereby freeing it from any restrictions. In this scenario, a legitimate user may act as an attacker if they are motivated to free video content from the DRM system. Users are able to observe the computations that take place on their personal device using a debugger and other tools. For this reason, malicious users can be modeled as white-box attackers. This leads to the following question: can an ECDH shared secret be computed in a white-box environment without exposing the shared secret to an attacker?
In the present disclosure, four embodiments of ECDH that are resistant to white-box attacks are provided; these embodiments can be used to build software implementations that are able to derive cryptographic keys and establish trust in hostile execution environments. Although our description focuses on the elliptic curve variant of Diffie-Hellman, one skilled in the art will appreciate that these embodiments can easily be adapted to any Diffie-Hellman scheme; this includes, in particular, Diffie-Hellman schemes where the underlying algebraic group is a subgroup of a finite field.
The four embodiments generate and manipulate transformed random numbers. A transformed random number is an obfuscated or masked form of an original random number. In each embodiment, a transformed random number, denoted by {circumflex over (r)} or {circumflex over ({circumflex over (r)}, is used to generate an ECDH public key. The same transformed random number is then used to generate the shared secret. The resultant public key, generated using the transformed random number, will be the same as what would be generated using the original random number. This property also holds for generation of the shared secret. To accomplished this, some auxillary elliptic curve points must be pre-computed and utilized in the computations.
In the first embodiment, we generate a transformed random number {circumflex over (r)} using a linear transformation defined by two instance-specific, compile time, random numbers k1 and k2 chosen from the interval 1 to n−1 where n denotes the size of the underlying elliptic curve group. In the second embodiment, we generate a transformed random number {circumflex over (r)} using a linear transformation defined by one instance-specific, compile time, random number k1, chosen from the interval 1 to n−1. This approach is similar to the first embodiment except that only one compile time random number is used to generate the transformed random number {circumflex over (r)}. In the third embodiment, a transformed random number {circumflex over ({circumflex over (r)} is generated by applying a pseudo-Hadamard transformation [11] to the transformed random number {circumflex over (r)} that is generated in the first embodiment. In the fourth embodiment, a transformed random number {circumflex over ({circumflex over (r)} is generated applying a pseudo-Hadamard transformation to the transformed-random number {circumflex over (r)} that is generated in the second embodiment.
The present invention will be further understood from the following detailed description with reference to the drawings in which:
The following symbols are used within this disclosure to describe embodiments of the present disclosure.
Elliptic Curve Diffie-Hellman Schemes
ECDH is recommended by the NSA in their Suite B cryptography guidelines [6]. More precisely, Suite B recommends the use of Ephemeral Unified ECDH (EU-ECDH) and One-Pass ECDH (OP-ECDH). These two ECDH schemes are described in the Suite B implementer's guide [7]. In the Ephemeral Unified scheme, each party generates an ephemeral key pair each time they want to compute a new shared secret. In the One-Pass scheme, an ephemeral key pair is generated by only one party; the other party uses a static key pair, which may be reused in the computation of many shared secrets. The implementer's guide states that EU-ECDH is preferred over OP-ECDH; however, there are some scenarios where EU-ECDH cannot be used. This is the case, for example, in store-and-forward communication systems where one party is not available to contribute an ephemeral public key. In scenarios like this, OP-ECDH can be used.
In both the EU-ECDH and OP-ECDH schemes, there are two main phases of computation: key pair generation and shared secret computation. The steps required in each phase are described in detail in the implementer's guide [7]; however, the computations in those steps leak sensitive parameters to white-box attackers.
The procedure for key pair generation in [7] specifies that the key pair (private key dA and public key QA) be generated based on a random number r as follows: dA=r+1, and QA=dAG. Note that “dAG” denotes the elliptic curve point formed by adding the generator G to itself dA times. The procedure for shared secret computation specifies that the shared secret, Z, be computed as the x-coordinate of an elliptic curve point P defined as follows: P=dAQB=(r+1)QB, where dA is party A's private key and QB is party B's public key.
Straightforward implementations of key pair generation and shared secret computation on party A's device will expose the secret key dA in that device's memory. This is apparent for the computations that produce dA=r+1, QA=dAG, and P=dAQB. Thus, a white-box attacker that examines memory-dumps of party A's device will be able to deduce the private key, dA, as well as the shared secret, Z. To maintain security against white-box attackers, the computations executed for key pair generation and shared secret computation must be protected.
Key Pair Generation.
Each static and ephemeral private key d and public key Q shall be generated using the appropriate domain parameters, D, which must be selected from one of the two sets of domain parameters specified in the list of Suite B elliptic curves and domain parameters (see Appendix A of [7]). Henceforth, we assume that domain parameters have been appropriately selected.
Target:
Compute Q=dG without disclosing d inside the memory of one party's machine, where Q=dG=(r+1)G, r is the random number. Q is the public key, and may represent any one of Qe,A, Qe,B or Qs,B depending on the particular ECDH scheme (EU-ECDH or OP-ECDH). d is the private key, and, similarly, it may represent any one of de,A, de,B or ds,B.
Transformed Random Number:
In this embodiment, the transformation t of a random number r is denoted as {circumflex over (r)} and defined as {circumflex over (r)}=t(r)=k1r+k2 mod n, where k1 and k2 are two instance-specific, compile time, random numbers and k1, k2εZn* (note that Zn* denotes the set of integers in the range 1 to n−1 that are relatively prime to the domain parameter n). Thus, {circumflex over (r)}=k1r+k2 mod n and r=k1−1{circumflex over (r)}−k2k1−1 mod n. It is important to note that the output of the random bit generator (RBG) used in key pair generation is interpreted as producing transformed random numbers. This means that the transformation t(r) does not have to be explicitly applied by the implementation—the transformation is implicit in the way in which the output of the RBG is interpreted.
Precomputation:
Once the instance-specific, compile time, random numbers k1 and k2 are selected, two auxiliary elliptic curve points are pre-computed: Ĝ=k1−1G and U2=(k1−k2)Ĝ. These two points are computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time).
The steps of key pair generation are as follows:
Input:
None.
Output:
Process:
N is valid, that is, N=256 or N=384 (the only valid lengths for Suite B).
6. Compute Q=U1+U2 (recall that U2=(k1−k2)Ĝ was precomputed).
Verification:
It can be shown that the previous procedure produces a public key that is compatible with the original procedure given in [7]. The original procedure produces a public key Q=(r+1)G; for the new procedure, we have
In our proposed key pair generation procedure, the cost of elliptic curve computations is as follows. Three elliptic curve point multiplications are required, two of which are offline and one is online. One elliptic curve point addition is required.
Shared Secret Computation.
Party A computes the shared secret Z, which is the x-coordinate of an elliptic curve point, using the appropriate domain parameters D, party B's public key (QB), and their transformed random number {circumflex over (r)}A. Party B computes the shared secret in an analogously, using party A's public key and their own transformed random number. We continue
Target:
Compute P=(rA+1)QB without disclosing rA (party A's random number) inside the memory of party A's machine.
Precomputation:
Let k1(A) and k2(A) denote party A's instance-specific, compile time, random numbers. Party A should pre-compute the integer values s=k1(A)−1 mod n and t=k1(A)−k2(A) mod n. These two values are computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time). If party B's public key is known in advance, as is the case in OP-ECDH, then further pre-computation can be done. When doing OP-ECDH, the elliptic curve points {circumflex over (Q)}B
The steps of shared secret computation are as follows:
Input:
Output:
The shared secret Z or an error indicator.
Process:
Verification:
It can be shown that the previous procedure, which is summarized in
Cost:
In our proposed shared secret computation procedure, the cost of elliptic curve computations is as follows. Three elliptic curve point multiplications required. For OP-ECDH, two of the point multiplications are off-line and one is online; for EU-ECDH, all three point multiplications are online. One elliptic curve point addition is required.
The second embodiment of the disclosure is a reduced version of the first embodiment. The second embodiment is described with reference to
Key Pair Generation.
We now describe key pair generation.
Target:
Compute Q=dG without disclosing d inside the memory of one party's machine.
Transformed Random Number:
In this embodiment, the transformation of the random number r is defined as {circumflex over (r)}=t(r)=k1r mod n, where k1 is an instance specific, compile time random number selected from the set Zn*. Thus, {circumflex over (r)}=k1r mode and r=k1−1{circumflex over (r)} mod n.
Precomputation:
Once the instance-specific, compile time, random number k1 is selected, an auxiliary elliptic curve point is pre-computed: Ĝ=k1−1G. As in the first embodiment, this point should be computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time).
The steps of key pair generation are as follows:
Input and Output are the same as that of the first embodiment.
Process:
Verification:
The following sequence of identities show that the previous procedure produces a public key that is compatible with the original procedure given in [7]:
Cost:
The costs of elliptic curve computations for key pair generation are now as follows. Two elliptic curve point multiplications are required (one online and one offline). One elliptic curve point addition is required.
Shared Secret Computation:
We explain shared secret computation from the perspective of Party A. Party A will use the appropriate domain parameters D, the other party's public key (QB), and their transformed random number {circumflex over (r)}A to compute Z, which is the x-coordinate of an elliptic curve point P.
Target:
Compute P=(rA+1)QB without disclosing rA inside the memory of party A's machine.
Precomputation:
Let k1(A) denote party A's instance-specific, compile time, random number. Party A should pre-compute the integer values s=k1(A)−1 mod n. This value should be computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time). If party B's public key is known in advance, as is the case in OP-ECDH, then further pre-computation can be done. When doing OP-ECDH, the elliptic curve point {circumflex over (Q)}B=sQB=k1(A)−1QB should be computed off-line.
The steps are as follows:
Input:
Output:
The shared secret Z or an error indicator.
Process:
Verification:
It can be shown that the previous procedure, which is summarized in
Cost:
The costs of elliptic curve computations for shared secret computation are now as follows. Two elliptic curve point multiplications are required. For OP-ECDH, one point multiplication is off-line and one is online. For EU-ECDH, both point multiplications are online. One elliptic curve point addition is required (for both OP-ECDH and EU-ECDH).
The second embodiment is a reduced version of the first embodiment. One instance specific random number is removed from the first embodiment, which reduces the number of elliptic curve computations, but possibly sacrifices security strength. This methodology is suitable for those applications where computational efficiency is a prime concern.
The third embodiment is described with reference to
Key Pair Generation:
The third embodiment is an enhanced version of the first embodiment where security strength is a prime concern. The computational cost of this embodiment is higher.
Target:
Compute Q=dG without disclosing d inside the memory of one party's machine.
Transformed Random Numbers:
In this methodology, the Random Bit Generator generates two transformed random numbers {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 that, in combination, encode two random numbers ri and r2. The resultant public key, Q, may be generated using either r1 or r2, and this choice may be made at runtime within the key pair generation procedure. Thus, our proposed procedure can compute Q=dG=(r1+1)G or Q=dG=(r2+1)G. The values {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 output by the RBG are interpreted as being produced by two layers of transformations. One layer corresponds to the same transformation technique of the first embodiment; namely, a linear transform using instance specific, compile time random numbers drawn from the set Zn*. The second layer is a pseudo-Hadamard transformation [11], which ensures that r1 and r2 can only be recovered from the combination of both {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22.
More precisely, {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 are interpreted as follows:
{circumflex over ({circumflex over (r)}11={circumflex over (r)}1+{circumflex over (r)}2 mod 2N
{circumflex over ({circumflex over (r)}22={circumflex over (r)}1+2{circumflex over (r)}2 mod 2N
We interpret {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 as being the result of a pseudo-Hadamard transform applied to the values {circumflex over (r)}1 and {circumflex over (r)}2. Note that N is equal to the bit length of the domain parameter n, and {circumflex over (r)}1 and {circumflex over (r)}2 are transformed values that encode r1 and r2 respectively. To recover {circumflex over (r)}2 from {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22, we compute {circumflex over ({circumflex over (r)}22−{circumflex over ({circumflex over (r)}11 mod 2N; and to recover {circumflex over (r)}1 we compute 2{circumflex over ({circumflex over (r)}11−{circumflex over ({circumflex over (r)}22 mod 2N. The definition of {circumflex over (r)}1 and {circumflex over (r)}2 is
{circumflex over (r)}
1
=k
11
r
1
+k
12 mod n
{circumflex over (r)}
2
=k
21
r
2
+k
22 mod n
Here, k11, k12, k21, k22 are instance specific, compile time random numbers drawn from the set Zn*. As in the first embodiment, we have r1=k11−1{circumflex over (r)}1−k12k11−1 mod n and r2=k21−1{circumflex over (r)}2−k22k21−1 mod n.
Precomputation:
Once the instance-specific, compile time, random numbers k11, k12, k21, k22 are selected, two sets of auxiliary elliptic curve points are pre-computed: Ĝ11=k11−1G, Ĝ12=k11−1Ĝ11, U12=(k11−k12)Ĝ11 and Ĝ21=k21−1G, Ĝ22=k21−1Ĝ21, U22=(k21−k22)Ĝ21. As in the first embodiment, these points should be computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time).
The steps of key pair generation are as follows:
Input and Output are the same as that of the first embodiment.
Process:
Verification:
The procedure is illustrated in
Shared Secret Computation:
We explain shared secret computation from the perspective of Party A. Party A will use the appropriate domain parameters D, the other party's public key (QB), and their own transformed random numbers {circumflex over ({circumflex over (r)}11(A),{circumflex over ({circumflex over (r)}22(A) to compute Z.
Target:
Without disclosing r1(A) and r2(A) inside the memory of the party A's machine, compute P=(r1(A)+1)QB or P=(r2(A)+1)QB depending on whether r1(A)+1 or r2(A)+1 was selected as party A's private key.
Precomputation:
Let k11(A), k12(A), k21(A), k22(A) denote party A's instance-specific, compile time, random numbers. Party A should pre-compute the integer values s1=k11(A)−1 mod n and s2=k21(A)−1 mod n. These values should be computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time). If party B's public key is known in advance, as is the case in OP-ECDH, then further pre-computation can be done. When doing OP-ECDH, the elliptic curve points {circumflex over (Q)}B
The steps are as follows:
Input:
Output:
The shared secret Z or an error indicator.
Process:
Verification:
The procedure is illustrated in
The third embodiment may be the most resistant to white-box attacker. In this embodiment, two random numbers use to make it more difficult for the attacker to follow the execution path of the program. Furthermore, two transformations are used to mask the random numbers. As a result, computational efficiency is degraded. The third embodiment is suitable for those applications where security strength is the main concern.
The fourth embodiment is described with reference to
Key Pair Generation:
The fourth embodiment is an enhanced version of the second embodiment.
Target:
Compute Q=dG without disclosing d inside the memory of one party's machine.
Transformed Random Numbers:
As was done the in third embodiment, the fourth embodiment uses the random bit generator to generate two transformed random numbers {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 that, in combination, encode two random numbers r1 and r2. The resultant public key, Q, may be generated using either r1 or r2, and this choice may be made at runtime within the key pair generation procedure. The values {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 output by the RBG are interpreted as being produced by two layers of transformations. One layer corresponds to the transformation technique of the second embodiment; namely, a linear transform using instance specific, compile time random numbers drawn from the set Zn*. The second layer is a pseudo-Hadamard transformation, as in the third embodiment.
More precisely, {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 are interpreted as follows:
{circumflex over ({circumflex over (r)}11={circumflex over (r)}1+{circumflex over (r)}2 mod 2N
{circumflex over ({circumflex over (r)}22={circumflex over (r)}1+2{circumflex over (r)}2 mod 2N
We interpret {circumflex over ({circumflex over (r)}11 and {circumflex over ({circumflex over (r)}22 as being the result of a pseudo-Hadamard transform applied to the values {circumflex over (r)}1 and {circumflex over (r)}2. Note that N is equal to the bit length of the domain parameter n, and {circumflex over (r)}1 and {circumflex over (r)}2 are transformed values that encode r1 and r2 respectively. To recover {circumflex over (r)}2 from {circumflex over (r)}11 and {circumflex over (r)}22, we compute {circumflex over ({circumflex over (r)}22−{circumflex over ({circumflex over (r)}11 mod 2N; and to recover {circumflex over (r)}1 we compute 2{circumflex over ({circumflex over (r)}11−{circumflex over ({circumflex over (r)}22 mod 2N. The definition of {circumflex over (r)}1 and {circumflex over (r)}2 is
{circumflex over (r)}
1
=k
11
r
1 mod n
{circumflex over (r)}
2
=k
21
r
2 mod n
Here, k11, k21 are instance specific, compile time random numbers drawn from the set Zn*. As in the second embodiment, we have r1=k11−1{circumflex over (r)}1 mod n and r2=k21−1{circumflex over (r)}2 mod n.
Precomputation:
Once the instance-specific, compile time, random numbers k11,k21 are selected, two sets of auxiliary elliptic curve points are pre-computed: Ĝ11=k11−1G, Ĝ12=k11−1Ĝ11 and Ĝ21=k21−1G, Ĝ22=k21−1Ĝ21. As in the first embodiment, these points should be computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time).
The steps of key pair generation are as follows:
Input and Output are the same as that of the third embodiment.
Process:
Verification:
The computational process is illustrated in
Shared Secret Computation.
As before, we explain shared secret computation from the perspective of Party A. Party A will use the appropriate domain parameters D, the other party's public key (QB), and their own transformed random numbers {circumflex over ({circumflex over (r)}11(A),{circumflex over ({circumflex over (r)}22(A) to compute Z.
Target:
Without disclosing r1(A) and r2(A) inside the memory of the party A's machine, compute P=(r1(A)+1)QB or P=(r2(A)+1)QB depending on whether r1(A)+1 or r2(A)+1 was selected as party A's private key.
Precomputation:
Let k11(A), k21(A) denote party A's instance-specific, compile time, random numbers. Party A should pre-compute the integer values s1=k11(A)−1 mod n and s2=k21(A)−1 mod n. These values should be computed during an off-line set-up phase where the implementation is not subject to white-box attacks (e.g. at compilation time). If party B's public key is known in advance, as is the case in OP-ECDH, then further pre-computation can be done. When doing OP-ECDH, the elliptic curve points {circumflex over (Q)}B
The steps of shared secret computation are as follows:
Input:
Output:
The shared secret Z or an error indicator.
Process:
Verification:
The computational process is illustrated in
The fourth embodiment attempts to reach a compromise between security and computational efficiency.
Table A shows the computational comparison among the proposed White-Box ECDH design embodiments. Table A allows the four embodiments to be ranked in terms of their computational cost. The embodiments can also be approximately ranked in descending order of security strength as embodiment 3, embodiment 1, embodiment 4, embodiment 2.
Numerous modifications, variations and adaptations may be made to the particular embodiments described above without departing from the scope patent disclosure, which is defined in the claims.
This application claims priority to International Patent Application No. PCT/CA2012/000114, filed Feb. 9, 2012, the disclosure of which is hereby incorporated by reference in its entirety
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CA2012/000114 | 2/9/2012 | WO | 00 | 8/7/2014 |