The aim of the present invention is to propose an alternative scheme to the classical Boneh-Franklin scheme in order to simplify the generation and the use of the asymmetric keys.
Consider the following scenario: a center would like to broadcast some data to l receivers, where only authorized users (typically, those who have paid a fee) can have access to the data. A possible solution, widely deployed in commercial Pay-TV systems or in secured media distribution systems, for instance, consists in encrypting the data using a symmetric key and to securely transmit to each authorized receiver this key which will be stored in a tamper-proof piece of hardware, like a smartcard.
Unfortunately, tamper-resistant hardware is very difficult and/or costly to design, since it is vulnerable to a wide variety of attacks. Therefore, a malicious user (hereafter called a traitor) can try to retrieve the decryption key from his receiver and distribute it (sell or give away) to unauthorized users (the pirates). Depending on the nature of the encryption schemes in use, we can even imagine situations where a dishonest user will try to mix several legitimate keys in order to build a new one and embed it in a pirate receiver device.
The problem of identifying which receivers were compromised and/or which secret keys were leaked is called traitor tracing. Usually, two modes of traitor tracing are considered: in the black-box mode, the tracing algorithm sends crafty ciphertexts to the rogue receiver and aims at determining which keys it uses while observing its behavior; in the non-black-box model, we assume that the keys (or their combination) can be extracted from the pirate receiver and are known to the tracing algorithm.
Fiat and Naor [1] introduced the concept of broadcast encryption. In their model, there exists a set of l authorized users and the broadcasting center can dynamically specify a privileged subset of authorized users that can decrypt selected ciphertexts (like high-value content, for instance). Later on, Chor, Fiat, and Naor [2] introduced the concept of traitor-tracing to thwart the problem of decryption keys piracy in broadcast encryption schemes; their scheme is k-collusion resistant (or k-resilient) in the sense that at least one traitor is identified with very high probability if there are at most k of them. Later on, Naor, Naor and Lotspiech [3, 4] presented more efficient broadcast encryption schemes with tracing capabilities; it was however shown by Kiayias and Pehlivanoglu [5] that the iterative nature of the tracing procedure allows a pirate to leverage significantly the compromise of a few keys.
Boneh and Franklin [6] proposed a new public-key traitor-tracing scheme based on error-correcting codes, more precisely on Reed-Solomon codes. The Boneh-Franklin non-black-box traitor tracing scheme is k-collusion resistant and deterministic in the sense that all of the traitors are identified with probability 1 as long as at most k of them collude to derive new pirate keys.
The aim of the present application is an improved key generation and encryption mechanism for Boneh-Franklin and related schemes.
An immediate benefit of the present application is the possibility to use Reed-Solomon codes that are especially optimized to allow faster decryption and key generation. In practice, for large systems and coalitions of medium size, one obtains a decryption speed improvement by almost an order of magnitude.
The present application also addresses the beyond-threshold security of the Boneh-Franklin scheme: if an adversary is able to recover 2k or more secret keys, where k is the maximal collusion size defined prior to the system deployment, then he is able to compute any other secret key (even if they were not compromised) and thus, the security of the system completely collapses. This is mainly due to the fact that the linear tracing code is public. In the present application we propose a way to protect against this issue.
We now describe the original Boneh-Franklin algorithm in details as advertised and published in [6]. This description will be the basis for the description of the invention for the fast and secure traceable keys and encryption/decryption mechanisms.
We need a group Gq (i.e., a set of elements equipped with a mathematical operation) of prime order q in which the Decision Diffie-Hellman problem is hard. Three main choices are thinkable, but other exist:
Scenario 1. We can work in a subgroup of order q of the group Z/pZ, where p and q are large prime numbers and where q|p−1. Typically, q is a 160-bit prime number and p is a 1024-bit prime number. Group elements are 1024-bit numbers requiring 1024 bits of storage/bandwidth; implementing Boneh-Franklin scheme on a prime-order subgroup of Z/pZ requires to be able to perform modular additions, modular subtractions, modular multiplications and modular inversions both on 160-bit and on 1024-bit numbers.
Scenario 2. We can work over a group of points of an elliptic curve [3] over a finite field with characteristic 2 having in the order of 2160 elements. Group elements require typically 320 bits, but point compression techniques allow decreasing this number down to 160 bits of storage/bandwidth. Implementing Boneh-Franklin scheme on such a group requires that the receiver be able to perform additions, subtractions, multiplications and inversions on 160-bit field elements.
Scenario 3. We can work over a group of points of an elliptic curve over a finite field with a large prime characteristic and having in the order of 2160 elements.
Group elements require typically 320 bits, but point compression techniques allow decreasing this number down to 160 bits of storage/bandwidth. Implementing Boneh-Franklin scheme on such a group requires performing modular additions, modular subtractions, modular multiplications and modular inversions on 160-bit numbers.
We now describe the traceable key public component γ(i) generation process as done in [6]. For ease of understanding, we assume from now on that we work in a multiplicative group as described in scenario 1. Basically, the approach of Boneh and Franklin is based on the use of Reed-Solomon codes.
and considering a basis b1, . . . , b2k of the nullspace of A, a new matrix is built
Consider /− as being the rows of B. Thus, /− contains l codewords each of length 2k. By observing that any vector in the span of the rows of A corresponds to a polynomial of degree at most l−2k−1 evaluated at the points 1, . . . , l one can construct the rows of B using Lagrange interpolation.
Let k denote the maximal allowed coalition size (i.e., the maximum number of keys that could potentially be mixed by a pirate while keeping the tracing properties). Let g denote a generator of the group Gq of prime order q in which we implement the Boneh-Franklin scheme. Let l denote the maximum number of receivers in the Boneh-Franklin system. Let 1≦i≦l denote the identity of the i-th receiver. The following values are computed:
1. The i-th Boneh-Franklin traceable key public component is computed as being the following 2k-valued vector over Z/qZ:
2. The public key is computed by generating 2k secret values rj:
r∈
R
Z/qZ for 1≦j≦2k (4)
and computing
h
j
=g
r
for 1≦j≦2k. (5)
Then 2k secret values αj are generated:
αj∈RZ/qZ for 1≦j≦2k (6)
and finally the value y is computed as
The public key is then defined as being the (2k+1)-valued vector
(y, h1, . . . , h2k) (8)
3. The i-th private key secret component θi put in the i-th receiver, is derived from the i-th traceable key public component
γ(i)=(γ1(i), . . . , γ2k(i)) as
To encrypt a message m∈Gq, we first generate a random value a ∈RZ/qZ and the ciphertext is defined as being the (2k+1)-valued vector
(m·ya, h1a, . . . , h2ka) (10)
Given a ciphertext c=(s,ρ1, . . . , ρ2k), it is easy to see that one can recover m by computing using i-th private key secret component θi
where γj(i) are the public components of the traceable private key which are used to derive θi.
In order to simplify the generation and the use of the asymmetric keys, in particular private keys in a public key encryption scheme with traceable private keys formed by a public component and a secret component, we propose a method to generate an i-th private key in a public key encryption scheme with traceable private keys formed by a public component γ(i) and a secret component θI, according to a maximal coalition factor k, with all arithmetic operations performed within the multiplicative group Z/qZ where q is a prime number,
said public component being defined as:
γ(I)=(1, b mod q, b2 mod q, . . . , b2k−1 mod q)
and said secret component being defined as:
where rj and αj are uniformly distributed random values in the group Z/qZ, 1≦j≦2k and where the value b may be either public and easily computable or secret and statistically decorrelated.
Furthermore, we propose two possible variants to encrypt any type of message faster than the original Boneh-Franklin scheme with the same tracing and security properties.
We now present a traceable private key public component generation process which allows deriving public components which offer a significantly improved decryption speed.
Previously, we noted that the components γ(i) can be computed using the recursive formula Eq. (3); this operation is typically feasible in the broadcasting center, but not in a receiver. We can furthermore note that, working in a usual security configuration of 280 operations, the elements of a public component γ(i) have all a length of 160 bits.
This new method works as follows: in the key generation process described previously, the step 1 is replaced by
1′ We compute the i-th fast Boneh-Franklin traceable private key public component as being the following 2k-valued vector over Z/qZ:
γ(i)=(1, i mod q, i2 mod q, . . . , i2k−1 mod q). (12)
The method presented below results in rather small exponent sizes which can drastically speed up the ciphertext decryption in the receiver: re-writing (11) as
we can transform, for instance for l=220, 2k+1 modular exponentiations with 160-bit exponents by 2k modular exponentiations with 20-bit exponents and one 160-bit exponentiation. This is more than a 7-times speedup.
According a particular embodiment of the invention, q is higher than 2127 in order to avoid generic attacks against the discrete logarithm problem.
A further advantage of this method is that a receiver can compute the public component of the decryption key without the need to evaluate the recursive formula of Eq. (3).
In practical scenarios, there might be a situation where an attacker might have 2k secret components θi at his disposal. This part of the invention describes specifically how to the system can be protected in such a case. We start by describing an attack that might occur in practice and allow the attacker to derive every private key in the system.
Let us suppose than an adversary has managed to get 2k private elements θi, for 1≦s≦2k. The vectors in r/−={γ(1), γ(2), . . . , γ(l)} are assumed to be public. Then, we can rewrite Eq. (9) over Z/qZ as
with ωj=rj/Σrjαj; note that the ωj are unknown coefficients to an adversary. However, with 2k private elements, we have a system of 2k linear equations with 2k variables with a single solution revealing the values of ωj to the adversary using a simple Gaussian reduction. From those coefficients, the adversary can compute any other private key θi, in the system
Not only the adversary will be able to create many untraceable combinations of keys, but he will be also able to distribute newly derived keys so that innocent users (whose keys were a priori never compromised) will be accused of treachery.
We now present a traceable key generation process which allows deriving traceable keys resistant to pirates able to gather 2k keys or more. This new method works as follows:
1″ We compute the i-th fast Boneh-Franklin public component of the traceable private key as being the following 2k-valued vector over Z/qZ:
γ(i)=(1, ζ2 mod q, . . . , ζ2k−mod q). (14)
where ζ ∈R Z/qZ is drawn independently and uniformly at random for each γ(i).
We note that the receivers have to store the entire representation
d
(i)=(θiγ1(i), . . . , θ2kγ2k(i)) (15)
in tamper-proof memory and hence the abovementioned public component becomes secret.
A possible variant would consist in deriving ζ from i by processing i and/or additional information with a cryptographically secure pseudo-random function (or permutation) parametered by a secret key.
To encrypt a message m∈Gq, the standard Boneh-Franklin encryption procedure requires to generate a random value a ER Z/qZ and the ciphertext is defined as being the (2k+1)-valued vector
(m·ya, h1a, . . . , h2ka) (16)
In most practical situations, the message m consists in a symmetric session key k, which is then used to encrypt some content, since m is of limited length (no more than 20 bytes, usually). Furthermore, one possibly needs a hash function mapping a group element to a symmetric key.
We propose to bypass these intermediate steps and to use one of the two following possible variants to encrypt any type of message faster than the standard Boneh-Franklin scheme, but keeping the same tracing and security properties.
1. To encrypt a message m∈{0, 1}* (i.e., a bitstring of arbitrary length), we first generate a random value a ∈R Z/qZ and the ciphertext is defined as being the (2k+1)-valued vector
(m⊕PRF(n, ya), h1a, . . . , h2ka) (17)
where PRF(., .) denotes a cryptographically secure pseudo-random function. For instance, it can be HMAC-SHA1, HMAC-SHA256 or a block cipher evaluated on a counter and where ya is considered as being the symmetric key and n is a nonce value (e.g., a counter incremented sufficiently many times to generate enough key stream). Here, the XOR operation ⊕ could be replaced by any group law.
2. To encrypt a message m∈{0, 1}*, we first generate a random value a ∈R Z/qZ and the ciphertext is defined as being the (2k+1)-valued vector
(E(m, ya), h1a, . . . , h2ka) (18)
where E(., .) is a block cipher or any symmetric encryption scheme based on a block cipher, and where ya is considered as being the key. A possible variant would consist in mapping the ya value to a key using a hash function. Another possible variant is an encryption scheme E(.,.) requiring additional information, like an initial vector.
In Pay-TV systems, the use of traceable asymmetric keys is an advantage in terms of fighting against piracy. The Pay-TV receiver (or the security module thereof) is loaded with a private key i.e., the public component γ(i) and the secret component θi.Each Pay-TV receiver, such as a set top-box, multimedia device or wireless portable device (DVB-H), comprises at least one private key. The secret component is preferably stored in a secure container such as a SIM card, smartcard of any type of tamper-proof memory.
In a practical example, a video/audio data packet PSpacket will be encrypted in the following way, assuming we are working with a multiplicative group and HMAC-SHA256 as the function PRF (see formula (17)):
generate uniformly distributed random value a
compute h1a, h2a, . . . h2ka using 2k last elements of the public key (see formula (8)),
compute ya using the first element of the public key,
divide the PSpacket into chunk packets of 256 bits possibly remaining a residual packet of less than 256 bits,
initialize an index to an arbitrary constant (usually 0),
for each chunk, computing the HMAC-SHA256 of the index with ya as key, the index being updated for each chunk, and applying an XOR function (or any group operation) with the respective chunk
in case that a residual chunk exists, adjusting the HMAC-SHA256 value by extracting the number of bit corresponding to the number of bits of the residual chunk before applying the XOR function.
transmitting to the receiver, the result values after the XOR function and the h1a, h2a, . . . , h2ka
In the receiver side, the received values h1a, h2a, . . . , h2ka are considered as 2k values i.e. ρ1, ρ2, . . . ρ2k.
In order to extract the audio/video data PSpacket, the following steps will be executed:
computing
using the γ(i) public component of the private key, and θi is the secret component of the private key,
executing the same HMAC-SHA256 operation as made on the sender side, by defining an index in the same way as defined during the encryption operation.
In this way, the broadcasting center can send a global, encrypted version of audio/video packet to all receivers; those receivers decrypt the packets using their own private key. A pirate willing to implement an unofficial (unlawful) receiver will necessarily have to embed a unique private key (or a mix of several private keys) in order to decrypt the packets. Having such a rogue receiver in hands, the Pay-TV operator can then recover the pirate private key(s) and possibly revoke it (them) using another mechanism and/or possibly take legal or any other action against the person having purchased the original (broken) receiver(s), provided such a link exists.
Instead of mixing the packets with HMAC result, the packets are encrypted with a standard symmetric encryption scheme using a key K, this key being used at the mixing step with the HMAC result.
According to another embodiment, the encrypted packet is obtained by encrypting the said packet with a symmetric encryption scheme using the ya value as a key (e.g. TDES in CBC mode). According to an alternative embodiment, a hashing function is first applied to the ya value before being used as a key. This is preferably the case when the size of the ya value is different than the size of the symmetric encryption scheme key.
Another possible field of application concerns the protection of software against piracy. We may assume that a software is sold together with a hardware dongle containing a different private key for every package. This dongle is able to decrypt a global ciphertext contained in the software and getting a piece of information which is necessary to the use of the software. If a pirate is willing to clone dongles and sell them, he must embed at least a private key. Getting such a pirate dongle in hands, the software seller can then recover the involved private key(s) and take legal or any other action against the person having purchased the original (broken) dongle(s), provided such a link exists.