The present invention relates to performing authentication as to whether or not an element possesses a pre-specified property. An example is authenticating validity of a digital revocation in a public key infrastructure, or authenticating validity of an entitlement to use a resource (e.g. to sign onto a World Wide Web site).
A certificate may have to be revoked prior to its expiration date D2. For example, the certificate owner U may change his affiliation or position, or the owner's private key SKU may be compromised. Other parties must be prevented from using the owner's public key if the certificate is revoked.
One approach to prevent the use of public keys of revoked certificates is through a certificate revocation list (CRL). A CRL is a signed and time-stamped list issued by CA 120 and specifying the revoked certificates by their serial numbers SN. These CRLs must be distributed periodically even if there are no new revoked certificates in order to prevent any type of replay attack. The CRL management may be unwieldy with respect to communication, search, and verification costs. The CRL approach can be optimized using so-called delta-CRLs, with the CA transmitting only the list of certificates that have been revoked in the previous time period (rather than for all time periods). The delta-CRL technique still has the disadvantage that the computational complexity of verifying that a certificate is currently valid is basically proportional to the number of time periods, since the verifier must confirm that the certificate is not in any of the delta-CRLs.
Certificate revocation trees (CRTs) can be used instead of CRLs as described in [15] (the bracketed numbers indicate references listed at the end before the claims).
Instead of CRLs and CRTs, CA 120 could answer queries about specific certificates. In
Let f be a predefined public length-preserving function
f: {0,1}n→{0,1}
where {0,1}n is the set of all binary strings of a length n. Let fi denote the f-fold composition; that is, fi(x)=x for i=0, and fi(x)=f(fi−1(x)) for i>0. Let f be one-way, i.e. given f(x) where x is randomly chosen, it is hard (infeasible) to find a pre-image z such that f(z)=f(x), except with negligible probability. “Infeasible” means that given a security parameter k (e.g. k=n), the pre-image z cannot be computed in a time equal to a predefined polynomial in k except with negligible probability. Let us assume moreover that f is one-way on its iterates, i.e. for any i, given y=fi(x), it is infeasible to find z such that f(z)=y.
We can assume, without loss of generality, that CA is required to provide a fresh validity status every day, and the certificates are valid for one year, i.e. 365 days (D2−D1=365 days). To create a certificate 104 (
c365=f(x), c364=f(f(x)), . . . c1=f365(x), c0=f366(x). (1)
We will sometimes denote x as x(SN) for a certificate with a serial number SN, and similarly ci=ci(SN) where i=0, 1, . . . . The value c0 is called a “validation target”. CA 120 inserts c0 into the certificate 104 together with data 104D (
Every day i (i=1, 2, . . . 365), a certificate re-validation is performed for the valid certificates as follows. For each certificate 104, CA distributes to directories 210 a validation data structure which includes, in addition to a validity status indication (not shown in
1. the certificate's “i-token” ci if the certificate is valid on day i;
2. the revocation seed No if the certificate has been revoked.
(We will call ci a “validity proof”, and N0 a “revocation proof”.) This information is distributed unsigned. Each directory 210 provides this information, unsigned, to a requester system 110 in response to a validity status request 150 (
1. If the validity status is “valid”, the verifier 110 checks that fi(ci)=c0.
2. If the validity status is “revoked”, the verifier 110 checks that f(N0)=N1.
Despite the validity information being unsigned, the scheme is secure because given ci, it is infeasible to compute the subsequent tokens ci+1, ci+2, . . . .
To reduce the communication between CA 120 and directories 210, a hash chain (1) can be generated for a set of certificates 104, and a single i-token ci can be distributed for the set if the set is “unrevoked” (i.e. all the certificates are unrevoked in the set). The certificate 140 will contain a separate target c0 for each set containing the certificate and associated with a hash chain (see [1]).
Certificate revocation can also be performed using accumulators. See [37]. An accumulator is a way to combine a set of values (e.g. a set of valid certificates) into a shorter value. A formal definition of a “secure” accumulator is given in Appendix A at the end of this disclosure before the claims. An accumulator example can be constructed as follows. Let us denote all possible values that can be accumulated as p1, . . . , pt. (For example, each pi can be a unique number assigned to a certificate, and we want to accumulate the values corresponding to the valid certificates.) Suppose v0 is the accumulator value for the empty set. Let ƒ be a one-way function. To accumulate p1, we compute the accumulator as follows:
v({p1})=ƒ(v0,p1) (2)
Now to accumulate p2, we set the accumulator to be v2=ƒ(v1,p2), and so on. More generally, the accumulator value for some set {pi
v({pi
The function ƒ can be chosen such that the accumulation order does not matter, i.e.
ƒ(ƒ(v,pi),pj)=ƒ(ƒ(v,pj),pi) (4)
(this is the “quasi-commutative” property).
In each period j, CA 120 can send to the directories 210 a pair (vj,t) where vj is the accumulator value for the set of the valid certificates, and t is a time stamp. The directories can respond to queries 150 with some proof that the accumulator value vj accumulates the value pi corresponding to the certificate of interest.
A common accumulator is an RSA accumulator defined as follows:
ƒ(v,p)=vp mod n (5)
where p is a positive integer, and n=q1q2 is the product of large prime numbers q1 and q2. In this case,
The certificate validation is performed as follows. Without loss of generality, suppose that the values p1, . . . , pm correspond to the valid certificates in a period j. Then the accumulator value distributed by CA 120 to directories 210 is
vj=v0p
If a verifier 110 inquires a directory 210 of the status of a certificate corresponding to the value pi which is one of p1, . . . , pm, the directory sends to the verifier the accumulator value vj and a “witness” value
si,j=v0p
The verifier checks that
si,jp
If this equality holds, the certificate is assumed to be valid.
The witness si,j is hard to forge provided that it is hard to compute the pi-th root of vj. The pi-th root computation is hard if the adversary does not know the factorization of n and the strong RSA assumption is valid (this assumption is defined in Appendix A). However, it is possible to keep pi and si,j secret. For example, instead of providing the values si,j and pi to the verifier, the verifier can be provided with a proof that such values exist and are known to the certificate owner.
Accumulators can be used more generally to prove that an element satisfies some pre-specified property.
This section summarizes some features of the invention. Other features are described elsewhere in this disclosure. The invention is defined by the appended claims.
In some embodiments of the present invention, accumulators are constructed using modular roots with exponents corresponding to the accumulated values. For example, suppose we need an accumulator to accumulate all the elements that possess some property (e.g. all the valid certificates) or all the elements that do not possess that property (e.g. all the revoked certificates). We will associate each element with an integer greater than 1. Let PPm={p1, . . . , pm} be the set of integers associated with the elements to be accumulated (as in (7)), and denote the product of these integers as Pm:
Pm=Πl=1mpl (10)
(By definition herein, the product of the empty set of numbers is 1; i.e. Pm=1 if PPm is empty.) In some embodiments, the accumulator value represents the Pm-th root of some value u, e.g.
vj=u1/P
In some embodiments, the following advantages are achieved.
Suppose the value (11) accumulates valid digital certificates. The value (11) can be modified by exponentiation to de-accumulate all the values pl except for some given value pi, i.e. to compute
vjp
This exponentiation can be performed by a directory 210 or by the certificate owner's system 110, without knowledge of factorization of the modulus n. The value (12) is the accumulator value as if pi were the only accumulated value. The witness value needed for verification can also be computed as if pi were the only accumulated value. The verification (the authentication) can be performed using the values that do not depend on accumulated values other than pi. Alternatively, the verification can be performed with the accumulator and witness values that incorporate some other accumulated values but not necessarily all the accumulated values. Therefore, if a directory 210 or a user 110 are responsible for providing validity proofs for less than all of the certificates, the directory 210 or the user 110 (the “validity prover”) can use an accumulator that accumulates less than all of the valid certificates. In some embodiments, this feature reduces the number of computations needed to be performed by all of the provers.
In some embodiments, the value u=u(j) can depend on the time period j for which the authentication is provided (unlike the value v0 in (7)). Therefore, the time stamp t can be omitted.
In some embodiments, the accumulator accumulates the revoked certificates rather than the valid certificates. In some embodiments, the witness values for an integer pi depend on other p values.
In some embodiments, the witness values are used as encryption keys for encrypting validity proofs. The validity proofs can be constructed using a non-accumulator based validation system, e.g. as in
In each period j, if the certificate 140.i is still valid, CA 120 transmits the decryption key (the witness value) for the period j to the user, enabling the user to recover the token cj(i). The user provides the token to the verifiers to proof the certificate validity as described above in connection with
Some embodiments of the invention are suitable for limited bandwidth or low reliability networks, for example in ad hoc networks, where the nodes 110 may have low computational and transmission power, and where the nodes may have only incomplete information about the topology of the network. (An ad hoc network is a self-configuring wireless network of mobile routers.)
Some embodiments are communication and storage efficient.
The invention is not limited to the features and advantages described above. Other features are described below. The invention is defined by the appended claims.
The embodiments described in this section illustrate but do not limit the invention. The invention is defined by the appended claims.
In the following description, numerous details are set forth. However, the present invention may be practiced without these details. Some portions of the detailed descriptions that follow are presented in terms of algorithms and symbolic representations of operations on data bits within a computer memory. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. An algorithm is here, and generally, conceived to be a self-consistent sequence of steps leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.
It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the following discussion, it is appreciated that throughout the description, discussions utilizing terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or the like, refer to the action and processes of a computer system, or some computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and other storage into other data similarly represented as physical quantities within the computer system storage, transmission or display devices.
The present invention also relates to apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMS, EEPROMS, magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a computer system.
Some of the algorithms presented herein are not inherently related to any particular computer or other apparatus. Various general-purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required operations. The required structure for a variety of these systems will appear from the description below. In addition, the present invention is not described with reference to any particular programming language. A machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer). For example, a machine-readable medium includes read only memory (“ROM”); random access memory (“RAM”); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.); etc.
M
In a given transaction, VV wishes to ascertain that the certificate has not been revoked prior to its expiration date. To do so, VV must obtain a proof of validity or a proof of revocation that is or has been issued by the certification authority CA. VV may obtain this proof either from the CA directly, or through some other distribution mechanism.
We let {0,1}* denote the set of all bit strings. For a bit string s, we denote its length by |s|. We let H denote a cryptographic compression function that takes as input a b-bit payload as well as a v-bit initialization vector or IV, and produces a v-bit output. We assume b≧2v, which holds for all well-known constructions in use. For the constructions we describe here, we typically take b=2v. We assume these cryptographic compression functions are collision resistant; that is, finding two distinct inputs m1≠m2 such that H(IV, m1)=H(IV, m2) is difficult. We assume that the IV is fixed and publicly known. For notational simplicity, we will not always explicitly list IV as an argument in the hash function. A practical example of such a cryptographic compression function is SHA-1[26]. SHA-1's compression function has an output and IV size of 20-bytes and a 64-byte payload size. In many embodiments, we will not need to operate on data larger than the compression function payload size; however there are numerous standard techniques such as iterated hashing or Merkle-trees [19] for doing so. For simplicity, we will use the term hash function instead of compression function, where it is understood that a hash function can take arbitrary length strings {0,1}* and produce a fixed length output in {0,1}v.
In practice, one often constructs a length preserving function that is one way on its iterates by starting with a hash function Hand padding part of the payload to make it length preserving.
Finally, for a real number r, we set ┌r┐ to be the ceiling of r, that is, the smallest integer value greater than or equal to r. Similarly, └r┘ denotes the floor of r, that is, the largest integer value less than or equal to r.
Let A be an algorithm. By A(·) we denote that A has one input. By A(·, . . . , ·) we denote that A has several inputs). By A(·) we will denote that A is an indexed family of algorithms. y←A(x) denotes that y was obtained by running A on input x. In case A is deterministic, then this y is unique; if A is probabilistic, then y is a random variable. If S is a set, then y←S denotes that y was chosen from S uniformly at random. Let b be a boolean function. The notation (y←A(x): b(y)) denotes the event that b(y) is true after y was generated by running A on input x. Finally, the expression
Pr[{xi←Ai(yi)}1≦i≦n:b(xn)]
denotes the probability that b(xn) is TRUE after the value xn was obtained by running algorithms A1 . . . , An on inputs y1, . . . yn.
Some accumulator schemes will now be described with respect to the digital certificate validation. These schemes can also be used for other authentication operations as described above.
To simplify the description, we will assume that each certificate owner Ui operates just one system 110.i and owns at most one certificate. This is not in fact necessary.
CA SET-UP. The CA 120 generates a composite modulus n. In some embodiments, n is the product of two primes, but n can also be the product of three or more primes. In some embodiments, n is public, but n is hard to factor, and the factorization is known only to the CA 120.
I
The CA computes a unique public value mi∈(Z/nZ)* for the user ui. (Z/nZ is the ring of all residue classes modulo n, and (Z/nZ)* is the multiplicative group of the invertible elements of Z/nZ, i.e. of the elements defined by integers mutually prime with n.) In some embodiments,
mi=H1(ui,pi,j0,wi)∈(Z/nZ)* (13)
where H1 is a predefined function, e.g. a one-way collision resistant function, ui is the user identification (e.g. user name), and wi is an optional string (which may contain ui's public key and/or some other information). The value mi thus binds pi to ui's identity. If desired, any one or more of the values jo, pi, mi and the identification of the function H1 can be made part of the certificate 140. See
At step 420, the CA computes an “initial certificate” value
si,j
The CA keeps this value secret, and transmits it to the user ui's system 110 in an encrypted form.
C
Pj=Πp
i.e., Pj is the product of the integers in PPj. (By definition, the product of the empty set of numbers is 1.) The CA computes
hj=H2(j)∈(Z/nZ)*, hj−1=H2(j−1)∈(Z/nZ)* (16)
where H2 is the same as H1 or some other function. The CA also computes the accumulator
vj=vj(PPj)=(hj/hj−1)1/P
Of note, in some embodiments, the root in (17) exists with a very high probability.
The CA transmits the following data to the system 110 of each user in UUj (this data can also be broadcast to a larger set of users if such broadcast transmission is more efficient in terms of communication):
1. vj;
2. the list of users in UUj, which may be just the list of numbers pi in PPj. (Since there can be overlap among these lists for different periods j, the CA can simply transmit the information that has changed since the previous period; in this case, the CA transmits the list of numbers in PPj to each new user at step 430 of
P
vj(pi)=(hj/hj−1)1/p
This value is computed from the vj value (17) by exponentiation:
At step 620, the user aggregates this personal accumulator with previous personal accumulators to obtain a witness for the period j:
si,j=si,j−1vj(pi)=si,j−1(hj/hj−1)1/p
The equations (20) and (14) imply that
si,j=si,j
U
1. computes mi from (13) or obtains it from the certificate 140 (
2. computes hj=H2(j) and hj
3. confirms that
si,jp
A
The verifier sends to the user a random message m.
The user generates a random number r and computes:
d=H(m∥rp
where H is some predefined public function (e.g. H1 or H2). The user sends the values m, rp
The verifier computes J=mi−1 mod n and checks that the following equations hold:
d=H(m∥JdDp
JdDp
This scheme may reduce total bandwidth because the verifier does not need the certificate 140. Note: for the GQ signature scheme to be secure, it is desirable that each pi>2160.
REMARKS. 1. First, si,j
2. ANONYMOUS AUTHENTICATION. If we want to allow the user nodes 110 to sign messages anonymously—i.e., to be able to authenticate themselves as users that have been certified by the CA, but without revealing their actual identities—we can handle initial certification differently. For example, instead of (13), the CA can set mi to be independent of the user's identity, e.g.:
mi=hj
The mi value is then provided to the user. To hide the fact that the mi value is provided to the user, the mi value may be transmitted to the user via a secure channel (e.g., by encrypting this value such that only ui can decrypt it). Then, it follows from (14) and (21) that:
si,j=hj1/p
As mentioned below in Appendix A, there are efficient zero-knowledge (ZK) proofs through which user ui can prove, in the jth time period, that it knows a pth root of hj modulo n for some (unrevealed) number p that is contained in a specified interval I of integers, i.e.
PK{(α,β): αβ=hj mod nβ∈I}
See [41], [42]. The interval I can be an interval containing all the numbers pi. Using this scheme, the user can authenticate itself anonymously if and only if it has obtained an initial certificate si,j
As mentioned above, the accumulator techniques are not limited to a user possessing a single certificate or to a digital certificate revocation. For example, a user ui may or may not possess one or more of entitlements e1, e2, . . . , er. During initial certification, the user ui is assigned a unique integer pi,k if the user is to be certified for an entitlement ek. The entitlement proof proceeds as described above with respect to equations (13)-(26), with numbers pi replaced by pi,k, with si,j replaced by si,k,j, etc.
In some of these embodiments, however, a value pi is computed as the product of pi,k. For example, if the user is initially certified for all the entitlements, then:
pi=pi,l . . . pi,r (27)
In some embodiments, the integers pi,k are mutually prime relative to each other and to the CA's modulus n. The user ui's initial certificate is si,j
The user can use this value to demonstrate that it possesses the entitlement that corresponds to prime pi,k without revealing the user's other entitlements. The CA can revoke a specific entitlement ek for the user without revoking the user's other entitlements, simply by issuing a validation accumulator in (17) that does not accumulate pi,k (i.e. the product Pj is computed as in (15) except that pi is replaced with pi/pi,k (the number pi,k is not included in the product)). If the user wants to sign anonymously and unlinkably, then the user cannot reveal its specific value of pi,k, but it would seem to make it difficult for a verifier to determine what entitlement the user is claiming (if the verifier cannot see the value of pi,k). We can get around this problem by, for example, associating each entitlement ek to a range of integers Ii. This range will contain all the pi,k values for all the users ui initially certified to have the entitlement ek. The ranges Ik do not overlap in some embodiments. Then the user can prove in ZK that it possesses a modular p-th root (for some unrevealed number p in the range Ik) of the appropriate value, i.e. the user can provide the following proof of knowledge:
PK{(α,β): αβ=C mod nβ∈Ik} (29)
where C=mihj/hj−1 or C=hj (see (21), (26)). See [41], [42]. The user can also prove that it possesses multiple entitlements simultaneously, e.g., by proving (in ZK if desired) its possession of a pth root for some p that is the product of (for example) two integers p1 and p2 in ranges corresponding to the claimed entitlements:
PK{(α,β):αβ=C mod nβ=β1β2β1∈Ik
See [41], [42].
USERS MOVING FROM NETWORK TO NETWORK: Suppose the CA has users (certificate owner systems) 110 in different networks, and the users move from a network to a network (e.g. some of the networks may be ad hoc networks). The CA can calculate a separate accumulator (say, (17)) for each network. Each accumulator will accumulate only the valid users in the corresponding network based on the CA's knowledge of the current composition of users in each network. Each user will compute its personal accumulator value and/or witness value (e.g., as in (18), (20), (26), and/or (28)). The user can move to another network and use the same personal accumulator value and/or witness value in the other network.
AGGREGATION OF PERSONAL ACCUMULATORS: Multiple personal accumulators can be aggregated into a single value, in order to save bandwidth; from this aggregated value, a verifier can batch-verify that multiple users are indeed certified. For example, in the scheme of equations (17), (22), user ui (if it is still valid) possesses a personal value si,j that satisfies
si,jp
where mi is as in (13), and jo,i is the period jo for the user ui. Denote
zi=mi(hj/hj
Then, for multiple users in period j, their personal values can simply be multiplied together
S=Πi=1t′si,j(mod n) (33)
where t′ is the number of users which take part in the aggregation (or the number of certificates or entitlements belonging to a user if each ui is a certificate or an entitlement). A verifier can use the value (33) to confirm that users (or certificates or entitlements) (u1, . . . , ut′) are valid by confirming that:
DE-ACCUMULATION. We will now discuss concrete approaches to the de-accumulation of values from the accumulator computed by the CA; however, we note that the CA does not need to compute a single accumulator value that accumulates all of the values associated to all of the valid nodes 110 in the network. Instead, the CA can use a tradeoff, which we now describe. De-accumulation is somewhat computationally expensive. Procedure Split(v,P) below performs de-accumulation on an accumulator v that accumulates t k-bit numbers P={pi}; the procedure's output is t personal accumulators; the execution time is O(t log2 t) (where we consider k to be a constant). Let us denote the accumulator accumulating an empty set of values as u (in equation (17), u=h(j)/h(j−1)). Then
Procedure Split(v,P):
1. Split the set P into two disjoint halves P1 and P2.
2. Compute
(mod n); this is the accumulator for the set P1.
3. Compute
(mod n); this is the accumulator for the set P2.
4. If P1 has only one member, output (v1,P1), otherwise call Split (v1,P1).
5. If P2 has only one member, output (v2,P2), otherwise call Split (v2,P2).
End of Split (v,P)
As a rule of thumb, exponentiating a number modulo n by a product of t′ numbers takes time proportional to t′; thus, the first split operation (steps 2 and 3 in procedure Split(v,P)) is the most expensive one in the recursion above. (Actually, since there are t distinct numbers (p1, . . . , pt), the pi are O(log t) bits apiece on average, so that the exponentiation is proportional to t′ log t′; since the Split algorithm can recurse to a depth of log t′, the overall computation complexity is t log2 t.) To reduce the amount of de-accumulation that needs to be performed, the CA can (for example) compute two accumulators—one for each of two halves P1 and P2 of P—and transmit these two accumulators to the users, thereby allowing the users to skip the first split operation and thereby reducing their computation. The CA can reduce their computation further by transmitting even more accumulators, each for an even smaller subset of the users. In computing these multiple accumulators, it is advantageous for the CA to use its knowledge of the current network topology to enhance the performance of the scheme. For example, suppose that the users are divided into several (say, 10) topological areas. Then, instead of using one product of integers Pj as in (15), the CA can compute 10 such products Pj,1, . . . , Pj,10, and 10 area-specific accumulators (hj/hj−1)1/P
De-accumulation is also relevant to how the personal accumulators (18)-(19) are computed. Clearly, in terms of computation, it is non-optimal for each of the t users to perform de-accumulation (18), (19) independently; doing it this way would entail O(t2) computation (actually, worse, since the size of the pi must grow at least as fast as log t). In some embodiments, the users compute their personal accumulators cooperatively, as illustrated, for example, in
If the performance metric is minimum communication, a simple broadcast by the CA of the accumulator (17) to t users (see
Referring to
Node 110.1 performs operations similar to those for node 110.0.
This procedure can be extended recursively to any number of nodes. The procedure is not limited to the “binary-tree” type of
Now, we describe one approach for selecting the sub-groups, where nodes only have local topology information. Roughly speaking, the idea is as follows. To partition (a portion of) the network, the designated node D chooses two “group heads” in the network, such as nodes 110.0, 110.1 in
At the beginning, there is only a single group PP of users which covers the whole network. Each recursion step is carried out in four phases as depicted in
S
B
R
T
Considering a set of all users UU with cardinality |UU|=2R=t, and equal distribution of each group PP into two groups of identical size |PP|/2, the recursion will end after R steps with a total number of t sub-groups of size 1. Here, we mention a few aspects of the scheme's computational and communication complexity, assuming that, at each step, the split is into approximately equal halves.
S
B
R
T
Overall, the communication is O(t√{square root over (t)} log t) and the computation is O(t log2 t).
If we assume that there is an entity E of the network with less stringent limitations in terms of computational power and energy such as the certification authority itself, another interesting opportunity arises. By acquiring some topology knowledge this entity may pre-de-accumulate the accumulator into accumulators for constant size groups. In detail, such a scheme works as follows: Some node C which holds the current accumulator for all users and has connectivity to E initiates a distributed clustering algorithm of
Briefly, we mention a distributed approach that splits a connected network into two connected subnetworks, where neither subnetwork is more than twice the size of the other. As before, we begin with a sub-network that has a designated node. We assume that the sub-network is connected, and that its topology is basically constant during the accumulator distribution (though it may change dramatically from period to period). The first step in the preferred algorithm for achieving O(t log t) communication and O(t log t) computation is that, in a distributed fashion, the nodes of the sub-network establish an enumeration of themselves. This can be accomplished as follows. The designated node initiates the procedure by broadcasting a message. The nodes that receive the message transmit an acknowledgement message back to the designated node, and they log the designated node as the node from which they received the message. The designated node logs the acknowledging nodes as the nodes that received its message. This process recurses throughout the network. Specifically, a node that received the message from the designated node broadcasts the message, and nodes that have not sent or received the message before log the identity of the node that they received the message from and they send back an acknowledgment, after which the sending node logs the acknowledgers. If a node receives the message more than once, it only logs and responds back to the first node that sent it. In this fashion, since the subnetwork is connected, every node in the subnetwork (except the designated node) has a unique node from which it received the message, as well as a list of the nodes that received the message directly from it. Each node that has a nonempty list of acknowledgers chooses an arbitrary enumeration of those acknowledgers. In effect, from this procedure, we have constructed a tree (a graph with no loops) from the network, as well as an enumeration of all of the tree nodes given by the depth-first pre-order traversal of the tree. If the subnetwork has t′ nodes, this procedure can be accomplished with O(t′) communication and computation.
The next stage of the algorithm is to use the enumeration to approximately bisect the subnetwork. There are a variety of different ways of doing this. One method is that each node, beginning with the nodes with no acknowledgers, could back-transmit (backwards according to the enumeration) the number of nodes in its subtree including itself; in this fashion, each node computes the number of nodes in its subtree. There must be exactly one node that has at least half of the nodes in its subtree, but such that none of its children have at least half. This node is designated to be the midpoint of the subnetwork. Now, viewing the midpoint point as the root of the tree, it divides its children into two groups, such that the number of nodes that are in a subtree emanating from one of the children in the first group is approximately equal to the number of nodes that are in a subtree emanating from one of the children in the second group. (This can always be accomplished such that ratio between the two numbers is at most two.) Thus, all of the network nodes except the midpoint become members of one of the two groups; the midpoint is considered to be a member of both groups. The step of computing how many nodes are in each subtree requires O(t′ log t′) communication, since there are t transmissions, where the size of each transmission (which is a number between 1 and t′ representing how many nodes are in the given subtree) is log t′ bits. Viewing the midpoint node as the root node, a new enumeration of the nodes is established with the midpoint node as the initiator of the message, beginning with the nodes in the first group. (This new enumeration could be computed as before, or it could actually be derived indirectly from the previous enumeration. Either way, it does not add to the asymptotic communication complexity of the protocol, which is t log2 t overall.)
The personal accumulators vj(pi) and/or the witness values si,j (see e.g. equations (17), (20), (21), (26)) can be used to obtain symmetric keys that users can use to decrypt validation proofs under other validation systems. An example will now be given for the validation system obtained by combining the systems of
INITIAL CERTIFICATION: When a user us joins the system, the CA:
RE-VALIDATION BY CA: At the start of, or shortly before, each period j, the CA:
PROOF DERIVATION BY THE PROVER: If the user ui's certificate is valid (as indicated by the transmission of number pi in the RE-VALIDATION procedure at step 1), the prover:
AUTHENTICATION (VALIDITY PROOF): As in
Many variations are possible. For example, the decryption keys can be some different function of si,j than the encryption keys.
Above, we have described an approach in which an accumulator accumulates the valid certificates; an alternative approach is to accumulate revoked certificates. The valid certificates' owners (or validity provers) then use the “dynamic” feature of dynamic accumulators to compute a new accumulator for the valid nodes, and to compute their personal accumulators with respect to this new accumulator.
As before, we will assume for the sake of illustration that each user operates a corresponding computer system 110 and owns at most one certificate 140. This is not limiting, as a user may own multiple certificates and/or operate multiple systems 110. The scheme is also applicable to controlling resource access and other kinds of authentication. We will use the word “user” to denote both the system 110 and the system's operator where no confusion arises. As described below, each user ui will be assigned a positive integer pi with the same properties as in the scheme of
For each period j, the symbol QQj will denote the set of the revoked pi numbers (i.e. the set of the pi numbers corresponding to the certificates to be declared as revoked in period j). Qj denotes the product of the numbers in QQj:
Qj=Πp
The symbol a will denote an accumulator seed, which is an integer mutually prime with the modulus n. The accumulator of the values in Qj is:
vj=a1/Q
Let RRj=QQj=QQj−1, i.e. RRj is the set of the p numbers corresponding to the certificates declared as revoked in period j but not in period j−1; in period j−1 these certificates were either declared as valid or were not yet part of the validation system. Let Rj denote the product of the numbers in RRj:
Rj=Πp
It is easy to see from (37) that if each certificate cannot be “unrevoked” (i.e. cannot be made valid once revoked), then:
vj=vj−11/R
CA SET UP: The CA generates its modulus n and the accumulator seed a. The CA sets the initial accumulator value vj=a for the initial period j.
I
ti=mi1/p
si,j
Here, vj
si,j=vj1/p
for each period j>jo for which the certificate is valid.
C
W
At step 1320, the user computes the witness
wi,j=tisi,j (45)
U
wi,jp
A
The verifier sends to the user a random message m.
The user generates a random number r and computes:
d=H(m∥rp
where H is some predefined public function. The user sends the values m, mi, rp
The verifier computes J=mi−1 mod n and checks that the following equations hold:
d=H(m∥JdDp
JdDp
This scheme may reduce total bandwidth because the verifier does not need the certificate 140. Note: for the GQ signature scheme to be secure, it is desirable that pi>2160.
The use of accumulator vj allows a user to revoke itself, without the CA's help. To revoke itself in a period j, user ui simply broadcasts vj1/p
In some embodiments, this scheme allows efficient distribution (e.g. broadcast) for the CA at the stage of
The invention is not limited to the embodiments described above. The invention is not limited to secure or dynamic accumulators. An accumulator can be any data that accumulate some elements. Further, the invention is not limited to the accumulators described above. For example, the accumulator seed h(j)/h(j−1) in equation (17) can be replaced with a value independent of the period j, and the accumulator seed a in the accumulator (37) can be replaced with a function of j. The accumulator methods can be used to prove (authenticate) membership in a set or possession of some property. Examples include authentication of valid entitlements, or authentication of people as being members of some organization.
In some embodiments, the CA 120, the directories 210, and the systems 110 are computer systems communicating with each other over a network or networks. Each of these systems may itself be a computer system having components communicating over networks. Each computer system includes one or more computer processors executing computer instructions and manipulating computer data as described above. The term “data” includes “computer data” and covers both computer instructions and computer data manipulated by the instructions. The instructions and data can be stored on a data carrier such as a computer storage, i.e. a computer readable medium (e.g. a magnetic or optical disk, a semiconductor memory, and other types of media, known or to be invented). The data carrier may include an electromagnetic carrier wave transmitted over a network, e.g. through space, via a cable, or by some other means. The instructions and data are operable to cause the computer to execute appropriate algorithms as described above.
The invention is not limited to any particular hash functions, or to cryptographic functions (which are easy to compute but are one-way or collision resistant). In some embodiments, it is desirable that a function f or H be collision resistant not in the sense that it is difficult to find different x and y with the same image but in the sense that if x and y are uniformly drawn from the function's domain, the probability is small that they both will have the same image:
P{H(x)=H(y)}≦α
where α is a small constant (e.g. 1/10, or 1/100, or 2−25, or 2−50, or 2−80, or 2−160, or some other value). Some or all of the techniques used for validity proofs can also be used for invalidity proofs and vice versa. The CA, the directories, and the systems 110 may include software-programmable or hardwired computer systems interconnected via a network or networks. Each function f or H represents an evaluation method performed by a computer system. The invention is not limited to the step sequences shown in the flowcharts, as the step order is sometimes interchangeable and further different steps may be performed in parallel. Other embodiments and variations are within the scope of the invention, as defined by the appended claims.
All of the following references are incorporated herein by reference.
Definition (Secure Accumulator). A secure accumulator for a family of inputs {XXk} is a family of families of functions GG={FFk} with the following properties:
Efficient generation: There is an efficient probabilistic algorithm G that on input 1k produces a random element f of {FFk}. G also outputs some auxiliary information about f, denoted by auxf.
Efficient evaluation: f∈{FFk} is a polynomial-size circuit that, on input (u,x)∈ UUƒ×XXk, outputs a value v∈UUƒ, where UUƒ is an efficiently-samplable input domain for the function f and {XXk} is the intended input domain whose elements are to be accumulated.
Quasi-commutative: For all k, for all f∈FFk, for all u∈UUƒ, for all x1,x2∈XXk, f(f(u,x1),x2)=f(f(u,x2),x1). If X={x1, . . . , xm}⊂XXk, then by f(u,X) we denote f(f( . . . (u,x1), . . . ),xm).
Witnesses: Let v∈UUf and x∈XXk. A value w∈UUf is called a witness for x in v under f if v=f (w,x).
Security: Let UU′f×XX′k denote the domains for which the computational procedure for function f∈FFk is defined (thus UUƒ⊂UU′f, XXk⊂XX′k). For all probabilistic polynomial-time adversaries Ak,
Pr[f←G(1k);u←UUf,(x,w,X)←Ak(f,UUf,u) X⊂XXk;w∈UUf′;x∈XXk;x∉X;ƒ(w,x)=ƒ(u,X)]−neg(k).
Camenisch and Lysyanskaya ([39]) define the notion of a dynamic accumulator:
Definition (Dynamic Accumulator). A secure accumulator is dynamic if it has the following property:
Efficient Deletion: There exist efficient algorithms D and W such that, if v=f(u,X), x, x′∈X, and f(w,x)=v, then:
D(auxf,v,x′)=v′ such that v′=f(u,X−{x′}) and
W(f,v,v′,x,x′)=w′ such that f(w′,x)=v′.
Z
Let x be an input, and let R be a polynomially computable relation. Roughly speaking, a zero-knowledge proof of knowledge of a witness w such that R(x, w)=1 is a probabilistic polynomial-time protocol between a prover P and a verifier V such that, after the protocol, V is convinced that P knows such a witness w, but V does not obtain any explicit information about w. In other words, apart from “proving” that it knows a witness w such that R(x, w)=1, P imparts “zero knowledge” to V.
In the sequel, we may use the notation introduced by Camenisch and Stadler for various proofs of knowledge of discrete logarithms and proofs of the validity of statements about discrete logarithms. For instance,
PK{(α,β,γ):y=gαhβy′=g′αh′γ(u≦α≦v)}
denotes a zero-knowledge Proof of Knowledge of integers α, β, and γ such that y=gαhβ and y′=g′αh′γ, where u≦α≦v and where g, g′, h, h′, y, and y′ are elements of some groups G=<g>=<h> and G′=<g′>=<h′>. The convention is that Greek letters denote quantities the knowledge of which is being proved, while all other parameters are known to the verifier. Using this notation, a proof-protocol can be described by just pointing out its aim while hiding all details.
Often, these proofs of knowledge are instantiated by a three-pass protocol, in which the prover first sends the verifier a commitment to certain values, after which the verifier sends the prover a challenge bit-strings, and the prover finally sends a response that incorporates both the “known value”, the committed values and the challenge value in such a way that it convinces the verifier is convinced of the prover's knowledge.
These proofs of knowledge can be turned into signature schemes via the Fiat-Shamir heuristic. That is, the prover determines the challenge c by applying a collision-resistant hash function H to the commitment and the message m that is being signed and then computes the response as usual. We denote such signature proofs of knowledge by the notation, e.g., SPK{α:y=ƒ(α)}(m). Such SPK's can be proven secure in the random oracle model, given the security of the underlying proofs of knowledge.
ZK proofs are often accomplished with the help of a commitment scheme. A commitment scheme consists of the algorithms Commit and VerifyCommit with properties as follows. The commitment algorithm Commit takes as input a message m, a random string r and outputs a commitment C, i.e., C=Commit(m,r). The (commitment) verification algorithm VerifyCommit takes as input (C,m,r) and outputs 1 (accept) if C is equal to Commit(m,r) and 0 (reject) otherwise. The security properties of a commitment scheme are as follows. The hiding property is that a commitment C=Commit(m,r) contains no (computational) information on m. The binding property is that given C, m, and r, where 1=VerifyCommit(C,m,r), it is (computationally) impossible to find a message m0 and a string r0 such that 1=VerifyCommit(C,m0,r0).
To prove, in ZK, knowledge of a witness w of a value x that has been accumulated—i.e., that f(w,x)=v, where v is the accumulator value—the usual method is to choose a random string r and construct a commitment c=Commit(x, r) and then provide the following proof of knowledge:
PK{(α,β,γ):c=Commit(α,γ)ƒ(γ,α)=v}.
Above α represents the (hidden) x value, while β represents r and γ represents w.
RSA-B
Definition (RSA modulus). A 2 k-bit number n is called an RSA modulus if n=pq, where p and q are k-bit prime numbers.
Of course, one can choose n in a different way—e.g., as the product of three primes, or as the product of two primes of different sizes.
Definition (Euler totient function). Let n be an integer. The Euler totient function φ(n) is the cardinality of the group Zn* (the multiplicative group of elements having an inverse in the ring Zn of the integers modulo n; Zn* is the set of all elements mutually prime with n).
If n=pq is the product of two primes, then φ(n)=(p−1)(q−1).
The security of RSA-based accumulators is based on the following assumption.
Definition (Strong RSA Assumption) The strong RSA assumption is that it is “hard,” on input an RSA modulus n and an element u∈Zn*, to compute values e>1 and v such that ve=u(mod n). By “hard”, we mean that, for all polynomial-time circuit families {Ak}, there exists a negligible function neg(k) such that
Pr[n←RSAmodulus(1k);u←Zn*(v,e)←Ak(n,u): ve=u(mod n)]=neg(k),
where RSAmodulus(1k) is an algorithm that generates an RSA modulus as the product of two random k-bit primes, and a negligible function neg(k) is a function such that for all polynomials p(k), there is a value k0 such that neg(k)<1/p(k) for all k>k0. The tuple (n,u) generated as above, is called a general instance of the strong RSA problem.
Corollary 1. Under the strong RSA assumption, it is hard, on input a flexible RSA instance (n,u), to compute integers e>1 and v such that ve=u(mod n).
The most common concrete instantiation of accumulators is based on the above strong-RSA assumption. Roughly speaking, the idea is as follows: Given a fixed base u(mod n), one can compute an accumulator of values x1 and x2 (for example) as v=ux
Now, we relate the formal description of accumulators to the concrete RSA-based construction. A secure RSA-based accumulator for a family of inputs Xk is a family of functions FFk, where the particular function f∈FFk depends on what the modulus n is. For reasons that will become clear later, we assume that elements of Xk are pairwise relatively prime integers. Then, auxf is the (secret) factorization of n. As alluded to above, given an initial accumulator value v′, an additional value x is added to the accumulator by computing a new accumulator value v=v′x(mod n). Notice that the computational complexity of this algorithm is independent of the number of prior values that have been accumulated. The RSA-based accumulator possesses the quasi-commutative property; e.g., regardless of the order in which x1 and x2 are incorporated into an initial accumulator v′, the result is v=v′x
RSA-based accumulators can be made dynamic. Recall that an accumulator is dynamic if, given an accumulator v that accumulates values of the set X and given the secret information auxf, one can “de-accumulate” a value x′∈X—i.e., compute a new accumulator v′ that accumulates the values of the set X−{x′}. Moreover, given a witness w that a value x has been accumulated (with respect to accumulator v that accumulates members of the set X) and given the accumulator v′ that only accumulates members of X−{x′}, one can compute a new witness w′ for x with respect to the accumulator v′. Specifically, for RSA-based accumulators, one can use the factorization of n to de-accumulate x′ by computing v′=v1/x′ (mod n). And given a witness w for x with respect to v—i.e., wx=v(mod n)—and given the value of v′, one can compute a witness w′ for x with respect to v′—i.e., w′x=v′=v′1/x′(mod n)→w′=v′1/xx′ (mod n) as follows. Assuming that x and x′ are relatively prime, one can compute integers (a,b) such that ax+bx′=1, by using the Extended Euclidean Algorithm. Then w′=v′awb=va/x′vb/x=v(ax+bx′)xx′=v1/xx′ (mod n). Notice that the computation of w′ is efficient, and (after v′ is computed) it doesn't require any secret knowledge (e.g., the factorization of n).
Given a witness w that a value x is accumulated (with respect to an accumulator v(mod n)), it also well-known in the art how to construct a ZK proof of knowledge of a pair (w,x) that satisfies wx=v(mod n) (that hides the values of both w and x from the verifier).
End of Appendix A
SET-UP:
1. A public key generator (PKG, a trusted party), publishes its public key (v,n) where n=q1q2 (a product of two primes) is such that its factorization is hard to find, and v is a prime less than φ(n)=(p−1)(q−1).
2. For a user with an identity ID (e.g., an email address), the PKG computes the secret signing key B such that JBv≡1 mod n, where J=R (ID), where R is a predefined public function, e.g. a redundancy function. In some embodiments, the function R maps ID into an element of Zn. The function R is such that the number of elements in Zn that correspond to mappings from valid ID's is small. The PKG sends B to the user via a secure channel (e.g. encrypted).
SIGNING: To sign a message M, the user:
1. Computes J=R(ID)
2. Generates a random number r and computes
d=H(M∥rv), D=rBd (B-1)
where H is a predefined public function (e.g. a hash function).
3. Sends the values M, rv, d, D to the verifier.
VERIFICATION: The verifier:
1. Computes J=R(ID);
2. Checks that d=H(M∥rv);
3. Checks that JdDv=rv.
End of Appendix B
The present application is a division of U.S. patent application Ser. No. 11/304,200 filed on Dec. 15, 2005 now U.S. Pat. No. 7,266,692, which claims priority under 35 U.S.C. §119(e) to provisional U.S. patent application No. 60/637,177 filed Dec. 17, 2004, both of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4914698 | Chaum | Apr 1990 | A |
5140634 | Guillou et al. | Aug 1992 | A |
5666416 | Micali | Sep 1997 | A |
5687235 | Perlman | Nov 1997 | A |
5699431 | Van Oorschot | Dec 1997 | A |
5717757 | Micali | Feb 1998 | A |
5717758 | Micali | Feb 1998 | A |
5793868 | Micali | Aug 1998 | A |
5903651 | Kocher | May 1999 | A |
5960083 | Micali | Sep 1999 | A |
5982892 | Hicks et al. | Nov 1999 | A |
6044462 | Zubeldia | Mar 2000 | A |
6097811 | Micali | Aug 2000 | A |
6128740 | Curry | Oct 2000 | A |
6226743 | Naor et al. | May 2001 | B1 |
6292893 | Micali | Sep 2001 | B1 |
6301659 | Micali | Oct 2001 | B1 |
6442689 | Kocher | Aug 2002 | B1 |
6487658 | Micali | Nov 2002 | B1 |
6532540 | Kocher | Mar 2003 | B1 |
6766450 | Micali | Jul 2004 | B2 |
20020046337 | Micali | Apr 2002 | A1 |
20020165824 | Micali | Nov 2002 | A1 |
20030177352 | Camenisch et al. | Sep 2003 | A1 |
20030217265 | Nakano et al. | Nov 2003 | A1 |
20030221101 | Micali | Nov 2003 | A1 |
20030236976 | Wheeler | Dec 2003 | A1 |
20040049675 | Micali | Mar 2004 | A1 |
20040128504 | Kivinen | Jul 2004 | A1 |
20050053045 | Chmora | Mar 2005 | A1 |
20050055548 | Micali | Mar 2005 | A1 |
20050081037 | Kumagai | Apr 2005 | A1 |
Number | Date | Country |
---|---|---|
WO2005002944 | Mar 2005 | WO |
Number | Date | Country | |
---|---|---|---|
20060242699 A1 | Oct 2006 | US |
Number | Date | Country | |
---|---|---|---|
60637177 | Dec 2004 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 11304200 | Dec 2005 | US |
Child | 11454394 | US |