Various exemplary embodiments disclosed herein relate generally to a method and apparatus for performing a privacy preserving comparison.
Protocols have been developed for comparing private values using homomorphic encryption. Embodiments improving upon the state of the art will be described below.
A brief summary of various exemplary embodiments is presented below. Some simplifications and omissions may be made in the following summary, which is intended to highlight and introduce some aspects of the various exemplary embodiments, but not to limit the scope of the invention. Detailed descriptions of an exemplary embodiment adequate to allow those of ordinary skill in the art to make and use the inventive concepts will follow in later sections.
Various embodiments relate to a method for performing a secure comparison between a first secret data and a second secret data, including: receiving, by a processor of a first party, encrypted bits of the second secret data y from a second party, where is an integer; computing the Hamming weight h of first secret data x, wherein x has bits; computing the value of a first comparison bit δA such that δA=0 when h>└/2┘, δA=1 when h<┌/2┐, and δA is randomly selected when h=/2; forming a set of └/2┘ indexes that includes at least the indexes i where xi=δA; selecting random invertible scalars ri for each i in and computing c*i=(1+(1−2δA)xi·yi2δ
Further various embodiments relate to a non-transitory machine-readable storage medium encoded with instructions for performing a secure comparison between a first secret data and a second secret data, including: instructions for receiving, by a processor of a first party, encrypted bits of the second secret data y from a second party, where is an integer; instructions for computing the Hamming weight h of first secret data x, wherein x has bits; instructions for computing the value of a first comparison bit δA such that δA=0 when h>└/2┘, δA=1 when h<┌/2┐, and δA is randomly selected when h=/2; instructions for forming a set of └/2┘ indexes that includes at least the indexes i where xi=δA; instructions for selecting random invertible scalars ri for each i in and computing c*i=+(1+(1−2δA)xi·yi2δ
Various embodiments are described, wherein when the second party sets a value of a second comparison bit δB based upon the decrypted c*i's and wherein δA⊕δB=[x≤y].
Various embodiments are described, wherein when the second party sets a second comparison bit δB=1 when any one of the decrypted c*i's is equal to zero.
Various embodiments are described, wherein when the second party sets a second comparison bit δB=0 when none of the decrypted c*i's is equal to zero.
Various embodiments are described, wherein the encryption uses the Pallier cryptosystem.
Various embodiments are described, wherein the encryption uses the exponential variant of the ElGamal cryptosystem.
Various embodiments are described, further including receiving an encryption of the second comparison bit δB from the second party and computing δ=δB when δA=0 and δ=1·δB−1 when δA≠0, wherein δ=[x≤y].
Further various embodiments relate to a method for performing a secure comparison between a first secret data and a second secret data, including: receiving, by a processor of a first party, encrypted second secret data y from a second party, wherein where is the number of bits in y and wherein y denotes the additive homomorphic encryption of y; choosing a random mask ρ in , where κ is a security parameter; computing z†=y·x−1·+ρ, wherein x is the first secret data having bits; sending z† to the second party; computing x′=ρ mod ; receiving, by the processor of a first party, encrypted bits of y′ from the second party, wherein y′ is based upon z†; computing the Hamming weight h of x′; computing the value of a first comparison bit δ′A such that δ′A=0 when h>└/2┘, δ′A=1 when h<┌/2┐, and δ′A is randomly selected when h=/2; forming a set of └/2┘ indexes that includes at least the indexes i where x′i=δ′A; selecting random invertible scalars ri for each i in and computing c*i=(1+(1−2δ′A)x′i·y′i2δ′
Further various embodiments relate to a non-transitory machine-readable storage medium encoded with instructions for performing a secure comparison between a first secret data and a second secret data, including: instructions for receiving, by a processor of a first party, encrypted second secret data y from a second party, wherein where is the number of bits in y and wherein y denotes the additive homomorphic encryption of y; instructions for choosing a random mask ρ in , where κ is a security parameter; instructions for computing z†=y·x−1·+ρ, wherein x is the first secret data having bits; instructions for sending z† to the second party; instructions for computing x′=ρ mod 2; instructions for receiving, by the processor of a first party, encrypted bits of y′ from the second party, wherein y′ is based upon z†; instructions for computing the Hamming weight h of x′; instructions for computing the value of a first comparison bit δ′A such that δ′A=0 when h>└/2┘, δ′A=1 when h<┌/2┐, and δ′A is randomly selected when h=/2; instructions for forming a set of └/2┘ indexes that includes at least the indexes i where x′i=δ′A; instructions for selecting random invertible scalars ri for each i in and computing ci*=(1+(1−2δ′A)x′i·y′i2δ′
Various embodiments are described, wherein when the second party decrypts z† and defines y′=z† mod .
Various embodiments are described, wherein the first party sets δA=δ′A when └ρ/┘ is even, and δA=1−δ′A otherwise, second party sets a value of a second comparison bit δB=δ′B when └z†/┘ is odd, and δB=1−δ′B otherwise and wherein δA⊕δB=[x≤y].
In order to better understand various exemplary embodiments, reference is made to the accompanying drawings, wherein:
To facilitate understanding, identical reference numerals have been used to designate elements having substantially the same or similar structure and/or substantially the same or similar function.
The description and drawings illustrate the principles of the invention. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the invention and are included within its scope. Furthermore, all examples recited herein are principally intended expressly to be for pedagogical purposes to aid the reader in understanding the principles of the invention and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Additionally, the term, “or,” as used herein, refers to a non-exclusive or (i.e., and/or), unless otherwise indicated (e.g., “or else” or “or in the alternative”). Also, the various embodiments described herein are not necessarily mutually exclusive, as some embodiments can be combined with one or more other embodiments to form new embodiments.
A comparison compares two -bit integers to decide whether or not x≤y. This problem arises in a variety of privacy-preserving applications including secure data mining and secure auctions.
There exist different versions of the problem depending on whether the numbers x and y are known to the respective parties or unknown to everyone, and whether the result of the comparison is public or private. In this disclosure, it is assumed that two parties have two integers (in the clear) and that they want to compare these numbers without revealing the value. Embodiments of the comparison protocol described below can also be used in the case where one of the parties has the encrypted integers and the other party has the key to decrypt encrypted integers and in this case both integers will remain unknown to the parties. The final result of the comparison protocol can be public or secretly shared between the parties.
The embodiments of the disclosure introduce new comparison protocols which increase the efficiency in both communication and computational complexities by about a factor of two as compared to the current state of the art. These embodiments include the following features:
bit comparisons, which leads to a better performance.
Damgård, Geisler, and Krøigaard present an elegant protocol for comparing private values. It was later modified in: Zekeriya Erkin, Martin Franz, Jorge Guajardo, Stefan Katzenbeisser, Inald Lagendijk, and Tomas Toft, Privacy-preserving face recognition, In I. Goldberg and M. J. Atallah, editors, Privacy Enhancing Technologies (PETS 2009), volume 5672 of Lecture Notes in Computer Science, pages 235-253. Springer, 2009; and Thijs Veugen. Improving the DGK comparison protocol, In 2012 IEEE International Workshop on Information Forensics and Security (WIFS 2012), pages 49-54. IEEE, 2012.
The comparison protocol utilizes an additively homomorphic encryption scheme. Let m denote the encryption of a message m. The homomorphic property implies that for any two messages m and m′, the encryption of m+m′ can be obtained from the encryptions of m and m′ as m+m′=m·m′ for some public operation “·”. Likewise, for a known constant integer d, the encryption of dm (that is, m+m+ . . . +m (d times)) can be obtained from the encryption of m as dm=md. Examples of additively homomorphic encryption schemes include the Paillier cryptosystem or the exponential variant of the ElGamal encryption scheme.
The DGK+ protocol, a variation on the Damgård-Geisler-Krøigaard protocol, will now be described. Alice possesses a private -bit value x=xi2i while Bob possesses a private -bit value y=yi2i. The goal for Alice and Bob is to respectively obtain bits δA and δB such that δA⊕δB=[x≤y]. Here [x≤y] denotes the result of the comparison: [x≤y]=1 (true) if x≤y, and [x≤y]=0 (false) if x>y. The protocol proceeds in four steps:
c*
i
=(s·xi·yi−1·(xj⊕yj)3)r
c*
−1
=(δA·xj⊕yj)r
The correctness of the DGK+ protocol follows from the fact that x=(, . . . , x0) is smaller than or equal to y=(, . . . , y0) if only and only if
xi<yi, and
x
j
=y
j for −1≥j≥i+1.
When x≠y, this latter condition is equivalent to the existence of some index i, with 0≤i≤−1, such that xi−yi+1+(xj⊕yj)=0. Indeed, because (xi−yi+1)≥0 and (xj⊕yj)≥0, it follows that xi−yi+1+(xj⊕yj)=0 is equivalent to xi−yi+1=0 and (xj⊕yj)=0 for all j≥i+1, which in turn is equivalent to xi<yi and xj=yj for all j≥i+1.
Let δA∈{0,1}. The above test is replaced to allow the secret sharing of the comparison bit across Alice and Bob as [x≤y]=δA⊕δB. The new test checks the existence of some index i, with 0≤i≤−1, such that
c
i
=x
i
−y
i+(1−2δA)+3(xj⊕yj)
is zero. When δA=0 this occurs if x≤y; when δA=1 this occurs if x>y. As a result, the first case yields δA=¬[x<y]=1⊕[x≤y] while the second case yields δA=[x>y]=¬[x≤y]=1⊕[x≤y]. This discrepancy is corrected by augmenting the set of ci's with an additional value c−1 given by
c
−1=δA+(xj⊕yj).
It is worth observing that c−1 can only be zero when δA=0 and x=y. Therefore, in all cases, when there exists some index i, with −1≤i≤−1, such that ci=0, then δA=1⊕[x≤y], or equivalently, [x≤y]=δA⊕1.
It is easily verified that ci* as computed in step 3 of the DGK+ protocol is the encryption of rici. Clearly, if rici is zero then so is ci because ri≠0. Hence, if one of the c*i's decrypts to 0 then [x≤y]=δA⊕1=δA⊕δB; if not, one has [x≤y]=δA=δA⊕δB. This concludes the proof of correctness.
In section II-A of Veugen, the author does not explicitly mention that c−1 has to be randomized by an invertible scalar r−1. This step is important as otherwise, assuming δA=1 (which occurs with probability ½), if Bob decrypts one of the c*i's to 1, he can deduce that x is very likely equal to y.
The Damgård-Geisler-Krigaard (DKG) protocol has the disadvantage of being computational intensive. Step 3 of the DKG protocol is dominated by +1 exponentiations in the group underlying the homomorphic encryption scheme. Those are costly operations. This issue was addressed in Veugen. Veugen was able to divide the computational workload by approximately a factor of two. However, the resulting implementation is subject to timing attacks. Another drawback of the DKG protocol is the communication cost. Step 3 in the DKG protocol produces +1 ciphertexts that are transmitted from Alice to Bob.
An embodiment of a comparison method will now be described that reduces by roughly a factor of two both the computational complexity and the necessary bandwidth for step 3. Furthermore, provided it is properly implemented, the proposed method is resistant against timing attacks.
The same setting as described above will be used again where Alice possesses an -bit integer x and Bob possesses an -bit integer y. The goal is for Alice and Bob to respectively obtain bits δA and δB such that δA⊕δB=[x≤y].
A first embodiment of the privacy comparison protocol, as illustrated in
c*
i
=(1+(1−2δA)xi·yi2δ
c*
−1
=(δA·xj⊕yj)r
The correctness of this protocol will now be discussed. It is useful to introduce some notation. For a t-bit integer a=Σi=0t−1ai2i with ai∈{0,1}, let ā denote the complementary of a; i.e., ā=2t−a−1. In particular, for t=1, a=a0 and ā=ā0=1−a0.
As a first proposition let x=xi2i and y=yi2i, with xi, yi∈{0,1}, be two -bit integers. Define
Then x<y if and only if there exists some unique index i with 0≤i≤−1 such that ci=0.
This may be proved as follows. As defined, ci is the sum of nonnegative terms. Therefore, ci=0 is equivalent to (i) xi=
As second proposition, let x=xi2i and y=yi2i, with xi, yi∈{0,1}, be two -bit integers. Define
Then x=y if and only if c−1=0. The proof is obvious.
By reversing the roles of x and y in the first proposition, the following corollary results: as a third proposition let x=xi2i and y=yi2i, with xi, yi∈{0,1}, be two -bit integers. Define
Then x≤y if and only if there exists no index i with 0≤i≤−1 such that ci=0.
This may be proved as follows. If there were such an index i, this would imply y<x by the first proposition. The absence of such an index therefore implies y≥x.
Suppose first that the Hamming weight of x is greater than └/2┘ (and thus δA=0). This means that x has more ones than zeros in its binary representation. Specifically, among the bits of x, at most └/2┘ bits are equal to 0. Furthermore, the first proposition shows that ci needs only to be evaluated when xi=0 because when xi=1, it is already known that the corresponding ci cannot be zero. The case x=y is taken into account using the second proposition.
Now suppose that the Hamming weight of x is less than ┌/2┐ (and thus δA=1). In this case, among the bits of x, at most └/2┘ bits are equal to 1. Then the second proposition can be made use of: with at most └/2┘ tests for ci=0 (i.e., when xi=1), it can be decided whether x≤y.
The last case is when the Hamming weight of x is /2 (and thus δA is equiprobably equal to 0 or 1). This supposes even. In this case, among the bits of x, /2 bits are equal to 0 and /2 bits are equal to 1. The combination of the first and second propositions or the third proposition can be used indifferently to decide after at most /2=└/2┘ tests for ci=0 whether or not x≤y.
The above analysis shows that (i) only the indexes i∈′ need to be tested, and (ii) # ′≤└/2┘. If # ′<└l/2┘ then additional indexes are added to ′ to form . This ensures that # is always equal to └/2┘ and is aimed at preventing timing attacks. Since the values of rici is nonzero for i∉′, the correctness follows by noting that the c*i's include the encryptions of rici for all i∈′.
By construction, δB=1 if one of the c*i's decrypts to 0.
If none of the c*i's decrypts to 0, then δB=0. When δA=0, this means x≤y; when δA=1, this means x≤y. In both cases, [x≤y]=δA⊕δB, as desired.
A second embodiment of a comparison method will now be described. In the setting where Alice possesses x and Bob y and where Alice and Bob wish to respectively obtain δA and δB such that δA⊕δB=6 with δ=[x≤y], the previously described first protocol needs special care. In particular, it requires that the Hamming weight of x a priori has the same probability to be greater than └/2┘ or less than ┌/2┐. This guarantees that δA is uniformly distributed over {0,1}. Indeed, if Bob knows for example that the Hamming weight of x is more likely greater than └/2┘ (and thus δA is more likely equal to 1), a value δB=0 tells Bob that x is more likely less or equal to y because δA⊕δB=[x≤y].
The second embodiment described below is secure even when Bob has some a priori knowledge on the Hamming weight of x. The distribution of δA will always be uniform over {0,1}, independently of the value of x.
The second embodiment of the privacy comparison protocol, as shown in
The correctness of this protocol will now be discussed.
Define X′=└ρ/┘, Y′=└z†/┘, and δ′=δ′A⊕δ′B. The values for δA and δB such that δA⊕δB=[x≤y] can be obtained from δ′A and δ′B, respectively. Indeed:
Therefore, modulo 2, one has:
δA+δB≡Y′+X′+δ′A+δ′B+1(mod 2);
a solution of which is: δA=(δ′A+X′) mod 2 and δA=(δ′B+Y′+1) mod 2.
The first and second embodiments may also produce an encrypted comparison bit as will now be described. Let δ denote the comparison bit; i.e., δ=[x≤y]. In certain settings, Alice wishes to produce an encryption of δ at the end of the protocol, rather than a share δA of δ (the other share, δB, being held by Bob). In this case, following step may be added to the first and second embodiments of the comparison protocols:
In yet other embodiments, the inputs may be encrypted. There exists another practical setting for the comparison of private inputs. In this setting, Alice possesses {circumflex over (x)} and ŷ, the encryption of two -bit values {circumflex over (x)}={circumflex over (x)}i2i and ŷ=ŷi2i. Bob possesses the corresponding decryption key. The goal is for Alice to get {circumflex over (δ)}, the encryption under Bob's public key of the comparison bit {circumflex over (δ)}=[{circumflex over (x)}≤ŷ]. The protocols described in the embodiments above may be used in that setting as well. Let κ be a security parameter. Alice first chooses a random (+κ)-bit integer μ and, from {circumflex over (x)} and ŷ, computes z* where
z*=ŷ+
−{circumflex over (x)}+μ
as z*=ŷ·{circumflex over (x)}−1·+μ. Alice also defines x=μ mod . Alice sends z* to Bob.
Bob decrypts z* to get z* and defines y=z* mod .
Again, it is worth noting that x and y are -bit integers privately held by Alice and Bob, respectively. This is a setting similar to the one considered above. Let δ=[x≤y] and assume that Alice obtained δ as the output of the first or second comparison protocol embodiments. It will be shown below how Alice can get {circumflex over (δ)} from δ.
Define X=└μ/┘ and Y=└z*/┘. Note that the problem of finding {circumflex over (δ)} boils down to the problem of finding δ. Indeed:
hence, Alice may compute {circumflex over (δ)} from δ and Y as
{circumflex over (δ)}=Y·X+1−1·δ.
It suffices that Bob sends the value of Y to Alice.
The embodiments described herein may be used in various applications. The comparison of private values is an essential building block for developing privacy-preserving machine-learning algorithms. These include the very popular SVM (support vector machines) algorithm as well as k-means clustering. In secure clustering algorithm, user profile should be compared with cluster centroids.
Comparison protocols also have a pivotal role in authentication services. In fingerprint-based authentication, a biometric device (fingerprint reader) identifies a user by comparing her sample with the database of authorized entities. It is also used in face recognition. In private recommender systems, the user value is compared with a threshold. Comparison algorithms also have applications in secure matrix factorization, private bio-informatic services, and secure adaptive filtering.
The embodiments described in this disclosure therefore find numerous applications. Remarkably, the resulting performance is greatly improved compared to state-of-the-art methods, both in computation and communication complexities.
The embodiments described herein represent an improvement in the technology of secure comparisons of data by a single party who does not have access to the underlying secure data or between two parties who are keeping their own information secret from the other party. These embodiments provide a reduction in the amount of computations needed to perform these secure comparisons as well as reducing the amount of data that needed to be exchanged between the parties doing the comparison. As a result, the embodiments also lead to an improvement in terms of number of operations of a computer that may be used to carry out such secure comparisons.
The methods described above may be implemented in software which includes instructions for execution by a processor stored on a non-transitory machine-readable storage medium. The processor may include a memory that stores the instructions for execution by the processor.
Any combination of specific software running on a processor to implement the embodiments of the invention, constitute a specific dedicated machine.
As used herein, the term “non-transitory machine-readable storage medium” will be understood to exclude a transitory propagation signal but to include all forms of volatile and non-volatile memory. Further, as used herein, the term “processor” will be understood to encompass a variety of devices such as microprocessors, field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), and other similar processing devices. When software is implemented on the processor, the combination becomes a single specific machine.
It should be appreciated by those skilled in the art that any block diagrams herein represent conceptual views of illustrative circuitry embodying the principles of the invention.
Although the various exemplary embodiments have been described in detail with particular reference to certain exemplary aspects thereof, it should be understood that the invention is capable of other embodiments and its details are capable of modifications in various obvious respects. As is readily apparent to those skilled in the art, variations and modifications can be effected while remaining within the spirit and scope of the invention. Accordingly, the foregoing disclosure, description, and figures are for illustrative purposes only and do not in any way limit the invention, which is defined only by the claims.