Various exemplary embodiments disclosed herein relate generally to distance-revealing encryption.
Distance-revealing encryption is a primitive related to functional encryption (FE), a generalization of public-key encryption which allows a party to learn a function of the input plaintext (or multiple plaintexts in the case of multi-input functional encryption (MIFE)). More specifically, distance-revealing encryption adds the useful feature that given any two ciphertexts, the Euclidean distance between the corresponding plaintexts—viewed as vectors—can be evaluated publicly; that is, without the knowledge of the private decryption key.
A brief summary of various exemplary embodiments is presented below. Some simplifications and omissions may be made in the following summary, which is intended to highlight and introduce some aspects of the various exemplary embodiments, but not to limit the scope of the invention. Detailed descriptions of an exemplary embodiment adequate to allow those of ordinary skill in the art to make and use the inventive concepts will follow in later sections.
Various embodiments relate to method for performing distance revealing encryption, including: generating a tuple (1,2,T,ê), where 1=g1, 2=g2 and T are groups whose order is |1|=|2|=|T|=p with p prime and ê:1×2→T is a bilinear pairing and is non-degenerate; randomly select a first plurality elements and a second plurality of elements; defining a parameter h that is based upon ê,g1,g2, and the first plurality of elements; selecting a pseudo-random function family indexed by a secret key K; setting public parameters as ê,1,2,T,p,h; setting private parameters as g1, g2, the first plurality of elements, the second plurality of elements, the pseudo-random function family, and the secret key K; encrypting a message {right arrow over (x)}=(x1, . . . ,xn), when n is the number of components in the message by:
choosing a random value r; computing a plurality of values of the pseudo random function; and computing a first plurality of ciphertext values based upon g1, the random value r, the first plurality of elements, the second plurality of elements, the plurality of values of the pseudo random function, and message {right arrow over (x)}; and computing a second plurality of ciphertext values based upon g2, the random value r, the first plurality of elements, the second plurality of elements, the plurality of values of the pseudo random function, and message {right arrow over (x)}.
Further various embodiments relate to a non-transitory machine-readable storage medium encoded with instructions for execution by a processor for performing distance revealing encryption, including: instructions for generating a tuple 1,2,T, ê, where 1=g1, 2=g2 and T are groups whose order is |1|=|2|=|T|=p with p prime and ê: 1×2→T is a bilinear pairing and is non-degenerate; instructions for randomly select a first plurality elements and a second plurality of elements; instructions for defining a parameter h that is based upon ê,g1,g2, and the first plurality of elements; instructions for selecting a pseudo-random function family indexed by a secret key K; instructions for setting public parameters as ê,1,2,T,p,h; instructions for setting private parameters as g1, g2, the first plurality of elements, the second plurality of elements, the pseudo-random function family, and the secret hey K; instructions for encrypting a message {right arrow over (x)}=(x1, . . . ,xn), when n is the number of components in the message by: choosing a random value r; computing a plurality of values of the pseudo random function; and computing a first plurality of ciphertext values based upon g1, the random value r, the first plurality of elements, the second plurality of elements, the plurality of values of the pseudo random function, and message {right arrow over (x)}; and computing a second plurality of ciphertext values based upon g2, random value r, the first plurality of elements, the second plurality of elements, the plurality of values of the pseudo random function, and message {right arrow over (x)}.
Various embodiments are described, wherein the first plurality of elements are α,β,ξ,η and are selected such that (α+β)ξη≢0 (mod p), and the second plurality of elements are μ1, . . . ,μn and are selected such that Σi=1nμi2≡0 (mod p).
Various embodiments are described, wherein the parameter h=ê(b1,g2)(α+β)ξη.
Various embodiments are described, wherein the first plurality of ciphertext values are calculated as ci,1=g1ξ(x
Various embodiments are described, wherein ti=FK(τ,i) where tag τ is a value associated with the message {right arrow over (x)}.
Various embodiments are described, further including determining the distance between two encrypted messages {right arrow over (x)} and {right arrow over (x)}′ by calculating:
Further various embodiments relate to a method for performing distance revealing encryption, including: generating a tuple (,T,ê), where =g and T are groups whose order is ||=|T|=p with p prime and ê:×→T is a bilinear pairing and is non-degenerate; randomly select a first plurality elements and a second plurality of elements; defining a parameter h that is based upon ê, and the first plurality of elements; selecting a pseudo-random function family indexed by a secret key K; setting public parameters as ê,,T,p,h, setting private parameters as g, the first plurality of elements the second plurality of elements, the pseudo-random function family, and the secret key K; encrypting a message {right arrow over (x)}=(x1, . . . ,xn), when n is the number of elements in the message by: choosing a random value r; computing a plurality of values of the pseudo random function; and computing a plurality of ciphertext values based upon g, the random value r, the first plurality of elements, the second plurality of elements, the plurality of values of the pseudo random function, and message {right arrow over (x)}.
Further various embodiments relate to a non-transitory machine-readable storage medium encoded with instructions for execution by a processor for performing distance revealing encryption, including: instructions for generating a tuple (,T,ê), where =g and T are groups whose order is ||=|T|=p with p prime and ê:×→T is a bilinear pairing and is non-degenerate; instructions for randomly select a first plurality elements and a second plurality of elements; instructions for defining a parameter h that is based upon ê, g, and the first plurality of elements; instructions for selecting a pseudo-random function family indexed by a secret key K; instructions for setting public parameters as ê,,T,p,h, instructions for setting private parameters as g, the first plurality of elements, the second plurality of elements, the pseudo-random function family, and the secret key K; instructions for encrypting a message {right arrow over (x)}=(x1, . . . ,xn), when n is the number of elements in the message by: choosing a random value r; computing a plurality of values of the pseudo random function; and computing a plurality of ciphertext values based upon g, the random value r, the first plurality of elements, the second plurality of elements, the plurality of values of the pseudo random function, and message {right arrow over (x)}.
Various embodiments are described, wherein the first plurality of elements are α,β,ξ,η and are selected such that (α+β)ξη≢0 (mod p), and the second plurality of elements are μ1, . . . ,μn and are selected such that Σi=1nμi2≡0 (mod p).
Various embodiments are described, wherein the parameter h=ê(g1,g2)(a+β)ξη.
Various embodiments are described, wherein the first plurality of ciphertext values are calculated as ci,1=g1ξ(x
Various embodiments are described, wherein ti=FK(τ, i) where tag τ is a value associated with the message {right arrow over (x)}.
Various embodiments are described, further including determining the distance between two encrypted messages {right arrow over (x)} and {right arrow over (x)}′ by calculating:
In order to better understand various exemplary embodiments, reference is made to the accompanying drawings, wherein:
To facilitate understanding, identical reference numerals have been used to designate elements having substantially the same or similar structure and/or substantially the same or similar function.
The description and drawings illustrate the principles of the invention. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the invention and are included within its scope. Furthermore, all examples recited herein are principally intended expressly to be for pedagogical purposes to aid the reader in understanding the principles of the invention and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Additionally, the term, “or,” as used herein, refers to a non-exclusive or (i.e., and/or), unless otherwise indicated (e.g., “or else” or “or in the alternative”). Also, the various embodiments described herein are not necessarily mutually exclusive, as some embodiments can be combined with one or more other embodiments to form new embodiments.
Two embodiments of distance-revealing encryption schemes are described below. These schemes make use of non-degenerate bilinear maps. Those maps may be constructed from pairings over elliptic curves. These two embodiments provide for a secure and computationally efficient method for encrypting data that can then be processed to determine the distance between two data points from their encrypted values, without ever decrypting.
There are two forms of bilinear maps commonly used in the cryptography literature. The first are of the form ê:1×2→T, where 1, 2 and T are cyclic groups of prime order p. The second are of the form ê:×→T where and T are cyclic groups of prime order p.
The second form can be seen as a particular case of the first one by setting 1=2=. Embodiments using both forms are described. The different groups will be written multiplicatively with identity element 1.
Let 1=g1 and 2=g2. Two properties of map ê are important: 1) bilinearity for any a, b∈p, ê(g1a,g2b)=ê(g1,g2) and 2) non-degeneracy ê(g1,g2)≠1.
The non-degeneracy property implies that ê(g1,g2) is a generator of the target group; that is, T=ê(g1,g2).
By construction, a good encryption scheme should be such that its output is uniformly distributed over the ciphertext space. Assume the message space is the set of n-dimension vectors whose components are l-bit strings, =({0,1}l)n. Consider also a pseudo-random function family, FK:{0,1}*×{0,1}n→p, indexed by a secret key K. A simple construction for distance-revealing encryption could be as follows: given a message {right arrow over (x)}=(x1, . . . ,xn)∈ with tag τ, its encryption is given by =(c1, . . . ,cn) where
c
i
=[F
K(τ,i)+xi]mod p.
Then the (square of) distance between any two plaintexts, given their ciphertexts =(c1, . . . ,cn) and ′=(c′1, . . . ,c′n), can be obtained as:
While this encryption scheme is very efficient, this encryption scheme leaks to much information. In particular, it provides the difference between any two components of plaintexts; namely, ci−c′i=xi−x′i (mod p) for any i∈{1, . . . ,n}. In turn, this implies that the knowledge of a single pair of plaintext/ciphertext allows one to decrypt any ciphertext.
Embodiments will now be described that aim at providing solutions so that only the distance between plaintexts can be inferred from the ciphertexts. Two schemes are proposed. The first one makes use of asymmetric pairings and the second one of symmetric pairings.
The first scheme may be described by the following steps.
First, define a Setup (1K,n) algorithm. The value n indicates the number of components in the message (viewed as a vector of dimension n) to be encrypted. On the input security parameter 1K, the Setup (1k,n) algorithm generates the tuple (1,2,T,ê) where 1=g1, 2=g2 and T are (multiplicatively written) groups whose order is |1|=|2|=|T|=p with p prime, and ê:1×2→T is a bilinear pairing. The message space is a subset of (p)n; for example, =({0,1}l−1)n where l denotes the bit-length of p. The Setup (1K,n) algorithm also selects n+4 elements α,β,ξ,η,μ1, . . . ,μn∈p such that:
(α+β)ξη≢0 (mod p); and
Σi=1nμi2≡0 (mod p).
Next, the Setup (1K,n) algorithm defines h=ê(g1,g2)(α+β)ξη∈T. Finally, the Setup (1K,n) algorithm selects a pseudo-random function family, FK:{0,1}*×{1, . . . ,n}→p, indexed by a secret key and randomly chooses
The public parameters are pp=(ê,1,2,T,p,h) and the secret key is sk=(g1,g2,α,β,ξ,η,{μi}1≤i≤n,FK,K). This is public key and the private key.
Next, define an Enc (sk,{right arrow over (x)},τ) algorithm. The encryption of message {right arrow over (x)}=(x1, . . . ,xn)∈ with tag τ (which is an identifier associated with the message) is obtained by performing the following two steps:
1) choose uniformly at random
and
2) for 1≤i≤n, compute ti=FK(τ,i) and
The ciphertext is =(C,D,τ) with C=(c1,1,c1,2, . . . ,cn,1,cn,2)∈(1)2n and D=(d1,1,d1,2, . . . ,dn,1,dn,2)∈(2)2n.
Finally, define an Eval (pp,,′) algorithm. The Eval (pp,,′) algorithm parses as
((c1,1,c1,2, . . . ,cn,1,cn,2),(d1,1,d1,2, . . . ,dn,1,dn,2),τ) and ′ as
((c′1,1,c′1,2, . . . ,c′n,1,c′n,2),(d′1,1,d1,2, . . . ,d′n,1,d′n,2),τ).
If τ′≠τ, the Eval (pp,,′) algorithm returns ⊥. Otherwise, if τ′=τ, the Eval (pp,,′) algorithm evaluates the product
The Eval (pp,,′) algorithm then obtains Σi=1n(xi−x′i)2 (mod p) as the discrete logarithm of Z with respect to base h in T.
The correctness of the Eval (pp,,′) algorithm is easily verified. From
the following results:
As a result, the following is obtained:
which is the desired result.
The second scheme may be described by the following steps.
First, define a Setup (1K,n) algorithm. On input security parameter 1K, the Setup (1K,n) algorithm generates the tuple (,T,ê) where =g and T are (multiplicatively written) groups whose order is ||=|T|=p with p prime, and ê:×→T is a bilinear pairing. The message space is a subset of (p)n. The Setup (1K,n) algorithm also selects n+6 elements α,β,γ,δ,ξ,η,μ1, . . . ,μn∈p p such that:
(1+α2+γ2)ξ2+(1+β2+δ2)η2≢0 (mod p); and
Σi=1nμi2≡0 (mod p).
The Setup (1K,n) algorithm defines h=ê(g,g)(1+α
The public parameters are pp=(ê,,T,p,h) and the secret key is sk=(g,α,β,γ,δ,ξ,η,{μi}1≤i≤n,FK,K). This is the public key and the private key.
Next, define a Enc (sk,{right arrow over (x)},τ) algorithm. The encryption of message {right arrow over (x)}=(x1, . . . ,xn)∈ with tag τis obtained in two steps as:
1) choose uniformly at random
and
2) for 1≤i≤n, compute t1=FK(τ,i) and
The ciphertext is =(C,τ) with
C=(c1,1,c1,2,c1,3,c1,4,c1,5,c1,6, . . . ,cn,1,cn,2,cn,3,cn,4,cn,5,cn,6)∈()6n.
Finally, define an Eval (pp, ,′) algorithm. The Eval (pp,,′) algorithm parses as
((c1,1,c1,2,c1,3,c1,4,c1,5,c1,6, . . . ,cn,1,cn,2,cn,3,cn,4,cn,5,cn,6),τ)
and ′ as
((c′1,1,c′1,2,c′1,3,c′1,4,c′1,5,c′1,6, . . . ,c′n,1,c′n,2,c′n,3,c′n,4,c′n,5,c′n,6),τ′).
If τ′≠τ, the Eval (pp,,′) algorithm returns ⊥. Otherwise, if τ′=τ, it evaluates the product
The Eval (pp,,′) algorithm then obtains Σi=1n(x1−x′i)2 (mod p) as the discrete logarithm of Z with respect to base h in T.
The correctness of the Eval (pp,,′) algorithm is easily verified. From
the following results:
As a result, the following is obtained:
which is the desired result.
The embodiments described above may be used in various applications. For example, in privacy-preserving applications, the evaluation of distance between two vectors serves many different applications. Two categories are focused on: (i) evaluation of distance between two vectors for machine learning algorithms; and (ii) evaluation of physical proximity.
First, application of the embodiments to machine learning will be described. Very large sample sets can require a lot of storage and computational resources in order to carry out the algorithm. In order to be able to handle large datasets, the computation will be offloaded to a datacenter such a cloud computing platform. However, the raw data or feature vectors can often constitute a valuable training set that cannot be released due to privacy concerns, financial value, etc. Therefore machine learning on the data should be performed without actually exposing the data to the server.
Many useful machine learning algorithms for clustering and classification of data use the distance between data-points or the feature vectors extracted from the data. These include known spectral clustering and classification methods. For instance the Gaussian kernel may be applied with the form
to separate points lying on concentric circles using a linear classifier.
Distances are used to calculate adjacency matrices which are used to compute Laplacian or diffusion matrices. The spectral decomposition of these matrices and choice of the eigenvectors corresponding to the largest eigenvalues result in an often useful non-linear dimensionality reduction of the data. Non-linear dimensionality reduction has been successfully applied to such tasks as computer vision, image classification and segmentation, speaker identification and verification, anomaly detection, etc.
Second, application of the embodiments to private evaluation of geo-proximity will be described. A central service is proposed that users may enroll in to get notifications when they are in proximity to a friend, similar to features supported nowadays by social networks. Two users Ui and Uj, who are interested in discovering when they are near one another, enroll in the service after generating a shared key. They occasionally submit their coordinates to the server, encrypted using the schemes described above, using the pre-shared key and a common tag. The pseudo random function used to generate the common tag may incorporate the time in order to prevent evaluating the distance with a prior location ciphertext.
The server evaluates distances for all pairs i,j that share a common tag, and notifies users whose distance from one another is below a specified threshold. Such a service eliminates the need to broadcast location to all social contacts, and instead just send it to the server.
Next, application of the embodiments to two-factor biometric and digital authentication is described. It is proposed to use distance evaluation for fuzzy-matching of biometric credentials. Biometric patterns matching has to allow for small mismatches due to the fuzzy nature of biometric credentials. The matching of biometric credentials therefore measures whether the distance between the stored reference pattern, and the reading is below a certain predefined threshold.
Due to the sensitivity of biometric credentials, it is desirable to avoid storing the biometric credentials in the clear in a central database. A biometric authentication system may include a central service with a database that stores encrypted biometric credentials, smart-IDs (electronic tokens) distributed to users, and biometric readers. An important point is that the biometric readers do not store any data in the clear, but rather encrypt it using encryption keys provided by the smart-IDs.
Any combination of specific software running on a processor to implement the embodiments of the invention, constitute a specific dedicated machine.
As used herein, the term “non-transitory machine-readable storage medium” will be understood to exclude a transitory propagation signal but to include all forms of volatile and non-volatile memory. Further, as used herein, the term “processor” will be understood to encompass a variety of devices such as microprocessors, field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), and other similar processing devices. When software is implemented on the processor, the combination becomes a single specific machine.
It should be appreciated by those skilled in the art that any block diagrams herein represent conceptual views of illustrative circuitry embodying the principles of the invention.
Although the various exemplary embodiments have been described in detail with particular reference to certain exemplary aspects thereof, it should be understood that the invention is capable of other embodiments and its details are capable of modifications in various obvious respects. As is readily apparent to those skilled in the art, variations and modifications can be effected while remaining within the spirit and scope of the invention. Accordingly, the foregoing disclosure, description, and figures are for illustrative purposes only and do not in any way limit the invention, which is defined only by the claims.