User authentication is a critical component of today's Internet applications. Users login to email applications to send/receive emails and increasingly to enable a second authentication factor for other applications; to bank websites to check balances and execute transactions; and to payment apps to transfer money between family and friends. User authentication is also an integral part of any enterprise access control mechanism that administers access to sensitive data, services and resources.
For the last three decades, password-based authentication has been the dominant approach for authenticating users, by relying on “what users know”. But this approach can have security and usability issues. First, in password-based authentication, the servers store a function of all passwords and hence can be susceptible to breaches and offline dictionary attacks. Indeed, large-scale password breaches in the wild are extremely common. Moreover, password-based authentication is more susceptible to phishing as the attacker only needs to capture the password which serves as a persistent secret credential in order to impersonate users.
Passwords can also pose challenging usability problems. High entropy passwords are hard to remember by humans, while low entropy passwords provide little security, and research has proven that introducing complex restrictions on password choices can backfire. Passwords may also be inconvenient and slow to enter, especially on mobile and Internet of Things devices that dominate user activities on the Internet.
There are major ongoing efforts in the industry to address some of these issues. For example, “unique” biometric features such as finger-print (e.g. Google Pixel's fingerprint [1]), facial scans (e.g. Face-ID used in Apple iPhone [2]), and iris scans (e.g. Samsung Galaxy phones) are increasingly popular first or second factor authentication mechanisms for logging into devices, making payments and identifying to the multitude of applications on consumer devices Studies show that biometrics are much more user-friendly, particularly on mobile devices as users may not have to remember or enter any secret information.
Moreover, the industry is shifting away from transmitting or storing persistent user credentials/secrets on the server-side. This can significantly reduce the likelihood of scalable attacks such as server breaches and phishing. For example, biometric templates and measurements can be stored and processed on the client side where the biometric matching also takes place. A successful match then unlocks a private signing key for a digital signature scheme (i.e. a public key credential) that is used to generate a token on various information such as a fresh challenge, the application's origin and some user information. Only a one-time usable digital signature is transmitted to the server who stores and uses a public verification key to verify the token and identify the user.
This is the approach taken by the FIDO Alliance [3], the world's largest industry-wide effort to enable an interoperable ecosystem of hardware-, mobile- and biometrics-based authenticators that can be used by enterprises and service providers. The framework is also widely adopted by major Internet players and built-into all browser implementations such as most recent versions of Chrome, Firefox, and Edge in form of a W3C standard WebAuthn API.
With biometrics and private keys stored on client devices, a primary challenge is to securely protect them. This is particularly crucial with biometrics since unlike passwords they are not replaceable. The most secure approach for doing so relies on hardware solutions such as secure enclaves and trusted execution environments that provide both physical and logical separation between various applications and the secrets. But hardware solutions are not always available on devices (e.g. not all mobile phones or IoT devices are equipped with secure elements), or can be costly to support at scale. Moreover, they provide very little programmability to developers and innovators. For example, programming a new biometric authentication solution into a Secure Element uses support from all parties involved such as OEMs and OS developers.
Software-based solutions such as white-box cryptography are often based on ad-hoc techniques that are regularly broken. The provably secure alternative, i.e. cryptographic obfuscation, is extremely inefficient and the community's confidence on its mathematical foundation is lacking. An alternative approach is to apply the “salt-and-hash” techniques often used to protect passwords to biometric templates before storing them on the client device. A naive salt-and-hash solution can fail since biometric matching is often a fuzzy match that checks whether the distance between two vectors is above a threshold or not.
A better way of implementing the hash-and-salt approach is via a powerful primitive known as fuzzy extractor [4]. Unfortunately, this is still susceptible to offline dictionary attacks on the biometric (trying different biometric measurements until we find one that generates the correct public key), and does not solve the problem of protecting the signing key which is reconstructed in memory during each authentication session. Moreover, existing fuzzy extractor solutions do not support the wide range of distance metrics (e.g. cosine similarity, Euclidean distance, etc.) and the necessary accuracy level needed in today's practical biometric matching.
Embodiments of the present disclosure address these and other issues individually and collectively.
Some embodiments of the present disclosure are directed to methods of using biometric information to authenticate a first electronic device of a user to a second electronic device. The first electronic device can store a first key share of a private key, wherein the second electronic device stores a public key associated with the private key, and wherein one or more other electronic devices of the user store other key shares of the private key. The first device can store a first template share of a biometric template of the user, wherein the one or more other electronic devices of the user store other template shares of the biometric template. When the first device receives a challenge message from the second electronic device, it can measure, by a biometric sensor, a set of biometric features of the user to obtain a measurement vector comprised of measured values of the set of biometric features, wherein the biometric template includes a template vector comprised of measured values of the set of biometric features previously measured from the user. The first device can send the measurement vector and the challenge message to the one or more other electronic devices. The first device can generate a first partial computation using the first template share, the first key share, and the challenge message and receive at least T other partial computations from the one or more other electronic devices, wherein each of the at least T other partial computations are generated using a respective template share, a respective key share, and the challenge message. The first device can generate a signature of the challenge message using the first partial computation and the at least T other partial computations and send the signature to the second electronic device.
Some embodiments of the invention are directed to generating partial computations with particular distance measures, while another embodiment is directed to generating partial computations with any suitable distance measure.
Other embodiments of the invention are directed to systems and computer readable media associated with the above-described methods.
These and other embodiments of the invention are described in further detail below with reference to the Figures and the Detailed Description.
Embodiments of the disclosure can be motivated by the fact that most users own and carry multiple devices, such as their laptop, smartphone, and smartwatch, and have other Internet of Things devices around when authenticating such as their smart TV or smart-home appliances. Embodiments provide a new framework for client-side biometric-based authentication with the goal of distributing both the biometric templates as well as the secret signing key among multiple devices who collectively perform the biometric matching and signature generation without ever reconstructing the template or the signing key on one device. This framework may be referred to as Fuzzy Threshold Token Generation (FTTG). FTTG can also be used to protect biometric information on the server-side by distributing it among multiple servers who perform the matching and token generation (e.g. for a single sign-on authentication token) in a fully distributed manner.
We initiate a formal study of FTTG and introduce a number of concrete protocols secure within the framework. In protocols according to embodiments, during a one-time registration phase, both the template and the signing key are distributed among n devices, among which any t of them can generate tokens. The exact values of n and t are parameters of the scheme and vary across different protocols. Then, during each authentication session, the initiating device obtains the biometric measurement and exchanges a constant number of messages with t−1 other devices in order to verify that the biometric measurement is close enough, with respect to some distance measure, to the secret shared template and if yes, obtain a token, which may also be referred to as a digital signature, on a message chosen by the initiating parties.
We formally define Fuzzy Threshold Token Generation schemes. We provide a unified definition that captures both privacy of biometrics and unforgeability of tokens in the distributed setting against a malicious adversary. Our definitions follow the real-ideal paradigm but may use a standalone setting for efficiency and simplicity.
We propose a four round protocol for any distance function based on any two-round UC-secure multi-party computation protocol that may use a broadcast channel (for example [5, 6, 7, 8, 9]). This protocol works for any n and t≤n) and tolerates up to t−1 malicious corruptions. Note that a generic application of constant-round MPC protocols may not meet the important consideration that every contacted party only needs to exchange messages with the single initiating party. This protocol is a feasibility result as the resulting protocol is not black-box. To obtain protocols with concrete efficiency, embodiments can address the most popular distance functions used for biometric authentication, including Hamming distance (used for iris scans), cosine similarity and Euclidean distance (both of which are used for face recognition).
For cosine similarity, embodiments can include a very efficient four round protocol which supports any n with threshold t=3 and is secure against the corruption of one party. The protocol can combine techniques such an additively homomorphic encryption (AHE) scheme and associated NIZKs, with garbled circuit techniques to obtain a hybrid protocol wherein arithmetic operations (e.g. inner products) are performed using the AHE and non-arithmetic operations (e.g. comparison) take place in a small garbled circuits. The same construction easily extends to one for Euclidean distance.
A. FIDO Universal Authentication Framework
Let us now take a closer look into FIDO Alliance architecture [3] that proposes a way to authenticate a device to an authentication server.
In step 102, a user device 110 can generate a public key and a secret key for an asymmetric encryption scheme. Alternatively, the public key and the secret key may be provisioned to the user device 110.
In step 104, the user's device 110 can send the public key to an authentication server 140. The authentication server 140 can store the user's public key and register the device. The user's device 110 may securely store the secret key, such as on a hardware security module. The user device 110 may secure the secret key with additional protections, such as with a previously entered biometric (e.g., a fingerprint of the user, a facial scan of the user).
In step 106, the authentication server 140 can send a challenge to the user device 110. The challenge may be a message, and may be sent in response to the user device 110 initiating an access attempt.
In step 108, the user's device 110 can sign the challenge using the secret key and send the signed challenge back to the authentication server 140. For example, the user device 110 may sign the challenge by encrypting the challenge message with the secret key. As another example, the user device 110 may generate a cryptogram comprising information in the challenge and the secret key. The authentication server 140 can use the previously provided public key to verify the challenge. For example, the authentication server 140 can use the public key to decrypt the signed message and determine if the decrypted message matches the challenge. If the signature is valid, then the user may be allowed access to the resource.
The user can make this process more secure with the use of a biometric. During the registration phase the user can enter biometric data into the user device 110. For example, the user may use a camera of the user device 110 to take a picture of their face. A biometric template (e.g., a facial scan) can be extracted from the biometric data and stored “securely” inside the user device 110. At the same time, the secret key/public key pair can be generated. The public key can be communicated with the authentication server 140 whereas the secret key is securely stored in the user device 110. The secret key is virtually “locked” with the template. Later, during sign-on, a candidate template is used to unlock the secret key, which can then used in a standard identification protocol.
FIDO specification emphasizes that during sign-on approximate matching takes place inside the device “securely”. One popular way to instantiate that is to use secure elements (for example Apple iPhones [2]): the template is stored inside the secure elements (SE) (for example hardware security modules) at all times along with other sensitive data. On input a candidate template, an approximate matching is performed inside the SE. However using SE has a number of drawbacks: SEs are usually expensive, platform-specific, offers limited programmability and are not easily updatable (as updating may require changes in hardware). Furthermore they are subject to side-channel attacks, which can be executed on (for example) a stolen device. Therefore providing an efficient, provably secure, flexible, software-only solution is very important.
B. Security Considerations
There are certain security constraints that are relevant to the invention. The biometric template should not ever be fully reconstructed, so that if any device is compromised the template is not revealed. The secret key also should never be fully reconstructed, for similar reasons. The distributed matching should work as long as more than a certain threshold of devices in the network are not compromised.
The secondary devices also should not learn if the biometric matching was successful. This increases security because if the other devices do not learn the result of the biometric matching, they may not be able to leak information, or have information to leak. Say if one of the other devices is compromised, a malicious party cannot use the information about the biometric matching. The secondary devices also don't learn if signature generation was successful. For example, if a device has been compromised but can still participate in the authentication, the hacker would not be able to learn if the information that they have about the biometric template or the secret share is accurate or valid.
It is important to note that other participating devices need not be required to talk to each other or even know each other (all messages are exchanged between the initiating device and other participants). In a typical usage scenario, one or two primary user devices (e.g. a laptop or a smartphone) play the role of the initiating device and all other devices are only paired/connected to the primary device and may not even be aware of the presence of other devices in the system. This makes the design of efficient and constant-round FTTG protocol significantly more challenging. We assume that devices that are connected, have established a point-to-point authenticated channel (but not a broadcast channel).
C. Overview
1. Device Registration
In step 202, a user 205 can enter a biometric measurement into a primary user device 215. A biometric sensor of the primary user device 215 (e.g., a camera, a microphone, a fingerprint sensor) to measure a set of biometric features of the user 205. For example, the user 205 can use a camera of the primary user device 215 to take a picture of their face for a facial scan. Other examples of biometric measurements may include voice recordings, iris scans, and fingerprint scans. The role of the primary device may also performed by any trusted authority which may not be a device of the user. As an example, the primary device may be a computer of an authentication system.
In step 204, the primary user device 215 can generate a public key and a private key for an asymmetric encryption scheme. The primary user device 215 can also use the biometric measurement to create a biometric template. The biometric template may include a template vector, and the template vector may comprise measured values of the set of biometric features of the user. For example, the primary user device 215 may compute distances between facial features identified in the picture of the user's face. The computed distances may comprise the template vector of the biometric template.
In step 206, the primary user device 215 can generate shares of the private key and the template, as well as any other parameters that might be needed for later authentication. The primary user device 215 can then store a first key share of the private key. The user device may also store a first template share of the biometric template.
In step 208, the primary user device 215 can send other key shares of the private key, other template shares of the template, and other parameters to each of a plurality of the user's other user devices 225, 235. Other devices of the user may include, for example, laptops, smartphones, wearable devices, smart TVs, IoT connected devices, etc. Two other devices are shown in
In step 210, the primary user device 215 can send the public key to an authentication server 245. The authentication server 245 can securely store the public key. The public key may be stored with an identifier of the user 205 and/or the primary user device 215. The primary user device 215 may now be registered.
2. Device Authentication
In step 302, the authentication server 245 can send a challenge message to the primary user device 215. The challenge message may be a vector.
In step 304, the user 205 can enter a biometric measurement into the primary user device 215. A biometric sensor of the primary user device 215 (e.g., a camera, a microphone, a fingerprint sensor) to measure a set of biometric features of the user 205. For example, the user 205 may use a camera of the primary user device 215 to take a picture of their face to be used in a facial scan. As another example, the user 205 may use a sensor of the primary user device 215 to scan their fingerprint. The primary user device 215 may then generate a measurement vector that comprises measured values of the set of biometric features. For example, the measured values may be computed distances between facial features of the user.
In step 306, the primary user device 215 and other user devices 225, 235 can match the previously distributed biometric template with the new biometric measurement. The primary user device 215 can send the measurement vector and the challenge message to the other devices. Each user device 215, 225, 235 may generate a partial computation with their template share and the measurement vector. For example, an inner product may be computed between the template share and the measurement vector. In embodiments steps 306 and 308 may occur concurrently.
In step 308, the primary user device 215 and other user devices 225, 235 can sign the challenge message with the key shares of the secret key. Each user device 215, 225, 235 may generate a partial computation with the challenge message and a respective key share. The partial computation may be, for example, a partial encryption of the challenge message with the respective key share. The partial computation may also be generated with the result of the partial computation from the respective template share and the measurement vector. After generating each partial computation, each user device 225, 235 can send the partial computation to the primary user device 215. After receiving the partial computations, the primary user device 215 can generate a signature of the challenge message using the partial computations. The primary user device 215 may need to receive partial computations from a threshold number, T, of other devices. A first partial computation generated by the primary user device 215 may be one of the at least T partial computations. For example, the primary user device 215 may need to receive at least three partial computations, and may receive the partial computations from two other devices. The threshold may be lower than the total number of other devices that received key shares and template shares, so that a signature can still be generated if one or more of the other devices are compromised.
In step 310, the primary user device 215 can send the signature to the authentication server 245. The primary user device 215 may also send the challenge message to the authentication server. If a valid signature is generated, then the authentication server 245 will be able to use the previously provided public key to verify the signature and allow the user access to the resource. If a valid signature is not generated then the device will not be authenticated and the user will not gain access to the resource.
We first briefly describe the notion of a Fuzzy Threshold Token Generation scheme.
Consider a set of n parties and a distribution over vectors ∈. Let's denote Dist to be the distance measure under consideration. Initially, in a registration phase, a biometric template {right arrow over (w)}∈ is sampled and secret shared amongst all the parties. Also, the setup of an unforgeable threshold signature scheme is implemented and the signing key is secret shared amongst the n parties. This is followed by a setup phase where suitable public and secret parameters are sampled by a trusted party and distributed amongst the n parties. Then, in the online authentication session (or sign-on phase), any party P* with input vector {right arrow over (u)}, that wishes to generate a token on a message m can interact with any set consisting of t parties (including itself) to generate a token (signature) on the message m using the threshold signature scheme. Note that P* gets a token only if Dist({right arrow over (u)}, {right arrow over (w)})>d where d is parameterized by the scheme. We recall that in this phase, the parties in set are not allowed to talk to each other. In particular, the communication model only involves party P* to interact individually with each party in .
The security definition for a Fuzzy Threshold Token Generation (FTTG) scheme captures two properties: privacy and unforgeability. Informally, privacy says that the long term secrets—namely, the biometric template {right arrow over (w)} and the signing key of the threshold signature scheme should be completely hidden from every party in the system. In preferred embodiments, the online input vector {right arrow over (u)} used in each authentication session should be hidden from all parties except the initiator. Unforgeability requires that no party should be able to generate a token on any message m without participating in an authentication session as the initiator interacting with at least t other parties, using message m and an input vector {right arrow over (u)} such that Dist({right arrow over (u)},{right arrow over (w)})>d. We formalize both these properties via a Real-Ideal security game against a malicious adversary. We refer the reader to Section V for the detailed definition including a discussion on the various subtleties involved in formally defining the primitive to achieve meaningful security while still being able to build efficient protocols.
In step 402, a first device 405 stores a first share of a private key, a second device 415 stores a public key associated with the private key, and other devices 425 store other shares of the private key. The private key and public key are keys can be keys generated by a trusted authority for a secret key encryption scheme. The trusted authority may be the first device. The second device may be an authorization server.
In step 404, the first device 405 stores the first share of a biometric template and the other devices 425 store other shares of the biometric template. The biometric template includes a template vector comprised of measured values of the set of biometric features previously measured from the user of the first device. The biometric template may have been generated by the trusted authority. In some embodiments, the biometric template may not be divided into shares. The first template share and the other template shares may all be the same value, and may be an encryption of the biometric template.
In step 406, the second device 415 sends a challenge message to the first device 405. This message may have been sent in response to the first device 405 requesting authentication in order to access some secure or protected resource. The challenge message may, for example, be a challenge message associated with the FIDO Universal Authentication Framework.
In step 408, the first device 405 obtains a biometric measurement from the user of the device. The biometric measurement can be obtained by a biometric sensor of the first device 405. The first device 405 may use the biometric sensor to measure a set of biometric features of the user to obtain a measurement vector comprised of measured values of the set of biometric features. For example, the biometric sensor may be a camera that takes a picture of a user's face. The first device may then measure distances between identified facial features in the picture to determine the biometric features. In some embodiments, the biometric measurement may be encrypted.
In step 410, the first device 405 sends the challenge message and the biometric measurement to the other devices 425. The first device 405 may also send other information or computations needed for the other devices 425 to generate partial computations. In some embodiments, the biometric measurement may be encrypted by the first device 405 before it is sent to the other device 425. In other embodiments, the biometric measurement may not be sent to the other devices 425 with the challenge message.
In step 412, the first device 405 can generate a first partial computation using the challenge message, biometric measurement, and/or the first shares of the private key and template. The other devices 425 can generate other partial computations with the challenge message, biometric measurement, and/or their respective key share and respective template share. The first device 405 and other devices 425 may also make use of information provided by the trusted authority, such as keys for a pseudorandom function. The partial computations can be computations to determine if the template and measurement match, according to a pre-established distance measure. Thus the partial computations may be partial distances. The partial distances may be encrypted with an additively homomorphic encryption scheme, which may be a threshold fully homomorphic encryption (TFHE) scheme. The TFHE can allow the first device 405 and the other devices 425 to compute the partial computations with both additions and multiplications of encrypted values, without decrypting the values. The devices can also generate partial computations of a token that can be used to generate a signature of the challenge message. In some embodiments, the devices can partially decrypt a partial computation (or a partial distance).
In step 414, the other devices 425 send the partial computations to the first device 405. The first device 405 receives at least T partial computations, where T is a threshold for the distributed signature generation. The first partial computation may be one of the at least T partial computations. The first device 405 may receive zero knowledge proofs along with each partial computation. The zero knowledge proofs may be used to verify the at least T partial computations.
In step 416, the first device 405 uses the partial computations to generate a signature (e.g., a token) of the challenge message. For example, the first device 405 may evaluate the partial computations to receive partial signatures (e.g., shares of the token), which it can then combine to generate a complete signature. In some embodiments, the partial signatures may be encrypted, such as with an additively homomorphic encryption scheme. The first device 405 may add shares of additively homomorphic encryptions of partial distances between the template shares and the measurement vector to obtain a total distance between the measurement vector and the template vector. The total distance can be compared to a threshold, and if the total distance is less than the threshold, the first device 405 can generate the signature and sign the challenge message. In this way, the generation of a complete token can be tied to the success of the biometric computations. In some embodiments, a zero knowledge proof can be used by the other devices to verify the comparison of the total distance to the threshold. A zero knowledge proof can also be used by the other devices or the first device 405 to verify the at least T partial computations.
In some embodiments, in order to generate the signature, the first device 405 may generate a cryptographic program. The cryptographic program may conditionally use a the set of keys to generate the signature when the measurement vector is within a threshold of the template vector. For example, the cryptographic program may generate the signature if a cosine similarity measure between the measurement vector and the template vector is less than a threshold. In some embodiments, the cryptographic program may include a garbled circuit. The garbled circuit may perform the steps of adding partial distances and comparing the total distance to the template. In some embodiments, the first device 405 may input the measurement vector into the garbled circuit, and thus the measurement vector can be compared to the template vector without the first device 405 sending the measurement vector to the other devices 425. The cryptographic program may also use properties of additively homomorphic encryption to determine if the measurement vector is within the threshold of the template vector. In some embodiments, the cryptographic program may reconstruct the biometric template using the template shares. In some embodiments, the garbled circuit may output a string that can be used to decrypt the partial signatures. In some embodiments, the cryptographic program may reconstruct the biometric template using the template shares in order to compute the distance measure.
In step 418, the first device 405 sends the signed challenge message to the second device 415. The second device 415 can use the stored public key to verify the signature on the challenge message. If the signature is valid, the first device 405 can be authenticated to the second device 415.
A. General Purpose
We now describe the techniques used in a four round Fuzzy Threshold Token Generation scheme that works for any distance measure and is malicious secure against an adversary that can corrupt up to (t−1) parties.
Our starting point is the observation that suppose all the parties could freely communicate in our model, then any r round malicious secure multiparty computation (MPC) protocol with a broadcast channel would also directly imply an r round Fuzzy Threshold Token Generation scheme if we consider the following functionality: the initiator P* has input (m, , {right arrow over (u)}), every party Pi∈ has input (m, ) and their respective shares of the template {right arrow over (w)} and the signing key. The functionality outputs a signature on m to party P* if Dist({right arrow over (u)},{right arrow over (w)})>d and ||=t. Recently, several elegant works [5, 6, 7, 8, 9, 11] have shown how to construct two round UC-secure MPC protocols in the CRS model with a broadcast channel from standard assumptions. However, since the communication model of our FTTG primitive does not allow all parties to interact amongst each other, our goal now is to emulate a two round MPC protocol π in our setting.
For simplicity, let's first consider n=t=3. That is, there are three parties: P1, P2, P3. Consider the case when P1 is the initiator. Now, in the first round of our FTTG scheme, P1 sends m to both parties and informs them the set S that it is going to be communicating with involves both of them. Then, in round 2, we have P2 and P3 send their round one messages of the MPC protocol π. In round 3 of our FTTG scheme, P1 sends its own round one message of the MPC protocol to both parties. Along with this, P1 also sends P2's round one message to P3 and vice versa. So now, at the end of round 3 of our FTTG scheme, all parties have exchanged their first round messages of protocol π.
Our next observation is that since we care only about P1 getting output, in the underlying protocol π, only party P1 needs to receive everyone else's messages in round 2! Therefore, in round 4 of our FTTG scheme, P2 and P3 can compute their round two messages based on the transcript so far and just send them to P1. This will enable P1 to compute the output of protocol π.
While the above FTTG scheme is correct, unfortunately, it is insecure. Note that in order to rely on the security of protocol π, we crucially need that for any honest party Pi, every other honest party receives the same first round message on its behalf. Further, embodiments can also require that all honest parties receive the same messages on behalf of the adversary. In our case, since the communication is being controlled and directed by P1 instead of a broadcast channel, this need not be true if P1 was corrupt and P2, P3 were honest. More specifically, one of the following two things could occur: (i) P1 can forward an incorrect version of P3's round one message of protocol π to P2 and vice versa. (ii) P1 could send different copies of its own round 1 message of protocol π to both P2 and P3.
The first problem can be solved quite easily as follows: we simply enforce that P3 sends a signed copy of its round 1 message of protocol π which is also forwarded by P1 to P2. Then, P2 accepts the message to be valid if the signature verifies. In the setup phase, we can distribute a signing key to P3 and a verification key to P2. Similarly, we can ensure that P2's actual round 1 message of protocol π was forwarded by P1 to P3.
Tackling the second problem is a bit trickier. The idea is that instead of enforcing that P1 send the same round 1 message of protocol π to both parties, we will instead ensure that P1 learns their round 2 messages of protocol π only if it did indeed send the same round 1 message of protocol π to both parties. We now describe how to implement this mechanism. Let us denote msg2 to be P1's round 1 message of protocol π sent to P2 and msg3 (possibly different from msg2) to be P1's round 1 message of protocol π sent to P3. In the setup phase, we distribute two keys k2, k3 of a pseudorandom function (PRF) to both P2, P3. Now, in round 4 of our FTTG scheme, P3 does the following: instead of sending its round 2 message of protocol π as is, it encrypts this message using a secret key encryption scheme where the key is PRF(k3, msg3). Then, in round 4, along with its actual message, P2 also sends PRF(k3, msg2) which would be the correct key used by P3 to encrypt its round 2 message of protocol π only if msg2=msg3. Similarly, we use the key k2 to ensure that P2's round 2 message of protocol π is revealed to P1 only if msg2=msg3.
The above approach naturally extends for arbitrary n, k. by sharing two PRF keys between every pair of parties. Then, each party encrypts its round 2 message of protocol π with a secret key that is an XOR of all the PRF evaluations. We refer the reader to Section VII for more details.
B. Cosine Similarity
We now describe the techniques used in our four round efficient FTTG protocol for t=3 for the Cosine Similarity distance measure. Our protocol is secure against a malicious adversary that can corrupt at most 1 party. Our construction is very similar for the very related Euclidean Distance function but we focus on Cosine Similarity in this section. Recall that for two vectors {right arrow over (u)}, {right arrow over (w)}, CS.
where ∥x∥ denotes the L2-norm of the vector. First, we are going to assume that the distribution samples vectors {right arrow over (w)} such that ∥{right arrow over (w)}∥=1. Then, instead of checking that CS.Dist({right arrow over (u)}, vecw)>d, we are going to check that {right arrow over (u)},{right arrow over (w)}>(d·{right arrow over (u)},{right arrow over (u)})2. This is just a syntactic change that allows us to construct more efficient protocols.
Our starting point is the following. Suppose we had t=2. Then, embodiments can use Yao's [12] two party semi-honest secure computation protocol to build a two round FTTG scheme. In the registration phase, we secret share {right arrow over (w)} into two parts {right arrow over (w)}1, {right arrow over (w)}2 and give one part to each party. The initiator requests for labels (via oblivious transfer) corresponding to his share of {right arrow over (w)} and to his input {right arrow over (u)} and the garbled circuit, which has the other share of {right arrow over (w)} hardwired into it, reconstructs {right arrow over (w)}, checks that {right arrow over (u)}, {right arrow over (w)}>(d·{right arrow over (u)}, {right arrow over (u)})2 and if so, outputs a signature. In this protocol, we would have security against a malicious initiator who only has to evaluate the garbled circuit, if we use an oblivious transfer protocol that is malicious secure in the CRS model. However, to achieve malicious security against the garbler, we would need expensive zero knowledge arguments. Now, in order to build an efficient protocol that achieves security against a malicious garbler and to actually make the protocol work for threshold t=3, the idea is to distribute the garbling process between two parties. That is, consider an initiator P1 interacting with parties P2, P3. Now both P2 and P3 can generate one garbled circuit each using a shared randomness generated during the setup phase and the evaluator can just check if the two circuits are identical. In the registration phase, both P2 and P3 get the share {right arrow over (w)}2 and a share of the signing key. Note that since the adversary can corrupt at most one party, this check would guarantee that the evaluator can learn whether the garbled circuit was honestly generated. In order to ensure that the evaluator does not evaluate both garbled circuits on different inputs, the garbled circuits can check that P1's OT receiver queries made to both parties were the same. While this directly gives a two round FTTG scheme that works for threshold t=3 and is secure against a malicious adversary that can corrupt at most one party, the resulting protocol is inefficient. Notice that in order to work for the Cosine Similarity distance measure, the garbled circuit will have to perform a lot of expensive operations—for vectors of length , we would have to perform O() multiplications inside the garbled circuit. Our goal is to build an efficient protocol that performs only a constant number of operations inside the garbled circuit.
Our strategy to build an efficient protocol is to use additional rounds of communication to offload the heavy computation outside the garbled circuit. In particular, if we can first do the inner product computation outside the garbled circuit in the first phase of the protocol, then the resulting garbled circuit in the second phase would have to perform only a constant number of operations. In order to do so, we leverage the tool of efficient additively homomorphic encryption schemes [14, 15]. In our new protocol, in round 1, the initiator P1 will send an encryption of {right arrow over (u)}. P1 can compute {right arrow over (u)}, {right arrow over (w)}1 by itself. Both P2 and P3 respond with encryptions of ({right arrow over (u)}, {right arrow over (w)}2) computed homomorphically using the same shared randomness. Then, P1 can decrypt this ciphertext and compute û, ŵ. The parties can then run the garbled circuit based protocol as above in rounds 3 and 4 of our FTTG scheme: that is, P1 requests for labels corresponding to û, ŵ and {right arrow over (u)},{right arrow over (u)} and the garbled circuit does the rest of the check as before. While this protocol is correct and efficient, there are still several issues.
The first problem is that the inner product û, ŵ is currently leaked to the initiator P1 thereby violating the privacy of the template {right arrow over (w)}. To prevent this, we need to design a mechanism where no party learns the inner product entirely in the clear and yet the check happens inside the garbled circuit. A natural approach is for P2 and P3 to homomorphically compute an encryption of the result (û, ŵ2) using a very efficient secret key encryption scheme. In our case, a one-time pad may suffice. Now, P1 only learns an encryption of this value and hence the inner product is hidden, while the garbled circuit, with the secret key hardwired into it, can easily decrypt the one-time pad.
The second major challenge is to ensure that the input on which P1 wishes to evaluate the garbled circuit is indeed the output of the decryption. If not, P1 could request to evaluate the garbled circuit on suitably high inputs of his choice, thereby violating unforgeability. In order to prevent this attack, P2 and P3 homomorphically compute not just x={right arrow over (u)}, {right arrow over (w)}2 but also a message authentication code (MAC) y on the value x using shared randomness generated in the setup phase. We use a simple one time MAC that can be computed using linear operations and hence can be done using the additively homomorphic encryption scheme. Now, the garbled circuit also checks that the MAC verifies correctly and from the security of the MAC, P1 can not change the input between the two stages. Also, P1 can send encryptions of {right arrow over (u)}, {right arrow over (u)} in round 1 so that P2, P3 can compute a MAC on this as well, thereby preventing P1 from cheating on this part of the computation too.
Another important issue to tackle is that we need to ensure that P1 does indeed send valid well-formed encryptions. In order to do so, we rely on efficient zero knowledge arguments from literature. We observe that in our final protocol, the garbled circuit does only a constant number of operations and the protocol is extremely efficient.
In order to further optimize the concrete efficiency of our protocol, as done by Mohassel et al. [13], one or both of the two parties P2, P3 can send the entire garbled circuit. The other party can just send a hash of the garbled circuit and P1 can just check that the hash values are equal.
A. Notation
The notation [j: x] can denote that the value x is private to party j. For a protocol π, we write [j: z′]←π([i: (x, y)], [j: z], c) to denote that party i has two private inputs x and y; party j has one private input z; all the other parties have no private input; c is a common public input; and, after the execution, only j receives an output z′. xs denotes that each party i∈S has a private value xi.
B. Basic Primitives
We refer the reader to [18] for the definition of threshold linear secret sharing schemes, secret key encryption, oblivious transfer, garbled circuits, non-interactive zero knowledge arguments, digital signatures, collision resistant hash functions and pseudorandom functions. We refer to [14] for a definition of additively homomorphic encryption schemes and to [19] for a definition of circuit privacy. We refer to [18] for the definition of secure multi-party computation. We refer to Appendix A for a definition of threshold oblivious pseudorandom functions and robust secret sharing.
C. Distance Measures
First, let's recall that the L2 norm of a vector {right arrow over (x)}=({right arrow over (x)}1, . . . , {right arrow over (x)}n) is defined to be ∥{right arrow over (x)}∥=√{square root over ({right arrow over (x)}12+ . . . +{right arrow over (x)}n2)}. We now define the various distance measures that we use in embodiments of the invention. This list is not limiting, as any suitable distance measure may be used.
Definition 1 (Hamming distance) For any two vectors {right arrow over (u)}=({right arrow over (u)}1, . . . , ), {right arrow over (w)}=({right arrow over (w)}1, . . . , )∈, the Hamming Distance between them is defined to be the number of positions j at which {right arrow over (u)}j≠{right arrow over (w)}j.
Hamming distance counts the number of points in which the measurement and template vectors differ. Two vectors of length l each can be said to be close if their Hamming Distance is at most (l−d)—that is, they are equal on at least d positions.
Definition 2 (Cosine similarity) For any two vectors {right arrow over (u)}, {right arrow over (w)}∈, the Cosine Similarity between them is defined as follows:
Cosine similarity uses the inner product of two vectors to determine the cosine of the angle between the vectors. Smaller angles correspond to more similar vectors. Thus if the angle is small, the cosine of the angle is greater, and if the cosine distance is greater than an established threshold, then the vectors are said to match.
Definition 3 (Euclidean Distance) For any two vectors {right arrow over (u)}, {right arrow over (w)}∈, the Euclidean Distance between them is defined as follows:
EC.Dist({right arrow over (u)},{right arrow over (w)})=({right arrow over (u)},{right arrow over (u)})+{right arrow over (w)},{right arrow over (w)}−2·CS.Dist({right arrow over (u)},{right arrow over (w)}))
Euclidean distance measures the distance between the endpoints of two vectors as points in space. Two vectors that have a small Euclidean distance are closer together. Thus if the Euclidean distance is below an established threshold, then the vectors are said to match.
D. Threshold Signature
We now formally define a Threshold Signature Generation Scheme [16] and the notion of unforgeability.
Definition 4 (Threshold Signature) Let n, t∈. A threshold signature scheme TS is a tuple of four algorithms (Gen, Sign, Comb, Ver) that satisfy the correctness condition below.
For all κ∈, any t, n∈ such that t≤n, all (pp, vk, skn) generated by Gen(1κ, n, t), any message m, and any set S⊆[n] of size at least t, if σi=Sign(ski, m) for i∈S, then Ver(vk, (m, Comb({σi}i∈s)))=1.
Definition 5 (Unforgeability) A threshold signatures scheme TS=(Gen, Sign, Comb, Ver) is unforgeable if for all n, t∈, t≤n, and any PPT adversary , the following game outputs 1 with negligible probability (in security parameter).
E. Specific Threshold Signature Schemes
We now describe the threshold signature schemes of Boldyreva [16] based on Gap-DDH assumption and Shoup[17] based on RSA assumption. We will use these schemes in Section IX.
1. Scheme of Boldyreva
Let =g be a multiplicative cyclic group of prime order p that supports pairing and in which CDH is hard. In particular, there is an efficient algorithm VerDDH(ga,gb,gc,g) that returns 1 if and only if c=ab mod p for any a, b, c∈ and 0 otherwise. Let H:{0,1}*→ be a hash function modeled as a random oracle. Let Share be the Shamir's secret sharing scheme.
The threshold signature scheme is as follows:
Verify(vk, x, Token)=:1/0. Return 1 if and only if VerDDH(H(x), vk, Token, g)=1.
2. Scheme of Shoup
Let Share be Shamir's secret sharing scheme and H:{0,1}*→ be a hash function modeled as a random oracle.
The threshold signature scheme is as follows:
F. Zero Knowledge Argument of Knowledge for Additively Homomorphic Encryption
In this section, we list a couple of NP languages with respect to any additively homomorphic encryption scheme in which we use efficient non-interactive zero knowledge argument of knowledge systems.
First, let (AHE.Setup, AHE.Enc, AHE.Add, AHE.ConstMul, AHE.Dec) be the algorithms of an additively homomorphic encryption scheme. Let pk denote a public key sampled by running the setup algorithm AHE.Setup(1λ). Let denote the message space for the encryption scheme.
1. NP Languages
The first NP language is to prove knowledge of a plaintext in a given ciphertext. The second one proves that given three ciphertexts where the first two are well-formed and the prover knows the corresponding plaintexts and randomness used for encrypting them, the third ciphertext was generated by running the algorithm AHE.ConstMul( ). That is, it proves that the third ciphertext contains an encryption of the product of the messages encrypted in the first two ciphertexts.
NP language L1 characterized by the following relation R1.
Statement: st=(ct, pk)
Witness: wit=(x, r)
R1(st, with)=1 if and only if:
NP language L2 characterized by the following relation R2.
Let cti=AHE.Enc(pk, x1; r1), ct2=AHE.Enc(pk, x2; r2).
Statement: st=(cti, ct2, ct3, pk)
Witness: wit=(x1, r1, x2, r2, r3)
R1(st, with)=1 if and only if:
2. Paillier Encryption Scheme.
With respect to the additively homomorphic encryption scheme of Paillier [14], we have efficient NIZK arguments for the above two languages. Formally, we have the following imported theorem:
Imported Theorem 1 [20] Assuming the hardness of the Nth Residuosity assumption, there exists a non-interactive zero knowledge argument of knowledge for the above two languages in the Random Oracle model.
The above zero knowledge arguments are very efficient and only use a constant number of group operations on behalf of both the prover and verifier.
We introduce the notion of fuzzy threshold token generation (FTTG). An FTTG scheme is defined with respect to a function Dist which computes distance between two vectors, say from the space of -dimensional vectors over . In the registration phase, key shares of a threshold signature scheme are generated and a template W is chosen according to a distribution over . Then the shares of the key and template are distributed among the n parties. A separate set-up algorithm generates some common parameters and some secret information for each party. After the one-time set-up has been completed, any one of the n parties can initiate a sign-on session with a new measurement {right arrow over (u)}. If at least t parties participate in the session and {right arrow over (u)} is close to the distributed template {right arrow over (w)}, w.r.t. to the measure Dist, then the initiating party obtains a valid signature.
Definition 6 (Fuzzy Threshold Token Generation). Let n, t∈ and be a probability distribution over vectors in for some q,∈. Let TS=(Gen, Sign, Comb, Ver) be a threshold signature scheme. An FTTG scheme for distance measure Dist: ×→ with threshold d∈ is given by a tuple (Registration, Setup, SignOn, Verify) that satisfies the correctness property stated below.
Correctness. For all κ∈, any n, t∈ such that t≤n, any threshold signature scheme TS, any q, ∈, any probability distribution over , any distance d∈, any measurement {right arrow over (u)}∈, any m, any S⊆[n] such that |S|=t, and any j∈[n], if (sk, {right arrow over (w)}[n], pp, vk)←Registration(1κ, n, t, TS, q, , , d), (ppsetup, s1, . . . , sn)←Setup ( ), and ([j: out], [S: (m, j, S)])←SignOn((sk,{right arrow over (w)})[n], [j:(m, {right arrow over (u)}, S)]), then Verify(vk, m, out)=1 if Dist({right arrow over (w)},{right arrow over (u)})≥d.
For an FTTG scheme, one could consider two natural security considerations. The first one is the privacy of biometric information. A template is sampled and distributed in the registration phase. Clearly, no subset of t−1 parties should get any information about the template from their shares. Then, whenever a party performs a new measurement and takes the help of other parties to generate a signature, none of the participants should get any information about the measurement, not even how close it was to the template. We allow the participants to learn the message that was signed, the identity of the initiating party, and set of all participants. The second natural security consideration is unforgeability. Even if the underlying threshold signature scheme is unforgeable, it may still be possible to generate a signature without having a close enough measurement. An unforgeable FTTG scheme should not allow this.
We propose a unified real-ideal style definition to capture both considerations. In the real world, sessions of sign-on protocol are run between adversary and honest parties whereas in the ideal world, they talk to the functionality DiFuz. Both the worlds are initialized with the help of n, t, an unforgeable threshold signature scheme, parameters q, for the biometric space, a threshold for successful match d, a distribution , and a sequence {right arrow over (U)}=({right arrow over (u)}1, {right arrow over (u)}2, . . . , {right arrow over (u)}h) of measurements for honest parties. The indistinguishability condition that we will define below must hold for all values of these inputs. In particular, it should hold irrespective of the threshold signature scheme used for initialization, as long as it is unforgeable. The distribution over the biometric space could also be arbitrary.
In the initialization phase, Registration is run in both real and ideal worlds to generate shares of a signing key and a template (chosen as per ). In the real world, Setup is also run. The public output of both Registration and Setup is given to the adversary . It outputs a set of parties C to corrupt along with a sequence ((m1, j1, S1), . . . , (mh, jh, Sh)), which will later be used to initiate sign-on sessions from honest parties (together with ({right arrow over (u)}1, . . . , {right arrow over (u)}h)). The secret shares of corrupt parties are given to and the rest of shares are given to appropriate honest parties. On the other hand, in the ideal world, is allowed to pick the output of Setup. We will exploit this later to produce a simulated common reference string. Note, however, that the output of Setup (whether honest or simulated) will be part of the final distribution.
Evaluation phase in the real world can be one of two types. Either a corrupt party can initiate a sign-on session or can ask an honest party to initiate a session using the inputs chosen before. In the ideal world, talks to DiFuz to run sign-on sessions. Again, there are two options. If S sends (SignOn-Corrupt, m, u, j, S) to the functionality (where j is corrupt), then it can receive signature shares of honest parties in S on the message m but only if {right arrow over (u)} is close enough to {right arrow over (w)}. When S sends (SignOn-Honest, sid,i), DiFuz waits to see if S wants to finish the session or not. If it does, then DiFuz computes a signature and sends it to the initiating (honest) party.
We say that an FTTG scheme is secure if the joint distribution of the view of the real world adversary and the outputs of honest parties is computationally indistinguishable from the joint distribution of the view of the ideal world adversary and messages honest parties get from the functionality.
There are several important things to note about the definition. It allows an adversary to choose which parties to corrupt based on the public parameters. It also allows the adversary to run sign-on sessions with arbitrary measurements. This can help it to generate signatures if some measurements turn out to be close enough. Even if none of them do, it can still gradually learn the template. Our definition does not allow inputs for sessions initiated by honest parties to be chosen adaptively during the evaluation phase. Thus, the definition is a standalone definition and not a (universally) composable one. This type of restriction helps us to design more efficient protocols, and in some cases without any trusted setup.
Definition 7 A fuzzy threshold token generation scheme FG=(Registration, Setup, SignOn, Verify) is secure if for any n, t s.t. t≤n, any unforgeable threshold signature scheme TS, any q, ∈, any distance d∈, and any PPT adversary there exists a PPT simulator such that for any probability distribution over and any sequence {right arrow over (U)}=({right arrow over (u)}1, {right arrow over (u)}2, . . . , {right arrow over (u)}h) of measurements (where h=poly(κ) and {right arrow over (u)}i∈),
(,{Out−Reali}i∈[n]\C))≈(,{Out−Ideali}i∈[n]\C)),
where and are the views of and in the real and ideal worlds respectively, Out−Reali is the concatenated output of (honest) party i in the real world from all the SignOn sessions it participates in (plus the parameters given to it during initialization), and Out−Ideali is the concatenation of all the messages that party i gets from the functionality DiFuz in the ideal world, as depicted in
In this section, we show how to construct a four round secure fuzzy threshold token generation protocol using any two round malicious secure MPC protocol using a broadcast channel as the main technical tool. Our token generation protocol satisfies Definition 7 for any n, t and works for any distance measure. Formally, we show the following theorem:
Theorem 1 Assuming the existence of threshold signatures, threshold secret sharing, two round UC-secure MPC protocols in the CRS model that is secure in the presence of a broadcast channel against malicious adversaries that can corrupt up to (t−1) parties, secret key encryption, and pseudorandom functions and strongly unforgeable signatures, there exists a four round secure fuzzy threshold token generation protocol Definition 7 for any n, t and any distance measure.
Note that such a two round MPC protocol can be built assuming DDH/LWE/QR/Afth Residuosity [5, 6, 7, 8, 9, 11]. All the other primitives can be based on the existence of injective one way functions.
Instantiating the primitives used in the above theorem, we get the following corollary:
Corollary 2 Assuming the existence of injective one way functions and A∈{DDH/LWE/QR/Nth Residuosity}, there exists a four round secure fuzzy threshold token generation protocol satisfying Definition 7 for any n, t, any distance measure Dist and any threshold d.
A. Construction
We first list some notation and the primitives used before describing our construction.
Let Dist denote the distance function and d denote the threshold distance value to denote a match. Let the n parties be denoted by P1, . . . , Pn. respectively. Let λ denote the security parameter. Let denote the distribution from which the random vector is sampled. Let's assume the vectors are of length where is a polynomial in λ. Each element of this vector is an element of a field F over some large prime modulus q.
Let TS=(TS.Gen, TS.Sign, TS.Combine, TS.Verify) be a threshold signature scheme. Let (SKE.Enc, SKE.Dec) denote a secret key encryption scheme. Let (Share, Recon) be a (t, n) threshold secret sharing scheme. Let (Gen, Sign, Verify) be a strongly-unforgeable digital signature scheme. Let PRF denote a pseudorandom function.
Let π be a two round UC-secure MPC protocol in the CRS model in the presence of a broadcast channel that is secure against a malicious adversary that can corrupt upto (t−1) parties. Let π. Setup denote the algorithm used to generate the CRS. Let (π.Round1, π.Round2) denote the algorithms used by any party to compute the messages in each of the two rounds and π. Out denote the algorithm to compute the final output. Further, let π.Sim use algorithm (π.Simi, π.Sim2) to compute the first and second round messages respectively. Note that since we consider a rushing adversary, the algorithm π.Sim1(·) does not require the adversary's input or output. Let π. Ext denote the extractor, that, on input the adversary's round one messages, extracts its inputs. Let π.Sim.Setup denote the algorithm used by π.Sim to compute the simulated CRS.
We now describe the construction of our four round secure fuzzy threshold token generation protocol πAny for any n and k.
1. Registration
In the registration phase, the following algorithm is executed by a trusted authority. The trusted authority may be a device that will participate in subsequent biometric matching and signature generation. If the trusted authority will potentially participate in biometric matching and signature generation, it can delete the information that it should not have at the end of the registration phase. That is, it should delete the shares corresponding to the other devices.
The trusted authority can sample a random vector {right arrow over (w)} from the distribution of biometric data and save it as the biometric template. The trusted authority can then compute the appropriate shares of the template ({right arrow over (w)}1, . . . , {right arrow over (w)}n), depending on the key sharing algorithm being used, the number of devices and the threshold, and the security parameter using a share generation algorithm ({right arrow over (w)}1, . . . , {right arrow over (w)}n)←Share(1λ, w, n, t). The public key vkTS, secret key shares sk1TS, . . . , sknTS, and other relevant parameters ppTS for the threshold signature scheme can be generated with the threshold generation algorithm (ppTS, vkTS, sk1TS, . . . , sknTS)←TS.Gen(1λ, n, t). Then the trusted authority can send each device the relevant information (e.g., the template share, secret key share, public key, and other threshold signature parameters). For example, the ith device (party P1) can receive ({right arrow over (w)}i, ppTS, vkTS, skiTS).
2. Setup
Set up can also be done by a trusted authority. The trusted authority may be the same trusted device that completed the registration. If the trusted authority will potentially participate in biometric matching and signature generation, it can delete the information that it should not have at the end of the registration phase. That is, it should delete the shares corresponding to the other devices.
First, the trusted authority can generate a common reference string crs for the multiparty computation scheme being used using a setup algorithm crs←π.Setup(1λ). Generate shares of a secret key and public key for authenticating the computations. Compute shares of relevant parameters and secrets for a distributed pseudorandom function. Then each device is sent the relevant information (e.g., the crs and key shares).
3. SignOn
In the SignOn phase, let's consider party P* that uses input vector {right arrow over (u)}, a message m on which it wants a token. P* interacts with the other parties in the below four round protocol. The arrowhead in Round 1 can denote that in this round messages are outgoing from party P*.
4. Token Verification
Given a verification key vkTS, message m and token Token, the token verification algorithm outputs 1 if TS.Verify(vkTS, m, Token) outputs 1.
The correctness of the protocol directly follows from the correctness of the underlying primitives.
B. Security Proof
In this section, we formally prove Theorem 1.
Consider an adversary who corrupts t* parties where t*<t. The strategy of the simulator Sim for our protocol πAny against a malicious adversary is described below. Note that the registration phase first takes place at the end of which the simulator gets the values to be sent to every corrupt party which it then forwards to .
1. Description of Simulator
Setup: Sim does the following:
SignOn Phase: Case 1—Honest Party as P*
Suppose an honest party P* uses an input vector u and a message m for which it wants a token by interacting with a set of parties . The arrowhead in Round 1 can denote that in this round messages are outgoing from the simulator. Sim gets the tuple (m, ) from the ideal functionality DiFuz and interacts with the adversary as below:
SignOn Phase: Case 2—Malicious Party as P*
Suppose a malicious party is the initiator P*.Sim interacts with the adversary as below:
2. Hybrids
We now show that the above simulation strategy is successful against all malicious PPT adversaries. That is, the view of the adversary along with the output of the honest parties is computationally indistinguishable in the real and ideal worlds. We will show this via a series of computationally indistinguishable hybrids where the first hybrid Hyb0 corresponds to the real world and the last hybrid Hyb4 corresponds to the ideal world.
1. Hyb0—Real World: In this hybrid, consider a simulator SimHyb that plays the role of the honest parties as in the real world.
2. Hyb1—Special Abort: In this hybrid, SimHyb outputs “SpecialAbort” as done by Sim in round 4 of Case 2 of the simulation strategy. That is, SimHyb outputs “SpecialAbort” if all the signatures verify but the adversary does not send the same transcript of the first round of protocol π to all the honest parties.
3. Hyb2—Simulate MPC messages: In this hybrid, SimHyb does the following:
4. Hyb3—Switch DPRF Output in Case 2: In this hybrid, suppose a corrupt party plays the role of P*, SimHyb computes the value of the variable flag as done by the simulator Sim in round 4 of the simulation strategy. That is, SimHyb sets flag=0 if the adversary did not send the same round 1 messages of protocol π to all the honest parties Pj∈. Then, on behalf of every honest party Pj, SimHyb does the following:
5. Hyb4—Switch Ciphertext in Case 2: In this hybrid, suppose a corrupt party plays the role of P*, SimHyb does the following: if flag=0, compute ctj=SKE.Enc(rand, 0|msg2,j|) as in the ideal world. This hybrid corresponds to the ideal world.
We will now show that every pair of successive hybrids is computationally indistinguishable.
Lemma 1 Assuming the strong unforgeability of the signature scheme, Hyb0 is computationally indistinguishable from Hyb1.
Proof. The only difference between the two hybrids is that in Hyb1, SimHyb might output “SpecialAbort”. We now show that SimHyb outputs “SpecialAbort” in Hyb1 only with negligible probability.
Suppose not. That is, suppose there exists an adversary that can cause SimHyb to output “SpecialAbort” in Hyb1 with non-negligible probability, then we will use to construct an adversary Sign that breaks the strong unforgeability of the signature scheme which is a contradiction.
π begins an execution of the DiFuz protocol interacting with the adversary as in Hyb1. For each honest party Pj, Sign interacts with a challenger Sign and gets a verification key vkj which is forwarded to as part of the setup phase of DiFuz. Then, during the course of the protocol, Sign forwards signature queries from to Sign and the responses from Sign to .
Finally, suppose causes Sign to output “SpecialAbort” with non-negligible probability. Then, it must be the case that for some tuple of the form (msg1,i, σ1,i) corresponding to an honest party Pj, the signature σ1,1 was not forwarded to from Sign but still verified successfully. Thus, Sign can output the same tuple (msg1,j,σ1,j) as a forgery to break the strong unforgeability of the signature scheme with non-negligible probability which is a contradiction.
Lemma 2 Assuming the security of the MPC protocol π, Hyb1 is computationally indistinguishable from Hyb1.
Proof. Suppose there exists an adversary that can distinguish between the two hybrids with non-negligible probability. We will use to construct an adversary π that breaks the security of the protocol π which is a contradiction.
π begins an execution of the DiFuz protocol interacting with the adversary and an execution of protocol π for evaluating circuit C (
Case 1: Honest Party as P*
Now, since we consider a rushing adversary for protocol π, on behalf of every honest party Pj, π first receives a message msgj from the challenger . π sets msgj to be the message msg1,j in round 3 of its interaction with and then computes the rest of its messages to be sent to exactly as in Hyb1. π receives a set of messages corresponding to protocol π from on behalf of the corrupt parties in which it forwards to π as its own messages for protocol π.
Case 2: Corrupt Party as P*
As in the previous case, on behalf of every honest party Pj, π first receives a message msgj from the challenger π. π sets msgj to be the message msg1,j in round 2 of its interaction with . Then, in round 4, if the signatures verify, π forwards the set of messages corresponding to protocol π received from on behalf of the corrupt parties in to π as its own messages for protocol π. Then, on behalf of every honest party Pj, π receives a message msgj from the challenger π as the second round message of protocol π. π sets msgj to be the message msg2,j in round 4 of its interaction with and computes the rest of its messages to be sent to exactly as in Hyb1.
Notice that when the challenger π sends honestly generated messages, the experiment between π and corresponds exactly to Hyb1 and when the challenger π sends simulated messages, the experiment corresponds exactly to Hyb2. Thus, if can distinguish between the two hybrids with non-negligible probability, π can use the same guess to break the security of the scheme π with non-negligible probability which is a contradiction.
Lemma 3 Assuming the security of the pseudorandom function, Hyb2 is computationally indistinguishable from Hyb3.
Proof. Suppose there exists an adversary that can distinguish between the two hybrids with non-negligible probability. We will use to construct an adversary PRF that breaks the security of the pseudorandom function which is a contradiction.
The adversary PRF interacts with the adversary in an execution of the protocol DiFuz. For each honest party Pj, PRF also interacts with a challenger PRF in the PRF security game. For each j, PRF sends the PRF keys corresponding to the set of corrupt parties (<k) as requested by PRF which is then forwarded to during the setup phase. Then, PRF continues interacting with up to round 3 as in Hyb1. Now, in round 4, suppose it computes the value of the variable flag to be 0 (as computed in Hyb3), then PRF does the following: for each honest party Pj, forward the message msg1,* received in round 3. Then, set the XOR of the set of responses from PRF to be the value ek1 used for generating the ciphertext ctj.
Now notice that when the challenger PRF responds with a set of honest PRF evaluations for each honest party j, the interaction between PRF and A exactly corresponds to Hyb2 and when the challenger responds with a set of uniformly random strings, the interaction between PRF and exactly corresponds to Hyb3. Thus, if can distinguish between the two hybrids with non-negligible probability, PRF can use the same guess to break the pseudorandomness property of the PRF scheme with non-negligible probability which is a contradiction.
Lemma 4 Assuming the semantic security of the secret key encryption scheme, Hyb3 is computationally indistinguishable from Hyb4.
Proof. Suppose there exists an adversary that can distinguish between the two hybrids with non-negligible probability. We will use to construct an adversary SKE that breaks the semantic security of the encryption scheme which is a contradiction.
The adversary SKE interacts with the adversary as in Hyb3. Then, on behalf of every honest party Pj, before sending its round 4 message, SKE first sends the tuple (msg2,j, 0|msg2,j|) to the challenger SKE of the secret key encryption scheme. Corresponding to every honest party, it receives a ciphertext which is either an encryption of msg2,j or 0|msg2,j| using a secret key chosen uniformly at random. Then, SKE sets this ciphertext to be the value ctj and continues interacting with the adversary exactly as in Hyb2. Notice that when the challenger SKE sends ciphertexts of msg2,i, the experiment between SKE and corresponds exactly to Hyb3 and when the challenger SKE sends ciphertexts of 0|msg2,j|, the experiment corresponds exactly to Hyb4. Thus, if can distinguish between the two hybrids with non-negligible probability, SKE can use the same guess to break the semantic security of the encryption scheme with non-negligible probability which is a contradiction.
In this section, we show how to construct a fuzzy threshold token generation protocol for any distance measure using any FHE scheme with threshold decryption. Our token generation protocol satisfies the definition in Section III for any n, t and works for any distance measure. Formally, we show the following theorem:
Theorem 3. Assuming the existence of the following: FHE with threshold decryption, threshold signatures, secret key encryption, and strongly unforgeable digital signatures, there exists a four round secure fuzzy threshold token generation protocol for any n, t and any distance measure.
A. Construction
We first list some notation and the primitives used before describing our construction.
Let Dist denote the distance function that takes as input a template w and a measurement u, and let d denote the threshold distance value to denote a match between w and u. Let C be a circuit that takes as input a template w, a measurement u and some string K, and outputs K if Dist(w, u)<d; otherwise it outputs 0. Let the n parties be denoted by P1, . . . , Pn respectively. Let λ denote the security parameter. Let W denote the distribution from which a random template vector is sampled. Let's assume the vectors are of length l where l is a polynomial in λ.
Let TFHE=(TFHE.Gen, TFHE.Enc, TFHE.PartialDec, TFHE.Eval, TFHE.Combine) be a threshold FHE scheme. Let TS=(TS.Gen, TS.Sign, TS.Combine, TS.Verify) be a threshold signature scheme. Let SKE=(SKE, Gen, SKE.Enc, SKE.Dec) denote a secret key encryption scheme.
Let (Gen, Sign, Verify) be a strongly-unforgeable digital signature scheme. Let H be a collision-resistant hash function (e.g., modeled as a random oracle).
We now describe the construction of our four round secure fuzzy threshold token generation protocol πAny-TFHE for any n and k.
1. Registration
In the registration phase, the following algorithm can executed by a trusted authority:
A trusted authority (e.g., primary user device) can sample a biometric template w from a distribution of a biometric measurement . The trusted authority can also compute a public key pk and a plurality of private key shares ski for a threshold fully homomorphic encryption scheme TFHE, in addition to public parameters ppTS, a verification key vkTS, and a plurality of private key shares skiTS for a threshold signature scheme TS and a string K for a secret key encryption scheme SKE. Using the public key pk, the trusted authority can encrypt the biometric template w to form ciphertext ct0 and encrypt the string K to form ciphertext ct1.
Then, the trusted authority can compute a plurality of values for each electronic device i of n electronic devices (e.g., a first electronic device and other electronic devices). The trusted authority can compute a secret key share sk′i and a verification key share vk′i for a digital signature scheme. The trusted authority can also compute a hash Ki using the string K and a hash function H. The trusted authority can then send to each electronic device Pi the public key pk, the ciphertexts ct0 and ct1, verification key shares (vki′, . . . , vki′), secret key share sk′i, public parameters ppTS, verification key vkTS, private key share skiTS, and hash Ki.
2. SignOn Phase
In the SignOn phase, let's consider party P* that uses input vector U, a message m on which it wants a token. P* interacts with the other parties in the below four round protocol. The arrowhead in Round 1 can denote that in this round messages are outgoing from party P*
A first electronic device, P* can encrypt the input vector u (e.g., the biometric measurement vector) with the public key pk of the threshold fully homomorphic encryption scheme to generate an encrypted biometric measurement ciphertext ct*. The first electronic device can send the encrypted biometric measurement ct* and the message m to each of the other electronic devices. Each of the other electronic devices can compute a partial signature computation σ′i with the ciphertext ct* and the secret key share sk′i. Each electronic can send the partial signature computation σ′i to the first electronic device. The first electronic device can send all of the partial signature computations (σ′1, . . . , σ′n) to all of the other electronic devices.
Each of the other electronic devices can verify each of the partial signature computations (σ′1, . . . , σ′n) with the ciphertext ct* and the received verification keys (vk1′, . . . , vkn′). If any of the partial signature computations are not verified (e.g., Verify(vk′i, ct*, σ′i)≠1), the electronic device can output ⊥, indicating an error. An unverified signature can indicate that one (or more) of the electronic devices did not compute the partial signature computation correctly, and thus may be compromised or fraudulent. After verifying the partial signature computations, each of the other electronic devices can evaluate the ciphertexts ct*, ct0 and ct1 to generate a new ciphertext ct. Evaluating the ciphertexts may include evaluating a circuit C that computes a distance measure between the template w (in ciphertext ct0) and the measurement u (in ciphertext ct*). If the distance measure is less than a threshold d, the circuit C can output the string K (in ciphertext ct1). Each of the other electronic devices can then compute a partial decryption μi of the ciphertext ct. A partial threshold signature token Tokeni can be generated by each of the other electronic devices using the secret key share ski and the message m. A ciphertext i can be computed as a secret key encryption with the partial threshold signature token Tokeni and the hash Ki. The partial decryption μi and the ciphertext i can then be send to the first electronic device.
The first electronic device can homomorphically combine the partial decryptions (μ1, . . . , μn) to recover the string K. Then the first electronic device can compute hash Ki for each electronic device i using the string K and a hash function H, then use the hash Ki to decrypt the secret key encryption ciphertext i to recover the partial threshold signature token Tokeni. With each received partial threshold signature token Tokeni, the first electronic device can combine them to compute a signature token Token. If the first electronic device can verify the token Token and the message m, the first electronic device can output the token Token; otherwise, the first electronic device can output 1.
3. Token Verification
Given a verification key vkTS, message m and token Token, the token verification algorithm outputs 1 if TS.Verify(vkTS, m, Token) outputs 1.
Correctness: The correctness of the protocol directly follows from the correctness of the underlying primitives.
B. Security Proof
In this section, we formally prove Theorem 3.
Consider an adversary A, who corrupts t*, parties where t*<t. The strategy of the simulator Sim, for our protocol πAny-TFHE, against an adversary A, is sketched below.
1. Description of Simulator
Registration Phase: On receiving the first query of the form (“Register”, sid) from a party Pi, the simulator Sim receives the message (“Register”, sid, Pi) from the ideal functionality FDiFuz, which it forwards to the adversary A.
SignOn Phase: Case 1—Honest Party as P*: Suppose that in some session with id sid, an honest party P* that uses an input vector u and a message m for which it wants a token by interacting with a set S consisting of t parties, some of which could be corrupt. Sim gets the tuple (m, S) from the ideal functionality FDiFuz and interacts with the adversary .
In the first round of the sign-on phase, ˜ sends encryptions of 0 under the threshold FHE scheme to each malicious party in the set S Note that this is indistinguishable from the real world by the CPA security of the threshold FHE scheme. In the subsequent rounds, it receives messages from the adversary A on behalf of the corrupt parties on S and also sends messages to the corrupt parties in the set S.
Sim also issues a query of the form (“SignOn”, sid, msg, Pi,T⊆[n]) to the ideal functionality FFTTG, and proceeds as follows:
SignOn Phase: Case 2—Malicious Party as P*: Suppose that in some session with id sid, a malicious party P* that uses an input vector u and a message m for which it wants a token by interacting with a set S consisting of t parties, some of which could be corrupt. Sim again gets the tuple (m, S) from the ideal functionality FDiFuz, and interacts with the adversary .
The simulator Sim receives the measurement u from the adversary A on behalf of the corrupt party P*, and issues a query of the form (“Test Password”, sid, u, Pi) to the ideal functionality FFTTG. It forwards the corresponding response from FFTTG,to the adversary A.
In the first round of the sign-on phase, ˜ receives ciphertexts under the threshold FHE scheme from the adversary A. on behalf of each honest party in the set S. In the subsequent rounds, it sends messages to the adversary A. on behalf of the honest parties in S and also receives messages from the adversary A on behalf of the hones parties in S.
Sim also issues a query of the form (“SignOn”, sid, msg, Pi,T⊆[n]) to the ideal functionality FFTTG and proceeds as follows:
In this section, we show how to construct an efficient four round secure fuzzy threshold token generation protocol in the Random Oracle model for the Euclidean Distance and Cosine Similarity distance measure. Our token generation protocol satisfies Definition 7 for any n with threshold t=3 and is secure against a malicious adversary that can corrupt any one party. We first focus on the Cosine Similarity distance measure. At the end of the section, we explain how to extend our result for Euclidean Distance as well.
Formally we show the following theorem:
Theorem 3 Assuming the existence of threshold signatures, threshold secret sharing, two message oblivious transfer in the CRS model that is secure against malicious adversaries, garbled circuits, a threshold secret sharing scheme, circuit-private additively homomorphic encryption, secret key encryption, and non-interactive zero knowledge arguments for NP languages L1, L2 defined in Section IV.E, there exists a four round secure fuzzy threshold token generation protocol satisfying Definition 7 with respect to the Cosine Similarity distance function. The protocol works for any n, for threshold t=3, and is secure against a malicious adversary that can corrupt any one party.
We know how to build two message OT in the CRS model assuming DDH/LWE/Quadratic Residuosity/Nth Residuosity assumption [21, 22, 23, 24]. The Paillier encryption scheme [14] is an example of a Circuit-private additively homomorphic encryption from the Nth Residuosity assumption. As shown in Section IV.E, we can also build NIZK arguments for languages L1, L2 from the Nth Residuosity assumption in the Random Oracle model. The other primitives can either be built without any assumption or just make use of the existence of one way functions. Thus, instantiating the primitives used in the above theorem, we get the following corollary:
Corollary 4 Assuming the hardness of the Nth Residuosity assumption, there exists a four round secure fuzzy threshold token generation protocol in the Random Oracle model satisfying Definition 7 with respect to the Cosine Similarity distance function. The protocol works for any n, for threshold t=3, and is secure against a malicious adversary that can corrupt any one party.
A. Construction
We first list some notation and the primitives used before describing our construction.
Let d denote the threshold value for the Cosine Similarity function. Let (Share, Recon) be a (2, n) threshold secret sharing scheme. Let the n parties be denoted by P1, . . . , Pn respectively. Let λ denote the security parameter. Let denote the distribution from which the random vector is sampled. Let's assume the vectors are of length where is a polynomial in λ. Each element of this vector is an element of a field F over some large prime modulus q.
Let TS=(TS.Gen, TS.Sign, TS.Combine, TS.Verify) be a threshold signature scheme. Let (SKE.Enc, SKE.Dec) denote a secret key encryption scheme. Let (Garble, Eval) denote a garbling scheme for circuits. Let Sim.Garble denote the simulator for this scheme. Let OT=(OT.Setup, OT.Round1, OT.Round2, OT.Output) be a two message oblivious transfer protocol in the CRS model. Let OT.Sim denote the simulator. Let OT.Sim.Setup denote the algorithm used by the simulator to generate a simulated CRS and OT.Sim. Round2 denote the algorithm used to generate the second round message against a malicious receiver.
Let AHE=(AHE.Setup, AHE.Enc, AHE.Add, AHE.ConstMul, AHE.Dec) be the algorithms of a circuit-private additively homomorphic encryption scheme. Let (NIZK.Prove, NIZK.Verify) denote a non-interactive zero knowledge argument of knowledge system in the RO model. Let RO denote the random oracle. Let NIZK.Sim denote the simulator of this argument system and let NIZK.Ext denote the extractor. Let PRF denote a pseudorandom function that takes inputs of length .
We now describe the construction of our four round secure fuzzy threshold token generation protocol πCS for Cosine Similarity.
1. Registration
In the registration phase, the following algorithm is executed by a trusted authority. The trusted authority may be a device that will participate in subsequent biometric matching and signature generation. If the trusted authority will potentially participate in biometric matching and signature generation, it can delete the information that it should not have at the end of the registration phase. That is, it should delete the shares corresponding to the other devices.
The trusted device samples a biometric template {right arrow over (w)} from a distribution . can be the raw biometric data that is collected by a biometric sensor. For simplicity, assume that IV is normalized. The trusted device then generates a public key, shares of a secret key and any other needed public parameters for a threshold signature scheme, following any suitable key sharing algorithm. Examples of key sharing algorithms include Shamir's secret sharing.
The ith device, of the n total devices, receives the public parameters, ppTA, the public key, vkTA, and the ith share of the secret key, skiTA for a secret key encryption scheme. Then, the trusted device can compute ({right arrow over (w)}i,{right arrow over (v)}i), where {right arrow over (w)}i is the ith share of the biometric template. The trusted device can also calculate shares of the public key and secret key for an additively homomorphic encryption scheme, (ski, pki). Then the trusted device can encrypt each of the l components of {right arrow over (w)}i as cti,j. The ith device receives its shares of the template, secret key, private key, and encrypted values, as well as the random factor used in the encryption. All of the other devices receive the complement of the template share, the public key share, and the encrypted template, without the random factor. Thus by the end of the registration phase, each device has the full public key and the full encrypted template.
2. Setup
Set up can also be done by a trusted authority. The trusted authority may be the same trusted device that completed the registration. If the trusted authority will potentially participate in biometric matching and signature generation, it can delete the information that it should not have at the end of the registration phase. That is, it should delete the shares corresponding to the other devices.
For each i∈[n], the setup algorithm does the following:
In the setup phase, the primary device can generate and distribute a number of constants that can be used later for partial computations. The trusted device can generate a common reference string for oblivious transfer set up, and a plurality of random keys for a pseudorandom function. The ith device receives the ith crs, and all other parties receive the ith crs and the ith set of random keys.
3. SignOn
In the SignOn phase, let's consider party Pi that uses an input vector {right arrow over (u)} and a message m on which it wants a token. Pi picks two other parties Pj and Pk and interacts with them in the below four round protocol. In the SignOn phase, a biometric measurement is matches to a template and a signature is generated for an authentication challenge. A primary device, Pi, selects two other devices of the n total devices, Pj and Pk to participate. The sign on phase happens with four rounds of communication. The primary device has a measured biometric {right arrow over (u)} and a message m on which it wants a token. The arrowhead on Round 1 can denote that in this round messages are outgoing from party Pi.
Round 1: (Pi→) Party Pi does the following:
i. Let {right arrow over (u)}=(u1, . . . , ul).
ii. Let =(Pj, Pk) with j<k.
iii. For each j∈[], compute the following:
iv. To both parties in , send msg1=(, m, {ct1,j, ct2,j, ct3,j, π1,j, π2,j, π1,j, π2,j,).
Round 1: For each component of the measurement vector, Pi makes a series of computations and non-interactive zero knowledge proofs on those computations. First it encrypts the component and generates a proof that the encryption is valid. Then it homomorphically computes the product of the component with itself as part of the proof that {right arrow over (u)}, is normalized. Finally, it computes the product of {right arrow over (u)} with {right arrow over (w)}i for the component, and generates a proof that the multiplication is valid. Then Pi sends the message, the computed values, and the proofs to the other two devices.
Round 2: (→Pi) Both parties Pj and Pk do the following:
i. Abort if any of the proofs {π1,j, π2,j, don't verify.
ii. Generate the following randomness:
iii. Using the algorithms of AHE and using randomness PRF(ki,enc, msg1), compute ctx,1, ctx,2, cty,1, cty,2, ctz,1, ctz,2 as encryptions of the following:
iv. Send (ctx,2, cty,2, ctz,1, ctz,2) to Pi.
Round 2: Both parties Pj and Pk verify the proofs. If any proofs aren't valid, they can abort the protocol. This ensures that Pi is trustworthy and did not send an invalid value for {right arrow over (u)} to attempt to force a match. Then each device uses the random keys provided in the Setup phase to generate pseudorandom values a, b, c, d, p, q, and rz. Because they received the same random keys associated with a message from Pi, they will generate the same random values.
They can then use the additively homomorphic encryption algorithms to compute some values, and one-time message authentication codes (MACs) for those values. The values are: the inner product of the measurement with Pi's share of the template (x1), the inner product of the measurement with itself (y1), and the inner product of the measurement with the complement {right arrow over (v)}1 plus the random value rz (z1). The associated MACs are x2, y2, and z2. The inner product with Pi's share of the template can be done because that was sent as a ciphertext component wise, and then reconstructed as inner products on the full vector because of the homomorphic addition. The one-time MACs provide another check on Pi. Even if Pi attempts to change the computed values to force a match, the MAC's will no longer correspond to the computed values. Then each party sends everything except for x1 and y1 to Pi.
Round 3: (Pi→) To each party in , party Pi does the following:
i. Abort if the tuples sent by both Pj and Pk in round 2 were not the same.
ii. Compute x1={right arrow over (u)},{right arrow over (w)}i, x2=AHE.Dec(ski, ctx,2).
iii. Compute y1={right arrow over (u)},{right arrow over (u)}, y2=AHE.Dec(ski, cty,2).
iv. Compute z1=AHE.Dec(ski, ctz,1), z2=AHE.Dec(ski, ctz,2).
v. Generate and send msg3={ots,tRec=OT.Round1(crsi, st; rs,tot)}s∈{x,y,z},t∈{1,2} where rs,tot is picked uniformly at random.
Round 3: Pi can compare the tuples sent by the other two parties. If they do not match, Pi can abort the protocol. This check is to ensure that Pj and Pk are trustworthy. Because they began with the same random keys, all of their calculations should be the same. We assume that the two devices did not collude to send invalid values because the devices are presumed to only communicate with the primary device. Then Pi computes x1 and y1 for itself, and decrypts the messages containing z1, x2, y2, and z2. Then Pi sends an oblivious transfer message to each of the other two devices to pass garbled inputs for the garbled circuit used in Round 4.
Round 4: (P1→Pt) Party Pj does the following:
i. Compute rC=PRF(ki,C, msg1).
ii. Compute {tilde over (C)}=Garble(C; rC) for the circuit C described in
iii. Let {rs,tot}s∈{x,y,z},t∈{0,1}=PRF(ki,ot, msg3).
iv. For each s∈{x, y, z} and each t∈{0,1}, let labs,t0, labs,t1 denote the labels of the garbled circuit {tilde over (C)} corresponding to input wires st. Generate ots,tSen=OT.Round2(crsi, labs,t0, labs,t1, ots,tRec; rs,tot).
v. Let otSen={ots,tSen}s∈{x,y,z},t∈{1,2}
vi. Pick a random string Pad=PRF(ki,C, msg3).
vii. Set OneCTj=SKE.Enc(Pad, TS.Sign(skjTS, m)).
viii. Send ({tilde over (C)}, otSen, OneCTj)) to Pij.
Round 4: (Pk→Pi) Party Pk does the following:
i. Compute {tilde over (C)}, otSen, Pad exactly as done by Pj.
ii. Set OneCTk=SKE.Enc(Pad, TS.Sign(skkTS,m)).
iii. Send (RO({tilde over (C)},otSen), OneCTk) to Pi.
Round 4: Both Pj and Pk can generate a garbled circuit, as shown in
Then Pj computes a partial signature for the secret key encryption scheme, using the string Pad and the secret key share from the registration phase. Pk does the same with its own key share. Then Pj sends to Pi sends the garbled circuit and its partial computation. Pk sends their partial computation and a random oracle hash of the circuit.
Output Computation: Parties Pj, Pk output (m, Pi, ). Additionally, party Pi does the following to generate a token:
i. Let ({tilde over (C)}, otSen, OneCTj) be the message received from Pj and (msg4, OneCTk) be the message received from Pk.
ii. Abort if RO({tilde over (C)}, otSen)≠msg4.
iii. For each s∈{x,y, z} and each t∈{0,1}, compute labs,t=OT.Output(ots,tSen,ots,tRec,rs,tot).
iv. Let lab={labs,t}s∈{x,y,z},t∈{0,1}.
v. Compute Pad=Eval({tilde over (C)}, lab).
vi. Compute Tokenj=SKE.Dec(Pad,OneCTj), Tokenk=SKE.Dec(Pad,OneCTk), Tokeni=TS.Sign(skiTS,m).
vii. Compute Token←TS.Combine({Tokens}s∈{i,j,k}).
viii. Output Token if TS.Verify(vkTS, m, Token). Else, output ⊥.
Output Computation: Pi can now generate a token. It can check if the hash of the circuit sent by Pj the matches the hash sent by Pk, and if not, abort the computation. This ensures that the two other parties are behaving correctly. Pi can then compute the appropriate labels for the garbled circuit and evaluate the circuit to determine a string Pad (if there is a match). Using the string Pad, Pi can decrypt the partial computations sent by the other two parties and do its own partial computation of the signature. Finally, Pi can combine the partial computations and create a complete token.
4. Token Verification
Given a verification key vkTS, message m and token Token, the token verification algorithm outputs 1 if TS.Verify(vkTS, m, Token) outputs 1.
The correctness of the protocol directly follows from the correctness of the underlying primitives.
B. Security Proof
In this section, we formally prove Theorem 3.
Consider an adversary who corrupts a party . The strategy of the simulator Sim for our protocol πCS against a malicious adversary is described below. Note that the registration phase first takes place at the end of which the simulator gets the values to be sent to which it then forwards to .
1. Description of Simulator
Setup: For each i∈[n], Sim does the following:
SignOn Phase: Case 1—Honest Party as Pi
Suppose an honest party Pi that uses an input vector {right arrow over (u)} and a message m for which it wants a token by interacting with a set of two parties, one of which is . The arrowhead in Round 1 can denote that in this round messages are outgoing from the simulator. Sim gets the tuple (m,) from the ideal functionality DiFuz and interacts with the adversary A as below:
SignOn Phase: Case 2—Malicious Party as Pi
Suppose a malicious party is the initiator Pi. Sim interacts with the adversary as below:
2. Hybrids
We now show that the above simulation strategy is successful against all malicious PPT adversaries. That is, the view of the adversary along with the output of the honest parties is computationally indistinguishable in the real and ideal worlds. We will show this via a series of computationally indistinguishable hybrids where the first hybrid Hyb0 corresponds to the real world and the last hybrid Hyb8 corresponds to the ideal world.
1. Hyb0—Real World: In this hybrid, consider a simulator SimHyb that plays the role of the honest parties as in the real world.
When Honest Party is Pi:
2. Hyb1—Case 1: Aborts and Message to Ideal Functionality. In this hybrid, SimHyb aborts if the adversary's messages were not generated in a manner consistent with the randomness output in the setup phase and also runs the query to the ideal functionality.
That is, SimHyb runs the “Message To Ideal Functionality” step as done by Sim after round 4 of Case 1 of the simulation strategy. SimHyb also performs the Abort check step in step 1 of round 3 of Case 1 of the simulation.
3. Hyb2—Case 1: Simulate NIZKs. In this hybrid, SimHyb computes simulated NIZK arguments in round 1 of Case 1 as done by Sim in the ideal world.
4. Hyb3—Case 1: Switch Ciphertexts. In this hybrid, SimHyb computes the ciphertexts in round 1 of Case 1 using random messages as done in the ideal world.
5. Hyb4—Case 1: Switch OT Receiver Messages. In this hybrid, SimHyb computes the OT receiver messages in round 3 of Case 1 using random inputs as done in the ideal world.
When Corrupt Party is Pi:
6. Hyb5—Case 2: Message to Ideal Functionality. In this hybrid, SimHyb runs the “Message To Ideal Functionality” step as done by Sim in round 2 of Case 2 of the simulation strategy. That is, SimHyb queries the ideal functionality using the output of the extractor NIZK. Ext on the proofs given by in round 1.
7. Hyb6—Case 2: Simulate OT Sender Messages. In this hybrid, SimHyb computes the CRS during the setup phase and the OT sender messages in round 4 of Case 2 using the simulator OT.Sim as done in the ideal world.
8. Hyb7—Case 2: Simulate Garbled Circuit. In this hybrid, SimHyb computes the garbled circuit and associated labels in round 4 of Case 2 using the simulator Sim.Garble as done in the ideal world.
9. Hyb8—Case 2: Switch Ciphertexts. In this hybrid, SimHyb computes the ciphertexts in round 2 of Case 2 using random messages as done in the ideal world. This hybrid corresponds to the ideal world.
We will now show that every pair of successive hybrids is computationally indistinguishable.
Lemma 5 Hyb0 is statistically indistinguishable from Hyb1.
Proof. When an honest party initiates the protocol as the querying party Pi, let's say it interacts with parties Pj and Pk such that Pj is corrupt. In Hyb0, on behalf of Pi, SimHyb checks that the messages sent by both parties Pj and Pk are same and if so, computes the output on behalf of the honest party. Since Pk is honest, this means that if the messages sent by both parties are indeed the same, the adversary , on behalf of Pj, did generate those messages honestly using the shared randomness generated in the setup phase and the shared values generated in the registration phase.
In Hyb1, on behalf of Pi, SimHyb checks that the messages sent by the adversary on behalf of Pj were correctly generated using the shared randomness and shared values generated in the setup and registration phases and if so, asks the ideal functionality to deliver output to the honest party. Thus, the switch from Hyb0 to Hyb1 is essentially only a syntactic change.
Lemma 6 Assuming the zero knowledge property of the NIZK argument system, Hyb1 is computationally indistinguishable from Hyb2.
Proof. The only difference between the two hybrids is that in Hyb1, SimHyb computes the messages of the NIZK argument system by running the honest prover algorithm NIZK.Prove(·), while in Hyb2, they are computed by running the simulator NIZK.Sim(·). Thus, we can show that if there exists an adversary that can distinguish between the two hybrids with non-negligible probability, we can design a reduction NIZK distinguish between real and simulated arguments with non-negligible probability thus breaking the zero knowledge property of the NIZK argument system which is a contradiction.
Lemma 7 Assuming the semantic security of the additively homomorphic encryption scheme AHE, Hyb2 is computationally indistinguishable from Hyb3.
Proof. The only difference between the two hybrids is that in Hyb2, SimHyb computes the ciphertexts in round 1 by encrypting the honest party's actual inputs ({right arrow over (u)}, {right arrow over (w)}i), while in Hyb3, the ciphertexts encrypt random messages. Thus, we can show that if there exists an adversary that can distinguish between the two hybrids with non-negligible probability, we can design a reduction AHE that can distinguish between encryptions of the honest party's actual inputs and encryptions of random messages with non-negligible probability thus breaking the semantic security of the encryption scheme AHE which is a contradiction.
Lemma 8 Assuming the security of the oblivious transfer protocol OT against a malicious sender, Hyb3 is computationally indistinguishable from Hyb4.
Proof. The only difference between the two hybrids is that in Hyb3, SimHyb computes the OT receiver's messages as done by the honest party in the real world, while in Hyb4, the OT receiver's messages are computed by using random messages as input. Thus, we can show that if there exists an adversary that can distinguish between the two hybrids with non-negligible probability, we can design a reduction OT that can distinguish between OT receiver's messages of the honest party's actual inputs and random inputs with non-negligible probability thus breaking the security of the oblivious transfer protocol OT against a malicious sender which is a contradiction.
Lemma 9 Assuming the argument of knowledge property of the NIZK argument system, Hyb4 is computationally indistinguishable from Hyb5.
Proof. The only difference between the two hybrids is that in Hyb5, SimHyb also runs the extractor NIZK. Ext on the proofs given y the adversary to compute its input U. Thus, the only difference between the two hybrids is if the adversary can produce a set of proofs {πj} such that, with non-negligible probability, all of the proofs verify successfully, but SimHyb fails to extract {right arrow over (u)} and hence SimHyb aborts.
However, we can show that if there exists an adversary that can this to happen with non-negligible probability, we can design a reduction NIZK that breaks the argument of knowledge property of the system NIZK with non-negligible probability which is a contradiction.
Lemma 10 Assuming the security of the oblivious transfer protocol OT against a malicious receiver, Hybs is computationally indistinguishable from Hyb6.
Proof. The only difference between the two hybrids is that in Hyb5, SimHyb computes the OT sender's messages by using the actual labels of the garbled circuit as done by the honest party in the real world, while in Hyb6, the OT sender's messages are computed by running the simulator OT.Sim. In Hyb6, the crs in the setup phase is also computed using the simulator OT.Sim.
Thus, we can show that if there exists an adversary that can distinguish between the two hybrids with non-negligible probability, we can design a reduction OT that can distinguish between the case where the crs an OT sender's messages were generated by running the honest sender algorithm from the case where the crs and the OT sender's messages were generated using the simulator OT.Sim with non-negligible probability thus breaking the security of the oblivious transfer protocol OT against a malicious receiver which is a contradiction.
Lemma 11 Assuming the correctness of the extractor NIZK. Ext and the security of the garbling scheme, Hyb6 is computationally indistinguishable from Hyb7.
Proof. The only difference between the two hybrids is that in Hyb6, SimHyb computes the garbled circuit by running the honest garbling algorithm Garble using honestly generated labels, while in Hyb7, SimHyb computes a simulated garbled circuit and simulated labels by running the simulator Sim.Garble on the value output by the ideal functionality. From the correctness of the extractor NIZK. Ext, we know that the output of the garbled circuit received by the evaluator () in Hyb6 is identical to the output of the ideal functionality used in the ideal world. Thus, we can show that if there exists an adversary that can distinguish between the two hybrids with non-negligible probability, we can design a reduction Garble that can distinguish between an honestly generated set of input wire labels and an honestly generated garbled circuit from simulated ones with non-negligible probability thus breaking the security of the garbling scheme which is a contradiction.
Lemma 12 Assuming the circuit privacy property of the additively homomorphic encryption scheme AHE, Hyb7 is computationally indistinguishable from Hyb8.
Proof. The only difference between the two hybrids is that in Hyb7, SimHyb computes the ciphertexts sent in round 2 by performing the homomorphic operations on the adversary's well-formed ciphertexts sent in round 1 exactly as in the real world, while in Hyb3, SimHyb generates ciphertexts that encrypt random messages. Thus, we can show that if there exists an adversary that can distinguish between the two hybrids with non-negligible probability, we can design a reduction AHE that can break the circuit privacy of the circuit private additively homomorphic encryption scheme AHE which is a contradiction.
C. Euclidean Distance
Recall that given two vectors {right arrow over (u)}, {right arrow over (w)}, the square of the Euclidean Distance EC.Dist between them relates to their Cosine Similarity CS.Dist as follows:
EC.Dist({right arrow over (u)},{right arrow over (w)})=({right arrow over (u)},{right arrow over (u)}+{right arrow over (w)},{right arrow over (w)}−2·CS.Dist({right arrow over (u)},{right arrow over (w)}))
Thus, it is easy to observe that the above protocol and analysis easily extends for Euclidean Distance too.
In this section, we show how to efficiently construct a fuzzy threshold token generation protocol for cosine similarity using any depth-1 FHE scheme with threshold decryption.
A. Construction
We first list some notation and the primitives used before describing our construction.
Let IP be a function that takes as input a template w, a measurement u and outputs the inner product w, u, and let d denote the threshold inner-product value to denote a match between w and u. Let the n parties be denoted by P1, . . . , Pn respectively. Let λ denote the security parameter. Let W denote the distribution from which a random template vector is sampled. Let's assume the vectors are of length where is a polynomial in λ. Let C be a circuit that takes as input a template w, a measurement u and some string K, and outputs K if the distance between w and u is below some (pre-defined) threshold; otherwise it outputs 0.
Let TFHE=(TFHE.Gen, TFHE.Enc, TFHE.PartialDec, TFHE.Eval, TFHE.Combine) be a depth-1 threshold FHE scheme. Let TS=(TS.gen, TS.Sign, TS.Combine, TS.Verify) be a threshold signature scheme. Let SKE=(SKE, Gen, SKE.Enc, SKE.Dec) denote a secret key encryption scheme. Let PKE=(PKE.Gen, PKE.Enc, PKE.Dec) denote a public key encryption scheme.
Let (Gen, Sign, Verify) be a strongly-unforgeable digital signature scheme. Let NIZK=(NIZK.Setup, NIZK.Prove, NIZK.Verify) denote a non-interactive zero knowledge argument. Let H be a collision-resistant hash function (modeled later as a random oracle).
We now describe the construction of our six round secure fuzzy threshold token generation protocol πIP for any n and k.
1. Registration
In the registration phase, the following algorithm is executed by a trusted authority:
A trusted authority (e.g., primary user device) can sample a biometric template w from a distribution of a biometric measurement . The trusted authority can also compute a public key pk and a plurality of private key shares ski for a threshold fully homomorphic encryption scheme TFHE, in addition to public parameters ppTS, a verification key vkTS, and a plurality of private key shares skiTS for a threshold signature scheme TS. The trusted authority can also compute a public key and a secret key for a public key encryption scheme PKE and a string K for a secret key encryption scheme SKE. Using the public key pk, the trusted authority can encrypt the biometric template w to form ciphertext ct0 and encrypt the string K to form ciphertext ct1.
Then, the trusted authority can compute a plurality of values for each electronic device i of n electronic devices (e.g., a first electronic device and other electronic devices). The trusted authority can compute a secret key share sk′i and a verification key share vk′i for a digital signature scheme. The trusted authority can also compute a hash Ki using the string K and a hash function H. The trusted authority can then send to each electronic device Pi the public key pk, the ciphertexts ct0 and ct1, verification key shares (vk′i, . . . , vk′i), secret key share sk′i, public parameters ppTS, verification key vkTS, private key share skiTS, and hash Ki.
2. SignOn Phase
In the SignOn phase, let's consider party P* that uses input vector u, a message m on which it wants a token. P* interacts with the other parties in the below four round protocol. The arrowhead in round 1 can denote that in this round messages are outgoing from party P*.
A first electronic device, P* can encrypt the input vector u (e.g., the biometric measurement vector) with the public key pk of the threshold fully homomorphic encryption scheme to generate an encrypted biometric measurement ciphertext ct*. The first electronic device can send the encrypted biometric measurement ct* and the message m to each of the other electronic devices. Each of the other electronic devices can compute a partial signature computation σ′i with the ciphertext ct* and the secret key share sk′i. Each electronic can send the partial signature computation σ′i to the first electronic device. The first electronic device can send all of the partial signature computations (σ′1, . . . , σ′n) to all of the other electronic devices.
Each of the other electronic devices can verify each of the partial signature computations (σ′1, . . . , σ′n) with the ciphertext ct* and the received verification keys (vk1′, . . . , vkn′). If any of the partial signature computations are not verified (e.g., Verify(vk′i, ct*, σ′i)≠1), the electronic device can output ⊥, indicating an error. An unverified signature can indicate that one (or more) of the electronic devices did not compute the partial signature computation correctly, and thus may be compromised or fraudulent. After verifying the partial signature computations, each of the other electronic devices can evaluate the ciphertexts ct*, and ct0 to generate a new ciphertext ct. Evaluating the ciphertexts may include computing an inner product between the template w (in ciphertext ct0) and the measurement u (in ciphertext ct*). The inner product can then be used, for example, to compute a cosine similarity distance measure or a Euclidean distance. Each of the other electronic devices can then compute a partial decryption μi of the ciphertext ct. The partial decryption μi can then be send to the first electronic device.
The first electronic device can combine the partial decryptions (μ1, . . . , μn) using threshold fully homomorphic encryption to recover the inner product IP(w, u) of the template w and the measurement u. The first electronic device can encrypt the inner product IP(w, u) with a public key encryption scheme to generate a ciphertext C0, and encrypt each partial decryption μi to generate a plurality of ciphertexts Ci. The first electronic device can also generate a non-interactive zero knowledge proof π. The proof π can be a proof that C0 encrypts an inner product μ0 and each ciphertext Ci encrypts a value μi. The proof π can also state that the inner product (and thus the distance measure) is less than a threshold d, and that the inner product μ0 is the result of the combination of the partial decryptions (μ1, . . . , μn). The first electronic device can then send to each of the other electronic devices a tuple with the ciphertexts and the proof (C0, C1, . . . , Cn, π).
Each of the other electronic devices can verify the proof π. If the proof is not verified, the electronic device can output ⊥ and abort. If the proof π is verified, the electronic device can compute a partial decryption μi,1 of ct1, the encryption of the string K. A partial threshold signature token Tokeni can be generated by each of the other electronic devices using the secret key share ski and the message m. A ciphertext i can be computed as a secret key encryption with the partial threshold signature token Tokeni and the hash Ki. The partial decryption μi,1 and the ciphertext i can then be send to the first electronic device.
The first electronic device can homomorphically combine the partial decryptions (μ1,1, . . . , μn,1) to recover the string K. Then the first electronic device can compute hash Ki for each electronic device i using the string K and a hash function H, then use the hash Ki to decrypt the secret key encryption ciphertext to recover the partial threshold signature token Tokeni. With each received partial threshold signature token Tokeni, the first electronic device can combine them to compute a signature token Token. If the first electronic device can verify the token Token and the message m, the first electronic device can output the token Token; otherwise, the first electronic device can output ⊥. Token Verification
Given a verification key vkTS, message m and Token, the token verification algorithm outputs 1 if TS.Verify(vkTS, m, Token) outputs 1.
B. Security Analysis
We prove informal Theorem 3 here. In fact, the proof is very similar to the proof of the general case using TFHE. So we omit the details and only provide a brief sketch. Note that, this protocol realizes FTTG with a slightly weaker guarantee as defined by the leakage functions. In particular, the leakage functions Lc, Lf, Lm return the exact inner product of the template w and the measurement u. This is because of the fact that, in the protocol, (viz. the Sign On phase), the initiator gets to know this value. In the simulation this comes up when the Sign On session is initiated by a corrupt party. In that case, the simulator issues a “Test Password” query on the input of the initiator and learns the inner product that it sends to the adversary then. Other steps are mostly the same as that of the simulator of the generic protocol and therefore we omit the details.
In this section, we show how to construct an efficient two round secure fuzzy threshold token generation protocol in the Random Oracle model for the distance measure being Hamming Distance. Our token generation protocol satisfies Definition 7 for any n, t and is secure against a malicious adversary that can corrupt up to (t−1) parties.
Formally, we show the following theorem:
Theorem 5 Assuming the existence of threshold signatures as defined in Section III.E, threshold linear secret sharing, robust linear secret sharing, secret key encryption, UC-secure threshold oblivious pseudorandom functions, and collision resistant hash functions, there exists a two round secure fuzzy threshold token generation protocol satisfying Definition 7 for the Hamming Distance function. The protocol works for any n, any threshold t, and is secure against a malicious adversary that can corrupt up to (t−1) parties.
The threshold oblivious PRF can be built assuming the Gap Threshold One-More Diffie-Hellman assumption in the Random Oracle model [25]. The threshold signature schemes we use can be built assuming either DDH/RSA [16, 17]. Collision resistant hash functions are implied by the Random Oracle model. We will use the Shamir's secret sharing scheme to instantiate the robust secret sharing scheme as described in Imported Lemma 1. The other primitives can be built either unconditionally or assuming just one way functions. Thus, instantiating the primitives used in the above theorem, we get the following corollary:
Corollary 6 Assuming the hardness of ∈{Gap DDH, RSA} and Gap Threshold One-More Diffie-Hellman (Gap-TOMDH), there exists a two round secure fuzzy threshold token generation protocol in the Random Oracle model satisfying Definition 7 for the Hamming Distance function. The protocol works for any n, any threshold t, and is secure against a malicious adversary that can corrupt upto (t−1) parties.
A. Construction
We first list some notation and the primitives used before describing our construction.
Let the n parties be denoted by P1, . . . , Pn respectively. Let λ denote the security parameter. Let denote the distribution from which the random vector is sampled. Let's assume the vectors are of length where is a polynomial in λ. Each element of this vector is an element of a field F over some large prime modulus q. Let d denote the threshold value for the Hamming Distance function. That is, two vectors {right arrow over (w)} and {right arrow over (u)} of length each can be considered close if their Hamming Distance is at most (−d)—that is, they are equal on at least d positions.
Let (Share, Recon) be a (n, t) linear secret sharing scheme. Let (RSS.Share, RSS.Recon, Thres.Recon) be a
robust linear secret sharing scheme as defined in Appendix A. That is, the secret can be reconstructed by running algorithm Thres.Recon given either exactly d honestly generated shares or by running algorithm RSS.Recon given a collection of shares of which
are honestly generated. Let TS=(TS.Gen, TS.Sign, TS.Combine, TS.Verify) be the threshold signature scheme of Boldyreva [16]. We note that a similar construction also works for other threshold signature schemes. Without loss of generality and for simplicity, we present our scheme here using the construction of Boldyreva [16].
Let (SKE.Enc, SKE.Dec) denote a secret key encryption scheme. Let TOPRF=(TOPRF.Setup, TOPRF.Encode, TOPRF.Eval, TOPRF.Combine) denote a UC-secure threshold oblivious pseudorandom function in the Random Oracle (RO) model. Let TOPRF.Sim=(TOPRF.Sim.Encode, TOPRF.Sim.Eval, TOPRF.Sim.Ext) denote the simulator of this scheme where the algorithm TOPRF.Sim.Encode is used to generate simulated encodings, TOPRF.Sim.Eval is used to generate simulated messages on behalf of algorithm TOPRF.Eval. Algorithm TOPRF.Sim.Ext extracts the message being encoded based on the queries made to RO during the TOPRF.Combine phase. Let's model a collision resistant hash function H via a Random Oracle.
We now describe the construction of our two round secure threshold fuzzy token generation protocol for Hamming Distance.
1. Registration
In the registration phase, the following algorithm is executed by a trusted authority:
2. Setup
The setup algorithm does nothing.
3. SignOn Phase
In the SignOn phase, let's consider party P* that uses an input vector {right arrow over (u)}=(u1, . . . , ) and a message m on which it wants a token. P* interacts with the other parties in the below two round protocol.
Strategy 1:
Matches
Strategy 2: Only d Matches
4. Token Verification
Given a verification key vkTS, message m and token Token, the token verification algorithm outputs 1 if TS.Verify(vkTS, m, Token) outputs 1.
The correctness of the protocol directly follows from the correctness of the underlying primitives.
B. Security Proof
In this section, we formally prove Theorem 5.
Consider an adversary who corrupts t* parties where t*<t. The strategy of the simulator Sim for our protocol πHD against a malicious adversary is described below. Note that the registration phase first takes place at the end of which the simulator gets the values to be sent to every corrupt party which it then forwards to .
1. Description of Simulator
SignOn Phase: Case 1—Honest Party as P*
Suppose an honest party P* that uses an input vector {right arrow over (u)}=(u1, . . . , ul) and a message m for which it wants a token by interacting with a set consisting of t parties, some of which could be corrupt. Sim gets the tuple (m,) from the ideal functionality DiFuz and interacts with the adversary as below:
SignOn Phase: Case 2—Malicious Party as P*
Suppose a malicious party is the initiator P*. Sim interacts with the adversary as below:
2. Hybrids
We now show that the above simulation strategy is successful against all malicious PPT adversaries. That is, the view of the adversary along with the output of the honest parties is computationally indistinguishable in the real and ideal worlds. We will show this via a series of computationally indistinguishable hybrids where the first hybrid Hyb0 corresponds to the real world and the last hybrid Hyb7 corresponds to the ideal world.
1. Hyb0—Real World: In this hybrid, consider a simulator SimHyb that plays the role of the honest parties as in the real world.
When Honest Party is P*:
2. Hyb1—Case 1: Simulate TOPRF Encoding. In this hybrid, SimHyb computes the round 1 message on behalf of P* by running the simulator TOPRF.Sim.Encode(·) to compute the encoding ci for each i∈[] as done in round 1 of the ideal world.
3. Hyb2—Case 1: Message to Ideal Functionality. In this hybrid, SimHyb runs the “Message To Ideal Functionality” step as done by Sim after round 2 of Case 1 of the simulation strategy instead of computing the output as done by the honest party P* in the real world.
When Corrupt Party is P*:
4. Hyb3—Case 2: Message to Ideal Functionality. In this hybrid, SimHyb runs the “Message To Ideal Functionality” step as done by Sim after round 2 of Case 2 of the simulation strategy. That is, SimHyb runs the extractor TOPRF.Sim.Ext to compute {right arrow over (u)}* and queries the ideal functionality with this.
5. Hyb4—Case 2: Simulate TOPRF Evaluation. In this hybrid, in round 2, SimHyb computes the TOPRF evaluation responses by running the algorithm TOPRF.Sim.Eval as done in the ideal world.
6. Hyb5—Case 2: =⊥. In this hybrid, when the output from the ideal functionality =⊥, on behalf of every honest party Pj, SimHyb sets the exponent of each yi,j to be a uniformly random value Ri,j instead of (ski,j+ri,j).
7. Hyb6—Case 2: =⊥. In this hybrid, when the output from the ideal functionality =⊥, instead of computing the ciphertext cti,j as before, SimHyb picks cti,j uniformly at random and responds to the Random Oracle query as in the ideal world to set the decrypted plaintext.
8. Hyb7—Case 2: ≠⊥. In this hybrid, when the output from the ideal functionality ≠⊥, SimHyb computes the ciphertexts cti,j and the responses to the RO queries exactly as in the ideal world. This hybrid corresponds to the ideal world.
We will now show that every pair of successive hybrids is computationally indistinguishable.
Lemma 13 Assuming the security of the threshold oblivious pseudorandom function TOPRF, Hyb0 is computationally indistinguishable from Hyb1.
Proof. The only difference between the two hybrids is that in in Hyb0, SimHyb computes the value ci for each i∈[] by running the honest encoding algorithm TOPRF.Encode while in Hyb1, it computes them by running the simulated encoding algorithm TOPRF.Sim.Encode. Suppose there exists an adversary that can distinguish between the two hybrids with non-negligible probability. We can use to construct an adversary TOPRF that can distinguish between the real and simulated encodings with non-negligible probability and thus break the security of the TOPRF which is a contradiction.
Lemma 14 Assuming the correctness of the threshold oblivious pseudorandom function TOPRF, correctness of the private key encryption scheme and correctness of the robust secret sharing scheme, Hyb1 is computationally indistinguishable from Hyb2.
Proof. The difference between the two hybrids is that in Hyb2, SimHyb checks whether the adversary did indeed computes its messages honestly and if not, aborts. However, in Hyb1, SimHyb aborts only if the output computation phase did not succeed. Thus, we can observe that assuming the correctness of the primitives used—namely, the threshold oblivious pseudorandom function TOPRF, the private key encryption scheme and the robust secret sharing scheme, Hyb1 is computationally indistinguishable from Hyb2.
Lemma 15 Assuming the correctness of the extractor TOPRF.Sim.Ext of the threshold oblivious pseudorandom function TOPRF, Hyb2 is computationally indistinguishable from Hyb3.
Proof. The only difference between the two hybrids is that in Hyb3, SimHyb also runs the extractor TOPRF.Sim.Ext based on the adversary's queries to the random oracle RO to extract the adversary's input {right arrow over (u)}*. Thus, the only difference in the adversary's view is if SimHyb aborts with non-negligible probability because the extractor TOPRF.Sim.Ext aborts with non-negligible probability. Thus, we can show that assuming the correctness of the extractor TOPRF.Sim.Ext of the threshold oblivious pseudorandom function TOPRF, Hyb2 is computationally indistinguishable from Hyb3.
Lemma 16 Assuming the security of the threshold oblivious pseudorandom function TOPRF, Hyb3 is computationally indistinguishable from Hyb4.
Proof. The only difference between the two hybrids is that for every honest party Pj, in Hyb3, SimHyb computes the value zi,j for each i∈[] by running the honest evaluation algorithm TOPRF.Eval while in Hyb4, it computes them by running the simulated evaluation algorithm TOPRF.Sim.Eval. Suppose there exists an adversary that can distinguish between the two hybrids with non-negligible probability. We can use to construct an adversary TOPRF that can distinguish between the real and simulated evaluation responses with non-negligible probability and thus break the security of the TOPRF which is a contradiction.
Lemma 17 Assuming the correctness of the threshold oblivious pseudorandom function, security of the private key encryption scheme, security of the threshold linear secret sharing scheme and the security of the robust linear secret sharing scheme, Hyb4 is computationally indistinguishable from Hyb5.
Proof. In the scenario where the output from the ideal functionality =⊥, we know that the adversary's input {right arrow over (u)}* matches with the vector {right arrow over (w)} in at most (d−1) positions. Therefore, from the correctness of the TOPRF scheme, for each honest party Pj, the adversary learns the decryption key for at most (d−1) of the ciphertexts . Therefore, from the security of the secret key encryption scheme, the adversary learns at most (d−1) of the plaintexts .
Now, in Hyb4, for every party Pj, for every i∈[] for which the adversary learns the plaintext, the exponent in the plaintext yi,j is of the form (ski,j+rt,j) where ri,j is a secret sharing of 0 while in Hyb5, the exponent is picked uniformly at random. Therefore, since the secret sharing scheme statistically hides the secret as long as at most (d−1) shares are revealed, we can show that if there exists an adversary that can distinguish between the two hybrids with non-negligible probability, we can use to construct an adversary Share that can distinguish between a real set of (d−1) shares and a set of (d−1) random values with non-negligible probability, thus breaking the security of the secret sharing scheme which is a contradiction.
Lemma 18 Hyb5 is statistically indistinguishable from Hyb6.
Proof Notice that in going from Hybs to Hyb6, we only make a syntactic change. That is, instead of actually encrypting the desired plaintext at the time of encryption, we use the random oracle to program in the decryption key in such a way as to allow the adversary to decrypt and learn the same desired plaintext.
Lemma 19 Hyb6 is statistically indistinguishable from Hyb7.
Proof In the scenario where the output from the ideal functionality =⊥, we know that the adversary's input {right arrow over (u)}* matches with the vector {right arrow over (w)} in at least d positions. For every honest party Pj, the adversary learns the plaintexts {yi,j} for every position i that has a match.
Now, in Hyb6, for every party Pj, for every i∈[] for which the adversary learns the plaintext, the exponent in the plaintext yi,j is of the form (ski,j+ri,j) where ri,j is a secret sharing of 0. However, in Hyb7, the ski,j values in the exponent are picked not to be secret shares of the threshold signing key share skjTS, but picked in such a manner that the adversary recovers the same output. There is no other difference between the two hybrids and it is easy to observe that the difference between the two hybrids is only syntactic and hence they are statistically indistinguishable.
Secure sketches are fundamental building blocks of fuzzy extractor. Combining any information theoretic secure sketch with any threshold oblivious PRF we construct a FTTG protocol. Due to the information theoretic nature of secure sketch this protocol has a distinct feature, that is the probability of succeeding in an offline attack stays the same irrespective of the computational power of the attacker or the number of brute-force trials. However, if a party initiates the protocol with a “close” measurement it recovers the actual template. Also, due to the same reason the template cannot be completely hidden. In other words, this protocol has an inherent leakage on the template incurred by the underlying secure sketch instantiation. The first distinct characteristic is easily captured under the current definition setting the functions Lc to return the correct template and Lf, Lm to return ⊥ on all inputs. To capture the leakage from template we introduce a new query to the ideal functionality.
Consider another extra parameter, the leakage function Ltmp: →{0,1}. On receiving a query of the form (“Leak on Template”, sid, ) from S, first check whether there is a tuple (sid, , wi) recorded, if it is not found then do nothing, otherwise reply with (“Leakage”, (sid, Ltmp(wi)) to S. The extended ideal functionality that is the same as FTTG except it has this additional query is called FTTG*.
Now we provide a protocol below that realizes FTTG* with some reasonable leakage. For simplicity we only present the protocol only secure against semi-honest adversary. Adding generic NIZK proofs it is possible to make it secure against malicious adversary.
A. Setup
In the Setup phase, the following algorithms are executed by a trusted authority:
B. Registration for Pi
Party Pi interacts with everyone else to register its own template:
C. SignOn Phase
In the SignOn phase, let's consider party Pi that uses an input vector u and a message m on which it wants a token. Pi picks a set S of t−1 other parties {Pj}j∈S and Pk and interacts with them in the below four round protocol. The arrowhead in round 1 can denote that in this round messages are outgoing from party Pi
D. Token Verification
Given a verification key vkTS, message m and token Token, the token verification algorithm outputs 1 if TS.Verify(vkTS, m, Token) outputs 1.
Correctness: The correctness of the protocol directly follows from the correctness of the underlying primitives.
E. Security Analysis
We show that the above protocol realizes the extended ideal functionality FTTG* in presence of semi-honest], static adversary that corrupts up to t−1 parties. In particular we formally prove (informal) Theorem 5 in this section. We prove that by constructing a polynomial time simulator as follows:
The Simulator S:
During the Setup the simulator generates the signing key pairs of a threshold signatures and forward the secret-key shares to the corrupt parties. It also generates the TOPRF keys and forwards the key shares to the corrupt parties. It also generates the parameters for the extractable commitment and keeps the trapdoor secret.
After the registration is done the simulator makes a “Leak on Template” query to obtain some leakage on the template and using that simulate a fake sketch for the corrupt parties.
To simulate the sign on phase it works as follows depending on whether the initiator is corrupt or not. If the initiator is honest, then it can easily simulate the honest parties view by using their shares of TOPRF correctly and using a dummy measurement. If the initiator is corrupt, then it extracts the input measurement from the extractable commitment Com(u) and then makes a “Test Password” query to the ideal functionality FTTG* with that. If u is close to the template then it successfully registers a signature which it then returns to the initiator in a threshold manner (can be done easily by adding secret-shares of zeros).
We argue that for any PPT adversary , the above simulator successfully simulates a view in the ideal world that is computationally indistinguishable from the real world. From the above description notice a few results.
The registration is perfectly simulated due to access of the leakage on template query. Note that, without this access it would have been impossible to simulate this step. To demonstrate that we mention a simple attack when the attacker registers a dummy template, say string of 0s. Then there is no guarantee form secure sketch. So, without the leakage access, the simulator would not have any clue about the template. However, given the leakage access, the simulator can reconstruct the entire template (as there is no entropy and the information can be compressed within the allowed leakage bound).
The sign-on phase, which is initiated from the honest party is simulated correctly due to the template privacy guarantee provided by the underlying TOPRF. Note that, in this case the adversary does not learn the signature and hence in the simulation it is not required to register a correct signature with the ideal functionality. In the other case, when a corrupt party initiates the session, the extractability of the commitment scheme guarantees that, the extracted commitment is indeed the correct input of the attacker and then using the “Test Password” query the simulator is able to generate a signature correctly in case a close match is guessed correctly.
Any of the computer systems mentioned herein may utilize any suitable number of subsystems. Examples of such subsystems are shown in
The subsystems shown in
A computer system can include a plurality of the same components or subsystems, e.g., connected together by external interface 81, by an internal interface, or via removable storage devices that can be connected and removed from one component to another component. In some embodiments, computer systems, subsystem, or apparatuses can communicate over a network. In such instances, one computer can be considered a client and another computer a server, where each can be part of a same computer system. A client and a server can each include multiple systems, subsystems, or components.
Aspects of embodiments can be implemented in the form of control logic using hardware circuitry (e.g. an application specific integrated circuit or field programmable gate array) and/or using computer software with a generally programmable processor in a modular or integrated manner. As used herein, a processor can include a single-core processor, multi-core processor on a same integrated chip, or multiple processing units on a single circuit board or networked, as well as dedicated hardware. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will know and appreciate other ways and/or methods to implement embodiments of the present invention using hardware and a combination of hardware and software.
Any of the software components or functions described in this application may be implemented as software code to be executed by a processor using any suitable computer language such as, for example, Java, C, C++, C#, Objective-C, Swift, or scripting language such as Perl or Python using, for example, conventional or object-oriented techniques. The software code may be stored as a series of instructions or commands on a computer readable medium for storage and/or transmission. A suitable non-transitory computer readable medium can include random access memory (RAM), a read only memory (ROM), a magnetic medium such as a hard-drive or a floppy disk, or an optical medium such as a compact disk (CD) or DVD (digital versatile disk), flash memory, and the like. The computer readable medium may be any combination of such storage or transmission devices.
Such programs may also be encoded and transmitted using carrier signals adapted for transmission via wired, optical, and/or wireless networks conforming to a variety of protocols, including the Internet. As such, a computer readable medium may be created using a data signal encoded with such programs. Computer readable media encoded with the program code may be packaged with a compatible device or provided separately from other devices (e.g., via Internet download). Any such computer readable medium may reside on or within a single computer product (e.g. a hard drive, a CD, or an entire computer system), and may be present on or within different computer products within a system or network. A computer system may include a monitor, printer, or other suitable display for providing any of the results mentioned herein to a user.
Any of the methods described herein may be totally or partially performed with a computer system including one or more processors, which can be configured to perform the steps. Thus, embodiments can be directed to computer systems configured to perform the steps of any of the methods described herein, potentially with different components performing a respective steps or a respective group of steps. Although presented as numbered steps, steps of methods herein can be performed at a same time or in a different order. Additionally, portions of these steps may be used with portions of other steps from other methods. Also, all or portions of a step may be optional. Additionally, and of the steps of any of the methods can be performed with modules, circuits, or other means for performing these steps.
The specific details of particular embodiments may be combined in any suitable manner without departing from the spirit and scope of embodiments of the invention. However, other embodiments of the invention may be directed to specific embodiments relating to each individual aspect, or specific combinations of these individual aspects.
The above description of exemplary embodiments of the invention has been presented for the purpose of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form described, and many modifications and variations are possible in light of the teaching above. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications to thereby enable others skilled in the art to best utilize the invention in various embodiments and with various modifications as are suited to the particular use contemplated.
A recitation of “a” “an” or “the” is intended to mean “one or more” unless specifically indicated to the contrary. The use of “or” is intended to mean an “inclusive or,” and not an “exclusive or” unless specifically indicated to the contrary.
All patents, patent applications, publications and description mentioned herein are incorporated by reference in their entirety for all purposes. None is admitted to be prior art.
A. Threshold Oblivious Pseudorandom Functions
We now define the notion of a UC-secure Threshold Oblivious Pseudorandom Function taken almost verbatim from Jarecki et al. [25]. We refer the reader to Jarecki et al. [25] for the security definition and here only list the algorithms that form part of the primitive.
Definition 8 A threshold oblivious pseudo-random function TOPRF is a tuple of four PPT algorithms (TOPRF.Setup, TOPRF.Encode, TOPRF.Eval, TOPRF.Combine) described below.
B. Robust Secret Sharing
We now give the definition of a Robust Secret Sharing scheme taken almost verbatim from Dupont et al. [26].
Definition 9 Let λ∈, q be a λ-bit prime, Fq be a finite field and n, t, m, r∈ with t<r≤n and m<r. An (n, t, r) robust secret sharing scheme (RSS) consists of two probabilistic algorithms RSS.Share: Fqât†′Fqn and RSS.Recon: Fqn→Fq with the following properties:
In other words, an (n, t, r)-RSS is able to reconstruct the shared secret even if the adversary tampered with up to (nâ{circumflex over ( )}′r) shares, while each set of t shares is distributed independently of the shared secret s and thus reveals nothing about it.
We say that a Robust Secret Sharing is linear if, similar to standard secret sharing, the reconstruction algorithm RSS.Recon only performs linear operations on its input shares.
Imported Lemma 1 The Shamir's secret sharing scheme is a
robust linear secret sharing scheme.
The above lemma is borrowed from instantiating Lemma 5 of Dupont et al. [26] using the work of McEliece and Sarwate [27], as described in Dupont et al. [26].
This application is a 371 application of International Application No. PCT/US2019/054666, filed Oct. 4, 2019, which claims priority to U.S. Provisional Patent Application No. 62/741,431, entitled “Leveraging Multiple Devices To Enhance Security Of Biometric Authentication” and filed on Oct. 4, 2018, which is hereby incorporated by reference in its entirety for all purposes.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2019/054666 | 10/4/2019 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2020/072882 | 4/9/2020 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6697947 | Matyas, Jr. | Feb 2004 | B1 |
20090177894 | Orsini | Jul 2009 | A1 |
20100119061 | Kawale | May 2010 | A1 |
20140372769 | Kerschbaum et al. | Dec 2014 | A1 |
20140380040 | Albahdal et al. | Dec 2014 | A1 |
20150046989 | Oberheide et al. | Feb 2015 | A1 |
20150270977 | Martins | Sep 2015 | A1 |
20160337131 | De Andrada et al. | Nov 2016 | A1 |
20180129797 | Rush | May 2018 | A1 |
20200259638 | Carmignani | Aug 2020 | A1 |
Number | Date | Country |
---|---|---|
101528112 | Jun 2015 | KR |
Entry |
---|
Application No. PCT/US2019/054666 , International Search Report and Written Opinion, dated Jan. 22, 2020, 9 pages. |
Application No. EP19869144.6 , Extended European Search Report, dated Nov. 2, 2021, 10 pages. |
Mustafa et al., “Frictionless Authentication System: Security & Privacy Analysis and Potential Solutions”, Cornell University Library NY 14853, Feb. 2018, pp. 1-11. |
Number | Date | Country | |
---|---|---|---|
20210336792 A1 | Oct 2021 | US |
Number | Date | Country | |
---|---|---|---|
62741431 | Oct 2018 | US |