Authentication protocols generally rely on private information held by entities in order to establish identity. In traditional systems, the private data may consist of a username and password pair, personal identification numbers (PINs) or cryptographic keys. Multi-factor authentication protocols generally require two or more kinds of identifying information such as information the entity knows (e.g., a username and password), something the entity has (e.g., a smart card or token), and information representing what the entity is (e.g., a fingerprint).
Metadata comprises auxiliary information relating to the identity or state of an entity involved in authentication. Examples of metadata include biometric data, sensor output, global positioning data, passwords or PINs and similar auxiliary information that may be used to construct a characterization of an entity's identity or state. Biometric data comprises measurements of physical characteristics of a user (e.g., fingerprints, retina, iris, voice and vein patterns) that are adequately unique to be used as a proof of identity.
Systems that rely on sensor output, however, may be vulnerable to forged sensor output; and while biometric systems utilize a potent property for authentication, they can face challenges relating to the exposure and/or loss of sensitive biometric data. Since a sensor transforms a measured physical characteristic into a binary string, which is stored (enrolled) by the computer system and then compared to the binary strings subsequently generated by the sensor upon authentication requests, without further measures the system cannot distinguish between a string returned from the sensor and a string supplied by an adversary without the sensor. Thus for example an adversary may attempt to observe the output of a biometric sensor for a particular user and “clone” the user by supplying the surreptitiously-obtained biometric data to the system. An adversary may similarly attempt to clone a user by reading biometric data stored in the system. Further, since the features utilized in biometric systems tend by definition to be substantially immutable, the compromise of a user's biometric data cannot be remedied in the way that a lost user password can simply be changed.
Characteristics that are unique and intrinsic to individual hardware devices (e.g., wire resistance, initial memory state, CPU instruction timing) can also be extracted and used as part of authentication protocols. A leading example of this is the physical unclonable function (PUF). A PUF function ƒ(c) maps an input domain (or challenges) c to an output range (or response) r, where the mapping is defined based on characteristics unique to the device computing ƒ(•). A circuit or hardware description of ƒ(•) may be identical across all devices, yet the mapping from the domain to the range will be unique based on the specific hardware device executing the circuit computing ƒ(•).
U.S. Pat. No. 8,577,091 to Ivanov et al. and U.S. Pat. No. 8,566,579 to Armstrong et al. describe authentication systems wherein intrinsic hardware characteristics (e.g., PUF output) as well as human biometrics are required to successfully complete an authentication, but neither provide a method for inexorably linking the PUF with the authentication or a method for handling non-sensitive sensor output.
Frikken et al. (“Robust Authentication using Physically Unclonable Functions,” Information Security, volume 5735 of Lecture Notes in Computer Science, pages 262-277, Springer, 2009) teach a method for combining metadata (e.g., PIN) into the input of a PUF, but does not provide an extension to arbitrary metadata (e.g., biometrics) or non-sensitive metadata (e.g., temperature, pressure).
Rust (editor) in “D1.1 Report on use case and architecture requirements,” Holistic Approaches for Integrity of ICT-Systems (2013) mentions the idea of merging biometric features with a cell-based PUF, but does not elaborate on a means for achieving this.
U.S. Patent Application Publication No. 20110002461 to Erhart et al. describes a method for authenticating sensor output by employing a PUF, which extracts unique characteristics of the physical sensor hardware. The method does not directly link the output of the sensor to the authentication of the hardware, however, and also requires that sensitive biometric sensor output leave the device.
An intrinsic identity of a device is constructed by generating an enrollment token or public key, which is based on intrinsic characteristics unique to the device such as a physical unclonable function (PUF). An authentication system utilizes the device's enrollment token or public key to verify the device's authenticity, preferably through a zero knowledge proof. Sensitive metadata is preferably also incorporated into the enrollment token or public key, which may be accomplished through an algorithmic means such as a hash function combining the metadata with hardware-intrinsic (e.g., PUF) data. The authentication may be interactive or non-interactive.
Although the invention applies to the output of arbitrary sensors, an exemplary embodiment utilizing biometric sensors is described. The present invention is also described with reference to the example of an embodiment utilizing elliptic curve cryptography (including the associated terminology and conventions), but the inventive concept and teachings herein apply equally to various other cryptographic schemes such as ones employing different problems like discrete logarithm or factoring, and the invention is not limited by the various additional features described herein that may be employed with or by virtue of the invention. Before setting forth details of the invention, basic handling of PUF output, modeling assumptions, and primitives for PUF-based cryptographic schemes and threshold cryptography applicable to the example embodiment are described.
PUF output is noisy in that it varies slightly despite evaluating the same input. This is generally addressed with fuzzy extraction, a method developed to eliminate noise in biometric measurements. (See Juels et al., “A Fuzzy Commitment Scheme.” Proceedings of the 6th ACM conference on Computer and Communications Security, CCS '99, pages 28-36, ACM, 1999). Fuzzy extraction may in part be employed within a device having a PUF such as within an auxiliary control unit, such that the output is constant for a fixed input. Fuzzy extraction (or reverse fuzzy extraction) may for example employ a “secure sketch,” as described by Juels et al.
A secure sketch SS for input string O, where ECC is a binary (n, k, 2i+1) error correcting code of length n capable of correcting t errors and V←{0, 1}k is a k-bit value, may be defined as SS(O; V)=O⊕ECC(V). This definition can be used to build a Gen algorithm, which outputs a set V,P, where V is the value to be reconstructed and P is a helper string (which may be public) that is used to recover V.
Correspondingly, a Rep algorithm can be defined such that, on input O′ within a maximum Hamming distance t of O, the original value V may be recovered. Rep(O′, P), where D is the decoding scheme for the binary (n, k, 2t+1) error-correcting code ECC and O′ is an input such that dist(O, O′)≦t, can be defined as:
This definition can then be used to build a Rep algorithm that allows a PUF output. O′ that differs from the original output O by at most t to reproduce output V such that Rep(O′)=V using the helper string P=O⊕ECC(V):
Gen and Rep algorithms such as these may be used in PUF-based protocols to ensure that the same value V is recovered so long as the PUF outputs O, O′ differ by at, most t bits.
It is desirable that an adversary cannot predict a device's PUF response r for a challenge c with more than negligible probability (at least without physical access to the device), and that helper data does not reveal anything to an adversary about PUF responses. In assessing these security aspects, the following entities may be considered: a set of servers , where each server si∈ controls authentication of devices on its system; a set of devices di∈, each with an embedded PUF; and an adversary that wishes to masquerade as a legitimate device di∈ to obtain resources stored on some subset of the servers ′⊂. It may be assumed that all entities are bound to probabilistic polynomial-time (PPT), i.e., can only perform computation requiring polynomially many operations with respect to a global security parameter λ (which refers to the number of bits in the relevant parameter). Computation requiring exponentially many operations with respect to λ is not efficient for the agents, and will succeed with only negligible probability.
Games can be employed to assess a PPT adversary's advantage in (1) predicting a PUF's output, and (2) distinguishing helper data from a truly random string. It is sufficient to require that an adversary's advantage in a game is negligible with respect to the security parameter of the protocol, where a function ƒ(x): is negligible if for every positive polynomial p(•) and sufficiently large x, the absolute value of ƒ(x) is less than 1/p(x). Although we describe exemplary games for capturing properties (1) and (2), they may be substituted for other game formulations designed to capture the concepts.
The unpredictability of a PUF can be assessed through the following game between an adversary and a PUF device P:{0, 1}κ
P ⊂ P,
′P ⊂ P,
The game proceeds as follows:
In the PUF indistinguishability game, an adversary is asked to differentiate between the output r of the fuzzy extractor for a PUF P and a randomly chosen string s∈{0, 1}l of the same length l.
This game proceeds as follows:
Rührmair et al. (“Modeling Attacks on Physical Unclonable Functions,” Proceedings of the 17th ACM conference on Computer and communications security, CCS '10, pages 237-249, ACM, 2010) define three distinct classes of PUF devices:
One definition for an ideal physical unclonable function Pd: {0, 1}κ
In the example of an embodiment employing elliptic curve cryptography, Algorithms 3 and 4 below can be used to allow a. PUF-enabled device to locally store and retrieve a sensitive value without storing any sensitive information in non-volatile memory. Algorithm 3 illustrates the storing of a sensitive value Vi using a PUF, and Algorithm 4 illustrates the dynamic regeneration of Vi. The challenge ci and helper data helper, can be public, as neither reveals anything about the sensitive value Vi. While the present example uses encryption of Vi by exclusive-or, ⊕, Vi could also be used as a key to other encryption algorithms (e.g., AES) to enable storage and retrieval of arbitrarily sized values.
Whenever O and O′ are t-close, the error correcting code ECC can be passed to a decoding algorithm D which will recover the sensitive value Vi.
In order to construct an intrinsic identity of a device, a public representation of the device's identity (referred to here as an enrollment token or public key) must be generated. In this process of device enrollment, a cryptographic enrollment, token is collected from the device. An elliptic curve mathematical framework for enrollment and authentication may be used, but those skilled in the art will realize that other embodiments (e.g., discrete logarithm frameworks, in which regard U.S. Pat. No. 8,918,647 is incorporated here by reference) will provide the same functionality. Using Algorithm 5, a local device can perform an enrollment protocol using the PUF.
This allows each PUF circuit to generate a local public key pipub, which is useful for bootstrapping more complex key setup algorithms. When the key setup algorithm is performed internal to the device (rather than externally among a set of distinct devices), this bootstrap process may not be necessary.
A zero knowledge proof (ZKP) of knowledge is a method for proving that a given statement is true, while revealing nothing beyond this fact. The ZKP is an interaction between two parties: a prover that wishes to establish the validity of a statement, and a verifier V that must be convinced the statement is true. At, the conclusion of the protocol, the verifier should be convinced with overwhelming probability that a true statement is indeed true. Correspondingly, an adversary should only succeed in convincing the verifier of a false statement with negligible probability. The proof is zero knowledge in the sense that the verifier could not use the messages from a previous proof to convince a new party of the statement's validity, and the messages reveal only a single bit of information: whether or not the prover possesses the secret. There are two general classes of zero knowledge proofs: interactive ZKPs, where a series of messages are exchanged between the prover and verifier V, and non-interactive ZKPs, where the prover publishes a single message without interaction with V, yet V is convinced that possesses the secret.
The requirement for communication from the verifying server in an interactive zero knowledge proof is to obtain a nonce value specific to the current proof. This prevents an eavesdropping adversary from using previous proofs from a valid device to successfully complete an authentication protocol and masquerade as the device. A non-interactive zero knowledge proof removes this communication requirement, and allows the proof to be completed without interacting with the verifying end point.
Achieving a non-interactive construction requires the proving device to generate the nonce on behalf of the verifier in a manner that prevents the proving end device from manipulating the proof. One method for constructing a non-interactive zero knowledge proof is for the device to construct a nonce N as N←H(A∥τ), where A is the device's public key, H(•) is a cryptographic hash function, τ is a timestamp and x∥y denotes concatenation of x and y. The timestamp ensures that previous proofs constructed by the proving device cannot be replayed by an adversary in the future, while the hash function ensures that the proving device cannot manipulate the nonce in an adversarial manner. The reliance on timestamps is substantially less onerous than reliance on globally synchronized clocks. That is, the timestamp need not match the current timestamp on arrival at the prover exactly, which eliminates the potential of network delay to affect the proof. Rather, the verifying end point checks that the timestamp is reasonably current (e.g., second granularity) and monotonically increasing to prevent replay attacks. An exemplary non-interactive zero knowledge proof for a PUF-enabled device is described in Algorithm 6.
In general, a hash function is defined as H(•):{0, 1}*{0, 1}λ, where λ is a fixed constant. That is, a hash function H(•) (or written explicitly as Hash(•)) takes an input of arbitrary size, and maps to a finite output domain. For cryptographic settings, hash functions must satisfy additional properties. In the context of binding metadata in authentication protocols, the following are particularly relevant:
Let metadata binding refer to the process of incorporating auxiliary metadata into the authentication process. Metadata is arbitrary auxiliary information upon which the authentication protocol should depend. That is, without the correct metadata, the authentication should fail. Metadata may be characterized as either sensitive or non-sensitive, where sensitive metadata should not leave the device (e.g., password, PIN, biometric) and non-sensitive metadata may leave the device (e.g., sensor output on temperature, pressure).
Sensitive metadata is incorporated into the public identity token created during enrollment. For example, when no sensitive metadata is provided, device enrollment outputs a public identity that characterizes only the device. However, when sensitive metadata is provided during enrollment (e.g., biometrics, PIN, etc.), the public identity characterizes both the device and the sensitive metadata. One embodiment of the invention never requires the sensitive metadata to leave the device, as the zero knowledge proof protocol is completed without the verifier having access to the sensitive metadata.
Non-sensitive metadata is not incorporated into the enrollment process. Thus, the public identity output from enrollment does not depend on non-sensitive metadata (e.g., sensor output for temperature, pressure, etc.). Rather, non-sensitive metadata is incorporated into the zero knowledge proof protocol, such that the proof of device and/or user authenticity is only valid if the corresponding non-sensitive metadata is also provided to the verifier. This allows the device and/or user to have a single public identity, and yet a verifier given access to the non-sensitive metadata can verify both the authenticity of the device and/or user as well as the origin of the metadata.
Due to the avalanche property of hash functions (where a single bit difference in the input results in each bit of the output flipping with approximately 50% probability), the metadata must be exactly the same in order for the authentication to succeed. However, the exemplary embodiment of biometric authentication frequently results in noise, where scans differ slightly despite observing the same characteristic (e.g., fingerprint, iris, etc.). Thus, means such as a fuzzy extractor may preferably be employed to ensure that the biometric reliably returns a constant value. For example, a constant value i for the metadata may be chosen and linked to an associated public helper data value . A noisy biometric scan can then be used to compute ←ECC(i)⊕ where ECC is an error correcting code, and given access to a new biometric scan that is t-close to , the constant value i can be recovered by computing i←D(⊕) where D is the corresponding error decoding algorithm.
Incorporating metadata into the construction requires re-defining the PUF-Store functionality described in Algorithm 3. Algorithm 7 provides an example of how metadata i may be hashed into the PUF input x, which is used to protect the committed value Vi.
While in the present example the PUF input consists of a hash of a challenge value, a metadata value, the elliptic curve E, base point C, and moduli p and q, various other permutations of values (pertinent to the mathematical framework used) may be hashed to produce a PIF input incorporating metadata in other embodiments. Moreover, one or more values can be iteratively hashed and/or hashed values can be nested (e.g., H(H(ci∥i), E, G, p, q), etc.). Further, other methods for linking and/or combining the parameters (e.g., an all-or-nothing transformation) may be employed.
Similarly, the PUF-Retrieve functionality described in Algorithm 4 must be modified to require the sensitive metadata i in order to recover the committed value Vi. Algorithm 8 describes how metadata i in combination with the PUF is used to recover the committed value Vi.
Returning to the exemplary embodiment of biometric authentication (e.g., fingerprint scanner),
Algorithm 9 provides an example of how a fingerprint, scan may be bound to the authentication protocol such that both the device and fingerprint must match those originally enrolled. Non-sensitive metadata (e.g., sensor output for temperature, pressure, etc.) ipub may be incorporated into the non-interactive authentication algorithm by incorporating it into the construction of the nonce N, and providing ipub to the verifier. Thus, the verifier is only able to construct the nonce N (and, consequently, the variable c′) if ipub matches the output from the sensor.
i
FP ← ECC( ⊕
First, a user's fingerprint scan F
A non-interactive zero knowledge proof may also be constructed by requiring the server to issue a nonce N to the device. This exemplary construction is illustrated in Algorithm 10.
i
FP ← ECC( ⊕
The addition of (sensitive and/or non-sensitive) metadata is optional in embodiments of the invention. That is, non-sensitive metadata may be included while sensitive metadata is excluded. This requires only that the public identity token did not incorporate the sensitive metadata. Similarly, sensitive metadata may be included while non-sensitive metadata is excluded. This requires only that the nonce is not constructed using non-sensitive metadata.
As one embodiment of our invention relies on an elliptic curve mathematical framework, one skilled in the art will realize that it may be extended to support cryptographically-enforced role based access control (RBAC). That is, data access policies and device credentials may be specified mathematically, and the RBAC algorithm computes a function ƒ(,){0, 1} mapping policies and credentials to an access decision in {0, 1}. This is typically accomplished by constructing a bilinear pairing (e.g., Weil or Tate pairing), and is a natural extension of our invention.
While the foregoing embodiments have been described with various features, one of ordinary skill in the art will recognize that, the authentication protocol need not be limited to zero knowledge, and could be based on other cryptographic constructions for establishing identity. For example, the device could use its hardware identity to digitally sign the contents of a packet, and include this signature in the packet header (e.g., TCP Options Header, where an example header would include {B=r·G mod p, m=r+Hash(G, B, AB, N)·rand mod q,τ}) and the hardware identity may be applied to a variety of other cryptographic authentication techniques, and need not be limited by the zero knowledge aspect of the example provided.
This application claims the benefit of the priority of provisional U.S. Patent Application Ser. No. 62/017,045 filed Jun. 25, 2014 and Ser. No. 61/988,848 filed May 5, 2014, both of which applications are incorporated by reference here.
Number | Date | Country | |
---|---|---|---|
61988848 | May 2014 | US | |
62017045 | Jun 2014 | US |