Embodiments of the present disclosure generally relate to the field of electronic verification, and more specifically, embodiments relate to devices, systems and methods for electronic verification of credentials.
The verification of characteristics of an entity is a useful tool in the context of decision making, for example, in relation to access provisioning, goods and service provisioning, among others.
However, an individual when providing credentials for verification may wish to restrict the amount of information being provided to the counterparty. The credentials being provided, to increase trust, may also benefit from verification through association with a third-party verifier (e.g., indicating that the individual is who they purport to be).
Credential assertion with restricted information has been difficult to implement in practice as it is technically challenging to generate and provide sufficient trusted credentials, especially on a high-volume, scalable system adapted to serve a large number of users on an on-demand basis.
A computer implemented system for electronic verification of credentials including at least one processor and data storage is described in various embodiments. The approach describes the architecture and use of a “max gate”, which can be used as an structural component of the neural network. As described, the max gate can be used as an advantageous activation function for improving certain performance of a neural network based approach for zero knowledge proofs. In a practical example, the max gate can be used both in (i) an initial enrollment process flow to train a neural network, obtain weights, and then discard the neural network after training, and (ii) a verification process flow where a verifier device takes a contemporaneous data set of a person who is trying to prove that they are who they purport to be, and the weights of the verification process flow (or commitments thereof) are computationally utilized to generate a corresponding token that can be validated by the verifier device. Upon a successful verification, the verifier device can provide access to the protected resource. As mentioned herein, the use of the max gate can help with practical aspects around computational speed and avoiding certain technical limitations.
Embodiments described herein are directed to computer systems and devices directed to provide a cryptographic platform for generating and transmitting messages that are adapted to assert attributes about various objects (e.g., user profiles) without indicating any more than is actually required, and corresponding methods and computer readable media storing machine-interpretable instruction sets for performing the methods.
The computer systems and devices, in accordance with some embodiments, are adapted to a high-volume, scalable system, which dynamically responds to data credential requests of one or more users or one or more computer systems requesting identity/credential proofs.
In some embodiments, the assertions are conducted using mobile endpoints (e.g., user devices) which may have limited computational performance and resources, and accordingly, an improved cryptographic approach and system is proposed that enables the assertion functionality through the passing of cryptographically generated messages between devices. An improvement associated with the proposed cryptographic approach of some embodiments is that it is able to operate in a secure and scalable way, even on limited computational resources (e.g., those available on an unenhanced smartphone).
Prior approaches required large numbers of large messages being sent, which made the approaches impractical where resources were limited. The approach proposed herein requires less messages and streamlines the amount of cryptographic computations required to make these assertions. For example, Belenkiy describes an approach which requires a large number of computational steps, which can have deleterious impacts on performance.
Credential verification, when conducted manually, is a tedious process prone to falsification and also over-provisioning of information. In an example, Alice is a law-abiding 26 year old, and she would like an alcoholic beverage. Before selling beer to Alice, Bob wants to make sure of two things: She is legally allowed to drink, meaning 21 years of age or more, and that she is not a problem customer.
Alice thinks the conditions are fair, and they both know presenting her ID card would prove that she does satisfy them. She could provide her driver's license, which shows her name and date of birth. She would like to not disclose anything to him other than the fact that she satisfies the conditions. However, by providing her driver's license, Bob ends up knowing more than he needs to know (e.g., age and specific date of birth as opposed to the fact that she is above 21 years of age and is not the problem customer). Further, aside from visual inspect of the license, Bob has practical difficulties in verifying that the driver's license is not a fake driver's license.
Accordingly, a challenge involves providing a robust credential verification whereby Alice is able to prove to Bob that she does satisfy Bob's customer policy, while revealing nothing other than the fact to him. As an example, consider a policy of being older than 21. That is all Bob needs to know. He does not and should not know that Alice is in fact 26.
The system is configured to adduce stripped down credentials to meet Bob's customer policy without exposing additional information. In particular, cryptographic techniques are utilized that undertake specific steps and computational approaches to provide a secure, yet computationally efficient mechanism for proof generation.
Accordingly, an issuer device issues one or more signed token data objects, which are stored on a client's device for later usage. Upon encountering a situation where verification is required, the client's device is configured to dynamically generate proof data messages which are then provided to the verifier's computing device (e.g., the verifier's smart phone, a point of sale device, an access control system, a mantrap gate). The verifier is able to conduct a verification check using the proof data message to see only that the conditions required in the original verification check message are met, without being provided the actual underlying characteristics. As the proof data messages are generated using the token data objects, the verifier is able to validate that such proof data messages are associated with a trusted verifier.
There are two different types of proofs that are proposed in some embodiments, these being exact match proofs (non-zeroness protocol; e.g., this person either matches someone on a whitelist or doesn't match anyone on a blacklist), and conditional proofs (e.g., based on an inequality condition being matched, such as over 18 years old?).
As described in various embodiments herein, improved cryptographic protocols are proposed that, relative to prior approaches, reduce an overall cryptographic complexity without a significant reduction in security. Accordingly, the proofs can be generated more quickly, which improves convenience, especially where a system is being established for mass adoption and client device characteristics are highly variable across the users (e.g., some users may be using devices with extremely limited capabilities).
An enhanced solution is described herein that is adapted for protecting a client's personal information and only providing what is needed by leveraging a client's special space using a secure enclave and a blockchain solution, in accordance with some embodiments.
A blockchain infrastructure and the secure enclave each store data sets representing aspects of signed attributes and, in some embodiments, a proof response logic. The block chain infrastructure can include distributed logic technologies and combination with cascading encryption to provide an immutable ledger. In some embodiments, the proof requests and responses can be conducted using intelligent connected devices such as a mobile device, or wearable devices (e.g., a smartwatch that is connected to a mobile device across Bluetooth low energy).
In an example embodiment, there are multiple authoritative issuers who are able to provide signed attributes (e.g., for storage in secure enclaves or on a distributed ledger blockchain data structure). Secure enclaves can be utilized, or other types of hardware protected spaces are usable.
A registration mechanism and method is utilized to initialize and populate the attributes using public and secret (private) encryption keys. Issuer devices create attribute data records that are generated using a combination of a client's public key and an issuer's secret key (e.g., using digital signatures or encryption/decryption). The attributes can be made publicly available, for example, on a blockchain, whereby the attributes can be signed by an issuer's secret key but encrypted using the client's public key.
A verification mechanism and method is provided whereby a communications channel can be established with an authenticated verifier device, which initiates a proof request, which triggers a process to establish a proof response that is transmitted to the verifier.
An example use case includes a specially configured age verifier terminal which, for example, can include a graphical user interface rendering visual and coded objects such as a quick response code that can be scanned by a mobile device. Upon scanning the quick response code, the verification mechanism is invoked, and the mobile device may share data sets on a backend communications network such as the Internet. The proof response can be transferred to the verifier device based off of identifiers or information stored other on the age verifier terminal, or encoded within the quick response code. The age verifier terminal returns true or false such that both a verifier such as a cashier, and the customer are able to visually confirm. The proof response rendering, for example, may be restricted to a true/false determination (e.g., additional private information is not disclosed or rendered).
Corresponding computer implemented methods and computer readable media are contemplated.
In accordance with a first aspect, a computer implemented system is provided for communicating data messages between a verifier computing device and a portable client computing device, the data messages establishing authentication of one or more characteristics of a client associated with the portable client computing device.
The system includes the portable client computing device including at least a client computing device processor and data storage, the data storage storing one or more token data objects received from or computed jointly in a multiparty protocol with an issuer computing device, the one or more token data objects generated using at least a issuer computing device private issuance key, the one or more token data objects each including one or more signed data elements representing at least one of the one or more characteristics of the client associated with the portable client computing device.
The client computing device processor is configured to: receive a verification request data message from the verifier computing device, the verification request data message including at least a nonce c0; and compute t=x−1 mod p, where x is an attribute value from the one or more token data objects, and p is an order of the discrete log group according to a cryptographic protocol being used; t is a modular inverse of x mod p; uniformly sample a first random number r1 and a second random number, r2, such that
compute
where R is a commitment (e.g., a representation of a value that is both hiding and binding, hiding in the sense that the recipient of the commitment cannot find out anything about what the value of the commitment is, and binding in the sense that the sender later cannot pretend that it was a commitment to another value than it originally was) to random values r1 and r2, Cx is a commitment to attribute x, h is a group generator taken from cryptographic specifications; compute
where c is a proof challenge, based at least on the Fiat-Shamir Heuristic; compute
where z1 and z2 are proof responses in a sigma protocol; and encapsulate and transmit one or more proof data messages including R, z1 and z2 as data objects to the verifier computing device, such that the verifier computing device is able to compute
and contirm that
the verifier computing device controlling provisioning of access to a secured resource responsive to the confirmation that
In another aspect, the client computing device processor is a secure enclave processor or a trusted execution environment that is segregated from a general execution environment of the client computing device.
In another aspect, the data storage is a secure enclave memory region that is segregated from a general memory region of the client computing device, the secure enclave memory region not accessible by general execution environment of the client computing device.
In another aspect, the one or more token data objects are pre-loaded into the data storage such that the generation of the proof can be conducted at a time temporally separated from when the one or more token data objects were generated or preloaded.
In another aspect, the one or more proof data messages are generated such that the one or more proof data messages can be validated using an issuer computing device public encryption key corresponding to the issuer computing device private issuance key.
In another aspect,
indicates that a condition established in the verification request data message is met without having to present the underlying one or more characteristics of the client associated with the portable client computing device.
In another aspect, the verification request data message includes a blacklist data structure and the one or more proof data messages are generated to establish that the client associated with the portable client computing device does not match one or more identities stored in the blacklist data structure, or wherein the verification request data message includes a whitelist data structure and the one or more proof data messages are generated to establish that the client associated with the portable client computing device matches one or more identities stored in the blacklist data structure.
In another aspect, the provisioning of access to the secured resource includes transmitting an electronic signal to a physical lock actuator, or to a digital challenge-response mechanism.
In another aspect, there is provided a computer implemented system for communicating data messages between a verifier computing device and a portable client computing device, the data messages establishing authentication of one or more characteristics of a client associated with the portable client computing device, the system comprising: the portable client computing device including at least a client computing device processor and data storage, the data storage storing one or more token data objects received from or computed jointly in a multiparty protocol with an issuer computing device, the one or more token data objects generated using at least a issuer computing device private issuance key, the one or more token data objects each including one or more signed data elements representing at least one of the one or more characteristics of the client associated with the portable client computing device. In this aspect, the client computing device processor is configured to: receive a verification request data message from the verifier computing device including a request to prove a≤b; generate a proof data message wherein G is defined as a discrete log group of prime order p and g and h be generators with unknown discrete logs, numbers q and l are defined such that
and two whole numbers a and b such that l≤a≤b<q; and G is a discrete log group of prime order p and g and h be generators with unknown discrete logs, considering consider commitments
and B=gbhm
producing bit commitments
where ai, bi and ci are the i'th bits of a−l, b−l and c, respectively, and mai, mbi and mci are sampled randomly; computing
and likewise
; for each
and repeated through each of A, B, and C: randomly sampling rai, d′ai, and z′ai; computing
computing
computing zai=(dai−d′ai)mai+rai; assigning
encapsulating proof data message including at least Ai, Rai,0, Rai,1, d″ai,0, zai,0, zai,1, Bi, Rbi,0, Rbi,1, d″bi,0, zbi,0, zbi,1, Ci, Rci,0, Rci,1, d″ci,0, zci,0, zci,1; wherein the proof data message can be verified by the verifier computing device checking for A, B, and C, that
and for each
In another aspect, the proof data message is encapsulated such that the proof data message is free of A0, B0 and C0 to reduce a filesize of the proof data message, and wherein the verifier computing device is configured to derive A0, B0 and C0 independently.
In the figures, embodiments are illustrated by way of example. It is to be expressly understood that the description and figures are only for the purpose of illustration and as an aid to understanding.
Embodiments will now be described, by way of example only, with reference to the attached figures, wherein in the figures:
Embodiments described herein are directed to computer systems and devices directed to provide a cryptographic platform for generating and transmitting messages that are adapted to assert attributes about various objects (e.g., user profiles) without indicating any more than is actually required, and corresponding methods and computer readable media storing machine-interpretable instruction sets for performing the methods.
There are computing devices that interoperate with one another in concert with the cryptographic platform, including devices associated with issuers, verifiers, and clients. The issuers are trusted entities which provide cryptographically validated credential messages that are issued to the client devices for storage thereon.
The cryptographically validated credential messages are then presentable to a verifier (e.g., a third party organization) that seeks to validate that identity or aspects of the identity of the user associated with the client device. The cryptographically validated credential messages are configured such that the user is able to validate such identity or aspects without providing additional information associated with the user that is not requested (e.g., as opposed to presenting all the information on a driver's license).
The credential assertion platform is a high-volume, scalable system which dynamically responds to data credential requests of one or more users or one or more computer systems requesting identity/credential proofs.
In some embodiments, the assertions are conducted using mobile endpoints (e.g., user devices) which may have limited computational performance and resources, and accordingly, an improved cryptographic approach and system is proposed that enables the assertion functionality through the passing of cryptographically generated messages between devices.
An improvement associated with the proposed cryptographic approach of some embodiments is that it is able to operate in a secure and scalable way, even on limited computational resources (e.g., those available on an unenhanced smartphone).
For example, a device with limited computational resources can include basic smartphones, which may be one or more generations out of date, and also have limited amounts of on-board memory (e.g., 1-4 GB of memory) and storage (e.g., 8-64 GB of solid state memory). The transfer protocols as between the client devices and the verifier devices may also have limited bandwidth (e.g., through near-field communications (NFC), Bluetooth, limiting communications to only several Mbit/s).
Prior approaches required large numbers of large messages being sent, which made the approaches impractical where resources were limited. The approach proposed herein requires less messages and streamlines the amount of cryptographic computations required to make these assertions.
As described herein, an improved cryptographic mechanism and protocol is proposed that reduces an overall number of data messages and/or cryptographic steps required to be taken to generate the proof data messages. For example, the method of Belenkiy requires 4 randomizations, 3 group multiplications and 7 group exponentiations, which includes elliptic curve exponentiations that are computationally expensive (e.g., involves more than 256 operations on 512 long integers). In a proposed non-zeroness approach of some embodiments, a field inversion is provided, which itself is an expensive operation, but reduces a considerable number of group exponentiations.
The proof data messages are designed to have a “soundness” attribute whereby a malicious verifier is unable to find out from the proof data message more information that what is being provided in the proof data message (e.g., can't find out the underlying characteristic values).
A computer implemented identity brokerage solution is described in accordance with various embodiments. The identity brokerage solution is adapted to address problems with identity and attribute verification, using computer implemented cryptographic approaches to provide a robust mechanism for conducting verifications while reducing the provisioning of extraneous information (e.g., information not required for the verification).
Credential verification, when conducted manually, is a tedious process prone to falsification and also over-provisioning of information.
Bob is not looking for a headache so before selling beer to Alice, wants to make sure of two things: She is legally allowed to drink, meaning 21 years of age or more, and that she is not Mallory McFelon, a problem customer. Alice thinks the conditions are fair, and they both know presenting her ID card would prove that she does satisfy them. She could provide her driver's license, which shows her name and date of birth.
Alice also knows that Bob tends to be nosy, so she would like to not disclose anything to him other than the fact that she satisfies the conditions. However, by providing her driver's license, Bob ends up knowing more than he needs to know (e.g., age and specific date of birth as opposed to the fact that she is above 21 years of age and is not Mallory). Further, aside from visual inspect of the license, Bob has practical difficulties in verifying that the driver's license is not a fake driver's license.
Accordingly, a challenge involves providing a robust credential verification whereby Alice is able to prove to Bob that she does satisfy Bob's customer policy, while revealing nothing other than the fact to him. As an example, consider a policy of being older than 21. That is all Bob needs to know. He does not and should not know that Alice is in fact 26.
In accordance with various embodiments, the prover should be able to hide as many attributes as the prover seeks to prove that follows from their attributes having zero knowledge of the underlying attributes: “I've lived in the same city over the last 5 years.”
The prover's client 202 holds credentials that are digitally signed by the issuer (“tokens”) 208. An example token are those provided by U-Prove specifications. A U-Prove token can include a credential similar to a PKI certificate with cryptographic wrapping of attributes to aid in reducing unwanted tracking of users.
For example, a token may have various artifacts wrapped therein and may include information, such as issuer parameters, including issuer public key information (e.g., coupled an issuer's private key) that can be used for signing or encrypting elements of information stored thereon to prove the veracity of such signature or to protect sensitive information. The issuer signature can be used by the prover or verifier to verify issuer parameters being relied upon, and the token itself, in some embodiments, may have one or more data fields storing information such as token usage restrictions, validity period, token metadata.
In some embodiments, the token is jointly created using a combination of issuer information and prover information. For example, there may be information stored thereon that is established in conjunction and hidden from the issuer, such as contact information, encryption key, or verifier supplied nonces, etc.
During issuance of a token, an issuer may authenticate the existence and access/control that the prover has over the prover's device.
Tokens include attributes that can be converted from a natural form to a sequence of large numbers (field elements) suitable for public key operations. These public key operations include anonymous credentials protocols.
Attributes are organized in a tree. An attribute can either come with a value, in which case it is called a leaf attribute, or bundle a number of sub-attribute, in which case it is called a parent attribute.
For example, consider a geographic location attribute. That would be most naturally divided up into a latitude sub-attribute and a longitude sub-attribute. Thus, a credential token can be considered consisting of a single root attribute containing all others as descendants.
Regardless of whether an attribute is disclosed, committed to, or hidden, the prover may wish to communicate metadata about it to the verifier. The most important such property is an attribute's name. The number “170” in an attribute would mean nothing without the name “height” attached. Additionally, such numeric attributes require units as context. The number “170” is absurd if considered in inches but plausible when in centimeters.
It is important to disclose this metadata even when attributes are being committed to. Consider the non-trivial example of heights and units. Consider an attraction park that refuses to admit people taller than 180 cm on a rollercoaster. Without the proper context provided, a 188 cm tall person can abuse an attribute a height attribute of 74 inches and successfully prove 74<180, thereby put him and others in danger.
In some embodiments, the token can include fields that additionally give the users an ability to decide if they want to hide an attribute's metadata. For example, even if hidden, an attribute attesting to a negative syphilis test can carry social stigma.
An attribute will be serialized into one “raw attribute” (a number or string) if the user chooses its metadata to depend on its parent's. If not, it will be serialized into two, the first representing its metadata and the second representing the value.
Every attribute's metadata contain an array called “subAttributes”. If the array is empty, the attribute is considered to be a leaf attribute. Each sub attribute has a corresponding entry in the array. If the sub attribute is encoded independently, the entry will be an integer, denoting how many raw attributes the sub attribute and all of its descendants (subtree) together will take. If it is encoded dependently, the subAttributes entry will be all of its metadata.
In this example, it is describing a token for an individual residing in 35.796682 N, 51.416549 E, and 188 cm tall. In radians, the coordinates are 0.624769962188 N and 0.897388070061 E.
The example token will serialize into the following:
A proof request is issued from the verifier 206 to the prover's client 202, asking the prover to give the verifier 206 cryptographic assurance that according to some issuer trusted by the verifier, the prover's attributes satisfy a certain (arbitrary) policy (e.g. older than 21, as far as provisioning alcohol is concerned.), and these proof requests typically contain one or more challenge messages. A proof request can include a nonce, types of conditions, etc., and these conditions may be encapsulated as inequalities (e.g., intUserAge>18), or logical statements (e.g., intUserID not equal to 22412). One or more lookup reference data structures may also be passed, which can include blacklists, whitelists, values for various constants (e.g., MINIMUMDRINKINGAGE).
A proof is provided by the prover through client 202 as a response to the verifier 206's request, which includes cryptographic assurance that the prover's credentials satisfy the verifier 106's proof request, the cryptographic assurance being held being as good as the issuer 108's word. The proof is a data message that encapsulates various information (e.g., proof responses directed to a sigma protocol). The data message includes sufficient information such that the verifier is able to receive the data message and conduct steps to validate and verify that such proof responses are indeed acceptable. In processing proof responses, the proof data message can include aspects indicative of the identity of an issuer, and a potential step is the validation by the verifier that such issuer is indeed trustworthy as a source of credential authentication.
The proof responses can be processed to generate gatekeeping control signals, which, for example, in an example embodiment, may be as simple as a device that operates a lightbulb whenever someone is validated as being of age (e.g., so that a bouncer at a bar is able to know that this person should be allowed in), or as complex as control mechanisms that automatically open doors, enable access to online accounts (e.g., on a web portal), etc. Accordingly, the verifier systems can include physical and electronic mechanisms which can generate alerts, notifications, actuation/control signals, digital or electronic signals, among others.
Factors for assessing identity brokerage solutions include how light the required infrastructure is (e.g., it may be important to reduce the need for specialized hardware, centralized devices, or complex distributed systems that make deployment and entry difficult), a level of computational efficiency, a simplicity of cryptography, a level of un-linkability between various parties (e.g., the issuer should not be able to aggregate additional data about the client, even in collusion with verifiers), and a flexibility and level of minimalism of disclosed information.
Any solution requiring the issuer to be online at verification time risks exposing additional information about the client to the issuer. This is especially concerning in cases where the issuer and the verifier collude to track client activities.
Reduced complexity is desirable as a solution may be less likely to suffer implementation flaws, be more easily understood, and less likely to theoretically break due to reliance on unusual hardness assumptions. If computational operations that have optimized/low-level implementations, the solution may be able to operate using less computing resources and/or time.
The identity protocols, ideally, should require little time, take little power, have few rounds of message transmission, and pass messages having small sizes and/or overhead. This is especially important where the parties implement portions of the identity brokerage solution on mobile devices to handle one or more verification events. The mobile devices have limited computational, storage, and interface capabilities.
The parties hold corresponding public/secret (e.g., private) key pairs. The public keys can be used to determine the veracity of information signed using the private keys, and to encrypt information that can be decrypted using the corresponding private key.
The private keys can be used to sign information and to decrypt information that has been encrypted using the corresponding public key, and in some cases, produce Zero-Knowledge Proofs of Knowledge. Each secret key is maintained by the corresponding computing device associated with the corresponding entity.
The parties each have corresponding computing systems, which are used to electronically communicate amongst one another (e.g., through a network) and to perform various cryptographic activities, including signing, verifying signatures, encrypting information, decrypting information and various anonymous credential issuance, proof and verification protocol implementations. Each verification event is associated with validating whether all logical conditions of the proof request are satisfied. A positive determination may lead to access/service/goods being provisioned to the prover. A negative determination may lead to access/service/goods not being provisioned to the prover.
A specific technological implementation of providing identity assertions with minimal disclosure is described in various embodiments. Three separate approaches are described, along with variations thereof. These approaches include (1) an O-Auth token based design, (2) a secure enclave based design, and (3) an anonymous credentials based design.
In some embodiments, a proposed approach is provided in an anonymous credentials based design whereby a client receives token data structure(s) that are stored on data storage, and asynchronously, the client gets a verifier request from a verifier. The verifier may, for example, have a list of trusted issuers that the verifier trusts. Certain organizations may be trusted for certain information, such as a bank for employment or financial status, a university for educational attainment characteristics, among others. The client generates a proof (e.g., encapsulated as a proof data message) based on the token and the verifier request, and the proof can be established as either a non-zeroness proof or a conditional proof. Token objects can be received from or computed jointly in a multiparty protocol with an issuer computing device.
For a non-zeroness proof, the proof approach generation can include a first modular inverse, two randomization steps, two group exponentiations, and a group multiplication. In particular, the steps in an example non-limiting embodiment can be established as:
(1) Receive a verification request data message from the verifier computing device, the verification request data message including at least a nonce c0.
(2) Compute t=x−1 mod p, where x is the attribute value from the token, and p is the order (e.g., size, number of elements) of the discrete log group (e.g., elliptic curve, Diffie-Hellman group) according to the cryptographic standards the parties choose to use (e.g., elliptic curve, Diffie-Hellman group); t is the modular inverse of x mod p.
(3) Sample a first random number r1 and a second random number, r2, such that
(4) Compute
where R is effectively a commitment to random values r1 and r2, Cx is a commitment to attribute x,h is a group generator taken from cryptographic specifications (e.g., elliptic curve, Diffie-Hellman group). A commitment is a representation of a value that is both hiding and binding, hiding in the sense that the recipient of the commitment cannot find out anything about what the value of the commitment is, and binding in the sense that the sender later cannot pretend that it was a commitment to another value than it originally was.
(5) Compute
where c is the proof challenge, following the Fiat-Shamir Heuristic.
(6) Compute
where z1 and z2 are proof responses in a sigma protocol.
(7) Encapsulate and transmit one or more proof data messages including R, z1 and z2 as data objects to the verifier computing device, such that the verifier computing device is able to compute
amd confirm that
the verifier computing device controlling provisioning of access to a secured resource responsive to the confirmation that
The verifier independently validates the received proof and the verifier makes a determination of access grant or not grant.
In some embodiments, the verifier is a verifier computing system that automatically grants access to one or more secured resources, such as a physical access entry (e.g., mantrap, revolving doors, locked gateway, locked cabinet), and in other embodiments, the system grants access to one or more virtual resources (e.g., administrator access on a computer system, logging into accounts, access to secured sections of webpages), among others.
In another example, a comparison protocol may be established (e.g., to prove some condition whereby a<=b). This can be utilized to establish proof messages whereby it is necessary to indicate that a person is of a particular age, that a person has a particular minimum creditworthiness, a person has a minimum educational attainment, among others.
Consider G to be a discrete log group of prime order p and g and h be generators with unknown discrete logs.
Let numbers q and l be such that
and two whole numbers a and b such that
Consider commitments
to a and b, respectively.
To prove that a≤b, the following steps can be taken:
(1) Prover computes C=BA−1=gb−ahm
(2) Prover produces bit commitments
for
where ai, bi and ci are the i'th bits of a−l, b−l and c, respectively. mai, mbi and mci are sampled randomly.
(3) Prover computes
and likewise
(4) For each
the prover does the following:
(4.1) Randomly sample rai, d′ai and z′ai.
(4.2) Compute
(4.3) Compute
(4.4) Compute
(4.5) Assign
(4.6) Repeat steps 4.1 through 4.5 for B and C.
(5) Prover sends all Ai, Rai,0, Rai,1, d″ai,0, zai,0, zai,1, Bi, Rbi,0, Rbi,1, d″bi,0, zbi,0, zbi,1, Ci, Rci,0, Rci,1, d″ci,0, zci,0, zci,1.
(6) Verifier checks that
(7) For each
the verifier checks that:
(7.3) Check the same conditions for B and C
Note: It may be that either a or b are known to the verifier. In such a case there is no need to decompose the known number and commitment C will have the same mask exponent as that of the unknown parameter.
In some embodiments, that the client computing device (e.g., the prover) does not send A0, B0 and C0 to reduce the size of its messages. In that case, in step 6, instead of verifying a relation between the bit commitments, the verifier derives A0, B0 and C0 independently. This aspect may be particularly useful in low data throughput situations or where storage space is very limited.
The comparison method of some embodiments reduces the problem of comparison to three bit decompositions. As such, the computational burden on the prover consists of about 12N-3 group exponentiations.
In contrast, the method of Belenkiy involves two bit decompositions and N-1 equality maps each consisting of four 2-variable equations and a total of six distinct variables.
As such, it is estimated that each equality map requires at least 8 group exponentiations.
Using the efficient Bit Decomposition implementations of some proposed embodiments, the two decompositions will require a total of 8N-2 group exponentiations. Accordingly, it is estimated that Belenkiy's method requires 16N-10 group exponentiations. This demonstrates that for N≥2, the proposed method for the comparison protocol is more efficient, and this superiority becomes increasing important as the numbers to be compared scale up.
In particular, the scale up may occur if the credential verification system is instituted across a large number of users.
A computer implemented system 300 for electronic verification of credentials is illustrated. The system includes at least one processor and data storage, and includes a proof request parsing engine 302 configured to receive one or more proof request data structures 304, which in combination represent one or more logical conditions.
A credential parsing engine 306 is provided to receive one or more credentials 308 which in combination, validate one or more characteristics of an identity profile 310 of a prover entity.
A proof generation engine 312 is provided that receives, from a verifier computing system 314, the one or more proof request data structures 304 and the one or more credentials 308; and for each logical condition provided in the one or more proof request data structures, parse the one or more characteristics of the identity profile 310 to determine whether the logical condition has been satisfied.
One or more proof output data structures 316 storing signatures or zero knowledge proofs of satisfaction of a subset or all of the one or more logical conditions is returned by the system (e.g., in the form of data fields). A secure encryption engine 318 and a secure processing enclave 320 may be included, in accordance with various embodiments.
A proof generation engine 312, in some embodiments, resides at or is coupled to a data center of a financial institution, and wherein parsing the one or more characteristics of the identity profile includes invoking an electronic comparison against a stored user profile of the financial institution corresponding to the prover entity. The example implementations are not restricted to such a topology, and other topologies are contemplated, including a cloud/distributed resources based proof generation engine 312.
In other embodiments, the proof generation engine 312 is coupled to the secure processing enclave 320, which may also be coupled to a verifier computing device 314.
In another embodiment, the proof generation engine 312 lies within the prover's user device, thus user data will never be provided to the verifier and the issuer will not be informed of the transaction taking place.
In another aspect, the electronic comparison against the stored user profile of the financial institution corresponding to the prover entity includes querying one or more attributes of the stored user profile and comparing the queried one or more attributes against the one or more logical conditions to determine whether individual logical conditions of the one or more logical conditions have been satisfied. The characteristics and attributes of the user profile can be used established and stored thereon the portable client computing device as one or more token data objects that can be received from or computed jointly in a multiparty protocol with an issuer computing device.
The one or more token data objects are generated (e.g., as signed objects or encrypted objects) using at least an issuer computing device private issuance key. The one or more token data objects each including one or more signed data elements representing at least one of the one or more characteristics of the client associated with the portable client computing device.
In another aspect, the verifier computing system is configured to encapsulate the one or more credentials along with the one or more proof request data structures in a single data container transmitted to the proof generation engine.
However, the issuer computing system needs to be available (e.g., online) to be able to process the request. In response to a proof request, the prover confers an OAuth token (not to be confused with credentials) that the verifier can use to query the issuer and be assured that the prover does indeed satisfy their policy.
The verifier is provided tokens containing the verifier's proof request which can be used to query a computing system associated with an issuer, receiving an answer such as a yes/no response or a Boolean variable (i.e., TRUE/FALSE, 0, 1).
A challenging technical problem occurs in implementing a system where the verifier is able to ensure the prover has the correct credentials, while preserving their privacy. In some embodiments, a secure enclave based approach is described. In order to implement a secure enclave, Intel Software Guard Extensions™ (SGX) can be utilized, among others.
There are different mechanisms for public key cryptography. An approach, for example, supported by the Intel SGX SDK natively supports ECDH for key encapsulation and ECDSA for digital signatures over the PRIME256V1 (also known as SECP256R1) curve. Other approaches are possible, such as Schnorr's, which would serve just as well for a digital signature scheme. 256-bit base fields may potentially provide sufficient security.
For symmetric cryptography, Intel SGX SDK™ natively supports 128-bit AESGCM. This primitive can be used for authenticated encryption. It remains to be seen if larger AES key sizes are necessary. In that case, Galois-Counter Mode cannot be used.
Hashing can be performed using SHA-2, as this is natively supported by the Intel SGX™ SDK. As it supports 256-bit blocks, it would also be useful in case of a migration to larger AES blocks; both as a means of key derivation as well of a MAC building block.
The secure enclave approach improves computational efficiency and minimizes a trusted computing base, rendering it more amenable to formal analysis. The verifier may include a verify oracle, which is a trusted software/hardware component hosted by an untrusted third party. It is allowed to view a prover's attributes in the clear and attest that they satisfy a certain predicate queried by the verifier.
An example registration protocol is provided as follows. First, a prover generates their public key. The issuer hands the prover a random string r, the prover generates sk′p and generates pkp=f(skp) for skp=(sk′p,r) and common knowledge collision resistant function f. In order for the registration to be accepted, the prover should prove to the issuer in zero knowledge that it knows a corresponding sk′p. The (semi-honest) issuer's contribution to key generation is to keep a malicious prover from stealing credentials associated with a compromised private key.
In regard to credential subletting, it may be beneficial that the issuer should demand the prover to incorporate some important secret about their account (not even known by the issuer) into the private key, such that the secret can be inferred from the private key. This will discourage provers from sharing credentials with one another. Alice may be willing to let Bob use some credential issued to her by her bank, but not be comfortable giving him complete control over her bank account.
Another possible technique to approach this is to issue credentials to specific devices, using private keys that the device can create for an application and sign using them on the application's behalf, without ever sharing the key with the application.
An example issuer protocol is described:
The issuer generates a signature on the prover's attributes using an arbitrary signature scheme that is secure against existential forgery. For the construction to be secure, the signature should also involve a description of the token's data structure.
More formally, the prover and the issuer agree on a string ap representing the prover's attributes. The issuer knows the prover as the owner of pkp, satisfying ap. The issuer sends the prover back a signature σi=sig(ski;pkp∥ap) to the prover.
It is not, strictly speaking, necessary for the prover to have a public key at all. However, if the issuer enforces limits on how often it would register a new public key for a client, provers will be discouraged from subletting their credentials to one another. This stands in opposition to keyless credentials, where disclosing the secrets to a credential doesn't cost the original owner anything.
An example protocol for showing verification by the oracle is provided.
Let the prover and the verifier both trust a verification oracle known by the key pair (sko,pko).
The verifier chooses a random challenge c and sends it to the prover. A proof request predicate P is agreed upon. The prover composes the string d=(pki∥skp∥ap∥σp∥c∥P) and sends enc(pko;d) to the oracle.
The oracle decrypts d and checks that the following propositions are satisfied:
sigver(pki;σi;∫(skp)∥ap)
P(pki,ap)
In case of successful verification, the oracle outputs σo=sig(sko;c∥P) or it outputs ⊥ otherwise.
The verifier only needs to check that sigver(pko;σo;c∥P) holds.
Note that as regards propositions P that do not depend on anything outside ap (e.g., time) there is no freshness requirement; therefore the challenge c can simply be regarded to be the empty string in such cases.
For examining the approach security, the following collusion scenarios are considered:
Malicious issuer and prover to break soundness: This attack can be trivially mounted and in some embodiments, there is not attempt to prevent it. The issuer can always issue bogus adaptively chosen untruthful credentials for an accomplice prover. In practice, such a problem is best thwarted by strong and transparent authentication and KYC practices by issuers, as well as careful choice of trusted issuers by verifier consortiums based on thorough vetting.
Malicious issuer and verifier to break zero-knowledge: Zero-knowledge in this context means that an adversary controlling all issuers and verifiers cannot pinpoint which of the trusted issuers implied by the query and which of the credentials obtained from the issuer the credential in use is. For this, the analysis makes use of the CCA2 property of the encryption scheme used in Acquire Proof queries.
More formally, consider the following game, where the adversary is allowed to make polynomially many of the following queries, interleaved with polynomial computations:
The adversary wins if D is non-empty and he can guess the value of d with non-negligible advantage over a random choice.
A simulation argument naturally follows from this intuition and is therefore avoided.
Malicious prover to break soundness: The adversary is allowed polynomially many queries from the following list; arbitrarily interleaved with one another and polynomial-time computations.
There are no queries regarding corrupted or corruptible Issuers and Oracles since such parties can be simulated by the adversary herself.
In
In response, the prover computing device sends encrypted credential, challenge and proof request to a master verifier computing device. The master verifier signs challenge and proof request computing device.
This approach, while requiring additional infrastructure relative to the approach of
A Verify Oracle supports three states:
Trust by Provers and Verifiers is assumed in all the previously described models as a common reference. Also, for obvious performance concerns, it is vital to be able to perform Remote Attestation on an enclave non-interactively. As such, the enclave's host can perform a publicly verifiable remote attestation with its enclave and publish the transcript to it. In order to do so, she may employ the Fiat-Shamir heuristic using the collision-resistant function H(.) modeled as a Random Oracle. If the Remote Attestation Verifier would normally use a probabilistic polynomial-time algorithm m2←A(m1;r) to generate the second message, in this scenario the second message would be derived through m2←A(m1;H(m1)).
A proof request can be defined in accordance with variations of the following examples.
The language describing policies should be simple to interpret so as not to expose the system to security risks.
In order to prevent the execution from leaking information about the attributes, the language should preclude programs with data-dependent access patterns, runtime and execution paths. Here, a C-like language called StraightTalk is described as an example, and it is only capable of describing straight-line programs:
policy
token-definition statement *
expression
token-definition
token ‘{’ variable-definition * ‘}’
variable-definition
type identifier-list ‘;’
identifier-list
identifier
identifier-list ‘,’ identifier
type
basic-type
basic-type ‘[’ integer ‘]’
basic-type
statement
variable-definition
expression ‘;’
argument-list
nonempty-argument-list
nonempty-argument-list
expression
expression ‘?’ expression ‘:’
expression
expression binary-operator
expression
unary-operator expression
function-like-operator ‘ ‘
argument-list ‘ ’
string
base64
identifier ‘[’ integer ‘]’
identifier ‘[’ integer ‘]’‘[’ integer ‘]’
identifier
number
unary-operator
binary-operator
In this example, the secure enclave is adapted for processing encrypted credentials, challenges, and proof requests. The secure enclave can be a processor or a secured memory location that is configured for maintaining a verifier secret key utilized to generate a first signed message.
The verifier computing device receives, from a prover computing device, a second signed message including at least an enclosed issuer signed message representing one or more encrypted containers storing at least one or more characteristics of an identity profile of a prover entity along with a message authentication code based at least on the proof request data structure.
The verifier computing device then transmits the second signed message, the proof request data structure, and the one or more encrypted containers to the secure enclave.
The verifier computing device then receives a response data message from the secure enclave indicative of whether all of the one or more logical conditions were satisfied by the at least one or more characteristics of the identity profile of the prover entity. In some embodiments, the secure enclave is configured to provide publicly verifiable remote attestation with a verifiable threat model and a verifiable proof of security.
A remote attestation protocol involves a zero knowledge proof with a prover and a verifier, the enclave being the prover. A direct run of this protocol by both Identity Brokerage parties (prover and verifier) may compromise efficiency. Therefore, a mechanism is implemented using the Fiat-Shamir heuristic, and the enclave's maintainer is configured to run an instance of remote attestation in a publicly verifiable manner.
Instead of using actual random inputs, the remote attestation verifier (the enclave's maintainer) replaces every randomness with the output of a pseudorandom function applied to the entire protocol transcript up until that moment, and an arbitrary initial nonce. Thus, by presenting the transcript of this protocol instance, the enclave's maintainer can efficiently convince the identity brokerage parties that the enclave is a trustworthy one.
In some embodiments, the verifier enclave or a third party hosted system tracks records transcripts of an exchange, which are exposed to the public. For example, it may be the responsibility of a verifier computing system to run a remote attestation protocol with its enclave once whereby the enclave communicates its public key, which is then stored in on a transcript exposure module, which may be hosted by any one of the computing systems associated with any one of the parties or by a third party hosted system. In order to establish the honesty of the transcript, all the randomness used on the verifier's cryptography are to be created using a pseudorandom function (hash, block cipher, etc.)
involving all or some of the information available to the verifier's computing device at a time of a credential validation transaction.
The secure enclave processor maintains a verification transcript in relation to its own credentials, as the enclave is wholly trusted by both the prover and the verifier, it should be strongly vetted itself.
Chip manufacturers provide mechanisms to verify an enclave involving multi-round interactive protocols. Remote attestation is a protocol based on bilinear group signatures, whereby an enclave proves to a third party that it is running on a legitimate Intel SGX platform, and that is running a particular program.
In such an approach, a custom-built signature mechanism is used for the issuance protocol that is amenable with the type of zero-knowledge proofs of knowledge approaches used in the proof protocol. Notable examples are U-Prove (relying on Diffie-Hellman type hardness assumptions), Idemix (relying on RSA-type hardness assumptions) and later approaches based on bilinear maps.
In this example, the prover has navigated to a page for a skydiving program whereby the user must prove that they are at least 18 years of age, and that they have been recently declared healthy. There is also a student deal, where if the prover wishes to access the student deal, they must also prove that they are currently enrolled at a university.
Processor 2002 may be an Intel or AMD x86 or x64, PowerPC, ARM processor, or the like. Memory 2004 may include a combination of computer memory that is located either internally or externally such as, for example, random-access memory (RAM), read-only memory (ROM), compact disc read-only memory (CDROM).
Each I/O interface 2006 enables computing device 2000 to interconnect with one or more input devices, such as a keyboard, mouse, camera, touch screen and a microphone, or with one or more output devices such as a display screen and a speaker.
Each network interface 2008 enables computing device 2000 to communicate with other components, to exchange data with other components, to access and connect to network resources, to serve applications, and perform other computing applications by connecting to a network (or multiple networks) capable of carrying data including the Internet, Ethernet, plain old telephone service (POTS) line, public switch telephone network (PSTN), integrated services digital network (ISDN), digital subscriber line (DSL), coaxial cable, fiber optics, satellite, mobile, wireless (e.g. Wi-Fi, WiMAX), SS7 signaling network, fixed line, local area network, wide area network, and others.
Computing device 2000 is operable to register and authenticate users (using a login, unique identifier, and password for example) prior to providing access to applications, a local network, network resources, other networks and network security devices. Computing devices 2000 may serve one user or multiple users.
There may be multiple issuer entities, each having their own set of associated computing devices (e.g., computers in data centers). The issuer entities and their computing devices may have heterogeneous data storage mechanisms, which can include local storage and/or specially allocated memory storages, including secured enclave memory and/or processors.
As illustrated in the example of
The blockchain data structure, in some embodiments, is a public blockchain (e.g., publicly accessible, such as an Ethereum blockchain), or a private, permissioned blockchain that operates through a propagated distributed ledger shared across a set of trusted nodes. The blockchain data structure may store immutable attributes (and/or encrypted or signed versions of the attributes).
Multiple authoritative issuer devices 2202, 2204 are configured to provide signed attributes which may either be delivered to the client special space 2206 (e.g., a secured enclave having secured memory and/or secured processors) or onto the blockchain data structure 2208. These attributes represent aspects of client personal information. The client special space 2206 can store some signed attributes and the proof response logic, and can be stored in a data storage remote from the on-board memory of devices associated with the client. A benefit of having a client special space 2206 is that, for example, multiple client devices 2210 are able to connect to the client special space 2206 (e.g., a tablet, a mobile device, a desktop computer), and if a client loses a device, the signed attributes remain accessible.
The blockchain data structure 2208 is adapted to store on one or more distributed ledgers, data blocks representing signed attributes and the proof response logic, according to some embodiments. Similarly, multiple client devices 2210 are able to connect to the blockchain data structure 2208 (e.g., a tablet, a mobile device, a desktop computer), and if a client loses a device 2210, the signed attributes remain accessible.
Proof requests and responses can be conducted using, for example, connected electronic client devices 2210 (e.g., a mobile device, such as a smartphone or a tablet) or other devices that are connected to the mobile device using Bluetooth low energy (e.g., a wearable device). The client devices 2210 may store locally a public key which can be utilized to encrypt data messages for decryption using the corresponding secret key or validate signed attributes that are signed based on the corresponding secret key.
At step 1 (shown by the encircled numbers in
A public/private key pair is generated, and the client device retains the public key (Pk) in data storage, while the private/secret key (Sk) is stored on the client special space data storage.
At step 2, a data communication occurs between the client device and the issuer device. The client authenticates (e.g., 2 way authentication) in relation to the issuer device, selecting which attributes he/she wants to provide (e.g., Age, zip, over 21, etc.) from a set of offered selections. In some embodiments, the issuer device may provide one or more interface options indicative of potential token destinations (e.g., storage destinations), including the client special space data storage or a blockchain-based data storage, such as a permissioned blockchain or a public blockchain.
The issuer is configured to then deliver attributes. For example, in a client secret space data structure, the client's device can provide a pointer (e.g., a uniform resource locator address (URL)) to a special space data structure and can transmit the public key (Pk) to the issuer device. Attributes are then created by the issuer device using a combination of the client's public key and the issuer's secret key (Sk), which can be delivered for storage on the client secret space data structure (3A) or on a data block of a blockchain. Where the attribute is stored on the blockchain, the attribute may be made public (if the issuer is blockchain enabled) and thus publicly viewable. The public blockchain may be configured to store a pre-defined set of attribute types that are thus publicly viewable (e.g., using a blockchain explorer).
In another embodiment, where the issuer device is capable of interacting with the blockchain and the attribute can be made public (e.g., over 21), the issuer device delivers attributes signed by the issuer using the client's public key (Pk) to the blockchain (3B).
In another embodiment, where the issuer device is capable of interacting with the blockchain and the attribute requires the client's permission (e.g., date of birth), the issuer can store on the blockchain signed attributes that are encrypted with the client's public key (3C).
In an alternate embodiment, a sidechain is utilized to keep attestation private between the involved parties.
Sample steps for verification are described herein, whereby the client device forms a communication channel with the authenticated verifier, and the verifier makes a “Proof Request”.
The proof request can be provided, for example, to the client secret space data storage. At step 3A, a “Proof Request” is sent to the Client's special space data storage, where a bounded device has both the URL and public key to access the special space. At step 4A, a “Proof Response” is sent back to the client device.
In an alternate embodiment where the issuer device is blockchain-enabled and the attribute can be made public (e.g., that the user is over 21), at step 3B, data messages representing the “Proof Request” are transmitted to the blockchain (or a blockchain explorer tool configured to interact with the blockchain), and at step 4B, a data message representing the “Proof Response” is sent back to the device as the attribute is public. In an alternate embodiment, the client device can direct the verifier to the blockchain, for example by providing a link or a block number indicative of where the attribute is stored.
In another embodiment, the issuer is blockchain-enabled but the attribute needs client's permission (e.g., date of birth). In this example, at step 3C, a “Proof Request” is sent to the blockchain (e.g., or to a blockchain explorer tool configured to interact with the blockchain), and at step 4C, an “Encrypted Proof Response” is sent back to the device.
The client device may then be required to grant permission to share the attribute at step 5C, and responsive to permission being granted, at step 6C, the “Encrypted Proof Request” is then transmitted to the client special space data storage for decryption. At step 7C, a Decrypted Proof Response is sent back to the client device that could then be provided to the verifier device.
A device is positioned at a point of sale or other location where verification is required, such as in front of a bar serving alcohol. The verification can be used as a gatekeeper mechanism in some cases, or in other cases, as a tool to determine whether a particular individual is entitled to various statuses or discounts (e.g., student discount). The device 2502 is adapted, in a specific, illustrative example, in relation to a requirement to check the age of all customers.
Without such a system, a cashier would have to request an identification card (which could be falsified) and perform mental math to determine if the individual is over 19 years old. This is time consuming; and requires mental work effort. Furthermore, the customer may find the check to be invasive, as the relationship between cashier and customer is unknown.
The device 2502 can be a terminal set up at the point of sale, for example, which could be designated a verifier computing device. The device 2502, in an example embodiment, may be configured to render a visual representation 2504 of a resource locator, such as a quick response code. The quick response code can be related to an underlying data element, such as a URL, which the client device can interact with, for example, by scanning the code to access the URL. On the backend, verification processes as described in various embodiments herein are utilized to transmit or otherwise make available signed attribute information of the client device, which are then provided to the verification device 2502 such that verification device 2502 is able to verify specific attributes of the client (e.g., age>25). The verification device 2502 can be configured to modify a rendering 2506 to visually or audibly indicate that the client has successfully passed or failed the attribute test.
In some embodiments, one or more aspects of the blockchain, tokenization and/or verification/validation/proof processes described herein can involve one or more secure execution environments and/or secure storage elements. For example, in some embodiments, the storage of private keys and tokens, in addition to computations required for issuance and proofs, could be performed on Trusted Execution Environments, Smart Cards, Secure Elements or Trusted Platform Modules on devices such as mobile and personal computers using corresponding APIs.
In some embodiments, a computing system includes or is configured to provide a plurality of distinct execution environments. The isolation of these environments can be enforced using software or hardware. In some embodiments, a distinct execution environment can include one or more secure storage elements (for example, a Secure Element or one or more aspects of a Smart Card).
The distinct execution environments are, in some embodiments, configured to provide access to different storage and processing resources.
In some embodiments, one of the environments may be referred to as a trusted execution environment (TEE) and may have access to isolated and secure storage and processing resources.
In some embodiments, a secure environment may support a distinct operating system, or it may be a set of secure resources accessible to applications that are assigned to use it by the underlying operating system of the overall system. In some embodiments, a computing system includes a dedicated secure storage resource, such as a separate secure storage or a secure storage area within a general storage resource. In some embodiments, the computing system includes a dedicated secure memory device such as a separate secure memory, or a secure area within a general memory resource (e.g., secure memory may be accessible in a different address space or in a different address range).
These resources may be physically and/or logically distinct from the general resources of the same type. In a computing system that includes or is configured to provide two distinct execution environments: the first execution environment is a secure execution environment, and the second execution environment is a potentially unsecure environment.
The secure execution environment is sometimes referred to as a trusted execution environment (TEE) and the potentially unsecure environment is sometimes referred to as a rich execution environment (REE).
The second execution environment (i.e., the potentially unsecure execution environment) is configured to communicate with the first execution environment (i.e., the secure execution environment) to request one or more aspects of the tokenization and/or verification/validation process to be performed.
The second execution environment includes an unsecure portion of a processor, memory, and storage. Software code of the second execution environment can include an unsecure OS which is stored in storage, loaded into memory at run time, and executed by processor to perform OS operations. In some embodiments, software executable by the second execution environment can include one or more APIs or other software components for providing function calls or otherwise interfacing with one or more components of the first execution environment.
For example, in some embodiments, the first (i.e.,. secure) execution environment can include (or store) one or more keys such as root keys, private keys, and the like for generating signed tokens, validating one or more signed data elements, and/or the like. The first execution environment can also include (or store) one or more tokens against which one or more credentials or other data elements can be validated.
In some embodiments, first execution environment can include one or more software components including computer executable code for generating/issuing and/or validating one or more tokens, credentials and/or other data elements.
For example, in one example embodiment, a digitally signed token representing a verified identity or account can be stored in a secure storage element in a secure execution environment. A secure execution environment can include computer executable instructions which receive from an unsecure execution environment one or more data sets representing one or more biometric verification credentials.
The computer executable instructions and the secure execution environment can be configured to perform one or more calculations or data transformations to validate that the data sets representing the biometric verification credentials match or otherwise correspond to the digitally signed token as described herein or otherwise. In some embodiments, the data sets representing the one or more biometric verification credentials can be received at the device on which the secure execution environment resides and/or an external device in communication with the device in which the secure execution environment resides.
In some embodiments, secure execution environment can return one or more signals indicating whether the biometric verification credentials are valid or otherwise match the digitally signed token. Some environments, the signals can include one or more signed data elements to confirm the veracity of the signals.
In some embodiments, the secure execution environment can be configured to respond to proof requests from unsecure execution environment(s).
In some embodiments, a secure execution environment can be used to generate a signed token. In some embodiments, a secure execution environment can receive from an unsecure execution environment one or more tokens and/or credentials. One or more software elements within the secure execution environment can generate a signed token and/or credential using one or more private keys stored within the secure execution environment. The signed token and/or credential can then be returned to the unsecure execution environment.
In some embodiments, one or more aspects of the blockchain verification, transaction and/or other modification processes can be performed within a secure execution environment to ensure that private keys, addresses, credentials and/or the like are only accessible by authorized users and/or processes within the secured environment.
Any other aspect of the tokenization and/or validation process can be similarly applied to using these secure and unsecure execution environments to ensure that sensitive information such as keys, credentials, tokens, tokenization algorithms, biometric data, biometric processing algorithms, blockchain transactions/activities, neural networks, and/or the like are only accessible by authorized users and/or processes.
In some embodiments, sensitive operations using a private key may be performed only in a secure area. In some embodiments, all or additional operations maybe performed in a java card space of a smart card.
Applicant notes that the described embodiments and examples are illustrative and non-limiting. Practical implementation of the features may incorporate a combination of some or all of the aspects, and features described herein should not be taken as indications of future or existing product plans. Applicant partakes in both foundational and applied research, and in some cases, the features described are developed on an exploratory basis.
Consider a dummy example where the group is of order 23. The prover has a commitment
and wants to prove that it is nonzero.
The prover first privately computes
Note that
The prover first samples two random numbers, say r1=6 and r2=21. It computes R=Cx6h21=g9h and hashes Cx and R together to obtain c, say its value is c=7. The prover computes
and sends over Cx, R, z1 and z2.
The verifier independently computes c and checks that
which is indeed true.
It can be seen that the prover was required to perform one modular inverse, two randomizations, two group exponentiations and one group multiplication. The method of Belenkiy requires 4 randomizations, 3 group multiplications and 7 group exponentiations.
Here is an example run of the protocol in a real-world setting. The NIST-recommended SECP256r1 curve has been used as the discrete log group with g being the default generator.
Integers:
x=989286703893525103965231516973559043547920049579737744125956333032737324 97743
y=950080536003610934319230465754247514336093771353579969411105819939381652 36367
x−1=462698403265673518540994510511338275355253401833673009216218520370008469 66515
−yx−1=811883617617445595202675787660756899150690176107941830256968169087277732 41705
r1=641741887648009138242672540494542503838803238375742165794804496772782878 45053
r2=780720476939726975622572766237501598771829933800253448848906197165965692 62161
c=962249901098101212105656204409842333674860682001691664025921912915116318 01053
z1=7755875620401220008047568488431038729137094391010731158546635866168352 4082068
z2=1096792458872982851774402447421842631107855970634305831369242133680040 95632307
Note: The specific value of c depends on how hashing is carried out. Many possible ways of doing so can be valid. The process involves deterministically serializing the objects to be hashed (however seen fit) and pass the resulting byte array through a secure hash (in this example, SHA256).
Note: These integers are displayed in base 10.
Group Elements:
Cx=(e6ab3db4c6691dda4a8b05d79a15559c18181cda6c6dfc7fc77f41dff392e41,f0d8a7a52 e882ff9da9f64cf082db98bbe1db6fa6f965dc96a4150b95270e073,1da350a2e431d51de921 7a218313fb2cc39f8f1dda48ea33ad7b1e561ef00e89)
h=(e6ab3db4c6691dda4a8b05d79a15559c18181cda6c6dfc7fc77f41dff392e41,f0d8a7a52e 882ff9da9f64cf082db98bbe1db6fa6f965dc96a4150b95270e073,1da350a2e431d51de9217 a218313fb2cc39f8f1dda48ea33ad7b1e561ef00e89)
R=(b5e8e60e25842deb89cdb80047e49681b566b8bcf6b6fd6298fdc7dab5857300,54cacd1 79ab2e3fbc892b1001c47408dc1d8559c8a2dce519094ab874b640e87,11b5ec881901a090 1d73d0892402c3f70d96f6d23ca851cd7fe9402f886f6bb4)
Note: These group elements are displayed in projective coordinates base 16.
Consider G to be a discrete log group of prime order p and g and h be generators with unknown discrete logs. (Setting identical to that of all the rest of the protocols)
Let numbers q and l be such that
and two whole numbers a and b such that
Consider commitments
to a and b, respectively.
To prove that a≤b, the following steps can be taken:
for
where ai, bi and ci are the i'th bits of a−l, b−l and c, respectively. mai, mbi and mci are sampled randomly.
and likewise
the prover does the following:
Note: It may be that either a or b are known to the verifier. In such a case there is no need to decompose the known number and commitment C will have the same mask exponent as that of the unknown parameter.
Note: In an embodiment, the prover avoids sending A0, B0 and C0 to reduce the size of its messages. In that case, in step 6, instead of verifying a relation between the bit commitments the verifier derives A0, B0 and C0 independently.
Let p=23, l=0, q=8, a=3, b=5
Appendix: The following section provides discussion on proofs and complexity analysis
Σ-Protocols
A Σ-protocol is a schema for proving knowledge of a witness w to an NP-Statement x∈L. It consists of the following three moves:
The verifier accepts if and only if the components of their view satisfy a certain polynomially-verifiable relation. The view includes are the problem statement P(x) and the three messages exchanged.
A Σ-protocol must also satisfy the following properties:
Note that the concept of a Σ-protocol is much broader than the algebraic cases used in this solution. Namely there is very little limitation on the mathematical nature of the NP-Statement to be proven, its witness, and the commitment scheme in use.
Note that it is not sufficient for the challenge to be unique. It needs to be unpredictable. From the Special Zero Knowledge property it follows that a malicious prover can fool a verifier if able to guess the challenge prior to commitment generation.
Note that even though a proof of knowledge, a Σ-protocol is not necessarily zero-knowledge, nor compose-able in parallel.
Composing Σ-Protocols
Disjunction
In order to prove that at least one of the two statements A and B is true, the prover proves one (one that is actually true) and simulates the other (which the zero knowledge property guarantees to be possible, regardless of its truth). In order to allow the prover to simulate one and only one of the two, for each verifier challenge c an efficient and efficiently invertible bijective function
where C is the set of possible challenges is devised.
The prover may use arbitrary challenges c1 and c2 for its subproofs, as long as
Intuitively, the prover can fix one of the challenges and generate bogus commitments. After receiving the uniformly random verifier challenge, the prover will be bound to a uniformly distributed challenge for the other statement, which forces it to try and prove it honestly.
Special Soundness:
Imagine two accepting executions of the proof with the same mask commitments and different verifier challenges. Represent the two transcripts as
There is the following:
Without loss of generality, assume that
are both accepting transcripts, a witness can be extracted for the first statement, demonstrating that the prover knew it was true, as well as a witness to its truth.
Special Zero-Knowledge:
The simulator can sample and c1 in advance and derive c2 from the two. Because the protocols for the sub-statements are zero-knowledge, the simulator can output
that are indistinguishable from honest transcripts, given the challenges.
But c1 and c2 are uniformly distributed, therefore the simulated transcripts are indistinguishable from honest transcripts.
Cost:
The computational overhead is negligible compared to that of providing the honest and simulated proofs, both in providing honest or simulated proofs and verifying them.
Communication has an overhead of the prover sending one of the sub-challenges to the verifier.
Note that the necessity of actually implementing proof simulators to build disjunction provers makes their efficiency relevant. Therefore, most complexity analyses in this document will include an estimation of the work required to simulate a proof.
Note that such differences do not enable timing attacks when disjunctions of two statements of the same form are proven.
However, whenever
is to be proven, care should be taken to ensure that the timing difference between honest P1 and simulated P1 is the same as the timing difference between honest P2 and simulated P2. In other words, in order to maintain zero knowledge one needs to guarantee that
Conjunction
One may need to prove knowledge of all statements
each with their own Σ-protocol. That is simply achieved through synchronizing the execution of their proof protocols. Meaning that all commitments
are sent first and the same challenge c is used to provide all responses
Special Soundness:
Take two accepting transcripts
From the soundness of each sub-protocol one can deduce that all
can be extracted.
Special Zero-Knowledge:
Given the challenge c beforehand, the sub-simulators can each create honest-looking transcripts
Clearly a sequence of such transcripts will also be honest-looking.
From Σ to ZKPoK
A Σ-protocol is not guaranteed to be Zero Knowledge. However, under certain black-box transformations, a Σ-protocol can be converted into a ZKPoK.
One such transformation (perhaps in the Random Oracle Model? Is that really bad?) uses the Fiat-Shamir heuristic.
Prior to the execution of the Σ-protocol, the verifier hands the prover a challenge string, later used in generating the Σ-protocol challenge. The main body of the protocol is made non-interactive, as the prover self-generates the challenge by hashing together the verifier-provided challenge string and its own commitments.
Roughly speaking, by having both parties contribute freshness to the hash input, the challenge is guaranteed to be unpredictable, in the sense that neither can guarantee any meaningful relation between the prover commitments and the challenge with non-negligible probability, essentially incapacitating them both from cooking up any mischief. With both parties limited to honest behavior, the special soundness and special zero knowledge properties suffice to guarantee the zero knowledge and proof of knowledge conditions.
Note that the challenge string provided by the verifier need not be random; uniqueness suffices.
Zero-Knowledge:
It is observed that the transcript of an honest prover's interaction with any verifier is identically distributed as with an honest verifier. Since good randomness from the prover alone suffices to guarantee randomness of the challenge, regardless of what the verifier contributes into the hash input, the challenge will be honestly produced.
It now remains to show that an honest prover's view under the Σ-protocol is identically distributed as that under the transformed protocol.
Proof of Knowledge:
One still needs to argue why this transformation will not spoil the original Σ-protocol's Proof of Knowledge property.
In the Random Oracle model, one can rewind the challenge creation step and have two distinct challenges that the prover can successfully respond to.
By making the protocol non-interactive, the transformation also addresses the issue of composability. Reducing the number of messages from 3 to 2 somewhat trivially guarantees parallel composability. Moreover, all zero-knowledge protocols are sequentially composeable.
This makes Σ-protocols valuable tools, mainly because proving Special Soundness and Special Honest Verifier Zero Knowledge is much easier than proving Arbitrary-Verifier Zero Knowledge and Proof of Knowledge.
Algebraic Implementations
Operations are carried out in a prime order group G and the field of integers modulo its size. This group has at least two publicly known elements g and h, along with a hash function H. The security of the constructions depends on the following properties.
Security Assumptions
such that
that can be used to transform a Σ-proocol on G into a ZKPoK under the Fiat-Shamir heuristic. A Σ-protocol operates on G iff
for all its executions.
Schnorr Signature
This protocol allows an entity to prove knowledge of the discrete log x (private key) of a group element
(public key) by providing a valid response (signature) to a challenge (message). The non-interactive version of this protocol can serve as a signature scheme with the verifier challenge serving as the message to be signed.
As a Σ-protocol, the following steps are to be taken:
Completeness is evident. The following prove security:
Special Soundness:
Imagine two distinct transcripts with identical commitment and different challenges.
Thus a witness can be extracted.
Special Zero-Knowledge:
To counterfeit an accepting transcript, the simulator can first randomize the challenge and the response c,z∈F|G|. It then generates fake mask commitment
NOTE: In some cases, the simulator may be given the challenge c. Therefore, it's best to prove that a simulator can generate transcripts that are indistinguishable from honest ones given the challenge.
Now it remains to prove that real and fake transcripts are identically distributed:
One can represent random variables with boldface capital letters. C,Z and R represent the challenge, response and the mask respectively.
In both cases, the challenge is uniformly distributed. In other words
In the counterfeit case, the response is generated independently of the challenge, therefore
In the honest case, the response is obtained by adding a uniformly randomized r to a fixed multiple of c. Therefore, in that case also one will have
In both cases, R is uniquely determined from c, z and X. Therefore, one can say that the random variable
is identically distributed across the two cases.
Thus the protocol is Honest-Verifier Zero Knowledge in the strongest sense, namely, information theoretically.
Computational Complexity:
Note on Complexity
All complexity estimates ignore the process of creating and communicating the challenge, because it can be amortized. Moreover, sometimes it is handed down from another protocol.
Note on Computational Complexity
Every group exponentiation involves
group doublings and compositions. This number is considerable, at least about 256.
Every group doubling or composition in turn, involves a constant number of Field multiplications and inversions.
And finally, every field multiplication involves
field additions, which is again at least about 256. This means that compared to a group exponentiation, a field addition, subtraction or negation takes a negligible
time and effort. For the purposes of simplifying this analysis, the analysis will dismiss basic field operations as inexpensive, reasonably certain that the analysis will not suffer much.
Communication Complexity
I field and 1 group element.
Note on Communication Complexity
In many contexts, one may assume that transmitting a group element requires more or less the same number of bits as a field element does. Even so, more work may be required on the receiver's side to reconstruct a representation amenable to manipulation.
Generalized Schnorr's
Schnorr's signature can be naturally extended to efficiently prove knowledge of a group element X's discrete logarithm based multi-generator
In addition, the prover may want to prove that some of these exponents are equal.
Prover knows
where
surjectively maps an exponent's index to that of its equality class. That is to say, there are
equally classes, indexed from 1 to l.
The prover does the following:
The verifier accepts if and only if
Special Soundness:
Consider two transcripts
This gives us witnesses
such that for every
with
one has
Notice the one-sidedness of the above implication. It is not at all asserted that variables belonging to distinct equality classes hold distinct values.
Special Zero-Knowledge:
For a given challenge c∈|G|, the simulator uniformly samples
and computes
This is identically distributed as an honest transcript. Given a challenge c, the zis are completely masked by the ris and therefore uniformly distributed. Finally, as in the simulated case, the commitment R is uniquely determined by the challenge and the responses.
Computational Complexity:
Communication Complexity:
1 field and 1 group elements.
Proving Knowledge of a Pedersen Commitment
Proving, verifying and simulating knowledge of commitment
is reduced to Generalized Schnorr's with the following parameters:
Computational Complexity:
Communication Complexity:
2 field and 1 group elements.
Side note: Soundness of the Pedersen Commitment Scheme:
If a prover can open a commitment in two way
one has
From the assumption that the prover does not know the relative discrete log of g and h, one can infer that
Very Generalized Schnorr's
This is called an “Equality Map” in the Microsoft Specifications. The problem is to prove knowledge of the discrete logarithm of a set of group elements
as:
Moreover, there is an “equality map” as common input, taking the index of an exponent to that of a witness:
The symbol I stands for the equality map as well as the number of non-trivial witnesses.
The prover performs the following:
by convention.
send
send back
by convention.
The verifier accepts if and only if
Special Soundness:
This follows naturally from the special soundness of Generalized Schnorr's. Intuitively, the relevant witnesses for each discrete log relation can be extracted independently of the others. Consistency is still guaranteed.
Special Zero-Knowledge:
Given group elements
challenge c and equality map l, the simulator samples
sets
and for
computes
Similarly to previous arguments, in both simulated and genuine cases, the set
is uniform and independent of the challenge c, and the set
is uniquely determined given the two. Therefore, those cases are identically distributed.
Computational Complexity:
Consider the set
Here, one can make an additional assumption that was not essential to the security analysis but makes complexity analysis easier.
That is, none of the discrete log witness vectors are trivially zero.
Communication Complexity:
I field and M group elements.
Commitment Equality Proof
In order to prove equality of two commitments
it is more efficient to devise a specially tailored proof scheme rather than using the generic equality map.
Both parties simply compute
The prover proves knowledge of
Special Soundness:
From the special soundness of Schnorr's it follows that the extractor can compute rδ such that
Assuming that the prover has knowledge of X1 and X2 as valid commitments, it should be able to compute
Because the prover doesn't know
it must be that
Note that this soundness guarantee is only valid if it is already ascertained that the input commitments are valid, which is not the case with the generic equality map.
Special Zero-Knowledge:
This follows directly from the zero-knowledge property of Schnorr's.
Complexity:
The simulator computes
and simulates a Schnorr's. This results in 2 randomizations, 2 exponentiations and one extra composition. Therefore, there is an extra exponentiation.
Commitment Non-zeroness Proof
In order to prove for a commitment
that
one only needs to prove the existence of a multiplicative inverse for x, namely
In other words, it suffices to prove the discrete log of g base X and h.
Special Soundness:
Since the prover doesn't know a nontrivial discrete log relation between the generators, this guarantees that
Therefore, a is the multiplicative inverse of x, guaranteeing that x is nonzero.
Special Zero Knowledge:
This follows directly from the zero knowledge property of the knowledge of commitment protocol.
Complexity:
To set up the discrete log proof, the prover needs to compute x−1 and −rx−1, requiring one field inversion and one multiplication only. The verifier and simulator need no additional setup.
Inequality Proof—Known Value
Given
with y common to both parties and x and r private to the prover, one can prove
by carrying out a non-zeroness proof on
As a side effect, this also proves knowledge of x and r.
Special Soundness:
An extractor can obtain
and r, with the guarantee that
From that x can also be determined.
Special Zero-Knowledge:
is uniquely determined from common inputs X and
and the subprotocol it is handed to is special zero-knowledge; thus so is this protocol.
Complexity:
There may be an additional exponentiation required if a party doesn't have
cached.
Inequality Proof—Unknown Values
This works very similarly as with known values. Group elements X1 and X2 are common input, and private to the prover are elements
such that
To prove that
the parties both compute
The prover also computes
and proves that xΔ is not zero using the commitment non-zeroness proof.
Note that this protocol doesn't prove knowledge of the witnesses
Complexity:
Bit Commitment Proof
Pedersen commitments to bits will play an important part in future sections, when comparison of two numbers is of concern. The commitment is of the form
where
In order to prove that such a commitment is well-formed (that b is indeed a bit) one may have the following:
Therefore a disjunction of knowing the discrete log of either C or
base h does the trick.
Complexity:
The prover and the verifier both do the additional work of computing Cg−1; namely, an additional group composition.
This results in 3 field randomizations, 3 group exponentiations and 1 composition for the prover, as well as 4 group exponentiations and 3 compositions for the verifier.
There is no additional communication cost.
Bit Decomposition Proof
Certain properties of committed attributes cannot be naturally expressed using algebraic techniques so far covered in this document. Comparing two numbers is one such example as doesn't come with a notion of order. It can be done however by a circuit operating on individual bits making up the operands. This means that one needs to produce commitments to bits and to simulate binary circuits. Fortunately, both of these can be done algebraically. This section describes a method to do the former.
The common input is a commitment
with the prover having x and r as private input.
To decompose x into an N-bit number, the prover generates the following:
The prover sends
to the verifier. Apart from that, the prover will need to provide proofs of commitment for
Special Soundness:
Take two accepting transcripts
From the soundness of the conjunction proof, it can be deduce all witnesses wn can be extracted.
Special Zero-Knowledge:
A simulator can randomize group elements
and have sub-simulators generate honest-looking sub-proofs for their validity as bit commitments.
Complexity:
The prover randomizes
masks and infers r0 from them. The computation is as follows:
This takes N−1 field additions and N−1 field doublings. In other words, it takes
field additions, about as expensive as a couple of multiplications, at most. In most cases though, 2N is much smaller than the group size and therefore the cost of these additions can be disregarded.
Similarly, computing C1 to
takes
group exponentiations and
group compositions.
In the same way as computing r0, computing C0 takes
group doublings and
group compositions. Alternatively, using the availability of r0, C0 can be computed directly as
This takes one group composition and one exponentiation. An exponentiation to the power of a large number takes at least about
exponentiations and
doublingd alone. Therefore, direct computation of C0 would always be less efficient.
Followed by that is N independent bit commitment proofs.
The verifier, takes C1 to
and computes C0, knowing C. The takes
group doublings and
group compositions. The rest of its work is to verify individual bit commitment proofs.
Bit Comparison Proof
Imagine an N-bit comparator design cascaded starting from the most significant bit.
The output of any 1-bit comparator unit comparing the nth digits is a commitment of form
The carry in cN is considered 0 and the carry out c0 is the result.
can be simply computed as
All other values are computed as
This is equivalent to saying
Proving and simulating
An honest proof would involve a proof of non-zeroness on
A simulation would involve a simulation of both subcomponents. Each contributes an extra exponentiation.
Proving and simulating
An honest proof involves a proof of zeroness on
and a proof of equality between
A simulation likewise involves simulating the two.
The most significant work difference between the
cases is an extra field inversion and two field multiplications when
A number of dummy operations may be necessary to close in the timing gap.
Overall Complexity:
When performing an honest proof, one of the predicates is proven and the other is simulated. When doing a simulation, they both are simulated. Disjunction provers and simulators both sample an additional challenge from the field.
Whole Number Comparison—Using Comparator Circuits
Assuming decompositions are already available to both parties, the prover needs to generate the bit comparison commitments
and prove their correctness. This takes N field randomizations, N exponentiations and N group compositions.
What follows is
comparators witn carry ins and a single group composition (providing
Whole Number Comparison—Using Signed Arithmetic (p's Complement)
Imagine a number
For any two whole numbers
one has the following:
q can be as small as the prover feels comfortable, as it puts an upper limit on the values a and b.
This construction essentially reduces comparison between two arbitrary numbers to a comparison between an arbitrary one and one the prover gets to choose. Powers of 2 are very easy to compare against, when equipped with decomposition proofs, which essentially enable one to make statements such as “a is at most N bits long.”
Complexity:
Assuming decompositions of both operands as common input, in order to imply they are at most N bits long, the prover only needs to decompose C=AB−1, again to prove it is at most N bits long. Thus, all that is needed is a group composition and a bit decomposition.
As compared with the circuit based method, this is considerably more efficient, as it requires about half as many exponentiations. However, it is more limited in that it cannot provide a commitment to the comparison result. It only proves the result directly.
Application to Identity Brokerage
Most of these problems start with Pedersen Commitments to finitely many attributes, in the form
These may be provided by U-Prove or another Anonymous Credentials protocol. The prover and the verifier have
as common input, and the prover additionally has
as private input.
A Zero Knowledge Max Gate
A protocol for zero-knowledge proofs is proposed below.
In particular, the maximum has been proposed as an effective activation function for deep convolutional neural networks.
The application of an activation, or transfer, function to a neural network (NN) is an important part of training a NN. Specifically, the activation function is required to be non-linear. This allows nodes to learn more complex tasks, whereas without a non-linear activation function, the entire NN training could be reduced to a simple linear transformation on its inputs.
Various options are available for use as non-linear activation functions in NN training. Among these are Rectified Linear Unit (ReLU), which is now widely accepted as a default activation function, due in part to a much simpler calculation requirement than alternatives such as sigmoid and hyperbolic tangent. ReLU is defined by a one-input maximum function: f(x)=max(0,x). Variations of ReLU are also used as activation functions; these include Leaky ReLU and Parametric ReLU, both of which feature maximum functions in their definition.
Widespread use of ReLU and its variations as NN activation functions demonstrates the effectiveness of maximum functions.
In the approach below, a two-input maximum function (the “max gate”) operating in zero knowledge is proposed.
The max gate takes two inputs to produce a single output, whereas ReLU (including its variations, Leaky ReLU and Parametric ReLU) and many of the other widely used activation functions in NNs map just a single input to a single output.
A function which takes more than one input, such as the max gate, when applied as an activation function to a NN node, may allow the node to better emulate a neuron in a biological neural network. In turn, this may allow for better NN training outcomes.
Two variations, and various combinations thereof are proposed, an if-then-else approach, and an algebraic method. The max gate can be configured with either, or both approaches, and the selection of which can be conducted based on a desired operating context or implementation. In particular, the max gate can be a model component that is utilized as a part of machine learning model computing architecture for improving certain aspects of the functioning of the machine learning model. As noted herein, the max gate can provide a computationally useful mechanism that aids in not only increasing computational speed in certain situations (e.g., for use with mobile GPUs or with dedicated mobile neural processing units (NPUs)), but also potentially in respect of avoidance of the vanishing-gradients or saturation issues that affects sigmoid/tanh approaches.
An improved approach is proposed as a construction for a proof, that can be used as a two-input maximum function for operating in zero knowledge. This can be used as an efficient way to conduct various proofs without having any knowledge of the underlying values. As noted in earlier sections, a prover and a verifier are able to conduct a verification without conveying any additional information, and the verifier is able to prove that the statement is true.
A challenge with zero knowledge proofs is that they can be computationally complex, and an improved approach is proposed herein adapted to improve computational efficiency to increase the viability of the approach.
From the previous sections, in this approach, one can assume that having committed numbers x and y, that one can obtain a commitment to a value c, that is 1 if y>x, and 0 otherwise.
Thus, the maximum of x and y, named m, satisfies the following:
m=x if c=0 and y otherwise (the if-then-else method)
m=x+(y−x)c (the algebraic method)
Two approaches are proposed above for building a zero-knowledge proof protocol. In some embodiments, just the first approach is used, in another embodiment, just the second approach is used, and in a further embodiment, both approaches can be used and selected between based on which is more efficient for a given scenario.
Either of these conditions yields a correct result, and efficiency is considered:
In both of these methods commitments to all bits of x and y are computed, therefore one may compare their performance from that point on.
The if-then-else method requires two honest and two simulated equality proofs, which amounts to 2*1+2*2=6 group exponentiations and no field inversions.
The algebraic method requires one proof of multiplication well-formedness. This means 4 group exponentiations on the prover side, and thus is more efficient than the other option.
How to Get a Carry
Applicants' 2's complement comparison proof method is more efficient than the more conventional circuit-based comparison method. However, it has the drawback that it can only disclose the comparison result. In order to get a commitment to the comparison result using the 2's Complement method, one would have to decompose the difference between the input numbers with the group's entire bit length, f, and then take the most significant bit as the comparison flag. Comparing by group exponentiations on the prover's side, this is more efficient than circuit-based comparison for
However, since in practice the group size is at least 256 bits and the significands are less than 54 bits long, this is unlikely to happen.
Practical Example
A set of practical examples are provided at
1. Alice goes to Financial Institution to train a model of Alice's face that's going to be stored locally on Alice's phone. Alice's phone maintains an uninitialized face NN, and at Financial Institution's backend systems, the model is trained such that the NN model weights are updated.
2. Financial Institution's system issues a credential based on the trained NN model and discards the training data. Financial Institution's system does not hold a copy of trained NN. Credential given to Alice and stored on her phone.
Time elapses, and the approach moves to the verification flow.
As noted above, a neural network is trained using Alice's photo/video, and the system helps protect Alice's privacy by discarding both the trained NN and the training data after getting the weights.
The max gate activation function can be used during model training, in some embodiments.
3. Alice goes to concert hall to see concert that is 18+. Concert hall usher has device to record Alice's face (or a video).
4. This new recording is provided to Alice's phone, and Alice's phone processes the new recording to initiate a presentation protocol whereby Alice's phone generates commitments that can be used alongside Financial Institution's system credential token to signal that Alice's NN was able to correctly process the recording of Alice's face.
5. Commitments are provided back to verifier—maybe with credential token—and credential token can be verified, for example, through a certificate authority.
6. Usher lets Alice into the concert.
The verifier device takes a contemporaneous photo/video of Alice. This gets sent back to Alice's device for NN inference using the stored weights/commitments. The ‘max gate’ activation function is used alongside the model weights when loading the model to perform NN inference.
The freshly captured photo/video is supplied as input to the loaded model to produce the inference result.
In particular, using the trained NN to process new recording to generate commitment proofs can involve using the model weights along with the Max Gate as the activation function. The Max Gate can improve the speed in which the commitments can be generated due to a closer alignment with functions handled by hardware acceleration present on the phone itself. This is hypothesized in respect of the specific type of hardware that is present on mobile devices, namely neural processing units, mobile graphics processing units, etc. In some cases, especially in respect of zero-knowledge aspects, the verification protocol can be assisted by cryptographic coprocessors available on mobile devices.
In addition to improving efficiency, another potential benefit of the max gate approach is that it may help with avoidance of the vanishing-gradients or saturation issues that affects sigmoid/tanh approaches.
The Max Gate function described herein is adapted to used as part another protocol, such as a privacy protecting NN, and that NN itself can be used for authentication. For example, the protocol can be used as extra security layer of biometric data, without having to store biometric data out to the world, and the protocol can be used to establish, as an alternative, a commitment to the model, to be proved in zero knowledge.
In some embodiments, the algebraic approach noted above has improved efficiency relative to the if-then-else approach. As an example implementation, sigma protocols can be used to implement a neural network or simulate a neural network, and as an activation function which sits between the layers of the neural net, one of the max gate embodiments proposed herein can be utilized. In the example above, it can be used to establish a neural net for Alice's face.
In this example, the approach would include creating commitments to the model weights, and have them signed by a trusted authority that's trusted by a verifier (think of the DMV). Alice could go to the bank and they would train a model of her face, and create a signed credential. The bank in this example, doesn't need to keep the model. The model can be, for example, stored locally, such that user will have the model (or the model weights).
The verifier would see a live video of Alice or get a photo of Alice, and then ask Alice to prove that there is a model that matches those. At this point, the ZKP protocol can be used along with presentation protocol, and commitment values showing that the model that Alice has accepts these weights. The ZKP process can be used to take those commitments that the neural network that would be built by those weights that would be committed to would accept the incoming face.
The use of the max gate approach provides a useful technical improvement that can also help improve the viability of the ZKP approach and the neural network functionality, especially as it relates to the computing limitations of Alice's device and/or the verifier device, and the amount of computing time available for such a determination (e.g., it may need to run in real or near-real time).
The approach can be useful for aspects beyond biometric data, for example, other variations where a model needs to be proven. An advantage is that cryptographic primitives that are commonly available are used, and the computational footprint is limited. Phones have hardware accelerators optimized for this type of calculation and additional speed advantages can be had in respect of portable devices.
The term “connected” or “coupled to” may include both direct coupling (in which two elements that are coupled to each other contact each other) and indirect coupling (in which at least one additional element is located between the two elements).
Although the embodiments have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the scope. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification.
As one of ordinary skill in the art will readily appreciate from the disclosure, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed, that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.
As can be understood, the examples described above and illustrated are intended to be exemplary only.
Glossary
Issuer/Identity Provider is a party trusted by the prover and the verifier, that attests to the prover's attributes.
Prover/Client the party in contact with the issuer as well as the verifier, attempting to prove properties in zero knowledge.
Verifier the party demanding knowledge of certain properties about the prover.
Attribute a proposition about the prover to which the issuer attests.
Property a proposition about the prover whose truth the verifier wants to ascertain. It may be identical to one of the prover's attributes or logically implied by one or many of them.
Proof Request a formalized message from the verifier specifying the property of which it demands knowledge, containing additional information about the protocols supported, etc.
Proof a message from the client to the verifier, providing cryptographic assurance as to the issuer's attestation to the client's satisfaction of the property specified in the proof request.
Templates for Messages
The protocols message can be communicated as JSON-encoded.
Proof Request
What follows is a sample proof request.
The script “alcohol-consumption.stk” can contain the following:
return expiration_date>expiration_threshold && birth_date<birth_threshold;
Instead of providing a URL, the verifier can provide the source code or bytecode to a straight talk script.
A standard bytecode language for straight talk shall also be provided.
Proof
The following can be a response to the request above:
Another example proof, based on secure enclaves, is provided below:
This application is a non-provisional of, and claims all benefit including priority to, U.S. Application No. 63/421,164, entitled SYSTEM AND METHOD FOR ZERO-KNOWLEDGE FACIAL RECOGNITION, filed 31 Oct. 2022. The contents of the above application(s) are hereby incorporated by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
63421164 | Oct 2022 | US |