Secure multi-party computation (MPC) is a cryptographic technology that enables two or more parties to jointly compute a function over a set of inputs while keeping the inputs private. For example, consider a simple scenario in which three parties A, B, and C have respective private inputs x, y, and z and would like to obtain the sum of these inputs without revealing x, y, and z to each other. In an ideal world, A, B, and C could each submit their input to an incorruptible and perfectly trustworthy third party . could then compute x+y+z on their behalf and output the resulting sum, thereby preventing A, B, and C from learning each other's inputs. In the real world, however, incorruptible and perfectly trustworthy third party does not exist. MPC addresses this problem by providing protocols which allow A, B, and C to collectively evaluate, via a series of message exchanges, x+y+z in a manner that achieves the same input privacy as the ideal world model, without relying on .
A common approach for designing an MPC protocol involves having each party distribute cryptographic shares of its input to the other parties using a threshold secret sharing scheme such as Shamir's secret sharing. Through this process, each party receives partial information regarding the other parties' inputs, which is sufficient for the party to carry out the protocol's computation but is insufficient (at least in isolation) for the party to learn the inputs. In an MPC protocol executed by n parties, the sharing of an input x via a secret sharing scheme with threshold t is secure in the sense that up to t of the n parties may collude and disclose their shares to each other without learning anything regarding x. Such colluding parties are referred to as “corrupted” and it is assumed that all corrupted parties follow a single strategy dictated by a “semi-honest” or “malicious” adversary. If the adversary is semi-honest, the corrupted parties will correctly follow the MPC protocol specification and will only try to learn private information by collecting data arising out of normal protocol execution (e.g., the internal state of each corrupted party, the transcript of messages received, etc.). If the adversary is malicious, the corrupted parties may arbitrarily deviate from the protocol specification in order to learn private information. MPC protocols that can withstand attacks from semi-honest adversaries or malicious adversaries are said to “be semi-honest-secure”/“have semi-honest security” or “be malicious-secure”/“have malicious security” respectively.
In recent years, a number of secret sharing-based MPC protocols with malicious security have been proposed that are close in computational and/or communication cost to semi-honest-secure MPC protocols. However, these proposed protocols suffer from at least two deficiencies. First, they are “secure with abort” (or in other words, are “non-robust”), which means that the adversary can cause the protocol to abort prior to completion, thereby preventing uncorrupted (i.e., honest) parties from obtaining the protocol's output. Second, the proposed protocols only allow parties that participate in protocol computation (referred to herein as “servers”) to submit inputs. This limitation—which arises out of the protocols' inability to distinguish between corrupted clients and corrupted servers at the time of verifying input sharings—is problematic because many MPC applications (e.g., secure auctions, electronic voting, etc.) are designed to receive inputs from a large group of “clients” that do not take part in computation.
In the following description, for purposes of explanation, numerous examples and details are set forth in order to provide an understanding of various embodiments. It will be evident, however, to one skilled in the art that certain embodiments can be practiced without some of these details or can be practiced with modifications or equivalents thereof.
The present disclosure is directed to techniques for implementing robust input verification in a secret sharing-based malicious-secure MPC protocol that is capable of receiving inputs from N clients (and is executed by n servers). As used herein, “input verification” refers to the task of verifying that the shares of an input provided by a client to the servers via a threshold secret sharing scheme are valid, which means that the shares conform to the secret sharing scheme's rules/restrictions.
The techniques of the present disclosure rely on two assumptions: first, at most t<n/4 servers are corrupted (and thus at least 3t+1 servers are honest), and second, at most (1−ρ)N clients are corrupted for 0<ρ≤1 (and thus at least ρN clients are honest). With these assumptions in place, the techniques ensure that (1) the adversary controlling the corrupted clients and corrupted servers cannot abort execution of the protocol, (2) a corrupted client cannot disqualify an honest server from participating in the protocol, and (3) a corrupted server cannot “censor” an honest client (i.e., prevent that client's input from being included in the protocol's computation). The foregoing and other aspects are explained in further detail in the sections that follow.
To provide context for the embodiments described herein,
During input phase 106, each client Ck submits an input xk to servers S1, . . . , Sn by sharing it using a threshold secret sharing scheme, which means client Ck submits a cryptographic share xqk of xk to each server Sq for q=1, . . . , n. If Shamir's secret sharing is used, input xk is an element of a finite field and each share xqk=f(q) where f is t-degree polynomial with f(0)=xk and where t is the maximum number of corrupted servers. It is well known that such a t-degree polynomial (or “t-polynomial”) theoretically hides the secret encoded at f(0) from any subset of at most t shareholders. Vector {right arrow over (xk)}=(x1k, . . . , xnk), which represents the shares of input xk provided to servers S1, . . . , Sn, is said to be “consistent with,” or a “consistent sharing” of, xk if the shares of all honest servers agree with polynomial f (i.e., reside on the polynomial, such that xqk =f(q) for every honest server Sq). By a corollary, there are at most t servers that disagree with the polynomial in this scenario. Further, vector {right arrow over (xk)} is said to be a “perfectly consistent with,” or a “perfectly consistent sharing” of, xk if the shares of all servers (even corrupted ones) agree with polynomial f. For the remainder of this disclosure, [x] is used to denote a consistent sharing of a given input/secret x.
Once all clients have provided their input shares, servers S1, . . . , Sn verify (via input verification phase 108) that the shares of each input are valid in order to detect cheating by corrupted clients/servers. The servers perform this verification in a manner that does not reveal the shares to each other, thereby maintaining the privacy of inputs x1, . . . , xN. There are several ways in which input verification phase 108 can be generally structured based on, e.g., the specific threshold secret sharing scheme used, degree of complexity tolerated, and so on. If Shamir's secret sharing is employed as mentioned above, one approach is for each server Sq to obtain a share rq of a secret random value r where the sharing of r is guaranteed to be consistent. For example, shares r1, . . . , rn can be computed offline and seeded on respective servers S1, . . . , Sn using a secure method. Each server Sq can then (1) add rq to its share xqk of a client input xk to generate aqk, (2) broadcast aqk to every other server, and (3) upon receiving the other servers' a values, invoke a Reconstruct([a]) procedure which attempts to interpolate a t-polynomial f such that f(i)=aik for i=1, . . . n.
If the Reconstruct procedure is successful, that means {right arrow over (xk)} is a consistent sharing of input xk and the shares which agree with interpolated polynomial f are valid shares of xk. Conversely, the shares which do not agree with f in this scenario are invalid shares of xk and are referred to as “bad points” (because they do not reside on the polynomial). In general, the Reconstruct procedure will succeed in interpolating f if there are at most
disagreeing shares (or stated another way, if there are at most
shares which do not reside on f).
Finally, during computation phase 110, servers S1, . . . , Sn compute the function of the MPC protocol over the input shares verified during input verification phase 108 and obtain a protocol output. This specific nature of this computation, which is beyond the scope of this disclosure, will vary based on the characteristics of the function and the protocol implementation.
As noted the Background section, there are a number of secret sharing-based malicious-secure MPC protocols in the art that offer relatively high efficiency, but are non-robust (i.e., can be aborted by the adversary) and are incapable of receiving inputs from non-participant clients. A key reason for the latter limitation is that, when a client input is shared among servers and cheating (i.e., an invalid share) is detected during input verification phase 108, these existing protocols cannot easily determine whether the cheating was carried out by a corrupted client or a corrupted server, which raises significant problems. For example, if the cheating was performed by a corrupted client but it is erroneously blamed on an honest server, that server will be excluded from further participation in the protocol, which means that corrupted clients can arbitrarily disqualify honest servers. If such an attack is repeated n−t times then only corrupted servers will remain for carrying out computation phase 110, which is clearly no longer secure (as corrupted servers may collude and reveal their shares to each other). Conversely, if the cheating was performed by a corrupted server but it is erroneously blamed on an honest client, that client's input will be ignored during the computation phase, which means that corrupted servers can arbitrarily censor honest clients.
To address the foregoing limitations, embodiments of the present disclosure provide a novel input verification protocol—usable by servers S1, . . . , Sn during input verification phase 108 of
For example, as described in further detail below, the novel protocol can initialize a counter ctrq to 0 for each server Sq. The protocol can then increment this counter each time Sq is found to hold an invalid share/bad point for a consistent input sharing (per, e.g., Shamir's Reconstruct procedure). If ctrq exceeds (1−ρ)N at any juncture of input verification phase 108, the protocol can definitively conclude that Sq is corrupted and can eliminate it from participating in computation phase 110. For input sharings that are found to be consistent but include some number of bad points held by servers with counters less than (1=ρ)N, the novel protocol can request that the clients which originated those sharings resubmit them via a stronger (and more expensive) method to definitively determine whether the clients or servers are corrupted.
With regard to property (1) above (i.e., robustness), this is achieved via a combination of properties (2) and (3) and the assumption that at most t<n/4 servers are corrupted. As mentioned previously, with Shamir's secret sharing, the Reconstruct procedure will succeed in interpolating a t-polynomial f for a sharing of a given input/secret x if there are at most
disagreeing shares/servers. When n≥4t+1, this procedure succeeds if there are at most 1.5t bad points (i.e., invalid shares/disagreeing servers). Accordingly, the adversary cannot abort the protocol by, e.g., causing some combination of corrupted servers/clients to broadcast/submit invalid shares (or nothing at all) for a given input xk to the honest servers, because (A) if at least 3t+1 servers (i.e., the honest servers) receive valid shares, Reconstruct[ak] will succeed (and the protocol can proceed to identify corrupted servers as above), and (B) if Reconstruct[ak] fails due to more than n/4 bad points, client Ck can be definitely identified as corrupted. In either case, the protocol can move forward and thus avoid being halted.
Section (3) below presents a “non-batched” implementation of the novel input verification protocol in which verification is performed on a per-client basis (i.e., sequentially on each input sharing submitted by clients C1, . . . , CN). And section (4) below presents a more efficient “batched” implementation of the novel input verification protocol that optimistically assumes all clients and servers are honest and thus attempts to verify the input sharings of all clients C1, . . . , CN together (i.e., as an aggregation). This is possible because, under Shamir's secret sharing, if two or more sharings are combined via a linear operator and those individual sharings are perfectly consistent, the combination is also guaranteed to be perfectly consistent. If the verification of the aggregated sharing of C1, . . . , CN succeeds (i.e., the aggregated sharing is perfectly consistent), servers S1, . . . , Sn can immediately move on to computation phase 110. Alternatively if the verification of the aggregated sharing fails (i.e., the aggregated sharing comprises one or more invalid shares), the servers can perform a binary search to find the invalid shares (i.e., verify aggregations of the first and second halves of the shares), and this can continue recursively until the invalid shares are identified.
Starting with blocks 202 and 204, server Si can receive shares xi1, . . . , xiN of inputs x1, . . . , xN from clients C1, . . . , CN and can initialize a counter ctrq for every server Sq (where q∈[n]) to zero.
At block 206, server Si can initialize a set of corrupted servers {tilde over (S)}, a set of corrupted clients {tilde over (C)}, a set of blocked clients Ĉ, and a set of honest clients H to null (0). It is important to note that during execution of the protocol all honest servers agree on these three sets (so for example, there is no situation of a conflict in which honest server Si decides that server Sc is corrupted while honest server Sj decides that server Sc′ (c≠c′) is corrupted). Server Si can then enter a loop for each input xk provided by a given client Ck for k=1, . . . , N (block 208).
Within the loop, server Si can obtain a share rik of a sharing of a secret random value rk where the sharing is guaranteed to be consistent (block 210). As mentioned previously, in certain embodiments shares r1, . . . , rn can be computed offline and seeded on respective servers S1, . . . , Sn using a secure method.
Upon obtaining rik , server Si can compute aik=rik+xik (block 212), securely broadcast aik to the other servers (block 214), and receive share aqk from every other server Sq, thereby allowing it to build a sharing {right arrow over (ak)} (block 216).
At blocks 218 and 220, server Si can invoke Reconstruct([ak]), which attempts to find/interpolate a t-polynomial f in which f(q)=aqk for at least 3t+1 shares of {right arrow over (ak)}, and check the outcome. If the Reconstruct procedure fails (i.e., no polynomial is found), server Si can conclude that client Ck is corrupted and can add client Ck to the set of corrupted clients {tilde over (C)} (block 222).
If the Reconstruct procedure succeeds but there is a set of servers S′⊂S whose shares disagree with reconstructed polynomial f (i.e., the sharing is consistent), server Si can increment counter ctrq for each server Sq∈S′ (block 224). As part of this step, if there is a server Sq whose counter ctrq has exceeded (1−ρ)N, server Si can add server Sq to the list of corrupted servers {tilde over (S)}.
And if the Reconstruct procedure succeeds and there are no bad points (i.e., the sharing is perfectly consistent), server Si can add client Ck to the set of honest clients H (block 226). Server Si can then reach the end of the current loop iteration (block 228) and return to block 208 to process the next input xk.
Upon processing all client inputs, server Si can populate the set of blocked clients Ĉ with those clients that are not in {tilde over (C)} or H (block 230). Finally, at block 232, server Si can output the set of blocked clients Ĉ, the set of corrupted clients {tilde over (C)}, and the set of corrupted servers {tilde over (S)} and flowchart 200 can end. Generally speaking, the inputs of corrupted clients (i.e., clients in {tilde over (C)}) will be disregarded by the servers during computation phase 110 and the corrupted servers (i.e., servers in {tilde over (S)}) will be blocked from participating in phase 110.
Further, although not specifically shown in
This batched approach results in at least two efficiencies: first, if all clients and servers are honest (which will often be the case in many applications), only a single verification round is needed. This is in contrast to the non-batched approach presented in
It should be noted that the batched approach assumes that (1−ρ) is relatively low (i.e., there are relatively few corrupted clients). In cases where (1−ρ) approaches 1, it may be preferable from a performance perspective to perform per-client verification per flowchart 200 of
Turning now to
Upon completing the loop (block 312), server Si can securely broadcast aik for k=1, . . . , N to the other servers (block 314) and receive aqk for k=1, . . . , N from each other server Sq (block 316), thereby allowing server Si to build sharings {right arrow over (ak)} for k=1, . . . , N.
At blocks 318 and 320, server Si can initialize a binary tree T by ordering the a sharings at the leaves of T (such that {right arrow over (ak)} resides at the k-th leaf) and recursively storing {right arrow over (aL)}+{right arrow over (aR)} at an internal node where {right arrow over (aL)} and {right arrow over (aR)} are the left and right children of that internal node (resulting in a tree height of log N where the root is at level 0 and the leaves at level log N) (block 320). Server Si can further mark the root of T and all nodes below the root as unresolved (block 322).
Server Si can then initialize a set of corrupted servers {tilde over (S)}, a set of corrupted clients {tilde over (C)}, and a set of blocked clients Ĉ to null (0) (block 324) and flowchart 300 can proceed to
At block 326 of
If the Reconstruct procedure succeeds but there is a set of servers S′⊂S whose shares disagree with the reconstructed polynomial, server Si can increment counter ctrq for each server Sq∈′ (block 338). As part of this step, if there is a server Sq whose counter ctrq has exceeded (1−ρ)N, server Si can add server Sq to the list of corrupted servers {tilde over (S)} (which means that from this point, Sq can be safely ignored by all honest servers).
And if the Reconstruct procedure fails (i.e., no polynomial is found) and node m is the k-th leaf of T, server Si can conclude that client Ck is corrupted and can add client Ck to the set of corrupted clients {tilde over (C)} (block 340). In this case, because Ck is considered “corrupted” rather than “blocked,” it is not entitled to the opportunity of using the stronger input scheme mentioned above.
Subsequently to blocks 336-340, server Si can check whether all nodes at level l are now marked as resolved (block 342); if so, server Si can exit the node loop. Otherwise, server Si can reach the end of the current node loop iteration (block 344) and return to block 330 in order to process the next node m at level l. Further, upon completing the node loop, server Si can reach the end of the current level loop iteration (block 346) and return to block 326 in order to process the next level l of binary tree T.
Upon completing the level loop, server Si can, for each leaf of T comprising a sharing {right arrow over (ak)}that is still marked as unresolved, add client Ck (i.e., the client that provided the input xk used to generate {right arrow over (ak)}) to the set of blocked clients Ĉ (block 348). Finally, at block 350, server Si can output the set of blocked clients Ĉ, the set of corrupted clients {tilde over (C)}, and the set of corrupted servers {tilde over (S)} and flowchart 300 can end. As with flowchart 200 of
Certain embodiments described herein can employ various computer-implemented operations involving data stored in computer systems. For example, these operations can require physical manipulation of physical quantities—usually, though not necessarily, these quantities take the form of electrical or magnetic signals, where they (or representations of them) are capable of being stored, transferred, combined, compared, or otherwise manipulated. Such manipulations are often referred to in terms such as producing, identifying, determining, comparing, etc. Any operations described herein that form part of one or more embodiments can be useful machine operations.
Further, one or more embodiments can relate to a device or an apparatus for performing the foregoing operations. The apparatus can be specially constructed for specific required purposes, or it can be a generic computer system comprising one or more general purpose processors (e.g., Intel or AMD ×86 processors) selectively activated or configured by program code stored in the computer system. In particular, various generic computer systems may be used with computer programs written in accordance with the teachings herein, or it may be more convenient to construct a more specialized apparatus to perform the required operations. The various embodiments described herein can be practiced with other computer system configurations including handheld devices, microprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, and the like.
Yet further, one or more embodiments can be implemented as one or more computer programs or as one or more computer program modules embodied in one or more non-transitory computer readable storage media. The term non-transitory computer readable storage medium refers to any data storage device that can store data which can thereafter be input to a computer system. The non-transitory computer readable media may be based on any existing or subsequently developed technology for embodying computer programs in a manner that enables them to be read by a computer system. Examples of non-transitory computer readable media include a hard drive, network attached storage (NAS), read-only memory, random-access memory, flash-based nonvolatile memory (e.g., a flash memory card or a solid state disk), a CD (Compact Disc) (e.g., CD-ROM, CD-R, CD-RW, etc.), a DVD (Digital Versatile Disc), a magnetic tape, and other optical and non-optical data storage devices. The non-transitory computer readable media can also be distributed over a network coupled computer system so that the computer readable code is stored and executed in a distributed fashion.
Finally, boundaries between various components, operations, and data stores are somewhat arbitrary, and particular operations are illustrated in the context of specific illustrative configurations. Other allocations of functionality are envisioned and may fall within the scope of the invention(s). In general, structures and functionality presented as separate components in exemplary configurations can be implemented as a combined structure or component. Similarly, structures and functionality presented as a single component can be implemented as separate components.
As used in the description herein and throughout the claims that follow, “a,” “an,” and “the” includes plural references unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
The above description illustrates various embodiments along with examples of how aspects of particular embodiments may be implemented. These examples and embodiments should not be deemed to be the only embodiments and are presented to illustrate the flexibility and advantages of particular embodiments as defined by the following claims. Other arrangements, embodiments, implementations and equivalents can be employed without departing from the scope hereof as defined by the claims.
The present application is a continuation of U.S. patent application Ser. No. 17/010,526, filed Sep. 2, 2020 and entitled “ROBUST INPUT VERIFICATION FOR SECURE MULTI-PARTY COMPUTATION (MPC) WITH CLIENTS,” the entire contents of which are incorporated herein by reference for all purposes.
Number | Date | Country | |
---|---|---|---|
Parent | 17010526 | Sep 2020 | US |
Child | 17966497 | US |