This application relates to the field of electronic transactions and more particularly to the field of securing the contents of sequences of transaction blocks for electronic transactions.
A blockchain consists of an augmentable sequence of blocks: 1, 2, . . . , wherein each block consists of a number of transactions, the hash of the previous block, and other data—e.g., the number of the block, time information, etc. Useful properties of a blockchain are that every user in the system eventually learns the content of every block, no one can alter the content or the order of the blocks, and any valid transaction will eventually eneter a block in the chain.
Users can digitally sign, and thus each user possesses at least one public key and a corresponding secret key. In a blockchain, in general, one knows the public keys, but not necessarily the user who owns it. Accordingly, we may identify a public key with its owner.
A blockchain works by propagating messages (e.g., blocks, transactions, etc.) Typically, but not exclusively, message are propagated by gossiping them in a a peer-to-peer fashion, or via relays.
Several blockchain systems require a block to be certified by the digital signatures of sufficiently many users in the system. In some systems such certifying users belong to a fixed set of users. In some other systems, they belong to a dynamically changing set. This is preferable, because an adversary would have a harder time to corrupt a dynamically changing set, particularly if the set is not only dynamic, but unpredictable as well.
A particularly effective way of selecting a set of users in a verifiable but unpredictable way is the cryptographic sortition employed by Algorand. Here, a user i belongs to a set of users empowered to act in some step s during the production of block number r based on the result of of a computation that i performs via a secret key of his, using inputs s and r, and possibly other inputs and other data (e.g., the fact that the user has joined the system at least k blocks before block r, for some given integer k). For instance, i's computation may involve i's digital signature, sir,s, of such inputs, hashing sir,s, and checking whether the hash is less than a given target t. (In fact, like any other string, a hashed value can in interpreted in some standard way as a number.) If this is the case, then σir,s=sir,s is defined to be the credential of i for step s about block r. Such credential proves to anyone that i is indeed entitled to produce a (preferably signed) message mir,s, his message for step s in round r, that is, in the process aimed at producing block r. In fact i's digital signatures can be checked by anyone, and anyone can hash a given value, and then check whether the result is indeed smaller (or equal to) a given number. Accordingly, i may propagate both sir,s and mir,s. In Algorand, the credential σir,s is computed relative to a long term key, while the signature of mir,s is computed using an ephemeral key, which i only uses to autheticate only one message: his message mir,s. In fact, an honest i erases such ephemeral secret key as soon as he uses it to sign Mir,s.
Using ephemeral keys that are erased after use prevents an adversary who corrupts i, after he propagates mir,s, from forcing i to sign a different message about step s of round r. The system, however, relies on a proper procedure to guarantee to others which is a user i's ephemeral key devoted to authenticate his message for step s of round r. Such guarantee may require additional data to be stored and/or transmitted. It therefore would be nice to lessen this requirement. Particularly, for certifying the blocks of a blockchain.
It is thus desirable to provide public ledgers and electronic money systems that do not need to trust a central authority, and do not suffer from the inefficiencies and insecurities of known decentralized approaches.
According to the system described herein, in a transaction system in which transactions are organized in blocks, a new block Br of valid transactions is constructed, relative to a sequence of prior blocks B0, B1, . . . , Br−1, by having an entity determine a quantity Q from the prior blocks, having the entity use a secret key in order to compute a string S uniquely associated to Q and the entity, having the entity compute from S a quantity T that is S itself, a function of S, and/or hash value of S, having the entity determine whether T possesses a given property, and, if T possesses the given property, having the entity digitally sign Br and make available S and a digitally signed version of Br, wherein the entity is selected based on a random value that varies according to a digital signature of Br. The secret key may be a secret signing key corresponding to a public key of the entity and S is a digital signature of Q by the entity. T may be a number and satisfies the property if T is less than a given number p. S may be made available by making S deducible from Br. Each user may have a balance in the transaction system and p may vary for each user according to the balance of each user. The random value may be a hash of the digital signature of the entity. The entity may be selected if the random value is below a threshold that is chosen to cause a minimum number of entities of the transaction system to be able to digitally sign Br.
According further to the system described herein, selecting a subset of users in a blockchain system to verify a new block Br relative to a sequence of prior blocks B0, B1, . . . , Br−1, includes causing at least some of the users to digitally sign the new block Br together with other information to produce a digital signature, causing at least some of the users to determine a hash value of the digital signature, causing at least some of the users to compare the hash value to a pre-determined threshold, and causing the subset of the users to make available the digital signature to verify the new block Br in response to the hash value being below a pre-determined threshold for each of the subset of the users. A particular one of the users may digitally sign the new block Br only if the particular one of the users verifies information provided in the new block Br. The predetermined value may be chosen to cause the subset of the users to contain a minimum number of the users. The blockchain system may be used in a transaction system in which transactions are organized in blocks.
According further to the system described herein, a blockchain for causes certification of at least one data string m by having a set S of users verify whether m enjoys at least some given property, having users digitally sign m, in response to verification of m by the users, and having the users make available the digital signatures of m that are credentialed signatures of m. The digital signature of m may be credentialed if the digital signature satisfies a given additional property. The digital signature of m may satisfy the given additional property if a hash of the digital signature is smaller than a given target number. The data string m may be certified by at least a given number of credentialed signatures of m.
According further to the system described herein, computer software, provided in a non-transitory computer-readable medium, includes executable code that implements any of the steps described herein.
The present invention dispenses with ephemeral keys for certifying blocks. Typically, a new block is first prepared (e.g., proposed and or agreed upon by at least some users) and then it is certified. We are agnostic about how a block B is prepared: it may be prepared in one or multiple steps, even with the use of ephemeral keys. However, we wish to certify it without relying on ephemeral keys. The certification of a block B guarantees that certain valuable properties apply to the block. A typical main property is to enable a user, even a user who has not participated to or observed the preparation of a block B, to ascertain that B has been added to the blockchain, or even that B is the rth block in the blockchain. Another valuable property (often referred to as finalization) guarantees that B will not disappear from the blockchain, due to a soft fork, even in the presence of a partition of the communication network on which the blockchain protocol is executed.
Assume that a block B has been prepared, in any fashion and in any number of steps. Realizing that a block has been properly prepared requires time and effort, and the verification of various pieces of evidence. A certificate of B consists of a given number of users' digital signatures with valid credentials. Such a certificate of B vouches that the users who have produced such signatures have participated to or observed the preparation of B. At least, it vouches that, if one of the digital signatures of the certificate has been produced by an honest user, then that user has checked that B has been properly prepared.
In the inventive system, multiple users i (even all users), who have seen evidence that B has been properly prepared, digitally sign B.1 These signatures may be relative to long-term (as opposed to ephemeral) keys. Such signatures, however, count for the certification of B if they satisfy a given property P. In the preferred embodiment, a digital signature of i of B, SIGi(B), possesses the given property if (a) its hash (interpreted as a number) is smaller than a given target t, and, preferably, if i has joined the blockchain at least k blocks before B. Note that everyone can verify i's digital signature of B, compute its hash, and check that the result is indeed no larger that t. In addition, any one can verify when i has joined the blockchain, and thus that he has joined the blockchain at least k blocks before. Such SIGi(B) may be considered a specialized credential of i for B as well as a credentialed signature. Thus, in the inventive system, credentials are linked to a specific block, rather than to a given step s in the production of the rth block. Accordingly, a user i may have a credential for a given block B, but not for another block B′. By contrast, for example, in Algorand a user with a proper credential for step s in round r, could sign anything he wanted in that step and round. A block certificate, therefore, consists of a given number n of credentialed signatures for B. Note that a block B may have more than one certificates, if there are more than n credentialed signatures of B. 1Digitally signing a quantity Q includes digitally singing an hashed version of Q, digitally signing Q with other data. Herein we assume that the digital signature is such that, for each message m, each user has a single signature of m, no matter how the public key might be chosen.
The efficiency of the inventive system derives from the fact that a proper SIGi(B) proves both that i certifies B and that i is entitled to certify B. In a traditional system, i would have first obtain a credential for the step s of round r in which he consents to certify B, and then certify B by a separate signature. Thus at least two signatures, rather than one, are needed and may need to be stored and/or transmitted as part of a certificate of B. In addition, if i's signature of B were ephemeral, one would also need some proof that the ephemeral key used was indeed the key that i needed to use just for step s and round r.
The security of the system is derived from a proper choice of the target t and the number n of signatures sufficient to certify a block. For instance, let p be the maximum percentage of malicious users in the system. Typically, malicious users are in a minority—e.g., p<⅓. Then t and n can be chosen so that, with sufficiently high probability, (a) for any possible block value B′, there are n or more credentialed signatures of honest users to form a certificate for B′ and (b) in any certificate of B′, at least one credentialed signature belongs to an honest user.
Also, the set of honest users who are credentialed to certify a block B is sufficiently random that an adversary cannot predict who they are and corrupt them before they certify the block. On the other hand, after an honest user i certifies a block B and propagates SIGi(B), the adversary has no advantage in corrupting i. Indeed, SIGi(B) is already being virally propagated throughout the network, and the adversary cannot stop this propagation process. Second, if, after corrupting i, the adversary forces i to digitally sign a different block B′, then SIGi(B′) may not have a hash that is smaller than t, and to have a fair probability to find n digital signatures of B′, the adversary would have to corrupt more than a fraction p of the users.
As part of the inventive system, a user i may not only have a single credential for B (or none), but also a credential with a weight (essentially a credential associated to a number of votes). Indeed, the weight of i's credentials for B may depend on how much money i has in the system. Indeed, rather that having a single t for all users, each user i may have his own target ti that is higher the higher i's amount of money is. And the weight of i's credential for B may depend on how small the hash of SIGi(B) is relative to ti. For simplicity, but without limitation intended, we shall continue to describe our system treating a user i with a weight-m credential for B as m users, each having a (weight-1) credential for B.
So far, we have discussed certifying a block B via a sufficient number of credentialed signatures of B. More generally, however, the inventive system applies to blockchains in which at least a given message m is certified by a sufficient number of credentialed digital signatures of m. Such a message m may not be a block, but a more general data string. Accordingly, such certification of m may guarantee that different properties apply to m than those applicable or desirable for blocks. For example, but without any limitation intended, the property that m has been approved by a sufficient fraction of a set S of users in the system, or by at least one honest user in S. Indeed, the users in S who have a credentialed signature of m may form a sufficiently randomly selected sample of the users in S. Thus, the fact that a sufficient number of credentialed signatures of m has been produced indicates that, with sufficient high probability, a given fraction of users in S or at least one honest user in S approves m.
Below, after quickly recalling the traditional Algorand system, we provide an example of the preferred embodiment, without any limitation intended, based on Algorand.
Embodiments of the system described herein are explained in more details in accordance with the figures of the drawings, which are briefly described as follows.
The system described herein provides a mechanism for distributing transaction verification and propagation so that no entity is solely responsible for performing calculations to verify and/or propagate transaction information. Instead, each of the participating entities shares in the calculations that are performed to propagate transaction in a verifiable and reliable manner.
Referring to
The description herein focuses on transactions that are payments and on describing the system herein as a money platform. Those skilled in the art will realize that the system described herein can handle all kinds of transactions as well.
The system described herein has a very flexible design and can be implemented in various, but related, ways. We illustrate its flexibility by detailing two possible embodiments of its general design. From them, those skilled in the art can appreciate how to derive all kinds of other implementations as well.
To facilitate understanding the invention, and allow to internal cross reference of its various parts, we organize its presentation in numbered and titled sections. The first sections are common to both of the detailed embodiments.
Money is becoming increasingly virtual. It has been estimated that about 80% of United States dollars today only exist as ledger entries. Other financial instruments are following suit.
In an ideal world, in which we could count on a universally trusted central entity, immune to all possible cyber attacks, money and other financial transactions could be solely electronic. Unfortunately, we do not live in such a world. Accordingly, decentralized cryptocurrencies, such as Bitcoin, and “smart contract” systems, such as Ethereum, have been proposed. At the heart of these systems is a shared ledger that reliably records a sequence of transactions, as varied as payments and contracts, in a tamperproof way. The technology of choice to guarantee such tamperproofness is the blockchain. Blockchains are behind applications such as cryptocurrencies, financial applications, and the Internet of Things. Several techniques to manage blockchain-based ledgers have been proposed: proof of work, proof of stake, practical Byzantine fault-tolerance, or some combination.
Currently, however, ledgers can be inefficient to manage. For example, Bitcoin's proof-of-work approach requires a vast amount of computation, is wasteful and scales poorly. In addition, it de facto concentrates power in very few hands.
We therefore wish to put forward a new method to implement a public ledger that offers the convenience and efficiency of a centralized system run by a trusted and inviolable authority, without the inefficiencies and weaknesses of current decentralized implementations. We call our approach Algorand, because we use algorithmic randomness to select, based on the ledger constructed so far, a set of verifiers who are in charge of constructing the next block of valid transactions. Naturally, we ensure that such selections are provably immune from manipulations and unpredictable until the last minute, but also that they ultimately are universally clear.
Algorand's approach is quite democratic, in the sense that neither in principle nor de facto it creates different classes of users (as “miners” and “ordinary users” in Bitcoin). In Algorand “all power resides with the set of all users”.
One notable property of Algorand is that its transaction history may fork only with very small probability (e.g., one in a trillion, that is, or even 10−18). Algorand can also address some legal and political concerns.
The Algorand approach applies to blockchains and, more generally, to any method of generating a tamperproof sequence of blocks. We actually put forward a new method—alternative to, and more efficient than, blockchains—that may be of independent interest.
Bitcoin is a very ingenious system and has inspired a great amount of subsequent research. Yet, it is also problematic. Let us summarize its underlying assumption and technical problems—which are actually shared by essentially all cryptocurrencies that, like Bitcoin, are based on proof-of-work.
For this summary, it suffices to recall that, in Bitcoin, a user may own multiple public keys of a digital signature scheme, that money is associated with public keys, and that a payment is a digital signature that transfers some amount of money from one public key to another. Essentially, Bitcoin organizes all processed payments in a chain of blocks, B1, B2, . . . , each consisting of multiple payments, such that, all payments of B1, taken in any order, followed by those of B2, in any order, etc., constitute a sequence of valid payments. Each block is generated, on average, every 10 minutes.
This sequence of blocks is a chain, because it is structured so as to ensure that any change, even in a single block, percolates into all subsequent blocks, making it easier to spot any alteration of the payment history. (As we shall see, this is achieved by including in each block a cryptographic hash of the previous one.) Such block structure is referred to as a blockchain.
Bitcoin assumes that no malicious entity (nor a coalition of coordinated malicious entities) controls the majority of the computational power devoted to block generation. Such an entity, in fact, would be able to modify the blockchain, and thus re-write the payment history, as it pleases. In particular, it could make a payment , obtain the benefits paid for, and then “erase” any trace of .
Bitcoin's proof-of-work approach to block generation requires an extraordinary amount of computation. Currently, with just a few hundred thousands public keys in the system, the top 500 most powerful supercomputers can only muster a mere 12.8% percent of the total computational power required from the Bitcoin players. This amount of computation would greatly increase, should significantly more users join the system.
Today, due to the exorbitant amount of computation required, a user, trying to generate a new block using an ordinary desktop (let alone a cell phone), expects to lose money. Indeed, for computing a new block with an ordinary computer, the expected cost of the necessary electricity to power the computation exceeds the expected reward. Only using pools of specially built computers (that do nothing other than “mine new blocks”), one might expect to make a profit by generating new blocks. Accordingly, today there are, de facto, two disjoint classes of users: ordinary users, who only make payments, and specialized mining pools, that only search for new blocks.
It should therefore not be a surprise that, as of recently, the total computing power for block generation lies within just five pools. In such conditions, the assumption that a majority of the computational power is honest becomes less credible.
In Bitcoin, the blockchain is not necessarily unique. Indeed its latest portion often forks: the blockchain may be—say—B1, . . . , Bk, Bk+1, Bk+2, according to one user, and B1, . . . , Bk, B″k+1, B″k+2, B″k+3 according another user. Only after several blocks have been added to the chain, can one be reasonably sure that the first k+3 blocks will be the same for all users. Thus, one cannot rely right away on the payments contained in the last block of the chain. It is more prudent to wait and see whether the block becomes sufficiently deep in the blockchain and thus sufficiently stable.
Separately, law-enforcement and monetary-policy concerns have also been raised about Bitcoin.2 2The (pseudo) anonymity offered by Bitcoin payments may be misused for money laundering and/or the financing of criminal individuals or terrorist organizations. Traditional banknotes or gold bars, that in principle offer perfect anonymity, should pose the same challenge, but the physicality of these currencies substantially slows down money transfers, so as to permit some degree of monitoring by law-enforcement agencies. The ability to “print money” is one of the very basic powers of a nation state. In principle, therefore, the massive adoption of an independently floating currency may curtail this power. Currently, however, Bitcoin is far from being a threat to governmental monetary policies, and, due to its scalability problems, may never be.
Setting Algorand works in a very tough setting. Briefly,
1. A N
Algorand leverages this binary BA protocol to reach agreement, in our different communication model, on each new block. The agreed upon block is then certified, via a prescribed number of digital signature of the proper verifiers, and propagated through the network.
2. S
3. T
To meet this challenge, we purposely construct, and continually update, a separate and carefully defined quantity, Qr, which provably is, not only unpredictable, but also not influentiable, by our powerful Adversary. We may refer to Qr as the rth seed, as it is from Qr that Algorand selects, via secret cryptographic sortition, all the users that will play a special role in the generation of the rth block. The seed Qr will be deducible from the block Br−1.
4. S
To prevent this problem, leaders (and actually verifiers too) secretly learn of their role, but can compute a proper credential, capable of proving to everyone that indeed have that role. When a user privately realizes that he is the leader for the next block, first he secretly assembles his own proposed new block, and then disseminates it (so that can be certified) together with his own credential. This way, though the Adversary will immediately realize who the leader of the next block is, and although he can corrupt him right away, it will be too late for the Adversary to influence the choice of a new block. Indeed, he cannot “call back” the leader's message no more than a powerful government can put back into the bottle a message virally spread by WikiLeaks.
As we shall see, we cannot guarantee leader uniqueness, nor that everyone is sure who the leader is, including the leader himself! But, in Algorand, unambiguous progress will be guaranteed.
5. P
Fortunately we'll prove that protocol BA*, executed by propagating messages in a peer-to-peer fashion, is player-replaceable. This novel requirement means that the protocol correctly and efficiently reaches consensus even if each of its step is executed by a totally new, and randomly and independently selected, set of players. Thus, with millions of users, each small set of players associated to a step of BA* most probably has empty intersection with the next set.
In addition, the sets of players of different steps of BA* will probably have totally different cardinalities. Furthermore, the members of each set do not know who the next set of players will be, and do not secretly pass any internal state.
The replaceable-player property is actually crucial to defeat the dynamic and very powerful Adversary we envisage. We believe that replaceable-player protocols will prove crucial in lots of contexts and applications. In particular, they will be crucial to execute securely small sub-protocols embedded in a larger universe of players with a dynamic adversary, who, being able to corrupt even a small fraction of the total players, has no difficulty in corrupting all the players in the smaller sub-protocol.
A honest user follows his prescribed instructions, which include being online and run the protocol. Since, Algorand has only modest computation and communication requirement, being online and running the protocol “in the background” is not a major sacrifice. Of course, a few “absences” among honest players, as those due to sudden loss of connectivity or the need of rebooting, are automatically tolerated (because we can always consider such few players to be temporarily malicious). Let us point out, however, that Algorand can be simply adapted so as to work in a new model, in which honest users to be offline most of the time. Our new model can be informally introduced as follows.
We shall rely on an efficiently computable cryptographic hash function, H, that maps arbitrarily long strings to binary strings of fixed length. Following a long tradition, we model H as a random oracle, essentially a function mapping each possible string s to a randomly and independently selected (and then fixed) binary string, H(s), of the chosen length.
In our described embodiments, H has 256-bit long outputs. Indeed, such length is short enough to make the system efficient and long enough to make the system secure. For instance, we want H to be collision-resilient. That is, it should be hard to find two different strings x and y such that H(x)=H(y). When H is a random oracle with 256-bit long outputs, finding any such pair of strings is indeed difficult. (Trying at random, and relying on the birthday paradox, would require 2256/2=2128 trials.)
Digital signatures allow users to to authenticate information to each other without sharing any sharing any secret keys. A digital signature scheme consists of three fast algorithms: a probabilistic key generator G, a signing algorithm S, and a verification algorithm V.
Given a security parameter k, a sufficiently high integer, a user i uses G to produce a pair of k-bit keys (i.e., strings): a “public” key pki and a matching “secret” signing key skc. Crucially, a public key does not “betray” its corresponding secret key. That is, even given knowledge of pki, no one other than i is able to compute ski in less than astronomical time.
User i uses ski to digitally sign messages. For each possible message (binary string) m, i first hashes m and then runs algorithm S on inputs H(m) and ski so as to produce the k-bit string
sigpk
3Since H is collision-resilient it is practically impossible that, by signing m one “accidentally signs” a different message m′.
The binary string sigpk
Everyone knowing pki can use it to verify the digital signatures produced by i. Specifically, on inputs (a) the public key pki of a player i, (b) a message m, and (c) a string s, that is, i's alleged digital signature of the message m, the verification algorithm V outputs either YES or NO.
The properties we require from a digital signature scheme are:
In general, a message m is not retrievable from its signature sigi(m). In order to virtually deal with digital signatures that satisfy the conceptually convenient “message retrievability” property (i.e., to guarantee that the signer and the message are easily computable from a signature, we define
SIGpk
We also consider digital signature schemes (G,S,V) satisfying the following additional property.
It is hard to find strings pk′, m, s, and s′ such that
s≠s′ and V(pk′,m,s)=V(pk′,m,s′)=1.
Algorand tries to mimic the following payment system, based on an idealized public ledger.
S
0=(pk1,a1), . . . ,(pkj,aj),
p=SIGpk(pk,pk′,a′,I,H()),
where I represents any additional information deemed useful but not sensitive (e.g., time information and a payment identifier), and I any additional information deemed sensitive (e.g., the reason for the payment, possibly the identities of the owners of pk and the pk′, and so on).
We refer to pk (or its owner) as the payer, to each pk′ (or its owner) as a payee, and to a′ as the amount of the payment p.
L=PAY1,PAY2, . . . ,
Unless otherwise specified, each public key (“key” for short) is long-term and relative to a digital signature scheme with the uniqueness property. A public key i joins the system when another public key j already in the system makes a payment to i.
For color, we personify keys. We refer to a key i as a “he”, say that i is honest, that i sends and receives messages, etc. User is a synonym for key. When we want to distinguish a key from the person to whom it belongs, we respectively use the term “digital key” and “owner”.
A system is permissionless, if a digital key is free to join at any time and an owner can own multiple digital keys; and its permissioned, otherwise.
Each object in Algorand has a unique representation. In particular, each set {(x, y, z, . . . ):x∈X, y∈Y, z∈Z, . . . } is ordered in a pre-specified manner: e.g., first lexicographically in x, then in y, etc.
There is no global clock: rather, each user has his own clock. User clocks need not be synchronized in any way. We assume, however, that they all have the same speed.
For instance, when it is 12 pm according to the clock of a user i, it may be 2:30 pm according to the clock of another user j, but when it will be 12:01 according to i's clock, it will 2:31 according to j's clock. That is, “one minute is the same (sufficiently, essentially the same) for every user”.
Algorand is organized in logical units, r=0, 1, . . . , called rounds.
We consistently use superscripts to indicate rounds. To indicate that a non-numerical quantity Q (e.g., a string, a public key, a set, a digital signature, etc.) refers to a round r, we simply write Qr. Only when Q is a genuine number (as opposed to a binary string interpretable as a number), do we write Q(r), so that the symbol r could not be interpreted as the exponent of Q.
At (the start of a) round r>0, the set of all public keys is PKr, and the system status is
S
r={(i,ai(r), . . . ):i∈PKr},
where ai(r) is the amount of money available to the public key i. Note that PKr is deducible from Sr, and that Sr may also specify other components for each public key i.
For round 0, PK0 is the set of initial public keys, and S0 is the initial status. Both PK0 and S0 are assumed to be common knowledge in the system. For simplicity, at the start of round r, so are PK1, . . . , PKr and S1, . . . , Sr.
In a round r, the system status transitions from Sr to Sr+1: symbolically,
Round r: Sr→Sr+1
In Algorand, the users continually make payments (and disseminate them in the way described in subsection 2.7). A payment of a user i∈PKr has the same format and semantics as in the Ideal System. Namely,
=SIGi(i,i′,a,I,H()).
Payment cg is individually valid at a round r (is a round-r payment, for short) if (1) its amount a is less than or equal to ai(r), and (2) it does not appear in any official payset PAYr′ for r′<r. (As explained below, the second condition means that has not already become effective.
A set of round-r payments of i is collectively valid if the sum of their amounts is at most ai(r).
A round-r payset is a set of round-r payments such that, for each user i, the payments of i in (possibly none) are collectively valid. The set of all round-r paysets is (r). A round-r payset is maximal if no superset of is a round-r payset.
We actually suggest that a payment also specifies a round ρ, =SIGi(ρ,i,i′,a,I,H()), and cannot be valid at any round outside [ρ,ρ+k], for some fixed non-negative integer k.4 4This simplifies checking whether p has become “effective” (i.e., it simplifies determining whether some payset PAYr contains . When k=0, if p=SIGi(r,i,i′,a,I,H()), and ∉PAYr, then i must re-submit .
For every round r, Algorand publicly selects (in a manner described later on) a single (possibly empty) payset, PAYr, the round's official payset. (Essentially, PAYr represents the round-r payments that have “actually” happened.)
As in the Ideal System (and Bitcoin), (1) the only way for a new user j to enter the system is to be the recipient of a payment belonging to the official payset PAYr of a given round r; and (2) PAY determines the status of the next round, Sr+1, from that of the current round, Sr. Symbolically,
PAYr:Sr→Sr+1.
PAY0, . . . ,PAYr.
In Algorand0, the block Br corresponding to a round r specifies: r itself; the set of payments of round r, PAYr; a quantity (Qr−1), to be explained, and the hash of the previous block, H(Br−1). Thus, starting from some fixed block B0, we have a traditional blockchain:
B
1=(1,PAY1,(Q0),H(B0)),B2=(2,PAY2,(Q1),H(B1)),
In Algorand, the authenticity of a block is actually vouched by a separate piece of information, a “block certificate” CERTr, which turns Br into a proven block,
B
1
As we shall see, CERTr consists of a set of digital signatures for H(Br), those of a majority of the members of SVr, together with a proof that each of those members indeed belongs to SVr. We could, of course, include the certificates CERTr in the blocks themselves, but find it conceptually cleaner to keep it separate.)
In Bitcoin each block must satisfy a special property, that is, must “contain a solution of a crypto puzzle”, which makes block generation computationally intensive and forks both inevitable and not rare. By contrast, Algorand's blockchain has two main advantages: it is generated with minimal computation, and it will not fork with overwhelmingly high probability. Each block Bi is safely final as soon as it enters the blockchain.
To analyze the security of Algorand we specify the probability, F, with which we are willing to accept that something goes wrong (e.g., that a verifier set SVr does not have an honest majority). As in the case of the output length of the cryptographic hash function H, also F is a parameter. But, as in that case, we find it useful to set F to a concrete value, so as to get a more intuitive grasp of the fact that it is indeed possible, in Algorand, to enjoy simultaneously sufficient security and sufficient efficiency. To emphasize that F is parameter that can be set as desired, in the first and second embodiments we respectively set
F=10−12 and F=10−18
Note that 10−12 is actually less than one in a trillion, and we believe that such a choice of F is adequate in our application. Let us emphasize that 10−12 is not the probability with which the Adversary can forge the payments of an honest user. All payments are digitally signed, and thus, if the proper digital signatures are used, the probability of forging a payment is far lower than 10−12, and is, in fact, essentially 0. The bad event that we are willing to tolerate with probability F is that Algorand's blockchain forks. Notice that, with our setting of F and one-minute long rounds, a fork is expected to occur in Algorand's blockchain as infrequently as (roughly) once in 1.9 million years. By contrast, in Bitcoin, a forks occurs quite often.
A more demanding person may set F to a lower value. To this end, in our second embodiment we consider setting F to 10−18. Note that, assuming that a block is generated every second, 1018 is the estimated number of seconds taken by the Universe so far: from the Big Bang to present time. Thus, with F=10−18, if a block is generated in a second, one should expect for the age of the Universe to see a fork.
Algorand is designed to be secure in a very adversarial model. Let us explain.
A user is honest if he follows all his protocol instructions, and is perfectly capable of sending and receiving messages. A user is malicious (i.e., Byzantine, in the parlance of distributed computing) if he can deviate arbitrarily from his prescribed instructions.
The Adversary is an efficient (technically polynomial-time) algorithm, personified for color, who can immediately make malicious any user he wants, at any time he wants (subject only to an upperbound to the number of the users he can corrupt).
The Adversary totally controls and perfectly coordinates all malicious users. He takes all actions on their behalf, including receiving and sending all their messages, and can let them deviate from their prescribed instructions in arbitrary ways. Or he can simply isolate a corrupted user sending and receiving messages. Let us clarify that no one else automatically learns that a user i is malicious, although i's maliciousness may transpire by the actions the Adversary has him take.
This powerful adversary however,
We consider a continuum of Honest Majority of Money (HMM) assumptions: namely, for each non-negative integer k and real h>½,
Assuming that all malicious users perfectly coordinate their actions (as if controlled by a single entity, the Adversary) is a rather pessimistic hypothesis. Perfect coordination among too many individuals is difficult to achieve. Perhaps coordination only occurs within separate groups of malicious players. But, since one cannot be sure about the level of coordination malicious users may enjoy, we'd better be safe than sorry.
Assuming that the Adversary can secretly, dynamically, and immediately corrupt users is also pessimistic. After all, realistically, taking full control of a user's operations should take some time.
The assumption HMMk>h implies, for instance, that, if a round (on average) is implemented in one minute, then, the majority of the money at a given round will remain in honest hands for at least two hours, if k=120, and at least one week, if k=10,000.
Note that the HMM assumptions and the previous Honest Majority of Computing Power assumptions are related in the sense that, since computing power can be bought with money, if malicious users own most of the money, then they can obtain most of the computing power.
We envisage message propagation—i.e., “peer-to-peer gossip”5—to be the only means of communication, and assume that every propagated message reaches almost all honest users in a timely fashion. We essentially assume that each message m propagated by honest user reaches, within a given amount of time that depends on the length of m, all honest users. (It actually suffices that m reaches a sufficiently high percentage of the honest users.) 5Essentially, as in Bitcoin, when a user propagates a message m, every active user i receiving m for the first time, randomly and independently selects a suitably small number of active users, his “neighbors”, to whom he forwards m, possibly until he receives an acknowledgement from them. The propagation of m terminates when no user receives m for the first time.
As already emphasized, Byzantine agreement is a key ingredient of Algorand. Indeed, it is through the use of such a BA protocol that Algorand is unaffected by forks. However, to be secure against our powerful Adversary, Algorand must rely on a BA protocol that satisfies the new player-replaceability constraint. In addition, for Algorand to be efficient, such a BA protocol must be very efficient.
BA protocols were first defined for an idealized communication model, synchronous complete networks (SC networks). Such a model allows for a simpler design and analysis of BA protocols. Accordingly, in this section, we introduce a new BA protocol, BA*, for SC networks and ignoring the issue of player replaceability altogether. The protocol BA* is a contribution of separate value. Indeed, it is the most efficient cryptographic BA protocol for SC networks known so far.
To use it within our Algorand protocol, we modify BA* a bit, so as to account for our different communication model and context.
We start by recalling the model in which BA* operates and the notion of a Byzantine agreement.
In a SC network, there is a common clock, ticking at each integral times r=1, 2, . . .
At each even time click r, each player i instantaneously and simultaneously sends a single message mi,jr. (possibly the empty message) to each player j, including himself. Each mi,jr is correctly received at time click r+1 by player j, together with the identity of the sender i.
Again, in a communication protocol, a player is honest if he follows all his prescribed instructions, and malicious otherwise. All malicious players are totally controlled and perfectly coordinated by the Adversary, who, in particular, immediately receives all messages addressed to malicious players, and chooses the messages they send.
The Adversary can immediately make malicious any honest user he wants at any odd time click he wants, subject only to a possible upperbound t to the number of malicious players. That is, the Adversary “cannot interfere with the messages already sent by an honest user i”, which will be delivered as usual.
The Adversary also has the additional ability to see instantaneously, at each even round, the messages that the currently honest players send, and instantaneously use this information to choose the messages the malicious players send at the same time tick.
The notion of Byzantine agreement might have been first introduced for the binary case, that is, when every initial value consists of a bit. However, it was quickly extended to arbitrary initial values. By a BA protocol, we mean an arbitrary-value one.
In a synchronous network, let be a n-player protocol, whose player set is common knowledge among the players, t a positive integer such that n≥2t+1. We say that is an arbitrary-value (respectively, binary) (n,t)-Byzantine agreement protocol with soundness σ∈(0, 1) if, for every set of values V not containing the special symbol I (respectively, for V={0, 1}), in an execution in which at most t of the players are malicious and in which every player i starts with an initial value vi∈V, every honest player j halts with probability 1, outputting a value outi∈V∪{⊥} so as to satisfy, with probability at least o, the following two conditions:
1. Agreement: There exists out ∈V∪{⊥} such that outi=out for all honest players i.
2. Consistency: if, for some value v∈V, vi=v for all players i, then out=v.
We refer to out as 's output, and to each outi as player i's output.
In our BA protocols, a player is required to count how many players sent him a given message in a given step. Accordingly, for each possible value v that might be sent,
#is(v)
(or just #i(v) when s is clear) is the number of players j from which i has received v in step s.
Recalling that a player i receives exactly one message from each player j, if the number of players is n, then, for all i and s, Z, Σv#is(v)=n.
In this section we present a new binary BA protocol, BBA*, which relies on the honesty of more than two thirds of the players and is very fast: no matter what the malicious players might do, each execution of its main loop not only is trivially executed, but brings the players into agreement with probability ⅓.
In BBA*, each player has his own public key of a digital signature scheme satisfying the unique-signature property. Since this protocol is intended to be run on synchronous complete network, there is no need for a player i to sign each of his messages.
Digital signatures are used to generate a sufficiently common random bit in Step 3. (In Algorand, digital signatures are used to authenticate all other messages as well.)
The protocol requires a minimal set-up: a common random string r, independent of the players' keys. (In Algorand, r is actually replaced by the quantity Qr.)
Protocol BBA* is a 3-step loop, where the players repeatedly exchange Boolean values, and different players may exit this loop at different times. A player i exits this loop by propagating, at some step, either a special value 0* or a special value 1*, thereby instructing all players to “pretend” they respectively receive 0 and 1 from i in all future steps. (Alternatively said: assume that the last message received by a player j from another player i was a bit b. Then, in any step in which he does not receive any message from i, j acts as if i sent him the bit b.)
The protocol uses a counter γ, representing how many times its 3-step loop has been executed. At the start of BBA*, γ=0. (One may think of y as a global counter, but it is actually increased by each individual player every time that the loop is executed.)
There are n>3t+1, where t is the maximum possible number of malicious players. A binary string x is identified with the integer whose binary representation (with possible leadings 0s) is z; and lsb(z) denotes the least significant bit of z.
(C
A proof of Theorem 3.1 can be found in https://people.csail.mit.edu/silvio/Selected-ScientificPapers/DistributedComputation/BYZANTINEAGREEMENTMADETRIVIAL.15pdf.
Let us recall, for arbitrary values, a notion of consensus much weaker than Byzantine agreement.
Definition 3.2. Let be a protocol in which the set of all players is common knowledge, and each player i privately knows an arbitrary initial value v′i.
We say that is an (n,t)-graded consensus protocol if, in every execution with n players, at most t of which are malicious, every honest player i halts outputting a value-grade pair (vi,gi), where gi∈{0, 1, 2}, so as to satisfy the following three conditions:
1. For all honest players i and j, |gi−gj|≤1.
2. For all honest players i and j, gi, gj>0⇒vi=vj.
3. If v′1= . . . =v, =v for some value v, then vi=v and gi=2 for all honest players i.
The following two-step protocol CC is a graded consensus protocol in the literature. To match the steps of protocol Algorand′1 of section 4.1, we respectively name 2 and 3 the steps of CC. (Indeed, the first step of Algorand′1 is concerned with something else: namely, proposing a new block.)
S
S
O
Since protocol GOC is a protocol in the literature, it is known that the following theorem holds.
Theorem 3.2. If n≥3t+1, then CC is a (n,t)-graded broadcast protocol.
We now describe the arbitrary-value BA protocol BA* via the binary BA protocol BBA* and the graded-consensus protocol GC. Below, the initial value of each player i is v′i.
S
S
O
Theorem 3.3. Whenever n≥3t+1, BA* is a (n,t)-BA protocol with soundness 1.
Proof. We first prove Consistency, and then Agreement.
P
P
Since both Consistency and Agreement hold, BA* is an arbitrary-value BA protocol.
▪
Protocol BA* works also in gossiping networks, and in fact satisfies the player replaceability property that is crucial for Algorand to be secure in the envisaged very adversarial model.
Let us now provide some intuition of why the protocols BA* and BBA* can be adapted to be executed in a network where communication is via peer-to-peer gossiping, satisfy player replaceability. For concreteness, assume that the network has 10M users and that each step x of BBA* (or BA*) is executed by a committee of 10,000 players, who have been randomly selected via secret cryptographic sortition, and thus have credentials proving of being entitled to send messages in step x. Assume that each message sent in a given step specifies the step number, is digitally signed by its sender, and includes the credential proving that its sender is entitled to speak in that step.
First of all, if the percentage h of honest players is sufficiently larger than ⅔ (e.g., 75%), then, with overwhelming probability, the committee selected at each step has the required ⅔ honest majority.
In addition, the fact that the 10,000-strong randomly selected committee changes at each step does not impede the correct working of either BBA* or BA*. Indeed, in either protocol, a player i in step s only reacts to the multiplicity with which, in Step s−1, he has received a given message m. Since we are in a gossiping network, all messages sent in Step s−1 will (immediately, for the purpose of this intuition) reach all users, including those selected to play in step s. Furthermore because all messages sent in step s−1 specify the step number and include the credential that the sender was indeed authorized to speak in step s−1. Accordingly, whether he happened to have been selected also in step s−1 or not, a user i selected to play in step s is perfectly capable of correctly counting the multiplicity with which he has received a correct step s−1 message. It does not at all matter whether he has been playing all steps so far or not. All users are in “in the same boat” and thus can be replaced easily by other users.
As discussed, at a very high level, a round of Algorand ideally proceeds as follows. First, a randomly selected user, the leader, proposes and circulates a new block. (This process includes initially selecting a few potential leaders and then ensuring that, at least a good fraction of the time, a single common leader emerges.) Second, a randomly selected committee of users is selected, and reaches Byzantine agreement on the block proposed by the leader. (This process includes that each step of the BA protocol is run by a separately selected committee.) The agreed upon block is then digitally signed by a given threshold (TH) of committee members. These digital signatures are propagated so that everyone is assured of which is the new block. (This includes circulating the credential of the signers, and authenticating just the hash of the new block, ensuring that everyone is guaranteed to learn the block, once its hash is made clear.)
In the next two sections, we present two embodiments of the basic Algorand design, Algorand′1 and Algorand′2, that respectively work under a proper majority-of-honest-users assumption. In Section ?? we show how to adopts these embodiments to work under a honest-majority-of-money assumption.
Algorand′1 only envisages that >⅔ of the committee members are honest. In addition, in Algorand′1, the number of steps for reaching Byzantine agreement is capped at a suitably high number, so that agreement is guaranteed to be reached with overwhelming probability within a fixed number of steps (but potentially requiring longer time than the steps of Algorand′2). In the remote case in which agreement is not yet reached by the last step, the committee agrees on the empty block, which is always valid.
Algorand′2 envisages that the number of honest members in a committee is always greater than or equal to a fixed threshold tH (which guarantees that, with overwhelming probability, at least ⅔ of the committee members are honest). In addition, Algorand′2 allows Byzantine agreement to be reached in an arbitrary number of steps (but potentially in a shorter time than Algorand′1).
Those skilled in the art will realize that many variants of these basic embodiments can be derived. In particular, it is easy, given Algorand′2, to modify Algorand′1 so as to enable to reach Byzantine agreement in an arbitrary number of steps.
Both embodiments share the following common core, notations, notions, and parameters.
Ideally, for each round r, Algorand should satisfy the following properties:
1. Perfect Correctness. All honest users agree on the same block Br.
2. Completeness 1. With probability 1, the block Br has been chosen by a honest user.
(Indeed a malicious user may always choose a block whose payset contains the payments of just his “friends”.)
Of course, guaranteeing perfect correctness alone is trivial: everyone always chooses the official payset PAYr to be empty. But in this case, the system would have completeness 0. Unfortunately, guaranteeing both perfect correctness and completeness 1 is not easy in the presence of malicious users. Algorand thus adopts a more realistic objective. Informally, letting h denote the percentage of users who are honest, h>⅔, the goal of Algorand is
Disregarding excessive time and communication for a moment, perfect Correctness could be guaranteed as follows. At the start of round r, each user i proposes his own candidate block Bir. Then, all users reach Byzantine agreement on just one of the candidate blocks. As per our introduction, the BA protocol employed requires a ⅔ honest majority and is player replaceable. Each of its step can be executed by a small and randomly selected set of verifiers, who do not share any inner variables.
Unfortunately, this approach does not quite work. This is so, because the candidate blocks proposed by the honest users are most likely totally different from each other. Indeed, each honest user sees different payments. Thus, although the sets of payments seen by different honest users may have a lot of overlap, it is unlikely that all honest users will construct a propose the same block. Accordingly, the consistency agreement of the BA protocol is never binding, only the agreement one is, and thus agreement may always been reached on I rather than on a good block.
Algorand′ avoids this problem as follows. First, a leader for round r, ′, is selected. Then, r propagates his own candidate block, . Finally, the users reach agreement on the block they actually receive from r. Because, whenever r is honest, Perfect Correctness and Completeness 1 both hold, Algorand′ ensures that r is honest with probability close to h.
In Algorand's, the rth block is of the form
B
r=(r,PAYr,(Qr−1),H(Br−1).
As already mentioned in the introduction, the quantity Qr−1 is carefully constructed so as to be essentially non-manipulatable by our very powerful Adversary. (Later on in this section, we shall provide some intuition about why this is the case.) At the start of a round r, all users know the blockchain so far, B0, . . . , Br−1, from which they deduce the set of users of every prior round: that is, PK1, . . . , PKr−1. A potential leader of round r is a user i such that
.H(SIGi(r,1,Qr−1))≤p.
Let us explain. Note that, since the quantity Qr−1 is deducible from block Br−1 because of the message retrievability property of the underlying digital signature scheme. Furthermore, the underlying signature scheme satisfies the uniqueness property. Thus, SIGi(r,1,Qr−1) is a binary string uniquely associated to i and r. Accordingly, since H is a random oracle, H(SIGi(r,1,Qr−1)) is a random 256-bit long string uniquely associated to i and r. The symbol “.” in front of H(SIGi(r,1,Qr−1)) is the decimal (in our case, binary) point, so that ri.H(SIGi(r,1,Qr−1)) is the binary expansion of a random 256-bit number between 0 and 1 uniquely associated to i and r. Thus the probability that ri is less than or equal to p is essentially p.
The probability p is chosen so that, with overwhelming (i.e., 1−F) probability, at least one potential verifier is honest. (If fact, p is chosen to be the smallest such probability.)
Note that, since i is the only one capable of computing his own signatures, he alone can determine whether he is a potential verifier of round 1. However, by revealing his own credential, σirSIGi(r,1,Qr−1), i can prove to anyone to be a potential verifier of round r.
The leader r is defined to be the potential leader whose hashed credential is smaller that the hashed credentials of all other potential leader j: that is, H(σ
Note that, since a malicious r may not reveal his credential, the correct leader of round r may never be known, and that, barring improbable ties, r is indeed the only leader of round r.
Let us finally bring up a last but important detail: a user i can be a potential leader (and thus the leader) of a round r only if he belonged to the system for at least k rounds. This guarantees the non-manipulatability of Qr and all future Q-quantities. In fact, one of the potential leaders will actually determine Qr.
Each step s>1 of round r is executed by a small set of verifiers, SVr,s. Again, each verifier i∈SVr,s, is randomly selected among the users already in the system k rounds before r, and again via the special quantity Qr−1. Specifically, i∈PKr−k is a verifier in SVr,s, if
.H(SIGi(r,s,Qr−1))≤p′.
Once more, only i knows whether he belongs to SVr,s, but, if this is the case, he could prove it by exhibiting his credential σir,sH(SIGi(r,s,Qr−1)). A verifier i∈SVr,s sends a message, mir,s, in step s of round r, and this message includes his credential σir,s, so as to enable the verifiers f the nest step to recognize that mir,s is a legitimate step-s message.
The probability p′ is chosen so as to ensure that, in SVr,s, letting # good be the number of honest users and # bad the number of malicious users, with overwhelming probability the following two conditions hold.
For embodiment Algorand 1:
(1) # good>2# bad and
(2) # good+4·# bad<2n, where n is the expected cardinality of SVr,s.
For embodiment Algorand′2:
(1) # good>tH and
(2) # good+2# bad<2tH, where tH is a specified threshold.
These conditions imply that, with sufficiently high probability, (a) in the last step of the BA protocol, there will be at least given number of honest players to digitally sign the new block Br, (b) only one block per round may have the necessary number of signatures, and (c) the used BA protocol has (at each step) the required ⅔ honest majority.
If the round-r leader r is honest, then the corresponding block is of the form
B
r=(r,PAYr,(Qr−1),H(Br−1))
where the payset PAYr is maximal. (recall that all paysets are, by definition, collectively valid.)
Else (i.e., if r is malicious), Br has one of the following two possible forms:
B
r=(r,PAYr,SIGi(Qr−1),H(Br−1)) and Br=Bεr(r,Ø,Qr−1,H(Br−1))
In the first form, PAYr is a (non-necessarily maximal) payset and it may be PAYr=Ø; and i is a potential leader of round r. (However, i may not be the leader r. This may indeed happen if if r keeps secret his credential and does not reveal himself.)
The second form arises when, in the round-r execution of the BA protocol, all honest players output the default value, which is the empty block Bεr in our application. (By definition, the possible outputs of a BA protocol include a default value, generically denoted by ⊥. See section 3.2.)
Note that, although the paysets are empty in both cases, Br=(r,ø,SIGi(Qr−1),H(Br−1)) and Bεr are syntactically different blocks and arise in two different situations: respectively, “all went smoothly enough in the execution of the BA protocol”, and “something went wrong in the BA protocol, and the default value was output”.
Let us now intuitively describe how the generation of block Br proceeds in round r of Algorand′. In the first step, each eligible player, that is, each player i∈PKr−k, checks whether he is a potential leader. If this is the case, then i is asked, using of all the payments he has seen so far, and the current blockchain, B0, . . . , Br−1, to secretly prepare a maximal payment set, PAYir, and secretly assembles his candidate block, Br=(r,PAYr,SIGi(Qr−1),H(Br−1)). That, is, not only does he include in Bir, as its second component, the just prepared payset, but also, as its third component, his own signature of Qr−1, the third component of the last block, Br−1. Finally, he propagates his round-r-step-1 message, mir−1, which includes (a) his candidate block Bir, (b) his proper signature of his candidate block (i.e., his signature of the hash of Bir, and (c) his own credential σir,1, proving that he is indeed a potential verifier of round r.
(Note that, until an honest i produces his message mir,1, the Adversary has no clue that i is a potential verifier. Should he wish to corrupt honest potential leaders, the Adversary might as well corrupt random honest players. However, once he sees mir,1, since it contains i's credential, the Adversary knows and could corrupt i, but cannot prevent mir,1, which is virally propagated, from reaching all users in the system.) In the second step, each selected verifier j∈SVr,2 tries to identify the leader of the round. Specifically, j takes the step-1 credentials, σi
Recall that each considered credential is a digital signature of Qr−1, that SIGi(r,1,Qr−1) is uniquely determined by i and Qr−1, that H is random oracle, and thus that each H(SIGi(r,1,Qr−1) is a random 256-bit long string unique to each potential leader i of round r.
From this we can conclude that, if the 256-bit string Qr−1 were itself randomly and independently selected, than so would be the hashed credentials of all potential leaders of round r. In fact, all potential leaders are well defined, and so are their credentials (whether actually computed or not). Further, the set of potential leaders of round r is a random subset of the users of round r−k, and an honest potential leader i always properly constructs and propagates his message mir, which contains i's credential. Thus, since the percentage of honest users is h, no matter what the malicious potential leaders might do (e.g., reveal or conceal their own credentials), the minimum hashed potential-leader credential belongs to a honest user, who is necessarily identified by everyone to be the leader r of the round r. Accordingly, if the 256-bit string Qr−1 were itself randomly and independently selected, with probability exactly h (a) the leader r is honest and (b) j=r for all honest step-2 verifiers j.
In reality, the hashed credential are, yes, randomly selected, but depend on Qr−1 which is not randomly and independently selected. A careful analysis, however, guarantees that Qr−1 is sufficiently non-manipulatable to guarantee that the leader of a round is honest with probability h′ sufficiently close to h: namely, h′>h2(1+h−h2). For instance, if h=80%, then h′>0.7424.
Having identified the leader of the round (which they correctly do when the leader r is honest), the task of the step-2 verifiers is to start executing BA* using as initial values what they believe to be the block of the leader. Actually, in order to minimize the amount of communication required, a verifier j∈SVr,2 does not use, as his input value v′j to the Byzantine protocol, the block Bj that he has actually received from j (the user j believes to be the leader), but the the leader, but the hash of that block, that is, v′j=H(Bi). Thus, upon termination of the BA protocol, the verifiers of the last step do not compute the desired round-r block Br, but compute (authenticate and propagate) H(Br). Accordingly, since H(Br) is digitally signed by sufficiently many verifiers of the last step of the BA protocol, the users in the system will realize that H(Br) is the hash of the new block. However, they must also retrieve (or wait for, since the execution is quite asynchronous) the block Br itself, which the protocol ensures that is indeed available, no matter what the Adversary might do.
Algorand′1 and Algorand′2 have a significant degree of asynchrony. This is so because the Adversary has large latitude in scheduling the delivery of the messages being propagated. In addition, whether the total number of steps in a round is capped or not, there is the variance contribute by the number of steps actually taken.
As soon as he learns the certificates of B0, . . . ,Br−1, a user i computes Qr−1 and starts working on round r, checking whether he is a potential leader, or a verifier in some step s of round r.
Assuming that i must act at step s, in light of the discussed asynchrony, i relies on various strategies to ensure that he has sufficient information before he acts.
For instance, he might wait to receive at least a given number of messages from the verifiers of the previous step (as in Algorand′1), or wait for a sufficient time to ensure that he receives the messages of sufficiently many verifiers of the previous step (as in Algorand′2).
Recall that, ideally, the quantities Qr should random and independent, although it will suffice for them to be sufficiently non-manipulatable by the Adversary.
At a first glance, we could choose Qr−1 to coincide with H(PAYr−1). An elementary analysis reveals, however, that malicious users may take advantage of this selection mechanism.6 Some additional effort shows that myriads of other alternatives, based on traditional block quantities are easily exploitable by the Adversary to ensure that malicious leaders are very frequent. We instead specifically and inductively define our brand new quantity Qr so as to be able to prove that it is non-manipulatable by the Adversary. Namely, 6We are at the start of round r−1. Thus, Qr−2=PAYr−2 is publicly known, and the Adversary privately knows who are the potential leaders he controls. Assume that the Adversary controls 10% of the users, and that, with very high probability, a malicious user w is the potential leader of round r−1. That is, assume that H(SIG
Q
r
H(SIG
The intuition of why this construction of Qr works is as follows. Assume for a moment that Qr−1 is truly randomly and independently selected. Then, will so be Qr? When r is honest the answer is (roughly speaking) yes. This is so because
H((·),r):{0,1}256→{0,1}256
is a random function. When r is malicious, however, Qr is no longer univocally defined from Qr−1 and r. There are at least two separate values for Qr. One continues to be QrH((Qr−1),r), and the other is H(Qr−1,r). Let us first argue that, while the second choice is somewhat arbitrary, a second choice is absolutely mandatory. The reason for this is that a malicious r can always cause totally different candidate blocks to be received by the honest verifiers of the second step.7 Once this is the case, it is easy to ensure that the block ultimately agreed upon via the BA protocol of round r will be the default one, and thus will not contain anyone's digital signature of Qr−1. But the system must continue, and for this, it needs a leader for round r. If this leader is automatically and openly selected, then the Adversary will trivially corrupt him. If it is selected by the previous Qr−1 via the same process, than r will again be the leader in round r+1. We specifically propose to use the same secret cryptographic sortition mechanism, but applied to a new Q-quantity: namely, H(Qr−1,r). By having this quantity to be the output of H guarantees that the output is random, and by including r as the second input of H, while all other uses of H have either a single input or at least three inputs, “guarantees” that such a Qr is independently selected. Again, our specific choice of alternative Qr does not matter, what matter is that r has two choice for Qr, and thus he can double his chances to have another malicious user as the next leader. 7For instance, to keep it simple (but extreme), “when the time of the second step is about to expire”, r could directly email a different candidate block Bi to each user i. This way, whoever the step-2 verifiers might be, they will have received totally different blocks.
The options for Qr may even be more numerous for the Adversary who controls a malicious r. For instance, let x, y, and z be three malicious potential leaders of round r such that
H(σxr,1)<H(σyr,1)<H(σzr,1)
and H(σzr,1) is particularly small. That is, so small that there is a good chance that H(σzr,1) is smaller of the hashed credential of every honest potential leader. Then, by asking x to hide his credential, the Adversary has a good chance of having y become the leader of round r−1. This implies that he has another option for Qr: namely, H(SIGy(Qr−1),r). Similarly, the Adversary may ask both x and y of withholding their credentials, so as to have z become the leader of round r−1 and gaining another option for Qr: namely, H(SIGz(Qr−1),r).
Of course, however, each of these and other options has a non-zero chance to fail, because the Adversary cannot predict the hash of the digital signatures of the honest potential users.
A careful, Markov-chain-like analysis shows that, no matter what options the Adversary chooses to make at round r−1, as long as he cannot inject new users in the system, he cannot decrease the probability of an honest user to be the leader of round r+40 much below h. This is the reason for which we demand that the potential leaders of round r are users already existing in round r−k. It is a way to ensure that, at round r−k, the Adversary cannot alter by much the probability that an honest user become the leader of round r. In fact, no matter what users he may add to the system in rounds r−k through r, they are ineligible to become potential leaders (and a fortiori the leader) of round r. Thus the look-back parameter k ultimately is a security parameter. (Although, as we shall see in section ??, it can also be a kind of “convenience parameter” as well.)
Although the execution of our protocol cannot generate a fork, except with negligible probability, the Adversary could generate a fork, at the rth block, after the legitimate block r has been generated.
Roughly, once Br has been generated, the Adversary has learned who the verifiers of each step of round r are. Thus, he could therefore corrupt all of them and oblige them to certify a new block . Since this fake block might be propagated only after the legitimate one, users that have been paying attention would not be fooled.8 Nonetheless, would be syntactically correct and we want to prevent from being manufactured. 8Consider corrupting the news anchor of a major TV network, and producing and broadcasting today a newsreel showing secretary Clinton winning the last presidential election. Most of us would recognize it as a hoax. But someone getting out of a coma might be fooled.
We do so by means of a new rule. Essentially, the members of the verifier set SVr,s of a step s of round r use ephemeral public keys pkir,s to digitally sign their messages. These keys are single-use-only and their corresponding secret keys skir,s are destroyed once used. This way, if a verifier is corrupted later on, the Adversary cannot force him to sign anything else he did not originally sign.
Naturally, we must ensure that it is impossible for the Adversary to compute a new key and convince an honest user that it is the right ephemeral key of verifier i∈SVr,s to use in step s.
In this section, we construct a version of Algorand′ working under the following assumption.
and there are at most m/3 trials. If all trials fail then Lrm/3. Lr will be used to upper-bound the time needed to generate block Br.
the number of signatures needed in the ending conditions of the protocol.
|HSVr,s|>2|MSVr,s| and |HSVr,s|+4|MSVr,s|<2n.
As already mentioned, we wish that a verifier i∈SVr,s digitally signs his message mir,s of step s in round r, relative to an ephemeral public key pkir,s, using an ephemeral secrete key skir,s that he promptly destroys after using. We thus need an efficient method to ensure that every user can verify that pkir,s is indeed the key to use to verify i's signature of mir,s. We do so by a (to the best of our knowledge) new use of identity-based signature schemes.
At a high level, in such a scheme, a central authority A generates a public master key, PMK, and a corresponding secret master key, SMK. Given the identity, U, of a player U, A computes, via SMK, a secret signature key skU relative to the public key U, and privately gives skU to U. (Indeed, in an identity-based digital signature scheme, the public key of a user U is U itself!) This way, if A destroys SMK after computing the secret keys of the users he wants to enable to produce digital signatures, and does not keep any computed secret key, then U is the only one who can digitally sign messages relative to the public key U. Thus, anyone who knows “U's name”, automatically knows U's public key, and thus can verify U's signatures (possibly using also the public master key PMK).
In our application, the authority A is user i, and the set of all possible users U coincides with the round-step pair (r,s) in—say—S={i}×{r′, . . . , r′+106}×{1, . . . , m+3}, where r′ is a given round, and m+3 the upperbound to the number of steps that may occur within a round. This way, pkir,s(i,r,s), so that everyone seeing i's signature
can, with overwhelming probability, immediately verify it for the first million rounds r following r′.
In other words, i first generates PMK and SMK. Then, he publicizes that PMK is i's master public key for any round r∈[r′,r′+106], and uses SMK to privately produce and store the secret key skir,s for each triple (i,r,s)∈S. This done, he destroys SMK. If he determines that he is not part of SVr,s, then i may leave skir,s alone (as the protocol does not require that he aunthenticates any message in Step s of round r). Else, i first uses skir,s to digitally sign his message mir,s, and then destroys skir,s.
Note that i can publicize his first public master key when he first enters the system. That is, the same payment g that brings i into the system (at a round r′ or at a round close to r′), may also specify, at i's request, that i's public master key for any round r∈[r′,r′+106] is PMK—e.g., by including a pair of the form (PMKi[r′,r′+106]).
Also note that, since m+3 is the maximum number of steps in a round, assuming that a round takes a minute, the stash of ephemeral keys so produced will last i for almost two years. At the same time, these ephemeral secret keys will not take i too long to produce. Using an elliptic-curve based system with 32B keys, each secret key is computed in a few microseconds. Thus, if m+3=180, then all 180M secret keys can be computed in less than one hour.
When the current round is getting close to r′+106, to handle the next million rounds, i generates a new (PMK′,SMK′) pair, and informs what his next stash of ephemeral keys is by—for example—having SIGi(PMK′,[r′+106+1,r′+2·106+1]) enter a new block, either as a separate “transaction” or as some additional information that is part of a payment. By so doing, i informs everyone that he/she should use PMK′ to verify i's ephemeral signatures in the next million rounds. And so on.
(Note that, following this basic approach, other ways for implementing ephemeral keys without using identity-based signatures are certainly possible. For instance, via Merkle trees.11) 11In this method, i generates a public-secret key pair (pkir,s, skir,s) for each round-step pair (r,s) in—say—{r′, . . . , r′+106}×{1, . . . , m+3}. Then he orders these public keys in a canonical way, stores the jth public key in the jth leaf of a Merkle tree, and computes the root value Ri, which he publicizes. When he wants to sign a message relative to key pkir,s, i not only provides the actual signature, but also the authenticating path for pkir,s relative to Ri. Notice that this authenticating path also proves that pkir,s is stored in the jth leaf. Form this idea, the rest of the details can be easily filled.
Other ways for implementing ephemeral keys are certainly possible—e.g., via Merkle trees.
5.3 Matching the Steps of Algorand′1 with those of BA*
As we said, a round in Algorand′1 has at most m+3 steps.
Recall that, in each step s of a round r, a verifier i∈SVr,s uses his long-term public-secret key pair to produce his credential, σir,sSIGi(r,s,Qr−1), as well as SIGi(Qr−1) in case s=1. Verifier i uses his ephemeral secret key skir,s to sign his (r,s)-message mir,s. For simplicity, when r and s are clear, we write esigi(x) rather than sigpk
Instructions for every user i∈PKr−k: User i starts his own Step 1 of round r as soon as he knows Br−1
In practice, to shorten the global execution of Step 1, it is important that the (r,1)-messages are selectively propagated. That is, for every user i in the system, for the first (r,1)-message that he ever receives and successfully verifies,12 player i propagates it as usual. For all the other (r,1)-messages that player i receives and successfully verifies, he propagates it only if the hash value of the credential it contains is the smallest among the hash values of the credentials contained in all (r,1)-messages he has received and successfully verified so far. Furthermore, as suggested by Georgios Vlachos, it is useful that each potential leader i also propagates his credential σir,1 separately: those small messages travel faster than blocks, ensure timely propagation of the mjr,1's where the contained credentials have small hash values, while make those with large hash values disappear quickly. 12That is, all the signatures are correct and both the block and its hash are valid—although i does not check whether the included payset is maximal for its proposer or not.
Instructions for every user i∈PKr−k: User i starts his own Step 2 of round r as soon as he knows Br−1.
Instructions for every user i∈PKr−k: User i starts his own Step 3 of round r as soon as he knows Br−1.
Instructions for every user i∈PKr−k: User i starts his own Step 4 of round r as soon as he knows Br−1.
b
i
0 if gi=2, and bi1 otherwise.
Instructions for every user i∈PKr−k: User i starts his own Step s of round r as soon as he knows Br−1.
Instructions for every user i∈PKr−k: User i starts his own Step s of round r as soon as he knows Br−1.
Instructions for every user i∈PKr−k: User i starts his own Step s of round r as soon as he knows Br−1.
Instructions for every user i∈PKr−k: User i starts his own Step m+3 of round r as soon as he knows Br−1.
Instructions for every user i in the system: User i starts his own round r as soon as he knows Br−1, and waits for block information as follows.
Essentially, in Algorand, blocks are generated in rounds. In a round r
(1) A properly credentialed leader proposes a new block and then
(2) Properly credentialed users run, over several steps, a proper Byzantine agreement (BA) protocol on the block proposed.
The preferred BA protocol is BA*. The block proposal step can be considered step 1, so that the steps of BA* are 2, 3, . . .
Only a proper user i, randomly selected according among the users in the system, is entitled to send a message mir,s in step s of round r. Algorand is very fast and secure because such a user i checks whether he is entitled to speak. If this is the case, user i actually obtains a proof, a credential. If it is his turn to speak in step s of round r, i propagates in the network both his credential, σir,s, and his digitally signed message mir,s. The credential proves to other users that they should take in consideration the message mir,s.
A necessary condition for user i to be entitled to speak in step s of round r is that he was already in the system a few rounds ago. Specifically, k rounds before round r, where k is a parameter termed the ‘look-back’ parameter. That is, to be eligible to speak in round r, i must belong to the PKr−k, the set of all public keys/users already in the system at round r−k. (Users can be identified with their public keys.) This condition is easy to verify in the sense that it is derivable form the blockchain.
The other condition is that
H(SIGi(r,s,Qr−1))<p
where p is a given probability that controls the expected number of verifiers in SVr,s, that is, the set of users entitled to speak in step s of round r. If this condition is satisfied, then i's credential is defined to be
σir,sSIGi(r,s,Qr−1)
Of course, only i can figure out whether he belongs to SVr,s, all other users, who lack knowledge of i secret signing key, have no idea about it. However, if i∈SVr,s, then i can demonstrate that this is the case to anyone by propagating his credential 4a'S given the blockchain so far. Recall in fact that (1) Qr−1 is easily computable from the previous block, Br−1, although essentially unpredictable sufficiently many blocks before, and (2) anyone can verify i's digital signatures (relative to his long-term key in the system).
Also recall that, in the versions of Algorand so far, a verifier i∈SVr,s digitally signs his step-s-round-r message Mir,s relative to an ephemeral public key pkir,s, which anyone can, given the block chain, realizes genuinely corresponds to i and step s of round r. This “ephemeral signature” is denoted by sigi(mir,s), that is using small letters so as to differentiate it from i signatures with his “long-term” key, which are denoted by capital letters.
In sum, a user in SVr,s propagates two separate messages in step s of round r: (a) his credential, σir,s, and (b) his (ephemerally) digitally signed step-s-round-r message, esigi(mir,s). After he does so, i deletes his secret ephemeral key corresponding to pkir,s.
This use of ephemeral keys prevents that an adversary who corrupts sufficiently many verifiers of round r after the block Br has been produced is able to generate a different round-r block.
Recall that, in effect, the verifiers of step 1 are the potential leaders, and that their step-1-round-r messages are the blocks they propose. (The leader t of round r is defined to be the potential leader whose hashed credential is smallest. In case of improbable ties, may choose the potential leader who is lexicographically first.) For any step s>1, the message mir,s of i∈SVr,s is his “control message”, that is, his message in the BA protocol BA*.
Separating a verifier i's credential from his (digitally signed) message mir,s has two main advantages:
Let us first describe a new embodiment of Algorand that dispenses with the use of ephemeral keys for ultimately certifying a block, but uses ephemeral keys for all other steps.
Then, we shall describe how to get rid of ephemeral keys in Algorand in all steps, but the first, block-proposing step.
The new embodiment uses the same step 1 as before. Thus, a potential leader i of round r signs his proposed block Bir relative to his corresponding ephemeral key; erases the corresponding secret ephemeral key; and then propagates his own credential and signature of Bir.
In a round r, every step s of the BA protocol BA* remains the same as before. Thus, in particular, a verifier i∈SVr,s propagates his credential and his own step-s-round-r message mir,s digitally signed relative to his r-s ephemeral public key, and erases the corresponding secret ephemeral key. However, the following change is applied to the ending condition of the first coin-fixed-to-1 step, and all subsequent steps of BBA*.
Assume that a user i has, in such a step s, has reached the end condition for the first time. Then it must be the case that
Accordingly, if i were a verifier in SVr,s, then in prior embodiments of Algorand, he would have stopped the execution right away and would have learned the block Br, for which he would be in possession of CERTr. Recall that CERTr consisted of a given number of ephemeral digital signatures. We now refer to such CERTr as an ‘ephemeral certificate’ of Br.
In our new embodiment of Algorand, user i can be an arbitrary user in PKr−k, where k is a look-back parameter, rather than a verifier in SVr,s (which necessarily belongs to PKr−k). Such an arbitrary user i now no longer stops (simulating) his execution of round r. Rather, using his long-term secret key, he produces a signature of data indicating that he considers block B to be final and guaranteeing that the signature has a proper chance to be taken into proper consideration. For instance, without any limitation intended, i computes
s
i=SIGi(FINAL,r,s,Qr−1,H(B))
where B is the just constructed latest block in the blockchain. If H(si)<p, then i propagates si, and we refer to si as a credentialed certifying signature. (Here, p is a given parameter in [0, 1].)
A given threshold T of such signatures constitute a non-ephemeral certificate for B.
Now, only non-ephemeral certificates really matter. Ephemeral certificates can be considered just a ‘stepping stone’ towards the real: non-ephemeral certificates.
An honest user, who sees a final certificate for a block Br, no longer contributes to the generation or the final certification of a block of round r.
Analysis Even though non-ephemeral certificates consist of long-term signatures, the embodiment remains secure. Essentially, this is so, because, for proper choices of p and T, the adversary cannot feasibly find any string X for which he can produce T signatures sj of the form
s
j=SIGi(FINAL,r,s,Qr−1,X)
where all j are corrupted users and H(sj)≤p.
(In this application, T could be quite small—e.g., around 500. This is so, because it suffices that at least one of the T signatures is from an honest user. In fact, T can be much smaller, because it suffices to produce non-ephemeral certificates very often, but not necessarily for every block.)
Also notice that in the new embodiment the Adversary cannot flood the network by obliging honest users to propagate ‘arbitrary credentialed certifying signatures’ computed by corrupted users. In fact, although any malicious j∈PKr−k could find some arbitrary string xj such that H(SIGi(FINAL,r,s,Qr−1,xj))<p, by a proper use of propagation rules, the signature SIGi(FINAL,r,s,Qr−1,z) will never be relayed by a honest user. In fact, a user a will forward a signature SIGi(FINAL,r,s,Qr−1,H(B)) not only if (1) j∈PKr−k and (2) H(SIGi(FINAL,r,s,Qr−1,H(B))<p, but also if (3) H(B) is the hash of a block B for which u himself has seen a non-ephemeral certificate.
In fact, we could replace the above condition 3 with the following weaker one:
Indeed, when a honest user i has seen a full ephemeral certificate for B, then (in absence of partitions) the other honest users must have seen B approved by a large number of verifiers of the proper step. This number is actually sufficient to identify the only block that stands a chance of being non-ephemerally certified.
The above embodiment requires a minimum number of changes to the original Algorand protocol. Let us now explain how to avoid ephemeral keys in every step, but the first one. The idea is that, for every step s>1, there are not step-s verifiers. Rather, for every round r, every user internally executes step s as if he were a verifier in SVr,s, so as to internally compute his step-s-round-r message mir,s. At this point, instead of digitally sign mir,s with his ephemeral key pkir,s, i checks whether he is entitled to propagate the message mir,s as follows. First, i checks whether he was in the system k round ago: that is, whether i∈PKr−k. If this is the case, then i digitally signs mir,s with his long-term key, together with the quantity Qr−1: for instance, he computes sir,s=SIGi(r,s,mir,s, Qr−1) and checks whether the hash of this signature is ≤p, for a given probability p. If this is the case, then i is entitled to propagate mir,s and actually propagates sir,s. Note that, given sir,s, every one can verify that i was entitled to propagate mir,s. In step s+1, users only consider step-s messages propagated by entitled users.
An honest user, who has (at least internally) executed step s of round r, no longer executes or participates to the execution to such a step.
Note that the mechanism described herein is applicable to other blockchain systems where it is desirable to randomly choose a subset of users for a particular purpose, such as verification, in a way that is generally verifiable. Thus, the system described herein may be adapted to other blockchain schemes, such as Ethereum or Litecoin or even blockchain schemes that do not relate directly to currency.
The system described herein may be adapted to be applied to and combined with mechanisms set forth in any or all of PCT/US2017/031037, filed on May 4, 2017, Ser. No. 15/551,678 filed Aug. 17, 2017, 62/564,670 filed on Sep. 28, 2017, 62/567,864 filed on Oct. 4, 2017, 62/570,256 filed on Oct. 10, 2017, 62/580,757 filed on Nov. 2, 2017, 62/607,558 filed on Dec. 19, 2017, 62/632,944 filed on Feb. 20, 2018 and 62/643,331 filed on Mar. 15, 2018, all of which are incorporated by reference herein.
Software implementations of the system described herein may include executable code that is stored in a computer readable medium and executed by one or more processors. The computer readable medium may be non-transitory and include a computer hard drive, ROM, RAM, flash memory, portable computer storage media such as a CD-ROM, a DVD-ROM, a flash drive, an SD card and/or other drive with, for example, a universal serial bus (USB) interface, and/or any other appropriate tangible or non-transitory computer readable medium or computer memory on which executable code may be stored and executed by a processor. The system described herein may be used in connection with any appropriate operating system.
Other embodiments of the invention will be apparent to those skilled in the art from a consideration of the specification or practice of the invention disclosed herein. It is intended that the specification and examples be considered as exemplary only, with the true scope and spirit of the invention being indicated by the following claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/053360 | 9/28/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62643331 | Mar 2018 | US | |
62632944 | Feb 2018 | US | |
62607558 | Dec 2017 | US | |
62580757 | Nov 2017 | US | |
62570256 | Oct 2017 | US | |
62567864 | Oct 2017 | US | |
62564670 | Sep 2017 | US |