This invention relates to a system and method for verifying that any of a set of digital records has not been altered, without leaking any information about the contents of other records.
The digital world is defined by events, many of which are or can be logged. For example, in the context of computer systems, syslog may be used as “a standard for computer data logging. It separates the software that generates messages from the system that stores them and the software that reports and analyzes them. Syslog can be used for computer system management and security auditing as well as generalized informational, analysis, and debugging messages. It is supported by a wide variety of devices (like printers and routers) and receivers across multiple platforms. Because of this, syslog can be used to integrate log data from many different types of systems into a central repository” (http://en.wikipedia.org/wiki/Syslog).
Rsyslog, developed by Rainer Gerhards, extends syslog as “an open source software utility used on UNIX and Unix-like computer systems for forwarding log messages in an IP network. It implements the basic syslog protocol, extends it with content-based filtering, rich filtering capabilities, flexible configuration options and adds important features such as using TCP for transport” (http://en.wikipedia.org/wiki/Rsyslog).
Such logs may be maintained not only for “real” computer systems but for virtualized computers (“virtual machines”—VMs) as well; indeed, the system and state changes of VMs themselves may be logged as events. Events are not restricted to computers, of course. As another example, telephone companies routinely log all uses of their subscribers' phones, including any exchange of voice, text, network communications, often including time-tracking, and not only for purposes of billing. In short, any activity that can be recorded and stored in digital form can be considered to be a loggable event.
Many methods are well known for digitally signing various sorts of records. Loggable events—singularly or grouped—can be treated as such records and signed as any others so as to provide a certain level of assurance that a log of these events, or some individually signed subset, presented later, exactly matches what was signed. One potential problem, however, is that the data contained in event logs or other input data sets may display unacceptably low entropy, that is, the possible input data may be too limited or is too “organized”; for example, it may have relatively few possible variations or a relatively higher probability of occurrence of one entry given another entry. Thus, whereas the universe of all possible, general documents or other digital records is too vast for exhaustive analysis (trying all possibilities) to succeed, this may not be true—or not provable to a level of confidence desired by many users—in the case of events. System event logs may often have this property of a small enough range of possibilities that an exhaustive “brute force” attack may succeed in defeating the otherwise inherent security of any data-signing scheme. Of course, even in general, high-entropy environments, an additional, provable assurance of security is always welcome.
Increasingly, logs from various information systems are used as evidence. With that trend, also the requirements on maintenance and presentation of the log data are growing. Integrity and authenticity, that is, the confidence that the information in the log has not been tampered with or even replaced with another one altogether, are obvious requirements, especially if the log data is to be used for dispute resolution or produced as evidence in legal proceedings, tax audits, etc., to ensure that a virtual machine has not been altered, for example upon migration, to provide proof of financial transactions, to verify telephone usage, etc. As information systems log all their activities in a sequential manner, often the details of the transactions involved in the dispute are interspersed with other information in a log. To protect the confidentiality of the unrelated events, it is then desirable to be able to extract only some records from the signed log and still prove their integrity. In the light of the above, there is therefore a need to addresses all or at least some of the following design goals for a log-signing scheme:
This invention relates to a system and method for verifying that a set of digital input records has not been altered after being entered into and signed by the system. By providing a mechanism for additional security, the invention is particularly useful where the universe of possible inputs is small enough that exhaustive attacks have a higher than acceptable chance of success, that is, where the entropy of the input records is unacceptably low. This will often be the case where the input records are event logs, such as system or other event logs for computer systems (both physical and virtual), telephone and other telecommunications devices, computer-controlled machine and processes, etc. The invention may be used, however, in any situation where increased security is desired, including even for high-entropy data that one wishes to be able to verify.
The primary example used below to describe various inventive aspects of the invention will be for event logs such as syslog, rsyslog, Windows event log, etc. As just explained, these are particularly relevant situations, but, nonetheless, examples only.
Embodiments of the invention provide a digital data-signing scheme that will achieve almost all of the goals mentioned above—there will in some chosen implementations be some trade-offs on the efficiency goals, but typically this will not compromise the security goals. Purely by way of example, aspects of the invention are described below in the context of securely and digitally signing event logs, although, as mentioned elsewhere, the invention can also be used to increase security even in the case of other types of input records to be signed.
A computational process producing a log may, in principle, run indefinitely and thus the log as an abstract entity need not (but of course may) have a well-defined beginning and end. In the following, by way of example, the log is modeled as an ordered sequence of blocks, where each block in turn is an ordered sequence of a finite number of records. In
Many practical logging systems work this way, for example, in the case of syslog output being sent to a log file that is periodically rotated. The most straightforward strategy—simply signing each log block as a unit—would satisfy all the requirements related to processing of the whole block, but would make it impossible to prove the integrity of individual records without exposing everything else in the block. Another possible strategy—signing each record individually—would, of course, have very high overhead in terms of both processing and storage, as signing is quite an expensive operation and the size of a signature may easily exceed the size of a typical log record; more importantly, it would also fail to fully ensure the integrity of the log as a whole since deletion of a record along with its signature would typically not be detected in this scheme.
A possible improvement over both of the above naive strategies would be to compute a hash value of each record in a log block and then sign the sequence of hash values instead of the records themselves. This would ensure the integrity of the whole log block, significantly reduce the overhead compared to signing each record separately and also remove the need to ship the whole log block when a single record is needed as evidence; however, the size of the proof of a record would still be linear in the size of the block, which can easily run into multiple millions of records for a busy system.
This invention does not depend on any particular method for defining the size of a block to be processed. One common and natural choice is, as used by way of example here, to define a block as having a certain number of entries. One advantage of this choice is that one can if desired set the block size to be some number that is easy for processing or indexing, such as a power of two. Another choice could be to define a block as being all entries that occur in certain time intervals. Yet another choice might be to define certain events as “triggers”, such that a new block is started upon the occurrence of one or more starting events (such as migration of a virtual machine, switching to execution of chosen processes, I/O requests, etc.), and is terminated and processed upon the occurrence of any ending event. Skilled programmers will know how to make still other choices, such as those involving multiple selection rules, for example, those combining both numerical and temporal limits and/or triggers.
In implementations of the invention designed for use with more general input sets of digital records, any known method may be used to define them as an unambiguous set of digital data for processing and signature. As mentioned, this invention is particular suitable for verifying individual members of sets of relatively low-entropy input records, but it may also be used more generally to provide additional security even for sets of relatively high-entropy input records such as general documents converted into or originally created in digital form, insurance, financial, legal or medical records or results or other test data, SMS telephone messages (telephone “text message”), a sub-set of a hard disk, one or more files representing the whole or partial state of a virtual machine, or any of countless other types of digital records one might want to securely verify.
To further reduce the size of the evidence for a single record, the records may be aggregated using a Merkle tree data and computational structure, that is, a binary tree whose leaves are the hash values of the records and each non-leaf node is computed as the hash value of the concatenation of the values in its child nodes. Such a Merkle tree structure, or a similar hash tree structure, may be used (with adaptation according to this invention) in conjunction with a data-signing infrastructure. The hash value in the root node of the hash tree may then be digitally signed and for each leaf node a compact (logarithmic in the number of leaves) proof extracted showing that the hash value in the leaf participated in the computation that led to the signed root hash value. There are two complications, however. The first is that the security of such an aggregation scheme can in general be proven only if some restrictions are placed on the shapes of the hash chains allowed as participation proofs. One sufficient method for accomplishing this is appending the height of the sub-tree to the concatenated hash values from the child nodes before hashing; this then limits the length of the hash chains accepted during verification and allows for the security of the scheme to be formally proven.
The second complication is that the hash chain extracted from the Merkle tree for one node contains hash values of other nodes. A strong hash function cannot generally be directly reversed to learn the input value from which the hash value in the chain was created, but a typical log record may contain insufficient entropy for this to hold true—an attacker who knows the pattern of the input could exhaustively test all possible variants to find the one that yields the hash value actually in the chain and thus learn the contents of the record. To prevent this kind of informed brute-force attack, according to this invention, a blinding mask with sufficient entropy is added, preferably to each record before aggregating the hash values.
Merely for the sake of succinctness, the numbers rnd are referred to below as being “random”, even where, strictly, they are “pseudo-random”. As long as they are stored for later recomputational purposes described below, even purely random numbers could be used, but in most cases this will be unnecessarily complicated. In
In this description of various examples of certain aspects of the invention, standard hash-function notation is used and different hash functions are evaluated. Thus, for example, “∥” is used to indicate concatenation. The invention does not require the hashing orders illustrated. For example, the system could just as well compute as mi=hash(rnd ∥xi-1) instead of as mi=hash(xi-1∥rnd), or xa,b=hash(la,b∥xz∥xb) instead of xa,b=hash(xa∥xb∥la,b), as long as any given implementation of the invention uses a chosen hashing order consistently, since hash functions typically do not have commutativity of input values.
Having built and signed such a tree, the hash chain from any leaf to the root can be extracted and presented as a proof that the leaf participated in the computation that yielded the signed root hash value.
For example, to prove that rec2 was part of the signed log block, rec2 itself, the sequence (right; m2); (right; x1); (left; x3,4) and the signature on the root hash value would be presented. Assume an input record is presented that purports to be the “real” rec2; in other words, at first it is a “candidate”. A verifier would then be able to re-compute:
r2=hash(rec2)
x2=hash(m2∥r2∥1)
x1,2=hash(x1∥x2∥2)
xroot=hash(x1,2∥x3,4∥3)
and then verify that the newly computed xroot matches the signature. If so, then the candidate input record is verified to the level of security of xroot. If xroot is then also digitally signed and verified, then the candidate input record itself will be verified to this level of security.
Note that the method described here—with blinding masks added into hash computations in a tree structure—differs from the technique known as “salting”, which associates a random number (the “salt”) one-to-one with a password/pass phrase; the salt+password combination is then hashed. The hash and the salt are then stored. As explained in http://en.wikipedia.org/wiki/Salt_(cryptography), “Salts . . . make dictionary attacks and brute-force attacks for cracking large numbers of passwords much slower (but not in the case of cracking just one password). Without salts, an attacker who is cracking many passwords at the same time only needs to hash each password guess once, and compare it to all the hashes.” In short, because salting typically involves a one-to-one association of some output based on a known function of a single password/passphrase plus a known (at least in the sense of being stored, with the possibility of being hacked) salt, there is still the possibility of reverse computation, albeit generally slower than otherwise. Since the salt and the hash output will be known, a brute force attack will still often be unacceptably feasible if the input data has lower entropy than the universe of passwords typically has.
In contrast, the hashing structure of this invention makes it in practice impossible to compute in reverse from the xi or xa,b value to any of the input records (except the records the attacker already possesses, obviously), because the attacker does not have access to the rnd value, even in implementations where a single rnd is applied for all entries in a given block.
In the discussion above, the shape of the Merkle tree is not specified. If the number of leaves is an even power of two, building a complete binary tree seems natural, but in other cases the appropriate shape is not necessarily obvious. The only requirement, however, is that the tree should be built in a deterministic manner so that a verifier is able to construct the exact same tree as the signer did. A practical consideration, however, is that to achieve the logarithmic size of the integrity proofs of the individual records, the tree should preferably not be overly unbalanced. Thus, one example of a canonical binary tree with n leaf nodes (shown for n=11 in
A useful property of canonical trees is that they can be built on-line, as the new leaf nodes arrive, without knowing in advance the eventual size of the tree, and keeping in memory only a logarithmic number of nodes (the root nodes of the complete trees constructed so far). Therefore, using the scheme outlined here, all the security goals are achieved, and almost the performance goals as well:
In this section, reference example procedures are presented, by way of example only, for aggregating a log block, extracting an integrity proof for an individual record and verifying a record based on such proof. Also discussed are some potential trade-offs where additional security benefits or runtime reductions could be gained at the cost of increased storage overhead. It is stressed that these example procedures are included merely to demonstrate to skilled programmers one way to implement the respective functions in one implementation of the invention. Such programmers will of course have their own design preferences as to details, arrangement of data structures, choice of programming languages, etc., without departing from the main idea of this invention.
Example Procedure 1 aggregates a block of records for signing or verification. The input description numbers the records 1, . . . , N, but the value of N is not used and the example procedure can easily be implemented for processing the records on-line. The amortized processing time per record is constant and the worst-case actual processing time per record is logarithmic in the number of records in the block, as is the size of the auxiliary working memory needed.
To sign a log block:
To verify a signed log block:
Although not strictly required, the last leaf hash value of the previous log block should preferably also be saved along with rnd and the signature in practice; otherwise, the verification process for the current block will need to re-hash the previous block to obtain the required input for the current verification. Assuming a consistent storage policy, for that, in turn, the next previous block would need to be re-hashed, etc. While this would obviously be inefficient, an even more dangerous consequence is that any damage to any log block would make it impossible to verify any following log blocks, as one of the required inputs for verification would no longer be available.
Considering the negative scenarios in more detail, the only conclusion that could be made from a failed verification is that something has been changed in either the log block or the authentication data. If it is desirable to be able to detect the changes more precisely, either the record hash values ri or the leaf hash values xi computed by Example Procedure 1 could be saved along with the other authentication data. Then the sequence of hash values can be authenticated against the signature and each record checked against its hash value, at the expense of small per-record storage overhead. It should also be noted that if the record hashes are saved, they should be kept with the same confidentiality as the log data itself, to prevent them being used for the informed brute-force attack that the blinding masks are to prevent.
Example Procedure 2 extracts the hash chain needed to prove or verify the integrity of an individual record. The core procedure is similar to that in Example Procedure 1, with additional tracking of the hash values that depend on the target record and collecting a hash chain based on that tracking.
Applying the choices in the example procedures above, the output value is a sequence of (direction, sibling hash, level correction) triples. The direction means the order of concatenation of the incoming hash value and the sibling hash value. The level correction value is included to account for cases when two sub-trees of unequal height are merged and the node level value increases by more than 1 on the step from the root of the lower sub-tree to the root of the merged tree. (The step from the lower three-ringed node to the higher one on
Also note that the need to access the full log file in this example procedure is not a compromise of confidentiality goals, since the extraction process may be executed by the owner of the log file and only the relevant log records and the hash chains computed for them by Example Procedure 2 are supplied to outside parties.
Example Procedure 3 computes the root hash value of the Merkle tree from which the input hash chain was extracted in one prototype of the invention. The hash chain produced by Example Procedure 2 and the corresponding log record will typically be fed into Example Procedure 3 and the output hash value verified against the signature to prove the integrity of the record.
In some implementations of the invention, users will be satisfied simply to verify an input event up to the level of the xroot. value associated with the respective block. Recall that, if (as is preferred but not strictly required) the previous value x0 is included in the computation of even the hash value for the first value x1 in a current block, then xroot.. will also encode information from all previous blocks. Digitally signing xroot.. with any standard signature may then also suffice, if desired at all. Nonetheless, a digital signing method is described below that ensures the integrity of xroot even within a potentially much larger framework so as to provide even greater security. In other words, although individual events can be verified within the structure 100, 200 illustrated in
The embodiment of the invention illustrated in
This section outlines some practical concerns regarding the implementation of the example of one embodiment of the invention for signing syslog or similar event messages. Skilled programmers will know how to choose suitable procedures for other implementations. As one example of many possible deployment scenarios, the example here concentrates on signing the output directed to a text file on a log collector device, which is discussed in Rainer Gerhards, The Syslog Protocol, RFC 5424, IETF, 2009.
Assume the log is modeled as an ordered sequence of blocks, where each block in turn is an ordered sequence of a finite number of records, and note that the case of syslog output being sent to a periodically rotated log file could be viewed as an instantiation of this model. The model is here refined to distinguish the physical blocks (the rotated files) from the logical blocks (implied by signing), because it is often desirable to sign the records with a finer granularity than the frequency of rotating the log files. For practical reasons, the system may allow a log file to contain several signed blocks, but prohibit a signed block from spanning across file boundaries. This means that when logs are rotated, the current signature block will always be closed and a new one started from the beginning of the new log file. The hash links from the last record of previous block to the first record of the next block do span the file boundaries, though, and thus still enable verification of the integrity of the whole log, however the files may have been rotated.
The invention can also be implemented for record-level log signing in multi-tenant environments, that is, environments in which two or more differently defined entities generate events that are logged in the same log. In such an environment, it is helpful to make a few general assumptions regarding the handling of logs. A first assumption is that logs will have records from different tenants interleaved and that these logs will need to be separated before delivery to the respective tenants. A second assumption is that in an interleaved log, the origin of each record will be clearly decidable. If this second assumption is violated, the log-separation problem will lack a well-defined solution, such that the question of signature separation is not applicable.
One property of the multi-tenant case is that the separation of the shared log into a set of interleaved threads is pre-determined: one can assume that the first step in any log processing is separation of records by tenant and after that each tenant will perform any further analysis on its own subset. Therefore, it could be beneficial to provide for a signing mechanism that protects the integrity of each thread as well as the integrity of the whole log. One possible solution, considering the small overhead of signing, would be to view each tenant's thread as a virtual log within the shared log, and then to link and sign the records in each of the threads in addition to the global thread containing all records in the log. Assuming roughly equal division of the N records of the shared log among K tenants, in addition to the log(N)-sized root list to be kept in memory and one signature to be archived for the long term for the whole log, the server would need to keep K additional log(N/K)-sized root lists and archive K additional signatures.
At the cost of leaking the number of tenants, the number of signatures could be reduced back to one by adding one extra aggregation layer (corresponding, in
As mentioned, the uppermost value xroot of the tree structure is then preferably digitally signed. Many schemes exist that would be suitable for signing (using time-stamping or otherwise) such data. Three of very many common and known methods are PKCS#7 or OpenPGP signature or a PKI-signed RFC3161 time-stamp.
Guardtime AS of Tallinn, Estonia, has developed a signing infrastructure that includes a distributed hash tree structure that provides exceptionally high reliability for authentication of digital records (defined essentially as any set of digital information) with no need for keys. See, for example, http://www.guardtime.com/signatures/technology-overview/for a summary of the Guardtime technology. Aspects of the Guardtime system are disclosed as well in U.S. Pat. Nos. 7,698,557; 8,347,372; and 8,312,528 (all “System and method for generating a digital certificate”). As mentioned, this invention does not require any particular signing scheme, but the Guardtime system is described here because of its particular advantages (among others, a high level of security, computational efficiency, substantially unlimited scalability, and not requiring keys) in general, and in the specific context of this invention in particular.
As
As
In the illustrated arrangement, a client is the system where digital records are prepared and entered into the verification/signature system. Viewed in the context of the invention shown in
A gateway in the gateway layer 3000 will typically be a computer system such as a server with which one or more of the clients communicates so as to receive requests for registration of each digital record that a client submits. In many implementations, a gateway will be a server controlled by an enterprise or some third-party provider, which may be a server known to and maybe even controlled by an organization to which the client user belongs, or a server accessed through a network such as the Internet. In short, a gateway may generally be any server located anywhere and configured to receive requests from clients for digital record registration. Gateway systems do not need to be of the same type; rather, one gateway might be a server within a company that employs many clients, whereas another gateway might be a server accessible online by arbitrary users. Of course, gateways could also be commercial systems, such that access for verification is granted only upon payment of a fee.
An aggregator in the aggregation layer 4000 will similarly be a computer system such as a server intended to receive registration requests that have been consolidated by respective gateways. Depending upon the scale and design requirements of a given implementation, any aggregator could also be controlled by the owner of the core, or the owner of the same systems as the gateways and clients, or could be provided by an entirely different entity, and in some cases it would also be possible to consolidate the aggregator and gateways for particular set of clients. For example, one design choice would be for the central system to include a set of aggregators as part of the “core” system, with lower-level, non-core aggregators submitting requests by communicating through the “core aggregators.” One could then locate core aggregators geographically, such as one or more aggregators in each of Europe, North America and Asia, to reduce latency or for administrative reasons.
As another example, large corporations or government entities might prefer to implement and benefit from the advantages of the infrastructure using only their own dedicated systems. Nearer the other end of the spectrum of possibilities would be that the gateways and aggregators could all be configured using “cloud computing” such that a user at the client level has no idea where any particular gateway or aggregator is located or who controls the servers. One of the advantages of this infrastructure is that digital input records can still be verified with near total security even in situations where users and others do not know if they can trust the systems in the gateway or aggregation layers 3000, 4000; indeed, it is not even necessary to trust the administrator of the core 5000 in order to have essentially total reliability of verification.
Consider the client system 2010-1, which will be whatever type of system that generates or inputs digital records that are to be registered for later verification. Just a few of the countless physical and software systems that may create digital inputs records, and that can be client systems in the sense of this invention, are a physical or virtual computer, a telecommunications device such as a mobile phone, hybrids of these two classes of devices, other computer-supervised machines for which state changes or other activities are logged (for example, flight data recorders or industrial processes), as well as pure software entities that have logged activities.
In one implementation, each client system that wishes to use the verification infrastructure is loaded with a software package or internal system routines for convenient or even automatic communication and submission “upwards” of digital information. The software package may include some application program interface (API) 2014 that transforms submitted digital records into a proper form for processing. A digital record 2012 created, selected, or otherwise input in any way is then submitted by way of the API 2014 to a software module 2016 that uses the digital data from the record 2012 as at least one argument in a transformation function such as a hash function.
In implementations of the invention designed for verifying event logs, the “client” will typically be a routine within the client system itself capable of extracting and submitting all or any desired portion of an event log as the input record to be signed and verified. In some cases, however, the event log may be separated or even remote from the system that receives or extracts the events or event log. For example, assume that the events relate to interactions between a mobile phone, tablet computer, etc., and a central telephone or wireless network system, or other types of system state changes of these devices. Examples of such events/state changes might be starting and shutting down the device, initiating and ending calls, transmitting or receiving sms messages or email, accessing the internet, moving from one cellular zone to another, receiving software updates, etc. Since these events are also detectable in the central exchange run by the service provider, events may be logged centrally and entered into the verification system either instead of or in addition to in and by the device itself.
Cryptographic hash functions are very well known in many areas of computer science and are therefore not described in greater detail here. Just one of many possible examples of a common class of hash functions that are suitable for use in this infrastructure are the “Message Digest” (MD) hash functions, which include the MD2, MD3, MD4, MDS, . . . functions and the various “secure hash algorithm” family (SHA-1, SHA-2, etc.).
Since the xroot value itself is the result of evaluation of the masking hash tree 200, it will in many implementations not be necessary to further hash it within the client. Additional hashing within the client may be desired, however to include additional information depending on the design protocol of the infrastructure. Just a few of the many possible arguments the system designer might optionally choose to include as arguments of the additional hash function 2016 are an identifier of the person or entity requesting registration, an identifier of the particular client system being used, a time indication, information relating to the geographic location of the client or other system, or any other information desired to be incorporated as part of the registration request. Since the transformation function 2016 will generally (but not necessarily—again, more complicated schemes may be used as long as corresponding bookkeeping for the required data structures is implemented and maintained) output a single number or vector 2018 regardless of the number of input parameters; later authentication through recomputation will succeed as long as the function 2016 is known. A software module 2020 is preferably included to transmit the output of the transformation 2016 to higher layers of the infrastructure as a request (REQ), along with any other parameters and data necessary to communicate with a gateway and initiate the registration request.
It is assumed in this discussion that the transformation function 2016 is a hash function because this will be the most common and efficient design choice, and also because the properties of hash functions are so well understood; moreover, many different hash functions are used in the field of cryptology, security, etc., within commodity computers. One other advantageous property of hash functions is that they can reduce even large amounts of digital information to a size that is more easily processed, with a statistically insignificant chance of two different inputs leading to the same output. In other words, many well-known hash functions will be suitable for use throughout the infrastructure of this infrastructure, and can be chosen using normal design considerations. Nonetheless, the function that transforms digital records into a form suitable for submission as a request need not be a hash function as long as its properties are known. For example, especially for small digital records, it may be more efficient simply to transmit the digital record data as is, in its entirety or some subset; in this case, the transformation function may simply be viewed as an identity function, which may then also append whatever other additional information is needed according to the core system administration to form a proper registration request.
The data structure of a binary hash tree is illustrated within the gateway 3010-2. Each of the lowest level nodes will correspond to the transformed dataset 2018 (which may be either xroot as is, or some augmented function of xroot) submitted as a request from a client, along with any other parameters or data used in any given implementation to form a request. As illustrated, the values represented by each pair of nodes in the data structure form inputs to a parent node, which then computes a combined output value, for example, as a hash of the two input values from its “children” nodes. Each thus combined output/hash value is then submitted as one of two inputs to a “grandparent” node, which in turn computes a combined output/hash value for these two inputs, and so on, until a single combined output/hash value is computed for the top node in the gateway.
Aggregators such as the system 4010-1 similarly include computation modules that compute combined output values for each node of a hash tree data structure. As in the gateways, the value computed for each node in the aggregator's data structure uses its two “children” nodes as inputs. Each aggregator will therefore ultimately compute an uppermost combined output value as the result of application of a hash function that includes information derived from the digital input record(s) of every client that submitted a request to a gateway in the data structure under that aggregator. Although it is of course possible, the aggregator layer 4000 does not necessarily need to be controlled by the same system administrator that is in charge of the core layer 5000. In other words, as long as they are implemented according to the required protocols and use the correct hash functions (or whatever other type of function is chosen in a given implementation), then the client, gateway, and aggregation layers may be configured to use any type of architecture that various users prefer.
In one embodiment, the core 5000 is maintained and controlled by the overall system administrator. Within the core, a hash tree data structure is computed using the root hash values of each aggregator as lowest level inputs. In effect, the hash computations and structure within the core form an aggregation of aggregation values. The core will therefore compute a single current uppermost core hash value at the respective tree node 5001 at each calendar time interval t0, t1, . . . , tn. This uppermost value is referred to here alternatively as the “calendar value” or “current calendar value” for the time interval. Note that the time origin and granularity are both design choices.
Note that the uppermost tree node 5001 represents the root node of the entire tree structure of nodes junior to it. As is explained later, this will change upon recomputation of a new uppermost core hash value at the end of the next period of accumulating requests and generating signature vectors (“also referred to as “data signatures”) containing recomputation parameters. Other arrangements would, however, be possible. For example, to reduce or eliminate single-point-of-failure possibility, it would be possible for requests to be sent upward to and hashed into multiple aggregators as long as some mechanism is included to arbitrate between and/or consolidate the then multiple root hash values that include the lower level's root hash value.
In
In
The core may return the data signature vector 8000 to clients and/or other layers directly, or it can be constructed or passed “downward” as a return. For example, when the core computes the current calendar 5001 at the new calendar time interval, it may return to aggregator 4010-1 its sibling (X-marked) lowest core node value from aggregator 4010-k, and the aggregator 4010-1 can then return downwards the X-marked hash values to the gateway 3010-2, which in turn can return downwards to the client 2010-1 all of the above, plus the X-marked hash values computed within that gateway's hash tree structure. In other words, not only may the hash computation infrastructure be distributed over various layers (vertically) and also “horizontally” at each layer, but the responsibility for communicating requests upward and partial or entire signature vectors downwards can also be distributed and can be carried out simultaneously in many different locations. Of course, since a data signature is unique to the digital record that led to it, the procedure for returning a signature vector for each input digital record 2012 for client 2010-1 (note that a single client may input more than one digital record for verification in each time interval) is preferably duplicated for all digital input records received in the time interval over which values were accumulated for the computation of node value 5001.
Note that the nature of the distributed infrastructure shown in
In most cases, it is unlikely that the number of clients during a given computation interval will be exactly equal to a power of 2. Any known method may be used to adapt to the actual number of clients while still maintaining a binary hash tree structure throughout. As just one example of a solution to this, known dummy values may be used for all of the “missing” sibling node values. Alternatively, it is also possible to adjust the hash tree branches accordingly, in the manner of giving “byes” in single-elimination sports tournaments.
In one embodiment, the gateways 3000 may be more local to various clients whereas the aggregators are more regional. For example, it would be possible to locate aggregators in different parts of the world not only to distribute the workload, but also to increase throughput. Although it appears in
Assume now by way of example that some entity later wishes to verify that a digital record in question—a “candidate digital record”—is an identical copy of digital record 2012. Applying the same transformation function 2016 to the candidate digital record and recomputing upward using the corresponding data signature 8000, the entity should compute to the exact same calendar value that resulted from the original digital record's registration request. In some implementations, this level of verification is sufficient. As one possible example, if the calendar is distributed to enough independent aggregators, then if one malicious actor were to tamper with some calendar value, this could be detected if some procedure is implemented to compare with other copies of the same calendar.
As another example, in some implementations, users may choose or be obligated to rely on the security of the administrator of the core. In particular, government entities might implement a system in which users must simply rely on the government administrators. In these cases, recomputation up to the corresponding calendar value may be considered sufficiently reliable authentication. In the context of this infrastructure, this can be viewed as “first-level” verification. One hypothetical example of where such a system might be implemented would be where a government agency requires companies, laboratories, etc. to submit a copy of its calendar to the government entity every time the company's system updates its calendar. The government would then be able to audit the company's records and verify the authenticity of any given digital record by recomputing up to the proper calendar value, which the government will have stored. In practice, this would amount to requiring the company to keep updated a “calendar audit trail” with the auditing entity (such as the government).
Even in other instances, as long as the highest level system administrator trusts its ability to securely store calendars, it could be satisfied that a candidate digital record is authentic if recomputation leads to the appropriate stored calendar value. In a sense, it would be the system administrator itself in such cases that is looking for proof of the authenticity of candidate digital records as opposed to clients or other third-party entities. Consequently, the system administrator could trust the security of the recomputation and calendar values to the same extent it trusts itself to maintain the calendar copies.
All but the last digital record requesting registration in a calendar time period will typically need to wait for all other requests in the calendar time interval to be processed before a calendar value will be available that will enable authenticating recomputation. If the calendar time interval is kept short enough, this delay may be acceptable. To increase the level of security during the delay, it would also be possible to implement an option, whenever a client submits an authentication registration request, to generate and return not only the data signature vector but also a key-based signed certificate, which may be issued by any higher layer system such as the current gateway, aggregator, or even core.
Because of the various data structures and procedures of the distributed infrastructure, the published composite calendar value may encode information obtained from every input digital record over the entire publication time interval, and if the current calendar value for the current calendar period is hashed together with the previous one, which is hashed with the one before it, and so on, as shown in
This invention involves an extension to this scheme: additional hash nodes comprising blinding masks are generated as random or pseudo-random numbers and are included in hash computations, preferably in the core layer, but optionally in other layers instead or in addition. These additional node values (randomly generated numbers) can then be included in a returned data signature just as is they were any other node value, thereby enabling recomputation.
In
Although it may in many cases be desirable or even required for the published value to encode information from the entire calendar from the beginning of calendar time, other alternatives can also be implemented as long as suitable bookkeeping routines are included. For example, rather than include all calendar values in the Merkle tree, at each publication time all of the most recent calendar values could be included in the publication computation along with a random sampling of calendar values from previous intervals. This would be one way, for example, to ensure that the number of included calendar values is conveniently a power of 2.
Similarly, in some contexts, government authorities require proof of records extending back only for some given time such as three years. In such cases it might be advantageous always to include only calendar values generated during this required period such that only relevant digital records are encoded in the most recent publication value.
Another alternative would be for there to be only a single computation of the publication value, including all calendar values from the beginning of system time. This might be useful, for example, in projects with clear time or digital record limits. For example, in litigation or transactions, parties often submit digital records to a “data room” for easy exchange. Calendar values could then be generated periodically as in other cases (perhaps with a longer calendar time interval since digital records will generally not be submitted as frequently as in large-scale, universally accessible implementations of the infrastructure's infrastructure), but with only a single computation of a publication value when all parties agree to close the data room. The publication value would then be a form of “seal” on the body of submitted digital records, which could later be used for recomputation and verification of any digital record ever submitted into the data room.
It is not absolutely necessary for the publication value to be computed using the Merkle hash tree data structure illustrated in
Notice that a recomputation vector can also be associated with each event input e(i) so as to allow recomputation from its value, up through the blinding mask hash tree (in the illustrated case, a Merkle tree) to xroot. Example Procedure 3 is one example of how this can be done within the computation module 200. As an example of this, assume that the vector (a form of “local” data signature for the tree structure 200) {(left, m2), (left, x1), (right, x3, 4)} is associated with rec2, which corresponds to e(k+2). Given e(k+2) and this information, the component 200, acting now as a verification engine, can compute r2 by hashing rec2 (e(k+2)). Hashing m1 ∥ r1 will then yield x1, hashing x1∥x2 will yield x1,2 and then hashing x1,2∥x3,4 will yield xroot, but only if the value of e(k+2) used in this recomputation is in fact totally identical to the one that led to computation of xroot originally. Observe that in this recomputation to verify e(k+2), it is not necessary to know the value of any other event e(j), and in fact any attempt to try to compute backwards to any other event value would require the practically impossible—backwards computation through one or more hash functions, whose input is high-entropy by virtue of the blinding mask.
In most anticipated implementations of this invention, there will typically be a large number of events e(j) in each block B. It would be possible to compile and associate a full digital signature vector that includes direction information and required sibling node values (plus hash function identifiers if these different hash functions are used within the overall infrastructure) all the way from the event value e(j) up to the calendar value 5001. This would, however, in most cases, require unacceptably great storage and computation burdens and would also be unnecessary. Rather, the preferred implementation is to digitally sign only the xroot. value for each block, and then maintain internal signature vectors for entries within a block for verifying recomputation up to xroot—if each xroot.is globally verified, then it is sufficient to verify entries up to its level only.
It is not a requirement for systems in any given layer to apply the same hash functions. For example, the transformation functions used in different client systems could be different. As long as the functions at each place in the recomputation path are known to whoever later wants to authenticate a digital record through recomputation, the authentication process will work properly. Adding a hash function identifier as an input parameter to the preparation of the registration request would be one convenient way to enable future users to correctly authenticate a digital record through recomputation.
Throughout this description, reference is made to computing values by applying various functions such as hash functions. For example, in
This application claims priority of U.S. Provisional Patent Application No. 61/768,386, which was filed 22 Feb. 2013.
Number | Date | Country | |
---|---|---|---|
61768386 | Feb 2013 | US |