METHODS AND SYSTEMS FOR SOMEWHAT HOMOMORPHIC ENCRYPTION AND KEY UPDATES BASED ON GEOMETRIC ALGEBRA FOR DISTRIBUTED LEDGER/BLOCKCHAIN TECHNOLOGY

Information

  • Patent Application
  • 20220045840
  • Publication Number
    20220045840
  • Date Filed
    August 10, 2021
    3 years ago
  • Date Published
    February 10, 2022
    2 years ago
Abstract
Disclosed are methods and systems to encrypt data with SomeWhat Homomorphic Encryption (SWHE) properties for submission to a distributed ledger/blockchain that allows further open operations retained in the distributed ledger/blockchain on the encrypted data that will be properly reflected when the encrypted result is decrypted by the data owner. The somewhat homomorphic properties include addition and scalar division. Also disclosed is an ability to update a secret key applied for a ciphertext such that a single piece of data may be provided on the distributed ledger/blockchain by a data owner to a new data owner without also exposing other data encrypted with the original secret key of the original data owner.
Description
BACKGROUND OF THE INVENTION

The advancement of science is possible when knowledge is shared and information is exchanged in a seamless manner. In a world where many businesses rely on information as their main assets, analysis over data is a crucial competitive advantage. Consequently, the amount of data processed and stored will continue to increase, creating a demand for virtualized services. To this end, some applications can be provided as cloud computing resources including Internet of Things (IoT), machine learning, virtual reality (VR) and blockchain. As a result, concerns about custody and privacy of data are on the rise.


Modern concealment/encryption employs mathematical techniques that manipulate positive integers or binary bits. Asymmetric concealment/encryption, such as RSA (Rivest-Shamir-Adleman), relies on number theoretic one-way functions that are predictably difficult to factor and can be made more difficult with an ever-increasing size of the encryption keys. Symmetric encryption, such as DES (Data Encryption Standard) and AES (Advanced Encryption Standard), uses bit manipulations within registers to shuffle the concealed text/cryptotext to increase “diffusion” as well as register-based operations with a shared key to increase “confusion.” Diffusion and confusion are measures for the increase in statistical entropy on the data payload being transmitted. The concepts of diffusion and confusion in encryption are normally attributed as first being identified by Claude Shannon in the 1940s. Diffusion is generally thought of as complicating the mathematical process of generating unencrypted (plain text) data from the encrypted (cryptotext) data, thus, making it difficult to discover the encryption key of the concealment/encryption process by spreading the influence of each piece of the unencrypted (plain) data across several pieces of the concealed/encrypted (cryptotext) data. Consequently, an encryption system that has a high degree of diffusion will typically change several characters of the concealed/encrypted (cryptotext) data for the change of a single character in the unencrypted (plain) data making it difficult for an attacker to identify changes in the unencrypted (plain) data. Confusion is generally thought of as obscuring the relationship between the unencrypted (plain) data and the concealed/encrypted (cryptotext) data. Accordingly, a concealment/encryption system that has a high degree of confusion would entail a process that drastically changes the unencrypted (plain) data into the concealed/encrypted (cryptotext) data in a way that, even when an attacker knows the operation of the concealment/encryption method (such as the public standards of RSA, DES, and/or AES), it is still difficult to deduce the encryption key.


Homomorphic Encryption is a form of encryption that allows computations to be carried out on concealed cipher text as it is concealed/encrypted without decrypting the cipher text that generates a concealed/encrypted result which, when decrypted, matches the result of operations performed on the unencrypted plaintext.


The word homomorphism comes from the ancient Greek language: ktoc (homos) meaning “same” and μoρφ{acute over (η)} (morphe) meaning “form” or “shape.” Homomorphism may have different definitions depending on the field of use. In mathematics, for example, homomorphism may be considered a transformation of a first set into a second set where the relationship between the elements of the first set are preserved in the relationship of the elements of the second set.


For instance, a map f between sets A and B is a homomorphism of A into B if






f(a1 op a2)=f(a1) op f(a2)|a1, a2 ∈ A


where “op” is the respective group operation defining the relationship between A and B.


More specifically, for abstract algebra, the term homomorphism may be a structure-preserving map between two algebraic structures such as groups, rings, or vector spaces. Isomorphisms, automorphisms, and endomorphisms are typically considered special types of homomorphisms. Among other more specific definitions of homomorphism, algebra homomorphism may be considered a homomorphism that preserves the algebra structure between two sets.


SUMMARY OF THE INVENTION

An embodiment of the present invention may comprise a method for performing somewhat homomorphic operations on encrypted data in a distributed ledger/blockchain system without decrypting the encrypted data and where data resulting from the somewhat homomorphic operations remains encrypted, the method comprising: generating off-chain by a data owner node device a secret/private key sk and a public evaluation key pk, wherein the secret/private key sk is comprised of a first key multivector (K1), a second key multivector (K2), and an integer g, and wherein the public evaluation key pk is comprised of a prime number q; encrypting off-chain by the data owner node device an integer message m as a function of at least one Geometric Algebra geometric product operation of the first key multivector (K1), the second key multivector (K2), and a message multivector (M), and a scalar multiplication operation with integer g to obtain message ciphertext multivector (C), wherein the message multivector (M) is a representation of the integer message m; submitting on-chain by the data owner node device the message ciphertext multivector (C) as a transaction for the distributed ledger/blockchain system; performing on-chain by a calculation node device at least one somewhat homomorphic operation with the message ciphertext multivector (C) to calculate a result ciphertext multivector (CR), wherein a smart contract for the distributed ledger/blockchain defines available somewhat homomorphic operations and wherein the calculation node device is part of the same distributed ledger/blockchain system as the owner node device; and decrypting off-chain by the data owner node device the result ciphertext multivector (CR) as a function of at least one Geometric Algebra geometric product operation of an inverse of the first key multivector (K1−1), an inverse of the second key multivector (K2−1), the result ciphertext multivector (CR), and a scalar division operation with integer g to obtain a result multivector (R), wherein the result multivector (R) is a representation of a numeric result r.


An embodiment of the present invention may further comprise a distributed ledger/blockchain system that performs somewhat homomorphic operations on encrypted data without decrypting the encrypted data and where data resulting from the somewhat homomorphic operations remains encrypted, the distributed ledger/blockchain system comprising: a data owner node device, wherein the data owner node device further comprises: a key generation subsystem that generates, off-chain, a secret/private key sk and a public evaluation key pk, wherein the secret/private key sk is comprised of a first key multivector (K1), a second key multivector (K2), and an integer g, and wherein the public evaluation key pk is comprised of a prime number q; an encryption subsystem that encrypts, off-chain, an integer message m as a function of at least one Geometric Algebra geometric product operation of the first key multivector (K1), the second key multivector (K2), and a message multivector (M), and a scalar multiplication operation with integer g to obtain message ciphertext multivector (C), wherein the message multivector (M) is a representation of the integer message m; a ciphertext submission subsystem that submits, on-chain, the message ciphertext multivector (C) as a transaction for the distributed ledger/blockchain system; and a decryption subsystem that decrypts, off-chain, a result ciphertext multivector (CR) as a function of at least one Geometric Algebra geometric product operation of an inverse of the first key multivector (K1−1), an inverse of the second key multivector (K2−1), the result ciphertext multivector (CR), and a scalar division operation with integer g to obtain a result multivector (R), wherein the result multivector (R) is a representation of a numeric result r; and, a calculation node device, wherein the calculation node device further comprises: a somewhat homomorphic operation calculation subsystem that performs, on-chain, at least one somewhat homomorphic operation with the message ciphertext multivector (C) to calculate a result ciphertext multivector (CR), wherein a smart contract for the distributed ledger/blockchain defines available somewhat homomorphic operations and wherein the calculation node device is part of the same distributed ledger/blockchain system as the owner node device.





BRIEF DESCRIPTION OF THE DRAWINGS

In the drawings,



FIG. 1 is a block diagram of the hardware implementation for a SomeWhat Homomorphic Encryption (SWHE) distributed ledger/blockchain embodiment.



FIG. 2 is a flow chart of encryption/decryption and somewhat homomorphic calculations for an embodiment.



FIG. 3 is a flow chart of a key update operation for an embodiment.





DETAILED DESCRIPTION OF THE EMBODIMENTS

The various embodiments aim to address the challenge of expanding Blockchain Technologies (BT) by implementing a somewhat homomorphic encryption scheme that not only enables computation on encrypted data but also yields a key update protocol with which one can selectively reveal consolidated data from a blockchain application. Constructions of the various embodiments are meant to be compliant with the fundamental requirements of BT, including ownership control and non-repudiation. In isolation, BT and homomorphic encryption (HE) can both suffer from performance issues. Combining the two only escalates that risk. We rely on Clifford Geometric Algebra as the single algebraic structure for introducing efficient solutions for merging BT with HE. One target application considers a trusted environment with pre-screened parties, which allows the various embodiments to consider cryptographic solutions based on relaxed notions of security. One possible means of implementation is to encode the various embodiments using the Ruby language.


I. Introduction


Blockchain may be offered as a virtualized resource—Blockchain as a Service (BaaS). Blockchain has been used to reduce costs and the complexity of management, but blockchain raises concerns about the custody of data and the classical Trust Model.


Blockchain is a distributed ledger where the state lies on a linked list of interdependent blocks, persisted under consensus. Blockchain defines a conjunction of technologies behind Bitcoin, where anonymous parties would join a network without permission, so blockchain was initially considered permissionless. The replication of the information amongst participants was onerous. So new initiatives approached the state replication paradigm in a more efficient way, increasing throughput by different consensus mechanisms. Companies then employed the new Distributed Ledger Technology (DLT) concept to promote cooperation, on a premise of identified nodes (i.e., permissioned).


Private blockchains provide immutability and nonrepudiation, but blockchain has restrictive analysis over encrypted data when it is done without segregating information, participants or third-party trusted architectures. Public key cryptography is used in blockchain operations, but its limitations towards computability spawned a new race for Homomorphic Encryption (HE) schemes. Additionally, adversarial behaviors can arise in DLT environments from semi-honest partners, such as the use of shared data to leverage a commercial advantage. HE offers the ability to correctly evaluate encrypted data allowing the outsourcing of computation without loss of privacy. Thus, parties can agree on common scripts implementing HE prior to the operation over data assets.


A. The Problems


The various embodiments may be comprised of functional blocks, each of which may be tailored as described in more detail below according to objectives for scope, capability and security. The following sections provide a mathematical and numerical description of these functional blocks.


Exclusivity is ingrained in the meaning of data property and no repossession lawsuit can restore ownership of digital data once the digital data is shared. Moreover, a legal contract cannot prevent misbehavior, being only a prior agreement on posterior punitive actions. Consequently, under the expectancy of misconduct, companies may avoid cooperation even when contracts provide legal protection.


Conversely, a smart contract defines the behavior prior to an accordance between parties, resembling a legal preventive action that avoids undesired executions. If combined with HE, it can realize blind computations (i.e., big data analytics) where the entity performing the calculations does not know the results of the calculations without gaining permission from the data owner. Additionally, if improved with a homomorphic key update, it can offer a mechanism to transfer ownership without leaving the trusted environment.


Cloud computing became a very expensive structure to reproduce in-house and the market made providing cloud computing a premise to stay competitive. Therefore, blockchain, with its ever-growing database and inherent complexity, has an appeal to be used as a third-party service for storage and management. However, blockchain's philosophy is built on the concept of cryptographic proof instead of trust, which creates a conundrum and weakens the ability of the technology to remain trustworthy since cloud suppliers work under the assumption of reputation and legal agreements. Another competence of blockchains is the capability to share digital assets avoiding power imbalance between parties. Nevertheless, fearing a loss of ownership, companies may restrain themselves from sharing sensitive data that could favor analysis and leverage a commercial relationship or research effort.


In summary, cloud services bring uncertainty in the treatment of Confidentiality, Integrity, and Availability (CIA), whereas partnerships can be restrictive due to lack of trust or regulatory concerns. Therefore, the problems that the various embodiments are addressing in this disclosure includes the following issues.


Problem 1: Given an immutable ledger on a permissioned blockchain setting, provide an efficient privacy preserving smart contract for computing arithmetic functions over encrypted data without violating the principles of ownership (the legal right of data access) and non-repudiation (the assurance that one cannot deny the validity of the data).


Problem 2: Given a smart contract that solves Problem 1, provide the ability to transfer ownership of homomorphically encrypted data without revealing to non-owners anything that is supposed to be known only by data owners.


One of the motivations for Bitcoin was the avoidance of a single point of failure. Avoiding a single point of failure may mean the absence of a computational node due to censorship or a dishonest behavior from a participant. Therefore, publicizing transactions was a means to verify integrity, although secrecy was not a concern. User's anonymity was provided by public key encryption and cryptography took care of the trust for the model.


Furthermore, companies realized that their commercial relationships would benefit from the provenance given by the immutable tracking of events. Also, company partnerships are mediated by legal contracts, facilitating a transition to electronic scripts—smart contracts—when transacting digital assets. Therefore, the applicability of cryptography expands and problems such as data ownership become the main concern. On-chain solutions sometimes are based on the segregation of information or just under-covered data or scripts, not allowing computation or any kind of analytical result over encrypted assets. On the other hand, off-chain implementations can violate the credibility that is only earned when every operation is performed within public sight.


For the various embodiments, two main guidelines must be preserved in order to favor CIA and DLT core principles. First, any sensitive data must be encrypted or decrypted off-chain, in possession of the owner. Second, any operation over an encrypted asset must occur on-chain and the algorithm must be known by the parties and consequently agreed upon before execution. Finally, the script implementing the mathematical operations must be sufficiently efficient to not overload the performance of the consensus mechanism at hand.


C. Clifford Geometric Algebra


Clifford geometric algebra is known by the richness, robustness and flexibility of its algebraic structure, which allows us to take advantage of concepts from several different branches of mathematics such as vector and matrix spaces, integer, rational and complex arithmetic, all in a single compact system. Clifford Geometric Algebra (herein simplified to GA) is a very powerful mathematical system. Some advantages commonly associated with GA computing include compactness of algorithms, implicit use of parallelism and high runtime performance and robustness. In working on the various embodiments it was further noted that three major benefits of working with GA based would be: (1) the ability of working with notions from several different branches of mathematics in a single framework (i.e., modular arithmetic, complex arithmetic, matrix algebra, etc.); (2) how much may be accomplished by even a very small set of computationally inexpensive algebraic tools; and (3) the simplicity of the construction itself, which favors understanding, maintenance and analysis.


An embodiment may advantageously utilize Geometric Algebra to provide the encryption and decryption of numeric messages that may be stored and/operated on within the distributed ledger/blockchain. The use of Clifford Geometric Algebra (aka. Geometric Algebra) to provide the encryption and decryption provides the mathematical basis for the homomorphic operations of an embodiment.


Geometric Algebra is an area of mathematics that describes the geometric interaction of vectors and other objects in a context intended to mathematically represent physical interactions of objects in the physical world. As used herein, this area of mathematics encompasses Geometric Algebra, Conformal Geometric Algebra and Clifford Algebra (referred to collectively herein as “Geometric Algebra” or “GA”). Generally, Geometric Algebra defines the operations, such as geometric product, inverses and identities, which facilitate many features of the various embodiments disclosed herein. Further, Geometric Algebra allows for the organization and representation of data into the “payload” of a multivector where the data in the payload may represent, for example, plaintext, ciphertext, or identifying signatures. Consequently, the various embodiments make beneficial use of Geometric Algebra properties to provide encryption, decryption, and homomorphic operations in a relatively computationally simplistic manner while still providing robust security for both data in motion and data at rest (e.g., data stored in the Cloud).


It may be demonstrated that through multivector decompositions and a small subset of operations in the Clifford Geometric algebra it is possible to propose new methods for general-purpose data representation and data encryption with multivectors. The methods of the various embodiments may be used as part of the necessary reconciliation of data availability and privacy preservation. This is important because once data is encrypted, one cannot meaningfully process it, unless the encryption function is homomorphic with respect to one or more operations. Therefore, homomorphism is a key concern in constructions of the various embodiments since there is particular interest in encryption schemes that allow homomorphic computations over concealed data.


Some fields of applications are inherently complex, as is the case for blockchain technologies and cryptography. The combination of blockchain and cryptography could easily increase the associated complexity exponentially should one fail to take into account the additional complexity from any particular tool or approach. In scenarios like combining blockchain and cryptography, it seems critical to consider solutions that are simple to implement, but are still powerful, so one can achieve much without necessarily adding complexity. For the various embodiments GA seems to be an appealing candidate for providing an efficient cryptographic protocol that aims to expand blockchain capabilities without violating its rigid, but necessary, constraints.


Favoring the quick distinction of a multivector from any other data structure, we use capital letters with an overbar as in M. We let the Clifford signature custom-character(3, 0) generate a geometric product space here denoted by custom-characterq3. A multivector is given by M=m0ē0+m1ē1+m2ē2+m3ē3+m12ē12+m13ē13+m23ē23+m123ē123. The four grades of a multivector may be referred to as the scalar part custom-characterMcustom-character0=m0ē0, the vector part custom-characterMcustom-character1=m1ē1+m2ē2+m3ē3, the bivector part custom-characterMcustom-character2=m12ē12+m13ē13+m23ē23, and the trivector or pseudoscalar part custom-characterMcustom-character3=m123ē123. An example of a three-dimension (3D) multivector Ā that includes a scalar, a vector, a bivector, and a trivector is:






Ā=a
0
+a
1
ē
1
+a
2
ē
2
+a
3
ē
3
+a
12
ē
12
+a
13
ē
13
+a
23
ē
23
+a
123
ē
123


where ēi is a unit vector along the i-axis and ē12 represents the orientation of the area created by a12. Notably, a Geometric Algebra multivector in N-space (i.e., a N-dimension multivector) has 2N coefficients whereas a standard N-dimension vector has only N coefficients. Accordingly, the Geometric Algebra multivectors provide a sense of size, direction, and volume while a standard vector would only provide a sense of size and direction. As the concepts involved in Geometric Algebra are part of a deep and rich mathematical file, some general observations may be helpful to the description of the various embodiments disclosed herein, below. First, each of the ai values in the multivector Ā above may be “packed” with information and each ai value may range from zero to very large (e.g., >256,000 bits or an entire message). Secondly, the inverse of Ā when multiplied by Ā yields unity, or:






ĀĀ
−1=1


Thus, if a second multivector B is created and the geometric product ĀB is transmitted, then the destination can recover B through:






ĀĀ
−1

B=B



Computations on the coefficients of M will be reduced to a given modulus q. This space reduced modulo q is denoted by custom-characterq3 and, thus, we write Mcustom-characterq3. The multivector involutions reverse and Clifford conjugation are denoted by M and M, respectively. The inverse of M−1 is computed as:











M
_


-
1


=





M
_

_



(


M
_




M
_

_


)






(


M
_




M
_

_


)



(


M
_




M
_

_


)









Eq
.




1







such that MM−1=1. For compactness, we denote a mod b by |a|b. The expression |└a/c┘|b reads “the floor division of a by c mod b”. The multiplication of two multivectors Ā, Bcustom-characterq3 follow the standard geometric product definition in Cl(3, 0) and have added that the computation for all coefficients is now reduced modulo q. The scalar multiplication for Āα for α ∈ custom-characterq is computed by multiplying each coefficient of Ā by α modulo q. The scalar division A/a is computed by multiplying each coefficient of Ā by x=α−1 mod q where x is the modular multiplicative inverse of α with respect to q such that αx=1 mod q. Thus, the following notations are equivalent: α−1 mod q=|a−1|q=|1/α|q.


As for the basic operations in custom-characterq3, similar to the operations of a vector space, one can add, subtract, scalar multiply and scalar divide multivectors component-wise. Multiplication of multivectors is achieved with the geometric product, which is given by ĀB=Ā·B+Ā{circumflex over ( )}B, where Ā·B is the Clifford dot product and Ā·B is the Clifford wedge product.


D. Homomorphisms


Given two messages a, b, a function f is homomorphic with respect to a given operation ∘ if f(a∘b)=f(a)∘f(b). When we represent the messages a, b as the multivectors Ā, B, we say that the function of this representation will be homomorphic with respect to ∘ if f(Ā∘B)=f(Ā)∘f(B). Homomorphic encryption is a form of encryption that allows computations to be carried out on cipher text as it is encrypted without decrypting the cipher text that generates an encrypted result, which, when decrypted, matches the result of operations performed on the unencrypted plaintext.


The essential purpose of homomorphic encryption is to allow computation on encrypted data without decrypting the data in order to perform the computation. In this way, the encrypted data can remain confidential and secure while the encrypted data is processed for the desired computation. Accordingly, useful tasks may be accomplished on encrypted (i.e., confidential and secure) data residing in untrusted environments. In a world of distributed computation and heterogeneous networking, the ability to perform computations on encrypted data may be a highly desirable capability. Hence, finding a general method for computing on encrypted data is likely a highly desirable goal for cryptography.


A sought-after application of homomorphic encryption may be for distributed ledger/blockchain systems. Encrypting blockchain stored data may mitigate the threat of data being compromised by a breach, but then the owners of the data would not then be able to perform operations (i.e., add, scalar divide, etc.) on the blockchain stored data. In order to perform operations on encrypted data stored in the blockchain, it would be necessary to download the encrypted blockchain stored data, recover/decrypt the data, perform all desired operations on the data locally, encrypt the resulting data and send the resulting data back to the blockchain. Alternatively, if a user wants another blockchain node to perform the computations, the other node would require access to the user's encryption/security keys. It is becoming increasing undesirable to provide others access to a user's security keys as the more entities that have access to the security keys inherently increases the susceptibility of the security keys to being breached, or even stolen by an unscrupulous user. Homomorphic encryption would allow the blockchain to operate on encrypted data without decryption, and without access to the client's security keys.


For the various embodiments, the “payload” may be packed in the values of the scalars and coefficients of the multivector elements. The packing method may define, among many things, the Geometric Algebra operations permissible for an embodiment. For example, the Rationalize operation on multivectors yields zero when all multivector coefficients are equal. Such multivectors having all equal coefficients have no inverse and the geometric product of such multivectors having all equal coefficients with another multivector has no inverse. Different aspects of the various embodiments, including the decryption methodology that utilizes the inverse of the security key(s) multivector to perform the decryption. Therefore, to avoid problems when performing an inverse operation, the various multivectors being utilized in the various embodiments should not have all equal value coefficients, unless specifically identified as being meant to be non-invertible.


II. Target Definitions


Before introducing our specifics of the constructions of the various embodiments to address the problems discussed in Section I-A, a definition of the general syntax and notions aimed to be achieved is presented. This is useful for many reasons including the ability if the desired goals are achieved, but also how well the goals are achieved.


A. SWHE Scheme


The syntax of a SomeWhat Homomorphic Encryption (SWHE) scheme is defined as follows:


Definition 1: A SWHE scheme denoted as:





Π=(Gen, Enc, Dec, Add, SDiv)   Eq. 2


is a tuple of efficient (i.e., probabilistic polynomial-time) algorithms with the syntax given by the following paragraphs.


Gen is a probabilistic polynomial-time key-generation algorithm that takes as input the security parameter 1λ and outputs a private-key sk and a public evaluation key pk. The secret key implicitly defines a ring custom-character that will serve as the message space. We write the syntax as (sk, pk)←Gen(1λ). The security parameter is usually given in unary notation which indicates a λ-bit string of 1s so the efficiency of the algorithm is expected to be polynomial-time in λ.


Enc is a probabilistic polynomial-time encryption algorithm that takes as input a secret key sk and message m and outputs a ciphertext c as a n-dimensional tuple. We write the syntax as c←Enc(sk,c).


Dec is a deterministic polynomial-time encryption algorithm that takes as input a secret key sk and a ciphertext c and outputs a message m. We write the syntax as m=Dec(sk,c).


Add is a deterministic polynomial-time addition algorithm that takes two ciphertexts c1 and c2 and outputs a ciphertext c which corresponds to the component-wise addition of c1 and c2 reduced modulo pk. We write the syntax as c=Add(pk, c1, c2).


SDiv is a deterministic polynomial-time scalar division algorithm that takes a ciphertext c1 and a scalar α and outputs a ciphertext c which corresponds to the scalar division of all elements of c by α reduced modulo pk. We write the syntax as c=SDiv(pk, c1, α).


Correctness requires the following:


1) For all sk, pk output by Gen, and all m ∈ custom-character we have Dec(sk,Enc(sk, m)=m.


2) For all ci←Enc(sk, mi), i=1, 2 and all α∈ custom-character, the following holds:






Dec(sk,Enc(sk, Add(pk, c1, c2)))=m1+m2,






Dec(sk,Enc(sk, SDiv(pk, c1, α)))=m1/α.   Eq. 3


Definition 2: A SWHE scheme Π is secure if for a uniform m ∈ custom-character, all (sk, pk)←Gen(1λ) and all c←Enc(sk,c), no efficient adversary A can recover m by knowing only pk and c.


B. Key Update Protocol


Definition 3: A key update protocol denoted as: Eq. 4





Σ=(TokGen, KeyUpd)   Eq. 3


is a tuple of efficient algorithms with the syntax given by the following paragraphs.


TokGen is a deterministic polynomial-time token generation algorithm that takes an old secret key skold and a new secret key sknew and outputs a token t. We write the syntax as t=TokGen (skold, sknew).


KeyUpd is a deterministic polynomial-time key update algorithm that takes a token t and a ciphertext cold, previously encrypted with skold, and outputs a ciphertext cnew that is encrypted with sknew. We write the syntax as cnew=KeyUpd(t, cold).


Definition 4: The key update protocol Σ is secure if for all uniform skold and sknew output by Gen (1λ) and t output by TokGen, the probability of any efficient adversary A to recover either skold or sknew by knowing t, cold and cnew is negligible.


III. Description of the SWHE Scheme


In this section we propose a construction for an embodiment that aims to satisfy the definitions in Section II-A. But first, let us introduce our motivation and a couple of useful remarks and definitions.


Motivation 1: We want to design a SWHE scheme that is secure based on the assumption that solving an underdetermined system of equations is computationally hard. In order to achieve this goal, we propose a design of an encryption function based on randomness and underdeterminancy. We want to transform a message m into a random multivector M ∈ Gq3 where a particular combination of addition and subtraction of its coefficients results in m, which implies that we have a different ciphertext even if we encrypt the same message multiple times. We also want to perform a modular multiplication using a secret factor, which implies that recovering m requires a modular multiplicative inverse operation with an unknown operand. Finally, we want to “seal” the randomly generated and modular displaced multivector with two secret key multivectors via a triple geometric product. In doing so, we expect to pose a challenge when attempting to recover a plaintext message from any give ciphertext, which is equivalent to solving a non-redundant underdetermined system of equations.


Motivation 2: We want to build an encryption scheme to be applied in a private (permissioned) blockchain among trusted parties. Thus, we are providing privacy in a trusted environment assuming that all the parties must follow a given protocol.


Remark 1: Due to Motivation 2, we assume that a relaxed threat model is in place where the adversary is not supposed to have any knowledge about the message that originated a given ciphertext. This allows us to propose an experimental and compact solution to solve Problems 1 and 2, as well as allowing us to introduce and discuss instances of a new approach for expanding BT capabilities with HE.


In Definition 1, the algorithm SDiv, for any useful result, might imply a fractional output. We will introduce a construction in which the encryption function receives positive integers as inputs and generates ciphertexts where the underlying computation is performed over the integers modulo a prime. Since Enc takes integers in Zq as input and generates ciphertexts also over integers in custom-characterq, the decryption function is expected to output integers in custom-characterq. The algorithm Add performs homomorphic addition of ciphertexts and the decryption of the results is also an integer. However, in the specific case of SDiv, a ciphertext is divided by a scalar which might result in a non-integer rational number. The scalar division is performed over the integers, with the modular multiplicative inverse. In order to map the integer result of a scalar division to its corresponding rational representation we will use the Extended Euclidean Algorithm (EEA) according to Definition 5 below.


Definition 5: Given a prime p and a positive integer c ∈ custom-characterp,


let the EEA be computed as follows:


1) Set a0=p, a1=c; b0=0, b1=1; i=1.


2) While ai>└√{square root over (p/2)}┘ compute

    • q=└ai−1/ai┘ ai+1=ai−1−qai
    • bi+1=bi−1−qbi i=i+1.


3) a/b=ai/bi


4) Return a/b. We write the syntax as a/b=EEA (p, c).


Now we are ready to introduce constructions that satisfy the definitions in II-A. Note that the following constructions take into account the bit size concerns of a computer program. Conceptually, the various embodiments are not limited by the bit size as the conceptual model may theoretically encompass infinite bit size values.


Gen takes as input 1λ and proceeds as follows: (1) set b=λ/8; (2) let q be the smallest prime greater than 2b; (3) choose uniform 16 b-bit integers and define K1, K2 custom-characterq3 such that the first 8 integers are the coefficients of K1 and the second 8 integers are the coefficients of K2—the generated K1, K2 must have an inverse otherwise other 16 b-bit integers must be uniformly chosen and transformed into the secret key multivectors K1, K2 until they have inverse; (4) choose a uniform b-bit integer g; and (5) output the secret key sk=(K1, K2, g) and the public evaluation key pk=(b, q). The message space is originally defined by custom-character=custom-characterq.


Enc takes as input sk=(K1, K2, g) and m and proceeds as follows:

    • 1) Let m0, : : : , m123, with the exception of m12, be uniform b-bit integers and m12 be defined as follows:






m
12
=|−m
0
−m
1
+m
2
+m
3
−m
13
+m
23
+m
123
+m|
q.   Eq. 5

    • 2) For j ∈ {0, 1, 2, 3, 12, 13, 23, 123}, define M such that:






Mjmjēj.   Eq. 6

    • 3) Compute M′ such that M′=Mg.
    • 4) Compute and output C=K1MK2.


Dec takes as input sk=(K1, K2, g) and Ccustom-characterq3 and proceeds as follows:

    • 1) Retrieve M′=K1−1CK2−1.
    • 2) Retrieve M=M′/g
    • 3) Compute m such that:






m=|m
0
+m
1
−m
2
−m
3
+m
12
+m
13
−m
23
−m
123|q.   Eq. 7

    • 4) Update the value m by mapping it to a rational format such that m=a/b=EEA(q, m). Output m.


Add takes as inputpk and C1, C2 custom-characterq3 and computes and outputs C as a component-wise addition of the coefficients of C1, C2.


SDiv takes as inputpk, C1 custom-characterq3 and a scalar α in custom-characterq, and computes and outputs C as a scalar division of all elements in C1 by α which is denoted by C1/α.


Lemma 1: For all uniformly generated coefficients of mj custom-characterq, where j ∈ {0, 1, 2, 3, 12, 13, 23, 123}, q is prime, and for all m12 as defined in Eq. 5, the result in Eq. 7 holds.


Proof Given the definition of m12 in Eq. 5, let's re-write Eq. 7 as m=ma+mb such that:






m
a
=m
0
+m
1
−m
2
−m
3
+m
12,   Eq. 8






m
b
=m
13
−m
23
−m
123.   Eq. 9


If we substitute for mu in Eq. 8 we have:






m
a
=m−m
13
+m
23
−m
123,   Eq. 10


so, when we compute ma+mb adding Eqs. 9 and 10 we obtain:






m
a
+m
b
=m−m
13
+m
23
+m
123
+m
13
−m
23
−m
123
=m,   Eq. 11


Lemma 2: For any prime q, any non-zero g ∈ custom-characterq and any Mcustom-characterq3, we have M′=Mg, M′/g=M.


Proof. For any prime q, all non-zero elements g ∈ custom-characterq have a unique modular multiplicative x=|g−1|q such that |gx|q=1. When we compute M′=Mg, we recover M by performing the scalar division of M′ by g, denoted by M′/g, which is in fact equivalent to the scalar multiplication of M′ by |g−1|q. Since q is prime, for all g>0 we have a x such that |gx|q=1, where g, x ∈ custom-characterq. According to the Bézout's identity, if gcd(g, q)=1, then we can write:






gx+gy=gcd(g, q)=1,   Eq. 12


where x, y have integer solutions. We can then rewrite Eq. 12 as:






gx−1=(−y)q and gx≡1 mod q,   Eq. 13


and, thus, x is the modular multiplicative inverse of g with respect to q.


For small values of q one can naively compute x by iterating x from 1 to q−1 until finding the result that satisfies |gx|q=1. However, a better way is to use the Extended Euclidean Algorithm (EEA) which can efficiently compute modular multiplicative inverses for large values of g and q as long as gcd(g, q)=1.


Theorem 1: For all sk output by Gen and m ∈ custom-characterq, we have Dec(sk, Enc(sk, m))=m.


Proof: Recall that in the definition of Gen, K1, K2 must have an inverse. Therefore, for all sk=(K1, K2, g) and all m ∈ custom-characterq, we obtain M′ as M′=K1−1CK2−1. By applying Lemma 2, we recover M from M′ and we recover m from M by applying Lemma 1.


Lemma 3: For all a, b ∈ custom-characterq that is transformed into Ā, Bcustom-characterq3 according the first two steps in the Enc algorithm, decoding Ā+B back to scalar in custom-characterq results in a+b and therefore the transformation of a, b into Ā, B is homomorphic with respect to addition.


Proof: For all a, b ∈ custom-characterq that are represented by Ā, Bcustom-characterq3, respectively, where the coefficients of Ā, B are all uniform in custom-characterq with the exception of a12 in Ā and b12 in B which are both defined in Eq. 5. Let S=Ā+B. The multivector addition is performed element-wise where sj=aj+bj for j ∈ {0, 1, 2, 3, 13, 23, 123}. For the particular case of s12=a12+b12 we have:






s
12
=a−a
0
−a
1
+a
2
+a
3
−a
13
+a
23
+a
123
+b−b
0
−b
1
+b
2
−b
3
−b
13
+b
23
+b
123.   Eq. 14


If we organize the coefficients of S as:






s
a
=a
0
+b
0
−a
2
−a
3
+b
0
+b
1
−b
2
−b
3   Eq. 15





sb=s12






s
c
=a
13
−a
23
−a
123
+b
13
−b
23
−b
123,


we compute sa+sb to obtain:














s
a

+

s
b


=



a
-

a
13

+

a
23

+

a
123

+
b
-

b
13

+

b
23

+

b
123








=



a
+
b
-

s
c









Eq
.




16







so, essentially, computing sa+sb+sc gives sa+sb+sc=a+b.


Lemma 4: Lemma 2 also applies to scalar multiplication and scalar division of all Ā, Bcustom-characterq3 by all scalar g ∈ custom-characterq. Recall that scalar division by g mod q is a scalar multiplication by g−1 mod q. A multivector scalar multiplication holds the properties of additivity in the scalar and additivity in the (multi)vector [44], and, therefore, we have:






Āg+Bg=(Ā+B)g.   Eq. 17


Lemma 5: For all prime q, C1←Enc (sk, m) and α ∈ custom-characteru where u=└√{square root over (q/2)}┘, the following holds:






m/α=Dec(sk, SDiv(pk, C1, α).   Eq. 18


Proof. On the encrypted domain, where computation is performed modulo q, for q is a prime, the scalar division of C1 by α is achieved via the scalar multiplication of C1 by the modular multiplicative inverse of α with respect to q. If we let C=SDiv(pk, C1, α), the decryption of C will result on the integer representation of m divided by α, that is, |mα−1|q. Since the definition of Dec in Section III requires a rational output in order to accommodate the results from computation including SDiv, we need to use the EEA in Definition 5 to achieve this goal. If we let c=|mα−1|q, the EEA, whose standard implementation computes all the convergents of c/q, will output the first convergent of c/q whose numerator satisfies ai≤u (according to the modified version presented in Definition 5), for u=└√{square root over (q/2)}┘. This result implies c≡m/α. To prove this equivalence, we can rewrite m as a Diophantine equation where there is an integer solution fork such that m=αc+kq. We can now write the solution fork as k=(m−αc)/q. It is clear that c=(m−kq)/α and since (m−kq) mod q=m then we have c≡m/α mod q.


Due to Lemma 5, and since we assume that homomorphic scalar divisions will always occur, in order to guarantee the desired result of scalar divisions over encrypted data, we reduce the message space originally defined as custom-characterq in Gen by custom-characteru, for u=└√{square root over (q/2)}┘.


Theorem 2: For all (sk, pk) output Gen, C1, C2 custom-characterq3, and m1, m2, α ∈ custom-characterq, the following holds:






Dec(sk, Add(pk, Enc(sk, m1), α))=m1·α.   Eq. 19





and






Dec (sk, Add(pk, Enc(sk, m1), Enc(sk, m2)))=m1+m2,   Eq. 20


Proof Given m1, m2, α ∈ custom-characterq, sk=(K1, K2, g), and pk=q, we compute C1, C2 custom-characterq3 as follows:







C

1
=Enc(sk, m1), C2=Enc(sk, m2).   Eq. 21


We compute C=Add(pk, C1, C2). When decrypting C we have:













Dec


(

sk
,

C
_


)


=





K
_

1

-
1




C
_




K
_

2

-
1









=






K
_

1

-
1




(



C
_

1

+


C
_

2


)





K
_

2

-
1









=






K
_

1

-
1





C
_

1




K
_

2

-
1



+



K
_

1

-
1





C
_

2




K
_

2

-
1










=






K
_

1

-
1





K
_

1




M
_

1





K
_

2




K
_

2

-
1



+



K
_

1

-
1





K
_

1




M
_

2





K
_

2




K
_

2

-
1










=





M
_

1


+



M
_

2


.









Eq
.




22







By applying Lemma 3 ad 4, we obtain m1+m2. Similarly, let C=SDiv(pk, C1, α), then:













Dec


(

sk
,

C
_


)


=





K
_

1

-
1




C
_




K
_

2

-
1









=





K
_

1

-
1





C
_

1



α

-
1





K
_

2

-
1









=





K
_

1

-
1





K
_

1




M
_

1





K
_

2




K
_

2

-
1




α

-
1









=





M
_

1





α

-
1


.









Eq
.




23







By applying Lemma 3, 4 and 5, we obtain m1/α.


Theorem 3: If an adversary custom-character can efficiently solve a system of equations with 8 non-redundant equations and 24 unknowns then custom-character can efficiently recover m from C without knowing anything other than C.


Proof: Let a multivector Ā ∈ custom-characterq3 be written as:






Ā=custom-characterAcustom-character0+custom-characterAcustom-character1+custom-characterAcustom-character2+custom-characterAcustom-character3   Eq. 24


where <·>i, for i ∈ {0,1,2,3}, is called a multivector grade. Grades 0 and 3 contain a single element each and grades 1 and 2 contain three elements each, for a total of 8 elements.


Given Ccustom-characterq3 such that C=K1MK2, we can write Ci=03custom-characterK1MK2custom-characteri. Similarly, if one wants to recover M′ they need to compute M′=K1−1CK2−1i=03custom-characterK1−1CK2−1custom-character.


Assuming the adversary custom-character only knows C, an attack to recover M from C can be formulated by solving a system of equations on the form of:






custom-character
C
custom-character
i=custom-characterK1MK2custom-characteri,   Eq. 25


where each element of C can be computed from a combination of the elements of K1, M′ according to the rules of the geometric product, for a total of 8 equations. Since K1, M′, and K2 are unknowns, and each also have a total of 8 elements, the adversary custom-character is faced with a total of 24 unknown variables. This means that the system of equations the adversary needs to solve in order to recover M′ from C is considered an underdetermined system, i.e., a system that has less equations than unknowns. As for any underdetermined system, the number of basic variables is given by the number of equations, thus we have 24−8=16 free variables. Therefore, the system has as an infinite number of solutions for as many values that the free variables can take.


Lemma 6: The proposed SWHE Scheme is secure assuming that no adversary custom-character can efficiently solve (that is, solve under polynomial-time) an underdetermined system of equations which its underdeterminancy is not affected by the number of ciphertexts samples under consideration.


Proof: Given C1=Enc(sk, m1), C2=Enc(sk, m2), and C=Add(pk, C1, C2), an adversary A may try to solve for M1 and M2 and/or simply M′=M1+M2, by organizing a system of equations as in Eq. 25, such that:






custom-character
C
custom-character
i=custom-characterK1M1K2custom-characteri   Eq. 26






custom-character
C
custom-character
i=custom-characterK1M2K2custom-characteri






custom-character
C
custom-character
i=custom-characterK1MK2custom-characteri.


The system would then have a total of 24 equations (8 for each cyphertext) and 32 unknowns if solving for both M1 and M2, or 24 unknowns if solving for M′. However, notice that:






custom-character

C

custom-character
i=custom-characterK1MK2custom-characteri=custom-characterK1M1K2custom-characteri+custom-characterK1M2K2custom-characteri=custom-characterK1(M1+M2)K2custom-characteri,   Eq. 27


i.e., the 8 equations with respect to the elements of C are generated as a sum of the equations with respect to C1 and C2, and, therefore, are redundant, which reduces the total number of non-redundant equations to 16. Hence, the resulting system, despite solving for both M1 and M2, or M′ only, is underdetermined and can have an infinite number of solutions. Similarly, if we compute C=SDiv(pk, C1, α), then:






custom-character

C

custom-character
i=custom-characterK1MK2custom-characteri=custom-characterK1(M1α−1)K2custom-characteri−1custom-characterK1M1K2custom-characteri.   Eq. 28


Notice that the equations for the elements of C are the result of α−1 multiplied by the equations with respect to the elements of C1, and hence are redundant. Therefore, the resulting systems of equations for the scalar division case has a total of 8 nonredundant equations and 24 unknowns, which turns out to be an underdetermined system with an infinite number of possible solutions.


IV. Description of the Key Update Protocol


In this section we propose a construction that aims to satisfy the definitions presented in Section II-B.


Motivation 3: We want to design a key update protocol that securely allows one to update the secret key of an existing ciphertext without revealing the corresponding message, the old key or the new key, also based on the assumption that solving a non-redundant underdetermined system of equation is computationally hard. In order to achieve this goal, we propose a design for a protocol based on underdeterminancy. From the old and the new key, we want to generate a token that is expected to not reveal information about either the old or the new key. Once the token is generated, one should be able to use it for changing the keys on an existing ciphertext under the old key, generating a new ciphertext under the new key. In this process, one should not be able to derive the underlying plaintext message.


TokGen takes as input two secret keys sk1=(K11, K12, g1) and sk2=(K21, K22, g2), the old and the new key, respectively, and computes and returns the token t=(T1, T2) such that T1=K21K11−1g1−1g2, T2=K12−1K22.


KeyUpd takes as input the token t=(T1, T2) and an existing (old) ciphertext Cold and computes and outputs an updated (new) ciphertext Cnew as Cnew=T1ColdT2.


Theorem 4: For all sk1 and sk2 output by Gen, and all T1 and T2 output by TokGen, given that C is a ciphertext such that:






C
old=K11MoldK12, Mold=M′g1,   Eq. 29


it holds: Cnew=K21MnewK22, Mnew=M′g2.


Proof Given the setup in Theorem 4, we verity that:














C
_

new

=





T
_

1




C
_

old




T
_

2








=





K
_

21




K
_

11

-
1




g
1

-
1




g
2




K
_

11




M
_





g
1




K
_

12




K
_

12

-
1





K
_

22


i







=





K
_

21




M
_





g
2




K
_

22








=





K
_

21




M
_

new





K
_

22

.









Eq
.




30







Theorem 5: If an adversary A can efficiently solve a system of equations with more unknowns than non-redundant equations then A can efficiently recover m from Cnew.


Proof Given a token t=(T1, T2) computed according to TokGen, Cold=K11MK12=K11Mg1K12 computed according to Enc, and Cnew=T1ColdT2 computed according to KeyUpd, an adversary custom-character, with knowledge of Cold, t, and Cnew, may try to solve for M, and consequently obtain m according to Dec, by organizing a system of equations on the form of:






custom-character
C
old
custom-character
i=custom-characterK11Mg1K12custom-characteri   Eq. 31






custom-character
C
new
custom-character
i=custom-characterT1ColdT2custom-characteri






custom-character

T

1
custom-character
i
=
custom-character

K

21

K

11
−1
g
1
−1
g
2
custom-character
i






custom-character

T

2
custom-character
i
=
custom-character
K
12
−1

K

22
custom-character
i


Notice that this system of equations contains a total of 32 equations (8 equations for each of the multivectors Cold, Cnew, T1, and T2) and 42 unknowns (40 related to the multivectors K11, K12, K21, K22, and M, and 2 related to the scalars g1 and g2. Therefore, the system is considered underdetermined as the number of unknowns surpasses the number of nonredundant equations.


Theorem 6: If an adversary custom-character can efficiently solve a system of equations with 8 non-redundant equations and 16 unknowns then custom-character can efficiently recover sk1 or sk2 from t.


Proof: The proof of Theorem 6 can be borrowed from the proof of Theorem 5, as the same system of equations and its characteristics apply in this case.


V. Application


In order to provide practical insights on how to connect the proposed constructions to a real-world DLT-based system, we introduce an illustrative design where we apply our SWHE scheme and key update protocol. In our example we describe an instance of the data ownership problem, where regulatory restrictions reduce the solution space for data computation. Due to space limitations, we cannot fully describe the system internals in all its details (i.e., consensus mechanism for persisting data), so we will provide a minimally required high level description of its building blocks.


Motivation 4: $300 billion out of more than $1.7 trillion are spent annually on medical research alone [47] and advancements depend on the reproducibility of experiments and the scientific correctness underlying it. Moreover, healthcare systems operate under strict regulations [48] in order to protect the secrecy of patients, resulting in a very siloed industry [18]. In such scenario, blockchain technologies have the potential to mediate the access to healthcare data [49], avoiding power imbalance over digital assets. With the addition of HE, a DLT system can protect the privacy of individuals' Electronic medical records (EMRs) while offering compliant analysis over their data.


Definition 6: This blockchain application is composed by the building blocks described in the following paragraphs.


User custom-characterA: The original data owner. Responsible for persisting information on-chain and the one that decides when and to whom the ownership is transferred.


User custom-characterB: An existing user from the same consortium of User custom-characterA. User custom-characterB has access to the off-chain cryptographic library and can perform homomorphic computations on-chain at any time. User custom-characterB is interested in getting insights of data processed at the blockchain.


App component custom-characterc: Software that works as an interface between the user and the SWHE scheme and the key update protocol. The App component custom-characterC imports the algorithms Gen, Enc, and Dec from the SWHE scheme and the TokGen from the key update protocol.


Blockchain component custom-characterC: A system composed by the ledger (the blockchain database) and a smart contract that controls the access to the ledger. The smart contract imports Add, SDiv from the SWHE scheme and KeyUpd from the key update protocol.


Definition 7: custom-characterC is a tuple with the following efficient algorithms: NewRecord, GetRecords, GenReport, GenResult, GetReport and GetResult, as described in the following paragraphs.


GenReport generates a report calculating the median of a given number of records. We write the syntax as GenReport (idSLedger), and operates as follows:

    • 1) First, GetRecords is called, retrieving the records represented by idsLedger;
    • 2) Then, Add operates the addition of multivectors inside the records returned by GetRecords;
    • 3) SDiv takes all summed multivectors given by Add and divides by the number of records returned by GetRecords; and,
    • 4) Finally NewRecord is used to persist the aggregated data.


GenResult takes as input an id, idLedger, and the generated tokens t to update the keys of a report. We write the syntax as GenResult (idLedger; t), and operates as follows:

    • 1) First, GetReport is called, retrieving the report of idLedger;
    • 2) Second, KeyUpd is used to change the keys of report idLedger; and,
    • 3) Finally, NewRecord is used to persist the resulting data.


GetResult takes as input idLedger and retrieves a report that had its keys updated. We use the syntax GetResult (idLedger).


Example 1: In our example, custom-characterA represents a hospital that owns patients' records. custom-character8 stands for a research institution that wants to make analysis over patients' data. The medical industry runs under strict regulation and health institutions are forbidden to share personal information from individuals. However, a disease outbreak urged the aforementioned organizations to cooperate. Therefore, the hospital agreed to share information under a security protocol, that could lead to a better triage of patients and, perhaps, a path to a cure.


In the DLT environment, both institutions will have a copy of the data, but their ownership is tied to their keys. Since the smart contract is using a SWHE scheme, computations can be performed homomorphically by custom-characterB and the property over the resulting analysis can be transferred through the key update protocol by custom-characterA.



custom-character
8 wants to calculate the average number of pre-existing conditions of every patient that died from the new illness. Therefore, custom-character8 generates a report over a selection of expired patients. Then, custom-characterA analyzes the result and decides to grant permission. To do so, a symmetric key is shared with custom-character8 through a traditional key exchange protocol. Now, custom-characterA updates the keys of the report, allowing custom-character8 to finally detect a high number of pre-existing conditions in patients that did not recover.


VI. Conclusions


Through practical constructions of the various embodiments the realization of a somewhat homomorphic encryption (SWHE) scheme is demonstrated and a key update protocol as a strategy for expanding the current capabilities of blockchain technologies (BT) is also demonstrated. With a very small set of elementary functions found in Clifford geometric algebra, the various embodiments are able to provide simple and yet efficient cryptographic protocols to equip BT with a homomorphic smart contract. Without violating current business logic constraints in BT, one can use constructions of the various embodiments to homomorphically analyze encrypted data, generate reports and transfer the data ownership without compromising the original key holder's and/or third parties' privacy. The disclosure further provides evidence of the various embodiments' proposed algorithms' correctness as well as the security properties the algorithms carry, under some strong assumptions such as the attacker's knowledge restricted to public information.


Hardware Implementation for Data Concealment Embodiments (FIG. 1)


FIG. 1 is a block diagram 100 of the hardware implementation for a SomeWhat Homomorphic Encryption (SWHE) distributed ledger/blockchain embodiment. The distributed ledger/blockchain system 102 provides for data storage with the principles of ownership and non-repudiation inherent to a distributed ledger/blockchain system. The data owner node device 104, calculation node device 106, and new data owner node device 108 are members of the distributed ledger/blockchain system 102. The data owner node device 104, calculation node device 106, and new data owner node device 108 read/write data 110 from/to the distributed ledger/blockchain system 102. The data owner node 104 may encrypt data and write/submit the encrypted data for storage on the blockchain system 102. The calculation node device 106 may perform somewhat homomorphic operations on the encrypted data supplied by the data owner node device 104, such as adding two ciphertexts from the data owner node device 104 together or dividing a ciphertext by a scalar, all while the data and results remain encrypted during the entire operation. The data owner node device 104 may decrypt ciphertext data owned by the data owner node device 104 as desired. The new data owner node 108 may be given ownership of a ciphertext owned by the data owner node device 104 using a key update protocol that will update the encryption key of the ciphertext to a new encryption key. The new encryption key will be transferred 112 to the new data owner node device 108 so the new data owner may now decrypt the data with the updated security key, but not the old data with the original security stored on the blockchain 102 by the data owner node device 104.


Further, generally, any computing device capable of communication over any form of electronic network or bus communication platform may be one, two or all three of the node devices 104-108 shown in FIG. 1. Additionally, the data owner node device 104, the calculation node device 106, and the new data owner node device 108 may actually be the same physical computing device communicating over an internal bus connection with itself, but still desiring to conceal transferred data to ensure that an attacker cannot monitor the internal communications bus to obtain sensitive data communications in an unconcealed format.


Various embodiments may implement the network/bus communications channel for the blockchain system 102 using any communications channel capable of transferring electronic data. For instance, the network/bus communication connection may be an Internet connection routed over one or more different communications channels during transmission between the node devices 104-108 and the blockchain system 102. Likewise, the network/bus communication connection may be an internal communications bus of a computing device, or even the internal bus of a processing or memory storage Integrated Circuit (IC) chip, such as a memory chip or a Central Processing Unit (CPU) chip. The network/bus communication channel may utilize any medium capable of transmitting electronic data communications, including, but not limited to: wired communications, wireless electro-magnetic communications, fiber-optic cable communications, light/laser communications, sonic/sound communications, etc., and any combination thereof of the various communication channels.


The various embodiments may provide the control and management functions detailed herein via an application operating on the node computing devices 104-108. The node computing devices 104-108 may each be a computer or computer system, or any other electronic devices device capable of performing the communications and computations of an embodiment. The node computing devices 104-108 may include, but are not limited to: a general purpose computer, a laptop/portable computer, a tablet device, a smart phone, an industrial control computer, a data storage system controller, a CPU, a Graphical Processing Unit (GPU), an Application Specific Integrated Circuit (ASI), and/or a Field Programmable Gate Array (FPGA). Notably, the first 102 and/or second 104 computing devices may be the storage controller of a data storage media (e.g., the controller for a hard disk drive) such that data delivered to/from the data storage media is always encrypted so as to limit the ability of an attacker to ever have access to unencrypted data. Embodiments may be provided as a computer program product which may include a computer-readable, or machine-readable, medium having stored thereon instructions which may be used to program/operate a computer (or other electronic devices) or computer system to perform a process or processes in accordance with the various embodiments. The computer-readable medium may include, but is not limited to, hard disk drives, floppy diskettes, optical disks, Compact Disc Read-Only Memories (CD-ROMs), Digital Versatile Disc ROMS (DVD-ROMs), Universal Serial Bus (USB) memory sticks, magneto-optical disks, ROMs, random access memories (RAMs), Erasable Programmable ROMs (EPROMs), Electrically Erasable Programmable ROMs (EEPROMs), magnetic optical cards, flash memory, or other types of media/machine-readable medium suitable for storing electronic instructions. The computer program instructions may reside and operate on a single computer/electronic device or various portions may be spread over multiple computers/devices that comprise a computer system. Moreover, embodiments may also be downloaded as a computer program product, wherein the program may be transferred from a remote computer to a requesting computer by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection, including both wired/cabled and wireless connections).


Operational Flow Chart for Encryption/Decryption and SWHE Calculations for an Embodiment (FIG. 2)


FIG. 2 is a flow chart 200 of encryption/decryption and somewhat homomorphic calculations for an embodiment. At off-chain process 206, the data owner node device 202 generates a secret/private key sk and a public evaluation key pk. At off-chain process 208, the data owner node device 202 encrypts a numeric message m as a geometric product operation and a scalar multiplication of a message vector M with elements of the secret/private key sk (K1, K2, g) to create message ciphertext multivector C. The message multivector M represents the numeric message m. At on-chain process 210, the data owner node device 202 submits the message ciphertext C to the blockchain. At on-chain process 212, the calculation node device 204 performs somewhat homomorphic operations with the message ciphertext C (add another ciphertext, scalar division) to calculate a result ciphertext multivector CR. At off-chain process 214, the data owner node 202 decrypts the result ciphertext multivector CR as a geometric product operation and scalar division with inverse elements of the secret/private key sk to create a result multivector R and calculates a numeric result r from the result multivector R.


Operational Flow Chart for Key Update Operation for an Embodiment (FIG. 3)


FIG. 3 is a flow chart 300 of a key update operation for an embodiment. At off-chain process 306, the data owner node device 302 generates a second, new secret/private key sknew that is different from the original/old secret private key skold. At off-chain process 308, a token comprised of a first token multivector T1 and a second token multivector T2 is calculated as a function of the original skold and new sknew secret private keys. At off-chain process 310, the data owner device node 302 updates the secret/private key skold of a desired ciphertext Cold to the new secret/private key sknew as a function of the first and second token multivectors T1 and T2 to create a new message ciphertext multivector Cnew. The process 300 of FIG. 3 may be done for any ciphertext owned by the data owner node device including the message ciphertext C as well as the somewhat homomorphic calculation result ciphertext CR. At off-chain process 312, the data owner node device 302 transfers the new secret key sknew to the new data owner node device 304 through a traditional key exchange protocol. At on-chain process 314, the data owner node device 302 submits the new ciphertext multivector Cnew to the blockchain. At off-chain process 316, the new data owner node device 304 decrypts the new ciphertext multivector Cnew as a geometric product operation and scalar division with inverse elements of the new secret/private key sknew to create a message multivector M and calculates a numeric message m from the message multivector M.


The foregoing description of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and other modifications and variations may be possible in light of the above teachings. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application to thereby enable others skilled in the art to best utilize the invention in various embodiments and various modifications as are suited to the particular use contemplated.

Claims
  • 1. A method for performing somewhat homomorphic operations on encrypted data in a distributed ledger/blockchain system without decrypting said encrypted data and where data resulting from said somewhat homomorphic operations remains encrypted, the method comprising: generating off-chain by a data owner node device a secret/private key sk and a public evaluation key pk, wherein said secret/private key sk is comprised of a first key multivector (K1), a second key multivector (K2), and an integer g, and wherein said public evaluation key pk is comprised of a prime number q;encrypting off-chain by said data owner node device an integer message m as a function of at least one Geometric Algebra geometric product operation of said first key multivector (K1), said second key multivector (K2), and a message multivector (M), and a scalar multiplication operation with integer g to obtain message ciphertext multivector (C), wherein said message multivector (M) is a representation of said integer message m;submitting on-chain by said data owner node device said message ciphertext multivector (C) as a transaction for said distributed ledger/blockchain system;performing on-chain by a calculation node device at least one somewhat homomorphic operation with said message ciphertext multivector (C) to calculate a result ciphertext multivector (CR), wherein a smart contract for said distributed ledger/blockchain defines available somewhat homomorphic operations and wherein said calculation node device is part of said same distributed ledger/blockchain system as said owner node device; anddecrypting off-chain by said data owner node device said result ciphertext multivector (CR) as a function of at least one Geometric Algebra geometric product operation of an inverse of said first key multivector (K1−1), an inverse of said second key multivector (K2−1), said result ciphertext multivector (CR), and a scalar division operation with integer g to obtain a result multivector (R), wherein said result multivector (R) is a representation of a numeric result r.
  • 2. The method of claim 1: wherein said process of generating off-chain by said data owner node device a secret/private key sk and a public evaluation key pk further comprises: setting said prime number q equal to a prime number;randomly generating 16 integer values;setting a first 8 of said 16 integer values as coefficient values of said first key multivector (K1), wherein said first key multivector (K1) has an inverse;setting a last 8 of said 16 integer values as coefficient values of said second key multivector (K2), wherein said second key multivector (K2) has an inverse;setting said integer g to a random integer value;setting said secret/private key sk to be said first key multivector (K1), said second key multivector (K2), and said integer g; andsetting said public evaluation key to be said prime number q;wherein said process of encrypting off-chain by said data owner node device said integer message m to obtain message ciphertext multivector (C) further comprises: setting coefficients of said message multivector (M) equal to random integer values except for a m12 coefficient of said message multivector (M);calculating said m12 coefficient of said message multivector (M) as a function of said other coefficients of said message multivector (M) and said integer message m modulo prime number q (m12=|−m0−m1+m2+m3−m13+m23+m123+m|q);calculating a transitional message multivector (M′) as a scalar multiplication of said integer g and said message multivector (M) (M′=Mg); andcalculating said message ciphertext multivector (C) as a geometric product of said first key multivector (K1), said transitional message multivector (M′) and said second key multivector (K2) (C=K1M′K2); andwherein said process of decrypting off-chain by said data owner node device said result ciphertext multivector (CR) further comprises: calculating a transitional result multivector (R′) as a geometric product of said inverse of said first key multivector (K1−1), said transitional result multivector (R′) and said inverse of said second key multivector (K2−1)(R′=K1−1CRK2−1);calculating a result multivector (R) as a scalar division of said integer g and said transitional result multivector (R′) by said integer g (R=R′/g);calculating said numeric result r as a function of coefficients of said result multivector (R) modulo prime number q (r=|r0+r1−r2−r3+r12+r13−r23−r123|q); andupdating said numeric result r by mapping said numeric result r to a rational number using an Extended Euclidean Algorithm (EEA).
  • 3. The method of claim 1 wherein said process of performing on-chain by a calculation node device at least one somewhat homomorphic operation further comprises calculating said result ciphertext multivector (CR) as a component-wise addition of coefficients of said message ciphertext multivector (C) and coefficients of a second ciphertext multivector (C2) (CR=C+C2) available on said distributed ledger/blockchain system such that when said result ciphertext multivector (CR) is decrypted, the numeric result r is equivalent to adding unencrypted values represented by said message ciphertext multivector (C) and said second ciphertext multivector (C2).
  • 4. The method of claim 1 wherein said process of performing on-chain by a calculation node device at least one somewhat homomorphic operation further comprises calculating said result ciphertext multivector (CR) as a scalar division of all elements of said message ciphertext multivector (C) by scalar value α (CR=C/α) such that when said result ciphertext multivector (CR) is decrypted, the numeric result r is equivalent to dividing integer message m by said scalar value α.
  • 5. The method of claim 1 wherein said distributed ledger/blockchain system is a private/permissioned system.
  • 6. The method of claim 1 further comprising: generating off-chain by said data owner node device a second secret/private key sk2, wherein said second secret/private key sk2 is comprised of a first key multivector (K21) of said second secret/private key sk2, a second key multivector (K22) of said second secret/private key sk2, and a second integer g2, such that said second secret/private key sk2 is not equal to said secret/private key sk;calculating off-chain by said data owner node device a token t comprised of a first token multivector (T1) and a second token multivector (T2) as a function of said secret/private key sk and said second secret/private key sk2;updating off-chain by said data owner node device said message ciphertext multivector (C) to be encrypted by said second secret/private key sk2 instead of by said secret/private key sk as a function of said first token multivector (T1) and said second token multivector (T2) to create a new message ciphertext multivector (Cnew);transferring off-chain by said data owner node device said second secret/private key sk2 to a new data owner node device through a traditional key exchange protocol;submitting on-chain by said data owner node device said new message ciphertext multivector (Cnew) as a transaction for said distributed ledger/blockchain system;decrypting off-chain by said new data owner node device said new message ciphertext multivector (Cnew) as a function of at least one Geometric Algebra geometric product operation of an inverse of said first key multivector (K21−1) of said second secret/private key sk2, an inverse of said second key multivector (K22−1) of said second secret/private key sk2, said new message ciphertext multivector (Cnew), and a scalar division operation with integer g2 of said second secret/private key sk2 to obtain said message multivector (M), wherein said new data owner node device has access only to said new message ciphertext multivector (Cnew) and not to other data encrypted by said data owner node device with said secret/private key sk.
  • 7. The method of claim 6: wherein said process of calculating off-chain by said data owner node device said token t further comprises calculating said first token multivector (T1) as T1=K21K1−1g−1g2 and said second token multivector (T2) as T2=K2−1K22; andwherein said process of updating off-chain by said data owner node device said message ciphertext multivector (C) to create said new message ciphertext multivector (Cnew) further comprises calculating said new message ciphertext multivector (Cnew) as Cnew=T1CT2.
  • 8. The method of claim 7 wherein said result ciphertext multivector (CR) is processed instead of said message ciphertext multivector (C).
  • 9. The method of claim 6 wherein operations of each of said data owner node device, said calculation node device, and said new data owner node device are performed by one or more hardware devices as desired by a user.
  • 10. A distributed ledger/blockchain system that performs somewhat homomorphic operations on encrypted data without decrypting said encrypted data and where data resulting from said somewhat homomorphic operations remains encrypted, the distributed ledger/blockchain system comprising: a data owner node device, wherein said data owner node device further comprises: a key generation subsystem that generates, off-chain, a secret/private key sk and a public evaluation key pk, wherein said secret/private key skis comprised of a first key multivector (K1), a second key multivector (K2), and an integer g, and wherein said public evaluation key pk is comprised of a prime number q;an encryption subsystem that encrypts, off-chain, an integer message m as a function of at least one Geometric Algebra geometric product operation of said first key multivector (K1), said second key multivector (K2), and a message multivector (M), and a scalar multiplication operation with integer g to obtain message ciphertext multivector (C), wherein said message multivector (M) is a representation of said integer message m;a ciphertext submission subsystem that submits, on-chain, said message ciphertext multivector (C) as a transaction for said distributed ledger/blockchain system; anda decryption subsystem that decrypts, off-chain, a result ciphertext multivector (CR) as a function of at least one Geometric Algebra geometric product operation of an inverse of said first key multivector (K1−1), an inverse of said second key multivector (K2−1), said result ciphertext multivector (CR), and a scalar division operation with integer g to obtain a result multivector (R), wherein said result multivector (R) is a representation of a numeric result r; and,a calculation node device, wherein said calculation node device further comprises: a somewhat homomorphic operation calculation subsystem that performs, on-chain, at least one somewhat homomorphic operation with said message ciphertext multivector (C) to calculate a result ciphertext multivector (CR), wherein a smart contract for said distributed ledger/blockchain defines available somewhat homomorphic operations and wherein said calculation node device is part of said same distributed ledger/blockchain system as said owner node device.
  • 11. The distributed ledger/blockchain system of claim 10: wherein said key generation subsystem of said data owner node device further sets said prime number q equal to a prime number, randomly generates 16 integer values, sets a first 8 of said 16 integer values as coefficient values of said first key multivector (K1) wherein said first key multivector (K1) has an inverse, sets a last 8 of said 16 integer values as coefficient values of said second key multivector (K2) wherein said second key multivector (K2) has an inverse, sets said integer g to a random integer value, sets said secret/private key sk to be said first key multivector (K1) said second key multivector (K2), and said integer g, and sets said public evaluation key to be said prime number q;wherein said encryption subsystem of said data owner node device further sets coefficients of said message multivector (M) equal to random integer values except for a m12 coefficient of said message multivector (M), calculates said m12 coefficient of said message multivector (M) as a function of said other coefficients of said message multivector (M) and said integer message m modulo prime number q (m12=|−m0−m1+m2+m3−m13+m23+m123+m|q) , calculates a transitional message multivector (M′) as a scalar multiplication of said integer g and said message multivector (M)(M′=Mg), and calculates said message ciphertext multivector (C) as a geometric product of said first key multivector (K1), said transitional message multivector (M′) and said second key multivector (K2)(C=K1M′K2); andwherein said decryption subsystem of said data owner node device further calculates a transitional result multivector (R′) as a geometric product of said inverse of said first key multivector (K1−1), said transitional result multivector (R′) and said inverse of said second key multivector (K2−1)(R′=K1−1CRK2−1), calculates a result multivector (R) as a scalar division of said integer g and said transitional result multivector (R′) by said integer g(R=R′/g), calculates said numeric result r as a function of coefficients of said result multivector (R) modulo prime number q (r=|r0+r1−r2−r3+r12+r13−r23−r123|q); and updates said numeric result r by mapping said numeric result r to a rational number using an Extended Euclidean Algorithm (EEA).
  • 12. The distributed ledger/blockchain system of claim 10 wherein said somewhat homomorphic operation calculation subsystem calculation node device further calculates said result ciphertext multivector (CR) as a component-wise addition of coefficients of said message ciphertext multivector (C) and coefficients of a second ciphertext multivector (C2)(CR=C+C2) available on said distributed ledger/blockchain system such that when said result ciphertext multivector (CR) is decrypted, the numeric result r is equivalent to adding unencrypted values represented by said message ciphertext multivector (C) and said second ciphertext multivector (C2).
  • 13. The distributed ledger/blockchain system of claim 10 wherein said wherein said somewhat homomorphic operation calculation subsystem calculation node device further calculates said result ciphertext multivector (CR) as a scalar division of all elements of said message ciphertext multivector (C) by scalar value α (CR=C/α) such that when said result ciphertext multivector (CR) is decrypted, the numeric result r is equivalent to dividing integer message m by said scalar value α.
  • 14. The distributed ledger/blockchain system of claim 10 wherein said distributed ledger/blockchain system is a private/permissioned system.
  • 15. The distributed ledger/blockchain system of claim 10: wherein said key generation subsystem of said data owner node device generates, off-chain, a second secret/private key sk2, wherein said second secret/private key sk2 is comprised of a first key multivector (K21) of said second secret/private key sk2, a second key multivector (K22) of said second secret/private key sk2, and a second integer g2, such that said second secret/private key sk2 is not equal to said secret/private key sk;wherein said data owner node device further comprises: a token calculation subsystem that calculates, off-chain, a token t comprised of a first token multivector (T1) and a second token multivector (T2) as a function of said secret/private key sk and said second secret/private key sk2;an update key subsystem that updates, off-chain, said message ciphertext multivector (C) to be encrypted by said second secret/private key sk2 instead of by said secret/private key sk as a function of said first token multivector (T1) and said second token multivector (T2) to create a new message ciphertext multivector (Cnew);a key transfer subsystem that transfers, off-chain, said second secret/private key sk2 to a new data owner node device through a traditional key exchange protocol;wherein said a ciphertext submission subsystem of said data owner node device submits, on-chain, said new message ciphertext multivector (Cnew) as a transaction for said distributed ledger/blockchain system; anddistributed ledger/blockchain system further comprises: said new data owner node device, wherein said new data owner node device further comprises: a new data owner decryption subsystem decrypts, off-chain, said new message ciphertext multivector (Cnew) as a function of at least one Geometric Algebra geometric product operation of an inverse of said first key multivector (K21−1) of said second secret/private key sk2, an inverse of said second key multivector (K22−1) of said second secret/private key sk2, said new message ciphertext multivector (Cnew), and a scalar division operation with integer g2 of said second secret/private key sk2 to obtain said message multivector (M), wherein said new data owner node device has access only to said new message ciphertext multivector (Cnew) and not to other data encrypted by said data owner node device with said secret/private key sk.
  • 16. The distributed ledger/blockchain system of claim 15: wherein said token calculation subsystem of said data owner node device further calculates said first token multivector (T1) as T1=K21K1−1g−1g2 and said second token multivector (T2) as T2=K2−1K22; andwherein said update key subsystem of said data owner device further calculates said new message ciphertext multivector (Cnew) as Cnew=T1CT2.
  • 17. The distributed ledger/blockchain system of claim 16 wherein said result ciphertext multivector (CR) is processed instead of said message ciphertext multivector (C).
  • 18. The distributed ledger/blockchain system of claim 15 wherein operations of each of said data owner node device, said calculation node device, and said new data owner node device are performed by one or more hardware devices as desired by a user.
CROSS REFERENCE TO RELATED APPLICATIONS

This application is based upon and claims the benefit of U.S. provisional application Ser. No. 63/063,719, filed Aug. 10, 2020, entitled “Towards a Somehwat Homomorphic Key Update Protocol based on Clifford Geometric Algebra for Distributed Ledger Technology,” all of which is also specifically incorporated herein by reference for all that it discloses and teaches.

Provisional Applications (1)
Number Date Country
63063719 Aug 2020 US