This patent application claims the benefit and priority of Chinese Patent Application No. 202310354833.7, filed with the China National Intellectual Property Administration on Apr. 4, 2023, the disclosure of which is incorporated by reference herein in its entirety as part of the present application.
The present disclosure relates to the field of data security, and in particular, to an anti-malicious method, device and medium for secure three-party computation.
As the society enters a data element era, the international situation becomes more uncertain, and data element problems become more complex. In the field of privacy computing, personal data security, data analysis and circulation within and among enterprises, and global data cross-border transactions face various challenges. With security standardization and normalization of relevant industry departments, this technology has been steadily developed. Secure multi-party computation is a common technical solution in the field of privacy computing, and plays an important role especially in commercial applications of emerging digital industries such as data governance, data collaboration, and artificial intelligence.
Most of existing secure multi-party computation technologies are based on a hypothesis of a semi-honest model. That is, in a computational task, participants all run according to a rule and a specified protocol. After the computation, each participant can only obtain a data result of the participant itself, and cannot learn of any input/output information about another participant. Since an entire process includes a computation phase and a communication phase, there is a risk of data leakage. First, in a computation phase, although a participant runs strictly according to a protocol instruction, a curious semi-honest node uses an intermediate computational result in an execution protocol to inversely estimate raw data information of another participant. Second, in a communication phase after the computation is completed, there is a case in which participant nodes cooperate with each other, and raw data information of remaining participants is estimated by sharing data with each other. Therefore, different secure multi-party participation solutions respectively adopt the following mainstream solutions according to respective technical features:
In summary, data security of an existing secure multi-party computation solution needs to be improved.
Based on this, embodiments of the present disclosure provide an anti-malicious method, device and medium for secure three-party computation, so as to improve data security of the secure three-party computation.
To achieve the above objective, the present disclosure provides the following technical solutions.
An anti-malicious method for secure three-party computation includes:
An electronic device is provided, including a memory and a processor, where the memory is configured to store a computer program, and the processor runs the computer program, to enable the electronic device to perform the above-described anti-malicious method for secure three-party computation.
A computer-readable storage medium is provided, where the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement the above-described anti-malicious method for secure three-party computation.
According to specific embodiments provided in the present disclosure, the present disclosure provides the following technical effects:
Embodiments of the present disclosure provide an anti-malicious method, device and medium for secure three-party computation, separately add corresponding security constraints to a computational process in which a collusion behavior exists and no collusion behavior exists. The security constraints implement a constraint on a rank of an internal matrix, so that any participant in the computational process cannot predict private data matrices of another two participants, thereby improving data security of the secure three-party computation, which is applicable to a scenario in which a security requirement is relatively high.
To describe technical solutions in embodiments of the present disclosure or in the conventional technology more clearly, the following briefly describes accompanying drawings required for describing embodiments. Apparently, the accompanying drawings in the following description merely show some embodiments of the present disclosure, and a person of ordinary skill in the art may still derive other drawings from these accompanying drawings without creative efforts.
Technical solutions of embodiments of the present disclosure are clearly and fully described below with reference to accompanying drawings. Apparently, the described embodiments are merely a part rather than all of the embodiments of the present disclosure. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.
To make the foregoing objectives, features, and advantages of the present disclosure clearer and more comprehensible, the present disclosure will be further described in detail below with reference to the accompanying drawings and specific implementations.
With reference to
Step 101: Determine a first private data matrix, a second private data matrix, and a third private data matrix.
The first private data matrix is in n×s dimensions, held by a first participant, and stored in a compute node thereof, the second private data matrix is in s×t dimensions, held by a second participant, and stored in a compute node thereof, and the third private data matrix is in t×m dimensions, held by a third participant, and stored in a compute node thereof.
Step 102: The first participant receives a first random matrix pair generated by a commodity server node, the second participant receives a second random matrix pair generated by the commodity server node, and the third participant receives a third random matrix pair generated by the commodity server node.
The first random matrix pair includes a first random matrix in n×s dimensions and a second random matrix in n×m dimensions, the second random matrix pair includes a third random matrix in s×t dimensions and a fourth random matrix in n×m dimensions, and the third random matrix pair includes a fifth random matrix in t×m dimensions and a sixth random matrix in n×m dimensions.
A sum of the second random matrix, the fourth random matrix, and the sixth random matrix is equal to a product of the first random matrix, the third random matrix, and the fifth random matrix.
Step 103: Determine whether a collusion behavior exists when the first participant, the second participant, and the third participant perform a secure computation process.
When no collusion behavior exists, go to step 104: Perform a first computational process that meets a first security constraint, a second computational process without a security constraint, or a third computational process that meets a second security constraint, to obtain a first output matrix output by the first participant, a second output matrix output by the second participant, and a third output matrix output by the third participant.
When a collusion behavior exists, go to step 105: Perform a first computational process that meets a third security constraint, a second computational process that meets a fourth security constraint, or a third computational process that meets a fifth security constraint, to obtain a first output matrix output by the first participant, a second output matrix output by the second participant, and a third output matrix output by the third participant.
Step 106: A computation requestor obtains the first output matrix, the second output matrix, and the third output matrix, and performs an operation on the first output matrix, the second output matrix, and the third output matrix according to a target requirement. Specifically,
The first computational process includes the following steps:
(10) The first participant computes a first internal matrix according to a formula Â=A+Ra, and sends the first internal matrix to the second participant. Â represents the first internal matrix, A represents the first private data matrix, and Ra represents the first random matrix.
(11) The third participant computes a third internal matrix according to a formula Ĉ=C+Rc, and sends the third internal matrix to the second participant, where Ĉ represents the third internal matrix, C represents the third private data matrix, and Rc represents the fifth random matrix.
(12) The second participant computes a second internal matrix, a second M matrix, a first intermediate term matrix, a second intermediate term matrix, a third intermediate term matrix, and a fourth intermediate term matrix according to formulas {circumflex over (B)}=B+Rb, Mb=·Rb·Ĉ, φ1=·{circumflex over (B)}, γ1=·Rb, φ1={circumflex over (B)}·Ĉ and γ2=Rb·Ĉ, sends the third intermediate term matrix and the fourth intermediate term matrix to the first participant, and sends the first intermediate term matrix and the second intermediate term matrix to the third participant, where {circumflex over (B)} represents the second internal matrix, Mb represents the second M matrix, φ1 represents the first intermediate term matrix, γ1 represents the second intermediate term matrix, φ2 represents the third intermediate term matrix, γ2 represents the fourth intermediate term matrix, B represents the second private data matrix, and Rb represents the third random matrix.
(13) The first participant computes a first S matrix and a first M matrix according to formulas Sa=Ra·γ2=Ra·RbĈ and Ma=A·φ2=A·{circumflex over (B)}·Ĉ, where Sa represents the first S matrix, and Ma represents the first M matrix.
(14) The third participant computes a third S matrix and a third M matrix according to formulas Sc=γ1·Rc=ÂRb·Rc and Mc=φ1·Rc=·{circumflex over (B)}·Rc, where Sc represents the third S matrix, and Mc represents the third M matrix.
(15) The second participant splits the second internal matrix into a column full rank matrix and a row full rank matrix by means of full rank decomposition, sends the column full rank matrix to the first participant, and sends the row full rank matrix to the third participant.
(16) The first participant generates a first output matrix based on the column full rank matrix, computes a first T matrix and a first t matrix according to formulas Ta=Ma+Sa−Va−ra and t1=RaB1, and sends the first T matrix and the first t matrix to the second participant, where Ta represents the first T matrix, t1 represents the first t matrix, Va represents the first output matrix, ra represents the second random matrix, B1 represents the column full rank matrix, and a space in which B1 belongs to is in s×r dimensions.
(17) The third participant computes a second t matrix according to a formula t2=B2Rc, and sends the second t matrix to the second participant, where t2 represents the second t matrix, B2 represents the row full rank matrix, and a space in which B2 belongs to is in r×t dimensions.
(18) The second participant generates a second output matrix based on the first T matrix, the first t matrix, and the second t matrix, computes a second S matrix according to a formula Sb=t1·t2=RaB1·B2Rc=Ra{circumflex over (B)}Rc, computes a second T matrix through a formula Tb=Ta−Mb+Sb−Vb−rb according to the second S matrix, and sends the second T matrix to the third participant, where Sb represents the second S matrix, Tb represents the second T matrix, Vb represents the second output matrix, and rb represents the fourth random matrix.
(19) The third participant computes a third output matrix according to a formula Vc=Tb−Mc+Sc−rc, where Vc represents the third output matrix, and Rc represents the sixth random matrix.
The second computational process includes the following steps:
(20) The first participant computes a first internal matrix according to a formula Â=A+Ra, and sends the first internal matrix to the second participant.
(21) The third participant computes a third internal matrix according to a formula Ĉ=C+Rc, and sends the third internal matrix to the second participant.
(22) The second participant computes a second internal matrix, a second M matrix, a first intermediate term matrix, a second intermediate term matrix, a third intermediate term matrix, and a fourth intermediate term matrix according to formulas {circumflex over (B)}=B+Rb, Mb=·Rb·Ĉ, φ1=·{circumflex over (B)}, γ1=·Rb, φ1={circumflex over (B)}·Ĉ and γ2=Rb·Ĉ, sends the third intermediate term matrix and the fourth intermediate term matrix to the first participant, and sends the first intermediate term matrix and the second intermediate term matrix to the third participant.
(23) The first participant computes a first S matrix and a first M matrix according to formulas Sa=Ra·γ2=Ra·RbĈ and Ma=A·φ2=A·{circumflex over (B)}·Ĉ.
(24) The third participant computes a third S matrix and a third M matrix according to formulas Sc=γ1·Rc=ÂRb·Rc and Mc=φ1·Rc=·{circumflex over (B)}·Rc.
(25) The second participant splits the second internal matrix into a column full rank matrix and a row full rank matrix by means of full rank decomposition, sends the column full rank matrix to the first participant, and sends the row full rank matrix to the third participant.
The column full rank matrix and the row full rank matrix meet a constraint condition r({circumflex over (B)})=r(B1)=r(B2)=r.
(26) The first participant generates a first output matrix based on the column full rank matrix, computes a first T matrix and a first t matrix according to formulas Ta=Ma+Sa−Va−ra and t1=RaB1, sends the first T matrix to the second participant, and sends the first t matrix to the third participant.
(27) The third participant computes a second t matrix according to a formula t2=B2Rc.
(28) The second participant generates a second output matrix based on the first T matrix, computes a second T matrix according to a formula Tb=Ta−Mb−Vb−rb, and sends the second T matrix to the third participant.
(29) The third participant computes a second S matrix according to a formula Sb=t1·t2=RaB1·B2Rc=Ra{circumflex over (B)}Rc, and computes a third output matrix through a formula Vc=Tb+Sb−Mc+Sc−rc according to the second S matrix and the second T matrix.
The third computational process includes the following steps:
(30) The first participant computes a first internal matrix according to a formula Â=A+Ra, and sends the first internal matrix to the second participant.
(31) The third participant computes a third internal matrix according to a formula Ĉ=C+Rc, and sends the third internal matrix to the second participant.
(32) The second participant computes a second internal matrix, a second M matrix, a first intermediate term matrix, a second intermediate term matrix, a third intermediate term matrix, and a fourth intermediate term matrix according to formulas {circumflex over (B)}=B+Rb, Mb=·Rb·Ĉ, φ1=·{circumflex over (B)}, γ1=·Rb, φ1={circumflex over (B)}·Ĉ and γ2=Rb·Ĉ, sends the third intermediate term matrix and the fourth intermediate term matrix to the first participant, and sends the first intermediate term matrix and the second intermediate term matrix to the third participant.
(33) The first participant computes a first S matrix and a first M matrix according to formulas Sa=Ra·γ2=Ra·RbĈ and Ma=A·φ2=A·{circumflex over (B)}·Ĉ.
(34) The third participant computes a third S matrix and a third M matrix according to formulas Sc=γ1·Rc=ÂRb·Rc and Mc=φ1·Rc=·{circumflex over (B)}·Rc.
(35) The second participant sends the second internal matrix to the first participant.
(36) The first participant generates a first output matrix based on the second internal matrix, computes a first T matrix and a first t matrix according to formulas Ta=Ma+Sa−Va−ra and t1=Ra{circumflex over (B)}, sends the first T matrix to the second participant, and sends the first t matrix to the third participant.
(37) The third participant computes a second S matrix according to a formula Sb=t·Rc=Ra{circumflex over (B)}Rc.
(38) The second participant generates a second output matrix based on the first T matrix, computes a second T matrix according to a formula Tb=Ta−Mb−Vb−rb, and sends the second T matrix to the third participant.
(39) The third participant computes a third output matrix according to a formula Vc=Tb+Sb−Mc+Sc−rc.
The first security constraint is r({circumflex over (B)})<min{s,t} the second security constraint is r({circumflex over (B)})<s, the third security constraint is r(Â)<s, r({circumflex over (B)})<min{s,t} and r(Ĉ)<t, the fourth security constraint is r(Â)<s, r(Ĉ)<t and r(B1)<s, the fifth security constraint is r(Â)<s, r({circumflex over (B)})<s and, r(Ĉ)<t, and r( ) represents a rank of a matrix.
In actual application, the following further describes the foregoing embodiments in detail by using an example in which a product of a three-party matrix is computed.
First, a secure three-party matrix multiplication problem is defined.
It is known that there are three participants Alice, Bob, Carol, independent of each other and not trusted to each other, Alice holds a private data matrix A which has a dimension of n×s and is only stored at its own compute node, Bob holds a private data matrix B which has a dimension of s×t, and Carol holds a private data matrix C which has a dimension of t×m. The three participants perform three-party matrix multiplication protocol computation f(A, B, C)=ABC=Va+Vb+Vc, and finally respectively corresponding dimensions obtained by each computation participant node are output matrices Va, Vb, Vc of n×m, and are sent to a computation requestor to aggregate to obtain a desired three-party matrix product result. During a computational process, each participant node can only know its own input/output information, and cannot obtain an intermediate settlement result and holding data information of another participant. A specific schematic diagram is shown in
With reference to
Step 1: An auxiliary compute node, also referred to as a commodity server (Commodity Server, CS) node, generates three groups of random matrix pairs. Specific forms of the three groups of random matrix pairs are: a random matrix Rc with a dimension of n×s, a random matrix with a dimension of s×t, a random matrix Rc with a dimension of t×m, and three random matrices ra, rb, rc with a dimension of n×m. The following constraint ra+rb+rc=Ra·Rb·Rc needs to be strictly met among these random matrices. Then, the CS auxiliary node sends a random matrix pair (Ra, ra) to the participant Alice compute node, sends a random matrix pair (Rb, rb) to the participant Bob compute node, and sends a random matrix pair (Rc, rc) to the participant Carol compute node. When an entire computation protocol is performed, the CS auxiliary node needs to strictly meet the following three requirements: (1) Not contact private data information related to Alice, Bob and Carol, whether an input or output result of an intermediate computational process. (2) Not collude with any participant compute node. (3) Strictly follow a protocol process to correctly perform an assigned subtask. The CS auxiliary node does not directly participate in a subsequently actual computational process of the secure three-party multiplication, only provides a random matrix pair that is independent of a private data matrix at an initial phase for performing a protocol, thereby protecting information of a private matrix of a participant and ensuring security of raw data in a subsequent computational process. Therefore, the auxiliary node CS may generate a large quantity of mutually independent random matrix pairs offline in advance, and send random seeds to Alice, Bob, and Carol compute nodes in an initial trial phase for performing a protocol in a manner similar to commodity sale, so that the compute node can obtain corresponding random matrix information, and the commodity server CS gets its name.
Step 2: After receiving a corresponding random matrix pair (Ra, ra) the participant Alice computes Â=A+Ra inside a node and sends it to a participant node Bob.
Step 3: After receiving a corresponding random matrix pair (Rc, rc) the participant Carol computes Ĉ=C+Rc inside a node and sends it to a participant node Bob.
Step 4: The participant Bob computes {circumflex over (B)}=B+Rb, Mb=·Rb·Ĉ inside a node of the participant Bob, sends φ1=·{circumflex over (B)} and γ1=·Rb to the node Carol, and sends φ1={circumflex over (B)}·Ĉ and γ2=Rb·Ĉ to the node Alice.
Step 5: After receiving the matrix φ2, γ2 sent from the Bob node, the participant Alice node successively computes Sa=Ra·γ2=Ra·RbĈ and Ma=A·φ2=A·{circumflex over (B)}·Ĉ locally.
Step 6: After receiving the matrix φ1, γ1 sent from the Bob node, the participant Carol node successively computes Sc=γ1·Rc=ÂRa·Rc and Mc=φ1·Rc=·{circumflex over (B)}·Rc locally.
Step 7: The participant Bob node internally splits the matrix {circumflex over (B)} by means of full rank decomposition, and two submatrices obtained after decomposition are a column full rank matrix B1∈s×r and a row full rank matrix B2∈r×t, where ranks of a non-zero matrix {circumflex over (B)} and split matrices B1, B2 meet a constraint condition r({circumflex over (B)})=r(B1)=r(B2)=r. The node Bob sends the matrix B1 to the node Alice, and sends the matrix B2 to the node Carol.
Step 8: After receiving the matrix B1 from the Bob node, the participant Alice node internally generates a random matrix of Va∈n×m secretly, computes locally Ta=Ma+Sa−Va−ra and t1=RaB1, and sends Ta and t1 to the Bob node.
Step 9: After receiving the matrix B2 from the Bob node, the participant Carol node secretly computes t2=B2Rc, and sends a result t2 to the Bob node.
Step 10: After receiving the matrix Ta and t1 sent from the Alice node and the matrix t2 sent from the Carol node, the participant Bob node internally generates a random matrix Vb∈n×m secretly, and secretly computes the matrix Mb=·Rb·Ĉ and Sb=t1·t2=RaB1·B2Rc=Ra{circumflex over (B)}Rc locally, finally obtains Tb=Ta−Mb+Sb−Vb−rb, and sends it to the Carol node.
Step 11: After receiving Tb, the participant Carol node secretly computes and obtains a matrix Vc=Tb−Mc+Sc−rc locally.
Step 12: The participants Alice node, Bob node, and Carol node separately sends final obfuscation split results Va, Vb and Vc corresponding to the three participants to a three-party matrix multiplication computation requestor, and the final obfuscation split results are aggregated by the three-party matrix multiplication computation requestor to obtain a final product ABC=Va+Vb+Vc.
It can be readily verified that
Next, an anti-malicious mechanism of secure three-party matrix multiplication is introduced.
Generally, a secure multi-party computation problem in different privacy computing application scenarios involves a security hypothesis, including a capability of an opponent, a behavior, and a quantity of malicious nodes in a system. Only privacy computing protocols established under a corresponding security hypothesis have a meaning of security interpretability. Generally, a security behavior model is classified into a semi-honest model and a malicious model according to whether a malicious participant exists in the model. In a semi-honest model, all semi-honest or honest nodes participate in a multi-party computation protocol honestly, and strictly follow the protocol procedure to perform each step. However, some semi-honest, unilateral curious participant nodes attempt to infer raw data privacy information of other participant nodes through content obtained during a process of performing the protocol. Another passive attack behavior existing in the semi-honest model is that after a plurality of participant nodes are corrupted, information about each other is shared in a conspiracy and collusion manner to obtain data information about remaining honest participant nodes. The malicious model is a malicious participant that has an active attack in the participant node. This type of participant does not strictly follow a protocol execution procedure, and may maliciously tamper with input and output results of an intermediate computation, or even terminate the protocol. In the malicious model, the attack behavior is active. At present, under constraints of various data security laws and network security protection laws, extremely malicious attacks generally occur rarely. However, a large quantity of cryptography techniques are required for security computation under the malicious model. This also sacrifices computation efficiency and causes great communication overheads while pursuing high security. This is an uncompensated loss for some commercial acts that require no extreme confidentiality for security levels. Therefore, this solution is mainly based on a premise of the semi-honest model. An opponent evaluates and guarantees computational security of the model by using a malicious means of which a single node curiously pries into raw data or two nodes collude to infer the raw data as an analysis target.
In this embodiment, in an initial phase, raw data of participant nodes have been protected by means of information encryption by means of adding random matrices provided by the CS auxiliary node in an obfuscation manner, and a final result is also split into three random result submatrices through a random obfuscation mechanism, so as to ensure security of a computational result. Therefore, a step in which a security risk exists is mainly reflected in step 7 to step 10 of interactions among three-party data of Alice, Bob, and Carol in an intermediate computational process. An ultimate purpose of these steps is to construct an effective intermediate computing component Ra{circumflex over (B)}Rc. Therefore, in a process of disassembling and merging the item, there is a risk of data leakage caused by a plurality of interactions of three-party intermediate results. There are five construction solutions for this item. As shown in
(1) Security Analysis without Collusion
When three participant compute nodes do not perform a collusion behavior in a security computation process, because the protocol is performed asynchronously and cooperatively, a step involving a security risk needs to be analyzed only to step 10. In this case, each participant node does not need to exchange a key information item involving raw data. Therefore, for solution 1, when a node performs step 10, a data item held by the participant node Alice is Â, {circumflex over (B)}Ĉ, RbĈ, B1, a data item held by the participant node Bob is Â, {circumflex over (B)}, Ĉ, RaB1, B2Rc, B1, B2, and a data item held by the participant Carol is Ĉ, Â{circumflex over (B)}, ÂRb, Ba. Therefore, according to data item distribution of each node, it can be observed that due to lack of information about private matrices of Bob and Carol, it is impossible for the node Alice to infer raw data matrices B, C of Bob and Carol. Similarly, Carol cannot infer private matrix information of the other two participants. Bob may inversely infer related information of a private matrix Ra involving Alice and a private matrix Rc involving Carol according to held data items RaB1, B2Rc and B1, B2.
indicates data missing or illegible when filed
Herein, a rank constraint method is introduced to ensure that a single node cannot reversely infer target matrix information according to a single coefficient matrix in a matrix equation. A principle thereof is that a necessary and sufficient condition of having matrices A∈m×t, B∈m×n which make a matrix equation AX=B have an infinity solution is that ranks of the matrices A, B strictly meet r(A)=r(A:B)<t. Similarly, a necessary and sufficient condition of a matrix equation XA=B having an infinity solution is that ranks of the matrices A, B strictly meet r(A)<m. Therefore, in order to prevent Bob from reversely inferring Ra and Rc according to RaB1, B2Rc and B1, B2, further inferring information of primitive matrices A, C of Alice and Carol according to Â, Ĉ only a rank constraint condition r({circumflex over (B)})<min{s,t} is required. A specific operation manner is that when {circumflex over (B)} is computed in step 4, a restriction is performed, if a rank constraint condition is met, a next process can be performed, and otherwise, a generation step of a random matrix Rb is re-performed until the condition is met. Similarly, for solution 2, it can be learned that a rank constraint condition of solution 2 is ø by using the foregoing analysis manner, indicating that it is ensured that a single node cannot infer, according to held data item information, raw data information held by another participant in a computational process without adding a constraint condition. Similarly, for solution 3, in the foregoing analysis manner, it can be seen that due to lack of key information involving raw data items Ra, Rc of the other two participants, the node Bob cannot infer matrix information of raw data A, C of the other two participants, and similarly, Carol cannot infer matrix information of A, B. Because Alice holds intermediate data items {circumflex over (B)}Ĉ, {circumflex over (B)}, RbĈ, it needs to be prevented that after inferring Ĉ through {circumflex over (B)}Ĉ, {circumflex over (B)}, then Alice and an intermediate item RbĈ infer a private matrix B of Bob. Therefore, according to the foregoing rank constraint method, it is only required to ensure r({circumflex over (B)})<t, that is, only the matrix {circumflex over (B)} needs to be ensured to meet a condition of column non-full rank when a procedure in step 4 is performed. It can be learned from Table 1 that, when there is no collusion node colluding malicious behaviors, solution 2 in the three solutions of the secure three-party multiplication problem is optimal, because solution 2 has a relatively small quantity of communication rounds and does not need to introduce a rank constraint with relatively high computational overheads.
(2) Security Analysis with Collusion
When there is a collusion behavior of two-party nodes in a procedure in which three participant compute nodes perform a security computation, a risk of data disclosure to the remaining participant node is further increased. Similar to the foregoing analysis manner, the analysis is still performed from step 7 to step 10, and data item information held by participant nodes Alice, Bob, and Carol in this step is sorted and arranged into a malicious behavior analysis table shown in table 2. With reference to the information shown in solution 1 in the table, it can be seen that when the computation participants Alice and Bob collude and share data, because the computation participants Alice and Bob hold an intermediate computational result B2Rc and a submatrix B1, there is a risk of inferring a secret matrix Rc held by Carol through collusion. Once Rc leaks, Alice and Bob may obtain primitive matrix information of Carol according to Ĉ−Rc. Therefore, a rank constraint r(B2)<t is added when step 7 is performed to ensure raw data security of Carol. Similarly, after Alice and Carol colludes, because Alice and Carol hold data items RbĈ, ÂRb and Â, Ĉ, there is a risk of reversely inferring Rb, and then inferring a secret matrix B of Bob through {circumflex over (B)}−Rb. Therefore, rank constraints r(Â)<s, r(Ĉ)<t needs to be added after step 2 and step 3 are performed, to perform a security check. If a rank constraint condition is met, perform an execution procedure of the following step, and otherwise, terminate and go back to the previous step, to cyclically perform. Similarly, when the participant nodes Bob and Carol collude, because the participant nodes Bob and Carol jointly hold an intermediate computation data item RaB1 and the submatrix B1, there is a risk of inversely inferring a secret matrix Ra of Alice by Bob and Carol. Therefore, a security check of a rank constraint r(B1)<s needs to be added after step 7 is performed. Because submatrices B1, B2 are obtained by means of full rank decomposition, constraints of the submatrices are equivalent to r({circumflex over (B)})=r(B1)=r(B2)<min{s,t} with reference to the foregoing, that is, only a rank constraint r({circumflex over (B)})<min{s,t} needs to be added after step 4 is performed. This is equivalent to that an obfuscated primitive matrix {circumflex over (B)} is required to meet a non-full rank condition. For solution 2, the collusion of participant nodes Alice and Bob lacks a critical matrix Rc involving Carol. Therefore, a secret matrix C of Carol cannot be inferred, and no constraint needs to be added. When the participants Alice and Carol collude, because the participants Alice and Carol jointly hold data items RbĈ, ÂRb and Â, Ĉ, there is a risk of reversely inferring Rb and then inferring the secret matrix B of Bob through {circumflex over (B)}−Rb, and a rank constraint r(Â)<s, r(Ĉ)<t needs to be added after step 2 and step 3 are performed to perform a security check. If the rank constraint condition is met, perform an execution procedure of the following step, and otherwise, terminate and go back to the previous step to cyclically perform. When the participant nodes Bob and Carol collude, because the participant nodes Bob and Carol jointly hold an intermediate computation data item RaB1 and the submatrix B1, there is a risk of inversely inferring a secret matrix Ra of Alice by Bob and Carol in collusion, and a rank constraint r(B1)<s needs to be added after step 7 is performed to perform a row non-full rank security check on the matrix B1. For solution 3, the collusion of the participant nodes Alice and Bob lacks the critical matrix Rc involving Carol. Therefore, the secret matrix C of Carol cannot be inferred, and no constraint needs to be added. When the participants Alice and Carol collude, because the participants Alice and Carol jointly hold data items RbĈ, ÂRb and Â, Ĉ, there is a risk of reversely inferring Rb, and then inferring the secret matrix B of Bob through {circumflex over (B)}−Rb, and the rank constraint r(Â)<s, r(Ĉ)<t needs to be added after step 2 and step 3 are performed to perform a column non-full rank security check on a matrix  and a row non-full rank security check on a matrix Ĉ. If the rank constraint condition is met, perform an execution procedure of the following step, and otherwise, terminate and go back to the previous step to cyclically perform. When the participant nodes Bob and Carol collude, because the participant nodes Bob and Carol jointly hold the intermediate computation data item Ra{circumflex over (B)} and the submatrix {circumflex over (B)}, there is a risk of inversely inferring the secret matrix Ra of Alice by Bob and Carol in collusion, and the rank constraint r({circumflex over (B)})<s needs to be added after step 7 is performed to perform a row non-full rank security check on the matrix {circumflex over (B)}.
indicates data missing or illegible when filed
The foregoing embodiments have the following technical effects:
An essential difference among three types of technical solutions lies in different splitting and merging execution methods are used about constructing an intermediate data item Ra{circumflex over (B)}Rc, and a specific execution procedure thereof is embodied in step 7 to step 11 in a secure three-party matrix multiplication procedure. More specifically,
Step 7: The participant Bob node internally splits the matrix {circumflex over (B)} by means of full rank decomposition, and two submatrices obtained after decomposition are a column full rank matrix B1∈s×r and a row full rank matrix B2∈r×t, where ranks of a non-zero matrix {circumflex over (B)} and split matrices B1, B2 meet a constraint condition r({circumflex over (B)})=r(B1)=r(B2)=r. The node Bob sends the matrix B1 to the node Alice, and sends the matrix B2 to the node Carol.
Step 8: After receiving the matrix B1 from the Bob node, the participant Alice node internally generates a random matrix of Va∈n×m secretly, computes locally Ta=Ma+Sa−Va−ra and t=RaB1, and sends Ta and t1 to the Bob node.
Step 9: After receiving the matrix B2 from the Bob node, the participant Carol node secretly computes t2=B2Rc, and sends a result t2 to the Bob node.
Step 10: After receiving the matrix Ta and t1 sent from the Alice node and the matrix t2 sent from the Carol node, the participant Bob node internally generates a random matrix Vb ∈n×m secretly, and secretly computes the matrix Mb=·Rb·Ĉ and Sb=t1·t2=RaB1·B2Rc=Ra{circumflex over (B)}Rc locally, finally obtains Tb Ta−Mb+Sb−Vb−rb and sends it to the Carol node.
Step 11: After receiving Tb, the participant Carol node secretly computes and obtains a matrix Vc=Tb−Mc+Sc−rc locally.
Technical means and effects of solution 1 are as follows:
In solution 1, obfuscated primitive matrices Â, {circumflex over (B)}, Ĉ are checked first to determine whether the primitive matrices meet rank constraints r(Â)<s, r({circumflex over (B)})<min{s,t}, r(Ĉ)−t, that is, it is required that matrices Â, Ĉ meet a row non-full rank condition and the matrix {circumflex over (B)} meets a non-full rank condition before decomposition. After the condition constraint is met, the matrix {circumflex over (B)} is decomposed into two submatrices B1, B2 by means of full rank decomposition, and then, after being modulated by two participant nodes Alice and Carol, RaB1, B2Rc carrying data information of Alice and Carol are used as computation intermediate items to be jointly transferred to the participant Bob, and are aggregated by the participant Bob to obtain a final target component Ra{circumflex over (B)}Rc. Because a rank constraint is added before the solution is performed, raw data information of another third participant cannot be inferred even if two-party nodes collude to share data information in a computational process. This is because when a rank constraint condition is met, solution space is an infinite set, so that real information of raw data can be effectively protected. In addition, because the constraint condition is a strong constraint and meets a constraint condition that a single node cannot inversely infer raw data information of another participant in a case of no collusion, solution 1 that meets the constraint can ensure security of a model against passive attack regardless of whether a collusion node exists. From a perspective of an application scenario, because the constraint condition may be checked before matrix decomposition, the solution is more suitable for a scenario in which a security requirement is relatively high from a perspective of higher prior security and easier to add a constraint operation.
A specific execution procedure of solution 2a or 2b is as follows:
Step 7: The participant Bob node internally splits the matrix {circumflex over (B)} by means of full rank decomposition, and two submatrices obtained after decomposition are a column full rank matrix B1∈s×r and a row full rank matrix B2∈r×t, where ranks of a non-zero matrix {circumflex over (B)} and split matrices B1, B2 meet a constraint condition r({circumflex over (B)})=r(B1)=r(B2)=r. The node Bob sends the matrix B1 to the node Alice, and sends the matrix B2 to the node Carol.
Step 8: After receiving the matrix B1 from the Bob node, the participant Alice node internally generates a random matrix of Va∈n×m secretly, computes locally Ta=Ma+Sa−Va−ra and t1=RaB1, sends Ta to the node Bob and sends t1 to the node Carol.
Step 9: After receiving the matrix B2 from the Bob node, the participant Carol node secretly computes t=B2Rc.
Step 10: After receiving the matrix Ta sent from the Alice node, the participant Bob node internally generates a random matrix Vb∈n×m secretly, secretly computes a matrix Mb=·Rb·Ĉ locally, finally obtains Tb=Ta−Mb−Vb−rb, and sends it to the Carol node.
Step 11: After receiving the matrix t1 sent from the Alice node and the matrix Tb sent from the Bob node, the participant Carol node secretly computes Sb=t1·t2=RaB1·B2Rc=Ra{circumflex over (B)}Rc and the matrix Vc=Tb+Sb−Mc+Sc−ra locally.
Technical means and effects of solution 2a or 2b are as follows:
In solution 2, a matrix {circumflex over (B)} is decomposed into two submatrices B1, B2 by means of full rank decomposition, and then, after being modulated by two participant nodes Alice and Carol, RaB1 carrying data information of Alice is used as a computation intermediate item to be jointly transferred to the participant Carol, and are aggregated by the participant Carol to obtain a final target component Ra{circumflex over (B)}Rc. In this solution, when there is no collusion behavior, there is no need to add any rank constraint, and security can be ensured because raw data information of another participant cannot be inversely inferred by a single node according to an intermediate computational result. Therefore, when a security requirement is relatively low and a trusted execution environment is involved, this solution can be adopted to ensure computational security in a case of no collusion.
However, for a scenario in which collusion exists, a constraint condition similar to that in solution 1 needs to be adopted, and obfuscated primitive matrices Â, Ĉ need to be checked first to determine whether the primitive matrices meet rank constraint r(Ä)<s, r({umlaut over (C)})<t, that is, it is required that the matrices Â, Ĉ meet a row non-full rank condition before decomposition. After the condition constraint is met, the matrix {circumflex over (B)} is decomposed into two submatrices B1, B2 by means of full rank decomposition. However, different from solution 1, a posterior rank constraint needs to be performed in solution 2. When the decomposed submatrix B1 meets a rank constraint condition r({circumflex over (B)})<s, step 7 is allowed to continue, or step 7 is not allowed to continue until the decomposition meets this condition.
A specific execution procedure of solution 3a or 3b is as follows:
Step 7: The participant Bob node sends the matrix {circumflex over (B)} to the node Alice.
Step 8: After receiving the matrix {circumflex over (B)} from the Bob node, the participant Alice node internally generates a random matrix of Va∈n×m secretly, computes locally Ta=Ma+Sa−Va−ra and t=Ra{circumflex over (B)}, sends Ta to the node Bob and sends t to the node Carol.
Step 9: After receiving the matrix t from the Alice node, the participant Carol node secretly computes Sb=t·Rc=Ra{circumflex over (B)}Rc.
Step 10: After receiving the matrix Ta sent from the Alice node, the participant Bob node internally generates a random matrix Vb∈n×m secretly, secretly computes a matrix Mb=·Rb·Ĉ locally, finally obtains Tb=Ta−Mb−Vb−rb, and sends it to the Carol node.
Step 11: After receiving the matrix Tb sent from the Bob node, the participant Carol node secretly computes the matrix Vc Tb+Sb+−Mc+Sc−rc locally.
Technical means and effects of solution 3a or 3b are as follows:
In solution 1, obfuscated primitive matrices Â, {circumflex over (B)}, Ĉ are checked first to determine whether the primitive matrices meet rank constraints r(Â)<s, r({circumflex over (B)})<s, r(Ĉ)<t, that is, it is required that matrices Â, {circumflex over (B)}, Ĉ meet a row non-full rank condition before decomposition. After the condition constraint is met, the matrix is not decomposed, and directly transfers Ra{circumflex over (B)} carrying data information of Alice used as a computation intermediate item to the participant Carol, and are aggregated by the participant Carol to obtain a final target component Ra{circumflex over (B)}Rc. Because a rank constraint is added before the solution is performed, raw data information of another third participant cannot be inferred even if two-party nodes collude to share data information in a computational process. This is because when a rank constraint condition is met, solution space is an infinite set, so that real information of raw data can be effectively protected. In addition, because the constraint condition is a strong constraint and meets a constraint condition that a single node cannot inversely infer raw data information of another participant in a case of no collusion, solution 3 that meets the constraint can ensure security of a model against passive attack regardless of whether a collusion node exists. From a perspective of an application scenario, because the constraint condition may be checked before matrix decomposition, and a quantity of communication rounds in an entire procedure is less than the other two solutions, the solution is more suitable for a scenario in which a security requirement is relatively high and a computing resource is relatively limited from a perspective of higher prior security and lower communication overheads.
In any one of the foregoing solutions, there are many practical scenarios for specific application. Herein, a financial industry is used as an example for description. It is assumed that three financial institutions Alice, Bob, and Carol exist, where an Alice organization has a private matrix A with a dimension of n×m1, n indicates that the bank has n customer samples, and m1 indicates that these samples involve m1 characteristic attributes of the customer (for example, parameters related to revenue and tax). A Bob organization has a private matrix B with a dimension of n×m2, where n indicates that the bank has n customer samples, and m2 indicates that these samples involve m2 characteristic attributes of the customer (for example, parameters related to liabilities and mortgage loans). A Carol organization has a private matrix C with a dimension of n×m3, where n indicates that the bank has n customer samples, and m3 indicates that these samples involve m3 label attributes of the customer (for example, whether the user is a blacklisted user or whether the user is a high-level customer of the bank). In this case, the superior financial control regulator of three financial organizations hopes to implement joint modeling of the three-party privacy data matrix without exposing respective private customer information of three financial subsidiary companies to each other, to classify loan levels of user groups. The modeling result is obtained by the financial control supervisors of the requestor, and only local model parameter information is obtained by the three participant subsidiary companies. In this computational process, the private matrices A, B, and C that participate in the computation may generate a three-party matrix multiplication A×B×C computation problem in a joint modeling process. This method is used to implement security computation of the process. Without disclosing each other's information, three sub-financial institutions obtain a parameter matrix of local output results Va, Vb, and Vc of joint modeling. When three parameter matrices are transferred to the superior financial control regulator, complete output result information of joint modeling model parameters may be obtained by adding the three parameter matrices.
This embodiment proposes a constraint method for effectively preventing a malicious behavior of which a single-node is curious to inversely infer raw matrix data information of remaining computation participants based on a semi-honest model; proposes a constraint method for effectively preventing a malicious behavior in which after secure three-party multiplication computation is completed, two nodes collude to inversely infer primitive matrix data information of remaining participants based on a semi-honest model, and proposes a set of anti-collusion constraint solutions for secure three-party matrix multiplication computation under different security requirements and computing resource conditions.
The method proposed in this embodiment eliminates a risk of node collusion faced by the conventional technology, such as a secret sharing solution in a semi-honest model with fewer participants (for example, three parties). At the same time, a multi-party security computation problem related to a limitation of three paradox of model security, availability, and low overheads proposes an alternative solution facing different scenarios. Specifically: (1) In a computational process, a security problem in which an existing secure multi-party matrix multiplication technology based on a semi-honest model hypothesis how to effectively ensure that raw participant matrix data cannot be inversely inferred by a curious and semi-honest participant node is solved. (2) A security problem in which an existing secure multi-party matrix multiplication technology how to effectively prevent participant nodes cause raw data leakage due to a corrupted and collusive behavior based on the semi-honest model hypothesis, after computation is completed is solved. (3) An optimal implementation solution in different scenario requirements is proposed to solve constraints of which the existing secure multi-party matrix multiplication technology based on the semi-honest model hypothesis cannot balance three indexes security, availability and low overheads related to model performance.
The present disclosure further provides an electronic device, including a memory and a processor, where the memory is configured to store a computer program, and the processor runs the computer program to enable the electronic device to perform the foregoing the anti-malicious method for secure three-party computation.
The present disclosure further provides a computer-readable storage medium. The computer-readable storage medium stores a computer program, where the computer program is executed by a processor to implement the anti-malicious method for secure three-party computation.
Each embodiment of the present specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same and similar parts between the embodiments may refer to each other.
Specific examples are used herein to explain the principles and embodiments of the present disclosure. The foregoing description of the embodiments is merely intended to help understand the method of the present disclosure and its core ideas; besides, various modifications may be made by those of ordinary skill in the art to specific embodiments and the scope of application in accordance with the ideas of the present disclosure. In conclusion, the content of the present specification shall not be construed as limitations to the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
2023103548337 | Apr 2023 | CN | national |