The present invention relates to a secure computation technique. In particular, the present invention relates to a technique for generating a filter that eliminates duplication while maintaining secrecy and a technique for eliminating duplication using the generated filter.
In the field of secure computation techniques, there is a need for a technique that eliminates duplication of attribute values of a certain attribute in a table while maintaining secrecy.
However, in the field of secure computation techniques, a technique is not known which eliminates duplication of attribute values of a certain attribute in a table while maintaining secrecy.
An object of the present invention is to provide a secure deduplication filter generation system that generates a filter for eliminating duplication while maintaining secrecy, a secure deduplication system that uses the generated filter to eliminate duplication while maintaining secrecy, methods for these, a secure computation apparatus, and a program.
A secure deduplication filter generation system according to one aspect of the present invention is a secure deduplication filter generation system comprising a plurality of secure computation apparatuses, wherein assuming that F denotes an arbitrary field, m denotes an integer of two or more, [α] denotes a share where α is securely shared supposing α to be an arbitrary vector, {{β}} denotes a share where β is securely shared supposing β to be an arbitrary permutation, and v denotes an m-dimensional vector v∈Fm, the plurality of secure computation apparatuses include a plurality of permutation calculation parts for generating a share {{σ}} of a permutation σ that stably sorts the vector v in ascending order using a share [v] of the vector v, a plurality of permutation application parts for generating a share [σ(v)] of a vector σ(v) obtained by applying the permutation σ to the vector v using the share [v] and the share {{σ}}, a plurality of vector generation parts for generating a share [e] of a vector e having 1 as an element corresponding to a certain element when the certain element of the vector σ(v) and an element before the certain element are different, and having 0 otherwise using the share [σ(v)], and a plurality of inverse permutation application parts for generating a share [σ−1(e)] of a vector σ−1(e) obtained by applying an inverse permutation σ−1 of the permutation σ to the vector e using the share [e] and the share {{σ}}.
A secure deduplication system according to one aspect of the present invention comprises the secure deduplication filter generation system, wherein the secure computation apparatuses further include deduplication parts for generating a share [vσ−1(e)] of vσ−1(e) using the share [v] and the share [σ−1(e)].
A secure computation apparatus according to one aspect of the present invention is the secure computation apparatus of the secure deduplication filter generation system or the secure deduplication system.
According to the invention, it is possible to generate a filter for eliminating duplication while maintaining secrecy. Also, it is possible to eliminate duplication while maintaining secrecy using the generated filter.
Hereinafter, an embodiment of the present invention will be described in detail. Note that component parts having the same functions in the drawings will be denoted by the same reference numerals, and redundant description will be omitted.
A configuration example of a secure deduplication system and a secure deduplication filter generation system of the embodiment will be described with reference to
A configuration example of the secure computation apparatus 1n (n=1, . . . , N) included in the secure deduplication system and secure deduplication filter generation system will be described with reference to
The permutation calculation part 11n, permutation application part 12n, vector generation part 13n, and inverse permutation application part 14n surrounded by a dashed line inside the secure computation apparatus 1n in
A secure deduplication method and a secure deduplication filter generation method of the embodiment are implemented by performing processing of each step described later while each component part of the secure computation apparatus 1n (1≤n≤N) is cooperating with each component part of the other secure computation apparatuses 1n′ (n′=1, . . . , N, where n≠n′). Step S1 to step S4 is an example of processing of the secure deduplication filter generation method. Step S1 to step S5 is an example of processing of the secure deduplication method.
Note that processing of each step is performed by secure computation. That is, the secure computation apparatus 1n performs the processing of each step without restoring a share, in other words, without knowing contents of the share.
The secure computation apparatus 1n is a special apparatus in which a special program is read into a known or dedicated computer including, for example, a central processing unit (CPU) and a random access memory (RAM). The secure computation apparatus 1n performs each processing, for example, under control of the central processing unit. Data input into the secure computation apparatus 1n or data obtained in each processing is stored in, for example, the random access memory, and the data stored in the random access memory is read out to the central processing unit and used for other processing as needed. At least a part of each processing part of the secure computation apparatus 1n may be made of hardware such as an integrated circuit.
In the following description, it is assumed that [α] is a share where α is securely shared supposing α to be an arbitrary vector, whereas it is assumed that {{β}} is a share where β is securely shared supposing β to be an arbitrary permutation.
A processing procedure of the secure deduplication filter generation method executed by the secure deduplication system of the embodiment will be described with reference to
A share [v] of a vector v is input into the permutation calculation parts 111, . . . , 11N.
The permutation calculation parts 111, . . . , 11N generate {{σ}} of the share [v].
In more detail, the permutation calculation parts 111, . . . , 11N use the share [v] of the vector v to generate the share {{σ}} of a permutation σ that stably sorts the vector v in ascending order (step S1). The stable sorting means that order of equivalent data before sorting is preserved after sorting as well.
The generated share {{σ}}; is output to the permutation application parts 121, . . . , 12N.
F denotes an arbitrary field, and m denotes an integer of two or more. Further, v denotes an m-dimensional vector v∈Fm. The vector v is, for example, a vector composed of attribute values of a certain attribute in a table.
For example, it is supposed that the vector v is a vector composed of attribute values of an attribute of “height” and v=(160, 150, 160, 170, 160). In this case, the permutation σ is as shown in the following Formula (1).
Note that each element of the vector v may be bit-decomposed. That is, each element of the vector v may be represented by bits of 0 and 1. In addition, the vector v may be a vector composed of values obtained by combining attribute values of a plurality of attributes of a table.
The share [v] and share {{σ}} are input into the permutation application parts 121, . . . , 12N.
The permutation application parts 121, . . . , 12N apply the share {{σ}} to the share [v] to generate a share [σ(v)].
In more detail, the permutation application parts 121, . . . , 12N use the share [v] and share {{σ}} to generate the share [σ(v)] of a vector σ(v) obtained by applying the permutation σ to the vector v (step S2).
The generated share [σ(v)] is output to the vector generation parts 131, . . . , 13N.
For example, the vector v=(160, 150, 160, 170, 160), and when the permutation σ is a permutation represented by Formula (1), the vector σ(v)=(150, 160, 160, 160, 170).
The share [σ(v)] is input into the vector generation parts 131, . . . , 13N.
The vector generation parts 131, . . . , 13N generate [e] in which [ei]:=[σ(v)i≠σ(v)i−1] where [e1]=1, and where i=1, m. Here, σ(v)i is an i-th element of the vector σ(v), and ei is an i-th element of a vector e.
In more detail, the vector generation parts 131, . . . , 13N use the share [σ(v)] to generate the share [e] of the vector e that has 1 as an element corresponding to a certain element when the certain element of the vector σ(v) and an element before the certain element are different, and has 0 otherwise (step S3). However, a first element of the vector e is set to 1.
The generated share [e] is output to the inverse permutation application parts 141, . . . , 14N.
For example, when the vector σ(v)=(150, 160, 160, 160, 170), the vector e=(1, 1, 0, 0, 1).
The share [e] and share {{σ}} are input into the inverse permutation application parts 141, . . . , 14N.
The inverse permutation application parts 141, . . . , 14N inversely apply the share {{σ}} to the share [e] to generate a share [σ−1(e)].
In more detail, the inverse permutation application parts 141, . . . , 14N use the share [e] and share {{σ}} to generate the share [σ−1(e)] of the vector σ−1(e) obtained by applying an inverse permutation σ−1 of the permutation σ to the vector e (step S4). The vector σ−1(e) is a vector used for deduplication processing and is also called a filter.
The generated share [σ−1(e)] is an output of the secure deduplication filter generation system. When secure deduplication processing is performed subsequent to the secure deduplication filter generation processing, the generated share [σ−1(e)] is output to the secure deduplication system.
For example, when the vector e=(1, 1, 0, 0, 1) and the permutation σ is a permutation shown in Formula (1), the vector σ−1(e)=(1, 1, 0, 1, 0).
The share [v] and share [σ−1(e)] are input into the deduplication parts 151, . . . , 15N.
The deduplication parts 151, . . . , 15N generate a share [vσ−1(e)] of vσ−1(e).
In more detail, the deduplication parts 151, . . . , 15N use the share [v] and share [σ−1(e)] to generate the share [vσ−1(e)] of vσ−1(e) (step S5).
For example, when the vector v=(160, 150, 160, 170, 160) and the vector σ−1(e)=(1, 1, 0, 1, 0), the vector vσ−1(e)=(160, 150, 0, 170, 0). In this way, the vector vσ−1(e) is a vector obtained by removing duplicate elements from the vector v.
According to the embodiment, it is possible to generate a filter that eliminates duplication while maintaining secrecy. In addition, according to the embodiment, it is possible to eliminate duplication while maintaining secrecy using the generated filter.
As described above, the embodiment of the present invention has been described, but specific configurations are not limited to the embodiment, and it goes without saying that even if there is a change or the like in design as appropriate without departing from the scope of the present invention, it is included in the invention.
For example, the deduplication parts 151, . . . , 15N may use a share [w] and the share [σ−1(e)] to obtain a share [wσ−1(e)] of wσ−1(e) obtained by applying the filter σ−1(e) to a vector w of attribute values of an attribute other than the attribute based on which the share [σ−1(e)] is generated.
For example, when the vector σ−1(e) obtained from the attribute of “height” is the vector σ−1(e)=(1, 1, 0, 1, 0) and a vector w of attribute values of an attribute of “gender” is the vector w=(man, woman, man, woman, man), the vector wσ−1(e)=(man, woman, 0, woman, 0). The deduplication parts 151, . . . , 15N may use the share [w] and share [σ−1(e)] to obtain the share [wσ−1(e)] of such a vector wσ−1(e).
The deduplication parts 151, . . . , 15N may also obtain shares obtained by applying the filter σ−1(e) to respective vectors of attribute values of a plurality of attributes.
The various processes described in the embodiment may be performed not only in chronological order according to the described order, but also in parallel or individually according to processing capability of a apparatus that performs the processes or as needed.
When various processing functions in each apparatus described in the embodiment are implemented by a computer, processing contents of functions which each apparatus should include are described by a program. Then, the computer executes the program, and thereby the various processing functions in each apparatus are implemented on the computer.
The program describing the processing contents can be recorded in a computer-readable recording medium. The computer-readable recording medium may be any recording medium, for example, a magnetic recording apparatus, an optical disk, a magneto-optical recording medium, and a semiconductor memory.
Distribution of this program is carried out, for example, by selling, transferring, or lending a portable recording medium such as a DVD or a CD-ROM on which the program is recorded. Furthermore, the program may be stored in a storage apparatus of a server computer, transferred from the server computer to another computer via a network, and thereby distributed.
A computer that executes such a program, for example, first stores the program recorded on the portable recording medium or the program transferred from the server computer temporarily in its own storage apparatus. Then, when performing processing, the computer reads the program stored in its own storage apparatus and performs the processing according to the read program. As another execution form of the program, the computer may directly read the program from the portable recording medium, and perform the processing according to the program, or further may sequentially execute processing according to a received program every time the program is transferred from the server computer to the computer. In addition, the above-described processing may be performed by a so-called ASP (Application Service Provider) type service that implements a processing function only by execution instructions and result acquisition, without transferring the program from the server computer to the computer. Note that the program in the embodiment includes information which is used for processing by the computer and is similar to the program (data or the like that is not a direct command to the computer but has properties that define processing of the computer).
In the embodiment, the apparatus is configured by executing the predetermined program on the computer, but at least a part of the processing contents may be implemented by hardware.
Number | Date | Country | Kind |
---|---|---|---|
JP2018-058270 | Mar 2018 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2019/010457 | 3/14/2019 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2019/188320 | 10/3/2019 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20130182836 | Hamada | Jul 2013 | A1 |
20130272521 | Kipnis | Oct 2013 | A1 |
20150149763 | Kamara | May 2015 | A1 |
20160321958 | Ikarashi | Nov 2016 | A1 |
20190166105 | Romain | May 2019 | A1 |
Number | Date | Country | |
---|---|---|---|
20210026812 A1 | Jan 2021 | US |