The field relates to searchable encryption. In particular, the present invention provides a computer-implemented method, system and computer readable medium for providing a searching over encrypted keywords in a database.
The advent of storage as a service being provided by the cloud providers and facilitating efficient analytics over it, has opened an entire new area of research over methods for secure data storage and efficient search and retrieval. Numerous works have been done over it with Boneh et al's method for trapdoor generation and corresponding search pioneering in the area. Boneh's method was based on double hashing of the keyword for trapdoor generation. The Park et al's scheme improvised the trapdoor generation by providing a generation mechanism comprising of usage of a single hash function over the keyword. But all these schemes suffer from offline key word guessing attacks. The trapdoors are generated statically i.e. same trapdoor generated every time for a particular key word making it vulnerable to brute force attack over the limited range of dictionary words by a misfeasor, masquerader or clandestine user.
The radical work for architectural design framework was proposed in the cryptographic cloud storage scheme. In the existing schemes, to search for any keyword the data user supplies the keyword to the data managing authority to get a corresponding trapdoor which is searched at the cloud database. Since keywords form a very limited range of dictionary words, the present schemes are liable to offline keyword guessing attack, i.e. the user can sniff a trapdoor from the network and then by applying brute force attack over it, they can guess the corresponding keyword.
The existing processes have limitations such as searchable encryption uses same trapdoor (unique static), for same keyword to be searched for i.e. there is a one to one mapping between the keyword and trapdoor. The existing methods are vulnerable to online and offline dictionary attacks. All existing schemes are based on single trapdoor search.
Thus, there is a need to overcome the problems of the existing technologies. Therefore, the present inventors have developed a computer-implemented method, system and computer-readable medium for providing a searching over encrypted keywords in a database, which would provide secure search by generating one time trapdoor i.e. trapdoor is changed dynamically every time for the same keyword.
According to one aspect of the invention there is provided a computer implemented method executed by one or more computing devices for providing a searching over encrypted keywords in a database. The method comprises the steps of generating at least one keyword, generating a plurality of different encrypted keywords corresponding to said keyword, storing said at least one encrypted keyword in said database; generating a plurality of different trapdoors for said keyword, verifying said plurality of different trapdoors with said plurality of different encrypted keywords corresponding to said keyword and determining said keyword if said plurality of different trapdoors match with one said encrypted keyword corresponding to said keyword else determining said keyword is not found.
According to another aspect of the invention there is provided a system for providing a searching over encrypted keywords in a database. The system comprises a memory and a processor operatively coupled to the memory. The processor configured to perform the steps of generating at least one keyword, generating a plurality of different encrypted keywords corresponding to said keyword, storing said at least one encrypted keyword in said database; generating a plurality of different trapdoors for said keyword, verifying said plurality of different trapdoors with said plurality of different encrypted keywords corresponding to said keyword and determining said keyword if said plurality of different trapdoors match with one said encrypted keyword corresponding to said keyword else determining said keyword is not found.
According to another aspect of the invention there is provided a computer-readable code stored on a non-transitory computer-readable medium that, when executed by a computing device, performs a method for providing a searching over encrypted keywords in a database. The method comprises the steps of generating at least one keyword, generating a plurality of different encrypted keywords corresponding to said keyword, storing said at least one encrypted keyword in said database; generating a plurality of different trapdoors for said keyword, verifying said plurality of different trapdoors with said plurality of different encrypted keywords corresponding to said keyword and determining said keyword if said plurality of different trapdoors match with one said encrypted keyword corresponding to said keyword else determining said keyword is not found.
Features, aspects, and advantages of the present invention will be better understood when the following detailed description is read with reference to the accompanying drawings in which like characters represent like parts throughout the drawings, wherein:
While system and method are described herein by way of example and embodiments, those skilled in the art recognize that system and method for providing a searching over encrypted keywords in a database are not limited to the embodiments or drawings described. It should be understood that the drawings and description are not intended to be limiting to the particular form disclosed. Rather, the intention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the appended claims. Any headings used herein are for organizational purposes only and are not meant to limit the scope of the description or the claims. As used herein, the word “may” is used in a permissive sense (i.e., meaning having the potential to) rather than the mandatory sense (i.e., meaning must). Similarly, the words “include”, “including”, and “includes” mean including, but not limited to.
The following description is full and informative description of the best method and system presently contemplated for carrying out the present invention which is known to the inventors at the time of filing the patent application. Of course, many modifications and adaptations will be apparent to those skilled in the relevant arts in view of the following description in view of the accompanying drawings and the appended claims. While the system and method described herein are provided with a certain degree of specificity, the present technique may be implemented with either greater or lesser specificity, depending on the needs of the user. Further, some of the features of the present technique may be used to advantage without the corresponding use of other features described in the following paragraphs. As such, the present description should be considered as merely illustrative of the principles of the present technique and not in limitation thereof, since the present technique is defined solely by the claims.
As a preliminary matter, the definition of the term “or” for the purpose of the following discussion and the appended claims is intended to be an inclusive “or” That is, the term “or” is not intended to differentiate between two mutually exclusive alternatives. Rather, the term “or” when employed as a conjunction between two elements is defined as including one element by itself, the other element itself, and combinations and permutations of the elements. For example, a discussion or recitation employing the terminology “A” or “B” includes: “A” by itself, “B” by itself and any combination thereof, such as “AB” and/or“BA.” It is worth noting that the present discussion relates to exemplary embodiments, and the appended claims should not be limited to the embodiments discussed herein.
The Data User
The data user (102) consists of an individual(s) or the corporation accessing the data. Considering the healthcare scenario, the user list consists of Patient, Doctor, Hospital, Pharmaceutical Company, Diagnostic labs, Research Scientists, Health Ministry, Blood bank and related organizations. The data user communicates with the Data Owner and the Data provider as per the requirement.
The data user (102) provides a keyword W (108) that he wants to search for, to the data owner (104). The data user (102) receives back the trapdoor TW (110) and using it, the data user looks into the data provider (106) for a particular keyword and the search results are returned in encrypted form (112). The data user may then request for the decryption key (114) or use the one if he already possesses it, for decrypting the selected files.
Data Owner
Data owner (104) is the enterprise which owns the data and has outsourced it to the data provider (106). The data provider (106) can be cloud provider, database provider, etc. Data owner (104) has the function to encrypt the data, outsourcing it for storage, providing trapdoor for searching options and distribution of appropriate keys for decryption of data to authorized users only. The three essential components of it are data (116), trapdoor generator (118) and key distributor (120). The data component stores the encrypted data at the provider. The trapdoor generator (118) receives the keyword to be searched for from the data user and generates the corresponding trapdoor to the user. The key distributor component (120) as the name depicts performs the task of maintaining the public parameters, decryption keys and distributing them to the users when asked upon. The data verifier component (not shown in figure), if included checks the integrity of the data. It checks if the data stored at the provider has been modified by anyone unauthorized to do so. It is implemented by storing an additional tag with the data files which is changed on every change in the file, and hence keep track of the integrity of the data. The tag can be a checksum which is intact for a given file. The detailed description of integrity verification has been omitted here. The major focus here is on the trapdoor generation and searching in the three tier architecture.
Data Provider
The data provider (106) stores the data. It performs the function of storing the data provided by the data owner (104). It also enables data search by the user by comparing the trapdoor with the data field and return the results in an encrypted format.
The key generation module (302) performs key generation process and provides the required keys for encryption and decryption.
Key Generation
Based on a security parameter k, system parameters and keys are generated. G1 and G2 are two cyclic group of some prime order n with an admissible pairing ê: G1×G1→G2
A generator P0 for G1 is generated. Three cryptographic hash functions H1, H2 and H3 are selected where H1: {0,1}*→{0,1}* and H2:G2→{0,1}* and H3: {0,1}*→G1.
Consider the message as M and cipher text as C. Randomly st, is selected as the secret key, where stεZ/qZ.
P
ID
=H
3(ID) (1)
Where PID is the encryption key and ID is the user identity used for identity based encryption. Also the decryption key is Kd where
K
d
=P
ID
·s
t (2)
The Public key is Qt and is defined as
Q
t
=s
t
P
0. (3)
The encryption module (304) performs the operation of the encryption of the data and keywords i.e. generate a plurality of different encrypted keywords corresponding to the keyword.
Encryption
For encryption, randomly select r such that rεZ/qZ. The message is M, keyword is W and the cipher text is C.
C is defined as
C=[V,J,N] (4)
Where,
V=M⊕H
2(ê(PID,Qt)r) (5)
J=rH
1(W)P0+rQt
and
N=r·P
0 (6)
The message M is stored in the above form C at the data provider. V is the encryption of the message M, using identity based encryption. J and N components of C are useful for keyword search.
The trapdoor generator module (306) performs the operation of trapdoor generation i.e. generate a plurality of different trapdoors (208) for the keyword.
Trapdoor Generation
The most important component for the architecture is the generation of trapdoor and its security. For any keyword ‘W’ the user wants to search for, a trapdoor TW is generated. The architecture uses the approach of one time trapdoor, so even for same keyword new trapdoor is generated every time. So the possibility of online keyword guessing attack is eliminated. The trapdoor generation method also introduces a random parameter ‘y’ and using ‘y’ it generates one time trapdoor and breaking it is a discrete logarithmic problem, thus eliminating any possibility for offline keyword guessing attack. This random parameter generated leads to new trapdoor every time, since for each search a new random parameter is generated.
For any keyword ‘W’ the user wants to search for, it sends ‘W’ to the data owner (104).
The trapdoor TW is generated by the data owner (104) and sent back to the data user (102). For keyword ‘W’ Trapdoor TW is defined as
T
W
=[y·(H1(W)+st)−1P0,y·P0] (7)
Where y is the random parameter, selected such that yεZ/qZ.
For simplicity of expression trapdoor may be written as
T
W
=[L,K] (8)
The search module (308) performs the operation of search in the database.
Search
The architecture provides for searchable encryption using one time trapdoor. To search for a keyword W, the user sends the word to the data owner (104), and receives back the trapdoor TW. It then sends the trapdoor to the data provider (106) and search is performed over there. All the files which match the search are given but in encrypted form only. The user can decrypt the files using the decryption key which it has or can get from the data owner. The mechanism for search is explained.
The trapdoor send to the data provider is TW where,
T
W
=[L,K].
The provider checks if
ê(N,K)=ê(J,L) (9)
If the expression evaluates to true then the keyword matches and the file is returned otherwise not.
The verification module (310) performs the operation of verification of the plurality of different trapdoors with the plurality of different encrypted keywords corresponding to the keyword.
Verification
The search approach mentioned is consistent. The requirement being that for any keyword W and trapdoor TW if the search evaluates to true then, the concerned keyword is the one which is being searched for.
The consistency of the search using trapdoor is checked as: ê(J,L)
Substituting the values for J and L
=ê(rH1(W)P0+rQt,y(H1(W)+st)−1·P0)
It is known that Qt=st·P0. Using it,
=ê(rH1(W)P0+rst·P0,y(H1(W)+st)−1·P0)
=ê(r(H1(W)+st)·P0,y(H1(W)+st)−1·P0)
=ê(r·P0,y·P0)(H
=ê(r·P0,y·P0)
=ê(N,K)
Based on the process, it is observed that the mechanism works for the keyword search using one time trapdoor.
The decryption module (312) performs the operation of the decryption of the keyword.
Decryption
On completion of search, the user requests for the required documents. The user gets the decryption key Kd from the data owner or uses the one if it already possess. Using the cipher text and the decryption key the user can get the data M as
M=V⊕H
2({circumflex over (e)}(Kd,N)) (10)
The present invention provided a secure storage and retrieval of data in a database, particularly on cloud using one time trapdoor scheme. Though the scheme has been developed over the three tier architecture model, it is equally applicable over other models of data storage. With the one-time trapdoor scheme, new trapdoor is generated every time for the same keyword using a random seed. Also the possibility of offline and online keyword guessing attack can be overruled. Thus the architectural model proposed is an efficient and secure method for searchable encryption over cloud.
Exemplary Computing Environment
One or more of the above-described techniques may be implemented in or involve one or more computer systems.
With reference to
A computing environment may have additional features. For example, the computing environment 400 includes storage 430, one or more input devices 440, one or more output devices 450, and one or more communication connections 460. An interconnection mechanism (not shown) such as a bus, controller, or network interconnects the components of the computing environment 400. Typically, operating system software (not shown) provides an operating environment for other software executing in the computing environment 400, and coordinates activities of the components of the computing environment 400.
The storage 430 may be removable or non-removable, and includes magnetic disks, magnetic tapes or cassettes, CD-ROMs, CD-RWs, DVDs, or any other medium which may be used to store information and which may be accessed within the computing environment 400. In some embodiments, the storage 430 stores instructions for the software 470.
The input device(s) 440 may be a touch input device such as a keyboard, mouse, pen, trackball, touch screen, or game controller, a voice input device, a scanning device, a digital camera, or another device that provides input to the computing environment 400. The output device(s) 450 may be a display, printer, speaker, or another device that provides output from the computing environment 400.
The communication connection(s) 460 enable communication over a communication medium to another computing entity. The communication medium conveys information such as computer-executable instructions, audio or video information, or other data in a modulated data signal. A modulated data signal is a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media include wired or wireless techniques implemented with an electrical, optical, RF, infrared, acoustic, or other carrier.
Implementations may be described in the general context of computer-readable media. Computer-readable media are any available media that may be accessed within a computing environment. By way of example, and not limitation, within the computing environment 400, computer-readable media include memory 420, storage 430, communication media, and combinations of any of the above.
Having described and illustrated the principles of our invention with reference to described embodiments, it will be recognized that the described embodiments may be modified in arrangement and detail without departing from such principles.
In view of the many possible embodiments to which the principles of our invention may be applied, we claim as our invention all such embodiments as may come within the scope and spirit of the claims and equivalents thereto.
While the present invention has been related in terms of the foregoing embodiments, those skilled in the art will recognize that the invention is not limited to the embodiments depicted. The present invention may be practiced with modification and alteration within the spirit and scope of the appended claims. Thus, the description is to be regarded as illustrative instead of restrictive on the present invention.
As will be appreciated by those ordinary skilled in the art, the foregoing example, demonstrations, and method steps may be implemented by suitable code on a processor base system, such as general purpose or special purpose computer. It should also be noted that different implementations of the present technique may perform some or all the steps described herein in different orders or substantially concurrently, that is, in parallel. Furthermore, the functions may be implemented in a variety of programming languages. Such code, as will be appreciated by those of ordinary skilled in the art, may be stored or adapted for storage in one or more tangible machine readable media, such as on memory chips, local or remote hard disks, optical disks or other media, which may be accessed by a processor based system to execute the stored code. Note that the tangible media may comprise paper or another suitable medium upon which the instructions are printed. For instance, the instructions may be electronically captured via optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in a computer memory.
The detailed description is presented to enable a person of ordinary skill in the art to make and use the invention and is provided in the context of the requirement for a obtaining a patent. The present description is the best presently-contemplated method for carrying out the present invention. Various modifications to the preferred embodiment will be readily apparent to those skilled in the art and the generic principles of the present invention may be applied to other embodiments, and some features of the present invention may be used without the corresponding use of other features. Accordingly, the present invention is not intended to be limited to the embodiment shown but is to be accorded the widest scope consistent with the principles and features described herein.
Number | Date | Country | Kind |
---|---|---|---|
4463/CHE/2012 | Oct 2012 | IN | national |