The present invention relates to a method for anonymously reading database records, where the records may have a different price.
Digital items are often sold by a website that charges per purchased item, and that sells different items at different prices. But there is the risk that the owners of the website are making a lucrative parallel business out of selling information about the shopping behaviour of customers. For example, this can be a problem for a pharmaceutical company that buys information about particular deoxyribonucleic acid genome sequences from a database, or for a high-tech company that buys patent documents from a patent database. The list of purchased records from either of these databases certainly reveals precious information about the company's research strategies. The question then appears, how can one prevent the database from gathering information about shopping behaviour while still allowing the database to correctly charge for the purchased items?
Network traffic obfuscation techniques like mixes and onion routing can be used to hide a network address, but do not help to hide which record is purchased. Anonymous payment systems are not very useful here because they require that the merchant knows the price of the item, which may already reveal too much information about the item.
As a solution to this problem it was proposed to use a priced oblivious transfer protocol (POT) by W. Aiello, Y. Ishai, and O. Reingold “Priced oblivious transfer: How to sell digital goods”, Proc. of EUROYCRYPT 2001, Vol. 2045 of LNCS, pp. 119-135, Springer Verlag, 2001. Here, customers load an initial amount of money into their pre-paid accounts, and can then start downloading records so that (1) the database does not learn which record is being purchased, nor the price of the record that is being purchased; (2) the customer can only obtain a single record per purchase, and cannot spend more than his account balance; (3) the database does not learn the customer's remaining balance.
All known POT protocols require the database to maintain customer-specific state information across the different purchases by the same customer to keep track of his (encrypted) account balance. Each customer's transactions thereby necessarily become linkable. Thus, none of these protocols allow the customer to purchase records anonymously: even if an anonymous payment system is used to pre-charge the initial balance, the customer could be at most pseudonymous, defeating the purpose of protecting the customer's privacy. For example, the database still learns the number of records bought by each customer, the time that these records were bought, and their average price. This clearly reveals information about the customer and might lead to identification of the customer or of the records he is buying. To overcome this, it is further required that the POT satisfy that (4) the database does not learn any information about who purchases a record. Existing POT protocols also lack recharge functionality: Once a customer's balance does not contain enough credit to buy a record, but is still positive, the customer cannot use up the balance, but will have to open a new account/balance for further purchases.
According to one embodiment of the present invention, a computer system is described comprising:
According to another embodiment of the present invention, a method and a corresponding computer program and a corresponding computer program product for anonymously reading records from a database provided by a database server is described, wherein the database comprises at least one record with an associated index and price and wherein the database provider publishes an encrypted form of the database, and wherein for each record contained in the encrypted form of the database the following steps are performed:
Each record may have a different price in the database. The database server encrypts each record with a key that is derived from not only its index but also its price and then publishes the whole encrypted database. To be able to access records, a customer as a user of the database first contacts the database server to create a new, empty wallet. Customers can load more money into their wallet at any time. The payment mechanism used to recharge customers' wallets can be implemented using prior art methods. But for full customer anonymity, an anonymous e-cash scheme is preferable. When a customer wants to purchase a record with index σ with price p from the database, the database server and the customer essentially run a two party protocol at the end of which the customer will have obtained the decryption key for the record as well as an updated wallet being worth p monetary units less. This is done is such a way that the provider does not learn anything about σ or p. More precisely, wallets are modelled as one-time-use anonymous credentials with the worth of the wallet being encoded as an attribute. When the customer buys a record (or charges her wallet), he basically uses the credential and gets in exchange a new credential with the updated worth as attribute without the server learning anything about the wallet's worth. The properties of onetime-use credentials ensure that a customer cannot buy records worth more than what he has (pre-)paid to the server.
If κ∈, then 1κ is the string consisting of κ ones. The empty string is denoted ∈. If A is a randomized algorithm, then
denotes the assignment to y of the output of A on input x when run with fresh random coins. Unless noted, all algorithms are probabilistic polynomial-time (PPT) and it is implicitly assumed that they take an extra parameter 1κ in their input, where κ is a security parameter. A function v:→[0,1] is negligible if for all c∈
there exists a κc∈
such that v(κ)<κ−c for all κ>κc.
Let Pg(1κ) be a pairing group generator that on input 1κ outputs descriptions of multiplicative groups ,
T of prime order p where |p|>κ. Let Pg(1κ) be a pairing group generator that on input p outputs descriptions of multiplicative groups
,
T of prime order p. Let
*=
T\{1} and let g∈
*. The generated groups are such that there exists an admissible bilinear map e:
×
→
T, meaning that (1) for all a,b∈
p it holds that e(ga,gb)=e(g,g)ab, (2) e(g,g)≠1; and (3) the bilinear map is efficiently computable.
Definition: We say that the l-power decision Diffie-Hellman (l-PDDH) assumption [see J. Camenisch, G. Neven, and Abhi Shelat “Simulatable adaptive oblivious transfer”, Proc. of EUROCRYPT 2007, vol. 4515 of LNCS, pp. 573-590, Springer Verlag 2007] holds in groups Γ,ΓT if for all polynomial-time adversaries A the advantage AdvΓ,Γ
Pr└A(g,gα,K,gα
is a negligible function in κ, where
Definition: We say that the l-strong Diffie-Hellman (l-SDH) assumption holds in group Γ1 of order p>2κ if for all polynomial-time adversaries A the advantage
is a negligible function in κ, where Pr is the probability,
The following modification of the weakly-secure signature scheme from D. Boneh and X. Boyen “Short signatures without random oracles”, Proc. of EUROCRYPT 2004, vol. 3027 of LNCS, pp. 56-73, Spinger Verlag 2004, is used. The scheme uses a pairing generator Pg as defined above. The signer's secret key is
the corresponding public key is (g,ym=gx. The signature on the tuple of messages (m,c1,K,cl) is
verification is done by checking whether e(s,ym·gm·y1c
Security against weak chosen message attacks is defined through the following game. An adversary begins by outputting N tuples of messages ((m1,c1,1,K,c1,l),K,(mN,cN,1,K,cN,l)). A challenger then generates the key pair and gives the public key to the adversary, together with signatures s1,K,sN on the message tuples. The adversary wins if it succeeds in outputting a valid signature s on a tuple (m,c1,K,cl)∉{(m1,c1,1,K,c1,l),K(mN,cN,1,K,cN,l)}. This scheme is said to be unforgeable under weak chosen-message attack if no PPT adversary has non-negligible probability of winning this game. An adaptation of the proof by Boneh and Boyen can be used to show that this scheme is unforgeable under weak chosen message attack if the (N+1)-SDH assumption holds.
Definitions from the following articles are used in the following: M. H. Au, W. Susilo, and Y. Mu “Constant-size dynamic k-TAA”, Proc. of SCN 06, vol. 4116 of LNCS, pp. 111-125, Springer Verlag 2006; and R. Cramer, I. Damgård, and P. D. MacKenzie “Efficient zero-knowledge proofs of knowledge without intractability assumptions”, Proc. of PKC 2000, vol. 1751 of LNCS, pp. 354-372, Springer Verlag 2000. A pair of interacting algorithms (P,V) is a proof of knowledge (PoK) for a relation R={(α,β)}⊂{0,1}*×{0,1}* with knowledge error κ∈[0,1] if (1) for all (α,β)βR, V(α) accepts a conversation with P(β) with probability 1; and (2) there exists an expected polynomial-time algorithm E, called the knowledge extractor, such that if a cheating prover {circumflex over (P)} has probability ∈ of convincing V to accept α, then E, with a given rewindable black-box access to {circumflex over (P)}, outputs a witness β for with probability ∈−κ.
A proof system (P,V) is perfect zero-knowledge if there exists a PPT algorithm Sim, called the simulator, such that for any polynomial-time cheating verifier {circumflex over (V)} and for any (α,β)∈R, the output of {circumflex over (V)}(α) after interacting with P(β) and the output of Sim{circumflex over (V)}(α)(α) are identically distributed. A Σ-protocol is a proof system (P,V) where the conversation is of the form (a,c,z), where a and z are computed by P, and c is a challenge chosen at random by V. The verifier accepts if φ(α,a,c,z)=1 for some efficiently computable predicate φ. Given two accepting conversations (a,c,z) and (a,c′,z′) for c≠c′, one can efficiently compute a witness β. Moreover there exists a polynomial-time simulator Sim that on input α and a random string c outputs an accepting conversation (a,c,z) for α that is perfectly indistinguishable from a real conversation between P(β) and V(α). For a relation R={(α,β)} with Σ-protocol (P,V), the commitment relation R′={(α,a),(c,z)} holds if φ(α,a,c,z)=1. If both R and R′ have Σ-protocols, then Cramer et al. cited above show how to construct a four-move perfect zero-knowledge PoK for R with knowledge error
where C is the space from which the challenge c is drawn.
In the common parameters model, several previously known results for proving statements about discrete logarithms are used, such as (1) proof of knowledge of a discrete logarithm modulo a prime [see C. P. Schnorr “Efficient signature generation for smart cards”, Journal of Cryptology, 4(3):239-252, 1991], (2) proof of knowledge of equality of (elements of) representations [see D. Chaum and T. P. Pedersen “Wallet databases with observers”, Proc. of CRYPTO '92, vol. 740 of LNCS, pp. 89-105, Springer Verlag 1993], (3) proof that a commitment opens to the product of two other committed values [see S. Brands “Rapid demonstration of linear relations connected by Boolean operators”, Proc. of EUROCRYPT '97, vol. 1233 of LNCS, pp. 318-333, Springer Verlag 1997; J. Camenisch, M. Michels “Proving in zero-knowledge that a number n is the product of two safe primes”, Proc. of EUROCRYPTT '99, vol. 1592 of LNCS, Springer Verlag 1999; J. Camenisch “Group Signature Schemes and Payment Systems Based on the Discrete Logarithm Problem”, PhD Thesis, ETH Zurich, 1998], and also (4) proof of the disjunction or conjunction of any two of the previous [see R. Cramer, I. Damgård, B. Schoenmakers “Proofs of partial knowledge and simplified design of witness hiding protocols”, Proc. of CRYPTO '94, vol. 839 of LNCS, pp. 174-187, Springer Verlag 1994].
When referring to the proofs above, the notation introduced by Camenisch and Stadler [“Efficient group signature schemes for large groups”, Proc. of '97, vol. 1296 of LNCS, pp. 410-424, Springer Verlag 1997] will be followed for various proofs of the validity of statements about discrete logarithms. For instance, PK{(a,b,c): y=gahb{tilde over (y)}={tilde over (g)}ahc} denotes a “zero-knowledge Proof of Knowledge of integers a,b,c such that y=gahb and {tilde over (y)}={tilde over (g)}a{tilde over (h)}c holds,” where y,g,h, {tilde over (y)}, {tilde over (h)}, {tilde over (h)} are elements of some groups G=
g
=
h
and {tilde over (G)}=
{tilde over (g)}
=
{tilde over (h)}
. The convention is that the letters in the parenthesis denote quantities of which knowledge is being proven, while all other values are known to the verifier. The Fiat-Shamir heuristic [A. Fiat and A. Shamir “How to prove yourself; Practical solutions to identification and signature problems”, Proc. of CRYPTO '86, vol. 263 of LNCS, pp. 186-194, Springer Verlag 1987] is applied to turn such proofs of knowledge into signatures on some message m; denoted as, e.g., SPK{(a): y=gα}(m).
Given a protocol in this notation, it is straightforward to derive an actual protocol implementing the proof [see the PhD Thesis of Camenisch cited above and J. Camenisch, A. Kiayias, M. Yung “On the portability of generalized schnorr proofs”, Proc. of EUROCRYPT 2009, LNCS, Springer Verlag 2009]. Indeed, the computational complexities of the proof protocol can be easily derived from this notation: basically for each term y=gahb, the prover and the verifier have to perform an equivalent computation, and to transmit one group element and one response value for each exponent.
The signature scheme proposed and proved secure by Au et al. [M. H. Au, W. Susilo, and Y. Mu “Constant-size dynamic k-TAA”, Proc. of SCN 06, vol. 4116 of LNCS, pp. 111-125, Springer Verlag 2006] is used, which is based on the schemes of Camenisch and Lysyankaya [J. Camenisch, A. Lysyanskaya “Signature schemes and anonymous credentials from bilinear maps”, Proc. of CRYPTO 2004, vol. 3152 of LNCS, pp. 56-72, Springer Verlag 1999] and of Boneh et al. [D. Boneh, X. Boyen, h. Shacham “Short group signatures”, Proc. of CRYPTO 2004, vol. 3152 of LNCS, Springer Verlag 2004]. It assumes cyclic groups Γ and ΓT of order p and a bilinear map e:Γ×Γ→ΓT. The signer's secret key is a random element
The public key contains a number of random bases
where l∈ is a parameter, and y←g1x.
A signature on messages m0,K,ml∈q is a tuple (A,r,s) where
are values chosen at random by the signer and
Such a signature can be verified by checking whether e(A,g1sy)=e(g1h0m
Now it is assumed that a signature (A,r,s) is given on messages m0,K,ml∈q and that it will be proved if indeed such a signature is possessed. The public key needs to be augmented with values u,v∈Γ such that logg
and compute Ã=Aut, B=vtut′;
It was proved by Au et al. cited above that the above signature is unforgeable under adaptively chosen message attack if Q-SDH assumption holds, where Q is the number of signature queries, and that the associated PoK is perfect honest-verifier zero knowledge.
An unlinkable priced oblivious transfer (UP-OT) scheme is parameterized by a security parameter κ∈, a maximum wallet balance bmax∈
and a maximum record price pmax≦bmax. Let us consider a setting with one database and one or more customers. A database consists of a list of N couples ((R1,p1),K,((RN,pN)), containing the database records and associated prices p1,K,pN. An UP-OT scheme is a tuple of polynomial-time algorithms and protocols UP-OT=(DBSetup, CreateWallet, Recharge, Purchase) run between customers C1,K,CM, and the database server DB in the following way:
The database server DB runs the randomized DBSetup algorithm to initiate the database DBase containing its records with the associated prices. It generates a pair of a secret and corresponding public key (skDB,pkDB) for the security parameter κ, and uses it to encrypt the individual records. The encrypted database ωDB consists of the public key pkDB and the encrypted records ER1,K,ERN. The encrypted database ωDB is made available to all customers, e.g. by publishing it on a website or by distributing it on a storage media. The database server DB keeps the secret key skDB to himself.
A customer creates an empty wallet with a zero balance, signed by the database server DB, by engaging in the CreateWallet protocol with the database server DB. The server's public key pkDB is a common input, the corresponding secret key skDB is a secret input to the database server DB. At the end of the protocol, the customer outputs an empty wallet W0 or ⊥ to indicate failure.
When the customer wants to add money to his wallet Wi (which may or may not be his initial wallet W0) he can engage in a Recharge protocol with the database server DB. The server's public key pkDB and the amount of money that the customer wants to add to his balance are common inputs. The server's secret key skDB and the current wallet Wi of the customer are private inputs to the database server DB and the customer, respectively. At the end of the protocol the customer either outputs the new wallet Wi+1 or ⊥ to indicate failure.
To purchase a record from the database, a customer engages in a Purchase protocol with the database server DB. The server's public key pkDB is a common input. The customer has as a private input his selection index σ∈{1,K,N}, the encrypted record ERσ and its price pσ, and his current wallet Wi. The server uses its secret key skDB as a private input. At the end of the protocol, the customer outputs the database record Rσ and an updated wallet Wi+1. An output containing Rσ=⊥ or Wi+1=⊥ indicates that the record transfer or the wallet update failed, respectively.
An oblivious transfer with access control protocol is described by J. Camenisch, M. Dubovitskaya, and G. Neven “Oblivious transfer with access control”, Proc. of the 16th ACM CCS 2009, p. 131-140, ACM, 2009. In the preferred embodiment of the invention, the database server DB fulfils the role of the issuer described in this paper and issues wallets as one-time show credentials, hence also playing the role of the bank.
Let
be the maximal balance that can be stored in a customer's wallet. To prove that the customer's new balance after buying a record remains positive and is not more than maximum balance that is used as a signature-based set membership protocol (see J. Camenisch et al “Efficient protocols for set membership and range proofs”, Proc. of ASIACRYPT 2008, Vol. 5350 of LNCS, pp. 234-252, Springer Verlag, 2008). They consider a zero-knowledge protocol which allows a prover to convince a verifier that a digitally committed value is a member of a given public set. This protocol has the verifier to produce signatures on the set elements, send them to the prover and then has the prover to show that he knows a signature (by the verifier) and the element he holds. Concretely, they employed the weak signature scheme by Boneh and Boyen cited above. They prove that their protocol is a zero-knowledge argument of set membership for a set Φ, if the |Φ|-SDH assumption holds.
Here the set contains all possible balances from the customer's wallet {0,K,bmax}. So for each possible balance 0≦i≦bmax the database provider uses xb to compute a signature {yb(i)}. These values are included in the public key of the database server DB; they will be used by the customer to prove that his balance remains positive after subtracting the price of the purchased record. The encrypted database consists of a public key pkDB and the encrypted records ER1,K,ERN. It is made available to all customers, e.g. by publishing it on a website or by distributing it on a storage media. The database server DB keeps the secret key skDB to itself.
The database setup algorithm can be implemented using the PBC library, which is a free portable C library allowing the rapid prototyping of pairing-based cryptosystems. It provides an abstract interface to a cyclic group with a bilinear pairing, insulating the programmer from mathematical details. The following code fragment provides an example implementation for steps 310, 320, 330 using the PBC library:
An example implementation for step 320 is given by the following code fragement:
//generate private and public keys:
Step 330 can be implemented using the following code fragment:
Before purchasing any records, customers first need to create an empty wallet and then charge it with money. To create a wallet, the customer runs the CreateWallet protocol with the database server DB shown in
Customers can recharge the balance of their wallets by engaging in a Recharge protocol with the database server DB as shown in
Next the customer proves in zero-knowledge that he correctly increased his balance by the amount m being deposited. The database server DB checks if the serial number ni is fresh, i.e., whether it previously saw the number ni. If not, then the database server decides that the customer is trying to overspend and aborts. Otherwise, if the database server DB accepts the proof, it signs the customer's new wallet skeleton with updated balance and sends it to the customer. The customer checks the validity of the signature on her new wallet, and if it verifies correctly, outputs an updated state containing the new wallet Wi+1.
When the customer wants to purchase a record from the database server DB, he engages in a Purchase protocol with the database DB as shown in
Next the customer proves in zero-knowledge that K is correctly formed as blinding of some Eσ
It should be noted that PK{(h):H=e(g,h)̂L=e(K,h)} can be realized in the standard way as e(g,•) is a group (one-way) homomorphism that maps into
T.
It should also be noted that the database server DB has to calculate a signature of every element in the set of all possible balances in the customer's wallet {0,K,bmax} and encrypt all records {1,K,N} only once at the setup phase, and the customer has to download the entire encrypted database and balance signatures only once as well. So the communication and computation complexity of the protocol depends on a cardinality of a set of all possible balances in the customer's wallet and a number of the records in the database only at the setup phase. The rest parts of the protocol (CreateWallet, Recharge and Purchase), however, require only a constant number of group elements to be sent and a constant number of exponentiations and pairings to be calculated.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The corresponding structures, materials, acts, and equivalents of all means or step plus function elements in the claims below are intended to include any structure, material, or act for performing the function in combination with other claimed elements as specifically claimed. The description of the present invention has been presented for purposes of illustration and description, but is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the invention. The embodiment was chosen and described in order to best explain the principles of the invention and the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.
As will be appreciated by one skilled in the art, the present invention may be embodied as a system, method or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, the present invention may take the form of a computer program product embodied in any tangible medium of expression having computer usable program code embodied in the medium.
Any combination of one or more computer usable or computer readable medium(s) may be utilized. The computer-usable or computer-readable medium may be, for example but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or propagation medium. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CDROM), an optical storage device, a transmission media such as those supporting the Internet or an intranet, or a magnetic storage device. Note that the computer-usable or computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted, or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory. In the context of this document, a computer-usable or computer-readable medium may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The computer-usable medium may include a propagated data signal with the computer-usable program code embodied therewith, either in baseband or as part of a carrier wave. The computer usable program code may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc.
Computer program code for carrying out operations of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). The present invention is described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable medium that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable medium produce an article of manufacture including instruction means which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Number | Date | Country | Kind |
---|---|---|---|
10151389.3 | Jan 2010 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP10/67336 | 11/12/2010 | WO | 00 | 7/19/2012 |