1. Field of Invention
The present invention relates to the field of communications and, more specifically, to secure communications systems and methods.
2. Description of Related Art
Communications systems and methods that are secure from eavesdropping are highly desirable. Such secure communications systems encourage the use of communication systems because users may be assured that the information they exchange will remain private. Various cryptographic techniques have been developed to address secure communications. Such techniques often require estimation of arbitrary or random functions. However, the complexity and processing bandwidth required to estimate arbitrary functions increases exponentially as the dimension of the sample space grows.
Thus, there is a need for systems and methods to address these limitations as well as others readily discernable from review of this disclosure.
Embodiments are directed generally to systems and methods for secure communications. Various embodiments can comprise cryptographic techniques for providing secure communications.
Various embodiments may be directed to a cryptographic method and system for forming a multidimensional vector from a random corner of a hypercube, providing the multidimensional vector to a receiver, computing a keyed one-way function using the multidimensional vector and a private key, receiving a response from the receiver, and comparing the received response to the computed one-way function.
The utility, objects, features and advantages of the invention will be readily appreciated and understood from consideration of the following detailed description of the embodiments of this invention, when taken with the accompanying drawings, in which same numbered elements are identical and:
Embodiments are directed generally to cryptographic systems and methods. For example, the various embodiments can comprise generating and using fixed length keyed cryptographic one-way functions by exploiting the exponential complexity asymmetries in estimation theory and statistical regression that arise in high dimension. These asymmetries may include, for example, those occurring between function evaluation and function fitting, between parameter estimation and density estimation, and between set-membership testing and set separation. By framing cryptography in an estimation setting, various embodiments can use and apply powerful theorems from the field of statistics to ensure the intractability of attempts to defeat the cryptographic systems and methods discussed herein, in contrast to algebraic cryptographic protocols.
Various embodiments can comprise, for example, authentication schemes that use and apply such complexity asymmetries. For example, various embodiments can comprise general secure authentication schemes between a user and terminal, or between users of terminals or devices. Because of the relative simplicity of implementation, various embodiments can be implemented in embedded low complexity devices with relatively little computational overhead. Furthermore, various embodiments can comprise protocols that take advantage of physical sources of randomness for generating samples from the cryptographically secure distributions.
The inventors have discovered that in the field of statistical learning, bounds on the ability to approximate arbitrary functions imply a curse of dimensionality in which estimation can grow exponentially difficult as the dimension of the sample space grows; however, asymmetry in function fitting and function evaluation can be exploited to provide systems and methods comprising a fixed length keyed one-way function that can be implemented for secure communication such as, for example, secure authentication.
A. Function Fitting and the Bias-Variance Trade-Off
A one-way function is a mapping which may be computed efficiently but for which it is intractable to find an input giving rise to a fixed output. Further details regarding one-way functions may be obtained from, for example, O. Goldreich, “Modern Cryptography, Probabilistic Proofs, and Pseudorandomness,” Springer, New York, 1999. In the language of approximation theory, a one-way function is a mapping of queries “q” to responses “r” where r=ƒ(q), such that given “N” query-response pairs (q1, r1), . . . , (qN, rN) and a new query qN+1, it is intractable to determine rN+1 to within a threshold, ε.
A statistician would interpret this requirement to say that the regression problem for the function ƒ which maps q to r is intractable. Finding tractable means of calculating ƒ is the heart of all regression, classification, and generalization problems in statistics. For example, see V. N. Vapnik, “The Nature of Statistical Learning Theory,” Springer, New York, 2nd edition, 2000, and F. Girosi, “An Equivalence Between Sparse Approximation and Support Vector Machines,” Neural Computation, 10:1455-1480, 1998. In various embodiments, hard limits in function fitting may be exploited to find functions which are intractable to fit.
We can make the notion of function fitting precise as follows. Let X and Y be two spaces with distribution μX×Y on their product. Given n pairs from X and Y, infer a functional mapping ƒ: X→Y such that given a new sample from X we can predict the paired sample from Y. Mathematically, we define the risk of the function ƒ to be:
I[ƒ]:=∫(ƒ(x)−y)2dμX×Y. Eq. (1)
In Eq. 1, I[ƒ] measures how much squared error on average we would see for a pair (x,y) drawn at random from X×Y. Since we are only able to infer ƒ from a finite set of data, we define the empirical risk as:
Thus, IN[ƒ] measures the error of ƒ on the data set we are presented.
The regression problem is to compute:
This optimization problem is fraught with difficulty because the distribution μX×Y is often unknown, and even when it is known, the optimization usually remains intractable. A solution is to find the minimizer of the empirical risk:
That is, to minimize the error only on the data that we have seen. Surprisingly, under very mild assumptions, ƒ*N converges to ƒ* as N goes to infinity. The proofs of such convergence are essentially due to the law of large numbers. Additional information regarding this relationship is available from, for example, F. Cucker and S. Smale, “On the mathematical foundations of learning,” Bulletin of the American Mathematical Society, 39(1): 1-49, 2001.
Unfortunately, there is no guarantee that ƒ can be written in closed form even if we know a priori that it is continuous. To make the problem computationally tractable, we must restrict the search for ƒ in a hypothesis space “H” such as, for example, the set of all cubic splines or the sum over a finite number of Parzen Windows. Further detail regarding Parzen Windows is available from, for example, T. Evgeniou, M. Pontil, and T. Poggio, “Regularization networks and support vector machines,” Advances in Computational Mathematics, 13(1): 1-50, 2000. Let:
be the best we can do in the space H and let:
be the best we can do in H with only N examples. To keep track of all of the approximations we are making define the generalization error as:
εN,H=I[ƒ*N,H]−I[ƒ*]=(I[ƒ*N,H]−I[ƒ*H]+(I[ƒ*H]−I[ƒ*]) Eq. (7)
With respect to Eq. 7 above, the first term in parentheses is called the “estimation error.” It represents how much our regression function is skewed by partial data. The second term in parentheses is called the “approximation error.” It measures how close we can get to the regression function by restricting ourselves to the hypothesis space, H. Further detail regarding these terms is available from, for example, P. Niyogi and F. Girosi, “On the relationship between generalization error, hypothesis complexity and sample complexity for radial basis functions,” Neural Computation, 8:819-842, 1996, and P. Niyogi and F. Girosi, “Generalization bounds for function approximation from scattered noisy data,” Advances in Computational Mathematics, 10:51-80, 1999. This splitting of the error into two parts is often referred to as the bias-variance trade-off. One can get low approximation error by having a rich hypothesis space, but then, on small data sets, over-fitting is inevitable and the estimation error is high. On the other hand, by fitting linear functions to data, the estimation error is quite low, but the class of functions that can be fit is fairly small. The approximation error amid estimation error must be balanced to guarantee low generalization error.
From the cryptographer's perspective, a one-way function is a distribution μX on X and a polynomial time computable function ƒ: X→Y such that the generalization error is greater than a constant for any choice of hypothesis space. In various embodiments, this can be achieved by making the dimension of the space X large enough, as described herein.
B. The Curse of Dimensionality
The inspiration for intractability comes from the principle of sampling. Given a continuous function which we want to digitize (or convert to a digital function), Nyquist's sampling theorem requires that we must sample at a rate that is twice as fast as the highest frequency in the signal we wish to reconstruct. If the function is defined over “d” variables, the number of samples required for perfect reconstruction is Nd. On the other hand, it is often quite easy to compute high dimensional functions. For example, dot products in “d” dimensions can be calculated in O(d) steps. This is an inherent asymmetry: function evaluation can scale linearly while the difficulty of function fitting and reconstruction can scale exponentially.
The first appearance of the curse of dimensionality can arise because of the exponential number of coefficients required to specify the Fourier series of high dimensional functions. Suppose we want to estimate ƒ: Rd→R using Taylor polynomials:
If we approximate ƒ with all Taylor coefficients such that wi<l, i.e., with
the approximation error is
since the number of parameters is n=ld, we have
or, to achieve an estimation of quality c, the number of parameters satisfies
From the foregoing, it is shown that the number of parameters is exponential in the desired precision. It turns out that this rate is optimal whether or not Fourier series are used as the approximating functions. Further details regarding this relationship are available from, for example, A. Pinkus, “N-widths in approximation theory,” Springer, New York, 1985. The above discussion also shows that the Fischer information, which is the quantity which determines how much one can learn about non-random parameters in a probability distribution, vanishes exponentially as the dimension grows. Further details regarding Fischer information are available from, for example, M. Cover and J. A. Thomas, “Elements of Information Theory,” Wiley, New York, 1991.
Furthermore, errors in estimation can occur because in high dimensions there are exponentially many functions which precisely fit the given data but do not generalize to unseen data. This phenomenon is called over-fitting. For example, suppose we are given samples xi sampled uniformly from the d-dimensional unit cube {−1, 1}d and values:
yi=sin(ω0Tx+φ0) Eq. (13)
with ω0εRd and φ0ε[0, 2π]. It is well known that the estimation error of this class of functions cannot be bounded; see, for example, V. N. Vapnik and A. Y. Chervonenkis, “The necessary and sufficient conditions for the uniform convergence of averages to their expected values,” Teoriya Veroyatnostei i Ee Primeneniya, 26(3):543-564, 1981. However, we can heuristically discuss why predicting responses from queries is intractable. For example, if we search over functions of the form ƒ(x)=sin(ωTx+φ), it is clear that more than one (ω, φ) pair can fit a particular sample point in the data. For example, let αi=arcsin(yi). Then, we must have for some integers ki that
This over-fitting characteristic is illustrated in
From the above discussion it is clear that the difficulty of almost any functional approximation problem grows exponentially with the dimension of the domain. In turn, even the deceptively simple looking problem of finding the parameters of:
ƒ(x;ω,φ)=sin(ωTx+φ) Eq. (15)
is exponentially difficult in the dimension of x. That is, ƒ is a keyed one-way function. Knowledge of the parameters ω and φ allow one to compute the function, but it is impossible to predict fix ƒ(x; ω, φ) from a small set of data.
With respect to
From the discussion above, it is shown that an eavesdropper cannot predict function ƒ values even if the would-be eavesdropper had seen half of the 2d possible query response pairs. For d=128, for example, the cryptographic method can provide 1018 queries with no compromised security.
Thus, in various embodiments, a processor can be configured for example, but not limited to, using a sequence of programmed instructions to calculate ƒ by computing the dot product ωTx+φ modulo some preset phase maximum. Secure communications can then be accomplished using a simple multiply accumulate step and a logical “AND” with a bit mask, requiring low processing bandwidth. Various embodiments can comprise a processor thus configured to compute f. Such embodiments can comprise, for example, but not limited to, a personal computer, a laptop computer, a wireless device, a wireless handset, a cellular telephone or terminal, a communications module or terminal, or other computing device including a lightweight computing device. In alternative embodiments, sawtooth waves can be used in the keyed one-way function, ƒ, instead of sine waves without impact to the degree of security provided. A sawtooth wave is a superposition of sine waves with harmonic frequencies.
As shown in
q←getRandomCorner( )
send(q);
r←receive( )
if (abs(r−f(q))<tol)
Further, a receiver 2 can be configured to execute the following sequence of programmed instructions:
q←receive( )
R←f(q);
Send(r);
With respect to
In various embodiments, the cryptographic system 10 can comprise a single sender-receiver pair or multiple sender-receiver pairs, or a group of senders and receivers, with each pair or group being associated with a common keyed one-way function as described hereinabove and vector, q, and private key (ω, φ).
The sender or sending device 11 can comprise a key manager 21, a vector generator 22, a keyed one-way function generator 23, a comparator 25, and a communication module 27. The key manager 21 and the vector generator 22 can be coupled to the keyed one-way function generator 23 to provide the private key and vector q, respectively, to the keyed one-way function generator 23. The vector generator 22 can be coupled to the communication module 27 to send the vector q to the receiving device 12. The comparator 25 may be coupled to the keyed one-way function generator 23 and the communication module 27 for receiving the computed function f(q) and a received response, r, respectively. The vector generator 22 can be coupled to the communication module 27 to send the vector q from the sending device 11 to the receiving device 12.
The receiver or receiving device 12 can comprise the key manager 21, the keyed one-way function generator 23, a response generator 24, and the communication module 27. The key manager 21 and the communication module 27 can be coupled to the keyed one-way function generator 23 for receiving the private key and the vector q, respectively. The response generator 24 can be coupled to the keyed one-way function generator 23 for receiving the response, r, and to the communication module 27 for sending the response r from the receiving device 12 to the sending device 11.
In various embodiments, the network 13 can comprise a wired or wireless network such as, but not limited to, a Local Area Network (LAN), Wide Area Network (WAN), RS-232, Universal Serial Bus (USB), Institute of Electrical and Electronics Engineers (IEEE) 1394 Firewire™, or other such private network, or a public-switched network such as the Internet, satellite, wireless, Radio Frequency (RF), the Public Switched Telephone Network (PSTN), another packet-based network, or any other electronic or optical communication medium.
In various embodiments, the key manager 21 can select a private key comprising a (ω, φ) pairing. For example, co can comprise a random vector, which may be a binary vector, and φ can comprise a phase. In various embodiments, the key manager 21 can comprise a sequence of programmed instructions that when executed cause a processor to select the private key. In various alternative embodiments, the key manager 21 can comprise a hardware module such as a card or a microcircuit device configured to perform these operations. In various embodiments, the private key can be prepositioned at the receiver 12. For example, the key manager 21 can obtain one or more private keys from local memory. The key manager 21 can maintain multiple private keys each corresponding to a different sender-receiver pair or group.
In various embodiments, the private key can be obtained or assigned from a key management system 14 provided in communication with the sending device 11. Alternatively, the key manager 21 can generate the private key using a random number generator algorithm.
In various embodiments, the vector generator 22 can form a vector, q, by selecting a random corner of a hypercube. In various embodiments, the vector q can comprise a multidimensional vector. For example, the vector q can comprise a binary vector of “d” dimension. In various embodiments, selection of the corner of the hypercube can be obtained using a random physical process such as, for example, atmospheric radio noise. In various embodiments, the vector generator 22 can also generate the hypercube of “d” dimensions. In various embodiments, the vector generator 22 can comprise a sequence of programmed instructions that when executed cause a processor to be configured to generate the vector, q, as described above. In various alternative embodiments, the vector generator 22 can comprise a hardware module such as a card or a microcircuit device configured to perform these operations.
In various embodiments, the keyed one-way function generator 23 can calculate a keyed one-way function using the private key (ω, φ) and the vector, q, modulo a predetermined phase maximum. In various embodiments, the function calculated by the keyed one-way function generator 23 can be, for example:
ƒ(q)=sin(ωTq+φ) Eq. (16)
In various embodiments, the keyed one-way function generator 23 can comprise a sequence of programmed instructions that when executed cause a processor to be configured to compute the keyed one-way function as described above. In various alternative embodiments, the keyed one-way function generator 23 can comprise a hardware module such as a card or a microcircuit device configured to perform these operations.
In various embodiments, the response generator 24 can compute a response, r, using the keyed one-way function and the private key (ω, φ) and the received vector, q, modulo a predetermined phase maximum. In various embodiments, the response calculated by the response generator 24 can be, for example:
r=ƒ(q)=sin(ωTq+φ) Eq. (17)
In various embodiments, the response generator 24 can comprise a sequence of programmed instructions that when executed cause a processor to be configured to compute the response as described above. In various alternative embodiments, the response generator 24 can comprise a hardware module such as a card or a microcircuit device configured to perform these operations.
In various embodiments, the comparator 25 can determine if a received response matches the expected response computed using the keyed one-way function and the private key. For example, the comparator 25 can perform a bit-by-bit compare of the response, r, and a locally computed one-way function f(q). In various embodiments, the comparison can use the first “t” bits of the response, r. In various embodiments, the comparator 25 can comprise a sequence of programmed instructions that when executed cause a processor to be configured to determine a correct response as described above. In various alternative embodiments, the comparator 25 can comprise a hardware module such as a card or a microcircuit device configured to perform these operations.
In various embodiments, the communication module 27 can provide the vector q from the sending device 11 to the receiving device 12. The communication module 27 can also receive the response r at the sending device 11 from the receiver 12. In various embodiments, the receiving device 12 can send the first “t” bits of the response, r, to the sending device 11. In various embodiments, the communication module 27 can comprise a sequence of programmed instructions that when executed cause a processor to be configured to send the vector q and receive the response r as described above. In various alternative embodiments, the response generator 24 can comprise a hardware module such as a card or a microcircuit device configured to perform these operations.
In various embodiments, f(q) can be any sinusoidal or sinusoidally-derived function, such as a sine wave or a sawtooth wave. Further, in various embodiments, the vector, q, can comprise a multidimensional vector based on random corners of a hypercube.
As shown in
With respect to
With respect to
Control can then proceed to 54, at which the sender or sending device can form a multidimensional vector from a random corner of a hypercube. Control can then proceed to 55, at which the sender can provide or send the multidimensional vector to a receiver. Control can then proceed to 56 and 57. At 56, the sender can compute a keyed one-way function using the multidimensional vector and a private key.
At 57, the receiving device or receiver can receive the multidimensional vector q from the sender. Control can then proceed to 58, at which the receiver can compute a keyed one-way function using the multidimensional vector received from the sender, and the private key. Control can then proceed to 59, at which the receiver can form a response using the keyed one-way function and private key. Control can then proceed to 60, at which the receiver can provide or send the response, r, to the sender. In an embodiment, the receiver can send only the first “t” bits of the response r.
At 61, the sender can receive the response, r, from the receiver. Control can then proceed to 62, at which the receiver can compare the received response from 61 to the locally computed one-way function at 56. If the comparison indicates a match condition, then control can proceed to 63, at which the sender 11 can register a successful authentication condition and permit further communication. If the comparison indicates that a match has not occurred, then control can proceed to 64, at which the sender 11 can register a failed authentication condition and prohibit further communication. Control can then proceed to 65, at which a method can end.
While the present teachings are described in conjunction with various embodiments, it is not intended that the present teachings be limited to such embodiments. On the contrary, the present teachings encompass various alternatives, modifications, and equivalents, as will be appreciated by those of skill in the art.
This application claims the benefit of U.S. Provisional Application No. 60/615,829, filed Oct. 4, 2004, the entire disclosure of which is hereby incorporated by reference as if set forth fully herein.
The U.S. Government has a paid-up license in this invention and the right in limited circumstances to require the patent owner to license others on reasonable terms as provided for by the terms of Contract No. CCR0122419 awarded by the National Science Foundation (NSF) and Contract No. F30602-03-2-0090 awarded by the Advanced Research and Development Activity (ARDA).
Number | Name | Date | Kind |
---|---|---|---|
5664016 | Preneel et al. | Sep 1997 | A |
6788788 | Kasahara et al. | Sep 2004 | B1 |
20030016770 | Trans et al. | Jan 2003 | A1 |
20030086515 | Trans et al. | May 2003 | A1 |
20040103325 | Priebatsch | May 2004 | A1 |
20060062389 | Mukherjee et al. | Mar 2006 | A1 |
Number | Date | Country | |
---|---|---|---|
20060088164 A1 | Apr 2006 | US |
Number | Date | Country | |
---|---|---|---|
60615829 | Oct 2004 | US |