This disclosure relates to the technical field of circuit-model quantum computation, and more particularly, to using quantum computers to perform computations for classical spin models.
In quantum algorithms for performing quantum computations for classical spin models, a quantum computer with n qubits can be used to store and manipulate a classical spin model with n spins, where each classical spin is mapped to a single qubit. This scheme has the property that the quantum state can represent a superposition of multiple spin configurations simultaneously. However, a notable disadvantage of this scheme is that a quantum computer with n qubits cannot represent a single spin configuration with more than n spins.
The disclosure is in the technical field of circuit-model quantum computation. Generally, it concerns methods to use quantum computers to perform computations on classical spin models, where the classical spin models involve a number of spins that is exponential in the number of qubits that comprise the quantum computer. (In this disclosure, the terms quantum computer, quantum processor, and quantum processing unit are used interchangeably.) Examples of such computations include optimization and calculation of thermal properties but extend to a wide variety of calculations that can be performed using the configuration of a spin model with an exponential number of spins. Spin models encompass optimization problems, physics simulations, and neural networks (there is a correspondence between a single spin and a single neuron). This disclosure has applications in these three areas as well as any other area in which a spin model can be used.
Some embodiments relate to a method. A classical spin model for I spins defined by a Hamiltonian is received. The Hamiltonian specifies an energy of a configuration of I spins. A quantum circuit is determined. The quantum circuit, when executed by a quantum computer, prepares an n-qubit quantum state |ψ in the quantum computer, where n<I. The quantum state |ψ is a superposition representing the configuration of I spins of the classical spin model, |ψ=Σi=0I−1αi|i, where |i is a spin index of spin i in the spin configuration and αi is an amplitude that represents the spin value of spin i in the spin configuration.
In some embodiments, the number of spins I is equal to 2n.
In some embodiments, αi may be a real number or a complex number.
In some embodiments, sign(αi) represents the spin value of spin i.
In some embodiments, a magnitude of αi represents the spin value of spin i.
In some embodiments, the spin model is an Ising model, αi is a real number, and sign(αi) represents the spin value of spin i.
In some embodiments, the spin model is an Ising model, αi is a real number or complex number, and a magnitude of αi represents the spin value of spin i.
In some embodiments, the spin model is an XY model, αi is a complex number, and angle (αi) represents the spin value of spin i.
In some embodiments, the spin model is an XY model, αi is a real number or complex number, and a magnitude of αi represents the spin value of spin i.
In some embodiments, the quantum circuit is executed by the quantum computer. A readout quantum circuit is executed by the quantum computer, where execution of the readout quantum circuit modifies the quantum state |ψ to form an output quantum state. The output quantum state of the quantum computer is measured. A property of the spin configuration (e.g., a macroscopy or microscopic property) is determined based on the measured output state. The property may be a spin value of a spin in the spin configuration, a magnetization of the spin configuration, a spin-spin correlation between two spins in the spin configuration, a sum of spin-spin correlations, an energy of the spin configuration, or a spin structure factor of the spin configuration. In some embodiments, the readout quantum circuit includes quantum gates that perform a Hadamard Test or a Swap Test with the quantum state |ψ. In some embodiments, the readout circuit includes quantum gates that apply a Quantum Fourier Transform to the quantum state |ψ.
In some embodiments, the quantum circuit is repeatedly executed by the quantum computer. For each execution of the quantum circuit: (1) a measurement basis is selected from a set of measurement bases and (2) the quantum state |ψ of the quantum computer is measured using the selected measurement basis. A neural network model is applied to the measurements, where the neural network model is trained to determine a property of the spin configuration based on the measurements.
In some embodiments, determining the quantum circuit includes the following steps. Quantum gates to be applied to the n qubits of the quantum computer are determined. An arrangement of the quantum gates in the circuit is determined. One or more gate parameters for one or more of the quantum gates are determined. Examples of the quantum gate parameters include at least one of: a single-qubit-rotation-gate angle, a controlled-phase-gate phase, or a generic multi-qubit-gate matrix entry.
In some embodiments, the quantum circuit is executed by the quantum computer. Executing the circuit may include the following steps: (1) initializing each qubit of the n qubits in state 0, (2) applying a Hadamard gate to each qubit of the n qubits, and (3) subsequent to applying the Hadamard gate to each qubit, applying a combination of (e.g., only) Z gates and controlled-Z gates to the n qubits.
In some embodiments, the quantum circuit is executed by the quantum computer. Executing the circuit may include the following steps: (1) initializing each qubit of the n qubits in state 0, (2) applying a Hadamard gate to each qubit of the n qubits, and (3) after applying the Hadamard gate to each qubit, applying a combination of (e.g., only) multi-control-qubit Controlled-U gates to the n qubits. U may be a Z gate or a group of XZX gates.
In some embodiments, the quantum circuit is executed by the quantum computer. A readout quantum circuit is executed by the quantum computer to form an output quantum state. The output quantum state of the quantum computer is measured. A non-quantum computer determines a (e.g., macroscopic or microscopic) property of the spin configuration based on the measured output quantum state. The non-quantum computer determines a second set of quantum circuit parameters based on the determined property. The quantum circuit may be a variational quantum circuit. The second set of quantum circuit parameters may be determined using a Markov Chain Monte Carlo procedure.
Some embodiments relate to a non-transitory computer-readable storage medium comprising stored instructions to determine a quantum circuit. The quantum circuit, when executed by a quantum computer, prepares an n-qubit quantum state |ψ in the quantum computer. The quantum state |ψ is a superposition representing a configuration of I spins of a classical spin model, |ψ=Σi=0I−1αi|i, where |i is a spin index of spin i in the spin configuration and αi is an amplitude that represents the spin value of spin i in the spin configuration, wherein n<I.
Some embodiments relate to a quantum circuit that, when executed by a quantum computer, prepares an n-qubit quantum state |ψ in the quantum computer, the quantum state |ψ being a superposition representing a configuration of I spins of a classical spin model, |ψ=Σi=0I−1αi|i, where |i is a spin index of spin i in the spin configuration and αi is an amplitude that represents the spin value of spin i in the spin configuration, wherein n<I.
Other aspects include components, devices, systems, improvements, methods, processes, applications, computer readable mediums, and other technologies related to any of the above.
Embodiments of the disclosure have other advantages and features which will be more readily apparent from the following detailed description and the appended claims, when taken in conjunction with the examples in the accompanying drawings, in which:
In
The figures and the following description relate to embodiments by way of illustration only. It should be noted that from the following discussion, alternative embodiments of the structures and methods disclosed herein will be readily recognized as viable alternatives that may be employed without departing from the principles of what is disclosed.
Some embodiments relate to a method to embed a classical spin model with exponentially many spins in a quantum computer. Related aspects of the disclosure are methods for initializing and manipulating spin configurations in a quantum computer; hybrid quantum-classical algorithms for evolving the spin configuration in time in a way that mimics the spin system being coupled with a thermal bath at a particular temperature T; methods for reading properties of the spin configuration (such as energy, magnetization, and correlation functions) out from the quantum computer; and methods to use the disclosed quantum embedding of a classical spin model to implement neural networks that have exponentially many neurons as a function of the number of qubits in the quantum computer.
Two examples of classical spin models that we refer to in this disclosure are the Ising model and the XY model. The Ising model is defined by a set of N spins, {si}, and a Hamiltonian (energy function) H=−Σ1≤i<j≤NJijsisj−Σ1≤i≤Nhisi, where si represents the value of the ith spin and takes a value of either −1 or +1, Jij represents the coupling between the ith and jth spins, and hi represents the external field applied to the ith spin. The XY model is defined by a set of N spins, {}, and a Hamiltonian H=−Σ1≤i<j≤NJij·j−Σii·, where represents the value of the ith spin and is a two-dimensional vector with unit length.
In quantum algorithms for performing quantum computations with the classical Ising model, such as the Quantum Approximate Optimization Algorithm (QAOA) and quantum annealing (QA), a quantum computer with n qubits is used to store and manipulate a classical spin model with n spins, where each classical spin is mapped to (or embedded in) a single qubit. The state of the quantum computer in the QAOA represents the spins as follows:
where n is the number of qubits and |i represents a particular spin configuration of the Ising model, and αi represents the weighting of the spin configuration |i. For example, |i=|0010 represents the 4-spin configuration “spin down” (s0=−1), “spin down” (s1=−1), “spin up” (s2=+1), “spin down” (s3=−1), i.e., ↓↓↑↓. This embedding scheme has the notable property that the quantum state can represent a coherent superposition of multiple physical configurations simultaneously,
represents a superposition of |0010=↓↓↑↓ and |0001=↓↓↓↑, both with equal weight and with the same phase. In some cases, there may be advantages to the simultaneous encoding of multiple physical configurations, but one notable disadvantage is this: the encoding of a single physical configuration uses only a tiny fraction of the ½n elements of the quantum state.
In this disclosure, we propose a new embedding, which we shall call the exponential embedding because it allows the embedding into a quantum computer of a spin model with an exponential number of spins. The embedding is as follows:
where n is the number of qubits, |i represents the index of a single spin in the spin model, and αi represents the value of the ith spin. In particular, we use a direct mapping between the classical spin values si (or i) and the amplitudes αi in the n-qubit quantum state. Because there are 2n elements in the sum, and hence 2n different αi values that are stored in the quantum state |ψ, the quantum state, using this embedding, is able to store a classical spin configuration comprising 2n spins. More generally, this embedding is able to store a spin configuration comprising a number of spins that are exponential in the number of qubits n. Note that in contrast to the previous embedding scheme used in QAOA and QA, the exponential embedding uses the quantum state to store a single physical configuration of the system—coherent superpositions over multiple physical configurations are not possible without additional qubits. The gain, however, is that an n qubit quantum system can represent an exponentially large spin system with 2n spins within the exponential embedding. Thus, for example, 10 qubits can be used to embed a spin configuration with 1024 spins, 20 qubits can embed a configuration with over one million spins, and 50 qubits can embed a configuration with 250 spins.
There are multiple possible ways to encode a spin value si (or i) in an amplitude αi. If the classical spin model is an Ising model, two general strategies are:
1.) a sign encoding, where αi is set to be a real number and sign(αi) determines the spin value: if sign(αi)=−1 then the ith spin has value si=−1, and if sign(αi)=+1 then the ith spin has value si=+1. An example of the sign encoding is to constrain the system so that αi is not any real number but a number that takes just one of two values: αi=−1/√{square root over (N)} or αi=+1/√{square root over (N)}. This specific variant of the sign encoding has the convenient feature that sums of αi over i give a quantity that is proportional to a sum over spin values si, which is not the case if the αi are arbitrary real numbers. However, by allowing αi to take arbitrary real values, more complex spin dynamics can be implemented (which in the case of optimization can support a mean-field-annealing implementation).
2.) a magnitude encoding, where αi can be a real or a complex number, and we use the magnitude of αi to encode the spin value. For example, a threshold value t can be chosen that is a real number between 0 and 1, and if |αi|<t then the ith spin has value si=−1, and if |αi|>t then the ith spin has value si=+1.
For Ising spin models, the preferred encoding may be the sign encoding.
If the classical spin model is an XY model, then the classical spin values i are unit vectors, which can each be equivalently represented by a single real number, (e.g., an angle θi). Two strategies for encoding XY spins are then:
1.) a phase encoding, where αi is set to be a complex number and angle (αi) determines the spin value. For example, we can set θi=angle (αi).
2.) a magnitude encoding, where αi is set to be a real or a complex number, and we use the magnitude of αi to encode the spin value. For example, we can set θi=2π|αi|/|αmax|, where |αmax| is the maximum value that |αi| can take.
For XY spin models, the preferred encoding may be the phase encoding.
The spins in the Potts model (the generalization of the Ising model in which the spins can take on one of q possible discrete values, where for q=2 the Potts model reduces to the Ising model) can be encoded in the same way as XY spins, but where the coefficient αi takes on discrete values instead of continuous values.
For spin models whose spins each use (e.g., require) two real numbers to describe, such as the classical Heisenberg model (whose spins are unit vectors in 3-space), a single spin in a single complex amplitude αi can be encoded by taking advantage of the fact that the complex amplitude has real and imaginary parts, or equivalently magnitude and phase. For example, a single spin in the classical Heisenberg model can be described by two angles, and these two angles can be encoded in the complex number αi.
For spin models whose spins each use (e.g., require) more than two real numbers to describe, such as an extension of the classical Heisenberg model to spins as unit vectors in 4 dimensions, the spin may be encoded in more than one complex amplitude αi, since each amplitude αi can encode at most two real numbers.
It is also possible to encode spin models that contain multiple types of spin, e.g., a spin model that comprises a set of Ising spins (which take on binary values from {−1,+1}) and a set of XY spins (which take on real values). This can be done by choosing the encoding for the Ising spins using an encoding for αi described above (where the index i covers the indices of the Ising spins in the model) and choosing the encoding for the XY spins using an encoding for αi described above (where the index i covers the indices of the XY spins in the model).
The choice of encoding has implications for how the state is initialized, manipulated, and read out, with different encodings being more or less effective and/or convenient for different purposes.
Given a choice of spin model (Ising, XY, or another classical spin model type, including hybrid models) and an encoding of spins for that model in the quantum-state amplitudes αi, it's desirable to devise a quantum circuit that can efficiently create quantum states representing spin configurations of interest.
One approach to devising such a circuit is to create a circuit with variational parameters that allow it to create states in many different regions of Hilbert space (e.g., as performed in the development of variational quantum algorithms for quantum chemistry (and quantum simulation of quantum systems more broadly)). This same approach is possible for the method presented in this patent, and in some circumstances it may be desirable. However, in general an arbitrary variational circuit causes a quantum state to be prepared that does abide by the encoding rules. For example, suppose one is working with an Ising model and has chosen the sign encoding where αi=−1/√{square root over (N)} or αi=+1/√{square root over (N)}. An arbitrary variational quantum circuit will prepare a state where the amplitudes αi take on values other than ±1/√{square root over (N)}.
For the Ising model with a sign encoding αi=±1/√{square root over (N)}, a family of circuits that keeps the quantum state in a subspace that doesn't violate the encoding rules is the following: start with all qubits in the state 0 in the computational basis, apply the Hadamard gate to each qubit, and then apply only Z (single-qubit) gates and, Controlled-Z (two-qubit) gates (e.g., see
If a given spin model has constraints on the values that the spins can take (e.g., as may arise when considering a binary-variable optimization problem that has constraints on the binary variables, and where the objective function of the optimization problem is given as an Ising Hamiltonian), it is possible to design the quantum circuit that creates the quantum state representing a spin configuration in such a way that any created spin configuration is guaranteed to satisfy the constraints associated with the Hamiltonian. In the language of the optimization community, the circuit can be designed to keep the spin configuration in the “feasible subspace.”
After a family of circuits for creating spin-configuration quantum states has been decided, there are many different possible choices for how to initialize the spin-configuration quantum state as well as how to manipulate the spin-configuration quantum state. For initialization, an example choice is to create the state where all the spins take the same value (e.g., spin-up). With an Ising model and sign encoding, starting with all the qubits in the state 0 and applying the Hadamard gate to each qubit creates the all-spins-up configuration.
With the spin-configuration initialized, it can be manipulated by applying different circuits from the family of circuits that preserve the encoding rules. The choice of the circuits to manipulate the state is determined by what the ultimate goal of the computation is. Two goals that this disclosure considers are the sampling of thermal states of a spin model, and the heuristic optimization of a spin model (attempting to find a spin configuration that has a low (e.g., the lowest) energy). Possible updates to the circuit for preparing the spin configuration that the classical processor (e.g., Classical Processor 200) may propose include both changes to the structure of the circuit itself (such as inserting or deleting gates) as well as to modifying the parameters of one or more of the gates (such as changing the rotation angle of a single-qubit rotation gate). In general for spin models in which the spins take on discrete values (such as the Ising model or the Potts model), the circuit updates will involve changes to the circuit that are also discrete (such as adding or removing gates from a discrete set of possible gates), whereas for spin models in which the spins take on continuous values (such as the XY model), the circuit updates will involve changes to the circuit that are continuous (such as modifying the continuous parameter of a rotation gate) and also discrete changes (such as adding or removing gates, where the gates may come from a continuous set).
Once a circuit has been used to create a quantum state representing a particular classical-spin configuration (in a particular choice of encoding), it's desirable to be able to read out properties of the spin configuration. The readout circuits are, in general, specific to the spin model, encoding, and property to read out.
An example readout to perform is to read out the value of one particular spin, the ith spin. This particular readout operation is generic across spin models and encoding. To read out the value of the ith spin, it is sufficient to read out the quantity i|ψ, which is equal to αi. An example circuit to perform this readout is to use either the Hadamard Test or the Swap Test to compute the inner product between |ψ and |i.
In spin models, a property that is often of interest is the magnetization of the spin configuration. For an Ising model with sign encoding αi±1/√{square root over (N)}, the magnetization is defined as M=Σi=0N−1 αi. This can be read out by reading out the quantity 0|H⊗n|ψ. By applying Hadamard gates to every qubit of the state |ψ, a state whose all-0's amplitude is equal to Σi=0N−1 αi is created. This can be read out by, for example, using a Hadamard Test or a Swap Test between H⊗n|ω and |0, where H⊗n represents applying a Hadamard gate to each of the n qubits and |0 represents all the qubits in the 0 state.
A property of a spin configuration that is typically of great interest is energy of a spin configuration. The energy depends on the Hamiltonian H of the spin model, so an energy-readout circuit may require knowledge of the Hamiltonian. In some embodiments, the general strategy to perform readout is to use a Hadamard Test or Swap Test where one of the states in the test is the spin-configuration state |ψ, and the other is a carefully constructed state |ϕ that depends on the Hamiltonian.
If the Hamiltonian is that of a 1D Ising model with nearest-neighbor interactions, H=−Σ1≤i≤NJisisi+1, then the choice for |ϕ of |ϕ=C Σi=0N−1Jiαi+1|i results in the Hadamard or Swap Test allowing the calculation of ϕ|ψ=C Σi=0N−1 Jiαi+1αi. This is precisely the Ising energy, up to the known normalization constant C. The normalization constant C may be determined entirely by what the values Ji in the Hamiltonian are, chosen such that |ϕ is a normalized state.
If there are poly(n) distinct values of Ji, then the circuit to create |ϕ can be polynomial in the number of qubits n, making the circuit efficient to perform. As an example, consider the case where the Hamiltonian is H=−Σ1≤i≤NJsisi+1, (there is only one distinct value of Ji that is the same for all neighboring spins in the 1D chain). The circuit to create |ϕ in this case is to do the following: first, use the spin-configuration-preparation circuit to create the spin-configuration state |ψ; second, apply an adder circuit that performs the map |i→|i+1. This creates the desired state. A modification of this allows the read out of the energy of Hamiltonian terms that correspond to next-near-neighbor interactions H=−Σ1≤i≤N Jsisi+2: when creating |ϕ, use an adder that adds 2 instead of 1, (said differently it performs the map |i→|i+2).
If the Hamiltonian is that of a 2D Ising model with nearest-neighbor interactions, the test state |ϕ can be created using a slight modification of the circuit for the 1D Ising model. For a square lattice, each spin has four neighbors, and an example representation of the Hamiltonian is H=−Σ1≤i≤NJsisi+1−Σ1≤i≤NJsisi+√{square root over (N)}. If the state |ϕ is prepared using an adder that adds √{square root over (N)}, i.e., |i|i+√{square root over (N)}), then the Hadamard or Swap Test between |ϕ and |ψ gives the contribution to the Ising energy from the interactions between spins that are one row apart. But doing two Hadamard or Swap Tests, one corresponding to Σ1≤i≤NJsisi+1 and another corresponding to Σ1≤i≤NJsisi+√{square root over (N)}, the Ising energy of this 2D Ising model can be read out. The given approach to reading out energies from square lattices can be readily extended, by a person familiar with the art, to 2D triangular, Kagome, or other lattices, as well as to 3D or even higher-dimensional lattices. Similarly, the approach to performing energy readout can be readily extended to the case of XY or other spin models.
In even more generality, this method of performing readout of energy of the spin configuration can be used for arbitrary Ising or XY-model Hamiltonians. For both the Ising and the XY models, there is a matrix J that defines the energies due to spin-spin interactions. Since both the spin state |ψ and the state |ϕ that the Hadamard or Swap Test are used to compute the dot product between have dimension N, the test can compute an energy for at most N spin-spin-interaction terms. However, the J matrix in general encodes N2 spin-spin interactions, so to perform energy readout for a general J matrix may require the execution of N different Hadamard or Swap Tests, using N different states |ϕ.
For a J matrix that has some elements equal to zero (some spin-spin interactions that may in principle exist have strength zero), the number of Hadamard or Swap Tests to read out the energy is nnz(J)/N, where “nnz(J)” represents the number of elements of J whose values are not zero.
From this we can see that the energy-readout procedure is remarkably efficient, in terms of the number of Hadamard or Swap Tests to be performed. For a 1D Ising model with nearest-neighbor interactions, nnz(J)≈N, so the number of tests used is N/N=1. For a 2D Ising model on a square lattice, nnz(J)≈2N, so the number of tests used is 2N/N=2. For an Ising model whose spin-spin interactions are on a graph with maximum degree d, then nnz(J)≈d N (at most), so the number of tests used is d. So for any spin model with sparse connectivity, energy readout will be efficient in the number of tests used.
As a concrete example of energy readout for an XY model, if the phase encoding is used, then the dot product ϕ|ψ between |ψ and |ϕ=C Σi=0N−1 Jiαi+1|i is a complex number whose real part is Re[ϕ|ω]∝Σi=0N−1 Ji cos(θi+1−θi). This quantity, Re[ϕ|ψ], can be read out using a Hadamard Test. This is precisely the energy of the nearest-neighbor interactions in the XY model. As has been explained for the Ising model, by appropriate choice and construction of |ϕ, different sums of energy terms from the XY model can be read out.
The readout of energy from an Ising or XY model is very related to the problem of reading out correlation functions. The methods described above for reading out energies can be readily adapted to read out either individual spin-spin correlations (such as si=1sj for any choice of spin index j) or sums of spin-spin correlations (such as Σ1≤j≤Nsi=1sj, or even sums over all two-point correlations with all choices of “starting spin”, Σ1≤i<j≤Nsisj). The method to read out individual correlations or sums of correlations is to perform one or more Hadamard or Swap Tests using a state |ϕ that has been prepared as a superposition of one or more basis states with amplitudes corresponding to the spins one wants to compute correlations with, e.g., prepare |ϕ=C Σi=0N−1 αi+1|i (where C is a normalization constant) to compute the sum of correlations of nearest-neighbor spins in a 1D chain.
The ability to read out sums of spin-spin correlations enables thermodynamic properties of spin models to be read out. For example, the suspectibility χ of a 2D Ising model is given by a sum of spin-spin correlation functions:
where M is the magnetization and C(m, n) is the spin-spin correlation function C(m, n):=s0,0sm,n.
Another readout mechanism that can give insight into the correlations between spins is to use the Quantum Fourier Transform and then either directly measure the output state in the computational basis or perform a Hadamard or Swap Test to extract a particular piece of spectral information. As an example, if the state encoding the spin configuration, |ψ, uses the sign encoding αi=±1/√{square root over (N)}, it is transformed by the application of the Quantum Fourier Transform to:
This state encodes information about the periodicity in the spin configuration in |ψ.
If we set ω=e−i, then UQFT|ψ can be written as:
where
S(q) here is the Spin Structure Factor for a 1D spin model. By measuring the UQFT|ψ state in the computational basis, the distribution of the Spin Structure Factor for different q values is determined, and in a few repeated measurements the values of q that give the largest S(q) can be determined with high probability (e.g., probability >90% (the exact number may depend on what is meant by “few” and what the distribution S(q) looks like, which is problem-dependent)), if the distribution of S(q) has prominent peaks. The Spin Structure Factor is a relevant quantity to extract when modeling neutron-scattering experiments. Beyond the 1D case, there is a natural extension to computing the more general Spin Structure Factor for higher-dimensional lattices:
Beyond the two general types of readout we have proposed (Hadamard or Swap Tests to extract information with carefully designed states |ϕ, and periodicity/spectral information from the Quantum Fourier Transform), there is another general type of readout method that may be productively used to efficiently extract useful information from the configuration state |ψ: methods where measurements of |ψ are performed whose outcomes are then postprocessed classically to extract relevant quantities.
Two examples of readout methods that fall into this general type are shadow tomography (as described in arXiv:2002.08953) and neural-network quantum state tomography (as described in arXiv:1703.05334). Both of these readout methods enable the readout of quantities of interest that may be difficult to access via either Hadamard/Swap Tests or Quantum Fourier Transforms without classical postprocessing. The network-network methods are particularly flexible: classical neural networks can be used to extract properties of the spin configuration stored in |ψ that we do not have explicit constructions to extract, where a classical neural network is trained to be able to output the quantity of interest given an input of results of measurements of the state performed in (e.g., random) bases. Here “bases” typically refers to Pauli-X, Y, or Z bases for individual qubit measurements.
By being able to prepare spin configurations efficiently using variational ansaetze with moderate numbers of parameters, and by being able to read out the energy of a prepared spin configuration, one has the ingredients to perform a hybrid algorithm that operates partially on a quantum computer and partially on a classical computer (also referred to as a non-quantum computer or a classical processor) to either optimize (e.g., minimize) the energy of a given spin Hamiltonian, or to compute thermal properties of the Hamiltonian at a particular temperature T. For the optimization use case, updates to the spin configuration can be chosen in a multitude of ways, including using black-box classical optimizers. For the thermal-property-calculation use case, a Markov-Chain-Monte-Carlo-style algorithm (such as Simulated Annealing) may be applied in which an artificial temperature T is introduced and is used to control the probability of accepting a change to the variational ansatz's parameters, which in turn change the spin configuration (e.g., see
Besides the Markov-Chain-Monte-Carlo approaches, thermal properties can be extracted using a Thermofield-Double approach. The Thermofield Double is a pure state that comprises two copies of a system, and when one copy is traced out, results in the other copy being a thermal state at a chosen inverse temperature β. The Thermofield Double can be written as the quantum state
When implementing this disclosure's exponential spin embedding, the state |n is a state representing a classical spin configuration that has energy En. By making two copies of the qubits and following the established procedure for variationally preparing the Thermofield Double state |TFD(β), a thermal mixture of classical spin configurations can be obtained and thermal properties directly read out, without needing to run Markov-Chain-Monte-Carlo iterations. The tradeoff is that the variational optimization may instead be performed to obtain |TFD(β), but this can be beneficial since once the parameters to create |TFD(β) are known, |TFD(β) can be created repeatedly, whereas Markov-Chain-based methods suffer from correlations between samples and long burn-in times.
For the use case where the methods in this disclosure are used to find the optimal spin configuration for a particular Hamiltonian or other objective function, we note that the methods apply to a wide variety of optimization problems, including binary optimization problems (where the optimization variables are binary, e.g., Ising spins); real-valued optimization problems (where the optimization variables are continuous, e.g., as in with XY spins); discrete-valued optimization problems (where the optimization variables are discrete and can take on more than just two values, e.g., spins in the Potts model). For any optimization problem where one wants to know the optimal objective-function value (e.g., the minimum spin-configuration energy), the methods presented in this disclosure give an efficient procedure for reading out the desired answer (e.g., the energy), provided that the optimization procedure converged to the correct variational parameters. Furthermore, if it's desirable to read out the values of just a subset of the total set of spins in a given problem, if one uses a procedure to read out the value of an individual spin that is efficient, then it will be efficient to read out the values of the subset of spins. An example of a particular use case where it is desirable to read out only a subset of spins is in constrained optimization problems (e.g., see
There exists a correspondence between spin models and neural networks, wherein a single spin represents (or maps to) a single neuron. The methods in this disclosure provide a way to represent, in a quantum computer, classical neural networks that have a number of neurons that is given by an exponential function of the number of qubits used. Hopfield neural networks, given their correspondence with the Ising model, can be realized, allowing the simulation of an associative memory. More broadly, going beyond the Ising model, real-valued neurons in XY spins can be encoded and the methods in this disclosure can be used to read out just a subset of neurons, which can be defined as the output of the neural network. The topology of the neural network is encoded in the Hamiltonian, which can be updated at will since it is stored classically. This allows for training to be performed, where, for example, the J matrix in the XY Hamiltonian can be modified in response to the observed outputs from the network.
A spin model is input 100 to the classical processor. This model is typically specified by a Hamiltonian.
The Classical Processor 200 determines an initial spin-configuration-preparation circuit to be executed on the QPU 201.
The QPU 201 executes 220 the circuit to prepare the spin configuration.
The QPU 201 performs 221 a measurement of a macroscopic property of the spin configuration 221. The choice of macroscopic property is to be determined by the user based on their application. Two examples of macroscopic properties that the user might choose are magnetization and energy.
The QPU 201 sends to the measurement result to the Classical Processor 200. The Classical Processor 200 determines 210 how to update the spin-configuration-preparation circuit based on the measurement result.
The Classical Processor 200 updates 211 the circuit and the updated circuit may again be passed to the QPU 201. This cycle repeats until it stops (e.g., due to a stopping condition being met). The stopping condition may, for example, be that a threshold number of iterations has been performed, or that the record of measurement results satisfies some condition (such as that the macroscopic property has not changed substantially over a certain number of iterations, e.g., it has converged).
The output of the method is a record 101 of the measured macroscopic property for each iteration of the method.
A spin model is input 100 to the classical processor. This model is typically be specified by a Hamiltonian.
The Classical Processor 200 determines an initial spin-configuration-preparation circuit to be executed on the QPU 201.
The QPU 201 executes 220 the circuit to prepare the spin configuration.
The QPU 201 performs 222 a measurement of the energy of the spin configuration 222.
The QPU 201 sends to the measurement result to the Classical Processor 200. The Classical Processor 200 determines 212 how to update the spin-configuration-preparation circuit based on the energy. For example, if the new energy is lower than the energy of the previous configuration, then the new spin-configuration-preparation circuit is accepted—meaning that the Classical Processor 200 updates a memory containing the spin-configuration-preparation circuit to store the new circuit. If the new energy is not lower, then the new-spin-configuration-preparation circuit is only accepted with some probability p that is typically be chosen to be <1. This procedure may be analogous to simulated annealing or related Markov Chain Monte Carlo methods used in simulations of spin systems on classical computers, where the choice of probability p can govern the effective temperature that the system is simulated at.
The Classical Processor 200 updates 213 the circuit by proposing a (e.g., random) modification (either to the circuit structure or to one of the parameters of the gates in the circuit) and the updated circuit may again be passed to the QPU 201. This cycle repeats until it stops (e.g., due to a stopping condition being met). The stopping condition may, for example, be that a threshold number of iterations has been performed, or that the record of measured energies satisfies some condition (such as that the energy has not changed substantially over a certain number of iterations, e.g., it has converged).
The output of the method is a record 102 of the energy for each iteration of the method.
A constrained optimization problem is determined 103.
The constrained problem is converted 104 to a spin model that has no constraints. The resulting spin model may generally have more spins (variables) than the original problem, because standard methods for converting constrained to unconstrained problems tend to introduce auxiliary variables. The spin model is input 100 to the classical processor 200.
The Classical Processor 200 determines initial spin-configuration-preparation circuit to be executed on the QPU 201.
The QPU 201 executes 220 the circuit to prepare the spin configuration.
The QPU 201 performs 220 a measurement of the energy of the spin configuration.
The QPU 201 sends to the measurement result to the Classical Processor 200. The Classical Processor 200 determines 212 how to update the spin-configuration-preparation circuit based on the energy. For example, if the new energy is lower than the energy of the previous configuration, then the new spin-configuration-preparation circuit is accepted—meaning that the Classical Processor 200 updates a memory containing the spin-configuration-preparation circuit to store the new circuit. If the new energy is not lower, then the new-spin-configuration-preparation circuit is only accepted with some probability p that is typically be chosen to be <1. This procedure may be analogous to simulated annealing or related Markov Chain Monte Carlo methods used in simulations of spin systems on classical computers.
The Classical Processor 200 updates 213 the circuit by proposing a (e.g., random) modification (either to the circuit structure or to one of the parameters of the gates in the circuit) and the updated circuit may again be passed to the QPU 201. This cycle repeats until it stops. The stopping condition may, for example, be that a threshold number of iterations has been performed, or that the record of measured energies satisfies some condition (such as that the energy has not changed substantially over a certain number of iterations, e.g., it has converged).
The record 102 of the energy for each iteration of the method may be searched for the iteration that had the lowest energy, and the QPU 201 may perform an additional (e.g., final) task (e.g., in response to a request 105): executing the spin-configuration-preparation for this iteration one more time, but instead of measuring the energy, a measurement of the spins corresponding to the variables in the original constrained problem may be performed 223.
The output 106 of the method is the set of solution variable values for the lowest observed energy configuration.
Some embodiments relate to a method for embedding a spin configuration of a classical spin model with N:=2n spins in n qubits of a quantum computer. The method comprises representing the spin configuration in a superposition state of n qubits, |ψ=Σi=02
The spin model may be a Heisenberg model and the encoding of α1 is a combination of phase and magnitude encoding or a combination of real and imaginary encoding. The spin model may represent the Hopfield model and variational optimization results in the generation of an associative memory.
In some embodiments, a classical computer is used to update the variational parameters based on read-out properties of the current spin configuration. The classical computer may use an optimization procedure to modify the variational parameters such that the read-out energy of the spin configuration is reduced (e.g., minimized). The classical computer may use a Markov Chain Monte Carlo (Simulated Annealing) procedure for choosing updated variational parameters. For example, a previous update of the parameters is accepted with probability 1 if the new spin configuration's energy is lower than an artificial temperature T, and the update of parameters is accepted with probability <1 based the difference in energy and the artificial temperature T, if the new spin configuration's energy is higher. A readout circuit may be used to extract thermal properties of the spin model at temperature T.
In some embodiments, a thermofield-double state
is variationally created. Readout may be performed to determine thermal properties of a spin model without running a Markov Chain Monte Carlo approach.
An optimization problem with constraints may be converted to an optimization problem with no constraints but more variables (e.g., spins), and the method may be used to solve the larger unconstrained problem. Readout of individual spins may be used to extract the solution of the original problem efficiently by only reading out the spins that correspond to variables in the original (constrained problem). The additional spins that correspond to constraint-enforcers may not be read out.
The spins may be interpreted as neurons to enable the embedding of neural networks with e.g., N:=2n, neurons inn qubits of a quantum computer.
A quantum processing device (also referred to as a quantum computer, quantum processor, or quantum processing unit) exploits the laws of quantum mechanics in order to perform computations. Quantum processing devices commonly use so-called qubits, or quantum bits. While a classical bit always has a value of either 0 or 1, a qubit is a quantum mechanical system that can have a value of 0, 1, or a superposition of both values. Example physical implementations of qubits include superconducting qubits, spin qubits, trapped ions, arrays of neutral atoms, and photonic systems (e.g., photons in waveguides). For the purposes of this disclosure, a qubit may be realized by a single physical qubit or as an error-protected logical qubit that itself comprises multiple physical qubits. The disclosure is also not specific to qubits. The disclosure may be generalized to apply to quantum processors whose building blocks are qudits (d-level quantum systems, where d>2) or quantum continuous variables, rather than qubits.
A quantum circuit is an ordered collection of one or more gates. A sub-circuit may refer to a circuit that is a part of a larger circuit. A gate represents a unitary operation performed on one or more qubits. Quantum gates may be described using unitary matrices. The depth of a quantum circuit is the least number of steps needed to execute the circuit on a quantum computer. The depth of a quantum circuit may be smaller than the total number of gates because gates acting on non-overlapping subsets of qubits may be executed in parallel. A layer of a quantum circuit may refer to a step of the circuit, during which multiple gates may be executed in parallel. In some embodiments, a quantum circuit is executed by a quantum computer. In this sense a quantum circuit can be thought of as comprising a set of instructions or operations that a quantum computer should execute. In order to execute a quantum circuit on a quantum computer, a user may inform the quantum computer what circuit is to be executed. A quantum computer may include both a core quantum device and a classical peripheral/control device that is used to orchestrate the control of the quantum device. It is to this classical control device that the description of a quantum circuit may be sent when one seeks to have a quantum computer execute a circuit.
A variational quantum circuit may refer to a parameterized quantum circuit that is executed many times, where each time some of the parameter values may be varied. The parameters of a parameterized quantum circuit may refer to parameters of the gate unitary matrices. For example, a gate that performs a rotation about the y axis may be parameterized by a real number that describes the angle of the rotation. Variational quantum algorithms are a class of hybrid quantum-classical algorithm in which a classical computer is used to choose and vary the parameters of a variational quantum circuit. Typically the classical processor updates the variational parameters based on the outcomes of measurements of previous executions of the parameterized circuit.
The description of a quantum circuit to be executed on one or more quantum computers may be stored in a non-transitory computer-readable storage medium. The term “computer-readable storage medium” should be taken to include a single medium or multiple media (e.g., a centralized or distributed database, or associated caches and servers) able to store instructions. The term “computer-readable medium” shall also be taken to include any medium that is capable of storing instructions for execution by the quantum computer and that cause the quantum computer to perform any one or more of the methodologies disclosed herein. The term “computer-readable medium” includes, but is not limited to, data repositories in the form of solid-state memories, optical media, and magnetic media.
The approaches described above may be amenable to a cloud quantum computing system, where quantum computing is provided as a shared service to separate users. One example is described in patent application Ser. No. 15/446,973, “Quantum Computing as a Service,” which is incorporated herein by reference.
Some portions of above description describe the embodiments in terms of algorithmic processes or operations. These algorithmic descriptions and representations are commonly used by those skilled in the computing arts to convey the substance of their work effectively to others skilled in the art. These operations, while described functionally, computationally, or logically, are understood to be implemented by computer programs comprising instructions for execution by a processor or equivalent electrical circuits, microcode, or the like. Furthermore, it has also proven convenient at times, to refer to these arrangements of functional operations as modules, without loss of generality.
As used herein, any reference to “one embodiment” or “an embodiment” means that a particular element, feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment. Similarly, use of “a” or “an” preceding an element or component is done merely for convenience. This description should be understood to mean that one or more of the element or component is present unless it is obvious that it is meant otherwise.
Alternative embodiments are implemented in computer hardware, firmware, software, and/or combinations thereof. Implementations can be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a programmable processor; and method steps can be performed by a programmable processor executing a program of instructions to perform functions by operating on input data and generating output. As used herein, ‘processor’ may refer to one or more processors. Embodiments can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. Each computer program can be implemented in a high-level procedural or object-oriented programming language, or in assembly or machine language if desired; and in any case, the language can be a compiled or interpreted language. Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, a processor will receive instructions and data from a read-only memory and/or a random-access memory. Generally, a computer will include one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and optical disks. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits) and other forms of hardware.
Although the above description contains many specifics, these should not be construed as limiting the scope of the invention but merely as illustrating different examples. It should be appreciated that the scope of the disclosure includes other embodiments not discussed in detail above. Various other modifications, changes, and variations which will be apparent to those skilled in the art may be made in the arrangement, operation, and details of the methods and apparatuses disclosed herein without departing from the spirit and scope of the invention.
This application claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Patent Application Ser. No. 63/196,597, ‘Exponential Spin Embedding for Quantum Computers,’ filed on Jun. 3, 2021, the content of which is incorporated by reference.
Number | Date | Country | |
---|---|---|---|
63196597 | Jun 2021 | US |