This patent disclosure may contain material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the U.S. Patent and Trademark Office patent file or records, but otherwise reserves any and all copyright rights.
This patent relates to quantum computing, and more specifically to preparing and evolving an array of atoms for quantum computations.
As quantum simulators, fully controlled, coherent many-body quantum systems can provide unique insights into strongly correlated quantum systems and the role of quantum entanglement and enable realizations and studies of new states of matter, even away from equilibrium. These systems also form the basis for the realization of quantum information processors. While basic building blocks of such processors have been demonstrated in systems of a few coupled qubits, increasing the number of coherently coupled qubits to perform tasks that are beyond the reach of modern classical machines is challenging. Furthermore, existing systems lack coherence and/or quantum nonlinearity for achieving fully quantum dynamics.
Neutral atoms can serve as building blocks for large-scale quantum systems, as described in more detail in PCT Application No. PCT/US18/42080, titled “NEUTRAL ATOM QUANTUM INFORMATION PROCESSOR.” They can be well isolated from the environment, enabling long-lived quantum memories. Initialization, control, and read-out of their internal and motional states is accomplished by resonance methods developed over the past four decades. Arrays with a large number of identical atoms can be rapidly assembled while maintaining single-atom optical control. These bottom-up approaches are complementary to the methods involving optical lattices loaded with ultracold atoms prepared via evaporative cooling, and generally result in atom separations of several micrometers. Controllable interactions between the atoms can be introduced to utilize these arrays for quantum simulation and quantum information processing. This can be achieved by coherent coupling to highly excited Rydberg states, which exhibit strong, long-range interactions. This approach provides a powerful platform for many applications, including fast multi-qubit quantum gates, quantum simulations of Ising-type spin models with up to 250 spins, and the study of collective behavior in mesoscopic ensembles. Short coherence times and relatively low gate fidelities associated with such Rydberg excitations are challenging. This imperfect coherence can limit the quality of quantum simulations and can dim the prospects for neutral atom quantum information processing. The limited coherence becomes apparent even at the level of single isolated atomic qubits.
PCT/US18/42080 describes exemplary methods and systems for quantum computing. These systems and methods can involve first trapping individual atoms and arranging them into particular geometric configurations of multiple atoms, for example, using acousto-optic deflectors. This precise placement of individual atoms assists in encoding a quantum computing problem. Next, one or more of the arranged atoms may be excited into a Rydberg state, which can produce interactions between the atoms in the array. After, the system may be evolved under a controlled environment. Finally, the state of the atoms may be read out in order to observe the solution to the encoded problem. Additional examples include providing a high fidelity and coherent control of the assembled array of atoms.
In one or more embodiments, a method includes selectively arranging a plurality of qubits into a spatial structure to encode a quantum computing problem, wherein each qubit corresponds to a vertex in the quantum computing problem, and wherein spatial proximity of the qubits represents edges in the quantum computing problem; initializing the plurality of qubits into an initial state; driving the plurality of qubits into a final state by applying a sequence of resonant light pulses with a variable duration and a variable optical phase to at least some of the plurality of qubits, wherein the final state comprises a solution to the quantum computing problem; and measuring at least some of the plurality of qubits in the final state.
In one or more embodiments, the spatial structure comprises a one-dimensional, two-dimensional or three-dimensional array of qubits.
In one or more embodiments, the encoded quantum computing problem comprises one or more of an unweighted maximum independent set problem, a maximum-weight independent set problem, a maximum clique problem, and a minimum vertex cover problem.
In one or more embodiments, weights in the maximum-weight independent set problem are encoded by applying light shifts to at least some of the plurality of qubits.
In one or more embodiments, the final state of the plurality of qubits comprises one or more of a solution to the encoded unweighted maximum independent set problem, a solution to the encoded maximum-weight independent set problem, a solution to the encoded maximum clique problem, and a solution to the encoded minimum vertex cover problem.
In one or more embodiments, the solution to the quantum computing problem comprises an approximate solution to the quantum computing problem.
In one or more embodiments, a method includes selectively arranging a plurality of qubits into a spatial structure comprising a plurality of vertex qubits and a plurality of ancillary qubits to encode a quantum computing problem using spatial proximity of the plurality of qubits, wherein each vertex qubit corresponds to a vertex in the quantum computing problem and wherein subsets of the ancillary qubits correspond to edges in the quantum computing problem; initializing the plurality of qubits into an initial state; driving the plurality of qubits into a final state, wherein the final state comprises a solution to the quantum computing problem; and measuring at least some of the plurality of qubits in the final state.
In one or more embodiments, the driving the plurality of qubits into the final state comprises applying light pulses with a constant or variable Rabi frequency Ω and a constant or variable detuning Δ to at least some of the plurality of qubits.
In one or more embodiments, the applying light pulses to the at least some of the plurality of qubits further includes: applying at least one light pulse with a detuning Δ0 to a vertex qubit comprising a corner vertex or a junction vertex; and applying at least one light pulse with a detuning Δi to each of i ancillary qubits adjacent to the vertex qubit on an edge connected to the vertex qubit.
In one or more embodiments, the applying the light pulses to the at least some of the plurality of qubits further comprises applying light shifts to selected qubits of the at least some of the plurality of qubits.
In one or more embodiments, the driving the plurality of qubits into the final state comprises applying a sequence of resonant light pulses with a variable duration and a variable optical phase to at least some of the plurality of qubits.
In one or more embodiments, the arranging the plurality of qubits into the plurality of vertex qubits and the plurality of ancillary qubits comprises arranging the plurality of qubits onto a grid.
In one or more embodiments, the encoded quantum computing problem comprises one or more of an unweighted maximum independent set problem, a maximum-weight independent set problem, a maximum clique problem, and a minimum vertex cover problem.
In one or more embodiments, weights in the maximum-weight independent set problem are encoded by applying light shifts to a plurality of qubits.
In one or more embodiments, the final state of the plurality of qubits comprises one or more of a solution to the encoded unweighted maximum independent set problem, a solution to the encoded maximum-weight independent set problem, a solution to the encoded maximum clique problem, and a solution to the encoded minimum vertex cover problem.
In one or more embodiments, the method further includes renumbering at least two vertices in the quantum computing problem prior to the encoding the quantum computing problem.
In one or more embodiments, the solution to the quantum computing problem comprises an approximate solution to the quantum computing problem.
In one or more embodiments, a method includes: selectively arranging a plurality of qubits into a spatial structure to encode a quantum computing problem, wherein each qubit corresponds to a vertex in the quantum computing problem; initializing the plurality of qubits into an initial state; stroboscopically driving the plurality of qubits into a final state, wherein the final state comprises a solution to the quantum computing problem; and measuring at least some of the plurality of qubits in the final state.
In one or more embodiments, stroboscopically driving the plurality of qubits into a final state comprises applying light pulses sequentially and selectively in an order to subsets of the plurality of qubits, the order of light pulses corresponding to the graph structure of the quantum computing problem.
In one or more embodiments, the driving the plurality of qubits into the final state comprises applying light pulses with a constant or variable Rabi frequency Ω and a constant or variable detuning Δ to at least some of the plurality of qubits.
In one or more embodiments, the driving the plurality of qubits into the final state comprises applying a sequence of resonant light pulses with a variable duration and a variable optical phase to at least some of the plurality of qubits.
In one or more embodiments, the encoded quantum computing problem comprises one or more of an unweighted maximum independent set problem, a maximum-weight independent set problem, a maximum clique problem, and a minimum vertex cover problem.
In one or more embodiments, weights in the maximum-weight independent set problem are encoded by applying light shifts to a plurality of qubits.
In one or more embodiments, the final state of the plurality of qubits comprises one or more of a solution to the encoded unweighted maximum independent set problem, a solution to the encoded maximum-weight independent set problem, a solution to the encoded maximum clique problem, and a solution to the encoded minimum vertex cover.
In one or more embodiments, the method further includes renumbering at least two vertices in the quantum computing problem prior to the encoding the quantum computing problem.
In one or more embodiments, the solution to the quantum computing problem comprises an approximate solution to the quantum computing problem.
In one or more embodiments, a method includes: arranging a plurality of qubits to encode a quantum computing problem; applying a sequence of q levels of light pulses to the plurality of qubits, wherein the q levels of light pulses comprises at least a first set of q variational parameters and a second set of q variational parameters; measuring the state of one or more of the plurality of qubits; optimizing, based on the measured state of at least some of the one or more of the plurality of qubits, the first set of q variational parameters and the second set of q variational parameters of the q levels of light pulses; optimizing, based at least on the first set of q optimized variational parameters and the second set of q optimized variational parameters of q levels of light pulses, a first set of p variational parameters and a second set of p variational parameters of p levels of light pulses, wherein q<p; and measuring at least some of the plurality of qubits in a final state.
In one or more embodiments, optimizing the first set of p variational parameters and the second set of p variational parameters of p levels of light pulses further comprises computing a first set of p variational parameter starting values and a second set of p variational parameter starting values of the p levels of light pulses.
In one or more embodiments, computing of the first set of p variational parameter starting values of the p levels of light pulses, wherein p>1, comprises: performing a Fourier transform on the first set of q variational parameters of the q levels of light pulses, into a plurality of k frequency components, each of the k frequency components having an amplitude uk; and computing the first set of p variational parameter starting values of the p levels of light pulses based on the amplitudes uk;
In one or more embodiments, computing of the second set of p variational parameter starting values of the p levels of light pulses, wherein p>1, comprises: performing a Fourier transform on the second set of q variational parameters of the q levels of light pulses, into a plurality of k frequency components, each of the k frequency components having an amplitude vk; and computing the second set of p variational parameter starting values of the p levels of light pulses based on the amplitudes vk.
In one or more embodiments, computing of the first set of p variational parameter starting values and computing of the second set of p variational parameter starting values of the p levels of light pulses, comprises: extrapolating the first set of p variational parameter starting values of the p levels of light pulses based on the first set of q variational parameters of the q levels of light pulses; and extrapolating the second set of p variational parameter starting values of the p levels of light pulses based on the second set of q variational parameters of the q levels of light pulses.
In one or more embodiments, the method further includes applying a sequence of p levels of light pulses to the plurality of qubits with a first set of p optimized variational parameters and a second set of p optimized variational parameters, wherein the measuring the at least some of the plurality of qubits in the final state comprises measuring the at least some of the plurality of qubits after the applying the sequence of p levels of light pulses to the plurality of qubits.
In one or more embodiments, the encoded quantum computing problem comprises a MaxCut problem, and wherein the final state of the plurality of qubits comprises a solution to the MaxCut problem.
In one or more embodiments, the encoded quantum computing problem comprises a maximum independent set problem, and wherein the final state of the plurality of qubits comprises a solution to the maximum independent set problem.
Various objectives, features, and advantages of the disclosed subject matter can be more fully appreciated with reference to the following detailed description of the disclosed subject matter when considered in connection with the following drawings, in which like reference numerals identify like elements.
Optimization algorithms are used for finding the best solution, given a specified criterion, for a specified problem. Combinatorial optimization involves identifying an optimal solution to a problem given a finite set of solutions. Quantum optimization is a technique for solving combinatorial optimization problems by utilizing controlled dynamics of quantum many-body systems, such as a 2D array of individual atoms, each of which can be referred to as a “qubit” or “spin.” Quantum algorithms can solve combinatorially hard optimization problems by encoding such problems in the classical ground state of a programmable quantum system, such as spin models. Quantum algorithms are then designed to utilize quantum evolution in order to drive the system into this ground state, such that a subsequent measurement reveals the solution. In other words, a problem can be encoded by placing qubits in a desired arrangement with desired interactions that encode constraints set forth by the optimization problem. When properly encoded, the ground state of the many-body system comprises the solution to the optimization problem. The problem can therefore be solved by driving the many-body system through an evolutionary process into its ground state.
Without being bound by theory, assuming complete control of the interactions between the qubits, it is possible to encode nondeterministic polynomial (“NP”)-complete optimization problems into the ground states of such systems. In most realizations, however, not all interactions are fully programmable. Instead, such interactions are determined by properties of specific physical realizations, such as, but not limited to locality, geometric connectivity, or controllability, which either constrain the class of problems that can be efficiently realized or imply that substantial overhead is required for their realization. Thus, one of the challenges in understanding and assessing quantum optimization algorithms involves designing methods to encode specific and larger classes of combinatorial problems in physical systems in an efficient and natural way.
In some implementations, quantum optimization can involve: (1) encoding a problem by controlling the positions of individual qubits in a quantum system given a particular type and strength of interaction between pairs of qubits and (2) steering the dynamics of the qubits in the quantum system through an evolutionary process such that their evolved final states provide solutions to optimization problems. The steering of the dynamics of the qubits into the ground state solution to the optimization problem can be achieved via multiple different processes, such as, but not limited to the adiabatic principle in quantum annealing algorithms (QAA), or more general variational approaches, such as, but not limited to quantum approximate optimization algorithms (QAOA). Such algorithms may tackle computationally difficult problems beyond the capabilities of classical computers. However, the heuristic nature of these algorithms poses a challenge to predicting their practical performance and calls for experimental tests. In addition, such systems, in their full generality, are inefficient and difficult to implement owing to practical constraints as described above, and can only be used on a subset of optimization problems.
Some aspects of the present disclosure relate to systems and methods for arranging qubits in programmable arrays that can encode or approximately encode in an efficient way a broader set of optimization problems. In some embodiments, chains of even numbers of adjacent “ancillary” qubits are used to encode interactions between distant qubits by connecting such distant qubits with chains of ancillary qubits, for example as described in more detail with reference to
Some additional or alternative aspects of the present disclosure relate to systems and methods for coherently manipulating the internal states of qubits, including excitation. In some embodiments, techniques are disclosed that can be used to evolve an encoded problem to find an optimal (or an approximately optimal) solution. For example, embodiments of the present disclosure relate to optimal variational parameters and strategies for performing the Quantum Approximate Optimization Algorithm (“QAOA”), some embodiments of which are described, for example, with reference to
In some embodiments, particular types of optimization problems can be encoded with an arrangement of qubits. For example,
Without being bound by theory, the embodiment of
In some embodiments, a M
where a spin-½ system is assigned with states |0 and |1 to each vertex, nv=|1υ1|, Δ is the detuning on the spin, and U is the energy penalty when two spins (v, w) connected by an edge (E) are both in state |1. Initially, all vertices can be prepared in the |0 state. Driving causes at least some of the vertices to transition to the |1 state. For Δ>0, HP favors qubits to be in state |1. However, if U>Δ, it is energetically unfavorable for two qubits, u and v, to be simultaneously in state |1 if they are connected by an edge u, v∈E. Thus, each ground state of HP represents a configuration where the qubits that correspond to vertices in the maximum independent set are in state |1, and all other qubits are in state |0.
In some embodiments, a quantum annealing algorithm (“QAA”) can be used to evolve the quantum state from the initial state to the final state, which encodes the solution of the optimization problem. For example, a simple QAA can be implemented by adding a transverse field HD=EvΩσvx with σx=|01|+|10|, that induces quantum tunneling between different spin configurations.
In some embodiments, MIS problems can be implemented using Rydberg interactions between individual atoms. For example, as discussed in more detail in PCT/US18/42080, graphs like that shown in
Without being limited by theory, the Hamiltonian governing the evolution of embodiments of such a system can be represented as follows:
where Ωυ and Δυ are the Rabi frequency and laser detuning at the position {right arrow over (x)}υ of qubit v. While individual manipulation is feasible, such a system can also be implemented with a homogeneous driving laser, for example, where Ωυ=Ω and Δυ=Δ. The operator σvx=|0v1+|1v0| can give rise to coherent spin flips of qubit v and nv=|1v1| counts if the qubit v is in the Rydberg state. In some embodiments, for isotropic Rydberg states, the interatomic interaction strength depends only on the relative atomic distance, x, and is given by VRyd(x)=C/x6, where C is a constant. The strong interactions at short distances energetically prevent two qubits from being simultaneously in state |1 if they are within the Rydberg blockade radius rB=(C/√{square root over ((2Ω)2+Δ2))}1/6, as shown in
The M
Without being bound by theory, the maximum-weight independent set problem is a M
Although Rydberg interactions decay significantly beyond the Rydberg radius, there are still long-range interaction tails between distant qubits, such as 102 and 108, shown in
In some embodiments, one or more of these problems can be solved by choosing atom positions in two dimensions and laser parameters such that the low energy sector of the Rydberg Hamiltonian HRyd reduces to the (NP-complete) M
Exemplary Qubit Arrangements with Ancillary Qubits
As described with reference to
can be determined by a parameter k, proportional to the linear density of ancillary vertices along each edge. According to some embodiments, it is desirable to implement approximately the same density of ancillary qubits along any given edge to ensure that the interactions that form each vertex are roughly equal in strength.
In the example of vertex qubits 202 and 204, vertex qubit 202 can interact with the leftmost ancillary qubit 206 as if it were vertex qubit 204, and vertex qubit 204 can interact with the rightmost ancillary qubit 206 as if it were vertex qubit 202. In this way, edges can be implemented between vertex qubits outside the Rydberg radius, and in ways that cannot be implemented purely as a unit disk graph. Furthermore, whereas in unit disk graphs like that discussed with reference to
In some embodiments, when a M
In some embodiments, implementations can be expanded beyond unit disk graphs to more general graphs. Such implementations can be referred to as “stroboscopic” implementations, which can involve an arbitrary graph implemented without ancillary qubits described with reference to
According to some embodiments, it is desirable to identify the types of UD-M
Δ(t)=Δ0(2t/T−1), Equation 3:
Ω(t)=Ω0 sin2(πt/T) Equation 4:
The limit U/Ω0→∞ can be observed, where the dynamics are restricted to the independent set subspace, allowing numerical simulation of system sizes up to N˜40 qubits. Sloped dashed lines parallel to the stripe pattern correspond to optimal disk packing. The fully connected region has trivially |MIS|=1. The Landau-Zener time scale, TLZ, required for adiabaticity can be extracted by fitting numerical results to the expected long-time behavior of the ground state probability PMIS=1−ea−T/T
Several approaches can be implemented to try to overcome these potential limitations. Such approaches include heuristics to open up the gap, the use of diabatic (non-adiabatic) transitions in QAA, and variational quantum algorithms such as QAOA studied below.
As described throughout the present disclosure, it is possible to take advantage of a direct connection between the many-body problem of spins interacting via van der Waals interactions and computational complexity theory. Individual control over the positions of such spins allows for NP-complete optimization problems to be directly encoded into such quantum systems. This result can be obtained from a reduction from MIS on planar graphs with maximum degree of 3. Quantum optimizers based on the techniques described in the present disclosure, in combination with techniques to trap and manipulate neutral atoms, can address NP-hard optimization problems as an improvement over traditional computing techniques.
As discussed above, after encoding a combinatorial problem using the position of a quantum system, whether using the “ancillary” qubit technique described above or not, the next step is to evolve the system in a way that produces a ground state that is a solution to the combinatorial problem. Some examples include QAA and QAOA.
In some embodiments, QAOA can be applied to quantum optimization problems like those described in the present disclosure. For example, a QAOA (of level p) can consist of applying a sequence of p resonant pulses to all qubits (or with some detuning and energy adjustments for specific qubits as described in more detail in the present disclosure) of varying duration, tk, and optical phase, yok on an initial prepared state. This can generate a variational wavefunction ψ({tk, ϕk})=Πk=1pUk|0⊗N, where Uk=e−iH
The resulting quantum state can be measured in the computational basis and then used for optimization. For example, the variational parameters tk and φk can be optimized in a classical feedback loop from measurements of Hp in this quantum state. Examples of this optimization are described in more detail below.
The performance of QAOA depends in part on the chosen classical optimization routine. Before explaining exemplary implementations of such techniques, the performance of QAOA can be shown, which marks an improvement upon classical computation techniques in some embodiments. For example,
As explained above, QAOA is a quantum processor that prepares a quantum state according to a set of variational parameters. Using measurement outputs (such as the measured qubit state (|0) or |1) of each qubit), the parameters can then be optimized, for example via a classical computer and fed back to the quantum machine in a closed loop. In QAOA, the state is prepared by a p-level circuit specified by 2p variational parameters. In other words, each pulse has two parameters (tk, ϕk), for k from 1 to p. Even at the lowest circuit depth (p=1), QAOA has non-trivial provable performance benefits (for example, better performance than a simple classical algorithm) but cannot be efficiently simulated by classical computers.
However, very little is known about QAOA with p>1. One hurdle lies in the difficulty to efficiently optimize in the non-convex, high-dimensional parameter landscape. Without constructive approaches to perform the parameter optimization, any potential advantages of the hybrid algorithms could be lost. Furthermore, although QAOA can be shown to succeed in the p→∞ limit due to its ability to approximate adiabatic quantum annealing (i.e., the adiabatic algorithm), its performance when 1<p<<∞ is largely unexplored.
Aspects of the present disclosure detail techniques to efficiently optimize QAOA variational parameters. In some examples, given a set of qubits in a particular arrangement, QAOA proceeds by applying a series of operations to the qubits, each operation having at least two variational parameters. The evolved state of the qubits is measured, which is fed back into an optimization routine (such as a classical algorithm) to adjust the variational parameters. The process then repeats on the qubits until it is determined that the measured state of the qubits is a solution to the encoded problem or an approximation thereof. In some embodiments, techniques for classical optimization routines are disclosed. These techniques are quasi-optimal in the sense that they typically produce known global optima, and generically require 2O(p) brute-force optimization runs to surpass performance. Aspects of the present disclosure also disclose implementations of QAOA with such optimization techniques, such as an example involving a 2D physical array of a few hundred Rydberg-interacting atoms that presents potential advantages over classical algorithms for solving MaxCut problems.
As discussed above, many real-world problems can be framed as combinatorial optimization problems. In some embodiments, these are problems that can be generally defined on N-bit binary strings z=z1 . . . zN, where the goal is to determine a string that maximizes a given classical objective function C(z): {+1, −1}N≥0. An approximate optimization algorithm aims to find a string z that achieves a desired approximation ratio
where Cmax=maxz C(z).
The Quantum Approximate Optimization Algorithm (QAOA) is a quantum algorithm that can tackle these combinatorial optimization problems. To encode the problem, the classical objective function can be converted into a quantum problem Hamiltonian by promoting each binary variable zi into a quantum spin σiz:
H
C
=C(σ1z,σ2z, . . . ,σNk) Equation 5B:
For the p-level QAOA described in
|ψp({right arrow over (γ)},{right arrow over (β)})=e−iβ
which is parameterized by 2p parameters γi and βi (i=1, 2, . . . p) (one for each level in the set of p level). The expectation value HC in this variational state can be determined as follows:
H
C
=F
p({right arrow over (γ)},{right arrow over (β)})=ψp({right arrow over (γ)},{right arrow over (β)})|HC|ψp({right arrow over (γ)},{right arrow over (β)}), Equation 6B:
which can be identified by repeated measurements 1460 of the quantum system in the computational basis and taking the average (e.g., with two orthogonal spin configurations |0 and |1, which can be measured using existing measurement techniques).
Once the measurements 1460 are performed, the results (e.g., the calculated Fp determined by taking the average over many HC) can be fed back to an optimizer 1470, such as a classical computer, to search for the optimal parameters ({right arrow over (γ)}*{right arrow over (β)}*) so as to maximize Fp({right arrow over (γ)}, {right arrow over (β)}),
The performance of QAOA can, in some embodiments, be benchmarked based on the approximation ratio:
In some embodiments, r characterizes how good the solution provided by QAOA is. The higher the r value, the better the solution.
In some embodiments, without being bound by theory, this QAOA framework can be applied to general combinatorial optimization problems. In one example, an archetypical problem called MaxCut can be considered.
where an edge i, jcontributes with weight wij if and only if spins σiz and σjz are anti-aligned. For example, the curved dashed line shows a cut through all edges of spins that are anti-aligned, excluding edges w13 and w25, which are aligned.
While embodiments of the present disclosure discuss MaxCut on d-regular graphs, where every vertex is connected to exactly d other vertices, based on the present disclosure a person of skill in the art would understand that the aspects of QAOA described herein would be applicable to other types of combinatorial problems such as, but not limited to MaxCut problems on other types of graphs, Maximum Independent Set problems, and others. Two types of d-regular MaxCut graphs are considered: (1) unweighted d-regular graphs (udR), where all edges have equal weight wij=1; and (2) weighted d-regular graphs (wdR), where the weights wij are chosen uniformly at random from [0,1] (though other weights other than the interval [0,1] can be selected in other embodiments).
It is NP-hard to design an algorithm that guarantees a minimum approximation ratio of r*≥16/17 for MaxCut on all graphs, or r*≥331/332 when restricted to unweighted 3 regular graphs (“u3R”) discussed above.
According to some embodiments, QAOA presents several benefits. For certain cases, it achieves a guaranteed minimum approximation ratio when p=1. Additionally, under some reasonable complexity-theoretic assumptions, QAOA may not be efficiently simulated by any classical computer even when p=1, making a candidate algorithm for “quantum supremacy,” or the ability of a quantum computer to perform calculations that a traditional computer cannot. The square-pulse (“bang-bang”) ansatz of dynamical evolution, of which QAOA can be one example, can be optimal given a fixed quantum computation time. In general, the performance of QAOA can improve with increasing p, achieving r→1 as p→∞ since it can approximate adiabatic quantum annealing via Trotterization. This monotonicity makes it more attractive than quantum annealing, whose performance may decrease with increased run time.
While some embodiments of QAOA have a simple description, not much is currently understood beyond p=1. For the example problem of MaxCut on u2R graphs, (such as 1D antiferromagnetic rings,) QAOA may yield r≥(2p+1)/(2p+2) as determined by numerical evidence. In another example, such as Grover's unstructured search problem among n items, QAOA can find the target state with p=Θ(√{square root over (n)}), achieving the optimal query complexity within a constant factor. In some embodiments, for more general problems, a simple brute-force approach can be used by discretizing each parameter into O(poly(N)) grid points. However, this technique requires examining NO(p) possibilities at level p, which can become impractical to calculate using typical computers asp grows. Embodiments of the present disclosure therefore address efficient optimization of QAOA parameters and understanding of the algorithm for 1<<p<∞.
Some embodiments of the present disclosure relate to techniques for optimizing variational parameters. As described in more detail below, patterns in optimal parameters can be exploited to develop a heuristic optimization strategy for more quickly identifying the optimal variational parameters. In some examples described below, parameters identified for level-p QAOA can be used to more quickly optimize parameters for level-(p+1) QAOA, thereby producing a good starting point for optimization. These techniques provide improvements over brute-force techniques. In some embodiments, parameters identified for level-q QAOA, for any q<p, can be used to more quickly optimize parameters for level-p QAOA. Further, while some examples discuss randomly generated instances of u3R and w3R, similar results can be found when applying these techniques to u4R and w4R graphs, as well as complete graphs with random weights (or the Sherington-Kirkpatrick spin glass problem. These other exemplary u3R, w3R, u4R, and w4R graphs are regular graphs, meaning, for example, that each vertex has the same number of neighbors (3 or 4 respectively). The letters u and the w can refer to whether one considers unweighted or weighted graphs, respectively. In some embodiments, these graphs are useful as test samples. The patterns in the optimal parameters identified herein are used to develop example heuristic strategies that can efficiently find quasi-optimal solutions in O(poly(p)) time.
In some embodiments it is possible to eliminate degeneracies in the parameter space due to symmetries. For example, generally, QAOA can have a time-reversal symmetry, Fp({right arrow over (γ)}, {right arrow over (β)})=Fp(−{right arrow over (γ)}, −{right arrow over (β)}), since both HB and HC are real-valued. For QAOA applied to MaxCut, there can be an additional Z2 symmetry, as e−i(π/2)H
in general, and
for udR graphs.
In some embodiments, it is possible to numerically investigate the optimal QAOA parameters for MaxCut on random u3R and w3R graphs with vertex number 8≤N≤22, using a brute-force approach. For each graph instance and a given level p, a random starting point (seed) in the parameter space can be chosen and a gradient-based optimization algorithm such as Broyden-Fletcher-Goldfarb-Shanno (“BFGS”) can be used to find a local optimum ({right arrow over (γ)}L, {right arrow over (β)}L) starting with this seed. In some embodiments, a local optimum can refer to a case where for a given choice of parameters β and γ, the result always gets worse if the values of the parameters β and γ are changed slightly. However, for such local optima, the result might get better if the values of these parameters are instead changed drastically. Thus the optimum can be referred to as local optimum, not global optimum. In some embodiments, for each graph, it is possible to optimize the variational parameters to cause the solution to go to a local optimum by a local optimization method, such as one where the optimization only searches parameters close to the initial starting parameters. This local optimization can be repeated with sufficiently many different seeds to find the global optimum ({right arrow over (γ)}*, {right arrow over (β)}*). In some embodiments, a global optimum may refer to the case where there is not a better choice of parameters. The global optimum can change from graph to graph. The degeneracies of the optimal parameters ({right arrow over (γ)}*, {right arrow over (β)}*) can be reduced using the symmetries discussed above (e.g., by finding some (distinct) values of γ and β that are equivalent because they lead to exactly the same result and thus do not need to be considered individually). In some illustrative examples, the global optimum can be non-degenerate up to these symmetries.
In some embodiments, the process of identifying optimal parameters ({right arrow over (γ)}*, {right arrow over (β)}*) can be repeated for additional random graphs, such as 100 u3R and w3R graphs with various vertex numbers N, which can draw out one or more patterns in the optimal parameters ({right arrow over (γ)}*, {right arrow over (β)}*). In one example, the optimal γi* can tend to increase smoothly with each iteration i=1, 2, . . . , p, while βi* can tend to decrease smoothly. For example,
Notably, even at small depth, this parameter pattern can be reminiscent of adiabatic quantum annealing where HC is gradually turned on while HB is gradually turned off, in some embodiments. However, the mechanism of QAOA can be shown to go beyond the adiabatic principle, as discussed in more detail below. In addition, in some embodiments, the optimal parameters can have a small spread over many different instances. This can be because the objective function Fp({right arrow over (γ)}, {right arrow over (β)}) can be a sum of terms corresponding to subgraphs involving vertices that are a distance ≤p away from every edge. At small p, there are only a few relevant subgraph types that enter into Fp and can effectively determine the optimal parameters. As N→∞ and at a fixed finite p, the probability of a relevant subgraph type appearing in a random graph can approach a fixed fraction. This implies that the distribution of optimal parameters ({right arrow over (γ)}*, {right arrow over (β)}*) can converge to a fixed set of values in this limit.
In some embodiments, the optimal parameter patterns observed above can indicate that generically, there is a slowly varying continuous curve that underlies the parameters γi* and βi*. In some embodiments, this curve changes only slightly from each level p to p+1. Based on these observations, a new parameterization of QAOA can be used, as well as a heuristic optimization strategy that can, without limitation, be referred to as “FOURIER.” In some embodiments, the heuristic strategy uses information from the optimal parameters at level p to help optimization at level p+1 by producing good starting points. While this heuristic does not necessarily find the global optimum of QAOA parameters, it can produce, in O(poly(p)) time, quasi-optima that can only be surpassed with 2O(p) number of brute-force runs. In some beneficial embodiments, this facilitates evaluation of the performance and mechanism of QAOA beyond p=1.
In some embodiments, QAOA can be re-parameterized based on the observation that the optimal QAOA parameters γi* and βi* appear to be smooth functions of their index i. An alternative representation of the QAOA parameters can be implemented using Fourier-related transforms of real numbers, which permits reductions in the number of necessary parameters. Instead of using the 2p parameters ({right arrow over (γ)}*, {right arrow over (β)}*)∈2p, FOURIER can instead use 2q parameters ({right arrow over (u)}, {right arrow over (v)})∈2q, where the individual elements γi and βi are written as functions of ({right arrow over (u)}, {right arrow over (v)}) through the following transformation:
In some embodiments, these transformations can be referred to as Discrete Sine/Cosine Transforms, where uk and vk can be interpreted as the amplitude of k-th frequency component for {right arrow over (γ)} and {right arrow over (β)}, respectively. When q≥p this new parametrization can describe all possible QAOA protocols at level p.
Embodiments of the FOURIER strategy work by starting with level p=1, optimizing using an optimization function such as brute force for level p=1, and then using the optimum at level p to determine a starting point for level p+1. The starting points can be generated by re-using the optimized amplitudes ({right arrow over (u)}*, {right arrow over (v)}*) of frequency components from level p extrapolated from the optimized parameters ({right arrow over (γ)}*, {right arrow over (β)}*) to identify the parameters ({right arrow over (γ)}*, {right arrow over (β)}*) for the level p+1. This can be repeated for increasing p.
Some embodiments include several variants of this strategy, examples of which are referred to as FOURIER[q, R] and INTERP, for optimizing p-level QAOA. Without limitation, embodiments of one of variants can be referred to as FOURIER[q, R], characterized by two integer parameters q and R. The first integer q can refer to the maximum frequency component allowed in the parameters ({right arrow over (u)}, {right arrow over (v)}), which can be the maximum value of k. If q=p, the full power of p-level QAOA can be utilized. However, since the smoothness of the optimal parameters ({right arrow over (γ)}*, {right arrow over (β)}*) implies that only the low-frequency components are important, it is also possible to consider the case where q is a fixed constant independent of, but smaller than p, so the number of parameters is bounded even as the QAOA circuit depth increases.
In some embodiments, the second integer R can refer to the number of controlled random perturbations added to the parameters to escape a local optimum towards a better one. For example, where the optimization parameters ({right arrow over (γ)}*, {right arrow over (β)}*) were identified at a local but not global optimum for the initial value p=1, perturbations can be introduced to avoid focusing only on that local optimum for increased values of p. Exemplary results discussed in the present disclosure implement the FOURIER[q, R] strategy with q=p and R=10 unless otherwise stated, but such as selection is not limiting.
In embodiments where q is chosen such that q=p, the strategy can be denoted as FOURIER[∞, R], since q grows unbounded with p. In embodiments of the FOURIER[∞, 0] variant of this strategy, a starting point is generated for level p+1 by adding a higher frequency component, initialized at zero amplitude, to the optimum at level p. For example, as shown in
{right arrow over (u)}
(p)
0
={right arrow over (u)}
(p−1)
L,0), {right arrow over (v)}(p)0={right arrow over (v)}(p−)L,0). Equation 12:
Using ({right arrow over (u)}(p)0, {right arrow over (v)}(p)0) 2222A as a starting point, a BFGS optimization routine can be performed to obtain a local optimum 2224B ({right arrow over (u)}(p)L, {right arrow over (v)}(p)L) for the level p. This is output to the next level of p, as the process continues.
In some embodiments, improvements over this technique can be gained with the strategy, FOURIER[∞, R>0], which is also shown in
As shown in
where {right arrow over (u)}(p−1)P,r and {right arrow over (v)}(p−1)P,r contain random numbers drawn from normal distributions with mean 0 and variance given by {right arrow over (u)}(p−1)B and {right arrow over (v)}(p−1)B:
[{right arrow over (u)}(p−1)P,r]k˜Normal(0,[{right arrow over (u)}(p−1)B]k2), Equation 15:
[{right arrow over (v)}(p−1)P,r]k˜Normal(0,[{right arrow over (v)}(p−1)B]k2), Equation 16:
In such embodiments, there is a free parameter a corresponding to the strength of the perturbation. In some non-limiting examples derived from trial and error, setting α=0.6 can yield good results. This choice of α is assumed in the present disclosure unless otherwise stated. However, a person of skill in the art would understand that other values of α can be used, including dynamic values of α as needed depending on particular implementations.
In some embodiments, additional strategies can be used to take advantages of the parameter pattern indicated above. One exemplary strategy can use linear interpolation of optimal parameters at lower level QAOA to generate starting points for higher levels, which can be referred to without limitation as “INTERP.” Both INTERP and FOURIER strategies are effective for the instances discussed throughout the present disclosure and are applicable to others as well. While FOURIER has demonstrated a slight edge in its performance in finding better optima when random perturbations are introduced, a person of skill in the art would understand from the present disclosure that INTERP is also an efficient way of improving QAOA and can present additional benefits. Embodiments of FOURIER[q, R] and INTERP are described below in more detail. However, additional techniques are contemplated by the present disclosure, such as the use of machine learning. Furthermore, although in aspects of the present disclosure, the heuristic strategies makes use of optimal variational parameters found at level-(p−1) QAOA to find initial variational parameters at level-p QAOA, a person of skill in the art would understand that optimal variational parameters found at level-m, for any map, can be used to design initial variational parameters at level-p QAOA.
In some variants of the FOURIER strategy, the number of frequency components q is fixed. These variants can be treated similarly to the strategies where q=p discussed above, except all {right arrow over (u)} and {right arrow over (v)} parameters can be truncated at the first q components. For example, when optimizing QAOA at level p≥q with the FOURIER[q,0] strategy, no further higher frequency components are added, and the starting point begins at {right arrow over (u)}(p)0={right arrow over (u)}(p−1)L∈q.
In some embodiments of the optimization strategy referred to as INTERP, linear interpolation can be used to produce a starting point for optimizing QAOA and an optimization routine can iteratively increase the level p. However, for purposes of the present discussion, p should be considered the same as p−1 in the discussion for the FOURIER strategy, as this is simply a matter of semantics for where to begin the algorithm. In some embodiments, this is based on the observation that the shape of parameters ({right arrow over (γ)}(p+1)*, {right arrow over (β)}(p+1)*) closely resembles that of ({right arrow over (γ)}(p)*, {right arrow over (β)}(p)*). For a given instance, QAOA is iteratively optimized by starting from p=1, obtaining local optimum parameters {right arrow over (γ)}(p)L, {right arrow over (β)}(p)L), and incrementing p to p+1. To optimize parameters for level p+1, optimized parameters from level p are used to produce starting points ({right arrow over (γ)}(p+1)0, {right arrow over (β)}(p+1)0) according to:
for i=1, 2, . . . , p+1, where i denotes the i-th element of the vector. Here, [{right arrow over (γ)}i]≡γi denotes the i-th element of the parameter vector {right arrow over (γ)}, and [{right arrow over (γ)}(p)L]0≡[{right arrow over (γ)}(p)L]p+1≡0* The expression for {right arrow over (β)}(p+1)0 can be the same as above after swapping γ↔β. Starting from ({right arrow over (γ)}(p+1)0, {right arrow over (β)}(p+1)0), the BFGS optimization routine (or any other optimization routine) can be performed to obtain a local optimum ({right arrow over (γ)}(p+1)L, {right arrow over (β)}(p+1)L) for the (p+1)-level QAOA. Finally, p can be incremented by one and the same process can be repeated until a target level is reached.
The INTERP strategy can also get stuck in a local optimum in some embodiments. Adding perturbations to INTERP can help but may not be as effective in some embodiments as with FOURIER This may occur because the optimal parameters are smooth, and adding perturbations in the ({right arrow over (u)}, {right arrow over (v)})-space modify ({right arrow over (γ)}, {right arrow over (β)}) in a correlated way, which can enable the optimization to escape local optima more easily. However, a similar perturbation routine is contemplated.
As discussed above, the heuristic approaches described in the present disclosure constitute a significant improvement over brute force QAOA techniques. Non-limiting comparisons of example implementations are discussed below in the sections below.
Based on the present disclosure, a person of skill in the art would understand that the disclosed heuristic strategies could be implemented on a number of technical platforms. In the section below titled “Example QAOA Implementations” the MaxCut problem is considered as an example, although it can also be applied to solve other interesting problems.
Example Implementations with Quantum Systems
In some embodiments, large-size problems are suitable for implementation on quantum systems. Two aspects of such implementations (reducing the interaction range and examples with Rydberg atoms) are discussed in more detail below.
First, with regard to reducing the interaction range, in some quantum implementations, as discussed above, each vertex can be represented by a qubit. For a large problem size, a major challenge to encode general graphs is the necessary range and versatility of the interaction patterns (between qubits). The embedding of a random graph into a physical implementation with a 1D or 2D geometry may require very long-range interactions. By re-labelling the graph vertices, it is possible reduce the required range of interactions. Without being bound by theory, this can be formulated as the graph bandwidth problem: Given a graph G=(V, E) with N vertices, a vertex numbering is a bijective map from vertices to distinct integers, f:V→{1, 2, . . . , N}. The bandwidth of a vertex numbering f is, Bf (G)=max{|f(u)−f(v)|: (u, v)∈E (G)} which can be understood as the length of the longest edge (in 1D). The graph bandwidth problem is then to find the minimum bandwidth among all vertex numberings, i.e., B(G)=minfBf (G); namely, it is to minimize the length of the longest edge by vertex renumbering.
In general, finding the minimum graph bandwidth is NP-hard, but good heuristic algorithms have been developed to reduce the graph bandwidth.
In some embodiments, a general construction can be used to encode any long-range interactions to local fields by including additional physical qubits and gauge constraints. It is also possible to restrict to special graphs that exhibit some geometric structures. For example, unit disk graphs are geometric graphs in the 2D plane, where vertices are connected by an edge only if they are within a unit distance. These graphs can be encoded into 2D physical implementations, and the MaxCut problem is still NP-hard on unit disk graphs.
In some embodiments, the above discussion of QAOA has been platform independent, and is applicable to any state-of-the-art platforms. Exemplary platforms include neutral Rydberg atoms, trapped ions, and superconducting qubits. Although the following discussion focuses on an implementation of QAOA with neutral atoms interacting via Rydberg excitations, where high-fidelity entanglement has been recently demonstrated, other implementations are contemplated.
In some embodiments, it is possible to implement control over the interaction between individual atoms according to a given graph. In some embodiments, in this way it is possible to control which types of problems are to be solved. Since the interactions can specify the problem, controlling the interactions is one way to control the problem. As discussed in more detail above, in an exemplary Rydberg implementation, the hyperfine ground states in each atom can be used to encode the qubit states |0 and |1, and the |1 state can be excited to the Rydberg level |r to induce interactions. The qubit rotating term, exp (−iβΣj=1Nσjx) can be implemented by a global driving beam with tunable durations. The interaction terms σizσjz; can be implemented stroboscopically for general graphs; this can be realized by a Rydberg-blockade controlled gate, as illustrated in
By controlling the coupling strength Ω, detuning Δ, and the gate time, together with single-qubit rotations, it is possible to implement exp(−iγΓizσjz), which can then be repeated for each interacting pair. In this way, it is possible to choose which pairs should interact, and thus control which problem to solve. In some embodiments, an additional advantage of the Rydberg-blockade mechanism is its ability to perform multi-qubit collective gates in parallel. This can reduce the number of two-qubit operation steps from the number of edges to the number of vertices, N, which means a factor of N reduction for dense graphs with ˜N2 edges. While the falloff of Rydberg interactions may limit the distance two qubits can interact, MaxCut problems of interesting sizes can still be implemented by vertex renumbering or focusing on unit disk graphs, as discussed above. Furthermore, implementing ancillary vertices discussed in the present disclosure can be used to increase the length of interactions.
Without being bound by theory, in some embodiments, for generic problems of 400-vertex regular graphs, the interaction range can be on the order of 5 atoms in 2D. This can be determined by assuming a minimum inter-atom separation of 2 μm, which means an interaction radius of 10 μm, which is realizable with high Rydberg levels. Given examples of coupling strength Ω˜2π×10-100 MHz and single-qubit coherence time τ˜200 μs (limited by Rydberg level lifetime), with high-fidelity control, the error per two-qubit gate can be made roughly (Ωτ)−1˜10−3-10−4. For 400-vertex 3-regular graphs, QAOA of level p≅Ωτ/N˜25 can be implemented with a 2D array of neutral atoms. Advanced control techniques such as pulse-shaping would increase the capabilities of QAOA in such systems. Furthermore, QAOA may not be sensitive to some of the imperfections present in existing implementations with Rydberg atoms.
The following sections explore additional examples and embodiments of the present disclosure. The present disclosure is not limited by the theory described herein, which is merely meant for illustration of some aspects of operational principles that underly some embodiments of the present disclosure.
In some embodiments, quantum optimization algorithms such as quantum annealing algorithm (QAA) can be used for random UD-M
As discussed above, a QAA for MIS can be performed using the following Hamiltonian:
The QAA can be designed by first initializing all qubits in |0 at time t=0, which is the ground state of HQA(t=0) when Δ(t=0)<0 and Ω(t=0)=0 (with U>0). The parameters can then be changed, for example by first turning on Ω(t) to a non-zero value, sweeping Δ(t) to a positive value, and finally turning off Ω(t) again. Without being bound by theory, the exemplary annealing protocol discussed throughout the present disclosure can be specified by
Δ(t)=(2s−1)Δ0, Ω(t)=Ω0 sin2(πs) with s=t/T. Equation 19:
If the time evolution is sufficiently slow, then by the adiabatic theorem, the system can follow the instantaneous ground state, ending up in the solution to the MIS problem. Ω0=1 can be considered the unit of energy, and it is possible to fix Δ0/Ω0=6, which in non-limiting examples has been identified as a good ratio to minimize nonadiabatic transitions.
In some embodiments, quantum annealing can be explored on random unit disk graphs, with N vertices and density ρ. In some embodiments, in the limit of Δ0, Ω0<<U, the non-independent sets are pushed away by large energy penalties and can be neglected. In some implementations, this can correspond to the limit where the Rydberg interaction energy is much stronger than other energy scales. Without being bound by theory, in some examples in this limit, the wavefunction can be restricted to the subspace of all independent sets, such as:
IS
={|
:n
v
n
w|ψ=0 for any (v,w)∈E} Equation 20:
in the exemplary numerical simulation discussed herein, which allows for access to a much bigger system size up to N˜50 since dim(HIS)<<2N. First, in an example, the subspace of all independent sets can be found by a classical algorithm, the Bron-Kerbosch algorithm, and the Hamiltonian in equation 18 can then be projected into the subspace of all independent sets. The dynamics with the time-dependent Hamiltonian can be simulated by dividing the total simulation time t into sufficiently small discrete time steps τ and at each small-time step, a scaling and squaring method with a truncated Taylor series approximation can be used to perform the time evolution without forming the full evolution operators.
In some embodiments, exemplary time scales for adiabatic quantum annealing to perform well can be explored. In some examples, this time scale can be governed by the minimum spectral gap, ∈gap, where the runtime required can be T=O(1/ϵgap2). However, the minimum spectral gap can be considered to be ambiguous when the final ground state is highly degenerate, since it is possible for the state to couple to an instantaneous excited state as long as it comes down to the ground state in the end. For an example generic graph, there can be many distinct maximum independent sets (the ground state of HP can be highly degenerate). So instead of finding the minimum gap, a different approach can be taken to extract the adiabatic time scale.
In some embodiments, in the adiabatic limit, the final ground state population (including degeneracy) can take the form of the Landau-Zener formula PMIS≈1−ea−T/T
The fitting has been shown in some examples to be effective for most instances. For example,
While some embodiments discussed above focused mainly on the capacity of exemplary algorithms to solve MIS exactly, it is also possible to identify whether the algorithms can solve MIS approximately, for example in the sense of finding an independent set as large as possible. For some exemplary quantum algorithms, without being bound by theory, the approximation ratio r can be used to gauge performance for approximations. For example, for a quantum algorithm (such as a quantum annealer) that outputs a state |ψf, it is possible to write r=Σiψf|ni|ψf/|MIS|, where |MIS| is the size of the MIS. In other words, r can quantify the ratio of the average independent-set size found by measuring the output quantum state to the maximum independent-set size.
As discussed above, it is possible to generalize the exemplary implementation discussed above to address MIS problems on graphs, G=(V, E) that are beyond the UD paradigm.
In some embodiments, the quantum algorithms discussed above with reference to the UD paradigm involve evolution with a Hamiltonian H(t)=ΣvΩv(t)σvx−Δ(t)nv+Σ(u,v)ϵEUnunv. In exemplary embodiments where U>>|Ω|,|Δ|, the dynamics can be effectively restricted to the independent set space HIS. Without being bound by theory, in some embodiments to generate such evolution with a Hamiltonian corresponding to a general graph structure, a Trotterized version of the time evolution operator can be considered:
where the time interval [0, T] is sliced defining times t1 such that Σjtj=T and tj+1−tj<<√{square root over (Dmax)}Ω(tj),|Δ(tj)|. Here Dmax can denote the maximum degree of the graph. Each U(tj) can be further Trotterized as follows
In other words, it can be split into a product of terms v that each are associated with the evolution of one spin, v. Here (v) can denote the neighbors of v on the graph. Note that in the U→∞ limit this can be written as
This is a simple single qubit rotation of atom v, conditioned in some embodiments on the state of the atoms corresponding to neighbors of v being |0. If at least one of the neighbors is in state |1, atom v may not evolve.
In some embodiments it is desirable to add further control of a quantum system in order to specify interactions between particular pairs or subsets of qubits. One exemplary approach to implement the corresponding dynamics with individually controlled neutral atoms can take advantage of qubit states |0 and |1 encoded in two (non-interacting) hyperfine states, in the internal atomic ground state manifold. In other words, when implementing other forms of graphs, it is possible to iterate through desired interactions between individual pairs (or sets) of qubits.
In some embodiments, the atoms can be positioned on the points of a 2D square lattice with distance g. To realize a single step, Uv(tj), the atoms, u, can be excited that correspond to neighbors of v on the graph (u∈(v)), selectively from the state |1 to a Rydberg S-state |1′. In some embodiments, a grid length g<<rB can be chosen such that none of the atoms u∈(v) interact during this process. Atom v can then be driven to realize the single qubit rotation in the hyperfine manifold, for example with a unitary, such as, but not limited to, a physical operation that can be performed deterministically such as driving the atom with a laser to cause a quantum state evolution corresponding to an evolution with Ωv(tj)σvx−Δv(tj)nv, where σvx couples the two hyperfine qubit states of atom v, and nv counts if atom v is in hyperfine state |1. In some embodiments, to realize this rotation, an individually addressed two-step excitation can be used that can couple the two hyperfine states |0 and |1 of atom v via a transition through a Rydberg P-state (for example, by first exciting the atom from the state |0 to a Rydberg P state with a first laser pulse, and then changing its state from the Rydberg P state to the state |1 with a second laser pulse). In some embodiments, if all atoms u are in the state |0, then this process may not be disturbed, but if at least one of the neighbors is the Rydberg S state, the strong S-P interaction can give rise to a blockade mechanism that prevents the rotation of the qubit v, thus realizing exactly equation 23. Note that in some embodiments this may require a different scale of interaction length of the blockade radius for two atoms in the Rydberg S-states on one hand, and the S-P blockade radius on the other hand because these two interactions scale differently with separation of the atoms: the S-P interactions decay as 1/x3 (much slower than the S-S interactions that scale like 1/x6), which makes it possible to implement collective gate with high fidelity. Accordingly, it is possible to apply quantum optimization to MIS problems on graphs that are more general than the unit disk graphs.
Without being bound by theory, this section addresses some aspects of M
which is a subclass of HP(2). This problem can be proved to be NP-complete by reducing it from M
Without being bound by theory, this theorem can be proven as follows: (1) M
In some embodiments, this theorem shows that it is NP-complete to decide whether the ground state energy of HUD is lower than −a′Δ. In some embodiments, the transformation in the proof of this theorem does not fully determine the actual positions of the ancillary vertices in the 2D plane. In some embodiments, a particular arrangement can be specified consistent with the requirements of this transformation. Once Rydberg interactions are considered, the interaction strength between each pair of qubits can be fixed in a way that takes into account the distance of the atoms.
Given an edge {u, v} of the graph embedded on a grid, the length of this edge can be denoted by g×u,v with u,v an integer, and g the grid unit length, in some embodiments. First, an ancillary vertex can be placed on the u,v−1 grid point along the edge, separating the edge in u,v segments of length g. An integer k≥3 can be selected and equally spaced 2 k ancillary vertices can be placed along each segment, dividing it into 2 k+1 pieces of equal length d=g/(2 k+1). In some embodiments, irregular vertices can be used to ensure that the number of ancillary spins on each edge is even. In some embodiments, however, the total number of spins on each edge might be different. If u,v is even, one segment can be chosen and the 4ϕ+2 ancillary vertices close to the center of this segment can be replaced with 4ϕ+1 ancillary vertices, that are equally spaced by a distance
for some integer ϕ to be determined. Such segments can be referred to as “irregular segments”, and to the vertex at the center of the irregular segment can be referred to as an irregular vertex. In some embodiments, these exceptions are made to ensure that the total number of ancillary vertices along each edge {u, v}ϵε, 2ku,v, is even in order to ensure that the ancillary vertices transfer the independent set constraint without changing the nature of the problem to be solved. Following this arrangement, the nearest-neighbor distance of the ancillary vertices can be either d or D. Setting the unit disk radius to r=D+0+ can produce the unit disk graph G. The positions of the vertices can be labelled by √{square root over (x)}v. In some embodiments, arrangement depends on the freely chosen parameters k and ϕ. Accordingly, as described throughout the present disclosure, a hard problem can be transformed into an MIS problem on an arrangement of vertices that form a unit disk graph.
Without being bound by theory, a few properties of the maximum independent set on the types of unit disk graphs constructed in this way can be noted, according to some embodiments. First, in some embodiments, the maximum independent set on G is in general degenerate, even if the maximum independent set on is unique. The properties of a particular set of MIS-states on G can be considered, such as MIS-states on G, , that coincide with MIS-states on , , on the vertices V. Without being bound by theory, to see that such states can always exist, can be constructed from as follows: given the state of two qubits corresponding to two vertices, u, vϵV, if u is in the state |0 and v is in state |1 (or vice versa) then the 2ku,v ancillary vertices can be placed on the edge connecting u and v in an (antiferromagnetically) ordered state, where the qubits are alternating in states |1 and |0. In the other case, if u and v are both in states |0, the ancillary vertices can be placed in an analogously ordered state. In embodiments of this latter case, a domain wall, such as an instance where two neighboring qubits are both in the state |0 can be introduced. The position of this domain wall along the edge is irrelevant. In both cases half of the 2ku,v ancillary vertices along the edge are in state |1. Therefore, by the above theorem, the state constructed from by applying this process on all edges of , is a MIS-state on G.
Without being bound by theory, the structure of these MIS-states around points where edges meet under a 90° angle can be further specified, according to some embodiments. Such points can be either junctions, where 3 edges meet at a vertex, such as point 216 in
Without being bound by theory, to illustrate applications of the above reduction to implementations that employ Rydberg interactions, a simple model can be implemented to explain aspects of some implementations. This model can be used to show some aspects and benefits of embodiments of the present disclosure, including, but not limited to treatment of special vertices. For example, a Hamiltonian similar to HUD, the MIS-Hamiltonian for UD graphs, can be considered with the introduction of interactions beyond the unit disk radius. Similar to the situation in the Rydberg system, these additional interactions can cause energy shifts that can result in a change of the ground state, thus invalidating the encoding of the M
Without being bound by theory, embodiments of the model can be expressed as:
with interactions given by:
where W<U. For W=0 and Δv=Δ, Hmodel can reduce to the Hamiltonian HUD described in Equation 25. For W>0 it includes interactions beyond the unit disk radius, r, and can thus be considered as a first approximation to the Rydberg Hamiltonian.
In some embodiments, in the case of a qubit arrangement described in the section titled “Maximum Independent Sets for Unit Disk Graphs” corresponding to a unit disk graph, G, and the case √{square root over (2r)}<R<2r, most qubits interact only with their neighbors on G, with an exception being qubits that are close to corners (such as qubit 202 in
In some embodiments, it is desirable to find a detuning pattern, Δv, such that the MIS-state of HUD remains the ground state of Hmodel, even at finite W. In other words, it is desirable to find a detuning pattern that renders the MIS solution of the graph more energetically favorable (i.e., coupling it to the ground state of the system) than other solution of the graph. For the relevant qubit arrangements, the interactions of the HUD and Hmodel differ only around corners and junctions, in some embodiments. Thus, Δv can be set to Δv=Δ everywhere (e.g., for all qubits) except at these structures, which can be considered individually and separately.
In some embodiments, with reference to corners as shown in
In some embodiments, with reference to
In some embodiments, by using the detuning patterns described above, the actions of HUD and Hmodel are, for non-limiting theoretical purposes, identical for at least one MIS-state. In addition, some embodiments ensure that the chosen detunings do not lower the energy of any other configurations. Therefore, a ground state of Hmodel is a ground state of HUD, encoding an MIS problem on the corresponding unit disk graph such that the ground state is a solution to the MIS problem.
Without being bound by theory, in some embodiments the detuning model described above can be applied to the case of the Rydberg Hamiltonian, thereby showing that it is NP-complete to decide whether the ground state energy of HRyd is below a given threshold, when Ωv=0, where the atoms can be positioned arbitrarily in at least two dimensions. While the main idea is similar, the infinite ranged Rydberg interactions warrant more explanation.
As described above, it is possible to apply a detuning pattern that separates length and interaction scales, in some embodiments. This is possible in part because the Rydberg interactions decay fast, such that interactions between qubits that are close can be separated from the interactions between qubits that are far apart on the graph G. For example, the interactions between distant qubits can be neglected, if k is chosen large enough. In such embodiments, individual substructures of the system can be treated separately for purposes of theory.
In some embodiments, the low energy sector of the Rydberg Hamiltonian can be mapped to a much simpler effective spin model. For example, clusters of qubits can be addressed with specific detuning patterns such that only two configurations are relevant for each cluster. In some embodiments, the resulting, effective pseudo-spins are described by a MIS Hamiltonian. This makes it possible to encode M
First, with regard to separation of length scales, the Rydberg interactions can decay as a power law with distance, and thus do not define a length scale where the interactions cease to exist. Nevertheless, in some embodiments, an exemplary qubit arrangement introduces two length scales: (1) the closest distance between two qubits, d, and (2) the grid length, g. These length scales are separated by a factor, g=d (2 k+1), that can be chosen arbitrarily in the transformation of the planar graph, , to the unit disk graph, G. In some embodiments, with such implementations, interactions between qubits that are “close” can be separated from interactions between qubits that are “distant”, for example when k is selected to be sufficiently large, as described in the present disclosure.
The closest qubits in the system are separated by a distance d and thus interact with a strength of =C/d6. This defines a convenient unit of energy, , which depends on the choice of the parameter k specifying the transformation from to G.
Without being bound by theory, two qubits can be described as “distant” if their x or y coordinate differs by at least g (the grid length). The interaction energy of a single qubit with all distant qubits can be upper bounded by:
In some embodiments, this bound can be derived by considering a system where a qubit can be placed in state |1 on each possible qubit position in the 2D plane (i.e. along all grid lines). Edist is the interaction energy of a single qubit in such a system with all other qubits that are distant in the above sense. This is an upper bound to the maximum interaction energy of a single qubit with all distant qubits on arbitrary graphs, that itself can be upper bounded by
where ζ(x) is the Riemann zeta function. Since in some embodiments, d/g=1/(2 k+1), the interaction energy of a spin with all distant spins can scale as Edist=(/k5). The total number of spins can scale as |V|˜(k|V|), as the grid representation of the original planar graph in the plane can be done with (|V|) area. Thus, in some embodiments, the contribution of interactions between all pairs of spins that are distant with respect to each other can be bounded by Edist|V|˜(V|/k4). In some embodiments, k can be chosen large enough such that these contributions can be neglected. In such embodiments, the ground state does not change due to the long range interactions. Note that the required k can scale polynomially with the problem size |V|.
As described in more detail below, without being bound by theory, an effective spin model can be developed in order to show that for each planar graph =(V, ε) of maximum degree 3, an arrangement of (k|V)=(|V|5/4) atoms can be identified, and detuning patterns applied such that the ground state of the corresponding Rydberg Hamiltonian directly reveals a maximum independent set of . Furthermore, without being bound by theory, it can be seen that it is NP-complete to decide whether the ground state energy of HRyd(Ωv=0) is lower than some threshold. In addition, without being bound by theory, it is possible to treat certain arrangements as pseudo-spins to simplify nonlimiting discussion of aspects of some embodiments.
In some embodiments, similar to the model discussed above, interactions beyond the unit disk radius (e.g., interaction tails) can be problematic. For example, longer range interactions can cause the ground state of systems with encoded MIS problems described above to differ from MIS solutions. These interactions can be more prominent close to vertices of degree 3, such as at vertex qubit 216 in
In some embodiments, as discussed throughout the present disclosure, to address these complications length and interaction scales can be separated to isolate these structures and control them individually. For example, for some graphs discussed in the present disclosure, various “special” vertices can be defined. These are special vertices can correspond to those that that either have integer grid coordinates or are irregular vertices. Special vertices thus include all vertices V of , but also a subset of vertices introduced to transform to the UD graph G.
Without being bound by theory, due to the separation of length and interaction scales described throughout the present disclosure, interactions between two spins, u and v, are only relevant if they belong to the same set, u, v∈Ai (or u,v∈Bi,j, or if they belong to adjacent sets, uϵAi and v∈Bi,j. All other interaction terms will give contributions that vanish like (|V|/k4). Without being bound by theory, the total energy can be written as
Where, EX=Ev∈X−Δvnv+Σw>v∈XVRyd(|{right arrow over (x)}w−{right arrow over (x)}v|)nwnv gives the energy of the system when only spins in X are taken into account, and EX,Y=Σu∈XΣv∈Y VRyd(|{right arrow over (x)}u−{right arrow over (x)}v|)nunv quantifies the interaction energy between spins in two regions X and Y. The sum =Σi runs over i and j, such that Ai is connected to Aj, by a non-empty set Bi,j, and the factor ½ compensates for double counting.
It is possible to apply an appropriate choice of the detunings, Δv, such that only a few configurations of spins in regions Ai and Bi,j are relevant to construct the ground state of the entire system, in some embodiments. In some such configurations it is possible to consider only a few configurations as candidates for the ground state, rather than exponentially many. For example, as discussed in more detail below, in such embodiments only two configurations of spins in the regions Ai shown in
which yields:
E
B
=(si(1−sj)+(1−si)sj)EB
where note EB
In some embodiments, the total energy in the relevant configuration sector containing the ground state of HRyd can be given by an effective spin model for the pseudo-spins:
where the sum runs over neighboring pairs of pseudospins (without double-counting). Effective detunings Δieff (e.g., detunings that have a similar effect on pseudospins as detunings for normal spins) can be introduced and pseudo-spin interactions Ui,jeff can be given by:
In some embodiments, detuning patterns, Δv, can be chosen such that the effective pseudo-spin detunings and interactions are homogeneous, Δieff=Δeff and Ui,jeff=Ueff, and satisfy 0<ΔeffUeff. In some embodiments, when these parameters are chosen such that the above equations are satisfied the problem of the Rydberg Hamiltonian can be treated as equivalent to the MIS problem. To ensure that the long-range correction is bounded by some small constant n<<Δeff, k can be selected such that k≥((|V|/η1/4)). ξ can be computed based on the chosen detuning pattern.
This effective spin model in equation 32 can be related back to the original MIS problem on g. Without being bound by theory, embodiments of this effective spin model corresponds exactly to a MIS problem on a graph eff obtained from =(V, ε) by replacing each edge {u,v}ϵε by a string of an even number 2Ku,v of vertices as discussed in more detail with reference to
Therefore, for each planar graph =(V, ε) of maximum degree 3, an arrangement of (k∥V∥)=(|V|5/4) atoms can be identified, and detuning patterns applied such that the ground state of the corresponding Rydberg Hamiltonian directly reveals a maximum independent set of g. Thus, without being bound by theory, it can be seen that it is NP-complete to decide whether the ground state energy of HRyd(Ωv=0) is lower than some threshold. First, this question is in NP since the ground state of HRyd(Ωv=0) has a classical description. In addition, for reduction from the NP-complete problem of deciding whether the size of MIS on , a planar graph of maximum degree 3, is ≥a. Let a′=a+Σ{u,v}∈εKu,v. If the size of MIS of is ≥a, then the ground state energy of HRyd is ≤−a′Δeff+ξ+Θ. If the size of MIS of is ≤a−1, then the ground state energy of HRyd is ≥(a′−1)Δeff+ξ−η. In some embodiments, by choosing η<Δeff/2, the MIS of has size ≥a if and only if the ground state energy of HRyd is ≤−(a′−½)Δeff+ξ.
Without being bound by theory, embodiments of aspects of the ground state of the Rydberg Hamiltonian are described below. Example aspects of the Rydberg Hamiltonian show that at lower energy, the ground state corresponds to an MIS solution to an encoded problem. Thus, in example implementations, if a problem is properly encoded, transitioning the corresponding qubits into lower energy states can result in the system in the ground state which corresponds to the solution to the encoded problem. Without being bound by theory, discussion of such transitions can be aided by referencing the “pseudo-spins” discussed in the previous section.
In some embodiments, the ground state of HRyd corresponds to a maximal independent set on the associated UD graph if the detuning of each spin is selected according to the techniques described herein. Maximal independent sets refer to independent sets where no vertex can be added without violating the independence property (e.g., that no two neighboring vertices can both be in the state |1). The largest maximal independent set is the maximum independent set. For configurations corresponding to such sets, no two neighboring spins can be in state |1 (independence), and no spin can be in state |0 if all its neighbors are in state |0 (maximality).
In some embodiments of the Rydberg Hamiltonian, no two neighboring vertices on G can be simultaneously in state |1 if the system is in the ground state of HRyd as long as the detuning Δv on all vertices obeys:
where D is the maximal Euclidean distance between two vertices that are neighboring on G.
The first term corresponds to the maximum interaction of spin v with spins on the same grid line (such as qubits 1304, 1310 in
The configuration shown in
Therefore, in some embodiments, the configuration of spins in the ground state of the Rydberg Hamiltonian corresponds to a maximal independent set on the associated UD graph (edge between nearest neighbor) if, C/D6≥Δv≥0.268031×C/d6.
The spins on each edge ε of the graph are in an ordered configuration, such as a configuration where spins are alternating in state |0 and |1, up to so called domain walls, where two (but not more) neighboring spins are in state |0. A MIS configuration on G has at most one such domain wall on any array of spins connecting two special vertices i and j such as 1314, 1324 in
This bound can be understood with reference to
Δv≥Δmin≡0.490084× Equation 38:
Therefore, for Δmin<Δv<Δmax, the ground state of the Rydberg Hamiltonian is ordered on all segments connecting two special vertices, with at most one domain wall per segment. This justifies the assumption made with reference to
Below, embodiments of special structures Ai consisting of a vertex i and its 2q neighbors on each leg are described in more detail. The discussion can be restricted to states that are ordered in Ai with at most one domain wall per leg. Detuning patterns can be selected for the spins in Ai such that any such domain wall is energetically pushed away from spin, i, out of the structure Ai. Without being bound by theory, with such a detuning only the two ordered states on Ai have to be considered for the ground state.
A corner vertex and the spins on the two legs, which can be collectively treated as a pseudo-spin, are considered first, such as those shown in embodiments of
In some embodiments, if the corner spin is in state |0, a domain wall can be formed where on one of the legs the two spins at distance 2p−2 and 2p−1 from the corner are in state |0. The domain wall can be moved by one unit (i.e. p→p+1) by flipping the state of the spins 2p−1 and 2p. The interaction energy increases in this process. Its amount can be bounded by F0(p)+(/k5) with
Without being bound by theory, this bound can be understood as follows. Effectively, the one spin excitation is moved by one site towards the corner. While the interaction energy of this excitation with all spins on the same leg is reduced (such as where 2p<k), and thus is upper bounded by zero, the interaction energy with all spins on the other leg increases. Since any defect on this leg would decrease this interaction energy, the energy can be maximized if all spins on the other leg are in the perfectly ordered state, which gives the bound in equation 39.
While the interaction energy can increase in this process, the contribution from the single particle term to the total energy changes by Δ2p−1−Δ2p. Depending on the choice of the detuning, this can lead to an energy gain such that it is energetically favorable to move the domain wall by one unit away from the corner spin. Thus, the energy is minimized if the first 2q spins on each of the legs are in the perfectly ordered state if the corner spin is in state |0 and the detuning for spins satisfy for p=1, 2 . . . , q:
Δ2p−1−Δ2p≥F0(p), Equation 40:
Similarly, if the corner spin is in state |1, the energy is minimized if the first 2q+1 spins on each of the leg are in the perfectly ordered state if detunings satisfy:
The conditions in equations 40, 41 can be satisfied by the choice (for p=1, . . . , q):
In some embodiments, these sums are convergent and can be efficiently evaluated numerically. Δx is monotonically decreasing with x and for large x approaches Δ∞+(/x5). Moreover, the maximum value of Δx in this sequence can be evaluated to Δ1=Δ∞+0.134682×. It is therefore possible to choose Δ∞ such that the detuning along the legs of the corner are within the required range.
With the above detuning pattern and the choice Δ∞≥ΔB, the ground state configuration is necessarily such that the 4q+1 spins forming a corner structure are in one of the two ordered states. These states can be labeled as s=0 and s=1 according to the corresponding state of the corner spin (|0 and |1).
Note that the choice in equations 43, 44 fixes the detuning on all spins except the detuning of the corner spin, denoted Δc. This makes it possible to tune the relative energy between the two relevant configurations, EA
Similarly, without being bound by theory, the difference in energy (DC) of the two configurations due to the single particle term can be calculated as:
For the corner structures, EA
E
A
1
−E
A
0=−ΔC+0.143797×+(/q5) Equation 47:
which can be fully tuned by the detuning of the corner spin. Thus, in some embodiments, corner structures can be treated as having only two configurations.
In some embodiments, junctions, such as that shown in
First, with reference to central spin, such as the degree-3 vertex 1304 in
In the other case, i.e. if the central vertex is in state |1, similar bounds can be found. If the two vertices on leg X at a distance 2p−1 and 2p from the central vertex are in state |0, the interaction energy will be increased when the states of vertices 2p and 2p+1 are flipped, i.e. when the domain wall is pushed away from the center by one unit. The interaction energy required to push the domain wall on leg X by one unit can therefore be bounded by F1(p). This bound can be valid independently of the configuration of the spins on legs Y and Z. The interaction energy required to push a domain wall on leg Y by one unit from the corner can be bounded by 2F1(p).
Therefore, in some embodiments, the state that minimizes the energy may not contain any domain wall on the first 2q+1 spins on each leg if the detuning pattern satisfies
Δ2p−1(X)−Δ2p(X)≥F0(p), Equation 48:
Δ2p−1(Y)−Δ2p(Y)≥F0(p), Equation 49:
Δ2p−1(X)−Δ2p(X)≥F1(p), Equation 50:
Δ2p−1(Y)−Δ2p(Y)≥F1(p), Equation 51:
for p=1, 2, . . . q, and Δv(z)=Δv(x). Here Δv(σ) denotes the detuning of the v-th spin on leg σ. Without being bound by theory, this can be achieved by the following:
Where Δv(z)=Δv(x). For the choice above, the maximum value of Δv(σ) in this sequence can evaluate to Δ1(Y)=Δ∞, +0.269364×. Thus, all the detunings on the legs of a junction Ai are within the range [Δ∞, Δ∞+0.269364×]. It is therefore possible to choose Δ∞ such that all of them are in the allowed range, Δmin<Δv<Δmax. With the additional choice Δ∞>ΔB, it can be seen that, in some embodiments, only the two ordered configurations are relevant for the ground state.
Analogous to the situation of corner structures, the detuning of the junction spin, denoted ΔJ can be used to tune the relative energy between the two relevant configurations of the junction. Without being bound by theory, the difference in interaction energies of the two spin configurations in this structure can be given by
which can be evaluated to IJ=0.218387×+(/q5). Similarly, the difference in energy of the two configurations due to the single particle terms can be calculated as
which can be evaluated to J=ΔJ−0.474188×+(/q5). Without being bound by theory, the quantity EA
E
A
1
−E
A
0=−ΔJ+0.255801×+(/q5). Equation 58:
Without being bound by theory, some embodiments include other special vertices described above in addition to vertices at corners and junctions. These are vertices of degree 1 (open ends, such as A12 in
Open ends. In some embodiments, energy can be lowered if a domain wall is moved away from a spin at the end of an open leg if the detuning is constant for the spins on that leg. Therefore, the pseudo-spin states can be restricted to the two ordered configurations on the 2q spins adjacent to the spin at the end of an open leg. Without being bound by theory, the energy difference between the two states of such an open leg can be denoted by:
where ΔO is the detuning for the spin corresponding to the vertex of degree 1. The homogenous detuning on the 2q spins adjacent to the latter can be chosen to be equal to Δ∞.
Straight structures. In some embodiments, for a regular special vertex, the relevant configurations for the ground state can be restricted to two ordered states by choosing a detuning for all 4q spins on the leg as Δ∞>ΔB. With this choice it would be energetically favorable to move a potential domain wall into the adjacent neighboring regions. Without being bound by theory, the energy difference can be denoted by:
where ΔS denotes the detuning of the special vertex.
Irregular structures. In some embodiments, irregular vertices can be treated identically to straight structures. Since the spacing of the spins close to the irregular vertex is slightly larger than elsewhere (e.g., because irregular structures can be defined as structures where the spacing between vertices is larger than in ordinary structures to ensure that the number of ancillary vertices on each edge is even), any domain wall will be pushed away from the irregular structure naturally, if the detuning in the irregular structure is larger than ΔB. Thus, only the two ordered configurations are relevant for the ground state. The corresponding energy difference can be numerically evaluated for every choice of ϕ (which can indicate how ancillary vertices are positioned). In the large ϕ (and q) limit, it is possible to obtain the same analytic expression as in equation 60.
In some embodiments, it is possible to determine the effective detuning Δeff of the pseudo-spins (such as the straight segments, corners, junctions, and other special vertices described above that can be described as having a limited set of states in the ground state) and their effective interaction energies Ui,jeff. In some embodiments, knowing the effective detunings and effective interactions of pseudospins allows for encoding of the problem to be effectively solved.
In some embodiments and without being bound by theory, the discussion above with reference to straight segments, corners, junctions, and other special vertices can rely on the concept that only four spin configurations are relevant in each region Bi,j to describe the ground state. These correspond to the four possible configurations of the spins in the adjacent regions Ai and Aj. For example, if the detuning in Bi,j is set to be homogeneous, ΔB, if si=1 and sj=0 (or vice versa), the lowest energy configuration should correspond to the perfectly ordered state on Bi,j (with bi,j spins in state |1, with b vertices in the set B), if Δmax>ΔB>Δmin. Any other configuration would require at least two domain walls. Second, if si=sj=0 then energy can be lowered by arranging the spins in Bi,j in an ordered configuration with bi,j spins in state |1, and one domain wall. The position of this domain wall does not change the energy up to (/k5), such that there is no need to distinguish between the different domain wall configurations. Finally, if si=sj=1 the lowest energy configuration is similarly achieved by an ordered configuration with one domain wall, if Δmax>ΔB>Δmin. While the position of the domain wall can be irrelevant, only bi,j−1 spins are in state |1 in this state.
Without being bound by theory, in some embodiments, the relevant energy differences between these different relevant configurations can be readily calculated, as
If Ai and Aj are not connected, this can be zero. Else, this term becomes independent of i and j in the large q and k limit, since bi,j=(k), and evaluates to
E
B
1,0
−E
B
0,0=0.0146637×+(/k5) Equation 62:
Similarly, EB1,1−EB1,0 can be calculated as
Thus, in some embodiments the effective interaction between pseudo-spins Ui,jeff=Ueff=EB1,1+EB0,0 can be written as
U
eff=ΔB−0.0134313×+(/k5). Equation 64:
In such embodiments, the effective interaction depends only on the choice of the detuning in the connecting structures, ΔB, thereby allowing for control of the effective interactions for specifying problems.
In some embodiments, the effective detuning for a pseudo spin can be given by Δieff=EA
ΔC=Δeff+0.173124× Equation 65:
ΔJ=Δeff+0.299792× Equation 66:
ΔO=Δeff+0.0305597× Equation 67:
ΔS=Δeff+0.0452234×. Equation 68:
This is compatible with Δmax≥Δv≥≤Δmin and the realization of an effective spin model with 0<Δeff<Ueff. In some embodiments, where this inequality is satisfied, the solution of the effective model can coincide with the solution of the hard problem to be solved.
In some embodiments, additional considerations are relevant to implementing the QAOA techniques described in the present disclosure. The framework of QAOA is general, however, and can be applied to various technical platforms to solve combinatorial optimization problems.
In some embodiments, quantum fluctuations (such as projection noise) can result in finite precision since the precision can be obtained via averaging over finitely many measurement outcomes that can only take on discrete values. Hence, there can be a trade-off between measurement cost and optimization quality: finding a good optimum can require good precision at the cost of a large number of measurements. Additionally, large variance in the objective function value can demand more measurements but may help improve the chances of finding near-optimal MaxCut configurations.
Without being bound by theory, as an example, the effect of measurement projection noise can be demonstrated with a full Monte-Carlo simulation of QAOA on some example graphs, where an objective function is evaluated by repeated projective measurements until its error is below a threshold. Exemplary implementation details of this numerical simulation are discussed in the section titled “Exemplary simulation with measurement projection noise.”
As shown in
While this exemplary simulation is limited to small-size instances, the good performance of QAOA and QA can be complicated by the small but significant ground state population from generic annealing schedules. Since it can take 102 measurements to obtain a sufficiently precise estimate of the objective function, a ground state probability of {tilde under (≤)}10−2 would mean that one can find the ground state without much parameter optimization. In some embodiments, for larger problem sizes with 102˜103 qubits, the ground state probability from generic QAOA/QA protocol without optimization can decrease exponentially with system size, whereas the measurement cost of optimization grows merely polynomially with the problem size. The results here indicate that the parameter pattern and the disclosed heuristic strategies are practically useful guidelines in realistic implementation of QAOA that leverages optimization to significantly increase the probability of finding the ground state.
Implementing a solution to the MaxCut problems with quantum machines can be limited by quantum coherence time and graph connectivity. In some embodiments, in terms of coherence time, QAOA is highly advantageous: the hybrid nature of QAOA as well as its short- and intermediate-depth circuit parametrization makes it useful for quantum devices. In addition, QAOA is not generally limited by the small spectral gaps, which demonstrates that interesting problems can be solved (or at least approximated) within the coherence time.
According to some embodiments, non-limiting comparisons of exemplary heuristic strategies to exemplary brute-force approaches can be evaluated. For example,
To estimate the number of brute-force runs needed to find an optimum with the same or better approximation ratio as the exemplary FOURIER heuristics, brute-force optimization was performed on 40 instances of 16-vertex u3R and w3R graphs with up to 40000 runs, as shown in
Although most examples discussed in the present disclosure use gradient-based methods (BFGS) in numerical simulations, non-gradient based approaches, such as the Nelder-Mead method, can be used with the disclosed heuristic strategies. The choice to use gradient-based optimization can be motivated by the simulation speed, which in some implementations is faster with gradient-based optimization. In other embodiments, other procedures can be used.
Using embodiments of the disclosed heuristic optimization strategies in hand, the performance of intermediate p-level QAOA on many graph instances can be examined. For example, it is possible to consider many randomly generated instances u3R and w3R graphs with vertex number 8≤N≤22 and use embodiments of the disclosed FOURIER strategy to find optimal QAOA parameters for level p≤20. In the following discussion, the fractional error 1-r is used to assess the performance of QAOA.
In one example, the case of unweighted graphs, specifically u3R graphs can be considered. For example,
for weighted graphs. Insets show the dependence of the fit parameters p0 on the system size N.
As shown in
As shown in
according to some embodiments. In some embodiments, the stretched-exponential scaling is true in the average sense, while individual instances have very different behavior. For easy instances whose corresponding minimum gaps Δmin are large, exponential scaling of the fractional error can be found. For more difficult instances whose minimum gaps are small, fractional errors reach plateaus at intermediate p, before decreasing further when p is increased. These hard instances are discussed in more detail in the section below titled “Adiabatic Mechanisms, Quantum Annealing, and QAOA.” Notably, when averaged over randomly generated instances, the fractional error is fitted remarkably well by a stretched-exponential function.
These results of average performance of embodiments of QAOA are notable despite considerations of finite-size effect. While the decay constant p0 does appear to depend on the system size N as shown in the insets of
The difference in the performance among embodiments of the various heuristic strategies proposed in the present disclosure can be examined on an example instance of 14-vertex w3R graph. As shown in
The previous section discussed the performance of embodiments of QAOA for MaxCut on random graph instances in terms of the approximation ratio r. Although useful for approximate optimization, QAOA is often able to find the MaxCut configuration—the global optimum of the problem—with a high probability as level p increases. In this section, the efficiencies of example embodiments of the disclosed algorithms are shown to find the MaxCut configuration and compare it with quantum annealing. In some embodiments, QAOA is not necessarily limited by the minimum gap in the quantum annealing and explain a mechanism at work that allows it to overcome the adiabatic limitations.
Comparing QAOA with Quantum Annealing
A predecessor of QAOA, quantum annealing (QA) can be used for solving combinatorial optimization problems. Without being bound by theory, to find the MaxCut configuration that maximizes HC, the following simple QA protocol can be considered:
H
QA(s)=−[sHC+(1−s)HB], s=t/T, Equation 69:
where t∈[0, T] and T is the total annealing time. The initial state can be prepared to be the ground state of HQA(s=0), i.e., |ψ(0)=|+⊗N. The ground state of the final Hamiltonian, HQA(s=1), can correspond to the solution of the MaxCut problem encoded in HC. In adiabatic QA, the algorithm can rely on the adiabatic theorem to remain in the instantaneous ground state along the annealing path and solves the computational problem by finding the ground state at the end. To ensure a high probability of success, the run time of the algorithm typically scales as T=O(1/Δmin2), where Δmin is the minimum spectral gap. Consequently, adiabatic QA becomes inefficient for instances where Δmin is extremely small. These graph instances can be referred to as hard instances for adiabatic QA.
In some embodiments beyond the completely adiabatic regime, there is often a tradeoff between the success probability (ground state population pGS(T)) and the run time (annealing time T): either algorithm can be run with a long annealing time to obtain a high success probability, or it can be run multiple times at a shorter time to find the solution at least once. One metric that can be used to determine the best balance can be referred to as the time-to-solution (TTS):
TTSQA(T) can measure the time required to find the ground state at least once with the target probability pd (taken to be 99% in the present disclosure), neglecting non-algorithmic overheads such as state-preparation and measurement time. The adiabatic regime where ln[1−pGS(T)]∝TΔmin2 per Landau-Zener formula can yield TTSQA∝1/Δmin2 which is independent of T In some cases, it can be better to run QA non-adiabatically to obtain a shorter TTS. By choosing the best annealing time T, regardless of adiabaticity, TTSQAopt can be determined as the minimum algorithmic run time of QA. Without being bound by theory, a similar non-limiting metric can be defined for QAOA for purposes of benchmarking. The variational parameters γi* and βi* can be regarded as the time evolved under the Hamiltonians HC and HB, respectively. The sum of the variational parameters can be interpreted to be the total “annealing” time such that Tp=Σi=1p(|γi*|+|βi*|), and TTSQAOA(p) and TTSQAOAopt can be defined as:
where pGS(p) is the ground state population after the optimal p-level QAOA protocol, in some embodiments. This quantity need not take into account the overhead in finding the optimal parameters. TTSQAOA(p) can be used to benchmark the algorithm but should not be taken directly to be the actual experimental run time.
To compare examples of the algorithms, TTSQAopt and TTSQAOAopt can be computed for many random graph instances. For each even vertex number from N=10 to N=18, 1000 instances of w3R graphs are generated.
Similarly, for QA, the optimal annealing time T need not be in the adiabatic limit for small-gap graphs.
To understand the behavior of QAOA, graph instances that are hard for adiabatic QA can be addressed in more detail. For example, a representative instance is used to explain how embodiments of QAOA as well as diabatic QA can overcome the adiabatic limitations. As illustrated in 18A, QAOA can learn to utilize diabatic transitions at anti-crossings to circumvent difficulties caused by very small gaps in QA.
In some embodiments, it is also possible to simulate QAOA on this hard instance. As mentioned above, although QAOA can optimizes energy instead of ground state overlap, substantial ground state population can still be obtained even for many hard graphs. Using an exemplary embodiment of the disclosed FOURIER heuristic strategy, various low-energy state populations of the output state are shown for different levels p shown in
Without being bound by theory, to better understand the mechanism of embodiments of QAOA and make a comparison with QA, the QAOA parameters can be interpreted as a smooth annealing path. The sum of the variational parameters can be interpreted to be the total annealing time, i.e., Tp=Σi=1p(|γi|+|βi|), as discussed above. A smooth annealing path can be constructed from QAOA optimal parameters as
where ti can be chosen to be at the mid-point of each time interval (γi*+βi*). With the boundary conditions f(t=0)=0, f(t=Tp)=1 and linear interpolation at other intermediate time t, QAOA parameters can be converted to a well-defined annealing path. This conversion can be applied to the QAOA optimal parameters at p=40, as shown in
Without being bound by theory, these results indicate that QAOA is closely related to a cleverly optimized diabatic QA path that can overcome limitations set by the adiabatic theorem. Through optimization, QAOA can find a good annealing path and exploit diabatic transitions to enhance ground state population. This explains the observation in
The effective dynamics of QAOA for these exemplary specific instances, as shown in
Without being bound by theory, this section elucidates the mechanism of the diabatic bump discussed above with reference to
H
QA(t)|∈l(t)=∈l(t)|∈l(t), Equation 76:
Expanding the time-evolved state in this basis as |ψ(t)=Σlαl (t)|∈l (t), the Schrodinger equation can be written as
where ℏ=1 and the time dependence in the notations is dropped for convenience. Multiplying the equation by ∈k|, the Schrodinger equation becomes
where ∈k|{dot over (∈)}k)=0 is taken by absorbing the phase into the eigenvector |∈k. Written in a matrix form, it can be seen that
where the ground state energy is ∈0=0 (by absorbing it into the phase of the coefficients) and Δi0=∈i−∈0 is the instantaneous energy gap from the ith excited state to the ground state. The time evolution starts from the initial ground state with α0=1 and αi=0 for i≠0, and the adiabatic condition to prevent coupling to excited states is
The first equality can be derived from equation 76. This can produce the standard adiabatic condition T=O(1/Δmin2). As discussed above, the minimum gap for some graphs can be exceedingly small, so the adiabatic limit may not be practical. However, is possible to choose an appropriate run time T, which breaks adiabaticity, but is long enough such that only few excited states are effectively involved in the dynamics. This is the regime where the diabatic bump operates and one can understand the dynamics by truncating equation 79 to the first few basis states.
As an example,
In the previous sections, an example representative graph instance where the adiabatic minimum gap is small was considered with reference to
Details of Simulation with Measurement Projection Noise
When running QAOA on actual quantum devices, the objective function can be evaluated by averaging over many measurement outcomes, and consequently its precision can be limited by the so-called measurement projection noise from quantum fluctuations, in some embodiments. This effect can be accounted for by performing full Monte-Carlo simulations of actual measurements, where the simulated quantum processor only outputs approximate values of the objective function obtained by averaging M measurements:
where zp,i is a random variable corresponding to the ith measurement outcome obtained by measuring |ψp({right arrow over (γ)}, {right arrow over (β)}) in the computational basis, and C(z) is the classical objective function. Note when M→∞,
In some embodiments, it can be expected that M≈Var(Fp)ξ2. To address issues that can appear with finite sample sizes, at least 10 measurements are performed (M≥10) for each objective function evaluation.
Regarding the classical optimization algorithm used to optimize QAOA parameters, generally, classical optimization algorithms iteratively use information from some given parameter point ({right arrow over (γ)}, {right arrow over (β)}) to find a new parameter point ({right arrow over (γ)}′, {right arrow over (β)}′) that produces a larger value of the objective function Fp({right arrow over (γ)}′, {right arrow over (β)}′)≥Fp({right arrow over (γ)}, {right arrow over (β)}). In order for the algorithm to terminate, some stopping criteria can be set. In some embodiments, up to two can be used: First, an objective function tolerance ∈ can be set, such that if the change in objective function |
Using the approach described above, it is possible to simulate experiments of optimizing QAOA with measurement projection noise for a few example instances, with various choices of precision parameters (∈, ξ, δ) and starting points. For the example representative instance studied in
{right arrow over (u)}
0=(1.9212,0.2891,0.1601,0.0564,0.0292) Equation 83:
{right arrow over (v)}
0=(0.6055,−0.0178,0.0431,−0.0061,0.0141) Equation 84:
at level p=5. For each such run, the history of all the measurements can be tracked so that the largest cut Cuti found after the i-th measurement can be calculated. Each experiment is repeated 500 times with different pseudo-random number generation seeds, and an average over their histories is taken.
In some embodiments, a number of techniques can be exploited to speed up the numerical simulation for both QAOA and QA.
For example, first, the symmetries present in the Hamiltonian can be used. For MaxCut on general graphs, the only symmetry operator that commutes with both HC and HB is the parity operator P=Πi=1Nσix:[HC, P]=[HB, P]=0, and so does [HQA(s), P]=0, where HQA(s) is the quantum annealing Hamiltonian equation 69. The parity operator can have two eigenvalues, +1 and −1, each with half of the entire Hilbert space. The initial state for both QAOA and QA are in the positive sector, i.e., P|+⊗N=|+⊗N. Thus, any dynamics must remain in the positive parity sector. HC and HB can be rewritten in the basis of the eigenvectors of P, and the Hilbert space reduced from 2N to 2N−1 by working in the positive parity sector.
For QA, dynamics involving the time-dependent Hamiltonian can be simulated by dividing the total simulation time T into sufficiently small discrete time T and implementing each time step sequentially. At each small step, it is possible to evolve the state without forming the full evolution operator, either using the Krylov subspace projection method or a truncated Taylor series approximation. In the simulations discussed herein, a scaling and squaring method is used with a truncated Taylor series approximation as it appears to run slightly faster than the Krylov subspace method for small time steps.
For QAOA, the dynamics can be implemented in a more efficient way due to the special form of the operators HC and HB, in some embodiments. Work can be performed in the standard σz basis. Thus,
can be written as a diagonal matrix and the action of e−i
Therefore, the action of e−iβH
This section discusses exemplary techniques to simulate the Quantum Approximate Optimization Algorithm to solve Maximum Independent Set Problems, according to some embodiments.
Without being bound by theory, in some embodiments, given a problem to find MIS on a given a graph G=(V, E), the p-level QAOA for MIS can be a variational algorithm consisting of the following steps:
(i) Initialization of the quantum state in |ψ0=|0⊗N.
(ii) Preparation of variational wavefunction:
where HP=Σv∈V−Δnv+Σ(v,w)∈EUnvnw, and HQ=Σv∈VΩσvx+Σ(v,w)∈EUnvnw. The parameters {right arrow over (γ)}∈p-1 and β∈p, specify the variational state.
(iii) Measurement of HP.
The three steps (i)-(iii) can be iterated and combined with a classical optimization of the variational parameters in order to minimize ψp({right arrow over (γ)}, {right arrow over (β)})|HP|ωp({right arrow over (γ)}, {right arrow over (β)}).
In some embodiments, where U>>|Ω|, |Δ|, the variational search can be restricted to the subspace IS spanned by independent sets, such that the algorithm does not need to explore states that can be directly excluded as MIS candidates. In this limit, it is possible to write:
where PIS is a projector onto the independent set subspace IS. Evolution with HP can reduce to simple rotation of individual spins around the z axis. Since
it is possible to commute all the unitaries generated by HP in Equation 86 to the rightmost side until they act (trivially) on the initial state. Thus, it is possible to rewrite the state |ψp ({right arrow over (γ)}, {right arrow over (β)}) as
|ψp({right arrow over (γ)},{right arrow over (β)})=Πk=1pexp(−itkΩΣvPIS(|0v1|e−iϕk+h.c.)PIS)|ψ0, Equation 90:
where the following can be identified
Thus, some embodiments of the formulation of QAOA given above can be equivalent to equation 90 for U>>Ω.
Aspects of the present disclosure show that quantum algorithms can be implemented for solving computationally hard problems with coherent quantum optimizers with minimal resources and implementation overhead, which can include, but is not limited to the number of ancillary qubits needed, additional depth of the quantum circuits needed, etc.
Aspects of the present disclosure show that NP-complete combinatorial optimization problems can be encoded exactly in quantum systems even considering the unwanted interactions between the qubits.
Aspects of the present disclosure show that quantum algorithms can be implemented by applying light pulses with a variable duration and a variable optical phase to at least some of the plurality of qubits.
Aspects of the present disclosure show that heuristic optimization strategies can find quasi-optimal variational parameters in variational quantum algorithms in O(poly(p)) time without the 2O(p) resources required by brute force approaches using many initial guesses of the parameters.
Aspects of the present disclosure show that quantum approximate optimization algorithms utilize non-adiabatic mechanisms to overcome the challenges associated with vanishing spectral gaps.
Aspects of the present disclosure show that vertex renumbering of the combinatorial optimization problem permits quantum systems to implement a broader class of problem instances.
This application claims the benefit of priority to U.S. Provisional Application No. 62/725,874, entitled “QUANTUM OPTIMIZATION FOR MAXIMUM INDEPENDENT SET USING RYDBERG ATOM ARRAYS,” filed on Aug. 31, 2018, the disclosure of which is hereby incorporated by reference in its entirety.
This invention was made with government support under Grant Nos. 1506284, PHY-1125846, and PHY-1521560 awarded by the National Science Foundation; FA9550-17-1-0002 awarded by the U.S. Air Force Office of Scientific Research; and N00014-15-1-2846 awarded by the U.S. Department of Defense/Office of Navy Research. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US19/49115 | 8/30/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62725874 | Aug 2018 | US |