In this disclosure, quantum algorithms are presented for simulating Hamiltonian time-evolution e−i(A+B)t in the interaction picture of quantum mechanics on a quantum computer. The interaction picture is a known analytical tool for separating dynamical effects due to trivial free-evolution A from those due to interactions B. This is especially useful when the energy-scale of the trivial component is dominant, but of little interest. Whereas state-of-art simulation algorithms scale with the energy ∥A∥+∥B∥ of the full Hamiltonian, embodiments of the disclosed approach generally scale with only the low-energy component ∥B∥. Applied to simulating a periodic N-site Hubbard model with arbitrary long-ranged density-density interactions, the disclosed gate complexity (N2t) is a quadratic improvement in N over prior art for electronic structure simulation in the plane-wave basis. More generally in the abstract query model, diagonally-dominant Hamiltonians with sparsity d can be simulated with cost independent of the diagonal component. The various contributions disclosed herein include a rigorous analysis of a low-space overhead simulation algorithm based on the Dyson series for general time-dependent Hamiltonians, which also enables a quadratic improvement in sparsity for time-dependent simulation with
(td) queries.
In some embodiments of the disclosed technology, a quantum computer is configured to simulate a quantum system, and a Hamiltonian in the simulation is represented in the interaction picture. The simulation of the quantum system is then performed using the quantum computer. In certain implementations, the simulation of the quantum system is a subroutine that is repeated two or more times. In some implementations, the simulation is performed using linear combinations of unitaries. For instance, in some cases, the simulation uses linear combinations of unitaries performed on a diagonally dominant matrix (e.g., using linear combinations of unitaries performed on the diagonally dominant components of the diagonally dominant matrix). In certain implementations, the quantum system is modelled by a Hubbard model. In particular implementations, the quantum system describes a physical chemical system or molecule. In some implementations, the Hamiltonian is sparse and the simulation uses the a state of an auxiliary qubit to encode the matrix elements of the Hamiltonian instead of using graph decomposition techniques. In certain implementations, the simulation is performed by compressing ancillas for quantum simulation of a time-dependent Hamiltonian.
In further embodiments, a quantum algorithm is implemented on a quantum computer for simulating a general sparse time-dependent quantum system. In this embodiment, the quantum algorithm does not use graph decomposition techniques. In certain implementations, a quantum Hamiltonian used in the simulation is represented in the interaction picture. In some implementations, the simulation uses linear combinations of unitaries performed on a diagonally dominant matrix. In particular implementations, an interaction Hamiltonian is chosen to be a diagonal matrix. In certain implementations, the quantum system is modelled by a Hubbard model, a physical chemical system, or a molecule. In further implementations, the simulation includes compressing ancillas use to index the time for the simulation.
The foregoing and other objects, features, and advantages of the disclosed technology will become more apparent from the following detailed description, which proceeds with reference to the accompanying figures.
0|α⊗Is)Uk(|0
α⊗Is)=Hk, controlled on number state |k
b, k∈{0,1, . . . , K}.
As used in this application, the singular forms “a,” “an,” and “the” include the plural forms unless the context clearly dictates otherwise. Additionally, the term “includes” means “comprises.” Further, the term “coupled” does not exclude the presence of intermediate elements between the coupled items. Further, as used herein, the term “and/or” means any one item or combination of any items in the phrase.
Although the operations of some of the disclosed methods are described in a particular, sequential order for convenient presentation, it should be understood that this manner of description encompasses rearrangement, unless a particular ordering is required by specific language set forth below. For example, operations described sequentially may in some cases be rearranged or performed concurrently. Moreover, for the sake of simplicity, the attached figures may not show the various ways in which the disclosed systems, methods, and apparatus can be used in conjunction with other systems, methods, and apparatus. Additionally, the description sometimes uses terms like “produce” and “provide” to describe the disclosed methods. These terms are high-level abstractions of the actual operations that are performed. The actual operations that correspond to these terms will vary depending on the particular implementation and are readily discernible by one of ordinary skill in the art.
Many-body dynamics are often described by a Hamiltonian H=A+B with two non-commuting parts: a well-understood but trivial free-particle theory A, and a more interesting component B describing interactions. One often finds a large separation of energy scales ∥A∥»∥B∥ in nature. For instance, the ultra-violet cutoff of a free scalar field is usually assumed to be significantly higher than the energy scale of its interactions. In practical applications, the energy contribution of electron-electron correlations in molecules and materials are typically larger than chemical accuracy, at 4 kJ mol−1, but still much smaller than the Hartree-Fock component.
Dynamical effects arising from the interaction are often the subject of interest. Whereas the quantum state-vector |φI(t) evolves under the full Hamiltonian by the Schrödinger equation i∂t|φ(t)
=(A+B)|φ(t)
, the interaction picture moves to the rotating frame of the trivial dynamics. This frame allows one to focus on effects of the interaction, and is particularly fruitful when interactions are perturbative corrections to the free-theory. For instance, an analytic evaluation of the time-ordered propagator of the interaction picture Hamiltonian HI(t)=eiAtBe−iAt is possible by perturbative expansions based on Green's functions and Feynman diagrams. However, such techniques fail for interactions that are non-perturbative, even if they are weak relative to the free theory. In such situations, one resorts to numerical simulations.
Numerical simulations of quantum dynamics scale exponentially with particle number, so the interaction picture provides little advantage here. One then relies on non-perturbative numerical approximations such as density-functional theory or Monte Carlo path-integral evaluation. Though highly successful in elucidating qualitative behavior, obtaining accurate quantitative predictions to high precision is difficult. For instance, achieving chemical accuracy in ab-initio quantum chemistry calculations of reaction rates often only possible by exact diagonalization of the Hamiltonian. This has an exponentially growing cost (n) with the system size n. (The standard big-
notation defines f(n)=
(g(n)) for positive functions f(n),g(n)>0 as the existence of absolute constants α>0,b>0 such that for any n>α, f(n)≤bg(n). One can also use f(n)=
(g(n)) when f(n)≤bg(n) polylog(g(n)), and f(n)=Ω(g(n)) when f(n)≥bg(n), and f(n)=⊖(g(n)) when both f(n)=
(g(n)) and f(n)=Ω(g(n)) are true.)
One approach that is tractable and non-perturbative is simulating quantum dynamics on a quantum computer. Given a (poly(n))-sized description of a time-independent Hamiltonian H, one devises a sequence of
(poly(n)) primitive quantum gates that approximates the unitary time-evolution operator e−iHt with error ϵ. (Primitive gate costs are defined as the number N of arbitrary single- and two-qubit gates. In fault-tolerant architectures, one multiples by an additional overhead
(log (N/ϵ)) for approximating N such gates to total error ϵ using a universal discrete gate set, e.g. CLIFFORD+T.) This is then applied to the system state |φ(0)
encoded in
(poly(n)) logical quantum bits to obtain the time-evolved state |φ(t)
=e−iHt|φ(0)
. Algorithms for Hamiltonian simulation have progressed rapidly culminating recently in schemes with optimal query complexity
(nt+log (1/ϵ)) with respect to all parameters for generic sparse Hamiltonians, and algorithms for geometrically-local exponentially-decaying interactions, with gate complexity
(nt polylog(nt/ϵ)) that is optimal up to poly-logarithmic factors.
It would be desirable if the cost of simulating time-evolution scaled with energy of the smaller interaction term, say (t∥B∥). As the interaction picture Hamiltonian has norm ∥HI(t)∥=∥B∥, this possibility is highly suggestive. Nevertheless, state-of-art performs simulation in the Schrödinger picture, where the gate cost of simulating e−iHt generally scales at least like Ω(t(∥A∥+∥B∥)log(1/ϵ)), even if the trivial dynamics e−iAt of term A may be evaluated in closed-form. Even with specialized techniques that scale with a geometric mean
(
(k)t∥A∥(t∥B∥/ϵ)1/k), ∀k∈2
+. the uninteresting component
(t∥A∥) is the dominant pre-factor.
Unfortunately, simulation in the interaction picture is difficult. There, evolution by a time-independent Hamiltonian H is transformed in the rotating frame |φI(t)=eiAt|φ(t)
to evolution by a time-dependent Hamiltonian HI(t)=eiAtBe−iAt. This follows from an elementary manipulation of the Schrödinger equation:
i∂
t|φ(t)=(A+B)|φ(t)
→i∂t|φI(t)
=eiAtBe−iAt|φI(t)
. (1)
Implementing the time-ordered propagator [exp(−i∫0tH(s)ds)] that solves Eq. (1) on a quantum computer requires time-dependent simulation algorithms. These are generally more complicated than time-independent algorithms, and exhibit different cost trade-offs that do not appear favorable. For instance, an order-k time-dependent Trotter-Suzuki product formula has a cost that scales with the rate of change of H(t) like
(
(k)(t∧)1+1/(2k)), where ∧=maxs∥{dot over (H)}(s)∥1/2=
(∥[A,B]∥1/2). More advanced techniques based on compressed fractional queries appear to scale better like ˜
but in terms of queries to a unitary oracle that obscures the gate complexity as it expresses Hamiltonian matrix elements at different times in binary, and may be difficult to implement in practice. One proposed technique directly implements a truncated Dyson series of a the time-ordered propagator and argues, though without proof, a similar scaling in terms of queries to a different type of oracle.
Here, it is shown that simulation in the interaction picture can substantially improve the efficiency of simulation. In Section III, the general time-dependent simulation algorithm is completed by a truncated Dyson series by providing a rigorous analysis of the approximation and explicit circuit constructions, with improvements in gate and space complexity over previously expected costs. In Section IV, situations are identified where the gate complexity of implementing these queries scale with the interaction strength (∥B∥), and not the larger uninteresting component
(∥A∥). Such are the cases where simulation in the interaction picture is advantageous. In Section V, the potential of interaction-picture simulation by an electronic structure application in the plane-wave basis is discussed. Further, the cost of simulating the time-evolution of N spin-orbitals subject to long-range electron-electron interactions to
(N2t) gates is discussed, which is close to a quadratic improvement over prior art of
(N11/3t). In Section VI, a complexity theoretic perspective of the disclosure is provided by considering the abstract problem of simulating time-dependent sparse Hamiltonians in the standard query model. A quadratic improvement in sparsity scaling is shown, and optimized algorithms for simulating diagonally dominant Hamiltonians are provided. A more detailed summary of main results in each section is as follows.
In Section III (“Time-dependent Hamiltonian simulation by a truncated Dyson series”), an example algorithm for a general time-dependent simulation algorithm Theorem 1 is provided. The example algorithm is based on approximating a truncation and discretization of the Dyson series for general time-dependent Hamiltonians H(t) characterized by spectral-norm α≥maxt∥H(t)∥ and average rate-of-change
A rigorous analysis of its performance is provided in together with explicit circuit constructions. Bounds on the approximation error is evaluated in Section III A, which is used to obtain the cost of simulating the time-ordered evolution operator. This cost is found to be
queries to a certain unitary oracle HAM-T that encodes H(t) evaluated
points. Example implementations as disclosed in Section III B, characterized by Theorem 2, completes the scheme proposed by D. W. Berry, A. M, Childs, and R. Kothari, “Hamiltonian simulation with nearly optimal dependence on all parameters,” in Proceedings of the 56th Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 792-809 (October 2015), with some improvements in gate complexity. One obtains scaling with the average rate-of-change (log(
∥H∥
)), instead of the worst case
(log(maxt∥H(t)∥)). This allows one to simulate Hamiltonians with arbitrary elementwise slew-rates, so long as ∥H(t)∥ remains bounded. One can then devise a compression gadget in Section III C that reduces the qubit overhead complexity of this implementation to
(log(M)), compared to
of the original. This gadget is of independent interest and also applicable to the time-independent truncated Taylor series algorithm, discussed in Section XA for completeness.
In Section IV (“Accelerated interaction picture simulation”), one can apply this truncated Dyson series algorithm to simulate time-evolution by e−i(A+B)t in the interaction picture. One can evaluate the gate complexity in Section IV A for one possible construction of these queries HAM-T for the interaction picture Hamiltonian HI(t)=eiAtBe−iAt. This leads to the simulation algorithm Theorem 3 for time-evolution by e−i(A+B)t with gate complexity
((CB+Ce
where CB is the gate complexity a unitary oracle OB such that
and Ce(αB−1). This may be compared to state-of-art Schrödinger picture simulation algorithms for time-independent Hamiltonians, which have gate complexity
((CB+CA)(αB+αA)tpolylog(∥A∥,∥B∥,t,ϵ)), (3)
where CA is analogous to CB but for the Hamiltonian term A. The disclosed result Eq. (2) is then advantageous in cases where ∥A∥»∥B∥ and Ce(CB). There, the gate complexity scales with the interaction strength ∥B∥, and not the larger uninteresting component ∥A∥, up to polylogarithmic factors.
In Section V (“Application to the Hubbard model with long-ranged interactions”),
One can demonstrate a concrete advantage for Hamiltonian simulation in the interaction picture over time-independent simulation algorithms. One can consider a general Hubbard periodic model on N lattice sites in an arbitrary number of dimensions, which has Hamiltonian
where α{right arrow over (x)}σ†, α{right arrow over (x)}σ are Fermionic creation and annihilation operators indexed by lattice position {right arrow over (x)} and spin σ, and n{right arrow over (x)}σ is a number operator. Note that one can allow for arbitrary translationally invariant kinetic T(⋅) hopping terms and long-range density-density V(⋅) interactions, subject to arbitrary local disorder U(⋅). Provided that the kinetic term is extensive, embodiments of the disclosed interaction-picture algorithm have gate complexity (N2t), independent of U and V up to poly-logarithmic factors. This model generalizes electronic structure simulations in the plane-wave basis, considered in Section V A, where one obtains almost a quadratic improvement over the prior art of
(N11/3t) gates for this special case. This is complementary to recent work by Jeongwan Haah, Matthew B Hastings, Robin Kothari, and Guang Hao Low, Quan- tum algorithm for simulating real time evolution of lattice Hamiltonians. arXiv preprint arXiv:1801.03922, 2018, which achieves
(Nt) scaling, but under the assumption of short-range exponentially decaying coefficients.
In Section VI (“Application to sparse Hamiltonian simulation”), the application to the Hubbard model with long-ranged interactions is explained.
Consider next a complexity-theoretic generalization of the technology that has been developed for time-dependent simulation and simulation in the interaction picture. The standard query model for black-box d-sparse Hamiltonian simulation assumes access to a unitary oracle that provides the positions and values of non-zero entries of the Hamiltonian. Each row has at most d non-zero entries, and the maximum absolute value of any entry is ∥H∥max. This infortmation is also provided as a function of a time index for time-dependent Hamiltonians, In Section VI A, one considers this time-dependent case and show with Theorem 4 that the time-ordered evolution operator may be simulated using
queries. Though linear scaling with respect to d is well-known in the time-independent case, this is a quadratic improvement in sparsity scaling over prior art for the time-dependent case. An analogous treatment for simulating sparse time-independent Hamiltonian in the interaction picture in Section VI B, Theorem 5 has query complexity that scales only with ∥Hod∥max, which is the maximum absolute value of any off-diagonal entry. This improved cost
is particularly advantageous for the common situation of diagonally dominant Hamiltonians.
The time-dependent simulation technique applied throughout this work is motivated by the Dyson series which solves the time-dependent Schrödinger equation. The Dyson series is a power-series expansion of the unitary propagator that involves integrals of products of H(t) at different times. The technique implements a truncation of this power-series, where integrals in each term is discretized in time.
This technology is presented with a rigorous analysis of the truncation and discretization error in Section III A, followed by an explicit circuit construction that is in the spirit of earlier work in Section III B. Two improvements are of note. First, the gate complexity of the algorithm scales with the average rate-of-change (log(
∥H∥
)), instead of the worst case
(log (maxt∥H(t)∥). This allows one to simulate Hamiltonians with arbitrary element-wise slew-rates, so long as ∥H(t)∥ remains bounded. Second, in Section III C, an alternate implementation of the technique is introduced that reduces the space overhead of the original proposal substantially, and has the same query and gate complexity.
The complexity of any simulation algorithm depends on assumptions made about how information on the time-dependent Hamiltonian H(t) is accessed by the quantum computer. This section begins by defining an input model for the unitary quantum oracle HAM-T that encodes H(t). The complexity of time-dependent simulation is then expressed in terms of queries to HAM-T (It is assumed that the query complexity to a controlled-unitary black-box is the same as that to the original black-box. In general, this will not affect the gate complexity. Though there are often cleverer ways to implement an arbitrary controlled-unitary, in the worst-case, all quantum gates may be replaced by their controlled versions with constant overhead.), any additional primitive gates required beyond HAM-T, and the total number of qubits required including that for HAM-T. In later sections where the time-dependent simulation algorithm is applied to the interaction picture, this oracle is “opened up” and and the gate complexity of its implementation is discussed.
A common assumption in the time-independent situation is that H=Σj=1lαjHj is a sum of l local Hermitian terms with positive coefficients αj, and that each term e−iH(l) quantum gates for any t. In abstract models for sparse Hamiltonians, a binary representation of matrix entries Hjk and positions are returned in superposition by querying a certain unitary quantum oracle that is a natural generalization of classical Boolean circuits, Here, Hamiltonians H is considered encoded in a so-called standard-form. For any time-independent Hamiltonian H, it is assumed that there exists a unitary oracle HAM such that
(0|a⊗Is)HAM(|0
a⊗Is)=H/α, (5)
where HAM acts jointly on registers α, s consisting of nα, ns qubits respectively, and α≥∥H∥ is a normalization constant that should be made small for best performance. In other words, post-selected on the nα-qubit measurement outcome |0α the Hamiltonian H is applied on any arbitrary state in the system register s. Use of this is justified as it generalizes a variety of different input models. Thus any simulation algorithm that queries Eq. (5) for this general case readily specializes to more structured inputs. As an example, H=Σj=12
These unitaries each cost (l) gates—PREP is implemented by arbitrary quantum state preparation on l dimensions, and SEL is implemented by binary-tree control logic for l different inputs. Typically, the number of terms in the Hamiltonian scales like l=
(poly(ns)), so nα=
(log(l)). However, the number of ancilla qubits can also be polynomial in ns, such as in the simulation of sparse Hamiltonians where nα≥ns+2, or where a space-time complexity trade-off is possible.
A number of techniques may be applied to approximate the unitary time-evolution operator e−iHt to error ϵ by querying HAM. The combined Qubitization and quantum signal processing approach is one such algorithm with query complexity (αt+log(1/ϵ)) that is optimal for sparse Hamiltonians. Alternatively, one could apply a truncated Taylor series of e−iHt, which has a slightly worse query and space complexity, but benefits from being conceptually simpler. As the Dyson series algorithm is time-dependent generalization of the truncated Taylor series algorithm, a review and some useful background is provided before in Section X A for completeness.
A natural time-dependent generalization of Eq. (6) is the unitary oracle HAM-T encoding the Hamiltonian H(s) defined over time s ∈[0,t], with t>0. The continuous parameter s is discretized into an integer number of M>0 time bins which are indexed by m ∈ {0, 1, . . . , M−1}. Here, it is assumed that H(s) is encoded in an analogous format
Given black-box access to HAM-T, details of a quantum algorithm that approximates the Dyson series Eq. (8) are provided. The overall algorithm is efficient so long as HAM-T has an efficient implementation in terms of primitive quantum gates. More formally, the performance of the algorithm is captured by the following theorem, proven in the remainder of this section.
Theorem 1 (Hamiltonian simulation by a truncated Dyson series). Let a time-dependent Hamiltonian H(s) be characterized by spectral norm α≥maxs ∥H(s)∥, and average rate-of-change
ds. This Hamiltonian is defined for s∈[0, t], where |αt|≤½, and is encoded in an oracle HAM-T acting on ns+nα+nd qubits, as per Eq. (7), evaluated at
points, where nd=(log (M)). Then the time-ordered evolution operator
[exp(−i∫0tH(s)ds)] may be approximated to error ϵ with the following cost.
Note that, this algorithm performs time-evolution for a relatively short duration of t≤α−1. Time-evolution for longer durations is simulated by breaking the time-ordered evolution operator [exp(−i∫0tH(s)ds)] into L=
(αt) time-steps 0=t0<t1 . . . <tL=t of size
(α−1), and each step is implemented by the quantum circuit underlying Theorem 1 that queries a different HAM-T now defined for a Hamiltonian H(s) for s∈[tj,tj−1]. As the final error is Lϵ by a simple triangle inequality, one can rescale ϵ→ϵ/L. Thus simulation for the full duration to error ϵ has query complexity
A. Dyson Series Truncation Error
The choice of parameters in Theorem 1 depends on the scaling of error from a finite and discrete approximation of the Dyson series. Consider a unitary propagator U(t) defined to solve the time-dependent Schrödinger equation i∂t|φ(t)=H(t)|φ(t)
by providing the state |φ(t)
=U(t)|φ(0)
given the initial state |φ(0)
at time t=0. For any t>0 and bounded ∥H(t)∥, U(t) has an absolutely convergent infinite expansion known as the Dyson series
U(t)=I−i∫0tH(t1)dt1−∫t
This may be compactly represented using the time-ordering operator z,99 which sorts any sequence of k operators according to the times tj of their evaluation, that is, [H(tk) . . . H(t2)H(t1)]=H(tσ(k)) . . . H(tσ(2))H(tσ(1)), where σ is a permutation such that tσ(1)≤tσ(2)≤ . . . ≤tσ(k). For instance,
[H(t2)H(t1)]=θ(t2−t1)H(t2)H(t1)+θ(t1−t2)H(t1)H(t2) using the Heaviside step function θ. With this notation, the propagator is commonly expressed as a time-ordered evolution operator U(t)=
[e−i∫
The Dyson series of Eq. (9) may be approximated on a computer by truncating the infinite expansion at some finite order k=K≥0, and evaluating H(tj) at some finite number of M time-steps of size Δ=t/M. Thus, one has the approximation
which converges to U(t) in the limit K, M→∞ if H(t) is Riemann integrable. One can obtain some intuition on how the error of this approximation depends on choices of K, M, H(t). As ∥H(t)∥ is bounded,
Using Taylor's remainder theorem for the exponential function, the error of truncation for |t| maxs ∥H(s)∥=(1) is at most ϵ=
(1/K!). Thus the order of truncation is at most
As {tilde over (B)}k is a left Riemann sum of Dk, one expects that the error of discretization should scale like
These two errors may then be added by a triangle inequality to obtain the overall error from approximating [e−i∫
The explicit time-ordering in {tilde over (B)}k may be removed with a slightly different approximation
where Ck captures terms where at least one pair of indices mj=mk collide for j≠k. A rigorous statement of this intuition is given by the following Lemma 1, with a detailed proof in Section XI.
Lemma 1 (Error from truncating and discretizing the Dyson series). Let H(s): →
N×N be differentiable on the domain [0, t], and let
For any ϵ∈[0, 21-e], an approximation to the time ordered operator exponential of −iH(s) can be constructed such that
if one takes all the following are true.
On a quantum computer, it becomes possible to compute the Bk exactly and efficiently even it they sum over exponentially many points M. In contrast, computing these Riemann sums on classical computer would be prohibitive, even by approximate Monte-Carlo sampling, which is exacerbated by the sign problem. However, this efficient quantum computation crucially assumes that information on terms of the Hamiltonian are made accessible in a certain coherent manner. In the case discussed here, this is information is accessed by querying the black-box unitary oracle HAM-T in Eq. (7).
B. Algorithm by Duplicating Control Registers
A quantum algorithm is presented that applies the ϵ-approximation
to the time-ordered evolution operator [e−i∫
(K) queries to HAM-T. Second, since Ũ is ϵ-close to unitary, one applies one round of oblivious amplitude amplification to boost this probability to 1−
(ϵ). Among the contributions here are rigorous bounds on K and M, and the implementation of a step not discussed previously—the efficient preparation of a particular quantuni state that correctly selects a desired linear combination of time-ordered products of Hamiltonians. This step is non-obvious as the state has
(M!) different amplitudes, and in the worst-case would take
(M!) gates to create by arbitrary state preparation techniques. The cost of this implementation is captured by the following theorem.
Theorem 2 (Hamiltonian simulation by a truncated Dyson series with duplicated registers). Let a time-dependent Hamiltonian H(s) be characterized by spectral norm α≥maxs ∥H(s)∥, and average rate-of-change
This Hamiltonian is defined for s ∈ [0,t], where |αt|≤½, and is encoded in an oracle HAM-T acting on ns+nα+nd qubits, as per Eq. (7), evaluated at
points, where nd=(log (M)). Then the time-ordered evolution operator
[exp (−i∫0tH(s)ds)] may be approximated to error ϵ with the following cost.
Proof Let HAM-TK be a unitary that acts jointly on registers s, {right arrow over (α)}, {right arrow over (b)}, c, {right arrow over (d)}. This unitary is depicted in
were SWAPc swaps the two qubits of register c. Note that the action of HAM-TK is only defined for input states to register {right arrow over (b)} that are spanned by basis states of the unary encoding |k{right arrow over (b)}=|0
⊗k|1
⊗K-k, which determines the number of terms in the product. As seen in the figure, HAM-TK makes K queries to HAM-T and copies the α, b, and d registers K times.
Now, consider a state |sk{right arrow over (d)}, which is a uniform superposition of time-index states under the dimension-k unit simplex
This state is easy to prepare when k=1—there, it is simply a uniform superposition over M number states, and costs (log M) gates. Otherwise, naive methods based on rejection sampling have some success probability |γk|2 that decreases exponentially with large k. Let PREPK be one such unitary that prepares |sk
{right arrow over (d)} on measurement outcome |00
c.
PREPK|k{right arrow over (b)}|0
c{right arrow over (d)}:=|k
{right arrow over (b)}(γk|00
c|sk
{right arrow over (d)}+√{square root over (1−|γk|2)}|01
c . . . ). (14)
For each order k, the Riemann sum Bk may be implemented by DYSK:=(PREPK†⊗Iαs). HAM-TK·(PREPK⊗Iαs), as depicted in
Now, a linear combination of Dyson series terms is implemented by preparing a state with the appropriate amplitudes in the basis |k{right arrow over (b)}. The required state preparation operators are
and may be implemented using (K) primitive gates. Up to a proportionality factor β, one can obtain the desired linear combination for simulating time-evohnion.
If one chooses t to be sufficiently small such that β=2, one can then obtain a single time-step of the truncated Dyson series algorithm TDS in {right arrow over (d)} with an amplitude that |γk| that is sufficiently large so that t=⊖(1).
The state |sk{right arrow over (b)}c{right arrow over (d)} can be prepared in a number of ways. The most straightforward approach creates a uniform superposition of states over the dimension-k hypercube using nd×k Hadamard gates HAD, then uses k reversible adders to flag states |{right arrow over (m)}
d
produces |sk
{right arrow over (d)} with amplitude
is then obtained by controlling on input state |k
{right arrow over (b)}. Thus
Thus by choosing t=⊖(1)≈½, s=2 and a single round of oblivious amplitude amplification suffices. Notably, even though the success probability of naive state preparation |γk|2 decays rapidly, this only amounts to a constant factor slowdown compared to more sophisticated techniques that effectively prepare |sk{right arrow over (d)} with success probability≈1. For example, rather than rejection sampling, one may perform a reversible sort on on uniform superposition of states
such as with the quanthm bitonic sorting network. This effectively increases γk2 by a factor of k!, and uses significantly more ancilla qubits, but ultimately allows us to implement time steps t≈ln2≈0.693 larger by a constant factor. □
C. Compression of Control Registers
The dominant contribution to the space overhead of the truncated Dyson series algorithm in Section III B is the K-fold duplication of registers {right arrow over (α)}, {right arrow over (d)} in the oracle DYSK. A general technique is now presented that when applied to DYSK avoids this duplication and completes the proof of Theorem 1. Suppose one has a sequence of K unitary oracles U1, U2, . . . , UK, defined as
(0|α⊗Is)Uk(|0
α⊗Is)=Hk, (19)
where Hj is some arbitrary matrix with bounded spectral norm ∥Hj∥≤1. In the general problem, one can construct a quantum circuit V that when controlled on index k in register b, applies the sequence Uk . . . U2U1, that is
Though binary control logic for this sequence is trivial when Hk is unitary, the complication here is that Hk is in general non-unitary and so the probability of successfully measuring |0α is less than one. Any other measurement outcome corresponds failure as it applies on register α an operator that is not Hk. This complication is overcome by introducing two more registers b, c of size
(log(K)) gubits that count the number of successful measurements.
Lemma 2 (Compression gadget). Let {Uk,k∈[K]} be a set of K unitaries that encode matrices Hk as defined in Eq. (19). Then there exists a quantum circuit V satisfying Eq. (20) such that the number of qubits nb=(log (K)), nc=
(log (K)). The cost of V is one query to each of controlled-controlled-Uk, and
(K (nα+log(K))) additional primitive quantum gates.
Proof. Let b, c be counter registers. Using notation where commas in the subscript indicates respective assignments, these represent an nb,c-bit integer lb,c=Σr=0nb,c:=|q0q1 . . . qn
b,c, where qr ∈{0, 1}. The size of these integers are determined by nb=nc+1=┌log2(K+1)┐+1. The unitaries Uj will be applied conditional on the trailing bit q0=0 in the c register, and the leading bit qn
CC−U
k
:=I
⊗n
n
−2⊗(|00|b
0|c
Consider the circuit in α state, and decrement lb by one conditional on the α register being in the |0
α state. This is accomplished by multiply-controlled modular addition
As one adds integers of size (K), each application of modular addition costs
(log(K)) primitive gates and requires
(log(K)) qubits. Implementing the multiple controls costs
(nα) primitive gates and up to nα extra qubits.
0|α⊗Is)Uk(|0
α⊗Is)=Hk, controlled on number state |k
b, k∈{0,1, . . . , K}.
Restricted to input states |0α|l
b|0
c, where l ∈{2n
α|2
b|0
c|φ
s for K=3.
In the above, negative integers correspond to the trailing bit qnb, so one can simply relabel k=lb+1 mod 2n
Proof of Theorem 1. Thus DYSK may be implemented with reduced ancilla overhead through Lemma 2 provided that one finds a sequence {Uk} such that Hk . . . H2H1∝Bk—in other words,
where ‘others’ represent registers with size independent of K, and γk is a scaling factor depends on the choice of Uk. This sequence is obtained by combining three matrices. First, a unitary matrix U that prepares a uniform superposition
Second, the block-diagonal matrix D=Σm=0M−1|mm|⊗H(Δm) implemented by HAM-T. Third, a strictly upper-triangular matrix G ∈
M×M with elements
The non-unitary triangular operator G is implemented by using an integer comparator COMP acting on registers d, e, f consisting of nd=ne and nf=1 qubits. For any input number state index |jd, let one compare j with a uniform superposition state
Conditional on i≥j, perform a NOT gate on register f. One can then swap registers d,e, and unprepare the uniform superposition. On input |jd|0
e|0
f, this implements the sequence.
where |i≥jf=|1
f if i≥j and is |0
f if i<j. This defines the following circuit that encodes G.
The gate complexity of G is dominated by the comparator COMP, which costs (nd) primitive quantum gates.
One may then verify that the terms Bk are generated by the following sequence
Thus one can make the choice
Combined with Lemma 2, this leads to the circuit of
One can then select the desired linear combination of different orders in the Dyson series with the state
where one assumes that t<1 for convergence. Thus by choosing t=⊖(1)≈½, the success probability of approximating e−1∫
(ϵ) using a single round of oblivious amplitude amplification. □
Time-independent Hamiltonians H become time-dependent HI(t) in the interaction picture. This requires the use of time-dependent Hamiltonian simulation algorithm, which scale with parameters of HI(t) that differ from those for the time-independent case. For certain broad classes of Hamiltonian identified in Section IV A, these different dependencies allow us to improve the gate complexity of approximating the time-evolution operator e−iHt by instead performing simulation in the interaction picture using the truncated Dyson series algorithm Theorem 1.
The interaction picture can be viewed as an intermediate between the Schrödinger and Heisenberg pictures wherein some of the dynamics is absorbed into the state and the remainder is absorbed into the dynamics of the operators. If the Hamiltonian in the Schrödinger picture is H=A+B and |φ(t)=e−iHt|φ(0)
then the Hamiltonian in the interaction picture is HI(t)=eiAtBe−iAt and i∂t|φI
(t)=HI(t)|φI(t)
with |φI(t)
=eiAt|φ(t)
for all t. These relations can easily be seen by substituting into the Schrödinger equation:
Note that if one started with time-dependent B(t), that is H(t)=A+B(t), the interaction picture Hamiltonian is HI(t)=eiAtB(t)e−iAt. The following results generalize easily to this situation, and so one can consider time-independent B for simplicity.
The advantage of this representation is a Hanmiltonian with a smaller norm ∥H(t)∥=∥B∥≤∥A∥+∥B∥, but at the price of introducing time-dependence. In general, one cannot write a closed form expression for the time-evolution operator. The following notation is commonly used to express the time evolution operator [e−i∫
, the state after evolution for t>0 may thus be written as
and evolution by the full duration is decomposed into evolution by L shorter segments of duration .
Using the simulation algorithm Theorem 1 to simulate each segment in Eq (34) leads to the following result
Lemma 3 (Query complexity of Hamiltonian simulation in the interaction picture). For any Hamiltonian H=A+B, let HAM-T be a unitary oracle that encodes HI(t)=eiAtBe−iAt at
uniformly spaced values of t ∈[0,=
(αB−1)].
Then e−iHt may be approximated to error ϵ using
Proof. The number of segments L=(αBt) in Eq. (34) is determined by a normalization constant αB≥maxt∥HI(t)|=∥B∥, and so each segment is of size
=
(αB−1). After rescaling ϵ→ϵ/L by the number of segments, the total query complexity is obtained from the truncated Dyson series algorithm Theorem 1. Using the facts maxs ∥HI(s)∥≤∥B∥,
∥H∥
=∥[A,B]∥≤2∥A∥∥B∥, and
=
(αB−1), it suffices to choose
Each segment is simulated
queries to HAM-T, thus simulation for the full duration has query complexity
A. Comparison with Simulation of Time-Independent Hamiltonians in the Schrödinger Picture
The cost of simulation in the interaction picture using the truncated Dyson series is now compared with state-of-art simulation in the Schrödinger picture with time-independent Hamiltonians using the truncated Taylor series approach outlined in section X A. Up to logarithmic factors, this comparison is valid as the truncated Taylor series algorithm cost differs from optimal algorithms by only logarithmic factors.
For any Hamiltonian H=A+B, let us assume access to the oracles
which have gate complexity CA,CB respectively. The gate complexity of time-independent simulation e−i(A+B)t is then
In the interaction-picture, one can prove the following theorem
Theorem 3 (Gate complexity of Hamiltonian simulation in the interaction picture). For any Hamiltonian H=A+B, let
C
TDS=((CB+Ce
Proof. let CHAM-T|ϵ| be the gate complexity of approximating HAM-T in Eq. (35) to error ϵ. Then from Lemma 3, the total gate complexity of simulating e−iHt=(e−iAt[e−i∫
(αB−1), is
One possible decomposition of HAM-T is
Using Ce(|t| polylog(t, ϵ)), synthesis of Σm=0M−1|m
m|d⊗eiArm/M by exponentiated power of controlled-eiAt/M,eiA2t/M,eiA4t/M, . . . , eiA2
Ce
(Ce
(CB+Ce
From comparing Eqs. (38) and (39), one may iminediately state sufficient criteria for when simulation in the interaction picture is advantageous over simulation in the Schrödinger picture.
One can now apply the technology developed in Section III and Section IV for Hamiltonian simulation in the interaction picture to physical problems of practical interest. Here, the focus is on the Hubbbard model in d-dimensions with N lattice sites subject to local disorder and translational-invariant two-body couplings that may be long-ranged in general. One can perform a gate complexity comparison with simulation by time-independent techniques, and later in Section V A, this model is specialized to that of quantum chemistry simulations in the plane-wave and dual basis.
The Hubbard Hamiltonian considered has the form H=T+U+V, where T is the kinetic energy hopping operator, U is the local single-site potential, and V is a symmetric translationally-invariant two-body density coupling term between opposite spins. In the dual basis, H is expressed in terms of single-site Fermionic creation and annihilation operators {α{right arrow over (x)}σ, α{right arrow over (y)}σ′}={α{right arrow over (x)}σ554 , α{right arrow over (y)}σ′554 }=0, {α{right arrow over (x)}σ, α{right arrow over (y)}σ′554 }=δ{right arrow over (x)}{right arrow over (y)}δσσ′, and the number operator n{right arrow over (x)}σ=α{right arrow over (x)}σ554 α{right arrow over (x)}σ. The subscript {right arrow over (x)} ∈ [−N1/d,N1/d]d indexes one of N lattice sites in d dimensions, and σ ∈ {−1,1} is a spin-½ index. Explicitly,
where the coefficients T({right arrow over (s)}), U({right arrow over (s)},σ), V({right arrow over (s)}) are real functions of the lattice index {right arrow over (s)} ∈ [−N1/d,N1/d]d.
Further simplification of Eq. (43) is possible as the kinetic energy operator is diagonal in the plane-wave basis. This basis related to the dual basis by a unitary rotation FFFT, an acronym for ‘Fast-Fermionic-Fourier-Transform’ that implements a Fourier transform over the lattice site indices, resulting in Fermionic creation and annihilation operators c{right arrow over (p)}σ554 , c{right arrow over (p)}σ.
By substituting the Fourier transform of the kinetic term
an equivalent expression for the Hubbard Hamiltonian is
where each term is now diagonal in their respective bases.
A simulation of this Hamiltonian on a qubit quantum computer requires a map from its Fermionic operators to spin operators. One possibility is the Jordan-Wigner transformation, which requires some map from Fermionic indices to spin indices, such as
Subsequently, one can replace
Note the very useful property where number operators map to single-site spin operators n{right arrow over (x)}σ→½(I−Z∫({right arrow over (x)},σ)) (under this encoding. Moreover, FFFT, can be implemented using (N log (N)) primitive quantum gates in the Jordan-Wigner representation.
Now evaluate the worst-case gate-complexity for time-evolution by H in the Schrödinger picture. As an example of state-of-art using the truncated Taylor series approach in Section X A, e−i(T+U+V)t may be simulated using ((αT+αU+αV)t log (1/ϵ)) queries to oracles that encode T, U, and V.
The cost of simulation depends strongly on the coefficients {tilde over (T)} ({right arrow over (p)}), U ({right arrow over (x)}), V ({right arrow over (x)}). The most straightforward approach synthesizes these oracles using the linear-combination of unitaries outline in Eq. (6). For instance, OT=(PREPT†⊗FFFT†)·SELT·(PREPT⊗FFFT), where
and similarly for U and V. As there are (N) distinct coefficients in the worst-case, each of PREPT,U,V costs
(N). As V has
(N2) terms, SELV has the largest cost of
(N2). Thus overall gate complexity is
(N2(αT+αU+αV)t log (1/ϵ)). As there are
(N2) coefficients, max{αT, αU, αV}=
(αT+N2), and so the cost of simulation is
(N2(αT+N2)t log (1/ϵ)). (50)
The worst-case gate-complexity may be substantially improved by instead simulating H in the interaction picture using the truncated Dyson series algorithm in Section III. The key idea is to simulate in the rotating frame of the interactions e−i(U+V)t, where the Hamiltonian becomes a time-dependent HI(t)=ei(U+V)tTe−i(U+V)t. Using the same oracle OT in Eq. (48) for the kinetic term, the cost of time-evolution e−i(T+U+V)t by this technique is given by Eq. (39):
All that remains is to bound the cost of time-evolution by the term Ce
There are (N2) exponentials, and so Ce
(N2). Compared to Eq. (50), one can already see an improvement by a factor
(N) in cases where the kinetic energy is extensive, that is αT=
(N), so CTDS=
(N2αTtpolylog(N, αT, t, ϵ)).
A further improvement to
C
TDS=(NαTtpolylog(N, αT, t, ϵ)) (53)
is possible by a more creative evaluation of the gate complexity of ei(U+V)t to reduce its cost from (N2) to
(N log (N)). Clearly, Ce
(N) with N commuting terms poses no problem. The difficulty lies in constructing time-evolution by the two-body term eiVt such that Ce
(N log N). As V is a sum of
(N2) commuting terms, a gate cost
(N2) appear unavoidable. However, this may be reduced by exploiting the translation symmetry of its coefficients with a discrete Fourier transform. As V({right arrow over (x)})=V(−{right arrow over (x)}) is real and symmetric, its discrete Fourier transform {tilde over (V)}({right arrow over (k)})=Σ{right arrow over (x)}V({right arrow over (x)})ei2π{right arrow over (x)}·{right arrow over (k)}/N
The strategy for implementing e−iVt is based on the following observation: Suppose one had a unitary oracle O{right arrow over (A)}|j|0
o|0
garb=|j
|Aj
o|g(j)
garbage that on input |j
∈
dim[{right arrow over (A)}], outputs on the l quoit o register, the value of the jth element of some complex vector {right arrow over (A)}, together with some garbage state |g(j)
garb of lesser interest required to make the operation reversible. One may then perform a phase rotation that depends on Aj as follows:
If Aj were represented in binary, say, Aj=Σk=0l−1qk2−k, PHASE could be implemented using (l) controlled-phase |0
0|⊗I+|1
1|⊗e−it2
Thus, one can construct a unitary OV,binary with the property that
where the value ƒ({right arrow over (n)}) is encoded in l=(log(1/ϵ)) bits. This is implemented by the following sequence, where one has omitted the garbage register for clarity.
The cost of OV,binary may be expressed in term of the four standard reversible arithmetical operations, addition, subtraction, division, and multiplication, which each cost (poly(l)) primitive gates. The first steps ADD adds
(N) pairs of two bits n{right arrow over (x)},σ=1 +n{right arrow over (x)},σ=1 and costs
(N) arithmetic operations. The second step FFT is a d-dimensional Fast-Fourier-Transform on
(N) binary numbers and requires
(N log (N)) arithmetic operations. The third step computes the absolute-value-squared of
(N) binary numbers, and uses
(N) arithmetic operations. The last step multiplies each |{tilde over (x)}k|2 with the corresponding Vk, and costs
(N) arithmetic operations. This last step may actually be avoided by rescaling the time parameter e−iA
(N log (N) poly(l))=
(N log(1/ϵ)). Using one query to OV,binary, OV,binary†, and
(N log(1/ϵ)) primitive quantum gates, one may thus apply e−iVt with a phase error
(ϵ) for a fixed value of t.
The Hamiltonian that generates time-evolution for a state |φ(t) of interacting electrons in d=3 dimension consists of three operators: the electron kinetic energy T, the electron-nuclei potential energy U, and the electron-electron potential energy V. It has been demonstrated that this electronic structure Hamiltonian is a special case of the general Hubbard Hamiltonian of Eq. (46). In the plane-wave basis,
where k{right arrow over (p)}=2π{right arrow over (p)}/Ω1/3, {right arrow over (p)} ∈ [−N1/3,N1/3]3, r{right arrow over (p)}={right arrow over (p)}(Ω/N)1/3, Ω represents the volume of the simulation, and nj is the nuclear charge of the jth nucleus. Whereas T is diagonal here, one may find an alternate basis where U and V are diagonal. This is the dual basis, defined through the unitary transform FFFT of Eq. (44). In this basis, let us define the state |φD(t)=FFFT†|φ(t)
, which evolves under the Hamiltonian H, which is of exactly that of Eq. (46), with coefficients
Thus the cost of time-evolution by the electronic structure Hamiltonian e−iHt using the interaction picture is given by Eq. (53). The only parameter that depends on the problem is the normalization factor
Thus the total gate complexity of time-evolution under the assumption of constant density (i.e. N/Ω∈O(1)) is
In contrast, the cost of simulation in the plane-wave dual basis by prior art Ryan Babbush, Nathan Wiebe, Jarrod McClean, James McClain, Hartmut Neven, and Garnet Kin Chan, “Low depth quantum simulation of electronic structure,” arXiv preprint arXiv:1706.00023, 2017, applies the ‘Qubitization’ technique (see Guang Hao Low and Isaac L Chuang, “Hamiltonian simulation by qubitization,” arXiv preprint arXiv:1610.06546, 2016) and has gate complexity that scales like Õ(N11/3t), and also has a polynomial dependence on the nuclear charge nj. Embodiments of the disclosed method outperform this, and notably depends only poly-logarithmically on the nuclear charges within the problem.
In this section, a complexity-theoretic perspective of the improvements that are enabled by simulation with the truncated Dyson series and simulation in the interaction picture are submitted. One can do so by evaluating the query complexity for the simulation of sparse Hamiltonians H. Such Hamiltonians of dimension N are called d-sparse if there are at most d=(polylog(N)) non-zero entries in every row, and the position and values of these entries may be efficiently output, in say a binary representation, by some classical circuit of size
(polylog(N)). This abstract model is useful in quantum complexity theory as a natural generalization these classical circuits leads to unitary quantum oracles that can be queried to access the same information, but now in superposition. With this model, one can achieve inSection VIA time-dependent simulation with a square-root improvement with respect to the sparsity parameter, and gate complexity scaling with the average instead of worst-case rate-of-change ∥H∥. By moving to the interaction picture in Section VI B, one can find more efficient time-independent simulation algorithms for diagonally-dominant Hamiltonians.
This model assumes that the Hamiltonian is input to the simulation routine through two oracles: Of and OH. OH is straight forward; it provides the values of the matrix elements of the Hamiltonian given a time index |t and indices |x
, |y
to for the row and column of H as follows
O
H
|t,x,y,0)=|t,x,y,Hxy(t) (62)
Of provides the locations of the non-zero matrix elements in any given row or column of H. Specifically, let ƒ(x,j) give the column index of the jth non-zero matrix element in row x if it exists and an appropriately chosen zero element if it does not. In particular, let rt,j be the list of column indices of these non-zero matrix elements in row J. One can then define, with a time-index t,
O
ƒ
|t,x,j
=|t,x,ƒt(x,j)
. (63)
A. Simulation of Sparse Time-Dependent Hamiltonians
Applying the truncated Dyson series simulation algorithm to sparse Hamiltonians requires us to synthesize HAM-T in Eq. (7) from these oracles OH, OF. This is possible by a straightforward construction.
Lemma 4 (Synthesis of HAM-T from sparse Hamiltonian oracles). Let an N×N time-dependent d-sparse Hamiltonian H(s) with max-norm ∥H∥max:=maxs∥H(s)∥max be defined on the interval s ∈[0, t] to np bits of precision. Then
can be implemented with O(1) queries to Of and OH, and (poly(np)+log (N)) primitive gates.
Proof. Let Ucol, Urow be the following unitary transformations
Let |φ=Σt,kαt,k|t
d|k)s. One then has that
As this result holds for any input state |φ, the choice HAM-T=Urow†·Ucol satisfies Eq. (64).
The query cost then follows from the fact that Ucol and Urow†can be implemented using O(1) calls to Oƒ. In particular, Ucol can be prepared in the following steps:
Therefore accessing Ucol unitaries requires O(1) queries to the fundamental oracles as claimed, along with a arithmetic circuit, of size polynomial in the number of bits used to represent |Hk,p(t), for computing trigonometric functions of the magnitudes of the complex-valued matrix elements as well as their arguments. The argument that Urow†requires O(1) queries follows in exactly the same manner, but with an additional final step that swaps the s and α1 registers.□
Once HAM-T, is obtained, the complexity of simulation follows directly from previous results.
Theorem 4 (Simulation of sparse time-dependent Hamiltonians). Let an N×N time-dependent Hamiltonian H(s) for s ∈ [0, t] with max-norm, ∥H∥max:=maxs∥H(s)∥max and average rate-of-change
ds be encoded in the oracles OH and Oƒ from Eqs. (62) and (63) to np bits of precision. Let α=d∥H∥max and =tα. Then the time-ordered evolution, operator
[exp(−iƒ0tH(s)ds)] may be approximated with error ϵ using
Proof. From Theorem 1, the number of qubits requried is (ns+na+nd+log log (1/ϵ)). Values for these parameters are obtained from the construction of HAM-T in Lemma 4, which also requires an additional np qubits for the bits of precision to which H is encoded. In this construction, nα=
(ns)=
(log (N)). nd is obtained from the number of time-discretization points required by Theorem 1. Simulation for time t is implemented by simulating segments of duration ⊖(α−1). As there are ⊖(tα) segments, one can rescale ϵ→⊖(ϵ/(tα)).□
This is a quadratic improvement in sparsity d. Furthermore, instead of scaling with the worst-case (log (maxs∥{dot over (H)}(s)∥)), one can obtain scaling with average rate-of-change
∥{dot over (H)}∥
. If one further assumes that the computed matrix elements Hj,k are not exact, the number of bit of precision scales like np=
(log (∥H∥t/ϵ)). Note several generic improvements to Theorem 4 are possible, but will not be pursued further as they are straightforward. For instance, if ∥H(t)∥max as a function of time is known, one may use step sizes of varying size by encoding each segment t ∈[tj,tj+1] with the largest maxt∈[t
B. Simulation of Sparse Time-Independent Hamiltonians in the Interaction Picture
This section concerns time-independent d-sparse Hamiltonians H=A +B where A is diagonal and Bkk=0 for all k. In particular, consider the case of diagonally dominant Hamiltonians, where ∥A∥≥d∥B∥max. Given norms for each of these terms ∥A∥ and ∥B∥max, it is straightforward to simulate time-evolution e−iHt in the Schrödinger picture. For instance, using the truncated Taylor series approach in Eq. (38), one obtains a query complexity of (t(d∥B∥max+∥A∥) polylog(t,d,∥A∥,∥B∥max, ϵ)). By instead simulating HI(t)=eiAtBe−iAt in the interaction picture, the dependence on ∥A∥ can be removed, which is particularly useful in cases of strong diagonal dominance ∥A∥≥d∥B∥max, of which the Hubbard model with long-ranged interactions in Section V is an example. Similar to the disclosed results for time-dependent sparse Hamiltonian simulation in Section VIA, this is easily proven by mapping the input oracles OH and Oƒfor matrix values and positions to the oracles of Theorem 3 for the more general result.
Theorem 5 (Simulation of sparse diagonally dominant Hamiltonians). Let an N×N time-independent Hamiltonian H for s ∈[0,t] with max-norm ∥A∥ for the diagonal component and max-norm ∥B∥max for the off-diagonal component be encoded in the oracles OH and Oƒ from Eqs. (62) and (63) to np bits of precision. Let αb=d∥B∥max. Then the time-evolution operator e−iHt may be approximated with error ϵ using
Proof. This follows immediately by combining the query complexity of Lemma 3 to e−iAt and HAM-T that encodes the Hamiltonian HI(t)=eiAtBe−iAt in the rotating frame, with the query complexity of the approach in Theorem 3 for synthesizing these oracles using the input oracles OH and Oƒ. One possible decomposition of HAM-T is
where αB=d∥B∥max, =
(αB−1) and
Note that with this construction,
First, synthesize (Σm=0M−1∥mm|d⊗eiAτm/M) using
(1) queries to the input oracles OH, and
(log(N)+nplog(M)) primitive gates. Since A is diagonal, e−iAt can be simulated for any t>0 using only two queries. This is implemented by the following steps:
Step one uses ns=(log (N)) CNOT gates to copy the computational basis state |k
. Step three applies
(np) phase rotation with angle controlled by the bits of |Hk,k
, and the value of t, which is given beforehand. Subsequently, (Σm=0M−1|m
m|d⊗Iα⊗eiAtm/M) may be implemented by a sequence of rotations with angles increasing in a geometric series, and each controlled by a different qubit in the d register, e.g. controlled−eiAτ2
(log (M)) queries. However, it is only necessary to compute |Hk,k
once as the entire sequence of controlled-phases may be applied after step three. Similarly, it is only necessary to copy the computational basis state
(1) times.
Second, synthesize OB using (1) queries to the input oracles OH and Oƒ. How this is done should be clear from Lemma 4, by omitting the time-index, and preparing the state 0|0
α
α
(poly(np)+log (N)). Thus e−iAτ and HAM-T combined have query complexity
(1) to OH and Oƒ, and gate complexity
(poly(np) log (M)+log (N)). By substituting into Lemma 3, one can obtain the stated results.□
This provides a formal proof that the query complexity of simulating a Hamiltonian, within the interaction picture, is independent of the magnitude of the diagonal elements of the Hamiltonian.
In this section, example methods for performing aspects of the disclosed embodiments are disclosed. The particular embodiments described should not be construed as limiting, as the disclosed method acts can be performed alone, in different orders, or at least partially simultaneously with one another. Further, any of the disclosed methods or method acts can be performed with any other methods or method acts disclosed herein.
In
At 710, a quantum state, Hamiltonian function, and evolution time are input. At 712, a classical computer is used to construct an interaction picture Hamiltonian by conjugating each term in the off-diagonal Hamiltonian by the evolution according to the diagonal terms. At 714, the interaction picture version of the Hamiltonian is simulated on a quantum computer using a time-dependent Hamiltonian simulation method (e.g., such as that of
At 1410, a quantum computer is configured to simulate a quantum system, wherein a Hamiltonian in the simulation is represented in the interaction picture. At 1412, a simulation of the quantum system is performed using the quantum computer.
In certain implementations, the simulation of the quantum system is a subroutine that is repeated two or more times. In some implementations, the simulation is performed using linear combinations of matrices. For instance, in some cases, the simulation uses linear combinations of unitaries performed on a diagonally dominant matrix (e.g., using linear combinations of unitaries performed on the diagonally dominant components of the diagonally dominant matrix). In certain implementations, the quantum system is modelled by a Hubbard model. In particular implementations, the quantum system describes a physical chemical system or molecule. In some implementations, the Hamiltonian is sparse and the simulation uses a state of an auxillary qubit to encode matrix elements of the Hamiltonian instead of using graph decomposition techniques. In certain implementations, the simulation is performed by compressing ancillas for quantum simulation of a time-dependent Hamiltonian.
At 1510, a quantum algorithm is implemented on a quantum computer for simulating a general sparse time-dependent quantum system. In this embodiment, the quantum algorithm does not use graph decomposition techniques.
In certain implementations, a Hamiltonian used in the simulation is represented in the interaction picture. In some implementations, the simulation uses linear combinations of unitaries performed on a diagonally dominant matrix. In particular implementations, a Hamilltonian used in the simulation is represented in the interaction picture and is chosen to be a diagonal matrix. In certain implementations the quantum system is modelled by a Hubbard model, a physical chemical system, or a molecule. In further implementations, the simulation includes compressing ancillas used to index a time for the simulation.
With reference to
The computing environment can have additional features. For example, the computing environment 1000 includes storage 1040, one or more input devices 1050, one or more output devices 1060, and one or more communication connections 1070. An interconnection mechanism (not shown), such as a bus, controller, or network, interconnects the components of the computing environment 1000. Typically, operating system software (not shown) provides an operating environment for other software executing in the computing environment 1000, and coordinates activities of the components of the computing environment 1000.
The storage 1040 can be removable or non-removable, and includes one or more magnetic disks (e.g., hard drives), solid state drives (e.g. flash drives), magnetic tapes or cassettes, CD-ROMs, DVDs, or any other tangible non-volatile storage medium which can be used to store information and which can be accessed within the computing environment 1000. The storage 1040 can also store instructions for the software 1080 implementing any of the disclosed simulation techniques in a quantum computing device. The storage 1040 can also store instructions for the software 1080 for generating and/or synthesizing any of the described techniques, systems, or quantum circuits.
The input device(s) 1050 can be a touch input device such as a keyboard, touchscreen, mouse, pen, trackball, a voice input device, a scanning device, or another device that provides input to the computing environment 1000. The output device(s) 1060 can be a display device (e.g., a computer monitor, laptop display, smartphone display, tablet display, netbook display, or touchscreen), printer, speaker, or another device that provides output from the computing environment 1000.
The communication connection(s) 1070 enable communication over a communication medium to another computing entity. The communication medium conveys information such as computer-executable instructions or other data in a modulated data signal. A modulated data signal is a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media include wired or wireless techniques implemented with an electrical, optical, RF, infrared, acoustic, or other carrier.
As noted, the various methods, techniques for controlling a quantum computing device, circuit design techniques, or compilation/synthesis techniques can be described in the general context of computer-readable instructions stored on one or more computer-readable media. Computer-readable media are any available media (e.g., memory or storage device) that can be accessed within or by a computing environment. Computer-readable media include tangible computer-readable memory or storage devices, such as memory 1020 and/or storage 1040, and do not include propagating carrier waves or signals per se (tangible computer-readable memory or storage devices do not include propagating carrier waves or signals per se).
Various embodiments of the methods disclosed herein can also be described in the general context of computer-executable instructions (such as those included in program modules) being executed in a computing environment by a processor. Generally, program modules include routines, programs, libraries, objects, classes, components, data structures, and so on, that perform particular tasks or implement particular abstract data types. The functionality of the program modules may be combined or split between program modules as desired in various embodiments. Computer-executable instructions for program modules may be executed within a local or distributed computing environment.
An example of a possible network topology 1100 (e.g., a client-server network) for implementing a system according to the disclosed technology is depicted in
Another example of a possible network topology 1200 (e.g., a distributed computing environment) for implementing a system according to the disclosed technology is depicted in
With reference to
The environment 1300 includes one or more quantum processing units 1302 and one or more readout device(s) 1308. The quantum processing unit(s) execute quantum circuits that are precompiled and described by the quantum computer circuit description. The quantum processing unit(s) can be one or more of, but not limited to: (a) a superconducting quantum computer; (b) an ion trap quantum computer; (c) a fault-tolerant architecture for quantum computing; and/or (d) a topological quantum architecture (e.g., a topological quantum computing device using Majorana zero modes). The precompiled quantum circuits, including any of the disclosed circuits, can be sent into (or otherwise applied to) the quantum processing unit(s) via control lines 1306 at the control of quantum processor controller 520. The quantum processor controller (QP controller) 1320 can operate in conjunction with a classical processor 1310 (e.g., having an architecture as described above with respect to
With reference to
In other embodiments, compilation and/or verification can be performed remotely by a remote computer 1360 (e.g., a computer having a computing environment as described above with respect to
In particular embodiments, the environment 1300 can be a cloud computing environment, which provides the quantum processing resources of the environment 1300 to one or more remote computers (such as remote computer 1360) over a suitable network (which can include the internet).
The dynamics of interest in many condensed matter systems occur in the low-energy subspace of the Hamiltonian. This work demonstrates that by simulating quantum dynamics in the interaction picture, the cost of simulation on a quantum computer for a large class of Hamiltonians can be made to scale with the low-energy component. This is especially useful when the dynamics at high energy scales are, in a certain sense, simple, but nevertheless are necessary as they couple to low energy scales. This represents a significant advance compared to state-of-art time-independent quantum simulation algorithms that generally scales with the spectral norm of the full Hamiltonian. Finding further improvements in simulation algorithms specialized for low-energy subspaces remains a major open problem. Embodiments of the disclosed approach are complementary to alternatives based on spectral gap amplification, and the possibility of combining these methods is an interesting future direction.
More generally, the complexity of time-independent quantum simulation for generic Hamiltonians, given minimal information, appears to have been resolved. Thus future advancements, such as this work, will likely focus on exploiting the detailed structure of Hamiltonians of interest. For example, the result for time-dependent sparse Hamiltonian simulation has scaling (d∥H∥maxt), but improvement to
(√{square root over (d∥H∥max∥H∥1t)}) is straight-forward, if the induced one-norm ∥H∥1 of the Hamiltonian is also known beforehand. The promise of results in similar directions is exemplified by recent work that exploit the geometric locality of interactions, or the sizes of different terms in a Hamiltonian. The challenge will be finding characterizations of Hamiltonians that are sufficiently specific so as to enable a speedup, yet sufficiently general so as to include problems of practical and scientific value.
A. The Truncated Taylor Series Algorithm
The truncated Taylor series simulation algorithm was a major advance in quantum simulation for its conceptual simplicity and computational efficiency. The original algorithm is motivated by truncating the Taylor expansion of the time-evolution operator at degree K.
Assuming that t>0 and that the truncation order K≥2∥H∥t, the norms of
Thus any unitary quantum circuit TTS that acts jointly on registers α,b,s and applies the non-unitary operator (00|ab⊗Is)TTS(|00
ab⊗Is)≈
Solving Eq. (74) for ∥H∥t=(1) gives the required truncation order
The simulation algorithm TTS in
state.
The original algorithm implements HAMK using K queries to controlled-HAM
C-HAM:=|11|b⊗Ias+|0
0|b⊗HAM (76)
with K copies of registers α and b. The state |k{right arrow over (b)}=|0
⊗k|1
⊗K−k that selects desired powers of H is encoded in unary, and so COEFK may be implemented using
(K) primitive gates. Up to a proportionality factor β, the unitaries of Eq. (75) allow us to implement the desired linear combination
As (δ). Thus, one can choose ln 2≤t=
(1) such that β=2. If one desires |t|<ln 2, β may be decreased by appending a single-qubit ancilla and noting that |
0|eiθX|0
|=|cos θ|≤1. Thus simulation is accomplished with the circuit
TTS=TTS
β=2(REF{right arrow over (α)}b⊗Is)TTSβ=2†(REF{right arrow over (α)}b⊗Is(TTSβ=2,REF{right arrow over (α)}b=I{right arrow over (α)}b−2|00|{right arrow over (α)}b. (78)
This approximates time-evolution by e−iHt with error ∥(0|{right arrow over (α)}b⊗Is)TTS(|0
{right arrow over (α)}b⊗Is)−e−iHt∥=
(δ). In order to simulate evolution e−iHT by longer times T>t, one can apply TTST/t− here t=⊖(1) is chosen such that T/t is an integer. The overall error
ϵ=∥(0|{right arrow over (α)}b⊗Is)TTST/t(|0
{right arrow over (α)}b⊗Is)−e−iHT∥=
(Tδ), (79)
and success probability 1−(ϵ) may thus be controlled by choosing the error of each segment to
This requires a truncation order of
One may drop the implicit assumption that ∥H∥≤1, by rescaling H→H/α, for some normalization constant α≥∥H∥. Thus simulation of e−iHt requires
queries to C-HAM. Note that the gate cost of all queries to COEFK at
and that of REF{right arrow over (α)}b at
is typically dominated by the gate cost of all applications of C-HAM.
The ancilla overhead of the truncated Taylor series algorithm, at
qubits, may be significantly improved by choosing the sequence of unitaries in the compression gadget Lemma 2 of Section III C to be Uj=−iHAM. This straightforwardly furnishes the following result.
Corollary 1 (Hamiltonian simulation by a compressed truncated Taylor series). Let a time-independent Hamiltonian H be encoded in standard-form with normalization α and ns+nα qubits, as per Eq. (5). Then the truncated Taylor series algorithm approximates the time-evolution operator e−iHt for any |αt|≤ln2 to error ϵ using
For longer-time simulations e−iHT of duration T>t, Corollary l is applied αT/ln (2) times, each with error
This leads a query complexity
Though the compressed algorithm is still worse than the quantum signal processing approach, which uses ns+(nα) qubits, the technique is applicable to simulating time-dependent Hamiltonians, as demonstrated in Section III.
In this section, the proof of Lemma 1 is completed for the error from truncating the Dyson series at order K, and the error from approximating its terms, which are time-ordered integrals, with Riemann sums. These results provide a rigorous upper bound on the error of time-dependent Hamiltonian simulation. Let Dk be the kth term in the Dyson expansion, and let Bk be the Riemann sum of Dk with each dimension discretized into M=t/Δ segments.
It is now proven that the bounds on the ϵ1 term, which is the error due to truncating the Dyson series at order K.
Lemma 5. Let H(s): →
N×N be differentiable on the domain [0,t]. For any ϵ1 ∈ [0, 2−e], an approximation to the time ordered operator exponential of −iH(s) can be constructed such that
if one takes
Proof of Lemma 5. We start by bounding |Dk∥.
At this point, the proof is identical to the time-independent case as maxs ∥H(s)∥ is independent of time. Thus using Stirling's approximation and assuming K≥2 maxs ∥H(s)∥|t|,
Now one can find that this in turn is less than ϵ1 if maxs ∥H(s)∥te<min{ln(1/ϵ1),eln2}≤1 given that, ϵ1≤2−e and
where W is the Lambert-W function. Using the fact that for x≥1, W(x)≥(ln(x)+1)/2 and ln(eln2)<1 one can obtain the simpler bound
It is now proven that the bounds on the ϵ2 term for the error from approximating the Dyson series with its Riemann sum.
Lemma 6. Let H(s): →
N×N be differentiable on the domain [0,t]. Let one also define the quantities
ds. For integer K≥0 and ϵ2>0,
by choosing any M such that
Proof of Lemma 6. One first expands the time-ordered evolution operator using the Dyson series. One can then examine the error incurred in evaluating a given order of the Dyson series for a small hypercubic region of sidelength Δ=t/M. One can then upper bound the maximum number of such hypercubes within the allowed volume and use the triangle inequality to argue that the error is the product of the number of such hypercubes and the maximum error per hypercube.
Since H(s) is a differentiable function it holds from Taylor's theorem that for any δ«1 and computational basis states |x,|y
,
Since computational basis states form a complete orthonormal basis it follows through norm inequalities that
H(s+δ)=H(s)+δH(s)+o(∥H(s)∥max N2δ). (86)
One then has from Taylor's theorem and the triangle inequality that
Since this equation holds for all r, it also holds in the limit as r approaches infinity. Therefore
∥H(s+δ)−H(s)∥≤∫0δ∥{dot over (H)}(s)∥ds. (88)
Next, let one consider the error in approximating the integral over a hypercube to lowest order and let one define the hypercube to be C with x1, . . . , xq being the corner of the hypercube with smallest norm. First note that in general if Aj and Bj are a sequence of bounded operators and ∥·∥ is a sub-multiplicative norm then it is straight forward to show using an inductive argument that for all positive integer q.
By applying this in combination with Eq. (88) to region C, the error induced is
There are two regions in the problem. The first region, which one calls the bulk, is the region that satisfies all the constraints of the problem namely bulk :={(t1, . . . , tq):└t1/Δ┘>. . . >└tq/Δ┘}. Thus for any index x1, . . . , xq to a hypercube in the bulk, the ordering of terms H(x1+t1) . . . H(xq+tq) in the integrand of Eq. (90) is fixed. The second region is called the boundary which is the region in which the hypercubes used in the Riemann sum would stretch outside the allowed region for the integral. Since one can approximate the integral to be zero on all hypercubes that intersect the boundary, the maximum error in the approximation is the maximum error that the discrete approximation to the integrand can take within the region scaled to the volume of the corresponding region.
Finally one has from Eq. (90) that the contribution to the error from integration over the bulk of the simplex is
In order to understand how the error scales let one examine the partial sum over x1 for fixed k>1 is
The integral then takes exactly the same form as the original integral and so by repeating the argument q−1 times it is easy to see, even in the case where k=1, that
where the definition that xq+1:=M=t/Δ has been used and the fact that ƒ0x∥H(s)∥ds is a monotonically increasing function of x. Thus the contribution to the error from the boundary is at most
Next one desirably estimate the volume of the boundary. If a point is on the boundary then by definition there exists at least one tj such that └tj/Δ┘=└tj+1/Δ┘ taking t0=t. All other values are consistent with points within the bulk. It is then straight forward to see that (after relabeling the indexes for the summation) that the volume can be expressed as the sum over all possiole choices of such sums with at least one matched index. If one further assumes that q2Δmaxs∥H(s)∥/[αt]≤ln(2) then one has the following upper bound on boundary contribution to the error in the Dyson series:
Note that when q=1, there is no boundary contribution. Using the fact that q/(q−1)≤2, ∀q≥2, this upper bound
It then follows from summing Eq. (96) that the error is
By adding this result to that of Eq. (94)
It follows from elementary algebra that this total error is at most ϵ2 if
Expressed in terms of the number of points
the total error is at most ϵ2 if choose any M such that
The final bound on M quoted immediately follows from
Now that it has been proven that the necessary results regarding the error in the truncated Dyson series simulation, the following proof of Lemma 1 can be provided:
Proof of Lemma 1. This is proven by combining two intermediate results using a triangle inequality. The approximation error is upper-bounded by
One can take in both cases the errors to obey ϵ1=ϵ/2 and ϵ2=ϵ/2. The result then follows by taking the most restrictive of the assumptions of Lemma 5 and Lemma 6. □
Having described and illustrated the principles of the disclosed technology with reference to the illustrated embodiments, it will be recognized that the illustrated embodiments can be modified in arrangement and detail without departing from such principles. For instance, elements of the illustrated embodiments shown in software may be implemented in hardware and vice-versa. Also, the technologies from any example can be combined with the technologies described in any one or more of the other examples. It will be appreciated that procedures and functions such as those described with reference to the illustrated examples can be implemented in a single hardware or software module, or separate modules can be provided. The particular arrangements above are provided for convenient illustration, and other arrangements can be used.
This application claims the benefit of U.S. Provisional Application No. 62/656,517 entitled “HAMILTONIAN SIMULATION IN THE INTERACTION PICTURE” and filed on Apr. 12, 2018, which is hereby incorporated herein in its entirety.
Number | Date | Country | |
---|---|---|---|
62656517 | Apr 2018 | US |