This invention relates to classical and quantum computations, specifically to implementations of quantum computing and simulations via Monte Carlo on classical computers.
Quantum mechanics and quantum field theories provide the most accurate description of almost everything in the known physical world, with the only exception of extremely strong gravitation. Vastly many questions in physics, chemistry, materials science, and even molecular biology could be answered definitively by solving a set of well established quantum equations governing a quantum system, which refers to a generic physical system consisting of particles and fields that mediate electromagnetic, weak and strong interactions, even non-extreme gravitational forces, so long as said particles and fields conform to the laws of quantum mechanics. Such a quantum system usually involves a large number of particles and modes of fields, and the noun quantum system is often premodified by an adjective many-particle or many-body [1-3] to stress that the sum of the number of particles and the number of modes of fields is large, usually much greater than 1, although such adjective many-particle or many-body is sometimes omitted but implied and understood from the context. A formal definition of quantum physics or system will be given in the detailed description, whose mathematical rigor is needed to clearly present and discuss the problem of solving or simulating a given quantum system, as well as to measure the computational complexity incurred by a solution. Computational applications solving quantum equations include but are not limited to two general classes of problems, where one class consists of static problems that need to find the eigenvalues and eigenstates or the stationary distribution or the expectation value of a quantum observable, such as a time-independent Hamiltonian or the associated (thermal) density matrix, which is also known as the (quantum) statistical operator or the Gibbs operator [4-8], whose position (that is, coordinate in a configuration space [1]) representation is called the Gibbs kernel, whereas the other class consists of dynamic problems that need to solve for the time evolution of a quantum state in the Schrödinger representation or a quantum observable in the Heisenberg representation [1, 9] when dealing with a closed quantum system, or of a reduced density matrix governed by the Liouville-von Neumann equation or the so-called master equations [10, 11] for an open quantum system. However, the categorization of computational problems into static and dynamic classes is only relative, and the two classes of problems are often interchanged during formulations and solutions. Examples for solving a static problem using dynamic evolution include the homotopy method for eigenvalue problems [12, 13], the equation-of-motion method for computing the eigenvalue distribution of large matrices on a classical computer [14, 15], and Kitaev's quantum phase (eigenvalue) estimation algorithm as a quantum counterpart [16]. Conversely, for examples of mapping the dynamic execution of a computational algorithm into finding the minimum energy and state of a static system, there are the so-called quantum-dot cellular automata [17, 18] still in the realm of classical computing, and the celebrated paradigm of ground state quantum computation (GSQC) [19-22], where a given quantum algorithm is mapped into a designer Hamiltonian and a simulation of its ground state, which encodes the history of the quantum evolution prescribed by said quantum algorithm.
Ostensibly due to the exponentially exploding dimension of the Hilbert space that is needed to describe the state of a quantum system, it can be exceedingly hard to solve the quantum equations or simulate an eigenstate or dynamics of even a moderately sized quantum system on a classical computer [23]. Various quantum Monte Carlo (QMC) approaches [24-28] have the potential to break the curse of dimensionality, by mapping a non-negative ground state wavefunction or Gibbs kernel of a quantum system into a classical probability density, and simulating a random walk that embodies importance sampling of such a probability distribution. Given positivity of a concerned wavefunction or Gibbs kernel, QMC is arguably the only general and exact numerical method that is free of uncontrollable systematic errors due to modeling approximations, providing reliable and rigorous simulation results upon numerical convergence. Unfortunately, previous QMC procedures for many quantum systems, especially those involving multiple indistinguishable fermions which represent the vast majority of atomic, molecular, condensed matter, and nuclear systems, suffer from the notorious sign problem [23, 28-31] that leads to an exponential slowdown of numerical convergence, due to the presence and cancellation of positive and negative amplitudes, when no computational basis is known to represent the concerned ground or thermal state by a non-negative wavefunction or Gibbs kernel. At the fundamental and theoretical level, as Feynman keenly noted [32], a defining characteristic, possibly the single most important aspect, setting quantum mechanics and computing apart from classical mechanics and computing, seemed to be the presence and necessity of a sort of “negative probability” in the quantum universe, endowing the power to represent and manipulate “negative probabilities”. Indeed, their perceived inability to deal with “negative probabilities” efficiently was believed to fundamentally limit the power of classical mechanics and computers in terms of simulating their quantum counterparts and solving computational problems that quantum computers are predicted and believed to excel. The persistently unsolved status of the sign problem in the past, compounded by the piling of “evidence problems” that had an efficient quantum solution but no good classical algorithm being known or even thought possible [33-39], had fueled a pervasive belief that quantum computers were inherently more powerful than classical machines, and there existed certain hard computational problems which were amenable to polynomial quantum algorithms but could not be solved efficiently on classical computers, or in the terminology of quantum complexity theory [40-42], that the computational complexity classes of bounded-error quantum polynomial time (BQP) and quantum Merlin Arthur (QMA) are strictly proper (i.e., larger) supersets of the classical classes of bounded-error probabilistic polynomial time (BPP) and 1-message Arthur-Merlin interactive proof
Bucking the common and popular belief, here I shall disclose general and systematic solutions to the dreaded sign problem, thus establish BPP=BQP, by identifying and characterizing a class of quantum Hamiltonians/systems called separately frustration-free (SFF), whose Gibbs wavefunctions have a nodal structure that can be efficiently computed by solving a small subsystem involving a small number of dynamical variables associated with a constituent few-body interaction. When an SFF Hamiltonian is also fermionic stoquastic and spatially local, it is dubbed SFF fermionic Schrödinger (SFF-FS). Alternatively, when a general SFF Hamiltonian has each of its constituent few-body interactions essentially bounded, it is called SFF essentially bounded (SFF-EB). As a first Major Utility, it will be demonstrated that any SFF-FS or SFF-EB Hamiltonian can be efficiently simulated on a classical probabilistic machine, in the sense that a Gibbs wavefunction of such a Hamiltonian can be efficiently sampled via a Monte Carlo procedure on a classical probabilistic machine, subject to a polynomially small error in a properly defined sense. Next, I will construct a special class of Hamiltonians/systems called SFF doubly universal (SFF-DU), that are both SFF-FS and SFF-EB consisting of distinguishable and interacting bi-fermions, each of which comprises two non-interacting identical spinless fermions moving in a three-well potential on a circle. Then as a second Major Utility, it will be demonstrated that SFF-DU Hamiltonians/systems consisting of only bi-fermions are universal for quantum circuits and computations, as such, any many-body quantum system can be mapped onto and simulated by an SFF-DU system of bi-fermions.
The combination of the two major utilities effectively asserts a third Major Utility that the two computational complexity classes, bounded-error probabilistic polynomial time (BPP) [45] and bounded-error quantum polynomial time (BQP) [45], are actually one and the same. That quantum computing and mechanics are just classical computing and probability up to polynomial reduction, is too good and true. Any quantum algorithm/circuit in BQP can be efficiently simulated by a Monte Carlo algorithm running on a classical computer. Such simulation, indeed implementation of quantum computing, is called Monte Carlo quantum computing (MCQC), which is not to be con-fused with the still sign-problem-prone, conventional quantum Monte Carlo simulation of quantum computation [46]. The methods and algorithms presented here not only solve the sign problem that has plagued Monte Carlo simulations in many areas of science and technology, but also open up new avenues for developing and identifying efficient probabilistic algorithms from the vantage point of quantum computing. All known and to be discovered BQP algorithms reduce to BPP solutions. It should be noted that the BPP or BQP class of computational problems as referenced here is not to be understood as limited strictly to the family of decision or promise problems [47, 48] on a classical or quantum computer. Rather, the BPP or BQP class should be broadly interpreted as to represent general types of computational problems that are efficiently solvable on a classical or quantum machine. Indeed, it has been well established and widely known that a great number of computational problems for function evaluation, objective optimization, and matching or solution search, etc., are reducible or polynomially equivalent to BPP or BQP problems, in that, the answer to a function/optimization/search problem can be obtained efficiently by solving one or a polynomially bounded number of BPP or BQP problem(s). Moreover, it is often quite straightforward in practice to modify and adapt only slightly a BPP or BQP algorithm to an efficient procedure for solving a function/optimization/search problem.
Methods, software systems and related computer program products, involved computer-readable source and machine codes as well as numerical data, hardware systems and related physical devices storing, processing, and executing involved computer-readable source and machine codes as well as numerical data, are described in relation to processes of simulating a multivariable system on a classical computer, constructing a multivariable system for quantum computing, and solving a computational problem on a classical computer.
A first process of simulating a multivariable system comprises a step of receiving a multivariable system and a step of generating a plurality of result sampling points. Said multivariable system in turn comprises a configuration space containing configuration points, and a separately frustration-free (SFF) partial Hamiltonian as a combination of a plurality of directly frustration-free (DFF) or ground state frustration-free (GFF) partial Hamiltonians, as well as an objective operator whose expectation value is of interest. Said step of generating a plurality of result sampling points further comprises one step of initializing a current sampling point (CSP) variable, and another step of repeating a first iterative step and a second iterative step for a predetermined number of times, where the first iterative step executes a transition from a CSP to a new sampling point, while the second iterative step records and resets the CSP variable.
A second process of constructing a multivariable system comprises a step of receiving a description of a quantum algorithm, a step of constructing a Feynman-Kitaev (FK) register, and a step of constructing an SFF partial Hamiltonian. Said description of a quantum algorithm in turn comprises a plurality of qubits, an initial state, a linearly ordered sequence of a plurality of quantum gates, and a quantum measurement operator. Said step of constructing a Feynman-Kitaev (FK) register further comprises a sub-step of allocating a plurality of FK logic bits and a plurality of FK clock bits, a sub-step of creating a plurality of FK time projectors, and a sub-step of creating a plurality of FK time propagators, where said plurality of FK time propagators are divided into a first group of first FK time propagators and a second group of second FK time propagators. Said step of constructing an SFF partial Hamiltonian further comprises a sub-step of creating an FK state initializer, a sub-step of creating a first FK state operator as a direct sum of a plurality of first FK state propagators, with each said first FK state propagator being a tensor product between a first FK time propagator and a quantum gate acting on said FK logic bits, and a sub-step of creating a second FK state operator as a direct sum of a plurality of second FK state propagators, with each said second FK state propagator being a tensor product between a second FK time propagator and a quantum gate acting on said FK logic bits.
A third process of solving a computational problem comprises a step of generating data or signals describing a multivariable system and a step of simulating said multivariable system. Said step of generating data or signals describing a multivariable system further comprises a sub-step of allocating a configuration space, a sub-step of creating an SFF partial Hamiltonian, and a sub-step of generating an objective operator. Said step of simulating the multivariable system uses said first process of simulating a multivariable system.
Separately Frustration-Free Partial Hamiltonians and Systems
Although not being stated explicitly, it should be understood that the physics and properties of any quantum system involve such physical constants as the Planck constant, the speed of light in vacuum, the elementary electric charge, and the masses of elementary particles. For succinctness of mathematical expressions, it is good to work with a “natural” unit system where the relevant physical constants take the numerical value 1, or to have the Schrödinger and other physical equations nondimensionalized, so that the physical constants do not manifest in the Hamiltonians and other physical quantities (operators).
A theory on a physical system, whether classical or quantum, needs to start with identifying a list of dynamical variables [49, 50] associated with the particles and/or fields comprising the physical system. Commonly used dynamical variables include spatial positions and/or spins of particles and spatial distributions of field intensities or potentials, although other physical quantities may be used as well, and could be particularly convenient for specific problems of interest. Different choices of a list of dynamical variables associated with the same physical system are mathematically and physically equivalent, when they are related through either a canonical transformation [50, 51] for a classical theory or a unitary transformation [1, 49, 52] for a quantum theory, and in both the classical and quantum cases an extension by/to the so-called gauge transformations [1, 53-57].
In the non-relativistic regime, a general many-body quantum system may consist of a fixed number of particles belonging to multiple species of boltzmannons, bosons, and fermions, that interact with and/or through certain classical fields as well as quantized modes of fields that are bosonic in nature, whereas in the relativistic regime, i.e., the realm of quantum field theory, both particles and fields could move in and out of quantized modes, and the number of particles or field quanta in any particular quantized mode may not be conserved. For considerations of QMC and quantum computing, if quantum systems consisting of many fermionic particles could be well handled, then bosonic particles and field modes would not impose additional challenges, because indistinguishable bosons can be artificially labeled and a bosonic system/subsystem can be simulated by a boltzmannonic random walk in conjunction with Monte Carlo sampling of particle label permutations. Permutation-averaging N-particle Gibbs wavefunctions, N∈ amounts to calculating the permanent of an N×N non-negative matrix, which is known to admit a fully polynomial randomized approximation scheme [58, 59], that, with a specified approximation error that is bounded by a polynomial of the problem input size, can be cast into a decision version that belongs to the class of BPP promise problems [47,48]. Another excuse for not having to emphasize and work explicitly on bosons in the present considerations of QMC and quantum computing is that a quantum system consists only of fermions can be configured into a universal quantum computer/simulator, which can be so programmed that its effective physics is a homomorphic image, or called a homophysical image (the notion of homophysics will be precisely defined shortly), of any general quantum system containing particles of any type of statistics. Furthermore, an individual boltzmannon can be regarded as a single fermion of a unique species. Therefore, it is without loss of generality to focus on many-body quantum systems consisting of multiple species of fermions. A quantum system and its Hamiltonian are said to be many-fermion when they involve at least two identical fermions of the same particle species.
In a canonically quantized theory [49,54,60] with the position (that is, the canonical coordinate [49]) representation, a quantum system could involve both continuous and discrete dynamical variables being represented by continuous and discrete canonical coordinates [49], thus move in a Cartesian product between a connected continuous topological space [61] and a finite discrete point set. Physical motions and quantum mechanics in a finite discrete space can be treated nicely and simply using the theory of finite-dimensional vector spaces [62] and matrices, as epitomized by the familiar physics and theory of the electron spin and Pauli matrices. By contrast, quite a bit of advanced and sophisticated mathematical machinery is required to render a quantum theory of continuous motions in a topological space with sufficient generality and necessary rigor, that involves topological manifolds, Riemannian differential geometry, and functional analysis, in particular, Wiener measures and path integrals, operator analysis, and spectral theory.
Definition 1. (Configuration Space [1])
In general, a configuration space refers to a Cartesian product
between a connected continuous topological space and a finite discrete (topological) space
, that is a finite point set
endowed with the discrete topology. A point in such a configuration space
is meant to represent the values of a chosen list of dynamical variables associated with a plurality of constituent particles and/or fields comprising a quantum system.
A configuration space defined as such includes two special cases, one of which being with
=θ or
containing but a single isolated point, when it reduces to
≅
, while the other with
=θ or
containing just one discrete point, when it reduces to
=
. Notwithstanding the adoption of a generic Cartesian product
representing an all-encompassing continuous-discrete product configuration space, and accordingly, our generality-striving mathematical formulations in this specification, it is worth noting that an actual or model physical system with either all discrete or all continuous dynamical variables can be made universal, in terms of both numerical simulations for or based on it and the computational power of it as a computing machine.
In practices of numerical computations as well as complexity analyses, every continuous coordinate can be discretized into a lattice at a finite resolution, with the entailed discretization error being controlled to decrease at least polynomially as the resolution gets finer. Certain required accuracy in approximating the quantum wave distribution or the expectation value of an observable sets an upper bound for the grid size of discretization. Such discretization is widely adopted in various fields like digital signal processing, computational physics, and lattice (quantum) field theory, where a finite grid resolution effects an upper limit to the frequency or energy scale, that, if set sufficiently high, should not affect the low-frequency dynamics or low-lying energy states of interests.
Conversely, any discrete dynamical variable, such as a spin-½, can be homophysically simulated (in a sense to be precisely defined shortly) by a subspace of low-energy states of a particle moving in a continuous space with a multi-well potential, where the well depths and tunneling strengths between wells are suitably designed, such that well-localized wave packets resemble discretely valued states of the discrete dynamical variable, and tunnelings of wave packets between potential wells simulate couplings between the discretely valued states of the discrete dynamical variable in question. Indeed, one interesting development later in this specification will be to construct a system of two fermions moving continuously on a circle that simulates/realizes a qubit, or to be exact, a rebit, which is a quantum two-state system with the wavefunction restricted to be real-valued.
Consider a physical system consisting of S∈ fermion species, each of which indexed by s∈[1, S] has ns∈
identical particles moving in a configuration space
with s being a finite set of discrete points having a cardinality
and
s) being a (dcS∈)-dimensional Ck-differentiable manifold, k≥1, that is a connected second-countable Hausdorff topological space of a set Ms with a topology Ts, equipped additionally with a Ck-smooth atlas
s, which is a collection of charts (also called coordinate patches)
where As is an index set, {Uα}α∈A(Uα)⊆
d
φβ(Uα∩Uβ) is Ck-smooth; still further, the atlas
s is assumed maximal, in the sense that, if (U, φ) is any chart Ck-compatible with all (Uα, φα), α∈As, then (U, φ)∈
s. The topological space (Ms,
) is called locally Euclidean by the equipment of an atlas. Being both locally Euclidean and second-countable implies that the manifold
s as a topological space is locally compact, normal, metrizable, paracompact, path-connected, separable, and admits a countable partition of unity {ρi
subordinate to a predetermined open cover of Ms, such that ∀i∈
, ρi≥0, ∀p∈Ms, Σi=1∞ρi(p)=1, where the functions {ρi
are Ck-smooth, the collection of supports {supp(ρi)
is locally finite [63,64]; either the predetermined open cover is also countable as {Ui}
, and supp(ρi)⊆Ui holds ∀i∈
, or else, the {ρi
functions and the open cover sets are not necessarily in one-to-one correspondence, but then {supp(ρi)
can be assumed all compact [63]. The configuration space
is considered to possess a product topology, with s being interpreted as a compact discrete topological space.
To better facilitate analysis and physics therein, the differentiable manifold s is further assumed to be equipped with a smooth metric {gp}p∈M
s, g) a Riemannian manifold, with the metric g defining an inner product on each tangent space TpMs, p∈Ms, and inducing naturally a length measure
dt along any piecewise C1-smooth path γ: ⊇[a, b]
Ms [64, 65], which in turn defines a distance function distg(q, r), or just dist(q, r) when there is no risk of ambiguity, as dist
∀q, r∈Ms, with
being the set of all piecewise C1-smooth paths on Ms connecting q and r, and arg inf {∫γdLg (γ): γ∈Γ[q,r](Ms)} being the minimal geodesic on Ms between q and r. The distance function distg(⋅, ⋅) metricizes Ms and the metric topology (Ms, distg) coincides with the atlas topology s. Also, the metric g induces a volume measure Vg on a suitable measurable space (Ms, Σs) with Σs being a certain σ-algebra of subsets of Ms that contains the topology
s, where for each U∈
s, the volume
with {ρi being a countable partition of unity subordinate to the atlas {(Uα, φα)}α∈A
, there exists an α(i)∈As such that supp(ρi)⊆Uα(i). It is further assumed that the measure space (Ms, Σs, Vg) is σ-finite. The smallest Σs possible is the Borel σ-algebra of
, denoted by ΣsB, on which Vg defines a Borel measure. The Borel measure space (Ms, ΣsB, Vg) is separable [66, 67] in the sense that, the quotient space of ΣsB modulo sets of measure zero is separable as a topological space, induced by the metric
called the measure of symmetric difference, ∀E, F∈ΣsB, where [X]∈ΣsB/Vg represents the equivalence class of X∈EsB modulo sets of measure zero, that is,
A drawback of the Borel measure is incompleteness. Fortunately, the Lebesgue measure, being separable and complete as desired, can be constructed by defining an outer measure V*g for any subset X⊆Ms as
calling any subset Y⊆Ms Lebesgue measurable if V*g (X)=V*g (X∩Y)+V*g(X∩Yc) holds ∀X⊆Ms, and identifying V*g on the σ-algebra ΣsL of Lebesgue measurable sets as the Lebesgue measure Vg [66, 67].
In a canonically first-quantized theory with the coordinate representation, the ns indistinguishable particles of each fermion species s∈S may be artificially labeled, so that their spatial configuration, namely, the collective local coordinates of the ns identical fermions, can be represented by a point on a product space
with sn
as a spatial substrate that hosts a number of identical particles of species s is called the substrate space, and
is called the substrate space dimension, where dim(s) is given by the standard definition of dimension, or called topological dimension, of a connected topological space [61, 69], which for a well-behaved topological space, particularly a differentiable manifold
s, coincides with the minimum number of independent numerical values, also called scalar components, that are needed to label and distinguish all of the points in
s, while |
s| denotes the cardinality of
s, namely, the number of discrete points in the set
s, and log2|
47s| characterizes the minimum number of bits needed in a binary string used to label and distinguish all of the discrete points in
47s. By contrast, the product manifold Csn
particles is called the configuration space, or to emphasize, the many-body configuration space, which is said to have a configuration space dim
Naturally, the interactions among particles and the wave function of the whole system consisting of S fermion species can be treated mathematically under the auspices of a (product) configuration space
which is endowed with the product topology, the product atlas, and the product Riemannian metric still denoted by g, together with the product length measure for paths and the product volume measure still being denoted by Lg and Vg respectively. A note is in order on the notation of a configuration point [1] or a coordinate variable on a substrate space
or a configuration space
In many situations, a single variable like q∈s or q∈
would be sufficient and self-explanatory. But when it is necessary or otherwise useful, an ordered pair such as (qC, qD)∈
may be used to coordinate a point in a substrate or configuration space, with the continuous and discrete canonical coordinates written down explicitly and separately.
On a many-body configuration space , the above-named product length measure Lg induces a product distance function dist
∀q, r∈, which respects the product topology on
. The overall configuration space dimension is obviously dim
The redundancy due to the artificial labeling of identical particles can and has to be removed by requiring the wavefunction to satisfy the exchange symmetry under permutations of identical particles. Specifically, for the system of S fermion species, let (q1, . . . , qs)∈Πs=1S sn
, with
1≤s≤S denoting the configuration of the ns artificially labeled identical fermions of the s-th species, where ∀s∈[1, S], ∀n∈[1, ns],
with vsn∈s,
being a chart (coordinate patch) on an s manifold, then any
-valued wavefunction
must be in the Hilbert space
with a scalar field ∈{
,
}, and a group
called the exchange symmetry group, where ∀s∈[1, S], Gs is the symmetry group of ns labels [70], i.e., the symmetry group of exchanging the ns identical fermions of the s-th species, so that for each (π∈Gex)=Πs=1S(πs∈Gs), it holds
with sign
sign(πs), where
and sign(πs)=±1 denotes the sign of the permutation πs∈Gs [70]. Alternatively, the redundancy in the coordinate representation for the system of S fermion species can be removed by using the quotient manifold /Gex of the configuration space
modulo the exchange symmetry group Gex, whose action on
is obviously homeomorphic [71-76]. More formally, the quotient manifold
/Gex forms an orbifold [77, 78], on which a typical point [q]∈
/Gex is actually an orbit
of a certain q∈, and a
-valued function can be defined and take value on each [q]∈
/Gex independently. The set of square Lebesgue-integral functions on
/Gex, called wavefunctions on
/Gex, form a Hilbert space L2(
/Gex), which is isomorphic to LF2(
), with both being regarded as one and the same. It is worth noting that the Lebesgue spaces L2(
) and LF2(
) are necessarily separable Hilbert spaces due to the separability of the Lebesgue measure space (C, ΣL(
), Vg), with ΣL(
)=Πs=1S(ΣsL)n
Many-body systems considered in this specification are always assumed to consist of a potentially large number S∈ of fermion species, where each species s∈[1, S] has at most a fixed number nmax∈
of identical fermions that move on a substrate space
whose substrate space dimension ds does not exceed a fixed number dmax∈. As the number S of species and the overall configuration space dimension Σs=1S nsds increase indefinitely, of great interest is the asymptotic scaling of the computational complexity for simulating such many-body systems.
Since Dirac and von Neumann [49, 79] laid down the foundations, a quantum theory is custom-arily formulated through a Hilbert space of state vectors and a space
of linear operators on
for physical observables. With a configuration space
being chosen,
are among the most often used Hilbert spaces, although other convenient choices, such as the Sobolev spaces [80-82]
are employed occasionally, where ∀ψ∈L2(), ∂ψ denotes the vector of first-order weak partial derivatives of as distributions. Having a Hilbert space
chosen and understood, let
denote the set of linear operators from to
that are possibly unbounded but must be densely defined and closable,
⊆
denote the subset consisting of self-adjoint operators, and
denote the Banach space of bounded linear operators [83-85]. A general Hilbert space and the sets of operators
as well as the quantum theory based on them are said to be supported by the configuration space
. The quantum physics of such a physical system is governed by one particular self-adjoint operator H∈
, called the Hamiltonian, that can be defined through a semibounded symmetric quadratic form QH(ϕ, ψ) over a dense subset dom(H) of the chosen and understood Hilbert space
[84, 85], such that, formally, the Hamiltonian H acts on Dirac bra and ket vectors to make
It is often necessary and/or convenient to consider a restriction of H to a closed Hilbert subspace ⊆
, denoted as
which is a self-adjoint operator defined on such that
ϕ|H
)|ψ
=
ϕ|H|ψ
, ∀ϕ, ψ∈
, in other words,
now as an operator defined over the entire , where P
represents the orthogonal projection onto the closed Hilbert subspace
. Conversely, the Hamiltonian H is called an extension of H
from the Hilbert space
to a Hilbert superspace
.
Let be an open subset of the configuration space
, and
{q∈\
}, p∈
, p≥1 be the closed subspace of Lp-integrable and Dirichlet boundary-conditioned functions. In particular, with p defaulting to the most common value 2, let
denote the closed Hilbert subspace of Dirichlet boundary-conditioned functions, called the Dirichlet space on . One particularly useful restriction of a Hamiltonian H∈
is H
=PDiri(
)HPDiri(
), or written as H|
=P
HP
in a shorthand manner, where
denote respectively a restriction and the orthogonal projection to the Dirichlet space Diri(). Formally, H|
can be represented as H|
=H+
(q), q∈
, interpreted as an extended operator form sum [84, 85], where V
(q)=0, ∀q∈
, V
(q)=+∞, ∀q∉
confines the quantum wave to within the domain
. A general operator O∈
is said to satisfy the Hopf lemma (in
), also known as the Hopf-Oleinik lemma, as well as the strong version of the Hopf extremum principle [86-89], if for any given open subset
⊆
, and any Dirichlet eigenfunction ψ∈Diri(
, p) of O such that Oψ=λψ in
, λ∈
∈{
}, it holds true that ∥∂ψ(q)∥≠0 for all q∈∂
, where ∂ψ represents the vector of first-order partial derivatives of ψ, ∥⋅∥ is a certain norm for such vectors, while ∂
denotes the boundary of
; either p=2 is implied as conventional, or otherwise, a value for p∈
, p>1 will be given explicitly. Conversely, an operator O∈
(
) is said to satisfy the anti-Hopf lemma on a specific open subset
⊆
, when each Dirichlet eigenfunction ψ∈Diri(
, p), p∈
, p>1 of O such that Oψ=λψ in
, λ∈
∈{
, C} also satisfies the Neumann boundary condition, namely, ∂ψ(q)=0 for all q∈∂D.
Another useful Hamiltonian restriction has to do with the mathematical description for a system of many identical fermions associated with a configuration space , where the governing fermionic Hamiltonian HF is a self-adjoint operator on the Hilbert subspace LF2(
)⊆L2(
), that is always assumed to be derived from
where HB∈0(
) is a boltzmannonic Hamiltonian, i.e., a Hamiltonian associated with a system of all distinguishable particles, that is exchange-symmetric as [HB, PF]=0 on L2(
), while
is the orthogonal projection from L2() onto LF2(
), such that ∀ψ∈L2(
),
Such exchange-symmetric HB associated with each fermionic Hamiltonian HF is called the base boltzmannonic Hamiltonian of HF, written symbolically as HB=Base(HF). In the following, some Hamiltonians may have the subscript “B” or “F” omitted, when it is unnecessary to make the distinction, or there is no ambiguity in the context as to which is the case. By a slight abuse of notation, let Base(HB)=HB when the operand HB is already a boltzmannonic Hamiltonian, so that the operator Base(⋅) may be applied generally to any exchange-symmetric Hamiltonian, either fermionic or boltzmannonic. Although it may not be used frequently in this specification, the notion of base boltzmannonic Hamiltonian can be straightforwardly generalized to a bosonic Hamiltonian describing a system of multiple identical bosons, as well as a Hamiltonian representing a mixed system consisting of a multitude of particle species, obeying possibly different types of statistics: boltzmannonic, bosonic, or fermionic.
Definition 2. (Quantum Physics and Systems)
A quantum physics/system, or called a physics/system in short, is an ordered triple (), with
being a generic continuous-discrete product configuration space,
being a Hilbert space of state vectors supported by , and
being a Banach algebra of bounded linear operators on , which contains a strongly continuous semigroup of Gibbs operators {exp(−T
H)
parametrized by an imaginary time
∈[0, ∞)} and generated by a lower-bounded and self-adjoint operator H, thereby designated as the Hamiltonian measuring the total energy of said quantum system.
A lower-bounded and self-adjoint operator will be called a partial Hamiltonian for brevity. For any partial Hamiltonian h∈0(
), a shorthand notation h∈ log
will be used to signify the fact a certain constant
∈
,
>0 exists such that exp(−
h)∈
.
Definition 3. (Dimension and Computational Size of a Quantum Physics/System)
For a given quantum physics/system () associated with a generic continuous-discrete product configuration space
, with
the dimension of the quantum physics/system is identified with dim()=Σs=1S ns[dim
while the computational size of the quantum physics/system is identified with the computational size of defined as size
where
is called the diameter of , with
being called the diameters of and
respectively. The designated Hamiltonian H∈ log
measuring the total energy of the quantum physics/system (
) is also said to have a computational size size
Definition 4. (Homophysical and Isophysical Mapping between Quantum Physics/Systems)
Given two quantum physics/systems () and (
), with each configuration space
i, i∈{1, 2} being associated with an implied measure space (
i,
i, Vg
i being a σ-algebra of measurable subsets of
i and containing the supports of all of the wavefunctions in
i, i∈{1, 2}, a homophysical mapping
: (
)
(
), also called a homophysics from (
) to (
2,
2,
2), is an injective mapping that sends any measurable set D1∈
1 to a measurable set
∈2 which is unique modulo a set of Vg
1 to a unique
and sends any O1∈1 to a unique
such that 1) the correspondence 1
(D)∈
2 embeds the measure space (
1,
1, Vg
, Vg
1
ψ
(ψ)∈
2 embeds the Hilbert space
1 into
2; 3)
1
O
(O)∈
2 embeds the Banach algebra
1 into
2; 4) there exists a constant c∈
, c>0, called the homophysical constant, with which
(ψ)|
(O)|
(ϕ)
=c
ψ|O|ϕ
holds ∀ψ, ϕ∈
1, ∀O∈
1; 5) in case the physics/systems (
) are variable and supported by configuration spaces
i, i∈{1, 2} of varying computational sizes, it is further required that size(
2) should be upper-bounded by a fixed polynomial of size(
1), and the homophysical constant c satisfy the condition of c+c−1 being upper-bounded by another fixed polynomial of size(
1), where size(
1) and size(
2) are the computational sizes of the configuration spaces (as well as of the quantum physics/systems).
A homophysical mapping : (
)
(
) is called isophysical and referred to as an isophysics between (
) and (
), when the mapping is also surjective, namely, the three embeddings
1
∈
ψ
(ψ)∈
2, and
1
O
(O)∈
2 are all isomorphisms. In general, any homophysics
: (
)
(
) induces and can be identified with an isophysics between (
) and (
(
1),
(
1),
(
1)), with the latter called a subphysics of (
), when
Under a homophysics (
, the image
is said to represent or implement the preimage (
) through
. The symbols
and
will be used such that, ψ1
ψ2 or ψ2
ψ1 and O1
O2 or O2
O1 indicate that a ψ1∈
1 and an O1∈
1 are homophysically mapped to ψ2=
(ψ1)∈
2 and O2=
(O1)∈
2 through the homophysics
, or in other words, that ψ2=
(ψ1) and O2=
(O1) implement ψ1∈
1 and O1∈
1 through the homophysics
. When said
is an isophysics, the two physics/systems are said to be isophysical to each other, and represent or implement each other via the isophysics
or
−1, and the symbol
will be used such that, ψ1
ψ2 and O1
O2 state that ψ1∈
1 and O1∈
1 are isophysically equivalent to ψ2=
(ψ1)∈
2 and O2=
(O1)∈
2 via the isophysics
.
For any partial Hamiltonian H∈, the operators {exp(−τH)} are well defined Vτ∈[0, ∞) and compact ∀τ∈(0, ∞), which form a strongly continuous semigroup indexed by τ∈[0, ∞) [92, 93]. Such {exp(−τH)}τ≥0 are called the Gibbs operators or thermal density matrices generated by H. For any fermionic partial Hamiltonian HF=PFHBPF, the corresponding Gibbs operator is simply exp(−τHF)=exp(−τPFHBPF)=PF exp(−τHB) PF. Therefore, it is without loss of generality to focus on Gibbs operators generated by boltzmannonic partial Hamiltonians.
Definition 5. (Gibbs Transition Amplitudes, Gibbs Kernels, Gibbs Wavefunctions)
For any given (r, q)∈2, the point value
r|exp(−τH)|q
of a Gibbs operator exp(−τH), τ>0 is called the Gibbs transition amplitude between the configuration points q and r. The bivariate function
⋅|exp(−τH)|⋅
: (r, q)∈
2
|exp(−τH)|q)∈
is called a Gibbs kernel, also known as a thermal Green's function, or a thermal density matrix. Acting on a prescribed initial state ϕ∈
as a ket vector, the Gibbs operator exp(−
H) produces another wavefunction ψ=exp(−τH)ϕ∈
, such that
which is called a forward Gibbs wavefunction, with H, τ, ϕ being implied and understood. By the self-adjointness of H and exp(−τH), the Gibbs operator acting on ϕ* as a bra vector produces
which is called a backward Gibbs wavefunction, specifically the backward counterpart of the forward Gibbs wavefunction ⋅|exp(−τH)|ϕ
, with H, τ, ϕ being implied and understood.
In particular, if λ0∈ is the smallest eigenvalue of a self-adjoint operator H∈
, then the ground state energy of the shifted partial Hamiltonian
is zero, and the large-τ limit of the forward Gibbs wavefunction exp(−τH)ϕ, that is limτ→+∞ exp(−τH)ϕ, coincides with the ground state of H′ as well as H, so long as the initial state ϕ∈ has a non-vanishing overlap with said ground state. It is for this reason that the ground state of a partial Hamiltonian is considered as a special case, or more precisely, the large-τ limiting case, of a forward Gibbs wavefunction. Similarly, the restriction of a Gibbs kernel
⋅|exp(−τH)|q
: r∈
r|exp(−τH)|q
∈
for any fixed q∈
is considered to be a forward Gibbs wavefunction, as the limit of a sequence {
r|exp(−τH)|ϕ
} associated with a sequence {ϕ}⊆
of wavefunctions, whose supports approach the point set {q}⊆
. The self-adjointness of H and exp(−τH) puts backward and forward Gibbs wavefunctions on the equal footing, in that every property or statement for a forward Gibbs wavefunction holds true for its backward counterpart, moreover, when the scalar field
contains real numbers only, each backward Gibbs wavefunction
ϕ|exp(−τH)|⋅
as an element in
becomes the same as its forward counterpart
⋅|exp(−τH)|ϕ
. In the following, the adjective “forward” or “backward” is mostly omitted when there is no risk of ambiguity or no need to differentiate between the two cases, and the generic noun “Gibbs wavefunction” will be used to refer to either a ground state wavefunction as a limiting case, or a general Gibbs wavefunction at a finite imaginary time
∈[0, ∞), or a restriction of a Gibbs kernel having one of the two configuration coordinates fixed, when there is no risk of ambiguity or it is clear from the context which is the case.
For linear operators acting on a finite-dimensional vector space L2(), the mathematical formulations and analyses are more or less straightforward. In contrast, significantly more mathematical sophistication is needed to treat quantum motions in a continuous space up to satisfying generality and rigor. In particular, Gibbs operators generated by partial Hamiltonians associated with continuous dynamical variables are best represented using the formulation of path integral [94], also known as functional integral [95-97], based on a Gibbs measure μG with respect to a reference measure μR, on a σ-algebra
of the Wiener space [98-101] of continuous functions/paths
γ is continuous}, ≥0, where
contains the uniform topology induced by the distance function dist∞(⋅, ⋅): W
×W
, such that ∀γ1, γ2∈W
,
and the reference measure μR is strictly positive, with respect to which the Gibbs measure μG is absolutely continuous, so that the Radon-Nikodym derivative dμG/dμR exists and is often written as dμG/dμR=exp{−UG[γ(⋅)]}, where UG[γ(⋅)] is a possibly complex-valued Gibbs energy functional depending on γ(⋅)∈W.
When the discrete space contains more than one points, the configuration space
comprises mutually disconnected components {×{v}: v∈
}, such that
and each component ×{v}, v∈
is a connected continuous submanifold that is isomorphic to
. Given a boltzmannonic partial Hamiltonian HB∈
0(
×{v}) supported by any component
×{v}, v∈
as a connected submanifold itself, the associated Gibbs measure facilitates a representation of the Gibbs operator exp(−
HB),
≥0 via path integral, such that
where (q, v) and (r, v) denote typical points on ×{v}, with q, r∈
, μG[q,r] is a Gibbs measure, with the reference measure μR[q,r] being a conditional Wiener measure, closely related to the so-called Brownian bridge measure [97, 100, 101], on the subset of constrained continuous paths {γ(⋅)∈
and the boltzmannonic Hamiltonian HB being sufficiently regular to guarantee the existence of the UG[γ(⋅)] functional whose real part is lower-bounded. It is clear that exp(−HB),
>0 is an integral operator, whose integral kernel is precisely the Gibbs kernel, defined as
such that a typical Gibbs wavefunction reads
Given a connected open subset ⊆(
×{v})⊆
, a partial Hamiltonian H∈
0(
), being boltzmannonic or otherwise, and an associated Gibbs operator exp(−
H),
>0 can be restricted to
and subject to the Dirichlet boundary condition on the boundary ∂
, denoted as H
and exp[−τH
] respectively, by defining and evaluating the Gibbs kernel as
∀q, r∈, where [γ⊆
]Iver is an Iverson bracket [102] which returns a number valued to 1 if the path γ(⋅)⊂Wτ is fully contained in
, and 0 otherwise. Evidently, K(τH
; r, q)=0 whenever (q, v)∉
or (r, v)∉
, ∀
>0. More generally, for any partial Hamiltonian H∈
and any topologically open submanifold ε⊆, the restriction of H to ε is denoted and defined as H|ε=PL
)
L2(ε) being an orthogonal projection such that
Definition 6. (Irreducible Partial Hamiltonians and H-Connected, H-Closed, and H-Complete Submanifolds)
Given a submanifold ⊆
of a configuration space
=
×P, where
is a differentiable manifold and P is a finite discrete space, a partial Hamiltonian H∈
0(
) is called irreducible in
, when its base boltzmannonic Hamiltonian Base(H) is irreducible in
, when in turn the semigroup {exp(−τBase(H)):
≥0} is irreducible in
, in the sense that, for any Vg-measurable subsets
1⊆
,
2⊆
such that Vg(
1)>0, Vg(
2)>0, there always exist wavefunctions ψ1, ψ2∈L2(
) and a constant τ>0 such that
in which case, the submanifold is said to be H-connected. On the other hand, given a partial Hamiltonian H∈
0(
), a submanifold
⊆
is called H-closed if, for any Vg-measurable subsets
1⊆
,
2⊆(
\
), and for any ψ1∈L2(
1), ψ2∈L2(
2), ∀τ>0, it holds identically that
A submanifold ⊆
is called H-complete when it is both H-connected and H-closed.
It is almost a trivial remark that the above-defined notions of H-connectedness, H-closedness, and H-completeness apply equally well when the concerned partial Hamiltonian H is restricted to and involves only a small number of particles or dynamical variables constituting a subsystem of a large physical system in the background. Such situation arises naturally, for example, when H is one of many additive partial Hamiltonians comprising a total Hamiltonian that governs a large physical system associated with a large-sized configuration space .
Definition 7. (Lie-Trotter-Kato Decomposition of Partial Hamiltonians)
A partial Hamiltonian H is called Lie-Trotter-Kato decomposable (or Lie-Trotter-Kato decomposed) [103-108] into H=Σk=1KHk, K∈, either as a conventional algebraic operator addition, or more generally, as an extended operator form sum, with each Hk, k∈[1, K] being a partial Hamiltonian, such that, as n→+∞, (Πk=1K exp(−Hk/n))n converges to exp(−H) in the strong operator topology. The corresponding expression H=Σk=1KHk, K∈
is called a Lie-Trotter-Kato decomposition, and exp(−H)=s-limn→+∞ (Πk=1K exp(−Hk/n))n is called a Lie-Trotter-Kato product formula, with “s-lim” indicating limit in the strong operator topology.
Definition 8. (Multiplicatively Coupled Functions or Operators)
Given an algebra of functions or operators, and two subalgebras
1⊆
and
2⊆
, an element g∈
is called
1
2-multiplicatively coupled when it can be represented as
with {ƒ1i: i∈}⊆
1, {ƒ2i: i∈
}⊆
2, where
is an index set, which can be countably infinite, with a suitable topology defined on
to make sense of the infinite sum.
Let and
be two algebras of linear operators supported by
and
respectively. Use CD-multiplicatively coupled as a shorthand abbreviation for an operator O∈
(
×
) that is
-multiplicatively coupled. Armed with Lie-Trotter-Kato product formulas and well known techniques of perturbation theory, it is without loss of generality for us to focus considerations on a particular class of Hamiltonians that are CD-separately moving.
Definition 9. (CD-Separately Moving and Irreducible Partial Hamiltonians)
A CD-multiplicatively coupled partial Hamiltonian H∈0(
×
) is called CD-separately moving if it can be written as H=HC+HD, with
being called the continuous and the discrete parts of H respectively, where ∀i∈ and ∀j∈
, with the index sets
and
being either finite or countably infinite, it holds that HiC, VjC∈
0(
) and ViD, HjD∈
0(
), while ViC is
-diagonal and VjD is
-diagonal. A partial Hamiltonian H∈
0(
×
) is called CD-separately irreducible if it is both CD-separately moving and irreducible in the configuration space
×
.
A discrete part HD of a partial Hamiltonian H occurs naturally, for example, in dealing with a physical system consisting of particles with spins, or in describing quantum tunneling of electrons between nearby nano-structures such as quantum wells, quantum wires, and quantum dots, or in a model of computational physics that involves a discrete dynamical variable, such as a Hubbard model or a lattice field theory. Particularly, in the context of GSQC [19-22], where a quantum computational algorithm is reduced to solving for the ground state of a designer Hamiltonian H that involves more than one identical particles of a certain species, the bosonic or fermionic exchange symmetry, as a rigid and nonnegotiable requirement imposed on the wavefunction describing the collective state of said identical particles, can be enforced equivalently by superimposing a discrete part HD=Σπ∈G being the total number of identical particles of the concerned species. Since Gex
π
sign(π)∈{+1, −1} is a group homomorphism between Gex and the group ({+1, −1}, *), consistency with the exchange symmetry conditions for permutations in the generating group G0 guarantees that the ground state of HD satisfies the requirement of exchange symmetry for all permutations π∈Gex. The benefit of having π to traverse a generating set G0 instead of the full set Gex is to reduce the number of additive terms in HD from |Gex|=n! to a polynomial of n, and specifically n−1 when G0 is the set of adjacent transpositions. One advantage of enforcing exchange symmetries via discrete parts of Hamiltonians like HD=Σπ∈G
from the fermionic Hilbert space LF2(
) to the boltzmannonic Hilbert space L2(
), knowing that the ground state of HD must fulfill the symmetry requirements.
Using the so-called perturbation gadgets [22, 109-112], any CD-multiplicatively coupled partial Hamiltonian can be Lie-Trotter-Kato decomposed into a sum of CD-separately moving partial Hamiltonians, since, any monomial AB with A∈, B∈
as an additive term of a CD-multiplicatively coupled partial Hamiltonian has a Lie-Trotter-Kato decomposition
provided that all of the relevant Gibbs operators are well defined, especially when the operator A∈ is unbounded. CD-multiplicatively coupled operators form a large class, that includes most of the naturally occurring and practically relevant physical interactions in real or model systems. More importantly, as will be clearly seen later, a physical system governed by a CD-separately moving Hamiltonian can already be made homophysically universal, implementing a quantum computer that can be programmed to simulate any given physical system. It is for these reasons that focusing considerations on CD-separately moving partial Hamiltonians entails no loss of generality. In this specification, any Hamiltonian H∈
supported by any continuous-discrete product configuration space
is always assumed to be CD-separately irreducible.
For all known naturally occurring physics systems, as well as most theoretical model systems that are employed to implement or represent quantum computing [20, 22, 113], it is usually the case that a many-body Hamiltonian is computationally local and of an additive form as H=Σk=1K hk, K∈, where each partial Hamiltonian hk, k∈[1, K] acts nontrivially only on a small number of artificially labeled particles, that belong to a necessarily small number of fermion species, and are said to be moved by the corresponding hk, while the other particles of the whole system are said to be fixed by hk. More specifically, the following definitions are in order.
Definition 10. (h-Moved and h-Fixed Particle Species)
With respect to a quantum physics/system () associated with a generic continuous-discrete product configuration space
S∈, ns∈
for all s∈[1, S], a partial Hamiltonian h∈
is said to move particles of the first species, if a tensor product state
exists, with ψ1∈L2(1n
for which no wavefunction ϕ′1∈L2(′1) exists to satisfy hψ=ψ1⊗ϕ′1, ∥ϕ′1∥≠0. More generally, ∀s∈[1, S], a partial Hamiltonian h∈
is said to move particles of the s-th species, if by swapping the indices 1 and s, the partial Hamiltonian h is turned into h′∈
which moves particles of the first species, with respect to the new quantum physics/system (
) that has the particle species relabeled. A partial Hamiltonian h∈
is said to fix particles of the s-th species, s∈[1, S], if it does not move particles of the s-th species, with respect to said quantum physics/system (
).
Definition 11. (Few-Body-Moving Interactions)
With respect to the same quantum physics/system () as above, a partial Hamiltonian h∈ log
is called an (sm, nm, dm)-few-body-moving (FBM) interaction, (sm, nm, dm)∈
3, when it moves a number sm of particle species and fix all of the other (S−sm) species, where each moved species labeled by an s∈[1, 5] has no more than nm identical particles that live in a substrate space
s with a substrate space dimension dim(
s)≤dm. When the quantum physics/system (
) is variable in size due to a varying S∈
number of species, a partial Hamiltonian h∈ log
is called a logarithmically FBM interaction if it is an (sm, nm, dm)-FBM interaction with dm=O(1) and nm=O(1) being fixed while sm=O(log(size(
))).
In this specification, O(⋅), Ω(⋅), and Θ(⋅) are the traditional notations of asymptotics in the Knuth convention, representing an upper bound, a lower bound, and a simultaneous upper and lower bound, respectively [114]. An (sm, nm, dm)-FBM interaction may be simply referred to as an FBM interaction in short, with the triple of bounds (sm, nm, dm) implied and clearly understood. Almost all FBM interactions in this specification are at most logarithmically FBM, namely, (sm, nm, dm)-FBM with sm, =O(log(size())) at the most, nm and dm being fixed and small. It is obvious that a logarithmically FBM interaction involves no more than O(log(size(
))) dynamical variables.
Given any FBM interaction h∈ log , let
be the set of indices labeling particle species that are moved by h, and the complement
be the set of indices labeling particle species that are fixed by h. Clearly, h and
h⊥ bi-partition the whole set of indices [1, S], in the sense that
h∪
h⊥=[1, S] and
h∪
h⊥=θ. Define product manifolds
The FBM interaction h is said to induce an orthogonal direct sum decomposition (ODSD) =εh ⊕εh⊥, in the sense that there exist atlases {(Uα, φα)}α∈A for εh and {(Uβ⊥, φβ⊥)}β∈B for εh⊥, A and B being index sets indexing coordinate patches, such that the Cartesian product
serves as an atlas for ; in particular, any q∈
has a unique ODSD q=u⊕u⊥, with u∈εh, u⊥∈εh⊥, such that for any open neighborhood U, q∈U⊆
, there exist open neighborhoods Uα, α∈A, u∈Uα⊆εh and Uβ⊥, β∈B, u⊥∈Uβ⊥⊆εh⊥ to make U⊆Uα×Uβ⊥ true. For any u⊥∈εh⊥, the subset
such that q=u⊕u⊥} is called an εh-coset at u⊥, which is a submanifold of isomorphic to εh in the sense that the two not only are diffeomorphic as smooth manifolds but also endowed with isometric Riemannian metrics. Similarly, ∀u∈εh, the subset
such that q=u⊕u⊥} is called an εh⊥-cosec at u∈εh, which is a submanifold of isomorphic to εh⊥.
Definition 12. (h-Moved and h-Fixed Factor Spaces and Cosets)
are called the h-moved and h-fixed factor (configuration) spaces respectively. An εh-coset εh⊕u⊥∈εh⊥is called h-moved if a ϕ∈L2(εh⊕u⊥) exists such that ϕ is not an eigenvector of h. Conversely, an εh-coset εh ⊕u⊥, u⊥∈εh⊥ is called h-fixed if any ϕ∈L2(εh ⊕u⊥) is an eigenvector of h.
It is clear that for any FBM interaction h, on any h-fixed εh-coset εh⊕u⊥, u⊥∈εh⊥, h is (εh⊕u⊥)-diagonal whose eigenvectors are trivially computable. Two εh-cosets εh⊕u⊥, εh ⊕v⊥ with u⊥, v⊥∈εh⊥ are said to be h-isomorphic when the effects of h on the two εh-cosets are the same up to a scalar constant, that is, there exist τ1, τ2∈(0, ∞) such that u⊕u⊥|exp(−τ1h)|v⊕u⊥
=
u⊕v⊥|exp(−τ2h)|v⊕v⊥
holds true ∀u, v∈εh. Obviously, h-isomorphism is an equivalence relation among the εh-cosets {εh⊕u⊥: u⊥∈εh⊥{, which induces a partition of the collection of all εh-cosets into equivalence classes
is h-isomorphic to εh⊕u⊥}}.
When an FBM interaction hk∈ log , k∈
is an additive term in a sum Σk=1K hk, or a tensor-multiplicative factor in a product ⊗k=1Khk, K∈
, the sets of indices
h
h
k∈[1, K], just for convenience.
Definition 13. (FBM Tensor Monomials and Polynomials)
Given a quantum physics/system (), a partial Hamiltonian M∈ log
is called an FBM tensor monomial, when M is in a form of a tensor product M=I0⊗(⊗i=1n hi), n∈
, n≤size(
), where ∀i∈[1, n], hi∈ log
is an FBM interaction moving a factor space
i with dim(
i)=O(log(size(
))), while {
i}i=1n Cartesian-factorizes the whole configuration space as
and I0 is the identity operator supported by the factor space
although 0 may reduce to an empty set. The number
dim(Ci) is called the degree of the FBM tensor monomial M. A partial Hamiltonian W∈ log is called an FBM tensor polynomial, when W can be written as W=Σk=1K Mk, K∈N, K=O(poly(size(C))), with each Mk, k∈[1, K] being an FBM tensor monomial. The number
is called the degree of the FBM tensor polynomial W.
When useful and necessary, the notions of few-body-moving operators, moved or fixed particle species, and moved or fixed factor spaces and cosets can all be straightforwardly generalized to generic operators in a Banach algebra with respect to a given quantum physics/system (
). For a general operator Q∈
having an adjoint operator (or Hermitian conjugate) Q+∈
, the associated operators
are called the Hermitian part and the skew-Hermitian part of Q respectively. Let i∈ denote an operator in
such that i2+I≡0, with I∈
being the identity operator, then iA∈
is self-adjoint or Hermitian, called a Hermitianization of A, for any given skew-Hermitian operator A∈
.
Definition 14. (General Operators being Few-Body-Moving)
Given a quantum physics/system (), a general operator Q∈
is called few-body-moving, when both its Hermitian part Q+=(Q+Q+)/2∈
and the Hermitianization iQ− of its skew-Hermitian part Q−=(Q−Q+)/2∈
are few-body-moving. A particle species, or a factor space, or a coset is called Q-moved (conversely, Q-fixed), when such a particle species, or a factor space, or a coset is either Q+-moved or iQ−-moved (both Q+-fixed and Q−-fixed, respectively).
For any given partial Hamiltonian H with respect to a quantum physics/system (), let λ0(H) denote the lowest eigenvalue of H (namely, the ground state energy), and ψ0(H) represent either a unique eigenvector (that is, the ground state wavefunction) when there is no degeneracy, or an orthonormal basis of the ground eigenspace
where gmul(H, 0) denotes the geometric multiplicity of λ0(H). Let ψ0(H; q) specify the point value(s) of the ground state(s) at q∈. When H is well understood, it may be omitted from the notations) λ0(H), ψ0(H), and ψ0(H; q) without ambiguity. By the same token, let {λn(H)}n≥1 denote the higher eigenvalues/eigenspaces of H in a non-decreasing order, {ψn(H)}n≥1 and {ψn(H; q)}n≥1 represent the corresponding eigenstates and their point values, provided that degeneracy of states does not create confusion, or it is properly resolved by using additional indexing and ordering in accordance with additional auxiliary quantum numbers.
Definition 15. (Polynomially/Efficiently Computable Partial Hamiltonians)
Given a quantum physics/system (), a partial Hamiltonian H∈ log
is said to be polynomially computable (or efficiently computable, interchangeably), when the Gibbs operators generated by Base(H) is polynomially computable (or efficiently computable), in the sense that, there exists an ϵ1=Ω(1/poly(size(
)))>0, such that, for any given ϵ2=Ω(1/poly(size(
)))>0, the Gibbs transition amplitude K(τBase(H); r, q)=
r|exp[−τBase(H)]|q
can be computed to within a relative error no more than ϵ2, ∀τ∈(0, ϵ1], ∀(r, q)∈
2 such that distg(r, q)≤ϵ1, at a computational cost that is upper-bounded by O(poly(size(
))).
In the context of GSQC, whenever a computational problem is reduced to determining the ground state energy or sampling from the ground state wavefunction of a Hamiltonian H on a subset of a configuration space , or in the context of quantum statistical mechanics, whenever a computational problem is reduced to computing a partition function ∫
∈
q|exp(−
H)|q
dq for a fixed
>0 or sampling from a Gibbs kernel
r|exp(−
H)|q
, (r, q)∈
2, the Hamiltonian H has to be polynomially computable in order for any hope of solving the problem efficiently. In particular, whenever a many-body Hamiltonian is given as an FBM tensor polynomial H=Σk=1K Hk, Hk=⊗i=1n
, ∀k∈[1, K], K∈
, it is always assumed that all of the FBM tensor monomials Hk, k∈[1, K] and all of the FBM interactions hki, i∈[1, nk], k∈[1, K] are polynomially computable. Furthermore, in the context of GSQC, the total Hamiltonian H should have a unique ground state and needs to be polynomially gapped, namely, the ground state should be separated from all excited states by an energy gap whose reciprocal is upper-bounded by a polynomial of the problem input size, in order for the problem to be efficiently solvable.
Given a quantum physics/system () of a variable size, and a typical FBM tensor monomial M∈ log
, a sufficient (though not necessary) condition guaranteeing efficient computability of M is that, either deg(M)=O(1) is a fixed number independent of the varying size(
), or more generally, M=P⊗Q is a tensor product with both P and Q being efficiently computable, while P is an projection operator such that P2=P, although deg(P) could be as large as O(poly(size(
))), in which case, it follows from the identity
exp(−M)=exp(−
P⊗Q)=I+P⊗[exp(−
Q)−I],
∈(0,∞) (16)
that M is efficiently computable, because both exp(−P)=(I−P) e−
P and exp(−
Q) are polynomially computable, ∀
=O(poly(size(
))),
>0. Also, if M∈ log
is a tensor product of the form M=R⊗Q, with both R and Q being efficiently computable, where R is a self-inverse operator such that R2=I, then both M and I−M are efficiently computable, because
where R± are projection operators such that R+R−=R−R+=0, hence R+⊗Q− and R−⊗Q+ commute. Furthermore, if both R and Q are self-inverse operators, then both ½(I+R⊗Q) and ½(I−R⊗Q) are efficiently computable projection operators.
Definition 16. (Polynomial Lie-Trotter-Kato Decomposition of Partial Hamiltonians)
A Lie-Trotter-Kato decomposed partial Hamiltonian
with K∈, K=O(poly(size(H))), nk∈
, nk=O(poly(size(H))), ∀k∈[1, K] is called polynomially Lie-Trotter-Kato decomposable (PLTKD), when H is also an FBM tensor polynomial with each hki, i∈[1, nk], nk∈
, k∈[1, K] being an efficiently computable FBM interaction, and for each k∈[1, K], all but an mk=O(1) number of the FBM interactions {hki: i∈[1, nk]} are projection operators, wherein limn→∞ (Πk=1K exp(−Hk/n))n converges to exp(−H) polynomially fast with respect to a suitable operator norm, that is, there exist a fixed integer b∈
and some positive constants c1, c2, c3∈
, all of which being independent of K, such that
holds ∀K∈, K=O(poly(size(H))), and ∀m∈
that satisfies m≥c3 and mKb≥
k−1 for all k∈[1, K], where λ0(H) and λ1(H) denote the lowest and second lowest eigenvalues of H, ∥⋅∥ can be either the standard operator norm, or more strongly, the Hilbert-Schmidt norm,
k∈
is a constant associated with Hk such that the Gibbs operator exp(−
Hk) is efficiently computable, ∀T∈(0,
k), ∀k∈[1, K], ∀K∈
. Naturally, the corresponding expression H=Σk=1K Hk is called a polynomial Lie-Trotter-Kato decomposition of H.
The class of PLTKD partial Hamiltonians contains a subclass of computationally local Hamiltonians, that are FBM tensor polynomials of the form H=Σk=1K hk, with each hk, k∈[1, K] being an FBM interaction that moves no more than O(log(size(H))) particles or dynamic variables. As will be seen later in this specification, the subclass of computationally local PLTKD Hamiltonians is already complete and universal for GSQC, and it defines the same quantum computational complexity classes as does the superclass of PLTKD Hamiltonians in general, as far as polynomial reduction is concerned. Still, it is advantageous to consider, analyze, and employ the superclass of PLTKD Hamiltonians, not only because they form a larger set and afford wider generality, but also due to the fact that their employment in certain applications leads to faster algorithms, even though the improvement is only polynomial.
When the partial Hamiltonians H and {Hk}k∈[1,K] are all bounded, the polynomial Lie-Trotter-Kato decomposability and the asymptotic bound of (18) can be easily established using power series expansions of matrix/operator functions and matrix/operator norm analyses, which reduces to the well-known Lie product formula [115-118], as a special case of the Baker-Campbell-Hausdorff formula [119-122], or the Zassenhaus formula [123]. However, when some of the operators are unbounded, it is more intricate to formulate a Lie-Trotter-Kato product and prove an asymptotic bound of (18), due to subtleties in applicable domains as well as additivity of operators. Nevertheless, a significant body of theorems has been proved to establish the Lie-Trotter-Kato product formula for a wide variety of operators [103-108], mostly in the sense of strong operator topology. Recently, the Lie-Trotter-Kato product formula has been proved to converge in the operator norm, and indeed with the asymptotic error polynomially bounded as in (18) [124-132], especially for the all-important Schrödinger operators H=−Δg+V(q) on a Riemannian manifold (, g), with the potential V(q∈
) subject to certain conditions of regularity. Here we do not limit our Hamiltonians to any explicit and specific form, but just posit that they should have a known polynomial Lie-Trotter-Kato decomposition.
Let (Ω, , P) be a base probability space. For any random variable X: (Ω,
, P)
(Ω′,
), let PX denote the induced probability on the measurable space (Ω′,
) such that
∀A′∈, and call PX the probability law of X. For any two sub-σ-algebras
⊆
,
⊆
, define
as a measure of dependence between and
[133]. Let
be a totally ordered monoid as an index set with min(
)=0, e.g.,
=
≥0 or
be a sequence of random variables defined on (Ω, , P). For any j, k∈
, j<k, let
be the σ-algebra generated by the random variables {Xi}j≤i≤k [134, 135]. For a sequence of random variables , call
k∈ its α-mixing coefficient [134-137]. For two measures μ and v on a common measurable space (Ω,
),
define the total variation distance
A sequence of random variables
is called α-strongly mixing when limk→∞α(, k)=0 [134-137].
Definition 17. (Strongly Mixing Sequence of Random Variables)
A sequence of random variables
is called rapidly mixing toward a random variable Y [138], with a mixing time k0>0, k0∈, when both α(
, k)≤exp(−k/k0) and ∥PX
. A sequence of random variables
is said to be well mixed if α(, k)≤exp(−k), ∀k≥1, k∈
.
Let () be a many-body quantum system with a PLTKD Hamiltonian H=Ek=1K Hk∈ log
that is supported by a continuous-discrete product configuration space
equipped with a Riemannian metric g, where S∈ is the number of particle species, each species indexed by s∈[1,
] has ns∈
identical particles moving on a substrate space
s×
s.
Definition 18. (Computational Problem of Simulating Many-Body Hamiltonians)
A computational problem of simulating H is to generate a sequence of a predetermined polynomial number of well mixed random samples of: either 1) configuration points {q∈} according to the probability density function |ψ0(H; q)|2dVg(q) in the context of GSQC, when H is guaranteed to be polynomially gapped with a unique ground state ψ0(H); or 2) configuration points {q∈
} according to a probability density function |ψ(H; q; q0,
)|2dVg(q) in the context of quantum statistical mechanics, where
q∈ is a Gibbs wavefunction with fixed (q0,
)∈
×(0, ∞); or 3) pairs of configuration points {(r, q)∈
2} according to the absolute value of a Gibbs kernel
r|exp(−
H)|q
in the context of quantum statistical mechanics, with
>0 being a predetermined constant; where all said polynomials are in terms of size(H).
Definition 19. (Spatially Local Partial Hamiltonians)
A boltzmannonic partial Hamiltonian HB∈0(
) for a quantum multi-species multi-particle system moving on a many-body configuration space
is called spatially local, if for any open subset
⊆
and any ψ∈dom(HB)⊆L2(
) such that ψ
=0, the overlap integral
ϕ* HBψdVg=0 identically for any ϕ∈L2(
). A fermionic partial Hamiltonian H∈
0(LF2(
)) is called spatially local when its base boltzmannonic Hamiltonian Base(H) is spatially local.
Spatially local boltzmannonic or fermionic partial Hamiltonians are epitomized by, but not limited to, the conventional boltzmannonic or fermionic Schrödinger operators of the familiar form H=−Δg+V(q), with
being the Laplace-Beltrami operator on a many-body configuration space endowed with a Riemannian metric g, V(q∈
) representing a local potential, which as an operator is obviously a
-diagonal polynomial, and will be referred to as a
-diagonal potential hereafter. An important property of a spatially local boltzmannonic partial Hamiltonian HB is wavefunction support nonexpanding, that is, ∀ψ∈dom(HB)⊆L2(
), let
⊆
be the support of ψ and cl(
) be the closure of
, such that
ϕ*ψdVg=0, ∀ϕ∈L2(
), then the support of the wavefunction HBψ modulo null subsets of zero measure is necessarily a subset of cl(
), because it also holds that
ϕ*HBψdVg=0, ∀ϕ∈L2(
). For a general CD-separately moving partial Hamiltonian H=HC+HD supported by a continuous-discrete product configuration space
the property of spatial locality is a useful characterization mostly for the continuous component HC, while the discrete component HD can be spatially local if and only if it is -diagonal, hence C-diagonal.
Definition 20. (Boltzmannonic and Fermionic Schrödinger Partial Hamiltonians)
Supported by a continuous-discrete product manifold as a many-body configuration space, a CD-separately moving boltzmannonic partial Hamiltonian HB=HBC+HBD∈
0(
) is called spatially local and stoquastic, or boltzmannonic Schrödinger in short, when HBC is spatially local and HB is stoquastic in the sense that, ∀
>0, the Gibbs operator exp(−
HB) has a real- and non-negative-valued Gibbs kernel K(
HB; r, q)=
r|exp(−
HB(|q
, ∀(r, q)∈
×
. A fermionic partial Hamiltonian H∈⋄0(LF2(
)) is called spatially local and fermionic stoquastic, or fermionic Schrödinger in short, when its base boltzmannonic partial Hamiltonian Base(H) is boltzmannonic Schrödinger.
Auxiliary Utility 1. (Tensor Product of Fermionic Schrödinger and C-Diagonal Operators)
If H1=H1C+H1D∈⋄0(LF2(1)) is a CD-separated fermionic Schrödinger partial Hamiltonian that is supported by a continuous-discrete product configuration space
and V∈⋄0(LF2()) is a non-negative C-diagonal potential supported by a configuration space
, then the tensor product operator
is a fermionic Schrödinger partial Hamiltonian supported by the product configuration space
Demonstration. H2=H1⊗V=H1C⊗V+HD⊗V. It is obvious that H1C⊗V is spatially local as long as H1C is so. Also clear is that Base(H2)=Base(H1)⊗Base(V) is stoquastic, as Base(V) is -diagonal and non-negative. It follows that H2 is fermionic Schrödinger.
Given any open submanifold ⊆
and any boltzmannonic Schrödinger partial Hamiltonian HB supported by it, an eigenvalue problem arises naturally for HB on
, subject to a suitable boundary condition of the Dirichlet, Neumann, or Robin type, when the boundary ∂
is nonempty. The same is true for a fermionic Schrödinger partial Hamiltonian H supported by
as well, as long as the submanifold
is large enough to contain all of the orbits, so to stay invariant, under the action of the exchange symmetry subgroup GexH, which is defined as the largest subgroup comprising permutations that exchange only among the identical particles being actually moved by H. Evidently, all eigenvectors of a boltzmannonic or fermionic Schrödinger partial Hamiltonian HB or H supported by
can be made real-valued, and the ground state of HB made nowhere negative. Moreover, with the submanifold
being Ck-smooth, k≥1 and HB=Base(H) being sufficiently regular, it can be assumed that the Gibbs operators exp(−
HB) and exp(−
H) for some
>0 are compact self-adjoint on the separable Hilbert spaces L2(
) and LF2(
) respectively, and the Gibbs kernel K(
HB; r, q)=
r|exp(−
HB)|q
∈L2(
×
) is jointly continuous in r and q, so is
It follows from the spectral theorem for compact self-adjoint operators [83, 85] that the countable set of eigenvectors of exp(−H),
>0 form an orthonormal basis for the Hilbert space LF2(
), and the Gibbs operator has the spectral resolution
where {e−λ
H) in a decreasing order, and {ψn(H)}n=0∞ lists the corresponding normalized eigenvectors, all of which are necessarily continuous in
by the continuity of K(
H; r, q) and the L2-integrability of the eigenvectors in the eigen equations
For a conventional boltzmannonic Schrödinger operator HB=−Δg+V(q) on any connected Riemannian manifold (, g), with Δg being the Laplace-Beltrami operator, V(q∈
) being
-diagonal and real-valued, the conditional Wiener measure can be taken as the reference measure, with respect to which the Gibbs energy functional reads UG[γ(⋅)]=∫0
V[γ(
′)]d
′ in equation (6), which then reduces to the celebrated Feynman-Kac formula. It can be shown that the operator exp(−
HB) is trace class, ∀
>0, the Gibbs kernel K(
HB; r, q)∈L2(
×
;
) is non-negative and continuous in (
, r, q)∈(0, ∞)×
×
, so long as the potential V (q) is sufficiently regular, e.g., Kato-decomposable, that is, V=V+−V−, with V+∈Lloc1(
) (locally L1-integrable), and V− being a Kato-class potential having mild singularities [84, 97, 139]. Moreover, ∀
>0, the Gibbs kernel K(
HB; r, q) is strictly positive almost everywhere, thus positivity improving [97] in the sense that, ∀ψ∈L2(
;
) such that ψ(q)≥0, ∀q∈
and ψ≢0,
K(
HB; r, q)ψ(q)dVg(g) becomes strictly positive for Vg-almost every r∈
, provided that the potential V(q) is Klauder regular, i.e., fulfilling the condition below to avoid the so-called Klauder phenomenon [95, 97],
with μW[q,⋅] being the standard Wiener measure on the subset of paths in Wthat start from q and end anywhere in
. Any Kato-decomposable potentials V(q) is Klauder regular, so the associated conventional boltzmannonic Schrödinger operator HB=−Δg+V(q) is positivity improving. When HB is positivity improving, ∀
>0, the Gibbs operator exp(−
HB) has a unique positive eigenvector as the ground state of HB [97], by the celebrated Perron-Frobenius theorem for non-negative matrices [140-142] and the generalizing Krein-Rutman and de Pagter's theorems for compact operators that are positive with respect to a convex cone [143-145]. Besides, when necessary, better analytical properties of K(
HB; r, q) can be established and used when q and r are kept out of a submanifold of singularity
sin, which has a lower dimension than
. A case in point is when the potential V(q), q∈
is a sum of a bounded smooth function and Coulomb interactions between multiple electrons and multiple nuclei as well as among the multiple electrons and among the multiple nuclei themselves, where
sin is the union of collision planes, each of which is defined by a coordinate coincidence between two particles. The potential V(q) is analytic [146] at any q∈
\
sin, so the eigenfunctions of HB are analytic in
\
sin, by the standard theory of analytic regularity for solutions of partial differential equations [147-149]. Thus, the Gibbs kernel K(
HB; r, q) is analytic in (
\
sin)×(
\
sin). Even at points on the submanifold of singularity
sin, the singular behaviors of the eigenfunctions of HB are mild [150-152].
In this specification, we do not limit boltzmannonic Hamiltonians to the conventional Schrödinger operators, instead assume that any boltzmannonic Hamiltonian HB should be spatially local and sufficiently regular over a continuous-discrete product Riemannian manifold (, g) excluding a lower-dimensional submanifold
sin⊂
, such that ∀
>0, the Gibbs operator exp(−
HB) is trace class, hence Hilbert-Schmidt with a Gibbs kernel K(
HB; r, q) belonging to the class L2(
×
;
) for any fixed
∈(0, ∞), as well as being positivity improving and (
, r, q)-jointly C∞,2,2 over the product topological space (0, ∞)×(
\
sin)×(
\
sin), namely, ∀(n
, ni, nj, nk, nl)∈(
∪{0})5 such that ni+nj≤2, nk+nl≤2, with (i, j, k, l)⊆
4 indexing continuous coordinate components, the partial derivative ∂τn
HB; r, q) exists and is continuous in (0, ∞)×(
\
sin)×(
\
sin). Moreover, the Hamiltonian HB ought to behave like a uniformly elliptic differential operator and substantiate the so-called Hopf lemma [86-89], in that, for any bounded connected open subset D⊆
of a submanifold
⊆(
\
sin), possibly with dim(
′)<dim(
) when considering restricted motions on a lower-dimensional submanifold of
, there exists a constant E0(HB, D)∈
, called a Hopf bias for convenience, such that if ψ∈C2(D)∩C1(cl(D)) satisfies the Dirichlet differential inequality [HB|D+E0(HB, D)]ψ(q)≥0, ∀q∈D, then 1) ψ must have a strictly negative outer normal derivative at any boundary point r∈∂D∩cl(B) with an open ball B⊆D such that ψ(r)<(q), ∀q∈B; 2) ψ can only be a constant if it attains a non-positive minimum over cl(D) at an interior point r∈D. In particular, ∀ψ∈Diri(D), λ∈
such that HB|D ψ(q)=λψ(q), ψ(q)>0, ∀q∈D, it is always true that the gradient vector
for almost every r∈∂D, so long as ∂D is piecewise C2-smooth.
Still further, when considering a varying series of fermionic or boltzmannonic Schrödinger partial Hamiltonians {H(t): t∈} or {HB(t)=Base(H(t)): t∈
}, with the index set
being an interval on the real axis, and {H(t)} or {HB(t)} being continuously differentiable or even analytic in t∈
[146, 153, 154], it is always assumed that a suitable Wiener measure on the Riemannian manifold
exists, and the partial Hamiltonian HB(t) behaves sufficiently well on
excluding a lower-dimensioned submanifold
sin(t), ∀t∈
, such that the Feynman-Kac integral produces a Gibbs kernel K(
HB(t); r, q) that is in L2(
×
;
) and positivity improving ∀(
, t)∈(0, ∞)×
, which is furthermore (
, t, r, q)-jointly C∞,1,0,0 in
and (, r, q)-jointly C∞,2,2 in (0, ∞)×(
\
sin(t))×(
\
sin(t)), ∀t∈
. Still further, the operator HB(t) must substantiate the Hopf lemma and extremum principle, ∀t∈
. Antisymmetrization yields the fermionic counterparts H(t)=PFHB(t)PF, t∈
, and K(
H(t); r, q)=(Πs=1S ns!) Σπ,π′∈G
HB(t); πr, π′q), which is also (
, t, r, q)-jointly C∞,1,0,0 in the product space (0, ∞)×
×
×
, and (
, r, q)-jointly C∞,2,2 in (0, ∞)×(
\
sin(t))×(
\
sin(t)), ∀t∈
. Let {ψn(t; q)}n=0∞ denote the set of eigenvectors of exp(−H(t)) corresponding to the eigenvalues {λn(H(t))}n=0∞ in an increasing order, ∀t∈
. It is well known that K(
H(t); r, q)=Σn=0∞e−
λ
, t, r, q)∈(0, ∞)×
×
×
, which converges absolutely and uniformly as n→∞, as well as uniformly when
→∞. Then it follows straightforwardly from
that the t-series of eigenstate wavefunctions {ψn(t; q): (t, q)∈×
} is (t, q)-jointly C1,0 in
×
, ∀n≥0, and C2 in
\
sin(t), ∀t∈
, ∀n≥0.
Definition 21. (Densities and Signed Densities)
Given a measurable space (), where
is a configuration space and
is a σ-algebra of subsets of , a scalar density on
is a (
)-measurable function from
to a field
. A density on
is a tuple- or vector-valued function having either just one or a plurality of scalar densities as components. In particular, a scalar density on
is a density on
that has just one component. A density ƒ on
is said to be signed, when ƒ has two components ƒ1 and ƒ2 as scalar densities that are not necessarily different, and two points q1∈
and q2∈
exist which need not to differ, such that the value of the quotient ƒ1(q1)/ƒ2(q2) is different from zero and a positive real number.
Definition 22. (Minimally Entangled Densities)
A density on a configuration space of a variable size is said to be minimally entangled, when it can be written as
with n=O(poly(size())), ƒ being a Borel measurable function defined on
being a field, each gi, i∈[1, n] being a
-valued scalar density on a submanifold
i as a tensor factor of
such that size(
i)=O(log(size(
))), wherein all of the functions ƒ and {gi}i=1n have a representation in a closed mathematical form that is efficiently computable, such that ∀q∈
, the point-value ƒ({gi(gi(q|
i)}i=1n), with q|
denoting the coordinate restriction of q to the submanifold
i, ∀i∈[1, n], can be computed to within any predetermined relative error ϵ>0, at the cost of an O(poly(size(
) 1 log ϵ|)) computational complexity.
Definition 23. (Substantially Entangled Densities, Practically Substantially Entangled Densities)
A density on a configuration space of a variable size is said to be substantially entangled when it is signed and can not be represented in the form of a minimally entangled density on
. A density on the same
is said to be practically substantially entangled when it is signed and has no known representation in the form of a minimally entangled density on
.
The ground state ψ0(H; ⋅) or the Gibbs kernel function
>u of a many-fermion Hamiltonian H is a signed density on the corresponding configuration space
or the product space
In the past, great computational difficulties have been encountered in sampling from |ψ0 (H; ⋅)|2 or K(H;
),
>0 to derive a numerical estimate for an expectation value of an interested observable, which seem to stem from the fact that the density ψ0(H; ⋅) or K(
H;
),
>0 is substantially entangled, or at least practically substantially entangled.
To tackle a computational problem of simulating a many-fermion Hamiltonian H via Monte Carlo, a mathematical correspondence is needed to connect a density associated with H to a classical probability. Frequently used densities include the ground state ψ0(H; q), q∈1 and a Gibbs kernel
>0. Let
⊆L2(
k;
), k∈
be a Hilbert space of functions supported by a configuration or product
and over a scalar field ∈{
}, and let T∈
(
) be a bounded and normal linear operator, defined through an integral kernel
⋅|T|⋅
∈L2(
k×
k;
), such that [Tϕ](r)=
r|T|q
ϕ(q) dVg(q), ∀ϕ∈
, ∀r∈
k. Let ψ be an eigenvector such that Tψ=λψ, λ∈
. ψ is necessarily continuous in
k. Define a new operator
on the L1 space Diri(, 1), with
being the open subset of k excluding the nodal points of ψ, the superscript * denoting complex conjugation when applicable and needed, and ∀ϕ∈L2(
), [ϕ]: L2(
)
L2(
) representing a multiplication operator induced by the function ϕ(⋅), which is formally a diagonal operator listing the values {ϕ(q): q∈
} as its diagonal elements, namely
such that [ϕ]ξ(q)=ϕ(q)ξ(q) holds true, ∀ξ∈L2(), ∀q∈
. The operator Tψ has an integral kernel
(r, q)∈×
.
Auxiliary Utility 2. (Quasi-stochastic Operators and Integral Kernels)
The operator Tψ and the associated integral kernel ⋅|Tψ|⋅
are quasi-stochastic, in the sense that the right marginal distribution
r|Tψ|q
dVg(r) reduces to the constant λ, ∀q∈
, moreover, for any eigenvector ψ′ of T such that T|
ψ′=λ′ψ′, ψ′∈Diri(
), the function ψ*ψ′∈Diri(
, 1) is an eigenvector of Tψ with the same eigenvalue λ′. Conversely, when T satisfies the Hopf lemma in
⊇
while Tψ satisfies the anti-Hopf lemma on
, then {ψ*ψk}k≥0 lists all of the Dirichlet eigenfunctions of Tψ such that Tψ(ψ*ψk)=λkψ*ψk, ψ*ψk∈Diri(
, 1), ∀k≥0, when {ψk}k≥0 lists all of the Dirichlet eigenfunctions of T such that Tψk=λkψk, ψk∈Diri(
), λk∈
, ∀k>0.
Demonstration. Let T+ denote the adjoint operator of T on L2(k), k∈
. The operator T−λ is normal. It follows from ∥(T−λ)ψ∥2=0 that ∥(T+−λ*)ψ∥2=0, namely, T+ψ=λ*ψ. Therefore,
ψ*(r)
r|T|q
dVg(r)=
ψ|r
r|T|q
dVg(r)=
ψ|T|q
=
T+ψ|q
=λψ*(q), and consequently,
r|Tψ|q
dVg(r)=λ, ∀q∈
. On the other hand, for any ψ′∈Diri(
) such that T|
ψ′=λ′ψ′, λ′∈
, it holds true that Tψ(ψ*ψ′)=[ψ*]T[ψ*]−1(ψ*ψ′)=[ψ*]Tψ′=λ′(ψ*ψ′). In particular, the operator Tψ fixes the non-negative density |ψ(q)|2=ψ*(q)ψ(q), q∈
, when the operator T is affine-transformed to make λ=1. Such a non-negative density |ψ(⋅)|2, when normalized, is called a stationary distribution or stationary probability vector of (or associated with) the quasi-stochastic operator Tψ. Conversely, when T satisfies the Hopf lemma in
⊇
while Tψ satisfies the anti-Hopf lemma on
, then for any Dirichlet eigenfunction ψ″∈Diri(
, 1) of Tψ such that Tψψ″)=λ″ψ″, λ″∈
, it holds that
is always well-defined in and continuous in cl(
), where
denotes the topological closure of , further, ψ′ vanishes on ∂
. It follows from Tψψ″=[ψ*]T[ψ*]−1ψ″=λ″ψ″ that T(ψ″/ψ*)=λ″(ψ″/ψ*), namely, ψ′=ψ″/ψ* is necessarily a Dirichlet eigenfunction of T in
which is also continuous in cl(
).
Clearly, a quasi-stochastic operator Tψ becomes bona fide stochastic when the associated integral kernel r|T|q
, (r, q)∈
k×
k, k∈
is real-valued and nowhere negative, and the operator is scaled properly to have a unit spectral radius, in which case, the meaning of stationary distribution or stationary probability vector coincides precisely with the standard and well-known definition in association with Markov chains and stochastic matrices.
Specifically, for any conventional boltzmannonic Schrödinger operator HB that is irreducible on a connected open subset of a many-body configuration space
, the Gibbs kernel
r|exp(−
HB)|q
is positive real-valued, ∀(r, q)∈
×
, ∀
>0, and the ground state ψ0 of HB is necessarily positive everywhere on
. Thus, the operator [ψ]exp(−
HB)[ψ]−1,
>0 and its integral kernel ψ(r)
r|exp(−
HB)|q
ψ(q)−1 are indeed stochastic and can be interpreted as a Markov operator and a Markov kernel. Such boltzmannonic Schrödinger Hamiltonians are known as stoquastic and of great importance for Monte Carlo simulations of physical systems [155,156], where the positivity of the associated Gibbs kernels and the ground state wavefunctions makes it possible to map a quantum density into a classical probability density, which can be sampled by a random walk without encountering the sign problem. Here in this specification, the notion of stoquasticity will be extended to fermionic systems, which require a resolution of the numerical sign problem.
Despite being based on a stoquastic boltzmannonic Hamiltonian, fermionic Schrödinger Hamiltonians have in the past been considered archetypical hosts and victims of the infamous sign problem, due to the ostensible necessity of both positive and negative signs to satisfy the fermion-exchange symmetry of any legitimate wavefunction. Feynman [23,32] was one of the pioneers recognizing the need and use of “negative probability” as the fundamental and differentiating aspect of quantum mechanics from classical physics and probability, even above the “spooky” quantum entanglement. Indeed, Feynman showed explicitly [32] that when formulated in terms of density matrices, quantum physics paralleled classical probability all but identically, with the only exception of admit-ting “probability” values out of the interval [0, 1], which echoed Wigner [157] and Moyal's [158] earlier analyses using the density-matrix-equivalent Wigner functions. When a quantum system undergoes a restricted form of dynamics such that a suitable representation of the density matrix or Wigner function can be chosen to avoid “negative probability”, then one may reasonably ex-pect that the quantum dynamics can be efficiently simulated on a classical probabilistic machine. Such intuition/conjecture has been attested empirically by the vast and successful QMC simulations of boltzmannonic and bosonic systems in the absence of magnetic fields, and recently proved in full mathematical rigor for restricted quantum dynamics involving superoperators represented by positive-definite Wigner functions [159, 160]. On the other hand, the apparently unavoidable “negative probability” makes it extremely hard, seemingly impossible, to simulate a general large quantum system on a classical probabilistic computer. Limited to classical physics and probability, unable to actualize and transfer “negative probability” as a quantum system can do, a BPP machine simulating a signed measure or density υ describing a quantum system would have to sample from a classical probability, that is, a non-negative-definite measure and compute expectation values of/involving a signed function σ=dυ/dμ, where catastrophic cancellation could have statistical noise inundating and obscuring the desired averages, when the υ-described system is large and near quantum degeneracy. The seemingly fundamental limitation prompted Feynman to turn the difficulty of simulating quantum mechanics on its head, by proposing to build quantum computers and simulate quantum mechanics on quantum machines, thus ushering in the field of quantum computing, particularly quantum simulations [161]. Quantum computers, empowered by the capability to actualize and transfer “negative probability”, seem to hold a potential to efficiently solve a possibly larger set of computational problems known as BQP. There is a widely held belief that BQP\BPP≠θ, which shall turn out to be false.
Let H be a general and usually fermionic partial Hamiltonian on a Riemannian manifold (, g) as a configuration space, q0∈
be a fixed configuration point called the reference point [5],
∈(0, ∞] be a fixed imaginary time. It is always assumed that H behaves sufficiently well analytically, such that the Gibbs wavefunction
is continuous in q∈, ∀q0∈
, ∀
∈(0,
]. In particular, the limit
q∈ becomes substantially independent of the reference point q0 and coincides with the ground state of H up to a global scaling factor, namely, ψ(H; q; q0, ∞)/∥ψ(H; q; q0, ∞)∥=ψ0(H; q), q∈
, so long as ψ(H; q0; q0, ∞)≠0 and the ground state is non-degenerate. In the following of this specification, when applicable, we will take advantage of such generality of the Gibbs wavefunction to treat two typical application scenarios on the equal footing, where in one application scenario a Gibbs wavefunction ψ(H; ⋅; q0,
→∞), q0∈
in the large-
limit represents the ground state of H, while in the other a Gibbs wavefunction ψ(H; ⋅; q0,
), q0∈
at a finite
∈(0, ∞) stands for the Gibbs transition amplitude associated with a Gibbs operator exp(−
H).
Definition 24. (Nodal Cell and Nodal Surface)
For any given Gibbs wavefunction ψ(H; ⋅; q0, ), q0∈
,
∈(0, ∞], the preimage ψ−1(H;
\
is necessarily an open subset of , one connected component of which containing a given point q∈
such that ψ(H; q; q0,
)≠0 is called the nodal cell of ψ(H; ⋅; q0,
) containing q, denoted as
(H; q; q0,
), whose boundary ∂
(H; q; q0,
) is called the nodal surface of ψ(H; ⋅; q0,
) enclosing q.
Multiple decades worth of research and development in QMC of fermionic Hamiltonians has signified the central importance of nodal cells and surfaces associated with Gibbs wavefunctions, which can be said to condense all of the computational hardness. In particular, for a fermionic Schrödinger partial Hamiltonian H on a connected Riemannian manifold , if the location of the nodal surfaces were known or otherwise efficiently computable for the ground state ψ0(H) or a Gibbs kernel function associated with H, then solving for the ground state wavefunction everywhere on
would be equivalent to solving the ground state of a Dirichlet eigenvalue problem for Base(H) on a single nodal cell [5, 162, 163], or computing the Gibbs kernel function would be equivalent to computing a boltzmannonic path integral over all of the Feynman paths that never cross any nodal surface. That is the essential observation behind the fixed-node diffusion [5, 163, 166, 167, 188-191] or constrained path integral [5, 168, 169, 173, 174] methods of QMC.
Given a fermionic partial Schrödinger Hamiltonian H=HC+HD on a continuous-discrete product configuration space
which is CD-separately irreducible and has a non-degenerate ground state ψ0(H). Let ⊆
×{v}, v∈
be a nodal cell of ψ0(H). The continuous component Base(HC) of the base boltzmannonic Hamiltonian Base(H) is necessarily irreducible in
, and straightforwardly, the nodal cell
is (Base(HC)|
)-complete. However,
might not be Base(HD)-closed in general. Rather, there could be another nodal cell
⊆
×{v′} with v′∈
, v′≠v that is connected to
through Base(HD).
Definition 25. ((Positively) HD-Connected Configuration Points and Nodal Cells)
Let
H=HC+HD, ψ0(H), and nodal cells ⊆
×{v},
⊆
×{v′}, v, v′∈
of ψ0(H) be given as in the immediate above. Two configuration points (q, v) and (q, v′) with q∈
and v, v′∈
are said to be HD-connected when
(q, v)|I+Base(HD)|(q, v′)
≠0. Such a pair of HD-connected configuration points (q, v) and (q, v′) are called positively HD-connected if additionally ψ0(H; (q, v))ψ0(H; (q, v′))>0. Two nodal cells
⊆
×{v} and
⊆
×{v′} with v, v′∈
are said to be directly (and positively) HD-connected when q∈
exists such that (q, v)∈
, (q, v′)∈
, (q, v) and (q, v′) are (positively) HD-connected.
Two nodal cells ⊆
×{v} and
′⊆
×{v′} with v, v′∈
are said to be (positively) HD-connected, when a sequence of nodal cells {
i}i=0n, n∈
exists such that
and i are directly (and positively) HD-connected, ∀i∈[1, n] where each such sequence of nodal cells is called a (positive) HD-path between
and
′. A non-negative integer-valued (positive) HD-distance between such pair of nodal cells (
,
′), denoted by dist(
), can be defined as dist(
)=0 if
else dist
if n∈ is the least number such that an HD-path {
i}i=0n exists to start and end at
respectively.
Definition 26. (Nodal Groupoid)
Let H=HC+HD be a CD-separately irreducible fermionic Schrödinger Hamiltonian supported by a continuous-discrete product configuration space
which has a non-degenerate ground state ψ0(H). Let (q, v)∈ be a configuration point such that ψ0(H; (q, v))≠0. The set of all nodal cells of ψ0(H) that are positively HD-connected to
(H; (q, v)) is called a nodal groupoid of ψ0(H), which is denoted as
(H; (q, v)). The same name of nodal groupoid may also refer to the set union of nodal cells
when there is no risk of ambiguity in a context, or the mathematical notation makes the specific meaning clear.
For any nodal groupoid (H; (q, v)), its continuous boundary denoted by ∂
(H; (q, v)) is defined to be the union of the usual nodal surfaces,
while its discrete boundary denoted by δ(H; (q, v)) is defined to be the set of ordered pairs of configuration points ((r, v), (r, v′)) such that r∈
, v, v′∈
, (r, v)∈∪
(H; (q, v)), (r, v) and (r, v′)) are HD-connected but not positively HD-connected. The boundary of
(H; (q, v)), called the nodal boundary in short, is denoted and defined as
Also, the closure of (H; (q, v)) is denoted and defined as
A wavefunction ψ∈L2() is said to satisfy the Dirichlet boundary conditions on ∇
(H; (q, v)), when ψ(r, v)=0 for any (r, v)∈∂
(H; (q, v)), and ψ(r, v′)/(r, v)=ψ0(H; (r, v′))/ψ0(H; (r, v)) for any ((r, v), (r, v′))∈δ
(H; (q, v)).
The term nodal groupoid suggests an interpretation of (H; (q, v)) forming a small category, where the collection of concerned nodal cells serves as the set of objects, and from each
∈(H; (q, v)) to each
∈
(H; (q, v)), the set of morphisms hom(
) consists of all positive HD-paths (considered as directed) between
and
, so consequently, every morphism is invertible.
The required equation of wavefunction proportionality for the Dirichlet boundary condition on the discrete boundary δ(H; (q, v)) is a generalization of the so-called lever rule in fixed-node diffusion QMC for lattice fermions [192,193], which makes it possible to recover fully and identically the ground state wavefunction ψ0(H) restricted to the nodal groupoid
(H; (q, v)), by solving the ground state of a Dirichlet eigenvalue problem HBeffψ=λψ, ψ∈Diri(
(H; (q, v)), 2), with an effective Hamiltonian
representing a truncation of the boltzmannonic Hamiltonian Base(H) to within (H; (q, v)), where
and Base(H)H; (q,v)) denotes the restriction of Base(H) to the submanifold ∪
(H; (q, v)) as specified in equation (10), which discards all of the Base(H)-induced interactions between any (r, v)∈
and (v, v′)∈
such that ((r, v), (r, v′))∈δ
(H; (q, v)), henceforth referred to as Base(H)-couplings across the discrete boundary δ
(H; (q, v)), whose collective effect is replaced by the effective
-diagonal potential VBeff(⋅, v). As mentioned previously, an alternative interpretation of Base(H)
(H; (q,v)) is to view it as a superimposition of Base(H) and an infinite potential barrier out of ∪
(H; (q, v)), namely
(r, v), where (H; (q,v)) (⋅, v) is a
-diagonal potential such that
(q, v)) and
That the Base(H)-couplings across δ(H; (q, v)) can be equivalently substituted by the effective
-diagonal potential VBeff has been proved rigorously in [192], noting that for any function ψ∈L2(
) satisfying the Dirichlet boundary conditions on ∇
(H; (q, v)), any occurrence of a wave amplitude ψ(r, v′) in any equation can be substituted identically by [ψ0(H; (r, v′))/ψ0(H; (r, v))]ψ(r, v), so long as ((r, v), (r, v′))∈δ
(H; (q, v)).
Definition 27. (Fundamental Domain or G-Tile)
Let (X, ) be a topological space, on which a group G acts by homeomorphisms [73, 74]. An open set D⊆X is called a fundamental domain under G, or simply a G-tile, if 1) ∀g1, g2∈G, either g1D=g2D or g1D∩g2D=θ, and 2) the orbit of D, orb
upon topological closure, contains X, that is, cl[orb(D)]=X. An open set D⊆X is called a G-pretile when only condition 1) is satisfied.
Auxiliary Utility 3. (Tiling Property of Nodal Groupoids [5-163])
Let
and Gex denote the configuration space, its defining topology, and the exchange symmetry group of a CD-separately irreducible fermionic Schrödinger Hamiltonian H=HC+HD, whose ground state is non-degenerate, then each nodal groupoid (H; q), q∈C, ψ0(H; q)≠0 is a Gex-tile, with respect to the topological subspace
Demonstration. Firstly, define a binary relation between any q, r∈
such that q
r if r∈
(H; q), that is, there exists a sequence of nodal cells {
i}i=0n, n∈
such that
0=
(H; q),
n=
(H; r),
i−1 and
i are directly and positively HD-connected, ∀i∈[1, n]. It is easy to verify that
is an equivalence relation, which partitions the set
=
\ψ0−1(H; {0}) into mutually disjoint equivalence classes
that are exactly the nodal groupoids, namely, [q]H=(H; q), ∀q∈
. It is also clear that ∀q, r∈
, ∀π∈Gex, q
r if and only if πq
πr, as a consequence of ψ0(H; πq)=sign(π) ψ0(H; q), ∀q∈
, ∀π∈Gex, as well as π∈Gex being an autohomeomorphism. Therefore, for each nodal groupoid
(H; q)=[q]H, q∈
and ∀π∈Gex, the transformed open set π
(H; q)=
(H; πq)=[πq]H is necessarily a nodal groupoid, which is either identical to or disjoint with π
(H; q)=[q]H. So, every groupoid is a G-pretile. The subgroup of Gex under which a nodal groupoid
(H; q) is invariant is called the stabilizer of
(H; q), denoted and specified as stab
q∈ groupoid viewed as a Gex-pretile, then, as a consequence of the spatial locality of Base(HC), the restriction of ψ0(H) to
constitutes an eigen solution to a Dirichlet problem on
, [HBeff|
][ψ0(H)|
]=λ0(H)[ψ0 (H)
], ψ0(H)|
>0, subject to the Dirichlet boundary condition on
where HBeff=Base(H)|+VBeff denotes the effective boltzmannonic Hamiltonian in association with the nodal groupoid
, with VBeff being specified in equations (23). Conversely, for any Dirichlet eigen solution ϕ0∈Diri(
)⊆L2(
) of HBeff associated with an eigenvalue κ0∈
, κ0≤λ0 (H) such that (HBeff|
)(ϕ0|
)=κ0ϕ0|
, ϕ0|
>0, ϕ0
=0, the wavefunction ϕ0 can be extended into ϕ1∈L2
⊆L2() such that ϕ1|
=ϕ0|
, ϕ1
=0, and ϕ1(q′)/ϕ1(q)=ψ0(H; q′)/ψ0(H; q) for all (q, q′)∈
×
satisfying (q, q′)∈δ
and q∈
. Consequently, {[Base (H)
][ϕ1
]}(q)=κ0ϕ0(q) holds for all q∈
∪∂
). Moreover, it is without loss of generality to assume that ϕ0 is fixed by stab(
(H; q)), meaning ϕ0∘π(q)=ϕ0(q), ∀q∈∪
(H; q), ∀π∈stab(
(H; q)), because, otherwise, ϕ0 can be replaced by its symmetrized version
Now consider the homeomorphic group action [71-76] of Gex on and the induced transformation of quantum amplitude distributions, i.e., wavefunctions. For any π∈Gex acting by homeomorphism,
is still a nodal groupoid, π(∂)=∂(π
), π(δ
)=δ(π
), π(∇
)=∇(π
) are still the boundaries of π
, and the transformed wavefunctions ϕ0∘π−1, ϕ1∘π−1 still solve a Dirichlet eigenvalue problem on π
, in particular, {[Base(H)
][ϕ1∘π−1
]}(r)=κ0ϕ0∘π−1(r) holds ∀r∈π
∪∂(π
), since Base(H) is invariant under π, ϕ0 is fixed by stab(
(H; q)), while ψ0(H) is antisymmetric as ψ0(H; πq)=sign(π)×ψ0(H; q), ∀q∈
. Consequently, the wavefunctions ϕ0 and ϕ1 can be antisymmetrized and extended into
i∈{0, 1} respectively, satisfying
subject to the boundary condition [PFϕ1](r′)/[PFϕ1](r)=ψ0(H; r′)/ψ0(H; r), ∀(r, r′)∈×
such that r∈orb(
), r′∈orb(δ
)\orb(
), and
r|Base(HD)|r′
=0, where orb
and orb
are the orbits of and δ
under the action of Gex respectively. Using the lever rule trick again, a similar change of variables with
transforms the homogeneous linear equation (24) into a Dirichlet eigenvalue equation
But it must hold that κ0=λ0(H). Otherwise, if κ0<λ0(H), then according to equation (26), the wavefunction
with the indicator function [194] for the set orb(
), would produce a Rayleigh-Ritz ratio
ψ|H|ψ
/
ψ|ψ
=κ0<λ0(H), which, by the Courant-Fischer-Weyl min-max principle [195, 196], is impossible unless PFϕ0 vanishes almost everywhere on orb(
). But PFϕ0 cannot vanish, even just in
, because, ∀π∈Gex such that π
∩
≠θ, thus
it follows from (ϕ0∘π±1)(r)=ϕ0(π±1r)>0, ∀r∈ that sign(π)=+1, hence ψ(r)=[PFϕ0](r)>0, ∀r∈
. Furthermore, orb(
) must be the whole of
. Otherwise, if there is a q′∈
\orb(
) such that ψ0(H; q′)≠0, then the nodal groupoid
is nonempty, orb()∩orb(
′)=θ, and a wavefunction
constitutes an eigenfunction of a Dirichlet boundary value problem on ′, associated with an eigenvalue to κ′0<λ0(H). Then the previous steps of wavefunction antisymmetrization and extension can be repeated to instantiate another wavefunction
besides
such that ψ|H|ψ
/
ψ|ψ
=λ0(H),
ψ′|H|ψ′
/
ψ′|ψ′
=κ′0≤λ0(H),
ψ|ψ
≠0,
ψ′|ψ′
≠0,
ψ′|ψ′
=0. That contradicts the premise of H having a non-degenerate ground state with the eigenvalue λ0(H). Therefore,
with respect to the topological space ().
Derived Utility 1. (Fixed-Node Equivalence)
Simulating a promised unique ground state ψ0(H; r∈) of a CD-separately irreducible fermionic Schrödinger Hamiltonian H=HC+HD supported by a continuous-discrete product configuration space
is equivalent to lowest Dirichlet eigenvector of Base(H) on a nodal groupoid
q∈, ψ0(H, q)≠0, provided though that the boundary ∇
and the Dirichlet boundary conditions on ∇
are a priori known or efficiently computable, in the sense that for any given continuous path γ: [0, 1]
of a polynomially bounded length, it can be efficiently decided whether γ intersects ∂
, and ∀r=(rC, rD)∈
, it is efficient to compute and enumerate all of the configuration points {r′=(r′C=rC, r′D)} such that (r, r′)∈δ
, together with the ratios of wavefunction values {ψ0(H; r′)/ψ0(H; r)}, ∀(r, r′)∈δ
.
This Derived Utility signifies the central importance of nodal groupoids and boundaries in QMC, which can be said to condense all of the computational hardness of simulating quantum systems on a classical probabilistic computer. A means to decide quickly the boundary and the Dirichlet boundary conditions of a nodal groupoid translates immediately to an efficient BPP algorithm for solving the ground state and energy of the concerned quantum system. Unfortunately, computing the nodal surface(s) of a general fermionic many-body system has proven to be exceedingly hard, except for the celebrated Lieb-Mattis solution [162] for many fermions moving on a one-dimensional manifold diffeomorphic to one of the intervals [0, 1], [0, 1), (0, 1) on the real line, where the two-fermion-coincidence hyperplanes in the many-body configuration space are precisely the only nodal surfaces, because, within any interval on the real line, particle exchanges can only happen with two particles passing through each other on the line. With more spatial dimensions, however, two-fermion-coincidence hyperplanes in the many-body configuration space have less dimensions than, therefore cannot fix, the nodal surfaces.
The celebrated Feynman path integral and the corresponding path integral Monte Carlo (PIMC) provide general and powerful methods for computing and simulating Gibbs operators and their associated Gibbs wavefunctions and kernels [4-8, 94-97, 164-176]. Generally, it may be desired to compute a Gibbs operator, specifically its associated Gibbs wavefunctions and kernels, corresponding to a piecewise constant sequence of partial Hamiltonians {Hm: m∈[1, M]}, M∈, with each Hm, m∈[1, M] being a constant operator applied to a many-fermion quantum system during an interval (
m−1,
m] of (imaginary) time, where {
m: m∈[0, M]}⊆[0, ∞)M+1 is a predetermined sequence of time instants, with
0=0,
m>
m−1, ∀m∈[1, M]. Within each time interval (
m−1,
m], the partial Hamiltonian Hm, m∈[1, M] effects a Gibbs operator
whose integral kernel K(δmHm; ⋅, ⋅), namely, the Gibbs kernel, is analytically expressed as
with
denoting the base boltzmannonic Hamiltonian of Hm. Numerically, in order to compute or approximate a Gibbs transition amplitude K(δmHm; r, q), ∀m∈
, ∀(r, q)∈
2, a PIMC procedure divides the interval (
m−1, Tm] into Nm∈
smaller steps by inserting Nm−1 intermediate instants of time {
mn}n∈[1,N
such that each time step
is positive and sufficiently small, that the corresponding Gibbs operator exp(−δmnHm) can be efficiently computed or approximated straightforwardly using a small-time approximation such as exp(−δ
mnHm)=I−δ
mnHm+O (δ
mn2), or exp(−δ
mnHm)=exp(−δ
mnUm) exp(−δ
mnVm)+O(δ
mn2), or a higher-order formula of operator decomposition or splitting [177-187], where Um+Vm=Hm is an exemplary operator decomposition, such that both exp(−δ
mnUm) and exp(−δ
mnVm) have a closed-form analytical kernel.
Having the time interval (0, M] partitioned into a number
of small steps with the ordered sequence of time instants
the essence of PIMC in the configuration coordinate representation is to discretize the Wiener space of continuous curves {γ: [0, M]
, γ∈C0} into a cylinder set
{t∈[0, Ntot]}≅N
t≅
: t∈[0, Ntot]}, with each element {qt
∈Πt=0N
t representing a zigzag path made of consecutive segments of straight lines, then to approximate a path or functional integral by an (Ntot+1)-fold multiple integral of elementary calculus [94-101]. Let
m∈[0, M] be an integer-valued function, and ∀t∈[0, Ntot], let m(t)∈ be the smallest integer m such that Ntot(m)≥t.
Definition 28. (Feynman Slices, Feynman Slabs, Feynman Stacks, Feynman Substacks, Feynman Paths, Feynman Flights, Feynman Bundles, Feynman Chains, Rectified Feynman Chains)
With the same conventions and notations as the immediate above, for a given time instant t∈[0, Ntot], a Feynman slice denoted as (t, H(t)) is a couple formed by the configuration space
t∈
and the instantaneous partial Hamiltonian
t∈[1, Ntot]; ∀m∈[1, M],
the (Nm+1)-tuple of Feynman slices {(t, H(t)): t∈
associated with a constant partial Hamiltonian H(t)=Hm for t∈(tm−1, tm] form a Feynman slab; the M-tuple of all of the Feynman slabs
form a Feynman stack; a single Feynman slab or a sequence of consecutive Feynman slabs form(s) a Feynman substack, which is a Feynman stack in its own right with respect to the corresponding start and end time instants.
With respect to a Feynman stack or substack, a couple formed by a zigzag path {qt}t∈[t′,t″]∈t, 0≤t′<t″≤Ntot and the sequence of partial Hamiltonians {H(t)}t∈[t′,t″] is called a Feynman path from t=t′ to t=t″ that connects qt′∈
t′ and qt″∈
t″; a triple (qt−1, qt, H(t)) with t∈[1, Ntot], qt−1∈
t−1, qt∈
t is called a Feynman flight and considered as a portion of a Feynman path; for each Feynman slab {(
t, H(t)): t∈[tm−1,tm]}=Slab(
t
t
With respect to a Feynman stack or substack that starts at tm
(m) for any m∈[m1, m2], ∀(m2−m1+1)-tuple {rm}m∈[mt
⋅|exp[−(tm−tm−1)Hm]|rm−1
; a rectified Feynman chain {Bun(tm−1, rm−1; tm, rm)}m∈[m
t
A specific PIMC method assigns to each Feynman flight a Gibbs transition amplitude through a small-time approximation, and assigns to each Feynman path a product of Gibbs transition amplitudes due to its constituent Feynman flights, then computes any Gibbs wavefunction or kernel by summing up (integrating over) the Gibbs transition amplitudes of all of the permitted Feynman paths. In particular, for each Feynman bundle Bun(tm−1, rm−1; tm, rm) with tm−1=Ntot(m−1), tm=Ntot(m), m∈[1, M], qm−1∈t
t
qm|exp[−(tm−tm−1)HmB]|rm−1
associated with
is obtained by integrating the corresponding Gibbs transition amplitudes of all of the paths in the Feynman bundle, which produces the boltzmannonic Gibbs wavefunction or kernel when rm or both rm−1 and rm traverse(s) the respective configuration space(s). By equation (27), the corresponding fermionic Gibbs wavefunctions or kernels can be obtained by symmetrizing the rm coordinate, that is, rm|exp[−(tm−tm−1)Hm]|rm−1
=
rm|PF exp[−(tm−tm−1)HmB]|rm−1
, ∀rm−1∈
t
t
for the boltzmannonic Gibbs wavefunctions or kernels, and
for the fermionic counterparts. It is noted that, in equation (29), suffice it to keep only one of the fermionic symmetrization projection operators acting on one of the configuration spaces {t
Context Switch Notice: In the following plurality of paragraphs until another reminder of context switch like this one, many-body physical systems are always assumed to be moving on a connected continuous Riemannian manifold, involving no discrete dynamical variables or having all discrete dynamical variables frozen.
Derived Utility 2. (Maximal Property of Nodal Cells for Two-Fermion Systems)
The ground state of any fermionic Schrödinger system consisting of two identical fermions moving on a connected Riemannian manifold, when non-degenerate, has exactly one connected positive nodal cell.
Demonstration. When limited to systems consisting of two identical fermions moving on a connected Riemannian manifold, the exchange symmetry group has only two elements, one being even, the other being odd. Then it follows from Auxiliary Utility 3 that the ground state 2i)0 can only have two nodal cells, one being positive, the other negative, denoted by +(ψ0) and
−(ψ0) respectively.
For a general single-species fermionic Schrödinger system moving on a connected Riemannian manifold, the maximal property of ground state nodal cells, i.e., there is but one positive nodal cell, and consequently only one negative nodal cell, has been conjectured by Ceperley, who was also able to confirm the property numerically for systems consisting of up to 200 identical fermions moving on a two- or three-dimensional torus [5]. Later, Mitas provided a rigorous proof for certain classes of single-species fermionic systems [197, 198], particularly those of spin-polarized non-interacting fermions moving in a quadratic potential well or on a potential-flat torus, whose wavefunctions can be factored as a nodeless function multiplying an alternant [199], that is, the determinant of an alternant matrix of particle coordinate variables, thus enabling an analytical representation and rigorous determination of the nodal surface. It has remained open whether or when the maximal property of nodal cells holds for a general interacting single-species fermionic Schrödinger system, until now. As will be asserted in Auxiliary Utility 4 below, the maximal property of nodal cells does hold for the ground state of a single-species fermionic Hamiltonian H(1), if there exists a C1 curve {H(t)∈: t∈[0, 1]} in the vector space
, such that the ground state of H(t) is non-degenerate, ∀t∈[0, 1], and that of H(t)) satisfies the maximal property of nodal cells.
On the other hand, it can be shown that a system of multi-particle multi-species has more than one disconnected positive nodal cells in general. To that end, just consider a system consisting of S≥2 fermion species, with at least two species indexed by s=1, 2 having at least two identical fermions, where different species do not interact, so that the Hamiltonian is of a form H(t))=Σs=1S Hs, with each Hs, s∈[1, S] representing the total energy of the subsystem of all of the fermions of the s-th species. Furthermore, the ground state of each Hs, s∈[1, S] is assumed non-degenerate, so that ψ0(Hs) is a unique ray in the corresponding projective Hilbert space, and the unique ground state of the overall system of S-specie is a separated product ψ0(H(0))=Πs=1Sψ0(Hs). Pick one positive nodal cell for each ψ0(Hs′), and denote it by s+, ∀s′∈[1, S]. For each s∈{1, 2}, there is at least one nontrivial element, denoted as πs≠1, in the symmetry group Gs of exchanging ns≥2 identical fermions, ns∈
, so there must be at least one positive and one negative nodal cells, denoted as
s+ and
s− respectively, such that
and
Clearly, the product subspace Πs=1Ss+ is one positive nodal cell for the overall ground state ψ0(H0), and so is
such that
Let (, Vg) denote a σ-finite Lebesgue measure space, with
being a connected, locally compact Riemannian manifold, and
being a σ-algebra of Lebesgue measurable subsets of
, and Vg a Lebesgue measure associated with the Riemannian metric g on the manifold
. Let [θ]⊂
be the equivalence class of null sets with Vg-measure zero, and
be the quotient space of
modulo [θ]. For any pair of measurable sets E, F∈
, let [E], [F]∈
, denote the corresponding equivalence classes of sets, and d([E], [F])=Vg((E∩Fc)∪(Ec∩F)) denote the measure of symmetric difference, then (
, d) is a metric space, which is a separable topological space [66,67], and actually second-countable.
Definition 29. (σ-Continuity of Set-Valued Functions)
Let be a topological space. A set-valued function ƒ:
is called σ-continuous, if there exist a countable number of continuous set-valued functions ƒn:
, n∈N, such that ƒn(x) is a compact subset of
for each (n, r)∈(N×
),
ƒn(r)=
for each x∈
, and ƒ∩ƒn:
is continuous for each n∈N, where ƒ∩ƒn denotes a range-restricted function such that ∀x∈
,
Definition 30. (C1,2 Path and Piecewise C1,2 Path of Hamiltonians)
Let be a connected subset of
. A C1,2 path of Hamiltonians is a fermionic Schrödinger operator-valued curve {H(t)∈
: t∈
} in the vector space
over a connected Riemannian manifold
, possibly accompanied by a σ-continuous set-valued function
sin:
, such that the Gibbs kernel function
r|exp(−Base(H(t)))|q
is (t, r, q)-jointly C1,0,0 in
×
2, and ∀t∈
,
sin(t)⊂
is a submanifold with dim(
sin(t))<dim(
),
r|exp(−Base(H(t)))|q
is positivity improving and in L2(
2)∩C2,2((
\
sin(t))2), the ground state of H(t) is non-degenerate and placed at the energy level 0, while the operator Base(H(t)) substantiates the Hopf lemma, ∀t∈
. A piecewise C1,2 path of Hamiltonians is a concatenation (also known as product) of a finite number of C1,2 paths of Hamiltonians.
Let the index subset =∪k=1K
k be a union of consecutive intervals {
k}k=1K, K∈N, where the closures of adjacent intervals intersect at a single point on the real line, cl(
k)∩cl(
k+1)={ak}, ak∈
, ∀k∈[1, K−1]. For a piecewise C1,2 path of Hamiltonians {H(t)∈
: t∈
}, the Gibbs kernel function
r|exp(−H(t))|q
as a linear combination of exchange-symmetric versions of
r|exp(−Base(H(t)))|q
is by definition (t, r, q)-jointly C1,0,0 in each
k×
2, k∈[1, K], and (r, q)-jointly C2,2 in (
\
sin(t))2, ∀t∈
, ∀k∈[1, K]. It follows from equation (22) that the t-dependent, non-degenerate ground state ψ0(t; q)=ψ0(H(t); q) is in C1,0(
k×
), ∀k∈[1, K] and in C2(
\
sin(t)), ∀t∈
, ∀k∈[1, K].
Therefore, the preimage
is an open subset of ×
, a connected component of which containing a given point (t′, q′)∈
×
, denoted by
, is called a nodal tube containing the point (t′, q′). Let
(t′, q′)∈×
be a typical nodal cell of the ground state ψ0(H(t′)) of the instantaneous Hamiltonian H(t′) at the instant t=t′. Clearly,
t′q′ is contained in the nodal tube
, and is actually the intersection of the two submanifolds
⊆(
×
) and
It is said that the nodal cell t′q′ grows into the nodal tube
, or the nodal tube
is grown from the nodal cell
t′q′.
Similar to Auxiliary Utility 3, by considering the group action of Gex on nodal tubes, it can be shown that any nodal tube , (t′, q′)∈(
×
) is a Gex-tile of the product manifold
×
, namely, ∀π1, π2∈Gex, either π1
=π2
or π1
∩π2
=θ, what is more, ∪π∈G
=(
×
), Having a point (t′, q′)∈(
×
) fixed, consider a set-valued function
(⋅):
which sends each t∈
to
then the nodal tube is the graph of the function
(⋅), modulo [θ].
Auxiliary Utility 4. (One-to-One Correspondence between Nodal Cells and Nodal Tubes)
Given a piecewise C1,2 path of Hamiltonians
and an instant t′∈,
N∈ denote the set of all nodal cells associated with the instantaneous Hamiltonian H(t′), with each q(n)∈
, n∈[1, N] being contained in and indexing one unique nodal cell, then
lists all of the nodal tubes associated with the piecewise C1,2 path, and there exists a bijection g: {t′}
{
}, which commutes with all of the elements of the exchange symmetry group Gex, such that
and gπ=πg, ∀ρ∈Gex. Furthermore, for any fixed point (t′, q′)∈(×
), the set-valued function
(⋅) from
to the metric space (
, d) is σ-continuous, and
|ψ0(H(t); q)|2dVg(q) remains a constant fraction of
|ψ0(H(t); q)|2dVg(q), ∀t∈
.
Demonstration. Suffice it to show that all of the assertions hold true for a single piece of C1 path {H(t)∈: t∈[0, 1]} with H(⋅) being C1([0, 1]), such that the Gibbs kernel function
r|exp(−H(t))|q
is (t, r, q)-jointly C1,0,0 in [0, 1]×
2 and (r, q)-jointly C2,2 in (
\
sin(t))2, ∀t∈[0, 1], thus the never degenerate ground state ψ0(H(t); q) is (t, q)-jointly C1,0 in [0, 1]×
and C2 in
\
sin(t), ∀t∈[0, 1]. Given any t′∈[0, 1] and the accordingly defined {
t′,} and {
}, it is obvious that each nodal cell
t′q′∈{
t′} with q′∈
such that ψ0(H(t′); q′)≠0 corresponds to a nodal tube Tt′q′∈{
} through their common reference to the point (t′, q′) ∈[0, 1]×
, which defines a mapping g: {
t′}
. By the Gex-invariance of H(t) and |ψ0(H(t); ⋅)|, ∀t∈[0, 1], it is obvious that g commutes with any π∈Gex, namely, for each nodal cell
t′q′∈{
t′} that grows into a nodal tube g(
t′q′)=Tt′q′∈{
}, the π-transformed nodal cell π
t′q′ grows into the π-transformed nodal tube π
. To prove that g is bijective, it is sufficient to show that for any fixed point (t′, q′)∈(
×
), the set-valued function
(⋅): [0, 1]
, d) is σ-continuous and never vanishes over its entire domain, namely, Vg(
(t))>0, ∀t∈[0, 1].
For any δ>0, for any t∈[0,1], let
be a slightly blown-up open set enclosing the submanifold of singularity sin(t), with
Pick one point O∈ and call it an origin. Given any ϵ∈(0, 1), and for any t∈[0, 1], let δ(ϵ)>0 be chosen such that
let
be the smallest closed ball centered at the origin O with a radius R(ϵ)>0, such that, with
it holds that
Then the mapping ϵ(⋅): t∈[0, 1]
(t)∈
is a continuous function, and the union of
ϵ(t)-restricted nodal surfaces
is a compact submanifold, which is C2-smooth by the implicit function theorem and the C2-continuity of ψ0(H(t); ⋅). Therefore, the norm of the gradient, ∥∂0ψ0(H(t), q)∥, does not vanish on ∂(t) and attains a minimum
∀t∈[0, 1] by the premise of the operator Base(H(t)) substantiating the Hopf lemma. Moreover, the function ψ′min(⋅): [0, 1] is C1 ([0, 1]) and strictly positive, so ψ*ϵ=min{ψ′ϵ(t): t∈[0, 1]} exists and is positive.
For any fixed point (t′, q′)∈(×
), and with respect to a fixed ϵ∈(0, 1), the
ϵ(t)-restricted nodal tube
(t)∩
ϵ(t) of ψ0(H(t); ⋅) is a continuous set-valued function in t∈[0, 1], because its boundary ∂(
(t)∩
ϵ(t))⊆((∂
(t))∩
ϵ(t))∪(∂
ϵ(t)) varies continuously in t∈[0,1]. That ∂
ϵ(⋅) is in C1([0, 1]) follows straightforwardly from the C1,2-continuity and L2-integrability of ψ0(H(⋅), ⋅), while the coordinate q∈
of points on (∂
(t))∩
ϵ(t), t∈[0, 1] satisfies the level-set equation [200] ψ0(H(t), q(t))=0, and the differential version (∂ψ0/∂t)+(∂qψ0)·(dq/dt)=0, which yields an evolution equation for the t-dependence of the nodal surface [200],
and guarantees the C1-continuity of (∂(t))∩
ϵ(t), by the C1,2-continuity of ψ0(H(t); q) and the ψ*ϵ>0 lower-boundedness of ∥∂qψ0(H(t); q)∥. Therefore, the set-valued function (
∩
E) (⋅): t∈[0, 1]
(t)∩
ϵ(t)∈
is continuous, so are the integrals
|ψ0(H(t); q)|2 dVg(q) and
|ψ0(H(t); q)|2 dVg(q) continuous in t, ∀t∈[0, 1].
By Auxiliary Utility 3, for any fixed t∈[0, 1], the subset (t)∩
ϵ(t) tessellates the whole compact space
ϵ(t), under the homeomorphism action of the exchange symmetry group Gex, hence there exists an integer N(ϵ; t′, q′; t) ∈[1, |Gex|] such that
since |ψ0(H(t); ⋅)|2 is invariant under any particle exchange operation in Gex. To be simultaneously continuous and discrete-valued, the fraction in equation (31) can only be a constant 1/N0, with N(ϵ; t′, q′; t)=N0∈, ∀t∈[0, 1]. In the limit ϵ→0, it is necessary that
ϵ(t)→H
, in the sense that Vg(
\
ϵ(t))→0, ∀t∈[0, 1], otherwise, there would exist a t∈[0, 1] such that the Gibbs kernel function
r|exp(−Base(H(t)))|q
is not positivity improving. As ϵ approaches 0, the fraction in equation (31) remains the same constant, and the set-valued function
(⋅)∪
ϵ(⋅): [0, 1]
d) is always continuous. That establishes the bijectivity of the function g, for any predetermined t′∈[0, 1].
Derived Utility 3. (Nodal Maximality for Conventional Fermionic Schrödinger Systems)
Ceperley's conjecture on the nodal maximality, positing that any non-degenerate ground state has exactly one positive nodal cell, is true for a single-spices system consisting of n∈ identical spinless fermions moving in a smooth (d≥2)-dimensional Riemannian substrate space (
0, g0), where the physics is governed by a conventional Schrödinger operator H1=−Δg+V1(q∈
) on the configuration space
with the potential V1 being Kato-decomposable [84, 97, 139] and C2-smooth in \
sin, where
sin ⊂
is a submanifold with dim(
sin)<dim(
).
Demonstration. Construct a path of Hamiltonians {H(t): t∈[0, 1]} that is not only piecewise C1-continuous but actually piecewise analytic [146,153,154] with
where H0=−Δg+V0, with V0 being a bounded, C2()-smooth, and quasiconvex potential having a deep global minimum, such that H0=−Σi=1nd gii (∂/∂xi)2−Σi=1nd hi(∂/∂xi)+Σi=1nd kigiixixi+min(V0), in a small neighborhood covered by a local coordinate (x1, . . . , xnd) around the global potential minimum, where (gii)i=1nd and (gii)i=1nd are diagonalized representations of the Riemannian metric g and g−1 respectively, hi is a smooth function related to the Riemannian metric, ki>0 is a constant, while U0 is a t-analytic,
-diagonal, bounded, and C2(
)-smooth potential that vanishes at t=0 and t=½, and U1 is t-independent,
-diagonal, bounded, and C2(
)-smooth. The V0 potential can be designed to have the constants {ki: i∈[1, nd]} in the Hamiltonian H0 so large that all the other coefficients {gii, gii, hi: i∈[1, nd]} can be taken as constants in the neighborhood of arg min(V0), without any substantial change in the relevant low-lying eigenstates of H0, which essentially represents an ideal harmonic oscillator, whose fermionic Schrödinger ground state is non-degenerate. More specifically, with a local coordinate (x1, . . . , xnd)=((xυδ)δ=1d)υ=1n, for said neighborhood of arg min(V0)∈
explicitly manifesting its structure of Cartesian product, in that for each υ∈[1, n], the d-tuple (xυδ)δ=1d∈
0 represents the position of the υ-th artificially labeled identical fermion in the substrate space
0, one good choice of the parabolic potential is
with (g0,δδ)δ=1d being a diagonalized representation of the Riemannian metric g0 for the substrate space 0, and the positive constants (kδ)δ=1d satisfying n max(k11/2 g0,11, k21/2 g0,22)<kδ1/2g0,δδ, ∀δ∈[3, d], such that, among the d decoupled one-dimensional modes of harmonic oscillations, each along one orthogonal axis in the neighborhood on the substrate space
0, only two can be excited and occupied by the identical fermions, while the rest (d−2) modes are all frozen to their ground states. Furthermore, the constants are chosen to make sure that no integers n1 ∈[1, n], n2 ∈[1, n] exist to satisfy n1k11/2g0,11=n2k21/2g0,22, so the ground state of the n-fermion system cannot be degenerate. By definition, the ground state of H(1) is also non-degenerate. If suitable potentials U0 and U1 can be chosen such that the ground state of H(t) is never degenerate, ∀t∈[0, 1], then all of the premises in Auxiliary Utility 4 are fulfilled to assert that the ground state of H(1)=H1 has exactly the same number of positive nodal cells as that of H(0)=H0, which as a system of non-interacting identical spinless fermions moving in a parabolic potential is known to have Ceperley's conjecture hold true [197].
It is without loss of generality to assume being compact and V1 being in C2(
)∩L∞(
), because, otherwise, compact sets
as defined in the demonstration of Auxiliary Utility 4 can be identified with ϵ>0, δ(ϵ)>0, and a bounded C2()-smooth potential V′1 can be defined, which coincides with V1 in
ϵ(1) and arises quickly to a sufficiently large positive constant outside
ϵ(1), so to provide a potential barrier trapping a sufficient number of bound states. Let
As multiplicative operators, V′1 is bounded and V1 is (−Δg)-bounded with relative bound zero [84, 97, 201]. Therefore, U1 is relatively bounded and infinitesimally small with respect to both H0 and H1, as well as H′1. Since both H0 and H1 have a non-degenerate ground state as a discrete eigenvector, a sufficiently large radius of ϵ(1) can be chosen such that the original potentials V0 and V1 have exponentially attenuated the ground state wavefunctions [201, 202] effectively to below a sufficiently small ϵ>0 that restriction of the configuration space to the compact subspace
ϵ(1) does not change the physics as far as the low-lying energy states are concerned, at the same time, the perturbation due to
is so small that the ground state of the analytic path of Hamiltonians {H(t): t∈[½, 1]} stays isolated and non-degenerate throughout. Alternatively and specifically, while always enforcing wavefunction continuity in
ϵ(1)\∂
ϵ(1), a Dirichlet boundary condition on ∂
ϵ(1) can be suddenly turned on at t=0 and abruptly removed at t=1 without significantly affecting the low-lying energy states and their nodal structures, as guaranteed by the Courant-Fischer-Weyl min-max principle [195,196], since the low-lying eigenstates have their wavefunctions exponentially decayed and decaying outside
ϵ(1).
Therefore, it is only necessary to consider an analytic path of Hamiltonian
on a presumably compact configuration space , with
being an -diagonal potential, ∀t∈[0, ½]. The eigenvalues and eigenvectors of H′(t) can be parametrized into t-analytic curves, by a well-known theorem of Rellich [84, 203-205]. Each t-analytic curve of eigenvalues or eigenvectors will be referred to as a λ-curve or ψ-curve respectively. A t-continuous curve of ground state energy {E0(H′(t)): t∈[0, ½]} is obtained by taking the minimum among all of the λ-curves. Since V(t), t∈[0, ½] is bounded, the growth of |λ(t)−λ(0)| is bounded by a constant [84] for each λ-curve {λ(t): t∈[0, ½]}, and there can be only a finite number of λ-curves that ever come close to {E0(H′(t)): t∈[0, ½]}. Consequently, there can be no more than a finite number of occurrences of ground state degeneracy along the {E0(H′(t)): t∈[0, ½]} curve, because, otherwise, there would be two λ-curves whose difference, as another t-analytic function, has an infinite number of zeros within the bounded domain [0, ½], then, by the Bolzano-Weierstrass theorem, the sequence of zeros has a limit point, contradicting analyticity of the λ-curves.
Now it only remains to demonstrate that each of the finite number of isolated points of degeneracy along E0(H′(t)), t∈[0, ½] can be removed by an above-mentioned U0(t, ⋅), t∈[0, ½] potential on the compact Riemannian manifold . Around each point of degeneracy at t=t1∈(0, ½), within each interval (t1−δ1, t1+δ1)⊂(0, ½) for a sufficiently small δ1>0, there can be only a finite number of λ-curves {λi(t): i∈[0, m]}, m∈
intersecting at t=t1 and nowhere else, while the eigenvalues and associated eigenvectors can be represented as λi(t)=λi(t1)+O(|t−t1|), ψi(t)=ψi(t1)+O(|t−t1|), ∀i∈[0, m]. Since the nodal surface of each eigenvector ψi(t1), i∈[0, m] is compact and smooth, on which the continuous function ∥∂gψi(t1, q)∥ is strictly positive by the Hopf lemma [86-89], the union of all these nodal surfaces has a Vg-measure zero, and there must be an open ball B(q1, ϵ1)⊂
of radius ϵ1>0 about a certain q1∈
, within which each eigenvector ψi(t1), i∈[0, m] does not vanish, and can be made positive by adjusting its global numerical sign. It can be also assumed that B(q1, ϵ1) be covered by a chart with a local coordinate (x1, . . . , xnd) such that the Laplace-Beltrami operator reads Δg=Σi=1ndgii(∂/∂xi)2+Σi=1nd hi(∂/∂xi). Next, sufficiently small δ2∈(0, δ1) and ϵ2∈(0, ϵ1) can be found such that δ2+ϵ2<<min{ψi(t1; q1): i∈[0, m]}, and ψi(t; q)=ψi(t1; q1)+O(|t=t1|+∥q−q1∥) in
2×B (q1, ϵ2),
∀i∈[0, m]. Install a perturbative potential U0(t; q=q1+(xi))=C exp[−D(t−t1)2]exp(−Ex12) sin(Fx1), with positive real-valued constants C, D, E, F, both D and E being so large that U0(t; q1+(xi)) is essentially localized in 2×B(q1, ϵ2), while |F|/E being so large that U0(t; q) induces virtually no direct, first-order mixing among the ψ-curves {ψi(t): i∈[0, m], t∈
2}, but couples each of them, in essentially the same manner when δ2 and ϵ2 are sufficiently small, to a collection of highly excited Dirichlet eigenstates of H′(t), denoted as {ψi(t): j∈
⊆
}⊆Diri(
), where the coupling matrix element
ψj(t)|U0(t)|ψi(t)
˜
ψjt1)|U0(ti)
ψi(t1; q1) is mostly independent of i∈[0, m] and t∈
2, while the energy difference λj(t)−λi(t)˜λj(t1)˜g11F2 is largely a constant independent of i∈[0, m], j∈
, and t∈
2. The
-indexed high-energy states mediate interactions among the low-energy {ψi: i∈[0, m]} states through higher-order perturbations.
A standard series expansion up to the second-order perturbations [53, 57, 206] indicates that the perturbative coupling effect of all of the -indexed high-energy states combined is equivalent to that of a single effective state Ψ(q=q1+(xi))˜−sin(Fx1), q∈
, which may not be a strict eigenvector of H′(t), but behaves much like an energy eigenstate in that H′(t)Ψ˜g11F2Ψ, ∀t∈
2. Note that the effective state Ψ is taken to be independent of t, ignoring any small perturbative change, by virtue of the largeness of g11F2 and smallness of δ2. Restricted to the (m+2)-dimensional subspace spanned by {ψi(t): i∈[0, m]} and Ψ, ∀t∈
2, the total Hamiltonian H(t)=H′(t)+U0(t) is effectively H(t)˜H″(t) as
with
The Hamiltonian H″(t) is stoquastic, irreducible, and aperiodic, ∀t∈2, whose ground state is guaranteed non-degenerate by the Perron-Frobenius theorem [140-142]. Alternatively, a Schrieffer-Wolff transformation [207, 208] can be employed to project out the
-indexed high-energy states and obtain an effective Hamiltonian for the low-energy subspace spanned by the {ψi: i∈[0, m]} states, which should again manifest the non-degeneracy of the ground state of the effective Hamiltonian. In any case, it is demonstrated that the ground state of H(t) has avoided level crossing by the gradual onset of U0(t) in the concerned interval
2. Importantly, toward the end of
2, both λ0(H(t)) and ψ0(H(t)) return back to λ0(H′(t)) and ψ0(H′(t)) respectively, so that the same procedure can be repeated for the other isolated points {ti∈(0, ½): i∈[2, n′]}, m′∈
of degeneracy along {E0(H′(t)): t∈[0, ½]}, to have all of them removed eventually, rest assured that the overlap among the Gaussian-shaped impulses of the form C exp[−D(t−ti)2], i∈[1, m′] in the U0(⋅, ⋅) potential can be made negligibly small for all of the relevant considerations by making D>0 sufficiently large.
Definition 31. (Imaginary-Time Cω Path of Hamiltonians)
Let (, g) be a connected, locally compact Riemannian manifold as a many-body configuration space. Use the Sobolev space
as a Hilbert space of fermionic wavefunctions. An imaginary-time Cω path of Hamiltonians is a fermionic Schrödinger operator-valued curve {H()∈
L
∈
in the vector space , which generates an imaginary-time-inhomogeneous (ITI) Gibbs kernel function K (
; r, q) that is (
, r, q)-jointly analytic over the domain
×
2, denoted as K(
; r, q)∈Cω(
×
2), such that
where the Hamiltonian H(),
∈
is applied with respect to the r∈
spatial variable in equation (35), and ∀
∈
, the function K(
; ⋅, ⋅)∈L2(
2) is positivity improving, the operator Base(H(
)) substantiates the Hopf lemma and the strong extremum principle on any bounded and connected submanifold of M. An imaginary-time piecewise Cω path of Hamiltonians is a concatenation of a finite number of imaginary-time Cω paths of Hamiltonians.
Much of the concepts of nodal cells and the tiling and maximal properties of nodal cells apply to an imaginary-time Cω path of Hamiltonians, particularly one that is associated with a constant Hamiltonian H()=H0∈
, ∀
∈
[5], when the ITI Gibbs kernel function is simply the conventional K(
; r, q)=
r|exp(−
H0)|q
, ∀
∈
.
Definition 32. (Ceperley Reaches and Nodal Cells for Gibbs Kernel Functions)
With a reference point (0, q0)∈{
0}×
fixed, the Ceperley reach of (
0, q0), denoted by
0, q0), is the set of points (
1, q1)∈
0>0 having a reaching path (i.e., a continuous curve) γ: [0,1]×
such that γ(0)=(
0; q0), γ(1)=(
1; q1), and
for all s∈[0, 1] [5]. A slice of the Ceperley reach ×
at
∈
is denoted and defined as
while a segment of the Ceperley reach 1∈
and
2∈[
1, ∞) is denoted and defined as
For each ∈
, the preimage K−1(
:
\
is necessarily an open subset of , one connected component of which containing a given point q1∈
is called the nodal cell of K(
; ⋅, q0) around q1, denoted as
(K;
; q1). A nodal cell
(K;
; q1) is called positive when K(
; q1, q0)>0.
Clearly, each slice ) is a union of some positive nodal cells, for each
∈
. For a conventional fermionic Schrödinger Hamiltonian H=−Δg+V, Ceperley has proved that the Ceperley reach
, ∀q0∈
, using the analytical solvability of K(
; ⋅, q0) in the classical limit of
→0+, when the effect of potential energy becomes immaterial, in conjunction with the differential evolution of K(
; ⋅, q0) in
∈(0, ∞). His proof can be generalized straightforwardly to an imaginary-time Cω path of Hamiltonians H(
)=−Δ9+V(
),
∈(0, ∞), in association with a
-dependent potential V(
) that is lower bounded, as such, any slice
) must be the union of all positive nodal cells at
, ∀
∈(0, ∞). However, when it comes to the maximality of a nodal cell
(K;
; q1), K(
; q1, q0)≠0,
∈[
0, ∞),
0>0, a rigorous proof is still missing. The following Derived Utility 5 shall fill the gap.
Before that, it is well worth mentioning that because of the analyticity of K(⋅; ⋅, q0)∈Cω(×
), the closure of the Ceperley reach
),
∈
and its boundary δ
), as well as the closure of any nodal cell
(K;
; q1),
∈
, q1∈
(K;
; q1), are all examples of a semianalytic set
, which is amenable to a Whitney stratification [209-220], meaning that a filtration of closed subsets
exists, such that
either empty or an i-dimensional analytic submanifold for each i∈[0, m], where I(i) is an index set labeling the disjoint and connected components {Sij: j∈I(i)} of i\
i−1, each of which is called a strata of dimension i, the decomposition of
into strata
is locally finite and satisfies the so-called frontier condition, namely, any point in has a neighborhood intersecting no more than a finite number of strata, and cl(Sij)⊇dim(Sij)≥dim(Si′j′) must hold for any pair of strata Sij, Si′j′ such that cl(Sij)∩Si′j′=θ, moreover, for any triple (Sij, S′i′j′, x*∈
) such that x*∈Si′j′ ⊂cl(Sij), dim(Sij)=k∈
, with a sequence of points {xα: α∈
}⊆Sij converging to x*, and the associated sequence of tangent spaces {Tx
)} converging to T⊆
m, where Gr(k, Tx
)} represents the Grassmannian bundle of k-dimensional subspaces in Tx
, and the convergence is in the standard topology of said Grassmannian bundle, one of the two Whitney conditions must be satisfied: A) Tx
} in Si′j′ also converges to x*, with x′α≠xα, ∀α∈
, and the secants
, converging in the projective space
to a line
⊆
m, then
⊆T. It is known that the condition B implies A [213].
In particular, the topological boundaries ∂) and ∂
(K;
; q1),
∈
, q1∈
⊆
s which, according to Lojasiewicz's structure theorem for varieties [146, 211], has a Whitney stratification
=∪i∈[0,m]∪j∈I(i)
ij, m=dim(
) in a neighborhood Q around any point x*∈
⊆
, where, in a suitable chart, x* can be identified with the origin (0, . . . , 0)∈
m, the neighborhood can be appointed as Q=Πi=1m(−δi, δi), δi>0, ∀i∈[1, m], and a finite set of polynomials {Hik(xi, . . . , xi; xk): i∈[1, m], k∈[i+1, m]} can be chosen, with (x1, . . . , xi; xk)∈
i+1, the discriminant Dik(x1, . . . , xi) for each Hik(x1, . . . , xk) viewed as a univariate polynomial of xk being non-vanishing in
each root xk=ξ of the univariate polynomial Hik(x1, . . . , xi; ⋅) satisfying |ξ|<δk, ∀(x1, . . . , xi)∈Qi, such that, each stratum ij is an algebraic variety that in an open subset Qij ⊆Qi ⊆
i can be analytically parametrized by a system of (m−i) equations as
where each χijk is an analytic function in Qij, the subsets {Qij} and functions {χijk} are non-redundant, in the sense that, for any triple of indices (i, j, j′), either Qij ∩Qij′=θ or Qij=Qij′, and in the latter case, for any k∈[i+1, m], either χijk≡χij′k on Qij or χijk(x1, . . . , xi)≠xij′k (x1, . . . , xi) for all (x1, . . . , xi)∈Qij.
In any case, each of the submanifolds ,
(
),
(K;
, q1) or the boundaries ∂
, ∂
(
), ∂
(K;
, q1),
∈
, q1∈
is guaranteed to behave sufficiently regular and much like a smooth manifold with smooth boundaries in a sufficiently small neighborhood of any q∈
, so that the classical Morse theory [221-226], in particular, the stratified Morse theory [223] and the Morse theory for manifolds with boundaries [226], can be employed to track topological changes of a (sub)manifold by analyzing a differentiable scalar function on the (sub)manifold. Our proof of Derived Utility 5 below may not necessarily invoke any deep results of Morse theory directly and explicitly, but the basic idea and methodology being used have much in common.
Auxiliary Utility 5. (Nodal Maximality for Fermionic Density Matrices)
With respect to a fixed reference point (0, q0)∈
Cepesley's conjecture on the maximality of any nodal cell (K;
i; q1), K(
1; q1, q0)≠0, q1∈
holds true for the ITI Gibbs kernel function K(⋅; ⋅, q0) associated with an imaginary-time Cω path of Hamiltonians {H(
):
∈
} as defined in Definition 31, once it starts at
=
0 with the nodal cell
(K;
0; q0) being maximal.
Demonstration. Since the Ceperley reach ⊆
×
is maximal, each slice
(
) is the union of all positive nodal cells of the ITI Gibbs kernel function K(
; ⋅, q0)∈L2(
)∩Cω(
), ∀
∈
. The subset
has only 1 positive nodal cell, ∀∈[
0,
*)} is nonempty because
(K;
0; q0) is maximal. Assume that
is upper-bounded to use reductio ad absurdum. Then
<∞ exists, and there must be a point (1, q1)∈{
1}×
(
1) at which a negative nodal cell starts to nucleate within a surrounding positive nodal cell, namely, on the one hand, ∃δ>0 such that no open set V exists to satisfy q∈V⊆
, K(
1−δ; q, q0)<0 for all q∈V, on the other hand, ∀δ>0, there always exists an open set V⊆
, V
q1 such that K(
1+δ; q, q0)<0 for all q∈V. Consider the analytic function K(⋅; ⋅, q0)∈Cω(cl(U)) in a small open set U⊆
×
, U
(
1, q1), and an analytic variety
=U∩∂
as a collection of strata. Clearly, U can be chosen sufficiently small, such that
∩{
=
1}θ, and Vg (
(
1)∩
)=0, otherwise, the absurdity of K(
1; ⋅, q0)≡0, thus K(⋅; ⋅, q0)=0, would follow.
Also obviously, K(1; q1, q0)=0, ∂
K(
=
1; q1, q0)≤0. If ∂
K(
=
1; q1, q0)<0, then by the continuity of K(⋅; ⋅, q0), U can be chosen sufficiently small such that OTK(
; q1, q0)<0 for all (
, q)∈U. Otherwise, if ∂
K(
=
1; q1, q0)=0, let k≥1 be the smallest integer such that ∂
kK(
=
1; q1, q0)≠0, then according to the Weierstrass-Malgrange preparation theorem [146, 211, 227, 228], the function K(⋅; ⋅, q0) can be represented as
K(t;x,q0)=c(t,x)[Σi=0k−1tiai(x)+tk], (38)
locally in U, with local coordinates t=−
1∈
, x=(x1, . . . , xm)∈
m, m=dim(
), where c(⋅, ⋅) and {ai(⋅)}i=0k−1 are all analytic functions such that ai(0)=0, ∀i∈[0, k−1], and c(t, x)>0, ∀(t, x)∈(t, x)[cl(U)], with
in
the ∂tK(t=0; x, q0)=[∂tc(t, x)/(c(t, x)]K(0; x, q0), ∀x∈x(U∩). Let
then ∂tK(t=0; x, q0)−E1K(0; x, q0)≤0, ∀x∈x(U∩{=
1}); Otherwise, it follows from
∩{
<
1}=θ and Vg(
(
1)∩
)=0 that a0(⋅) and a1(⋅) must be respectively of the forms
where κ≥1,
is a multi-index,
a0,k≥0 for all k and Σ|k|=κa0,k>0, both b0(⋅) and b1(⋅) are analytic in x(U), b0(x)>0 for all x∈x(U), a1,k must also vanish whenever a0,k=0. Therefore, |a1(x)|/a0(x) is bounded on x[cl(U)]. Let
then it holds true that ∂K(
=
1; q, q0)−E0K(
1; q, q0)≤0, ∀q∈U∩{
=
i}.
Now, consider the ITI Gibbs kernel function
which corresponds to a constant energy-shifted Cω path of Hamiltonians {H()+E0:
∈
}, and shares virtually the same analytical properties as well as exactly the same Ceperley reach and nodal structures with K (
; q, q0), (
, q)∈
×
. In particular, the set
and (1, q1) is exactly the same point at which a negative nodal cell starts to nucleate within a surrounding positive nodal cell of K′(⋅; ⋅, q0). It follows from the analyses above that
[H(1)+E0]K′(
1;q,q0)=−∂
K′(
=
1;q,q0)≥0,∀q∈U∩{
=
1}. (41)
On the other hand, K′(1; ⋅, q0) attains a non-positive minimum over cl(
) with K′(
1; q1, q0)=0 at q1∈U, the strong version of the Hopf extremum principle dictates that K′(
1; ⋅, q0)≡0 over cl(U), which contradicts the premise of Vg(
(
1)∩
)=0. It must be concluded that the set
≡
′ is unbounded, and Ceperley's conjecture is true.
Derived Utility 4. (Nodal Maximality for Conventional Fermionic Density Matrices)
Let K(⋅; ⋅, q0)∈Cω(×
),
be the ITI Gibbs kernel function associated with an imaginary-time Cω path of conventional fermionic Schrödinger operators {H()=−Δg V(t, q∈
):
∈
}, where Δg is the Laplace-Beltrami operator on the configuration space
, V(⋅, ⋅)∈Cω(({0}∪
)×
), and V(
, ⋅)∈L∞(
) is an
-diagonal potential, ∀
∈{0}∪
. Then, Ceperley's conjecture holds true for the maximality of any nodal cell
(K;
; q1), K(
; q1, q0)≠0, q1∈
,
∈
, with respect to a fixed reference point q0∈
.
Demonstration. The proposition follows straightforwardly from Auxiliary Utility 5, when a 0>0 sufficiently and arbitrarily small is chosen such that, based on the standard Feynman-Kac theory of path integrals [95-97], the Cω path of conventional fermionic Schrödinger operators {H(
):
∈[
0, ∞)} generates a (
, q)-jointly analytic ITI Gibbs kernel function K(⋅; ⋅, q0)∈Cω ([
0, ∞)×
), which fulfills all of the conditions in Definition 31 as premises for Auxiliary Utility 5, also ensuring the nodal maximality for K(
0; ⋅, q0).
It is interesting to note that Derived Utility 4 provides an alternative demonstration for Derived Utility 3, asserting the nodal maximality for the promised non-degenerate ground state of a conventional fermionic Schrödinger Hamiltonian H∞=−Δg+V(q∈), with V(⋅) being
-diagonal and Kato-decomposable, because such H∞ can be easily made the limit of an imaginary-time Cω path of conventional fermionic Hamiltonians {H(
)} as detailed in Derived Utility 4.
The essence of Auxiliary Utility 5 and Derived Utility 4 is to affirm that, on any boundary surface of a Ceperley reach or imaginary-time nodal tube, there exists no point on which the normal vector of said boundary surface runs parallel to the imaginary time axis. For a conventional fermionic Schrödinger Hamiltonian on a connected Riemannian manifold , with respect to a fixed reference point (
0, q0)∈{0}×
, Ceperley has posed an interesting question on whether two points (
1, q1) and (
1, r1) in
×
that fall in the same nodal cell of the Gibbs wavefunction K(
1; ⋅, q0) must also lie in the same Ceperley reach of (
0, q0) [5]. The question is now answered in the affirmative by virtue of Auxiliary Utility 5 and Derived Utility 4.
Context Switch Notice: From this point onwards, many-body physical systems are assumed to be moving on a general continuous-discrete product configuration space
as default, unless explicitly stated otherwise.
It is useful to remind that, in the context of a PLTKD Hamiltonian H=Σk=1K⊗i=1n
as a configuration space, where K∈, nk∈
for all k∈[1, K], S∈
, ns∈
for all s∈[1, S], there is an hki-moved and an hki-fixed factor spaces, denoted as
respectively that are associated with each individual FBM interaction hki for a given k∈[1, K] and a given i∈[1, nk]. Consequently, ∀k∈[1, K], there is an Hk-fixed factor space
associated with the FBM tensor monomial
in conjunction with a complementary Hk-moved factor space
such that =εk⊕εk⊥, namely, ∀q∈
, there is a unique ODSD q=uq,k⊕uq,k⊥ with
Definition 33. (Node-Determinate FBM Tensor Polynomials and Monomials) With respect to a Gibbs wavefunction of concern, an FBM tensor polynomial P is called ϵ-almost node-determinate, or said to possess a property of node-determinacy with an error probability upper-bounded by ϵ≥0, when either 1) the concerned Gibbs wavefunction is
the numerical sign of whose point values can be uniquely and efficiently determined on each of the εP-cosets {εP⊕u⊥: u⊥∈εP⊥, P moves εP ⊕u⊥}, except for a subset (P)⊆
of a measure that is less than ϵ times the volume of
; or 2) the concerned Gibbs wavefunction is the ground state ψ0(P) of P, which, on each of the εP-cosets {εP⊕u⊥: u⊥∈εP⊥, P moves εP ⊕u⊥}, either 2.1) is one-dimensional (namely, the ground state is non-degenerate); or else 2.2) has an orthonormal basis of ground state eigenvectors
gmul(P,0)]}, such that the subset
called the set of node-uncertain configuration points, has a |ψ0(P)|2-measure that is upper-bounded by ϵ, namely, Σϕ∈ψ|ϕ(q)|2 dVg(q)≤ϵ.
Specifically, an FBM tensor monomial M=⊗i=1n is called ϵ-almost node-determinate when it is so as an FBM tensor polynomial. Associated with an ϵ-almost node-determinate FBM tensor monomial M, the set
is called the subset of node-certain configuration points. More specifically, a single FBM interaction h is called ϵ-almost node-determinate when it constitutes an ϵ-almost node-determinate FBM tensor monomial M=h.
Obviously, for any finite ∈(0, ∞) and ∀q0∈
, if P is efficiently computable, then the Gibbs wavefunction
q|exp{−
[P−λ0(P)]}|q0
, q∈
is efficiently computable, the numerical sign of whose point values can be uniquely and efficiently determined, ∀q∈
, so P is straightforwardly node-determinate. It is only in the large-
limit, when the Gibbs wavefunction of P reduces to a set of degenerate ground states, that the node-determinacy of P becomes non-trivial.
Cast in the language of statistical decision theory or hypothesis testing [229,230], the set of ground state eigenvectors {ψ0,l(P): l∈[1, gmul(P, 0)]} of a tensor polynomial P constitutes a statistical model comprising a number gmul(P, 0) of hypotheses or candidate probability distributions that are parametrized by an index l∈[1, gmul(P, 0)], each of which corresponds to a probability density function |ψ0,l(P; q)|2 of generating a random q∈εM ⊕M⊥ that is measured or observed. Given an observation or measurement q∈εM ⊕εM⊥, the task of statistical decision making or hypothesis testing is to determine an optimal estimate for the parameter l∈[1, gmul(P, 0)], namely, which of the gmul(P, 0) candidates {ψ0,l(P): l∈[1, gmul(P, 0)]} is most likely the underlying probability distribution, where optimality is with respect to minimization of error (also known as the loss function [229, 230]). A tensor monomial P being ϵ-almost node-determinate means that a decision rule exists, based on which decisions can be made with an expected error bounded by ϵ. The significance of an FBM tensor monomial M being ϵ-almost node-determinate is that, ∀q∈(M), it is polynomially efficient to decide q falls into the support of which of the ground state eigenvectors {ψ0,l(M): 1≤l<gmul(M, 0)}, and further determine what value the ground state eigenvector assumes at q, with a probability of failure upper-bounded by ϵ.
Some illustrative examples may help to gain intuition and appreciation of node-determinate FBM tensor monomials and polynomials. To that end, it might be the simplest, by way of example but no means of limitation, to consider a prototypical system consisting of n∈ mutually distinguishable particles moving in a discrete product space
={0, 1}n, where each particle labeled by an i∈[1, n] amounts to a canonical rebit, with two Pauli matrices (or called operators)
generating a Banach algebra of operators acting on the two-dimensional Hilbert subspace span(|0i, |1
i). Let
1={i1, . . . , im}⊆[1, n] and
be a set partition of the indices of the rebits, and
be the corresponding Cartesian factorization of the configuration space. Let
denote the tensor product of the two-dimensional Banach algebras associated with the rebits indexed by 1. Any pure tensor
with ϵl∈{0, 1}, ∀l∈[1, m], is clearly a node-determinate FBM tensor monomial, because its ground states are all coordinate eigenstates that surely does not overlap in the space. To be utterly specific, for any given configuration point
there is at most one unique ground state in the set ψ0(M(ϵ1, . . . , ϵl)) that does not vanish at q, which can be easily identified by a (Zi, that is, on the coordinate eigenstate |q
, so to obtain a bit array (
q|Zi
, . . . ,
q|Zi
=(si
measures the {0, 1} coordinate of the i-th rebit, such that, for any i∈[1, n], q|Zi+|q
=
s1 . . . sn|Zi+|s1 . . . sn
=
si|Zi+|st
=si∈{0, 1}. Furthermore, if j∈
2 is a typical index, and Uj=Xj sin θ+Zj cos(θ), θ∈
is a general single-rebit operator acting on the j-th rebit, then a
1-diagonally controlled FBM operator M1⊗Uj, with
λ0(±M1)=−1, is also node-determinate, because the operator moves only one rebit, and its ground states in the basis {ψ0(M1)⊗ψ0(−Uj)}∪{ψ0(−M1)⊗ψ0(Uj)} have absolutely non-overlapping supports in the configuration space ={0, 1}n, since ψ0(−Uj) and ψ0(Uj) are each non-degenerate, and every eigenstate in the potentially degenerate manifold ψ0 (M1)∪ψ0(−M1) is a
-coordinate eigenstate that is distinguished from others in the manifold by its different support in
.
Also, let ={1, 2},
2=[1, n]\
1,
be the corresponding Cartesian factorization of the configuration space, consider the FBM projection operators
and a controlled FBM operator
with M2∈
space (Ii, Zi) being a 2-diagonal FBM tensor monomial such that λ0(±M2)=−1. It turns out that both X12+=½(I+X12) and X12−=½(I−X12) are node-determinate, because the operators have the famous Bell states as their ground states, namely,
√{square root over (2)}ψ0(X12+)={|+−+|−+
,|+−
−|−+
}={|00
−|11
,|10
−|01
}, (42)
√{square root over (2)}ψ0(X12−)={|+++|−−
,|++
−|−−
}={|00
+|11
,|10
+|01
}, (43)
such that, for either X12+ or X12−, the two basis ground states in either ψ0(X12+) or ψ0(X12−) are absolutely non-overlapping in the {0, 1}2={00, 01, 10, 11} configuration space, and the two of them can be easily told apart by performing a parity Z12 measurement jointly on the two rebits, with the parity operator
(mod 2), such that s1s2|Z12|s1s2
=s1+s2 (mod 2), ∀(s1, s2)∈{0, 1}2. Moreover, the operator Hxxz=X12⊗M2 is node-determinate, because the ground states in ψ0(Hxxz)={ψ0(X12)⊗ψ0(−M2)}∪{ψ0(−X12)⊗ψ0(M2)} have non-overlapping supports in the
space and can be told apart unequivocally by a (Z12, {Zi+: i∈
2})-joint measurement on any given configuration point q∈
that is within the support of ψ0(Hxxz), namely, the unique member wavefunction in the set ψ0(Hxxz) that does not vanish at a given configuration coordinate q=(s1, s2, {si: i∈
2})∈
can be easily identified by performing a (Z12, {Zi+: i∈
2})-joint measurement on the point q, that is, by measuring the parity Z12 on the rebits 1 and 2 jointly, and Zi+-measuring each of the i-th rebit, ∀i∈
2.
Similarly and straightforwardly, let 1={i1, . . . , im}⊆[1, n] and
be a set partition of the indices of the rebits, and
be the corresponding Cartesian factorization of the configuration space, then within the subsystem supported by the configuration space 1, the multi-rebit-simultaneously-flipping operator
η=±1 is node-determinate, with
lists an orthonormal basis of ground state functions, whose supports are mutually non-overlapping. Given any configuration point q1=(s1, . . . , sm)∈1, there is a unique ground state ψ0, s1 . . . sm (Xi
q1|ψ0, s1 . . . sm (Xi
span(Ii, Zi) with λ0(±M2)=−1 be a general 2-diagonal tensor monomial that is not necessarily FBM, then it is easily verified, and will be rigorously proved shortly in the following, that the operator Xi
More generally, with the same 1={ii, . . . , im}⊆[1, n],
and a general 2-diagonal tensor monomial M2∈
with λ0(±M2)=−1 as introduced immediately above, then any classical reversible computation [231-234] performed on the computational basis states associated with the subsystem supported by
1 amounts to a particular type of self-inverse operator F1, known as a classical reversible gate array [34] (CRGA) operator, for which the tensor monomial F1⊗M2 is also guaranteed to be node-determinate. Specifically, a CGRA operator F1 is associated with a bijective function F: {0,1}m
{0, 1}m, which maps each (s1 . . . sm)∈{0, 1}m to a unique F((s1 . . . sm)∈{0, 1}m. Therefore, F partitions the set {0, 1}m into 2m−1 F-equivalence classes, each F-equivalence class containing exactly two elements: an (s1 . . . sm)∈{0, 1}m, together with its F-image F(s1 . . . sm)∈{0, 1}m. Let {0, 1}m/F denote the quotient space, that is, the set containing the 2m−1 F-equivalence classes. Then the operator F1 can be represented as
with
being two and the only two eigenstates of the self-inverse operator F1, corresponding to each {(s1 . . . sm), F(s1 . . . sm)}∈{0, 1}m/F. Clearly, both F1 and −F1 have a node-determinate manifold of ground states. Consequently, the tensor monomial F1⊗M2 is node-determinate, as will be proven below.
Definition 34. (Doubly Node-Determinate FBM Tensor Monomials)
An FBM tensor monomial M is called doubly ϵ-almost node-determinate or doubly node-determinate up to ϵ>0, when both λ0(M)<0 and λ0(−M)<0, and orthonormal bases
exist on each of the εM-cosets {εM ⊕u⊥: u⊥∈εM⊥, M moves εM⊕u⊥}, such that the subset
Definition 35. (Universally Node-Determinate FBM Tensor Monomials)
An FBM tensor monomial M is called universally ϵ-almost node-determinate, or universally node-determinate up to ϵ>0, when on each of the εM-cosets {εM⊕u⊥: u⊥∈εM⊥, M moves εM ⊕u⊥}, a suitable choice of all of its eigenfunctions always exists and forms an orthonormal basis
such that the subset
The notion of a universally node-determinate FBM tensor monomial may seem a little peculiar, but it encapsulates and is epitomized by a class of FBM polynomials that are built from configuration coordinate measuring/projection operators, which are straightforwardly computable and universally node-determinate, yet endow tremendous computational power.
Auxiliary Utility 6. (Node-Determinate FBM Tensor Monomials of the First Kind)
If an FBM tensor monomial M=U⊗V is a tensor product of two FBM tensor monomials U and V supported by configuration spaces U and
V respectively, where U is doubly ϵ-almost node-determinate, ϵ>0, while V is either 1) doubly non-degenerate in the sense that both λ0(V) and λ0(−V) are negative and non-degenerate eigenvalues, or
) doubly ϵ-almost node-determinate, then, respectively, M is either 1) ϵ-almost node-determinate, or
) 2ϵ-almost node-determinate, and called a node-determinate FBM tensor monomial of the first kind in either case.
Proof. Depending on whether λ0(U)λ0(−V)>λ0(−U)λ0(V), or λ0(U)λ0(−V)<λ0(−U)λ0(V), or λ0(U)λ0(−V)=λ0(−U)λ0(V) holds, the ground eigenspace of M may be ψ0(M)=ψ0(U)⊗ψ0(−V), or ψ0(M)=ψ0(−U)⊗ψ0(V), or ψ0(M)={ψ0(U)⊗ψ0(−V)}∪{ψ0(−U)⊗ψ0(V)} respectively. No matter which is the case, the ground eigenspace has an orthonormal basis ψ0 (M)={ϕU⊗ϕV: ϕU∈ψ0(U)∪ψ0(−U), ϕV∈ψ0 (V)∪ψ0(−V)}, where each member wavefunction ϕU ⊗ϕV does not overlap with others, in the configuration space U×
V excluding a subset with a |ψ0(M)|2-measure no more than ϵ or 2ϵ corresponding to the cases of V being either doubly non-degenerate or doubly ϵ-almost node-determinate.
Auxiliary Utility 7. (Node-Determinate FBM Tensor Monomials of the Second Kind)
An FBM tensor monomial M=⊗i=1mVi, m∈, with each Vi, i∈[1, m] being an FBM tensor monomial, is mϵ-almost node-determinate and non-negative such that M≥0, if all but one of the m FBM tensor monomials {Vi: i∈[1, m]} are universally ϵ-almost node-determinate, ϵ>0, with the only one possible exception being at least ϵ-almost node-determinate, and all of the FBM tensor monomials are non-negative such that Vi≥0, ∀i∈[1, m]. Such an operator M is called a node-determinate FBM tensor monomial of the second kind.
Proof. Suffice it to just demonstrate the basic case of m=2 that M=V1⊗V2 is (ϵ1+ϵ2)-almost node-determinate and non-negative such that M≥0, if V1, V2 are two FBM tensor monomials supported by configuration spaces 1 and
2 respectively, such that V1 is universally ϵ1-almost node-determinate, ϵ1>0, and V2 is ϵ2-almost node-determinate, ϵ2>0, furthermore, V1≥0, V2>0. The ground eigenspace of M has a basis
ψ0(M)⊆{ψn(V1)⊗ψ0(V2): n∈,n>0}∪{ψ0(V1)⊗|q
: q∈
2},
whose member wavefunctions do not overlap with each other over the configuration space 1×
2 excluding a subset with a |ψ0(M)|2-measure less than ϵ1+ϵ2. Non-negativity of M is obvious. Evidently, the same are also true for the FBM tensor monomial M′=V2⊗V1.
Although by no means exhausting all of the ways that FBM tensor monomials can become node-determinate, doubly and universally node-determinate FBM tensor monomials as well as node-determinate FBM tensor monomials of the first and the second kinds do encapsulate a large set of node-determinate FBM tensor monomials. By means of direct sums, they also combine into a large collection of node-determinate FBM tensor polynomials.
Definition 36. (Ground state Frustration-Free Hamiltonian)
A partial Hamiltonian H on a configuration space of size
is called ground state frustration-free (GFF), when it is PLTKD in the form H=Σi=1J Hi, J∈, J=O(poly(N)) and has a non-degenerate ground state ψ0(H), that is separated from all of the excited states by an energy gap sized as Ω(1/poly(N)), where each Hi is O(1/poly(N))-almost node-determinate and has ψ0(H) as its ground state or one of its ground states, ∀i∈[1, J]. Each such additive partial Hamiltonian Hi, i∈[1, J] is called GFF-compatible with respect to H.
The ground state ψ0(H) of any efficiently computable GFF partial Hamiltonian H=Σi=1J Hi can be efficiently simulated through Markov chain Monte Carlo (MCMC) using a Markov transition matrix [ψ0(H; r)]r exp[−τBase(H)]|q
[ψ0(H; q)]−1, (r, q)∈
×
, r and q being restricted to within the same nodal cell
⊆
, where ψ0(H; r)/ψ0(H; q)=ψ0(Hi; r)/ψ0(Hi; q) must hold true for a certain i∈[1, J], and the quotient can be computed efficiently for all (r, q)∈
×
such that
r|exp[−τBase(Hi)]|q
≠0, by the premise of H being GFF.
Definition 37. (Directly Frustration-Free Hamiltonian)
A partial Hamiltonian H on a configuration space of size
is called directly frustration-free (DFF), when it is a direct sum of partial Hamiltonians in the form H=Σi=1J Hi, J∈, J=O(poly(N)), with the energy gap between the ground state and the excited states Ω(1/poly(N)) lower-bounded for H and every Hi, i∈[1, J], where each Hi, i∈[1, J] has 0 as the smallest eigenvalue and moves an O(log(N))-sized configuration subspace
i, which is annihilated by any other Hj, j∈[1, J], j≠i, namely, Hjϕi=0 holds true, ∀ϕi∈L2(
i), so long as j≠i. Each such additive partial Hamiltonian Hi, i∈[1, J] is called DFF-compatible with respect to H.
Any Gibbs operator exp(−τH), τ∈(0, ∞] of any efficiently computable DFF partial Hamiltonian H=Σi=1J can be efficiently simulated by an identity exp(−τH)=I+Σi=1J[exp(−τHi)−I], due to the premise that each Hi, i∈[1, J] only moves an O(log(N))-sized configuration subspace, which is annihilated by any other Hj, j∈[1, J], j≠i, consequently, HiHj=HjHi=0 holds true so long as j≠i. In particular, when τ>0 is sufficiently large, with respect to any fixed q0∈, the Gibbs wavefunction
⋅|exp(−τH)|q0
is essentially a ground state of H, which can be efficiently simulated using either a constrained path integral method [5, 168, 169, 173, 174] or a fixed-node diffusion method [5, 163, 166, 167, 188-191], whose nodal cells can always be efficiently and uniquely determined, even though the partial Hamiltonian H may have degenerate ground states.
In a constrained path integral method, a Feynman slab Slab(0,
L, H, τ), L∈
, also regarded as a Feynman substack consisting of the single Feynman slab, is divided into a number 1+L of Feynman slices, including the first with
0≅
and the last with
L≅
, and a Feynman path {qm}m∈[0,m] undergoes a MCMC random walk in the space {q0}×Πm=1M
m, with
m≅
for all m∈[1, M], subject to the constraint of no crossing nodal surfaces, and following a Metropolis-Hastings [235,236] or Gibbs [237-240] strategy of importance sampling, which uses the product of Gibbs transition amplitudes assigned to said Feynman path as a density proxy to the probability distribution of Feynman paths, where the density proxy needs only to be proportional to the probability density of Feynman paths, with a proportionality constant fixed but unknown.
In a fixed-node diffusion method, with respect to any fixed q0∈0≅
and at a sufficiently large τ>0, the Gibbs transition amplitude
can be efficiently computed for all qL ∈L≅
, L∈
, such computed numerical values for the wavefunction ϕ0(qL), qL∈
L, which is necessarily a ground state of H, can be used to guide an MCMC random walk of qL over
L with Metropolis-Hastings or Gibbs importance sampling using |ϕ0(qL)|2 as a density proxy, so to generate random samples of qL∈
L according to the probability density |ϕ0(qL)|2/|ϕ0(qL)∥2. It is usually desired to keep qL within a fixed nodal cell of ϕ0(⋅), and there is no difficulty to locate the nodal surfaces when the function ϕ0(⋅) itself is efficiently computable. It may be further desired to keep qL within the Ceperley reach with respect to the reference point q0∈
0, whose boundary surface is still not hard to locate for the Feynman slab Slab(
0,
L, H, τ).
Definition 38. (Separately Frustration-Free Hamiltonian)
A partial Hamiltonian H that generates a Gibbs operator exp{−t[H−λ0(H)]} of interest at a fixed t∈(0, ∞] is called separately frustration-free (SFF), when it is a sum of partial Hamiltonians as H=Σk=1K Hk, K∈, K=O(poly(N)),
with each shifted partial Hamiltonian Hk−λ0(Hk), k∈[1, K] being either DFF or GFF, and the Gibbs operator exp{−t[H−λ0(H)]} can be simulated, to within an error no more than O(1/poly(N)), by iterating a sequence of Gibbs operators {exp{−τ[Hk−λ0(Hk)]}: k∈[1, K]}, τ∈(0, ∞] for no more than O(poly(N)) times.
In one exemplary embodiment, where t∈(0, ∞) is finite, the Gibbs operator exp{−t[H−λ0(H)]} is approximated by iterating a Lie-Trotter-Kato product Πk=1K exp{−[τ(Hk−λ0(Hk)]}, τ=t/m for m∈ times, with m=poly(N) being sufficiently large for a sufficiently accurate approximation. In another exemplary embodiment, where t is so large that the Gibbs operator exp{−t[H−λ0(H)]} is essentially the same as the ground state projector |ψ0(H)
ψ0(H)|, τ is also chosen to be large, and each Gibbs operator exp{−τ[Hk−λ0(Hk)]} is applied in turn to effect a ground state projector |ψ0(H)
ψ0(Hk)|, ∀k∈[1, K], and a process of iterating a sequence of ground state projectors {|ψ0(Hk)
ψ0(Hk)}k∈[1, K] mixes rapidly and converges to the projector |ψ0(H)
ψ0(H)|.
Auxiliary Utility 8. (Homophysical Images of DFF, GFF, and SFF Partial Hamiltonians)
Let : (
)
(
) be a homophysics. If an FBM tensor monomial h∈
1 is (doubly/universally) ϵ-almost node-determinate, then
(h)∈
2 is also (doubly/universally) ϵ-almost node-determinate correspondingly. Therefore, if H∈
1 is DFF, GFF, or SFF, then so is
(H)∈
2 respectively.
Demonstration. The Proposition Follows Straightforwardly from Definitions.
Definition 39. (DFF-FS, GFF-FS, SFF-FS Partial Hamiltonians and Systems)
A partial Hamiltonian is called DFF fermionic Schrödinger (DFF-FS), GFF fermionic Schrödinger (GFF-FS), or SFF fermionic Schrödinger (SFF-FS), when it is both fermionic Schrödinger and DFF, GFF, or SFF, respectively. A quantum system is called SFF-FS when its governing Hamiltonian is SFF-FS.
Auxiliary Utility 9. (Path Rectification for Feynman Stacks of SFF-FS Hamiltonians)
For an SFF-FS partial Hamiltonian H=Σk=1K Hk on a configuration space of size
with K∈, K=O(poly(N)), in conjunction with any q0∈
and any
∈(0, ∞], the Gibbs wavefunction
qK|exp{−
[H−λ0(H)]}|q0
, qK∈
K≅
associated with a Feynman stack made of K Feynman slabs {Slab(
k−1,
k, Hk,
)}k∈[1,K] can be computed efficiently, to within an error no more than O(1/poly(N)), by integrating over rectified Feynman chains only, and using a constrained path integral method to simulate the Feynman slabs, when all {Hk}k∈[1,K] are DFF.
In case some Hk, k∈[1, K] are GFF, then the ground state ψ0(H) can be efficiently simulated, to within an error no more than O(1/poly(N)), by integrating over rectified Feynman chains only, using a fixed-node diffusion method to simulate each Feynman slab associated with a GFF partial Hamiltonian, and using either a fixed-node diffusion method or a constrained path integral method to simulate each Feynman slab associated with a DFF partial Hamiltonian.
Proof. It is without loss of generality and does not change physics to have all of the FBM tensor monomials Hk, k∈[1, K] affine transformed so that λ0(Hk)=0, ∀k∈[1, K]. Just as in equation (29), let
and {(Ck≅, Hk): k∈[0, K]} be the boundary and intermediate Feynman slices, such that ∀k∈[1, K], the k-th Feynman slab starts at (
k−1, Hk−1) and ends at (
k, Hk). Then, ∀qK∈
K, the Gibbs wavefunction for the product of Gibbs operators Πk=1K exp(−
Hk) associated with said Feynman stack reads
where
∀k∈[1, K]. The Gibbs wavefunction is computed by integrating contri-butions of Feynman chains {Bun(tk−1, qk−1; tk, qk): k∈[1, K]} over the intermediate configuration coordinates {qk}k∈[1,K).
It turns out sufficient to consider and integrate over rectified Feynman chains only, because ∀k∈[1, K), a fermion exchange operator πk∈Gex acting on a qk∈k introduces a sign(πk) factor to both the Gibbs wavefunction associated with the k-th Feynman slab and that with the (k+1)-th Feynman slab, yielding a sign(πk)2=+1 factor without real effect to equation (45), while for k=K, a fermion exchange operator πK∈Gex acting on a qK∈
k simply reflects qK between the symmetric positive and negative nodal cells, where a restriction of the Gibbs wavefunction to a positive nodal cell is sufficient to determine and derive all of the quantum physics.
If all {Hk}k∈[1,K] are DFF, then every fermionic Gibbs transition amplitude qk|exp(−
Hk)|qk−1
,
∈(0, ∞] associated with any Feynman bundle Bun(tk−1, qk−1; tk, qk) can be computed efficiently by a boltzmannonic path integral for the Gibbs operator exp(−tHkB) over all of the Feynman paths that do not cross any nodal surface of the Gibbs wavefunctions {
q(t)|exp(−tHk)|qk−1
: (t, q(t))∈[0,
]×
}, which is a procedure well known as the constrained path integral method [5, 168, 169, 173, 174]. The Gibbs wavefunctions and their nodal surfaces are efficiently computable, because each Hk is DFF, whose Gibbs wavefunctions with respect to any fixed qk−1∈
k−1 has an O(log(N))-bounded dimensionality.
When some Hk, k∈[1, K] are GFF, the Gibbs operator exp(−H)=PF exp[−
Base(H)] needs to take the large-
limit and reduces to the ground state projector. A
=Θ(poly(N)) can be chosen and be sufficiently large, since the ground states are separated by a polynomial-sized gap from the excited states. For each Feynman slab Slab(
k−1,
k, Hk,
), k∈[1, K], let qk−1∈
k−1 be a fixed configuration point and
⋅|exp(−
Hk)|qk−1
be a Gibbs wavefunction of interest, which is essentially the same as the ground state ψ0(Hk; ⋅), that can be simulated efficiently using the fixed-node diffusion method by Auxiliary Utility 2 and Derived Utility 1, with a Markovian transition matrix [ψ0(Hk; rk)]
rk|exp(−
HkB)|qk
[ψ0(Hk; qk)]−1, where qk∈
k and rk∈
k are in the same nodal cell that contains a point πqk−1, π∈Gex is an even permutation. The quotient ψ0(Hk; rk)/ψ0(Hk; qk) is efficiently computable, V(rk, qk)∈
k×
k, since Hk is either DFF or GFF. Obviously, if Hk, k∈[1, K] is in fact DFF, then the ground state
⋅|exp(−
Hk)|qk−1
can be simulated efficiently using a constrained path integral method as well, since the Gibbs wavefunction
⋅|exp(−tHk)|qk−1
is efficiently computable for all t∈(0,
],
=Θ(poly(N)).
Besides the fermionic Schrödinger Hamiltonians/systems, another class of Hamiltonians/systems that is useful for computations and simulations consists of FBM tensor polynomials as a sum of FBM tensor monomials that are essentially bounded.
Definition 40. (Essentially Bounded Partial Hamiltonian)
Given a partial Hamiltonian H on a Hilbert space and an ϵ>0, an ϵ-essential computational subspace for H is a Hilbert subspace
⊆
, onto which H is restricted to become
such that
|ψ0(H)|ψ0(H′)
|2≥(1−ϵ)
ψ0(H)ψ0(H)
ψ0(H′)|ψ0(H′)
, (46)
(1−ϵ)≤[λ1(H′)−λ0(H′)]/[λ1(H)−λ0(H)]≤(1+ϵ). (47)
A Lie-Trotter-Kato decomposable partial Hamiltonian H=Σk=1K Hk∈), K∈
is said to be ϵ-essentially bounded by Λ, ϵ>0, Λ>0, if an ϵ-essential computational subspace
⊆
exists for H and {Hk}k∈[1,K] such that the restriction of Hk to
is bounded by Λ, namely, the operator norm ∥Hk
∥≤Λ, ∀k∈[1, K].
Definition 41. (DFF-EB, GFF-EB, SFF-EB Partial Hamiltonians and Systems)
A partial Hamiltonian is called DFF essentially bounded (DFF-EB), GFF essentially bounded (GFF-EB), or SFF essentially bounded (SFF-EB), when it is both essentially bounded and DFF, GFF, or SFF, respectively. A quantum system is called SFF-EB when its governing Hamiltonian is SFF-EB.
Auxiliary Utility 2 and Auxiliary Utility 9 imply that the efficient classical Monte Carlo simulata-bility extends to any SFF-EB Hamiltonian H=Σk=1K Hk, because, for any >0, the Gibbs operators
k∈[1, K] can be similarity-transformed into quasi-stochastic operators
k∈[1, K], by the same diagonal operator associated with the same wavefunction ψ0(H), where each operator P(Hk) fixes the probability distribution |ψ0(H)|2, while K−1 Σk=1K P(
Hk) attenuates the excited states of H, therefore, repeated random applications of the operators {P(
Hk)}k∈[1,K] as transition probability matrices, with
>0 taking a value that is sufficiently large, form a Markov chain that projects out |ψ0(H)|2 as the stationary distribution.
More specifically, with +∞ asymptotically, ∀k∈[1, K], the Gibbs operator G(
Hk) approaches a limit
exponentially fast, where Πk is the orthogonal projection onto the null space of Hk, i.e., its ground state Hilbert subspace, which has {ψ0,1(Hk)}l∈gmul(H(H) of node-uncertain configuration points has an asymptotically negligible |ψ0(H)|2-measure, such that ∀k∈[1, K],
where
and (q∈supp(ψ0,l(Hk))) is binary-valued to either 1 or 0 depending on whether or not q falls into the support of ψ0,l(Hk), ∀l∈gmul(Hk). Furthermore, for any excited state of H represented by a wavefunction
n>0, the inequality ψn|exp(−
H)|ψn
/
ψn|ψn
≤exp(−
λ1)<1−
λ1/2 holds, ∀
∈(0, 1/λ1],
Since H=Σk=1K Hk is an essentially bounded Lie-Trotter-Kato decomposition into non-negative operators, ∀∈(0, 1/λ1], for any excited state ψn=ψn(H), n>0, there exists an m∈
such that
[Πk=1K exp(−Hk/m)]m≥[Πk=1K(I−
Hk/m)]m≥(I−Σk=1K
Hk/m)m≥I−Σk=1K
Hk,
ψn|[Πk=1K exp(−
Hk/m)]m|ψn
<(1+
λ1/6)
ψn|exp(−
H)|ψn
<(1−
λ1/3)
ψn|ψn),
consequently, ψn|I−Σk=1K
Hk|ψn
<(1−
λ1/3)
ψn|ψn
, and further
ψn|Σk=1K(I−Πk)|ψn
≥
ψn|η−1Σk=1KHk|ψn
<(λ1/3Λ)
ψn|ψn
,
with
Therefore, for any excited state ψn=ψn(H)>0, the inequality
ψn|K−1Σk=1KP(+∞Hk)|ψn
=
ψn|K−1Σk−1KΠk|ψn
<(1−λ1/3KΛ)
ψn|ψn
(49)
holds, meaning that the transition probability matrix K−1 Σk=1K P(+∞Hk), representing random applications of matrices {P(+∞Hk)}k=1K, attenuates any excited state of H rapidly.
In practical numerical computations as well as complexity analyses, the real or imaginary time axis is often discretized so that physical and Markov chain dynamics are represented discretely, where the discretization error can be made arbitrarily small efficiently by using the Lie-Trotter-Kato product formulas, especially the higher-order symmetrized versions of them [177-187]
In the following I shall present, by ways of example and by no means of limitation, several Monte Carlo algorithms using the Metropolis-Hastings [235, 236] or the Gibbs [237-240] method of importance sampling to simulate the ground states of SFF Hamiltonians, which take advantage of Auxiliary Utility 9 and overcome the dreaded sign problem. Even though the exemplary algorithms are mostly illustrations of simulating ground states, from and by virtue of Auxiliary Utility 9, one skilled in the art will see straightforwardly that the same methods and algorithms can be applied or adapted slightly to simulate any Gibbs kernel at a finite imaginary time, also known as a thermal Green's function or a thermal density matrix, associated with an SFF partial Hamiltonian, so long as the SFF partial Hamiltonian is a sum of all DFF partial Hamiltonians.
Some of the algorithms are exhibited in a framework with the configuration space fully discretized, while some others are presented in the mathematical language of an all-continuous configuration space. As being noted previously, no loss of generality is incurred by choosing any such specific configuration space, because an actual or model physical system with either all-discrete or all-continuous dynamical variables can be made universal, in terms of both numerical simulations for or based on it and the computational power of it as a computing machine. Furthermore, methods and steps as specified in the algorithms can be combined and generalized straightforwardly to handle the case of a continuous-discrete product configuration space with ease.
For any multi-species many-body system of concern in this specification, even if some of the physical dynamical variables are intrinsically continuous and conventionally represented by canonical coordinates on Riemannian manifolds s, s∈[1, S] as substrate spaces, such continuous substrate space can still be approximated by a set of grid/lattice points sampled at a desired resolution, that serve as vertices to form a connected graph
s, s∈[1, S], where each vertex is connected with no more than a fixed number of neighbors, by an edge representing the kinetics of a particle hopping between vertices. Taking advantage of its natural Cartesian product structure, the configuration space of such a multi-species many-body system is approximated and represented nicely by a Cartesian product graph
with which and for any FBM interaction h, the notions of h-moved and h-fixed factor spaces (now subgraphs) εh and εh⊥, h-moved and h-fixed εh-cosets, h-induced ODSDs and its node-determinacy, the notions of a DFF or GFF Hamiltonian and its local node-determinacy, all carry over straightforwardly.
When it is clear that a product graph
as a discrete configuration space is the result of spatial discretization from a bounded Riemannian manifold as a continuous-discrete product configuration space =Πs=1S
sn
can inherit from
the measures of configuration space dimension dim
diameter diam()=diam(
), and computational size size
Otherwise, without any reference to a continuous-discrete product configuration space being made, a finite product graph
can be assigned with estimated measures of configuration space dimension dim
diam-ter diam()=(Σs=1S diam(
s)2)1/2, and computational size size
where for each s∈[1, S], the diameter diam(s) of each
s is defined as diam
having a path formed by n consecutive edges}. Such estimated measures may not be exactly the same as what might be inherited from a continuous-discrete product configuration space, but they are surely within a constant multiplier, and suffice for the purpose of asymptotic analyses, when size() grows as the result of an increasing S∈
.
On a continuous submanifold ⊆
of a general configuration space
, a nodal cell is an open subset of
, for which topological connectedness is equivalent to path-connectedness. When
is discretized, the nodal surfaces are no longer located precisely, rather they fall mostly between pairs of adjacent lattice points at which the ground state wavefunction assumes different signs. As the lattice resolution increases, simulation errors due to the location uncertainty of the nodal surfaces, which does not exceed half of the grid size, can be made arbitrarily small. Furthermore, the so-called “lever rule” for fixed-node diffusion QMC of lattice fermions [192, 193] with a DFF or GFF Hamiltonian H=Σk=1K Hk can pinpoint the nodal surfaces more accurately, and the introduction of a “nodal boundary potential” [192, 193] can represent the effects of ψ0(H; r∈
) nodal surface crossings exactly without error.
With a properly discretized configuration space , the Hilbert space L2(
;
) of quantum states and linear operators on L2(
;
),
∈{
}, all have finite dimensions, so all quantum physics are described and solved by matrix algebra, saving much mathematical technicality of functional analysis. Particularly, in such lattice formulations, all operators as finite matrices are bounded, and their polynomial Lie-Trotter-Kato decompositions follow straightforwardly from Lie's product formula, as a trivial consequence of the Baker-Campbell-Hausdorff formula. Also, a spatially local Hamiltonian becomes a matrix that couples each lattice point with no more than an O(dim(
)) number of nearby neighbors. Recall that the graph-theoretic concept of connectedness is also based on path connections. So path-connectedness is generally applicable to continuous, discrete, as well as continuous-discrete product configuration spaces. Furthermore, a quantitative notion of path-connectedness will prove handy in computational complexity considerations.
Definition 42. A bounded and connected subset of a configuration space
, with
being either continuous as a Riemannian manifold or discrete as a graph, is called polynomially path-connected, if any two points in
is connected by a path in
whose length is O(poly(size(
))), where the length of a path is measured either through the Riemannian metric or in terms of the number of graph edges, respectively.
Polynomial path-connectedness may necessarily hold for any nodal cell of a quantum system on a suitably regular configuration space, especially when the Hamiltonian has a polynomially gapped ground state. But let's save a rigorous treatment for a dedicated mathematical discourse elsewhere, and simply posit the property when necessary in this specification. Let Γ(M) denote the directed graph associated with a finite matrix M [141]. The irreducibility of M is equivalent to the fact that Γ(M) is strongly connected. When M is primitive, i.e., aperiodic irreducible non-negative [141, 142], the polynomial boundedness of its index of primitivity γ(M) is tantamount to the polynomial path-connectedness of Γ(M), namely, that any ordered pair of vertices of Γ(M) can be connected by a directed path whose length does not exceed the polynomially bounded integer γ(M). With a self-adjoint matrix, directedness is immaterial for the associated graph, because any directed arc is always accompanied by another oppositely directed arc between the same pair of vertices.
It may be useful to note that, the term sign graph has been proposed [242] to denote a maximally connected subgraph of vertices on which a discrete eigen wavefunction ψ∈L2() assumes the same numerical sign, as opposed to the term nodal cell (or domain) in the case of a continuous configuration space. And, as far as a conventional boltzmannonic Schrödinger operator is concerned, possibly as the base boltzmannonic Hamiltonian of a fermionic Schrödinger operator, there have been theorems proved regarding the number of sign graphs in relation to the index of an energy eigenstate [242], in close analogy to the famous theorem of Courant [195] for a Schrödinger operator on a continuous configuration space.
On a bounded discrete configuration space , any DFF or GFF Hamiltonian H=Σk=1K Hk is discretized into a necessarily finite and tensor-sparse matrix, with tensor-sparsity referring to the property of H being the sum of a K=O(poly(size(
))) number of FBM tensor monomials Hk, k∈[1, K], each of which being a tensor product of an O(poly(size(
))) number of FBM interactions, with each FBM interaction coupling any lattice point in g to at most an O(log(size(
))) number of neighbor lattice points. By contrast, the dimensions of H, being a |
|×|
| matrix, can be much larger, as |
| can be an exponential function of size(
). Consider discrete Gibbs operators
with M∈{H, Hk, k∈[1, K]},
∥Base(Hk)∥: k∈[1, K]})−1, β∈[0, 1), and
τ≥0, M∈{H, Hk, k∈[1, K]}. Let
be the graph associated with the Hamiltonian H, and Γ+(H) be the subgraph of Γ(H) with all “ground state sign changing” edges removed, namely, Γ+(H) inherits all vertices of Γ(H) but keeps only those edges of Γ(H) whose two end vertices see the ground state ψ0(H) assuming the same sign. For each lattice point q∈\{r∈
: ψ0(H; r)=0}, let Γ(H; q) be the set of points that are connected from q by an edge in Γ(H), that is
or equivalently, Γ(H; q)={r∈: ∃k∈[1, K],
r|G1(Hk)|q
≠0}. Further, let Γ+(H; q) be the subset of Γ(H; q) containing only the points on which the ground state of H assumes the same sign as on q, namely,
Finally, let Γ*(H; q) be the subset of vertices containing q and all that can be reached from q via a connected path in Γ+(H). Evidently, Γ*(H; q) is exactly the discrete equivalent of the nodal cell (H; q).
Recall that (H) represents the set of node-uncertain configuration points for any DFF or GFF Hamiltonian H. Only it is noted that when H is a finite matrix supported by a finite graph
,
(H) is a subset of the vertices of
. On any subgraph
c(H)∩Γ(H; q0), with H being DFF or GFF,
a classical random walk can be defined in association with H, as represented by a transition probability matrix
with [ψ*0(H)]=diag({ψ*0(H; q): q∈c(H)∩Γ(H; q0)}) being a diagonal matrix,
when H is DFF-FS or GFF-FS, while
when H is SFF-EB, where
∀k∈[1, K], such that ∀q, r∈c(H), the transition probability
in which the Iverson brackets [q∈supp(ψ0(H))]Iver and [q∈supp(ψ0,l(Hk))]Iver are 0 or 1 binary-values to exclude starting points outside of a ground state. In case H is both DFF-FS and DFF-EB, or both GFF-FS and GFF-EB, either G□(H)=G1(H) or G□(H)=K−1 Σk=1K G*(+∞Hk) can be used in conjunction with the corresponding transition probability matrix. Starting from q0∈c(H), ψ0(H; q0)≠0 and following the transition probability matrix QH, a walker traverses a subset of configuration points
that is called the range of QH from q0. It follows from Auxiliary Utility 2 and equation (48) that the matrix QH is bona fide stochastic and defines an irreducible Markov chain on Range(QH; q0)⊆, which shares the same spectrum with the Gibbs operator G□(H), with the stationary distribution (stationary probability vector) of QH being the stationary distribution of G□(H) point-wise squared, much similar to that in the well-known quantum-stochastic mapping for boltzmannonic stoquastic Hamiltonians [156, 243, 244]. The Markov chain defined by the primitive (i.e., aperiodic irreducible non-negative) matrix QH on Range(QH; q0) is reversible and bound to converge to the unique stationary distribution |ψ0(H; q∈Range(QH; q0))|2.
Here goes one Monte Carlo algorithm for sampling the ground state of a discretized DFF-FS or GFF-FS Hamiltonian H=Σk=1K Hk, K∈ defined on a discrete, bounded configuration space
, with each Hk, k∈[1, K] being either DFF or GFF, which solves the computational problem of simulating a many-body Hamiltonian as specified in Definition 18 using a fixed-node diffusion type of approach, when the system is known to be DFF-FS or GFF-FS.
Algorithm 1. (Monte Carlo Simulation of a DFF-FS or GFF-FS Hamiltonian)
Pr(r|q)=Σl=1gmul(Hr|G1(Hk)|q
ψ0,1(Hk;q)−1×[q∈supp(ψ0,l(Hk))]Iver;
Another class of discrete-configuration-space-based algorithms randomize the applications of Gibbs operators
k∈[1, K] in association with the FBM tensor monomials of a Hamiltonian H=Σk=1K Hk according to H being DFF or GFF and FS or EB, while still adopting a fixed-node diffusion strategy.
Algorithm 2. (Monte Carlo Simulation of a DFF or GFF and FS or EB Hamiltonian)
Pr(r|q)=Σi=1gmul(Hr|G1(Hk)|q
ψ0,l(Hk;q)−1×[r∈Γ+(Hk;q)]Iver×[q∈supp(ψ0,1(Hk))]Iver
Pr(r|q)=Σi=1gmul(H
A variation of Algorithm 2 is applicable when the DFF or GFF and FS or EB Hamiltonian H=Σk=1K Hk is such that the FBM tensor monomials Hk, k∈[1, K], up to operator isomorphisms, all belong to a prescribed collection of a polynomially bounded number of FBM Hamiltonians, called template Hamiltonians, ∀q=uq,k ⊕uq,k⊥∈, uq,k∈εk, uq,k⊥∈εk⊥. The variation also illustrates, by way of example but no means of limitation, Monte Carlo samplings without a fixed-node diffusion strategy.
Algorithm 3. (Monte Carlo Simulation of a DFF or GFF and FS or EB Hamiltonian)
Pr(r|q)=Σi=1gmul(Hr|G1(Hk)|q
ψ0,l(Hk;q)−1×[r∈Γ(Hk;q)]Iver×[q∈supp(ψ0,1(Hk))]Iver
Pr(r|q)=Σi=1gmul(H
On a continuous configuration space , a general state space Markov chain [245-247] in association with a DFF or GFF Hamiltonian H=Σk=1K Hk can be defined for a discrete time random walk with a quasi-stochastic operator
constructed according to Auxiliary Utility 2, where
∈c(H)}) is
-diagonal,
with a scaling factor ϵ=Ω(1/poly(size(H))), ϵ>0 or
respectively depending upon H being DFF or GFF and FS or EB, with
k∈[1, K], such that ∀q, r∈c(H), the transition probability
Either G□(H)=G*(ϵH) or G□(H)=K−1Ek=1K G*(+∞Hk) may be used, in conjunction with the corresponding quasi-stochastic operator, when H is both FS and EB. Starting from q0∈c(H) such that ψ0(H; q0)≠0, and following the quasi-stochastic operator QH, a walker traverses a subset of configuration points
that is called the range of QH from q0. It is clear that the operator QH is primitive, i.e., aperiodic irreducible non-negative, and shares the same eigenvalues with G□(H), further has the corresponding eigenvectors {ψ0(H; q)ψn(H; q)}n≥0 simply related to those eigenvectors {ψn (H; q)}n≥0 of G□(H) [156, 243, 244]. In particular, the irreducible Markov chain is also reversible with respect to the ground state probability distribution, thus has |ψ0(H; ⋅)|2 as its unique stationary distribution.
The following is one Monte Carlo algorithm for a DFF-FS or GFF-FS Hamiltonian H=Σk=1K Hk on a continuous configuration space , using a fixed-node diffusion type of approach.
Algorithm 4. (Monte Carlo Simulation of a DFF-FS or GFF-FS Hamiltonian)
Pr(r|q)=Σi=1gmul(Hr|G*(Hk/mKb)|q
ψ0,l(Hk;q)−1×[q∈supp(ψ0,l(Hk))]Iver;
Another class of continuous-configuration-space-based algorithms randomize the applications of Gibbs operators
with a polynomial small scaling factor ϵ>0 or respectively
k∈[1, K] in association with the FBM tensor monomials of a DFF or GFF and FS or EB Hamiltonian H=Σk=1K Hk, and apply the so-called Gibbs sampling method to form a random walk that follows a series of conditional distributions [237-240] corresponding to the individual FBM tensor monomials of H, while still adopting a fixed-node diffusion strategy.
Algorithm 5. (Monte Carlo Simulation of a DFF or GFF and FS or EB Hamiltonian)
Pr(r|q)=Σi=1gmul(Hr|G*(ϵHk)|q
ψ0,l(Hk;q)−1×[r∈
(Hk;q)]Iver×[q∈supp(ψ0,1(Hk))]Iver
Pr(r|q)=Σi=1gmul(H
A variation of Algorithm 5 is useful when the concerned DFF or GFF and FS or EB Hamiltonian is a sum of FBM tensor monomials {Hk}k=1K, all of which belong to a prescribed collection of a polynomially bounded number of template Hamiltonians. The variation also illustrates, by way of example but no means of limitation, Monte Carlo samplings without a fixed-node diffusion strategy.
Algorithm 6. (Monte Carlo Simulation of a DFF or GFF and FS or EB Hamiltonian)
Pr(r|q)=Σi=1gmul(Hr|G*(ϵHk)|q
ψ0,l(Hk;q)−1×[r∈
c(Hk;q)]Iver×[q∈supp(ψ0,1(Hk))]Iver
Pr(r|q)=Σi=1gmul(H
In Algorithms 4 through 6, when dealing with a DFF-FS or GFF-FS Hamiltonian, Gibbs operators of the form
need to be computed, where M=⊗i=1n hi is an FBM monomial which moves a submanifold =Πi=1n εi as a Cartesian product of low-dimensional factor spaces, which, according to equations (16) and (17), requires the computation of W*(h) for a fermionic Schrödinger FBM interaction
that moves a low-dimensional
factor space
i∈[1, m], m=O(1)∈[1, n]. Such a Gibbs operator W*(h) can be computed directly either by solving the eigenvalue problem of h on the domain ε, or via a subroutine evaluating the Feynman-Kac path integral [172, 176] that corresponds to h on ε, subject to the proper boundary condition. Alternatively, each Gibbs operator W*(h) of any FBM interaction h can itself be estimated through an MCMC subroutine, where W* (h) is broken up into many small-step Gibbs operators as W*(h)|=Πj=1N W*(h/N), with N∈sufficiently large though still polynomially bounded, such that each small-step Gibbs operator W*(h/N) is highly localized in space, meaning that there exist positive constants c1, c2, c3 such that
r|W*(h/N)|q
<exp(−c1∥r−q∥c
Yet another alternative, applicable to an FBM interaction with a conventional Schrödinger operator h|Diri(ε)=−Δg+Vε(q) restricted to an open domain ε, where the potential Vε(q) rises up steeply near ∂ε, from being finite-valued within ε to being infinity-valued outside, in order to enforce the Dirichlet boundary condition, implements a single step of random walk described by a transition probability Pr(r|q)=ψ0(r)r|G*(h)|Diri(ε)|q
ψ0(q)−1 by mapping the Schrödinger-Dirichlet eigenvalue problem exp(−h|Diri(ε))ψ(q)=e−λψ(q), ∀q∈ε, λ∈
, while ψ(q)=0 for all q≠ε, to a Fokker-Planck-Kolmogorov eigenvalue problem exp(−L|Diri(ε,1))ψ0(q)ψ(q)=e−(λ−λ
, while ψ0(q)ψ(q)=0 for all q≠ε, with
characterizing the ground state of h|Diri(ε), through a quantum-stochastic operator similarity transformation [166, 167, 248-253] in the same spirit of Auxiliary Utility 2, where
is a Fokker-Planck-Kolmogorov operator generating a pure drift-diffusion process without branching and killing [254], and can be simulated by standard and stable MCMC [166, 167, 252]. It is obvious that the semigroup exp(−L|Diri(ε,1))=[ψ0]× exp[−(h|Diri(ε)−λ0)]×[ψ0]−1 of the Fokker-Planck-Kolmogorov drift-diffusion process does precisely realize the desired quasi-stochastic operator [ψ*0(h)]G*(h)|Diri(ε) [ψ*0(h)]−1 associated with G*(h)|Diri(ε).
Constrained path integral methods [5, 168, 169, 173, 174] provide another class of Monte Carlo algorithms for efficiently simulating DFF or GFF Hamiltonians, and the closely related reputation quantum Monte Carlo [170, 171, 175] is rightly suitable for simulating an adiabatically varying process in association with a sequence of Hamiltonians that change gradually in time. Given a DFF or GFF Hamiltonian H=HC+HD: LF2LF2(
), that is CD-separately irreducible and supported by a continuous-discrete product configuration space
an adiabatic sequence of DFF or GFF Hamiltonians {H′: LF2LF2(
), l∈[0, L], L∈
} may be constructed, in which ψ0(H′0) is easily solved, H′L=H, and the Hamiltonians vary slowly from H′0 to H′L to satisfy conditions of the adiabatic theorem, such that the Gibbs operators {exp(−
H′l): l∈[0, L]}, with a predetermined
>0, progressively and recursively project out a series of ground states {ψ0(H′l): l∈[0, L]} from the starting point ψ0(H′0), in the sense that
ψ0(H′l)|exp(−
H′l)ψ0(H′l−1)
=∥ψ0(H′l−1)∥×[1+O(
2/L2)], much similar to the quantum search algorithm that uses repeated measurements to drive an adiabatic evolution of ground states [261]. For each l∈[0, L], let LTK(H′l) denote a set of FBM tensor monomials that Lie-Trotter-Kato decompose H′l, such that
for an m=O(poly(max{size(H′l): l∈[0, L]})), m∈, and fixed constants c1>0, c2>0. Then let {Ht: t∈[0, T],
denote the ordered sequence of FBM tensor monomials collecting all of the m-fold repeated Lie-Trotter-Kato decompositions of {H′l: l∈[0, L]}, such that the continued products of the Gibbs operators {exp(−H′l): l∈[0, L]} are well approximated by the continued products of the Gibbs operators {exp(−
Ht/m): t∈[0, T]}, which progressively and recursively project out a series of ground states
where ∀t∈[0, T], the function LTK−1(t) returns the unique index l∈[0, L] such that Ht is there as an FBM tensor monomial for the Lie-Trotter-Kato decomposition of H. Given such a projected ground state ψ0(H)=Πt=0T exp(−Ht/m)|ϕ0
, the expected value of any operator O: LF2
LF2(
) can be computed as
The basic idea of PIMC in the configuration coordinate representation is to work with a cylinder set C2T+1={(q2T, . . . , qT, qT−1, . . . , q0): qt∈, ∀t∈[0, 2T]}, insert a resolution of the identity I=∫Cdqt|qt
qt| to the immediate left of either the operator exp(−
Ht/m) for each t∈[0, T] or the operator exp(−
H2T−t+1/m)+ for each t∈[T+1, 2T], such that
where shorthand notations are used with
while ∀q∈, |q
=const×Σπ∈G
) approximating an antisymmetric configuration coordinate eigenvector, and for each πq=r∈
, |r) denotes a Dirac delta distribution as a limit of functions in L2(C) approximating a boltzmannonic state that has artificially labeled particles, of the same or different species, being localized and positioned around the configuration point r∈
.
With each
nt∈, t∈[0, T] being an FBM tensor monomial, and most of the FBM interactions {hti: i∈[1, nt]} being orthogonal projection operators, the fermionic Gibbs transition amplitude
∀(r, q)∈2, ∀
>0,
=O(1) can be computed as in equation (16) from the transition amplitudes associated with the individual FBM interactions. For each i∈[1, nt], it is perfectly feasible to have the FBM interaction hti exactly diagonalized in the hti-moved factor space
from which, either the transition amplitude ri|hi|qi
, when hi is an orthogonal projection operator, or else the Gibbs transition amplitude
ri|exp(−
hi)|qi
, can be easily computed for any (ri, qi)∈
i2. Note that, when hi is a projection operator, computing the Gibbs transition amplitude
ri|exp(−
hi)|qi
for any
>0 such that
+
−1=O(poly(size(H))) is completely equivalent to computing
ri|hi|qi
, by virtue of the operator identity exp(−
P)=(e−
−1)P for all P such that P2=P. Alternatively, each fermionic Gibbs transition amplitude
ri|exp(−τhi)|qi
, with hi being an FBM interaction, can be evaluated by straightforwardly summing up an O(poly(size(H))) number of boltzmannonic transition amplitudes for all of the exchange permutations of the configuration (ri, qi), where, for each such permuted configuration (r′i, q′i)∈
i2, a boltzmannonic transition amplitude can be computed approximately by taking a direct “Rayleigh flight”, either along the geodesic line segment from q′i to r′i, when q′i, r′i∈
×{v} for some v∈
, and
r′i|exp[−τBase(HC)]|q′i
≠0, or through a single HD-hop, when q′i, r′i∈{x}×
for some x∈
, and
r′i|Base(HD)|q′i
≠0, provided that τ/m>0 is sufficiently small. Still alternatively, using the Hubbard-Stratonovich transformation [262, 263], every Gibbs operator exp(−τhti/m) with a small τ/m>0 can be converted into a product of single-particle Gibbs operators, each of which is free of inter-particle interactions, but may involve an individual particle subject to an external auxiliary random field, such that the product of single-particle Gibbs operators under a fixed auxiliary random field configuration is analytically solvable, and the Gibbs operator exp(−τhti/m) can be computed as an expectation value of said product against the probability distribution of the auxiliary random fields [164, 165]. In any case, equation (53) is numerically implemented by approximating both the denominator and the numerator on the right hand side as a sum of quantum amplitudes associated with zigzags henceforth called Feynman paths, each of which corresponds to a unique point in the cylinder set and is of the form
where each qt∈ is called the configuration coordinate of the t-th time slice, ∀t∈[0, 2T]. With respect to each specific Feynman path q, a partial zigzag formed by connecting a consecutive subset of slice coordinates,
with 0≤t1<t2<2T, is called a Feynman segment of q, and the quantum amplitude
is called a partial path integral associated with the Feynman segment q(t2, t1), where
It is well known and recognized that Feynman's method and theory of Euclidean path integrals can be regarded as an isophysical mapping between a quantum system on a configuration space and a classical system on a configuration space
⊗[0, 2T]≅
2T+1, with which a quantum Hamiltonian supported by
is mapped to a classical potential energy function on C2T+1, and a quantum Euclidean theory becomes identical to a classical theory of statistical mechanics, although for a fermionic quantum system, the requirement of exchange antisymmetry, persisting into the classically mapped problem, remains a strong quantum reminder and signature.
A PIMC procedure does importance sampling from such zigzag Feynman paths weighted by their associated quantum amplitudes and produces an approximate fraction to estimate O
. To overcome the sign problem, a constrained PIMC [5, 168, 169, 173, 174] starts with and accepts only positive-definite Feynman paths q=(q2
2T+1 such that ϕ0(q0)>0 and G[q(t, 0)]>0, ∀t∈[1, 2T], or equivalently,
ϕ0(q0)>0 and G[q(t,t−1)]>0,∀t∈[1,2T]. (57)
During a constrained PIMC procedure, when a positive-definite Feynman path q∈2T+1 is being wiggled into q′∈
2T+1, by changing and updating the configuration coordinates for a tuple of time slices {tk, . . . , ti, . . . , t0} from an old tuple of configuration coordinates {qt
both remain positive, ∀i∈[0, k], where the local node-determinacy of the DFF or GFF Hamiltonians {H′l: l∈[0, L]} has been used in equations (58) and (59), which enables efficient local determination of the ground state nodal structures. It is noted that some harmless redundancy may be present when the tuple of time slices {tk, . . . , ti . . . , t0} consists of consecutive time instants, namely, ∃i∈[0, k] such that ti±1=ti or ti−1=ti. Such redundancy is easily identified and avoided. Among the legitimate new Feynman paths q′=(q′2T, . . . , q′T, q′T−1, . . . , q′0)∈2T+1, a Metropolis-Hastings importance sampling strategy can be used to select a transition from q to q′ randomly according to a probability
where it is obvious that many of the multiplying terms in the numerator and denominator of the rightmost fraction are common factors and cancel out, except for those involving a configuration coordinate that is being changed.
The following algorithm is an example of constrained PIMC simulating the ground state of a CD-separately irreducible DFF or GFF Hamiltonian H=HC+HD=H′L supported by a continuous-discrete product configuration space
with H′L terminating an adiabatic sequence of DFF or GFF Hamiltonians {H′l: LF2LF2(
), l∈[0, L]}, L∈
, whose initial Hamiltonian H′0 has a known ground state ψ0(h′0). Let m=O(poly(max{size(H′l): l∈[0, L]})), m∈
be chosen sufficiently large so that each Gibbs operator exp(−
H′l), l∈[0, L] is approximated as in (51) to a predetermined accuracy, and let
denote the ordered sequence of FBM tensor monomials collecting all of the m-fold repeated Lie-Trotter-Kato decompositions of {H′l: l∈[0, L]}, with grounds states
Algorithm 7. (Constrained PIMC Simulation of a DFF or GFF Hamiltonian)
The following lists an alternative method of PIMC for simulating a DFF-FS or GFF-FS system, which does not forbid nodal crossings between two adjacent configuration coordinates of two consecutive time slices, but label each Feynman path q=(q2T, . . . , qT, qT−1, . . . , q0)∈2T+1 by a lab(q)=±1, that counts the total number of nodal crossings incurred by the Feynman path, and is efficiently computed from a
as well as the local nodal structure of the intermediate Hamiltonians {Ht: t∈[0, T], T∈}. It is noted, as been observed by many previous authors [5, 168, 169, 173, 174], that, for fermionic Schrödinger systems, the expectation value of an observable with respect to the ground state can be numerically estimated by discarding the odd Feynman paths labeled by −1 and using only the even Feynman paths labeled by +1, because, for any odd Feynman path, there exists a compensating even Feynman path that is a mirror image of the odd Feynman path, obtained by firstly identifying a time slice with a configuration point q, sitting on a nodal surface, then mirror-reflecting either the Feynman segment from q0 to q* or the Feynman segment from q* to q2T, said mirror reflection being done by applying an odd permutation that exchange identical fermions.
Algorithm 8. (Constrained PIMC Simulation of a DFF or GFF Hamiltonian)
∈2T+1 to qt=r0, ∀t∈[0, 2T]; Label the initial Feynman path by lab(q)=+1; Initialize a time slice index t∈
to 0;
Otherwise, label q′ by
An SFF-FS or SFF-EB partial Hamiltonian H=Σi=1J Hi, J∈, with each partial Hamiltonian Hi, i∈[1, J] being DFF or GFF and FS or EB, can be efficiently simulated via Monte Carlo by iteratively simulating DFF or GFF and FS or EB partial Hamiltonians using one of the Algorithms 1 through 8. It is often the ground state ψ0(H) or the associated probability density |ψ0(H)|2 that is of interest, which can be obtained by running a time-inhomogeneous Markov chain [140, 264, 265] that iterates the ground state projectors {[ψ0(Hi)]exp(−
Hi)[ψ0(Hi)]−1}i∈[1,J]in sequence, as implemented in the following algorithm.
Algorithm 9. (Monte Carlo Simulation of a SFF-FS or SFF-EB Hamiltonian)
Major Utility 1. (A First Major Utility of Monte Carlo Quantum Computing)
With a warm start promised, the computational problem (as specified in Definition 18) of simulating an SFF-FS or SFF-EB Hamiltonian H=Σk=1K Hk is in the class BPP.
Demonstration. According to the analyses above and Algorithm 9, it is only necessary to perform a Θ(poly(size(H))) number of Monte Carlo simulations that sample from ground states of DFF or GFF and FS or EB partial Hamiltonians. Suffice it to prove that any such Monte Carlo simulation for a DFF or GFF and FS or EB partial Hamiltonian can be done with a polynomial-bounded computational complexity.
Firstly, it is without loss of generality to assume that the configuration space is either all-continuous or all-discrete, because any computational problem as specified in Definition 18 with a continuous-discrete product configuration space can be transformed, through a polynomial reduction, into another problem of the same class but based on an all-continuous or all-discrete configuration space. By Auxiliary Utility 2 and Derived Utility 1, suffice it to show that any of the Algorithms 1 through 6 solves in probabilistic polynomial time the equivalent problem of either a Dirichlet boundary-conditioned boltzmannonic system confined in one nodal cell when using a fixed-node diffusion type of approach, or a quantum-stochastic operator similarity-transformed random walk based on equation (48) over the entire configuration space otherwise.
In accordance with the configuration space being discrete or continuous and Hi, i∈[1, J] being FS or EB, a Gibbs operator G□(Hi) is suitably defined, and the unique and polynomially gapped ground state ψ0(G□(Hi)) is sought after. Irrespectively, it always holds true that the associated quasi-stochastic operator QH
c(Hi) ⊆
that are relevant for simulating QH
As will be justified immediately below, it is safe to assume that G□(Hi) is aperiodic, consequently, QH according to the distribution |ψ0(Hi; q)|2. To conclude, the computational problem of simulating an DFF or GFF and FS or EB Hamiltonian Hi as specified in Definition 18 can be solved via MCMC in polynomial time.
In view of the polynomial gappedness of H, it is indeed without loss of generality to assume that exp(−H) be aperiodic and irreducible, therefore,
be primitive, regardless of the configuration space being discrete or continuous, because a Laplace-Beltrami omnidirectional diffusion operator −ϵL, with ϵ=Ω(poly(size(H))), ϵ>0, could always be added to the Hamiltonian H without substantially altering the ground state wavefunction and the energy gap, so to render the Hamiltonian definitely irreducible. More specifically, with a discrete configuration space =Πs=1S
sn
) (|i
j|+|j
i|), where i and j index graph vertices, (i, j) represents a graph edge, and E(
) is the set of all graph edges. With a continuous configuration space, L could be a Laplace-Beltrami operator Δg=|det(g)|−1/2∂i|det(g)|1/2gij∂j associated with a smooth Riemannian metric g on a Riemannian manifold (
, g). The aperiodicity of exp(−H), thus the primitivity of QH, follows immediately from the aperiodicity of the Hamiltonian H. Or even simpler, any Markovian transition matrix QH can be straightforwardly modified into a lazy version ½(I+QH) [138, 140] to ensure aperiodicity.
A warm start is guaranteed for a polynomially gapped, CD-separately irreducible Hamiltonian H=HC+HD supported by a continuous-discrete product configuration space
when two mild conditions are fulfilled: 1) the diagonal elements of H are all polynomially bounded, that is, |q|H|q
|≤poly(size(H)), ∀q∈
; 2) each nodal cell is polynomially path-connected, and the positive HD-distance between any two nodal cells in any nodal groupoid is polynomially bounded. In which case, any location q0 that is at least a distance ϵ>0 away from any nodal point of ψ0(H), with ϵ−1=O(poly(size(H))), is a warm start, because, without loss of generality, again a Laplace-Beltrami omnidirectional diffusion operator −ϵL can be added to the Hamiltonian H without substantially altering the ground state wavefunction and the energy gap, so to establish exp [−O(| log ϵ|+poly(size(H)))Lsup] as a lower bound for |ψ0(H; q)|, ∀q∈
that is a distance of ϵ away from any nodal point of ψ0(H).
Alternatively, it is mathematically less sophisticated and physically more intuitive that, Algorithms 1 through 9 can be used to simulate a time series of SFF-FS or SFF-EB Hamiltonians {H(t)}0≤t≤1, evolving adiabatically from a pre-solved H(0) with a known ψ0(H(0)) to a target Hamiltonian H(1), such that a warm start for H(0) is readily available, and running any Algorithm 1 through 6 for H(t), 0≤t<1 till convergence supplies warm starts for simulating H(t+δt), δt>0, so long as ∥H(t+δt)−H(t)∥ is sufficiently small and H(t), 0≤t<1 is always sufficiently gapped to satisfy conditions of the adiabatic theorem [255-260]. Similar adiabatic techniques have been used in the context of boltzmannonic stoquastic Hamiltonians [156, 243, 244].
Still further, it often happens, and always does in the context of GSQC [19-22], that the sought-after ground state encodes the history of a quantum evolution and has the quantum probability distributed more or less uniformly among a polynomial number of temporal snapshots of the evolution history, in which one or a few temporal snapshot(s) representing the initial condition of the quantum evolution is/are associated with a known distribution in space carrying a polynomially sized portion of the quantum probability, from which a warm start can be picked easily.
One skilled in the art shall have no difficulty to recognize that the presented methods of solving a sign problem in numerical simulations are not limited to solutions of many-body quantum systems. Rather, the essential ideas generalize straightforwardly to applications and problems in other areas of science, technology, and engineering, including but not limited to statistics and optimizations, where a high-dimensional density is involved and/or to be simulated, based on which a linear or quadratic functional or another mathematical form is defined as an expectation value, and a computational task is to derive a good numerical estimate to the expectation value. Familiar examples include the theory and applications of Markov random fields, Gibbs distributions, and Gibbs samplings in probability theory, Bayesian statistics, image processing, and stochastic optimizations [237-240], which have demonstrated and enjoyed tremendous successes in providing incredibly efficient numerical methods and algorithms solving many problems of great practical importance, especially when the underlying density is everywhere non-negative and no sign problem is there to spoil numerical integration.
However, in many other applications, a high-dimensional signed density is involved and a sign problem is there to impose great difficulty. That is where the present methods generalize and help. It is often the case, although not always nor does have to be so, that a Gibbs operator, a transition matrix, or another linear operator exists and is explicitly identified, broadly referred to as a transition density matrix, which has the high-dimensional (signed) density as a stationary density vector (SDV), in the sense that an operation of said transition density matrix on said stationary density vector produces a (signed) density that is substantially the same as the stationary density vector. There may or may not be a Hamiltonian, or an energy operator, or an energy functional explicitly identified as a generator of the transition density matrix. Even the transition density matrix may be implicit. Regardless, the essential ideas of the present methods are to identify and use an SFF property in association with a collection of low-dimensional transition density matrices (LD-TDMs) related to a high-dimensional signed density of interest, where each LD-TDM in turn is associated with a plurality of efficiently computable low-dimensional signed densities as low-dimensional stationary density vectors (LD-SDVs) in the sense that an operation of said LD-TDM on each one of the LD-SDVs associated with it produces a low-dimensional signed density that is substantially the same as the one being operated upon, furthermore, said high-dimensional signed density is one stationary density vector associated with all of the LD-TDMs that are related to it. Once such an SFF property is identified and established, it follows from substantially the same mathematical theory, derivations, and proofs as presented in the above, that the high-dimensional signed density of interest can be efficiently simulated on a classical computer, by way of example but no means of limitation, using an MCMC procedure of Gibbs samplings following Markovian state transition rules related to the LD-TDMs.
Bi-Fermion Rebits
With their BPP solvability established, it is natural to ask about the generality of SFF-FS or SFF-EB systems, whether SFF-FS or SFF-EB Hamiltonians possess sufficient computational power to simulate other quantum systems in general. The answer is yes, as may be greatly surprising to many. To proceed, let's first construct a so-called bi-fermion system as a rebit and devise a universal set of FBM interactions and tensor polynomials, which can be combined to effect a designer Hamiltonian that is either SFF-FS or SFF-EB, or even both SFF-FS and SFF-EB, thus dubbed SFF doubly universal (SFF-DU), which has a unique and polynomially gapped ground state encoding an entire history of quantum state evolution as a result of executing a quantum algorithm. A designer Hamiltonian is so named because it is specially programmed and tailor-made to have a ground state encoding a quantum computation, in particular, the solution to a prescribed computational problem that an accordingly designed quantum algorithm is able to solve.
Let
represent the one-dimensional compact and connected Riemannian Lie group, known as the circle group, which is algebraically a quotient of the additive group of real numbers (, +) modulo the lattice of even integers (2
, +), with the (mod 2) operation producing a real-valued remainder within [−1, 1), and geometrically a Riemannian manifold endowed with a Riemannian metric defining a distance
such that, each point on is indexed by, namely, bijectively mapped to, a number in the interval [−1, 1)⊂
, as will be done throughout this specification, and ∀x, y∈
, d(x, y)=min{|x−y+2n|: n∈
}, with |r| representing the absolute value of r∈
. It is obvious that d(−1, 1)=0, with the consequence of x=±1 being identified to represent the same point on
. This coordinate scheme is very convenient, although it deviates from the standard method of using multiple overlapping charts on open sets. Consider two non-interacting identical spinless fermions moving on
under a three-well potential V(x) as depicted in
where C±>0 are fixed constants, V0>>aV0>>1>>a>0, |v0|<<1, although in the contexts of computations and simulations, V0, aV0, a−1, and |v0|−1 all have to be polynomially bounded in terms of the problem size.
For reasons that may become more convincing below, the potential well in the region {x: d(x, 0)≤a} is referred to as the Pauli well, while the two potential wells at {x: d(x, −½)<½−a} and {x: d(x, ½)<½−a} are called the left and right logic wells respectively. The Pauli well is said to form a π junction, while the potential barrier in the region {x: d(x, 0)≥1−a} is said to form a normal junction, between the two regions of logic wells. In a natural unit system with ℏ=m=1, the single-particle Schrödinger equation reads
−½ψ″(x)+V(x)ψ(x)=Eψ(x), (62)
which is already semi-analytically solvable by matching sinusoidal or hyper-sinusoidal solutions at edges of potential jumps [9, 53, 266], but becomes especially easy to analyze in the asymptotic limit of a→0, V0→∞, while
remains constant, namely, when the potential barrier or well approaches a Dirac delta function γ0δ(x−x0), γ0∈, x0∈
, and the effect of such a Dirac δ potential reduces to a peculiar boundary condition, that is, while the wavefunction remains continuous at x=x0, its derivative undergoes a definitive discontinuity as limϵ→0,ϵ>0[ψ′(x0+ϵ)−ψ′(x0−ϵ)]=2γ0ψ(x0) [266-268].
In the simple case with
the δ potential Pauli well hosts a deeply bound state ψP(x)=γ−1/2e−γ−d(x,0) with energy E0=−γ−2/2, which is tightly localized around the point x=0, while the two logic wells mostly accommodate the first two excited states ψ+(x)=sin k+[d(x, 0)−α+] of even parity, and ψ−(x)=−sin πx of odd parity, corresponding to eigen energies E+=k+2/2 and E−=π2/2 respectively. The ψ− state vanishes at x=0 as well as x=±1, and assumes an eigen energy that is independent of the δ potentials. For the ψ+ state, the peculiar boundary conditions due to the δ potentials determine the parameters as α+=γ−−1+O(γ0−3), k+=π(1+γ−−1−γ+−1)+O(γ0−2), such that E+=π2/2+π2(γ−−1−γ+−1)+O(γ0−2). When γ+=γ−=γ0, the balance between the positive and negative δ potentials leads to energy degeneracy at E+=E−=π2/2. When the δ potentials are off balance, the even and odd states are split by an energy gap E+−E−=π2 (γ−−1−γ+−1)+O (γ0−2). In a unitarily transformed basis consisting of states
with [⋅]Iver being the Iverson bracket, the wave amplitude is mostly localized in the left and right logic wells respectively. The combined effect of the δ potentials may be interpreted as a coupling via tunneling between the ψL and ψR states in conjunction with an overall energy shift, altogether represented by a Hamiltonian Γ+Γ|ψLψR|+Γ|ψR
ψL|, with the coupling strength
whereas a relative potential shift 2v0≠0 between the left and right logic wells can be considered simply as to offset the eigen energies of the ψL and ψR states. By the same token, in the {ψ+, ψ−} basis, the combined effect of the δ potentials may be understood as to offset the eigen energies of the ψ+ and ψ− states, whereas a relative potential shift 2v0≠0 between the left and right logic wells can be regarded as a coupling between the ψ+ and ψ− states. It is fairly clear that, when 0≠|γ+−γ−|<<γ0 and v0=0, the second and third eigen states remain largely between ψ+ and ψ−, with a small deviation in the wavefunctions that is O(|γ+−γ−|/γ02), although the state degeneracy will be lifted with an energy gap that is Θ(|γ+−γ−|/γ0). Similarly, it can be easily verified that, when γ+=γ−=γ0 and 0≠|v0|<<1<<γ0, the second and third eigen states remain largely between ψL and ψR, with a small deviation in the wavefunctions that is O(|v0|/γ0), although the state degeneracy will be lifted with an energy gap that is Θ(|v0|).
With
define a closed circular are
and its open complement
corresponding to the regions that are nearby and far away from the Pauli well respectively. Consider a probability of collision P(ψ±, ψp; α−) between the ψ± and the ψp orbits with respect to
α−, which is defined and bounded as
Also, define a closed semicircle
and its open complement
the correspond to the left and the right logic wells respectively. Consider the overlap integral between ψ+ and ψ− within each of the two semicircles,
2ψ+(x)ψ−(x)dx=−2
ψ+(x)ψ−(x)dx=cos πα+=1−O(γ0−2). (66)
By choosing a sufficiently large value for γ0, the probability of collision P(P±, ψP; α−) can be made O(γ0−3 log3 γ0) infinitesimal, while the overlap integral between ψ+ and ψ− within each of the two logic wells can be made O(γ0−2) close to unity or perfection. Finally, define a closed semicircle
and its open complement
that enclose the Pauli well and the potential barrier respectively. It is interesting and potentially useful to note that, the two eigenstates ψ+ and ψ− have an appreciable difference in their cumulative probability distributions between the two semicircles 0 and
1. While ψ− is exactly equidistributed between the two semicircles, that is,
ψ−(x)|2dx=
|ψ−(x)|2dx=½, the ψ+ state is more concentrated in
1 than in
0, and specifically,
It is most worth noting that the disparity between the cumulative probabilities |ψ+(x)|2dx and
|ψ+(x)|2dx scales asymptotically as Θ(γ0−1) when γ0>>1, as opposed to the O(γ0−3 log3γ0) probability of collision between the ψ± and ψP orbits, and the O(γ0−2) deviation from perfect overlapping between ψ+ and ψ+ within each of the logic wells.
To employ the space spanned by {ψ+, γ−} or {ψL, ψR} for GSQC, two identical fermions are employed to invoke the Pauli exclusion principle, such that one fermion fills and blocks the deeply bound ψP state, while the other particle lives in the two-dimensional space span{ψ+, ψ−} to implement a rebit. The configuration is an example of Pauli blockade. Such construct of two identical fermions moving in a three-well potential on a circle is called a bi-fermion, whose two lowest-lying energy states form one rebit's worth of computational space, in which universal GSQC can be realized by designer Hamiltonians with -diagonal potential perturbations, where
for a single bi-fermion. More specifically, a bi-fermion has the nominal Hamiltonian
with two exchange antisymmetric states
that are degenerate at the lowest (zero-valued) energy and span a rebit manifold. All excited states lie above with an energy gap at least Ω(1). The unitarily transformed states
have one fermion trapped by the Pauli well and tightly localized around x=0, while the other fermion mostly localized in either the left or the right logic well, although the spatial localization is not quite exact. Images of the bi-fermion wavefunctions Φ+(x1, x2), Φ− (x1, x2), ΦL(x1, x2), and ΦR(x1, x2) are shown in +(Φi(x1, x2)) and one negative nodal cell
−(Φi(x1, x2)), i∈{+, −, L, R}, as dictated by Derived Utility 2 of Auxiliary Utility 3.
For the most convenient implementations of logic operations among multiple rebits, it may be preferable to work with basis states that are strictly localized in the logic wells. Again set α=2γ0−1 log γ0, define three well-localized single-particle states
with [⋅]Iver being the Iverson bracket, such that
where the π(2γ0)−1 phase shift in the wavefunction sin π[d(x, 0)−(2γ0)−1] is introduced deliberately to make equation (77) true. Then designate the bi-fermion states
which can be equivalently written in the second-quantized representation as
|↓=aL+aP+|vac
, (80)
|↑=aR+aP+|vac
, (81)
where |vac denotes the vacuum state, aP+, aL+, and aR+, are fermion creation operators for the |P
, |L
, and |R
singe-particle orbits respectively. At times, it is also convenient to use the unitarily transformed basis with states
To make definitive and unambiguous references in the following, the states |↓ and |↑
or |+
and |−
are called the effective computational basis states, and span{|↓
, |↑
}=span{|+
, |−
} is called the computational bi-fermion space. By contrast, the states ΦL and ΦR or Φ+ and Φ− are called the physical bi-fermion basis states, and span{ΦL, ΦR}=span{Φ+, Φ−} is called the physical bi-fermion space. The two spaces and their corresponding basis states are nearly the same, as the overlaps
↓|ΦL
,
↑|ΦR
,
+|Φ+
, and
−|Φ−
are all valued within O(γ0−2) to 1.
Interestingly, just for a little digression, it is noted that in the special case of a balanced three-well potential with v0=0 and γ+=γ−=γ0>>1 on the circle group, apart from the deeply bound state ψP(x)=γ01/2e−γ, n<<γ0 are two-fold degenerate, where the two degenerate states per each energy level En, n≥1 preferably chosen to have a definitive parity, either even or odd, namely, ψn+=sin nπ[d(x, 0)−γ0−1+O(γ0−3)], or γn−=−sin nπx. For all n∈
, n<<γ0, x=0 and x=±1 are the nodal points for the odd-parity states {ψn+}n, while x=±γ0−1 are approximately the nodal points for the even-parity states {ψn+}n, with an extremely small error O(γ0−3) when γ0>>1. A bi-fermion, with two non-interacting identical spinless fermions moving in the balanced three-well potential on the circle group, will have one fermion filling the ψP state and another fermion occupying one of the {ψn±}n orbits, so that the low-lying energy levels E0+En, n∈
, n<<γ0 are all two-fold degenerate, with even-parity and odd-parity eigenstates characterized respectively by the antisymmetric two-fermion wavefunctions
n∈, where all of the even-parity states {Ψn+}n have nodal lines defined by x1−x2=0 and x1+x2=0, x1, x2∈
, as illustrated in
, and another nodal curve that, when γ0>>1, is well approximated by {x1=±1}∪{x2=±1}, as illustrated in
2 is divided by the two straight lines x1−x2=0 and x1+x2=0, x1, x2∈(−1, 1) into four isosceles right triangles, the interiors of which are called the E (east), N (north), W (west), and S (south) quarters respectively. Then the even-parity state Ψ1+ has the E and W quarters as well as the line boundary {x1=±1} between them fused into a connected open set
EW as one nodal cell on
2, and the S and N quarters as well as the line boundary {x2=±1} between them fused into a connected open set
SN as the other nodal cell on
2, with the wavefunction vanishing almost everywhere on the line x1+x2=0 and absolutely everywhere on the line x1−x2=0, x1, x2∈
, while the odd-parity state Ψ1− has the N and W quarters as well as the line boundary {x1+x2=0, x1<0} between them fused into a connected open set
NW as one nodal cell on
2, and the S and E quarters as well as the line boundary {x1±x2=0, x1>0} between them fused into a connected open set
SE as the other nodal cell on
2, with the wavefunction vanishing almost everywhere on the lines {x1=±1}∪{x2=±1} and absolutely everywhere on the line x1−x2=0, x1, x2∈
. Furthermore, all of the even-parity states {Ψn+}n<<γ
EW and HBF,0|
SN, and by exactly the same token, all of the odd-parity states {Ψn−}n<<γ
NW and HBF,0|
SE. In other words, under a balanced three-well potential, the nodal cells
EW and
NW (or
SN and
SE) are isospectral manifolds, thus, one cannot distinguish between the two differently shaped bi-fermion “drums” HBF,0|
EW and HBF,0|
NW (or HBF,0|
SN and HBF,0|
SE), by just “hearing” the spectra of their vibrations [269-272].
Now come back to our main course of discussions regarding the use of bi-fermions for GSQC. The nominal Hamiltonian HBF,0 ensures that a bi-fermion at its lowest energy stays in the physical bi-fermion space, which is substantially the same as the computational bi-fermion space, when both parameters γ0 and α−1=γ0/(2 log γ0) are large yet polynomially bounded. Although containment of the ground state(s) in span{|↓, |↑
} is not absolute, the rate of leakage error as measured by the trace of a so-called leakage error operator
can be made arbitrarily small as Tr(Eleak(γ0))=O(α3)=O(γ0−3 log3 γ0), at the price of proportionally increased peak-to-valley swing of the three-well potential V(x), which may entail higher costs to simulate, either quantum computationally or by means of classical Monte Carlo. Furthermore, there are quantum error suppression techniques known as subsystem, operator, or Hamiltonian encoding [273-276], that map intended Hamiltonians into self-adjoint operators acting on an encoded qubit space, and introduce energy penalty terms to suppress one-local, or k-local with a fixed small k, error transitions affecting the raw qubits before the error suppression encoding. Generally, quantum gate operations on the bi-fermion rebits are not always perfect in terms of the effective computational basis states, but subject to small errors, even in theory. The error rate should and can be made so sufficiently low that there is an appreciable probability, polynomially bounded from below, that a required long sequence of ground state quantum gates on the bi-fermions could be carried through error-free. Also, the well-developed theory and techniques of quantum error correction [277-279] can be employed to counter the detrimental effects of errors, and the celebrated quantum threshold theorem promises that, with concatenated fault-tolerant encoding for quantum error correction, an arbitrarily sized quantum circuit can be realized using error-prone quantum gates with at most polylogarithmic overheads, so long as the rate of error for each error-prone quantum gate is below a certain threshold [280-284].
It may be noted that the nominal Hamiltonian HBF,0 of bi-fermions can be rendered doubly universal, and incorporated into an SFF-DU Hamiltonian for GSQC, which is amenable to probabilistic simulations using either of the two corresponding families of Monte Carlo algorithms, because HBF,0 becomes essentially bounded in a practical implementation, since the Dirac δ potentials in equation (68) are really idealizations that are meant to represent the effects of a bounded potential such as V(⋅) of equation (61), where the parameters V0>0 and a−1>0 are constants, whose values are large but still polynomially bounded. Besides, although it is necessary to keep the Pauli well narrow and deep, approximating a fairly strong Dirac delta trap so to minimize the leakage error of a bi-fermion rebit, there is no practical necessity other than convenience of mathematical analysis, to install literally a steep potential barrier around x=±1 on the circle group of a bi-fermion. Rather, it is perfectly fine to place a relatively wide and low potential barrier, as long as it has the width and height chosen properly to be commensurate with the Delta-like potential well around x=0, such that the nominal Hamiltonian of the bi-fermion system defines a degenerate two-state Hilbert space approximating an effective computational basis. Along another line of reasoning that follows literally the definition of essential boundedness, the quantum physics of any bi-fermion can be described to within any desired and predetermined accuracy by selecting a finite number of continuous basis functions to span a Hilbert subspace, and projecting or restricting all relevant operators to the Hilbert subspace, such that all projected or restricted operators, including Dirac δ potentials, become bounded. In any case, it is concluded that even Hamiltonians containing Dirac δ potentials can be made essentially bounded, a factoid that may be invoked repeatedly in the following without reiterating the justifications.
On top of the nominal Hamiltonian HBF,0, the following -diagonal potential,
with [⋅]Iver being the Iverson bracket, can be added to realize the Pauli ±σx operator, in that
where η is a constant in (−1, 1). It is quite obvious that a nonzero value of v0 in equation (61) induces ±σz interactions. In general, an arbitrary rebit operator σx sin θ+σz cos θ, θ∈[−π, π) can be realized by adding a -diagonal potential VX,η sin θ(x1, x2)+VZ,η cos θ(x1, x2) to the nominal Hamiltonian HBF,0, with
in the sense that
That is so, because, projected onto the two-dimensional manifold spanned by the nearly degenerate two states of concern, the single-particle Hamiltonian
is approximated as
in the basis of single-particle states {ψL, ψR}, or equivalently
in the basis of single-particle states {ψ+, ψ−}. Therefore, universal single-rebit ground state logic can be realized by a bi-fermion restricted to the two-state manifold of lowest energy, with the fermionic Schrödinger Hamiltonian HBF,0+VX,η sin θ+VZ,η cos θ, θ∈[−π, π), which is effectively
in the first-quantized, second-quantized, and rebit representations respectively.
In contour plots, with dashed lines labeled by the numbers 0.1, 0.3, 0.7 indicating where a wavefunction is positive-valued at levels equal to 0.1, 0.3, 0.7 respectively, while dark solid lines labeled by the numbers −0.1, −0.3, −0.7 signifying where a wavefunction is negative-valued at levels equal to −0.1, −0.3, −0.7 respectively, and gray solid lines labeled by the number 0.0 indicating nodal surfaces, 2, the closures of the S and N quarters fused into the closure of another nodal cell on
2, where the two nodal cells are divided by the nodal line x1−x2=0 and another nodal curve that is closely aligned with the line x1+x2=0, x1, x2,∈
, while a peri-odd ground state has, approximately, the closures of the N and W quarters of the open square {(x1, x2): x1, x2∈(−1, 1)} fused into the closure of one nodal cell on
2, the closures of the S and E quarters fused into the closure of another nodal cell on
2, where the two nodal cells are divided by the nodal line x1−x2=0 and another nodal curve that is closely aligned with {x1=±1}∪{x2=±1}, x1, x2, ∈
. In practical implementations and simulations, when a large γ0 is chosen, the nodal curves of the peri-even and peri-odd states can be well approximated by the lines x1+x2=0 and {x1=±1}∪{x2=±1} respectively, in addition to the always exact nodal line x1−x2=0, x1, x2,∈
.
A moment of reflection on ˜|ΨP
, the motion of the other fermion in the three-well potential is isophysical to a quantum computational rebit, whose wavefunction is restricted to be real-valued, but can be signed freely, either positive or negative. Still though, the system of the two fermions as a whole, that is the bi-fermion, can be simulated probabilistically on a classical computer, using MCMC in conjunction with either the fixed-node diffusion method or the quantum-stochastic operator similarity-transformation based on equation (48), where a random walk over a classical probability space, necessarily with a non-negative valued measure, gleans a sufficient statistic of the bi-fermion system in its quantum-mechanical entirety, involving both positive and negative valued quantum amplitudes. In a sense, if the ability of a quantum machine to represent and manipulate “negative probability” is considered a certain computational resource, then a classical probabilistic machine apparently lacks such a computational resource. Nevertheless, the deficiency can be made up for by using an exchange of two identical fermions to represent the sign flip of probability amplitude, which turns the so-called fermion sign problem on its head, in that, instead of being dreaded as a curse that brings trouble to quantum Monte Carlo simulations, the exchange antisymmetry of fermions is embraced as a blessing that endows a classical probabilistic simulator of bi-fermions the ability to represent and manipulate “negative probability”.
More specifically, let
θ∈[−π, π) represent an “R” gate on a bi-fermion rebit. Let
denote the affine-transformed operators, which preserve the two eigen states of R(θ), and send one of them to the ground at zero energy. As will be discussed shortly below, applications of such “R” gates on a specific bi-fermion rebit are often controlled by the state of other rebits, however the quantum physics and Monte Carlo dynamics of the concerned bi-fermion rebit remain essentially the same for the same R-gate, although exactly which R-gate is applied shall depend upon the state of the controlling rebits. With R(θ) being an involution, or a self-inverse operator, R±(θ) are orthogonal projections, or self-adjoint idempotent operators, and the associated Gibbs operators have a rather simple formula,
thus limτ→+∞G[τR∓(θ)]=R∓(θ). The two eigenstates of R(θ), corresponding to the eigenvalues +1 and −1 respectively, and being orthogonal necessarily, read
Φ+(θ)=cos(θ/2)|↓+sin(θ/2)|↑), (93)
Φ−(θ)=cos(θ/2)|↑+sin(θ/2)|↓), (94)
in the computational bi-fermion representation, which are equivalently
Φ+(θ)=[cos(θ/2)|L1+sin(θ/2)|R1
]|P2
−[cos(θ/2)|L2
+sin(θ/2)|R2
]|P1), (95)
Φ−(θ)=[cos(θ/2)|R1−sin(θ/2)|L1
]|P2
−[cos(θ/2)|R2
−sin(θ/2)|L2
]|P1), (96)
in terms of the effective computational basis states in an explicitly exchange-antisymmetric form. It holds true in any representation that
It does not need more than a note in passing that any Hamiltonian R±(θ), θ∈[−π, π) is trivially ϵ-almost node-determinate, with ϵ=O(γ0−2)>0. When Monte Carlo-simulated on a classical computer, the bi-fermion ground state Φ±(θ; x1, x2) of R∓(θ) corresponds to a probability distribution |Φ±(θ; x1, x2)|2, and the quantum statistical mechanics of the bi-fermion under the Gibbs operator G[τR±(θ)], ∀τ>0 maps to a classical random walk with a stochastic operator
on any fixed nodal cell (R±(θ); x″1, x″2) enclosing an (x″1, x″2)∈
2 such that R±(θ; x″1, x″2)≠0, where the operators R∓(θ)=|Φ∓+(θ)
Φ∓(θ) and G[τR±(θ)], as well as MτR
2-coordinate representation. Furthermore, in the limit of τ→+∞, the operator MτR
2\ψ0−1(R∓(θ)).
More formally, a quantum bi-fermion rebit/system lives in the two-dimensional quantum state space
where E.S.T. stands for the exchange-symmetric terms including (−c0)|L2|P1
and (−c1)|R2
|P1
, which are obtained by applying an operator π12 to the wavefunctions c0|L2
|P1
and c1|R2
|P1
respectively, with π12 denoting the well-known fermionic exchange operator that swaps labels for the two identical fermions and induces a negative sign. Any normalized wavefunction ϕ∈
can be made into the ground state of a Hamiltonian
with ϕ⊥∈being normalized and
ϕ|ϕ⊥
=0,
denoting the Banach algebra of operators acting on the Hilbert space
, over the field of real numbers. The quantum bi-fermion rebit can be mapped into a rectified bi-fermion rebit/system on an ostensibly four-dimensional but actually two-dimensional rectified state space
through a one-to-one correspondence : (
))
)), with
being called the rectified bi-fermion configuration space, as opposed to the full bi-fermion configuration space 2, such that
(
2)=
*2⊆
2, and
for the wavefunctions, ∀r∈, ∀θ∈[−π, π] and
for the orthogonal projection-valued Hamiltonians, θ∈[−π, π] where
and ΠR as defined in equation (102) is a one-to-one and self-inverse mapping between and
, which is actually an isomorphism between two Hilbert spaces, as will be shown rigorously below. It is noted that a useful convention, henceforth referred to as the logic-0-positive convention or representation, is adopted in equations (99) and (100), which imposes the condition that the coefficient c0 should be non-negative, without inducing any loss of generality, since a global phase factor (a +/−sign, to be exact) in or out of the wavefunction of a rebit does not change any physics.
Essentially, the bijective mapping embodies a method of fermion sign rectification using isomorphic group representations z and m, where
:({+1,−1},*)
({I,π12},*), such that
(+1)=I,
(−1)=π12, (106)
is an isomorphism between the multiplicative group ({+1, −1}, *) of numerical signs and the multiplicative group ({I, π12}, *) of linear operators, while
is an isomorphism between the multiplicative group of real numbers and the direct product group (>0, *)×({I, π12}, *), where
and ∀(|r|, π12(r<0))∈(′≠0, *), the equation (|r|, π12(r<0))=½(|r|+r) I+½(|r|−r)π12 makes its sense with the group elements {|r|, π12(r<0))
interpreted as linear operators acting on the rectified state space
. (
≠0, *) is of course the multiplicative structure in the field of real numbers (
, +, *). With zero elements added, and the domain of the mapping m extended to
such that
with
then the product set ≥0×{I, π12} can be made into an abelian additive group with a “+” operation defined as
so that the structure (′, +, *) becomes an ordered field, henceforth referred to as the field of rectified numbers, that is isomorphic to the conventional field of real numbers (
, +, *), with an element r′∈(
′, +, *) being positive, denoted as r′>0′ or r′∈
′>0′, if and only if m−1(r′)∈
>0. An alternative construction of the field (
′, +, *) starts with the direct product group
which is overcomplete to serve as a group of multiplicative coefficients for wavefunctions of bi-fermions, but the unwanted redundancy can be removed by imposing an equivalence relation such that (r, π12)
(−r, I), ∀r∈
≠0, which then effects a group endomorphism
such that ƒπ(r, π12s)=(|r|, π12r(−1), ∀s∈{0, 1}. The resulted quotient group (
, *)/
is isomorphic to (
′≠0, *). Then, a zero element 0′ and an additive group structure can be introduced to the direct product set
s∈{0, 1}} in a manner similar to equation (108), with the equivalence relation extended naturally such that (r, I)+(−r, I)
(r, I)+(r, π12)
0′, ∀r∈
. It becomes clear now that the quotient set
/
forms an ordered field (
/
, +, *), which is isomorphic to the field (
′, +, *). Viewing the field (
′, +, *) as (
/
, +, *) affords an advantageous perspective and the algebraic convenience for transforming mathematical formulas involving bi-fermion wavefunctions and operators, by freely inserting an identity factor (−1)π12 anywhere in a multiplicative expression, or a zero term 1+π12 anywhere in an additive expression. In particular, a stoquastic operator on a Hilbert space over the field of rectified numbers (
′, +, *) can be deliberately written as D−A, instead of D+π12A, to signify the stoquastic property and the consequence that exp[−
(D−A)] is non-negative ∀τ>0, when the operator D is represented by a diagonal matrix with elements in
, while the operator A is represented by a non-negative matrix with all elements in
≥0. It is also noted that the definition of the rectified state space
in (100) has removed a two-fold redundancy in the single bi-fermion configuration space and wave distribution, by omitting the negative nodal cell and the negative-valued wave distribution in it, which are always isomorphic to the retained positive nodal cell and positive wave distribution therein.
The quantum state space is clearly a Hilbert space over the field (
, +, *), with the conventionally defined inner product
∈, ai, bi∈
, i∈{1, 2}. The quantum physics of a rebit is also concerned with the vector space
of bounded linear operators on the Hilbert space
, which forms a Banach algebra under the operator norm. With a chosen basis of
, such as {|L1
|P2
−|L2
|P1
, |R1
|P2
−|R2
|P1
}, each element in
is represented by a 2×2 matrix with
-valued entries. Thanks to the one-to-one correspondence
and the bijection m, the rectified state space
can also be made into a vector space, over the field (
′, +, *), with the required vector addition and scalar multiplication operations defined as
then straightforwardly made into a Hilbert space with an inner product defined as
∀ψ′i=(ai, I)|L1|P2
+(bi, π12s
|P2
∈
, (ai, I)∈
′, (bi, π12s
′, si∈{0,1}, i∈{1, 2}. In the Hilbert space
, the value of any wavefunction at any location in the bi-fermion configuration space involves only non-negative numbers in
. Still, two non-zero vectors ψ′1, ψ′2∈
, can be orthogonal and produce a 0′-valued inner product
ψ′1|ψ′2
=(0, I)=(0, π12)∈(
′, +,*). The vector space
of bounded linear operators over the Hilbert space
forms a Banach algebra under the operator norm, which is isomorphic to
and describes the equivalent physics. With a chosen basis of
, such as {|L1
|P2
, |R1
|P2
}, each element in
is represented by a 2×2 matrix with
′-valued entries that involve only non-negative numbers in
and possibly the operator π12 exchanging identical fermions. It is interesting to note that both
and
can be considered as subsets of a larger Hilbert space
over the field
, where
being equipped with the conventionally defined inner product, while both and
can be regarded as subsets of
and
are just the results of
and
modulo the antisymmetric condition upon exchange of identical fermions. For the rectified state space, any wavefunction
is in the conical hull of
denoted by coni(), where all of the basis wavefunctions in
are non-negative-valued in the bi-fermion configuration space, the union of whose supports is the rectified bi-fermion configuration space
*2. Therefore,
is inside a positive convex cone consisting of non-negative definite wavefunctions in
. By the same token, any dyadic operator in
that is of the form Σij|ψ′i
ϕ′j|, ψ′i, ϕ′i∈
, i, j∈
can be interpreted as an element in the positive convex cone coni(
)⊂
, with
Such embeddings of into
and
into
coincide with the embeddings of
into
and
into
as far as the positive parts of the wavefunctions are concerned, thus enabling fermion sign rectification, which is to do quantum computing and Monte Carlo simulations without ever encountering a negative number in
, through the use of rectified bi-fermions in conjunction with the fixed-node diffusion method or the quantum-stochastic operator similarity transformation as in Auxiliary Utility 2.
Clearly, the mapping : (
)
(
) amounts to an isophysics, through which the rectified bi-fermion physics (
) constitutes a representation of the quantum bi-fermion physics (
)), where any observable H∈
other than the identity operator, and the corresponding H′=
(H)∈
may be designated as the Hamiltonians for the respective physics. Since the states in
are all non-negative distributions, any ground state physics of (
) can be isophysically mapped into a classical random walk on the set union of the supports of all wavefunctions in the state space
, governed by a stochastic operator of the form [ψ0(H′)]G′(
H′)[ψ0(H′)]−1, with
>0, H′=
(H),
as having been discussed extensively in the previous section leading to Major Utility 1, as well about equation (98) a few paragraphs above. It is reminded that for any Hamiltonian H′, ψ0 (H′) denotes the ground space of H′, which consists of one unique ground state wavefunction for the present case of a single bi-fermion system. For any H′∈ with ψ0(H′)∈
, it is obvious that one connected component of the support of ψ0 (H′) is exactly the positive nodal cell of the ground state
−1(ψ0(H′)) of the quantum bi-fermion Hamiltonian
−1(H′). Importantly, for a set
that contains orthogonal projection-valued operators of the form H=R±(θ), θ∈[−π, π), which are complete for ground state quantum computations, each of the associated ground states always has one unique positive nodal cell that is topologically connected, such that the Markovian random walks following {[ψ0(H′)]G′(
H′)[ψ0(H′)]−1: H′∈
,
>0} are always ergodic and rapidly mixing. Therefore, the isophysics
: (
)
(
) and Derived Utility 1 of Auxiliary Utility 3 agree, as they must do, on asserting rigorously that the fermionic ground state ψ0(H) of a quantum bi-fermion system is uniquely determined by the boltzmannonic ground state
[ψ0(H)] of the corresponding rectified bi-fermion system on the positive nodal cell of ψ0(H), where the two ground states are both supported and become identical.
It is important to note that a continuous substrate space is not indispensable for devising useful bi-fermions. Rather, a discrete version of the bi-fermion system can be quite simply realized by placing two identical non-interacting fermions on a cyclic grid of 2n points
n≥2, n∈, where x=±n are identified as labeling the same lattice point, with single-particle tunneling/coupling between neighbor sites that is translation-invariant in strength, and an on-site single-particle potential Vx that consists of a deep potential well at x=0 and a steep potential barrier at x=±n, so that the nominal bi-fermion Hamiltonian reads
where γ0>0, γ0>>1, and C0∈R is a constant suitably chosen such that λ0(HBF,0lattice)=0, and
denotes the Kronecker delta function for (x, y)∈2. The nominal bi-fermion Hamiltonian HBF,0lattice has two degenerate ground states ΦL∈L2(
2n2) and ΦR∈L2(
2n2) as the discrete counterparts of the wavefunctions as defined in equations (69-71), which span a two-dimensional Hilbert space as well. Such a bi-fermion made of two identical fermions moving in a discrete substrate space like
2n is referred to as a discrete bi-fermion, as opposed to the continuous bi-fermion being discussed extensively above, which is supported by a continuous substrate space like
. In perfect parallelism to the case of a continuous bi-fermion, the restricted states
with [⋅]Iver being the Iverson bracket, are called the effective computational basis states spanning a computational bi-fermion space span({|↓, |↑
}), which are more convenient and useful for GSQC, although such a computational bi-fermion space is not absolutely complete, manifested in that the leakage error operator
does not vanish identically. It can be easily verified that the leakage error can be made as small as Tr(Eleak(γ0))=O(γ0−3 log3 γ0) in the limit of n→∞, or Tr(Eleak(γ0))=O(γ0−2) in the limit of n→2. Next, perturbative on-site single-particle potentials can be introduced, so that the fermionic Schrödinger operator
effects a computational rebit operator (η/γ0) (σx sin θ+σz cos θ), η∈(−1, 1), θ∈[−π, π). Said operator is simultaneously fermionic Schrödinger and essentially bounded, thus doubly universal.
For yet another example of a discrete bi-fermion, two identical and non-interacting fermions move in a three-point lattice
with on-site energies γ0+cos θ, γ0−cos θ, −γ0 at x=0, 1, 2 respectively, γ0>0, θ∈[−π, π), where a particle can possibly hop between the two sites 0 and 1, but never transfer between 2 and the other sites, such that a bi-fermion is formed with a Hamiltonian
in the second-quantized representation, where ak and ak+ are the fermionic annihilation and creation operators respectively, ∀k∈{0, 1, 2}, such that akal±alak=0, ak+al++al+ak+=0, akal++al+ak=δkl, ∀k, l∈{0, 1, 2}. The Hamiltonian HBFtripod effects a computational rebit operator σx sin θ+σz cos θ over a computational bi-fermion space spanned by the states |↓=a0+a2+|vac
and I|↓
=a1+a2+|vac
, which is identical to the physical bi-fermion space spanned by the lowest two rigorous eigenstates of HBFtripod. In the first-quantized picture, with the system resting at its ground state(s), it is obvious that the low-energy site 2, which is named the first orbit, must always be filled by one fermion, that may be arbitrarily labeled as the first, while the second fermion can only occupy possibly a superposition between sites 0 and 1, that is named a second orbit. When the two fermions have their positions exchanged, the wavefunction of the whole bi-fermion system acquires a negative sign. The sole purpose of the first orbit at site 2 hosting one of the identical fermions is just sign-tracking via the exchange antisymmetry. Although not always fermionic Schrödinger, the Hamiltonian HBFtripod is surely essentially bounded, suitable for GSQC through an SFF-EB Hamiltonian. On the other hand, with such a discrete bi-fermion supported by a three-point lattice
3, henceforth referred to as the
3-supported leakage-free bi-fermion, there is absolutely no rebit leakage error to worry about, as the operator I−|↓
↓|−|↓
↓| strictly vanishes over the physical bi-fermion space, thus, Tr(Eleak(γ0))=0 hold true, ∀γ0=0. Consequently, there is no need to set γ0 to a substantially large value. A choice of γ0=O(1) suffices. These may be considered as compensations for sacrificing the fermionic Schrödinger property of HBFtripod.
From this point on, unless explicitly stated otherwise, a bi-fermion without a modifier shall refer to a general bi-fermion construct that can be wither continuous or discrete, which is made of two identical fermions moving on a general substrate space that can be either continuous as
, or discrete as
n≥2, or another variant of the substrate space. The two states with the lowest energy of such general bi-fermion, denoted as |↓∈LF2 (
2) and |↑
∈LF2 (
2), that may be suitably restricted in the configuration space
2 if useful, necessary, and desired, constitute a Hilbert space
called the computational bi-fermion space. The triple (), with
being a Banach algebra of bounded linear operators over the Hilbert space
, forms a quantum physics/system associated with a single bi-fermion, which implements a computational rebit in the sense of homophysics. It is also straightforward to generalize the discussions between equations (100) and (105) to construct a rectified bi-fermion rebit with a quantum physics/system (
), with
being called the rectified configuration space, +(ϕ) denoting the positive nodal cell of a wavefunction ϕ∈LF2(
2), π12 being the familiar two-fermion exchange operator,
, being the Hilbert space spanned by the states |↓
, |↑
, and (π12|↑
)
over the field of rectified numbers (
, +, *), and
being a Banach algebra of bounded linear operators over the Hilbert space
, with respect to the field of rectified numbers (
, +, *) of course.
Even though a general substrate space ∈{
3,
2n, T} and the quantum physics (
)) or (
)) have already covered a rather broad range of preferred embodiments of bi-fermions, they should be understood by way of examples and by no way of limitations. Indeed, there should be no difficulty for one skilled in the art to devise a construct similar to the bi-fermions that are exemplified in the above, by choosing another variant of the substrate space, or a different number of a preferred species of identical fermions, or subjecting the identical fermions to a differently arranged potential or different ways of interactions. In particular, it can be stressed that the dimension of the substrate space to host identical fermions that constitute a bi-fermion or any similar construct worthy of a quantum computational qubit or rebit, does not have to be 1, and the number of identical fermions comprising each bi-fermion or similar construct can exceed 2, so that there can be 2 or more fermions filling 2 or more lower energy orbits and another fermion occupying and roaming between two nearly degenerate states to encode a qubit/rebit. Still further, a bi-fermion or similar construct can span a d-dimensional Hilbert space, d≥3 to constitute a qudit or relit (that is a qudit with a d-dimensional Hilbert space over the field of real numbers).
Also, there can be inter-particle interactions among the identical fermions constituting a single qubit/rebit, and no difficulty in homophysical implementations and Monte Carlo simulations is entailed by such inter-particle interactions, so long as they are described by diagonal self-adjoint operators in the many-particle configuration space representation. On the other hand, it is important to employ a substrate space that is either at least two-dimensional, or in the one-dimensional case, necessarily containing a subspace that is not simply connected and homeomorphic to the circle group or a cycle graph, regardless of the substrate space being continuous as a manifold or discrete as a topological graph. And, it is essential to have at least two identical fermions moving on such a multi-connected substrate space, if the Hamiltonian has to be of the conventional Schrödinger type. Because, as discussed in the previous section, and it is without loss of generality to take the case of a continuous configuration space for example, the ground state of a single particle under a fermionic Schrödinger, thus equivalent to boltzmannonic stoquastic, Hamiltonian
is always non-degenerate and positive, when the arbitrarily dimensioned Riemannian manifold (, g) is compact and smooth, while the potential V(⋅) is Kato-decomposable and Klauder regular. In another situation, according to a beautiful theorem of Lieb and Mattis [162], a simply connected one-dimensional manifold can never host a degenerate ground state of any system of a number of particles with a base boltzmannonic Hamiltonian of the form Base(HB)=−Σigii−1/2∂i gii3/2∂i+V({xi}i), where ∀i∈
, xi denotes the position of the i-th labeled particle on the one-dimensional manifold, V(⋅) is Klaude regular,
gii(xi)∈>0, ∀xi, regardless of such particles being all identical, all distinguishable, or a mixture of multiple species, and whether or not there are inter-particle interactions. By contrast, it is cer-tainly possible for an at least two-dimensional substrate space in conjunction with a not necessarily Klauder regular potential to accommodate a degenerate ground state, as the previously discussed bi-fermion physics can be equally realized by two fermions confined to within an annular potential well on a two-dimensional plane, where two infinitely high potential walls are narrowly separated to define an annulus with such a small width that the particle motion in the radial direction is effectively frozen to the fundamental mode of lowest energy.
It is also useful to note that an experimental construction or numerical simulation of a bi-fermion-like computational qubit or rebit, being exploited for fermion sign rectification, does not have to involve an actual species of physical fermions as elementary particles or their compositions found in nature. What is truly essential and of only importance is the use of at least two identical and indistinguishable quantum atomic entities that can be artificially labeled by natural numbers in N, where each said quantum atomic entity has at least two different states and is associated with a pair of annihilation and creation operators ak and ak+, k∈ that support a Lie algebra as specified by akal+alak=0, ak+al++al+ak+=0, akal++al+ak=δkl, ∀k, l∈
, such that, the exchange of two identical said quantum atomic entities amounts to an equivalent algebraic representation of the numerical negative sign.
No matter what variants of bi-fermions or similar constructs may be employed to serve as basic units for carrying and processing quantum information, it is straightforward by following virtually the same exact recipes to construct a system of such bi-fermions or similar constructs that either is able to simulate homophysically any quantum physics of interest or has a GSQC Hamiltonian whose ground state can be designed to encode the solution of any prescribed BQP computational problem.
In the latter case, the GSQC Hamiltonian can be so judiciously designed by translating the notions of SFF Hamiltonians and their property of local node-determinacy to said system of bi-fermions or similar constructs, such that the GSQC Hamiltonian can be as well simulated efficiently using MCMC. The only thing to note here, once and for all, is that a discrete bi-fermion may require the use a substrate space ≠
3, if it is preferred to construct an SFF-FS or SFF-DU Hamiltonian for GSQC. On the other hand, it should also be borne in mind that the fermionic Schrödinger property is not indispensable for universal and efficient GSQC as well as MCQC.
Specifically, in a many bi-fermion system implementing a GSQC machine, the bi-fermions are individually distinguishable and interacting through couplings between their constituent fermions of different species, while the two fermions within each bi-fermion are identical and usually non-interacting. Each abstract quantum computational rebit associated with a binary configuration space {0, 1} and a two-dimensional Hilbert space L2({0, 1}) is homophysically implemented as a bi-fermion rebit associated with a configuration space 2 and a two-dimensional Hilbert subspace
∈LF2(
2) as specified in equation (99). An abstract quantum computational n-rebit system, n∈
associated with a 2n-valued configuration space {0, 1}n and a 2n-dimensional Hilbert space L2({0, 1}n) is homophysically mapped to a quantum n-bi-fermion system that consists of n distinguishable quantum bi-fermion rebits associated with a configuration space
and a tensor product Hilbert space ⊗n⊆LF2(
2n), henceforth called the quantum state space of n bi-fermions, which further can be isophysically mapped to a rectified n-bi-fermion system that consists of n distinguishable rectified bi-fermion rebits associated with a configuration space
∀n≥1, henceforth referred to as the simultaneously individually rectified (SIR) configuration space of n bi-fermions, and a tensor product Hilbert space ⊗n⊆L2(
*2n), henceforth referred to as the rectified state space of n bi-fermions, following the standard procedure of constructing/defining a tensor product vector space [285] to represent the quantum state of a many-body system.
An abstract n-rebit (n∈) quantum computer can be realized as either a quantum or a rectified n-bi-fermion system via the following homophysics
. An abstract n-rebit quantum state
is embodied by a system of n bi-fermions at the state
in the second-quantization representation, where
with W=L, P, R respectively the fermion creation operator for the |L, |P
, |R
singe-particle orbit of an i-th fermion species that moves in its substrate space, or
in the first-quantization representation, with
∀(χ1 . . . χn)∈{0,1}n, where ∀i∈[1, n], ∀j∈{1, 2}, |Wji with Wj=L, P, R denotes the j-th artificially labeled identical fermion of the i-th species occupying respectively the |L
, |P
, |R
single-particle orbit. In equations (120-122), the coefficients
(cs
-valued or
-valued, depending upon the codomain of the
homophysics is associated with a quantum or a rectified n-bi-fermion system.
It should be reemphasized that, adopting the so-called logic-0-positive convention or representation, namely, imposing the condition that the coefficient c0 in equations (99) and (100) or the coefficient c0 . . . 0 in equation (119) should be non-negative, is for conventional and notational convenience, which does not induce any loss of generality, since a global phase factor (a +/−sign, to be exact) in or out of a global wavefunction does not change any physics. Besides, were it really necessary to track the overall phase of a wavefunction representing a collection of bi-fermions, such collection of bi-fermions could always be augmented by the addition of a single bi-fermion, called the sign keeper, to which only two possible wavefunctions |R1|P2
−|R
|P1
or |R2
|P1
−|R1
|P2
in the
representation might be assigned, corresponding to |R1
|P2
or |R2
|P1
respectively in the R, representation, so that an overall phase factor for the pre-augmentation collection of bi-fermions would be kept by the additional sign keeper, while the augmented collection of bi-fermions as a whole could still be fully represented by a wavefunction as prescribed in equation (119).
In the first-quantization representation with the two identical fermions of each bi-fermion artificially labeled, an abstract quantum computational n-rebit state |ψ=Σs
is embodied as a superposition of 2n equally weighted wavefunctions |ψχ
BF, (χ1 . . . χn)∈{0, 1}n, each of which is homomorphic to the abstract quantum computational state |ψ
, and each of which can be obtained from another through fermion exchanges {(π12)i: i∈[1, n]}, with (π12)i denoting an exchange of the two identical fermions of the i-th bi-fermion, ∀i∈[1, n]. Furthermore, it is easily verified that, under any quantum gate operation corresponding to a partial Hamiltonian h∈
(y⊗n) that is necessarily symmetric under permutations of identical particles of the same species, the transformation of |ψ
BF can be interpreted as having each wavefunction |ψχ
BF, (χ1 . . . χn)∈{0, 1}n, which behaves much like a boltzmannonic wavefunction of distinguishable particles, subject to a boltzmannonic quantum gate operation Base(h), simultaneously and in parallel. Intuitively, the homophysical mapping between an abstract n-rebit quantum register and a system of n bi-fermions is as if the bi-fermionic system created a 2n-fold “multiverse”, where in each of the 2n “parallel universes”, there were a quantum register homomorphic to the abstract n-rebit register, albeit such a fictitious quantum register comprises artificially labeled identical fermions, and the states of all of the fictitious quantum registers must be superposed with equal weights and specific signs to constitute a |ψ
BF that is antisymmetric under exchanges of identical fermions. Clearly, there exists a 2n-fold redundancy.
Such redundancy can be handled effectively via an orbit partition under a group action [71-76]. When acting on the configuration space of the spatial coordinates of n bi-fermions 2n or on the Hilbert spaces
⊗n of n-bi-fermion wavefunctions, the fermion-exchange operators (π12)i and (π12)j of two different bi-fermions indexed by i, j∈[1, n] commute, so
as a product group is abelian, called the exchange symmetry group of n bi-fermions. All even-parity operator products of the form Πi=1n(π12)iξ
There are exactly two cosets of An in Sn, namely, ∀i∈[1, n], it holds that (π12)iAn ∩An=θ and (π12)iAn ∪An=Sn. Under the group action of An on the set 2n the orbit
such that r=πq} of any point q∈2n is a subset of equivalent coordinates from the standpoint of quantum physics of n bi-fermions, as the fermion-exchange symmetries associated with bi-fermions dictate that all of the points in an orbit Anq, q∈
2n must be assigned exactly the same wave amplitude, including specifically the ± signs, for any legitimate quantum state of a system of n quantum bi-fermions, or equivalently, for a classical random walk simulating the corresponding system of n rectified bi-fermions, all of the points in an orbit Anq, q∈
2n always assume exactly the same probability distribution at the stationary state of the Markov chain. Indeed, when Monte Carlo-simulating a system of n bi-fermions, the Markov chain is best interpreted as a random walk over the quotient space
where a visit of a representative point q∈2n by the walker should be considered as all of the 2n−1 points in the orbit Anq being visited simultaneously, by 2n−1 walker replicas if one will imagine, and a walk from a representative point q∈
2n to another representative point r∈
2n should be understood as representing a Markov state transition from the orbit subset Anq to the orbit subset Anr, that may be pictured as 2n−1×2n−1 walkers moving simultaneously and independently, each of which escapes from a representative point q′∈Anq and arrives at a representative point r′∈Anr. Still further, it is perfectly practical and a good alternative to do Monte Carlo with a Markov chain over the original configuration space
2n with redundancy, but incorporate a step/subroutine of random walk to sample from the alternating group An, which picks a random permutation uniformly from the set An and makes a transition from a given configuration point q∈
2n to any r∈Anq with equal probability. Such random sampling from a permutation group is rather similar to techniques used in conventional quantum Monte Carlo simulations for modeling exchange effects of identical particles, either bosons or fermions [168, 172, 286].
Consider the set of computational basis states {|s1 . . . sn∈L2({0, 1}n): si∈{0, 1}, ∀i∈[1, n]} associated with n computational rebits of a quantum computer, which are henceforth referred to as the computational n-rebit basis states, and the set of homophysical images {
(|s1 . . . sn
)∈y⊗n: si∈{0, 1}, ∀i∈[1, n]}, y∈{
} that are associated with an homophysical n-bi-fermion system in the first quantization representation with the pair of identical fermions of each bi-fermion artificially labeled as the first and the second respectively, which are henceforth referred to as the quantum n-bi-fermion basis states. Except for a negligible subset with a small measure O(γ0−2) in quantum probability (i.e., quantum amplitude squared), for most configuration point q∈
2n, there exists a unique n-bit array or string
signified as
with which the computational basis state
signified as
is homophysically mapped to a unique n-bi-fermion basis state |(s(q))
∈y⊗n such that the point amplitude
q|
(s(q))
≠0, in which case q is said to be signed, with
when q|
(s(q))
>0, and
when q|
(s(q))
<0. Otherwise,
if q∈2n cannot be signed. For any q∈
2n that is signed, let abs(q) denote the unique equivalent point in the orbit Anq, called the absolute value of q, which has every bi-fermion so configured as to have the first identical fermion located in a logic well and the second localized in the Pauli well. Otherwise,
if sign(q)=0. For any signed q∈2, the state vectors sign(q)|s(q)
∈L2({0, 1}n), sign(q)|
(s(q))
∈
⊗n, and π(1−sign(q))/2
(s(q))
∈
⊗n with any odd permutation π∈Sn\An are respectively called the computational n-rebit basis state, the quantum n-bi-fermion basis state, and the rectified n-bi-fermion basis state consistent with the configuration point q. For any signed q∈
2n that corresponds to an n-bit array s(q)=(s1(q) . . . si(q) . . . sn(q))∈{0, 1}n, define
if q cannot be signed. Let
denote the operations of exchanging the pairs of identical fermions simultaneously for, respectively, all of the bi-fermions that register a bit 0, all of the bi-fermions that register a bit 1, and all of the bi-fermions indiscriminately.
Definition 43. (Sentinel Configuration Points, Configuration Space, and Representation)
Given a signed q∈2n corresponding to a bit array (s1 . . . sn)q∈{0, 1}n, let (ρ0, ρ1)∈{0, 1}2 be such that ρi=
i (mod 2), ∀i∈{0,1}. Such a signed q∈
2n is called a sentinel configuration point if q=Π0δ
2n that is signed, let sent(q) denote the unique sentinel configuration point within the equivalence class Anq. The subset
sign(q)≠0}⊆2n is called the sentinel configuration space. For any given quantum or rectified state ψ∈
or ψ∈
the domain-restricted wavefunction
is called the sentinel representation of ψ. In particular, the sentinel representation of an either quantum or rectified n-bi-fermion basis state is called a sentinel n-bi-fermion basis state.
In the above definition, PL2(
n)⊆L2(
2n), y∈{
} denotes a projection operator such that
with being the indicator function for the set
n, and L2(
n) represents a Hilbert space of square integrable functions having
n as their domain, over either the conventional field
of real numbers or the isomorphic field
as specified by equations (107) and (108), depending upon the interested wavefunction is interpreted as ψ∈
⊗n or ψ∈
⊗n. In the latter case, the Hilbert subspace over the field
and supported by the sentinel configuration space
n is denoted specifically as
and called the sentinel state space of n bi-fermions. For any |ψ′=
ψ′(q)|q
dVg(q)∈Sn, the coefficient ψ′(q) is
-valued, ∀q∈
n, which if negative would carry a π∈Sn\An operator representing a negative sign. If n is odd, then for each signed configuration point q∈
2n, there is precisely one index set
(q) whose cardinality
(q)| is odd, with ii ∈{0, 1}, consequently, a negative sign in the coefficient ψ′(q) is equivalent to the operator Πi
into |Πi
, so to leave only the absolute value |ψ′(q)| in the coefficient, while maintaining the closedness and completeness of the sentinel configuration space
n.
From this point on, as often being reminded explicitly, it is convenient and without loss of generality to assume or make the total number of bi-fermions to be odd in any physics or system of concern, because, if necessary, an auxiliary and dummy bi-fermion can always be added to any interested system of bi-fermions, with said auxiliary and dummy bi-fermion not interacting with any bi-fermion or rebit, but just remaining at a fixed ground state and helping to book-keep the negative sign.
Let n with
∈{
} be a configuration space and y⊗n with y∈{
} be a Hilbert space representing either the quantum or the rectified state space of n bi-fermions,
(y⊗n) be a Banach algebra of bounded linear operators on y⊗n, so that (
n, y⊗n,
(y⊗n)) constitutes a quantum physics/system of n bi-fermions, where the n bi-fermions are distinguishable and can interact with each other, while no interaction is allowed between the two identical fermions constituting each bi-fermion. Construct a mapping between physical systems
: (
n, y⊗n,
(y⊗n))
(
)) such that 1)
(U)={sent(q): q∈U}, for any subset U⊆
n,
−1(V)=Uq∈V (Anq∩
n) conversely, for any subset V⊆
n; 2)
and conversely, −1(ϕ)=PFϕ∈Y⊗n, ∀ϕ∈Sn, where PF is the familiar fermionic antisymmetrization operator defined as
∀ϕ∈Sn⊆L2(2n) (note the peculiar definition regarding the operator PF−1); 3)
(O)=PF−1OPF∈
(Sn)⊆
(Sn) for all O∈
(Y⊗n), and conversely,
−1(O*)=PFO*PF−1∈
(Y⊗n)⊆
(Y⊗n), ∀O*∈
(Sn), which define the rules of correspondence and simultaneously the domains and ranges of the mappings between operators. Note that any operator O∈
(y⊗n) is symmetric and invariant under any exchange permutation τ∈Sn, thus the mapping
:
(y⊗n)
(Sn) is one-to-one and invertible.
Auxiliary Utility 10. (Isophysicality of Sentinel Representations)
The mapping : (
n, y⊗n,
(y⊗n))
, Sn,
(Sn)) just defined is an isophysics.
Demonstration. It follows straightforwardly from the definitions.
Note that the sentinel configuration space n and the sentinel state space Sn can be considered rectified spaces, just like the SIR configuration subspace
*2n and the rectified state space
⊗n, where the Hilbert spaces
⊗n and Sn are over the field of rectified numbers
, with the negative sign being implemented by exchanging pairs of identical fermions comprising the bi-fermions, so to avoid the use of any conventional negative real numbers in
, thus afford a pure probabilistic interpretation of any wavefunction in
⊗n or Sn, and facilitate an isophysical simulation of the physics (
*2n,
⊗n,
(
⊗n)) or (
n, Sn,
(Sn)) using Monte Carlo on a classical probabilistic machine. In relation to the original (quantum) physics (
2n,
⊗n,
(
⊗n)) for any interested quantum state in the
⊗n-representation, denoted by ψ∈
⊆LF2(
2n), the rectified physics (
*2n,
⊗n,
(
⊗n)) or (
n, Sn,
(Sn)) works with a wavefunction
or Sn, which is a portion of the positive part of the ψ∈LF2(2n) wavefunction, restricted to a configuration subspace
n ⊂
2n,
∈{
}, which does not intersect any of the negative nodal cells of ψ, namely,
n∩{q∈
2n: ψ(q)<0}=θ. Nevertheless, the rectified wavefunction ψ′=ψ|
For the sake of mathematical formality and rigor, let us spell out the construction of tensor product Hilbert spaces as well as an explicit ground state homophysics between a system of many quantum bi-fermions and a system of many rectified bi-fermions. From the two-dimensional Hilbert spaces {i}i=1n of n∈
distinguishable quantum bi-fermions,
the tensor product Hilbert space
is constructed to describe the whole quantum system of n bi-fermions, which is the quotient of the free vector space F(Πi=1n i) generated by the Cartesian product set Πi=1n
i [285] over the tensor equivalence relation “
” that is characterized by the following equations [285], ∀(u1, . . . , un)∈F(Πi=1n
i), ∀u′i∈
i,
Component-wise Linearity:
(u1, . . . ,ui+u′i, . . . ,un)(u1, . . . ,ui, . . . ,un)+(u1, . . . ,u′i, . . . ,un),∀i∈[1,n], (126)
Commutativity and Distributivity of Scalar Multiplication:
r(u1, . . . ,ui, . . . ,uj, . . . ,un)(u1, . . . ,rui, . . . ,uj, . . . ,un)
(u1, . . . ,ui, . . . ,ruj, . . . un),∀r∈
,∀i,j∈[1,n]. (127)
The equivalence class containing the element (u1, . . . , ui, . . . , un)∈F(øi=1n i),
is called the tensor product of the vectors {ui∈i}i=1n. A fundamental tenet of many-body quantum mechanics is that all possible quantum states of a composite system consisting of multiple parts or particles are represented completely and exactly by the tensor product space generated by the individual Hilbert spaces describing the constituent parts or particles, subject to further constraints of bosonic or fermionic exchange symmetry when indistinguishable particles are involved. In particular, for the concerned case of multiple bi-fermions each consisting of two identical fermions, the “commutativity of scalar multiplication” property of tensor product asserts that fermion-exchange operators associated with any two bi-fermions commute and invert each other, namely, if (π12)i denotes the operation of exchanging the two fermions for the i-th bi-fermion, and (π12)j for the j-th bi-fermion, i, j∈[1, n], then the equivalence identity (π12)i(π12)j
(π12)j(π12)i
I holds, ∀i∈[1, n], ∀j∈[1, n], over the tensor product Hilbert space ⊗i=1n
i.
Similarly, with the state spaces of n∈ distinguishable rectified bi-fermions
the Cartesian product set Πi=1n i generates a free vector space F(Πi=1n
i) over the field of scalars (
, +, *), which reduces to a tensor product Hilbert space
modulo a tensor equivalence relation “” that is characterized by the following equations, ∀(v1, . . . , vn)∈F(Πi=1n
i), ∀v′i∈
i,
Component-wise Linearity:
(v1, . . . ,vi+v′i, . . . ,vn)(v1, . . . ,vi, . . . ,vn)+(v1, . . . ,v′i, . . . ,vn),∀i∈[1,n], (130)
Commutativity and Distributivity of Scalar Multiplication:
r′(v1, . . . ,vi, . . . ,vj, . . . ,vn)(v1, . . . ,r′vi, . . . ,vj, . . . ,vn)
(v1, . . . ,vi, . . . ,r′vj, . . . vn),∀r′∈(
,+,*),∀i,j∈[1,n]. (131)
The equivalence class containing the element (vi, . . . , vi, . . . , vn)∈F(Πi=1n i),
is called the tensor product of the vectors {vi∈i}i=1n. It should be noted that equation (131) makes sense only under the premise that the fermion-exchange operators of any two rectified bi-fermions commute, and invert each other, namely, ∀i∈[1, n], ∀j∈[1, n], the equivalence identity (π12)i(π12)j
(π12)j(π12)j
I holds over the free vector space F(Πi=1n
i), in which case, the subscript “i” in a fermion-exchange operator (π12)i becomes insignificant and can be omitted safely, such that ∀r′=|r|π12(r<0), r∈
, ∀vi∈
1, i∈[1, n], the expression r′vi is always well-defined and equated with |r|(π12)i(r<0) vi. Consequently, there is a large amount of redundancy among the 4n computational basis states for a system of n rectified bi-fermions
{|Wχ1P3−χ1 . . . |WχiP3−χ
i . . . |WχnP3−χ
n: W∈{L,R},χi∈{1,2},∀i∈[1,n]}, (133)
many of which must be regarded as equivalent and the same. Specifically, the equivalence relation
[Πi=1n(π12)iξ1 . . . |WχiP3−χ
i . . . |WχnP3−χ
n
|Wχ1P3−χ
1 . . . |WχiP3−χ
i . . . |WχnP3−χ
n (134)
holds true for any computational basis state |Wχ1 P3−χ1 . . . |Wχi P3−χ
i . . . |Wχn P3−χ
n to so long as Σi=1nξi=0 (mod 2), where ξi∈{0, 1}, ∀i∈[1, n]. Obviously, the equivalence classes of the computational basis states are divided into two groups, which are labeled even and odd respectively, such that each even equivalence class has a representative computational basis state that has no tensor factor of the form |W2Pi
i, W∈{L, R} associated with any rectified bi-fermion i∈[1, n], while each odd equivalence class has a representative computational basis state that has one and only one tensor factor of the form |W2P1
2, W∈{L, R} associated with a certain rectified bi-fermion indexed by i∈[1, n].
The tensor product Hilbert space ⊗i=1n i or ⊗i=1n
i still contains much redundancy, because the state of a system of n quantum or rectified bi-fermions is sufficiently and uniquely represented by an element, called a ray, from the projective Hilbert space
(⊗i=1n
i) or
(⊗i=1n
i), that ignores any nonzero scaling in the respective base Hilbert space,
with the projective equivalence relations and
defined as
∀u,u′∈⊗i=ni,u
u′ if ∃r∈(
≠0,*) such that u′=ru, (137)
∀v,v′∈⊗i=ni,v
v′ if ∃s∈(
≠0,*) such that v′=sv, (138)
In particular, any state [ψ]∈
(⊗i=1n
i) describing a system of n quantum bi-fermions is a
-equivalence class of a representative vector ψ∈⊗i=1n
i such that
in which W1: {0, 1}{L1, R1} is a mapping such that Wi(0)=L1, Wi(1)=R1, and the first nonzero element in the sequence: {rs
n is positive, where the sequence is so arranged with the index (s1 . . . si . . . sn) as a binary integer increasing from 0 to 2n−1. Similarly, any state [ψ′]
∈
(⊗i=1n
i) describing a system of n rectified bi-fermions is a
-equivalence class of a representative vector ψi′∈⊗i=1n
i such that
where ∀(s1 . . . si . . . sn)∈{0, 1}n, the coefficient r′s, and the first nonzero element of the sequence {r′s
, denoted by r′*, is positive, in the sense that r′*=(r*, I)∈
for an r*∈
>0, when again the sequence {r′s
-coefficient r′s
1 . . . |W1 (si)P2
i . . . |W1 (sn)P2
n needs in principle just one operator π12 of fermion-exchange, which can be equivalently applied onto any of the n rectified bi-fermions, to represent a sign flip for the coefficients of the n-rebit computational basis states, although in the previously discussed scheme of SIR representation, it is chosen deliberately to always apply simultaneously an odd number n0∈2
+1 of π12 operators in total, each of which to one of no bi-fermions in a cluster. For a subsystem that collects all of the bi-fermions for ordinary rebits, it is possible, although not absolutely necessary, to always keep all bi-fermions rectified except at most one, called the single unrectified delegate (SUD), to carry an overall π12 operator/sign if present, then any quantum gate U that moves an i-th bi-fermion of ordinary rebit, i∈[1, n] can be effected by firstly transferring the if-present π12 operator from the old SUD to the U-moved i-th bi-fermion, so that the old SUD becomes rectified and the U-moved i-th bi-fermion becomes the new SUD, then applying the gate U, such that, any
-coefficient in the expansion of the quantum state of the subsystem of ordinary rebits, in terms of the computational basis states, similar to what has been illustrated in equation (140), never has more than one operator π12 of fermion-exchange, both before and after any gate operation on a single bi-fermion controlled by multiple bi-fermions.
It is noted that for each i∈[1, n], the Hilbert spaces i for a quantum bi-fermion and
1 for a rectified bi-fermion can both be regarded as subspaces of a larger Hilbert space
Naturally, the tensor product Hilbert spaces ⊗i=1n i and ⊗i=1n
i can be considered as subspaces of the tensor product Hilbert space
which is the quotient space of the free vector space F(Πi=1ni) modulo a tensor equivalence relation
characterized by
Component-wise Linearity:
(w1, . . . ,wi+w′i, . . . ,wn)(w1, . . . ,wi, . . . ,wn)+(w1, . . . ,w′i, . . . ,wn),∀i∈[1,n], (142)
Commutativity and Distributivity of Scalar Multiplication:
r(w1, . . . ,wi, . . . ,wj, . . . ,wn)(w1, . . . ,rwi, . . . ,wj, . . . ,wn)
(w1, . . . ,wi, . . . ,rwj, . . . wn),∀r∈
,∀i,j∈[1,n]. (143)
∀(w1, . . . , wn)∈F(Πi=1n i. ∀w′i∈
i. Each wavefunction in ⊗i=1n
i of the form
with rs≥0, ts
in the conical hull coni[(⊗i=1n i)*] spanned by
with scalar coefficients from the set ≥0, where each wavefunction is non-negative-valued at any location in the bi-fermion configuration space
2n. In this way of interpretation, ⊗i=1n
i is a subset of coni[(⊗i=1n
i)*], which is in turn a convex cone in the Hilbert space ⊗i=1n
i. Similarly, a typical dyad operator in
(⊗i=1n
i) of the form |ψ′
ϕ′|, ψ′, ϕ′∈⊗i=1n
i can be interpreted as an element in the convex cone conf[(⊗i=1n
i)*(⊗i=1n
i)*+]⊂
(⊗i=1n
i), with
Therefore, by virtue of fermion sign rectification and as the basis of Monte Carlo quantum computing, a collection of multiple quantum bi-fermions with a Hilbert space
and the Banach algebra ) can be isophysically represented by a system of multiple rectified bi-fermions with a Hilbert space
and the Banach algebra , through an isophysics
: (
))
(
)) such that any wavefunction ψ′∈
, interpreted as an element in coni[(⊗i=1n
i)*]⊂⊗i=1n
i, is just the positive part of the vector
with ψ and ψ′ mutually and uniquely determining each other through the isophysics . Having all wavefunctions and bounded linear operators positively valued in the first-quantized configuration space representation, the quantum physics (
)) becomes amenable to Monte Carlo simulations on a classical probabilistic computer.
A ground state computational physics is basically concerned with the C0-semigroups, also known as strongly continuous one-parameter semigroups [93], of the Gibbs operators {G(τHk)}k=0K, τ∈[0, ∞) that are associated with the set of self-adjoint FBM tensor monomials {Hk}k=1K and the Hamiltonian of total energy
as the infinitesimal generators, such that
∀k∈[0, K], ∀τ∈[0, ∞). Of particular interest is to drive a GSQC system to its ground state ψ0(H) by applying the Gibbs operators, and to sample from the probability distribution |ψ0(H; q∈2n)|2dVg. If all of the operators Hk, k∈[0, K] are either of a finite rank or unbounded from above, then all G(τHk), k∈[0, K] are compact operators, ∀τ∈(0, ∞). Furthermore, if all Hk, k∈[0, K] are lower-bounded by zero, then {G(τHk)}τ∈[0,∞), k∈[0, K] are all contraction semigroups. GSQC arises naturally from and is widely useful in the contexts of computational physics and quantum chemistry, as well as non-physics-related computational problems of optimizations and searches, where an objective function is encoded in the form of an energy functional resembling the Hamiltonian of a fictitious physical system. However, the power of GSQC far transcends the evaluation and optimization of energy functionals. As having been well established previously [19-22], as well as will be detailed in the next section, GSQC is actually universal for quantum computing in that, any quantum dynamics as a series of (a polynomial number of) unitary operations on a quantum system that is either executing or being simulated by a quantum algorithm, can be encoded into the ground state stationary properties of a larger quantum system with a polynomially bounded overhead, whose configuration space has an extra spatial dimension encoding the time variable of the concerned quantum dynamics. In a sense, GSQC is all that is needed as far as quantum computability and quantum computational complexity are concerned.
Nevertheless, it is still inspiring to note, even just in passing, that a quantum dynamics which either describes a quantum system or is implemented on a quantum computer based on rebits, where the effects of unitary state transformations are emphasized, can be described by a quantum physics (2(n+1),
⊗
⊗
)), n∈
in conjunction with a self-adjoint operator H∈
⊗
) designated as the Hamiltonian representing the total energy of the system, where the Hilbert spaces
and =c0|0
+c1|1
: c0, c1∈
as well as the Banach algebra
⊗
)=
⊗
and the Banach space
⊗
), are all defined with respect to the field of real numbers. By Stone's theorem [93], even though itself may not lie in
⊗
), H can generate a strongly continuous one-parameter unitary group
which acts on the tensor product Hilbert space ⊗
and is contained in the Banach algebra
⊗
), with
being the state space associated with an auxiliary rebit to flag and represent the real and imaginary parts of complex-valued amplitudes, and
representing an operator in that constitutes a matrix embodiment of the imaginary unit, such that i2=−1, with
representing the identity operator in . The matrix i generates a Banach algebra
[i], that consists of matrices with a non-negative definite determinant, the square root of which serves as the norm.
[i] is also a unital and commutative division algebra, constituting an isomorphic representation
[i]≅C of the space of complex numbers, both as an algebraic field and as a topological group. Now
⊗
can be considered as a Hilbert space over the field
[i] which is isomorphic to the field of complex numbers
, and
⊗
) can be regarded as a Banach algebra also over the field
[i]≅
. Therefore, (
2(n+1),
⊗
)) constitutes an isophysical representation for any general quantum physics (
)) involving complex-valued state vectors, with a Hilbert space
and the Banach algebra
being defined with respect to the field of complex numbers . Through another homophysics
using our field of rectified numbers
and an imaginary unit
∈GL(2, ), the universal rebit-based quantum physics (
2(n+1),
⊗
⊗
)) can be homophysically mapped into a quantum system (
*2n+2,
⊗
⊗
)) of rectified bi-fermions, with
representing the rectified state space of a single bi-fermion as defined in (100). Thus, even a quantum dynamics involving complex-valued wavefunctions being transformed by a one-parameter unitary group can be sign-rectified and implemented in a system of many rectified bi-fermions, where both the wavefunctions and the unitary groups are represented without involving any negative real number, consequently, the quantum dynamics is homophysically mapped into a standard Markovian process in classical probability theory, which is amenable to efficient Monte Carlo simulations on a classical probabilistic machine.
In spirit, our method of fermion sign rectification using bi-fermions and representing the numerical “−” sign by the fermion-exchange operator π12 is analogous to the Janzing-Wocjan representation and its application in the Jordan-Gosset-Love technique of stoquastization [288], where the multiplicative group ({1, −1}, *) for the sign of real numbers is represented by the multiplicative group ({I, σx}, *) of operators on an ancilla qubit, so that any designer Hamiltonians for QMA-complete and universal adiabatic computations using excited states can be converted into stoquastic matrices. I and σx are respectively the 2×2 identity matrix and the Pauli matrix measuring spin along the x axis. Also similarly, our method of fermion sign rectification and the Jordan-Gosset-Love technique both entail a multi-fold degeneracy/redundancy of sorts, when encoding the solution of a general BQP problem into an eigenstate of the correspondingly constructed stoquastic Hamiltonian. The Jordan-Gosset-Love technique introduces an ancilla qubit whose σx eigenstates |±=(|0
±|1
)/√{square root over (2)} create two computational manifolds, where the |−
-associated manifold encodes the general BQP problem of interest, while the 1-O-associated manifold corresponds to a stoquastic Hamiltonian that has little to do with the original BQP problem. Our method of fermion sign rectification, using a system of bi-fermions and the fermion-exchange operator π12, encodes the solution of a general BQP problem into the ground state of a many-bi-fermion system consisting of two classes of probability amplitude distributions that are isomorphic and symmetric with respect to each other, with one class being amplitude distributions in positive nodal cells, and the other class being amplitude distributions in negative nodal cells. There is a crucial difference though. While the Jordan-Gosset-Love technique either encodes the desired solution in a certain excited state associated with the |−
state of the ancilla, or alternatively biases the |−
state to a lower energy than the |+
state of the ancilla, thus unfortunately spoils the stoquasticity of the total Hamiltonian, our method of fermion sign rectification has the desired solution encoded exactly in the ground state, whose nodal structure can always be determined locally in the configuration space, thanks to the property of local node-determinacy afforded by the employed SFF Hamiltonian. That is fundamentally why our method of fermion sign rectification overcomes the infamous sign problem and enables efficient Monte Carlo simulations of SFF-FS or SFF-EB systems.
More specifically, the essential property of a designer Hamiltonian being separately frustration-free endows our method of fermion sign rectification with distributive and efficient solutions of nodal surfaces associated with FBM tensor monomials, which are all consistent with and collectively determine the global nodal structure of the unique ground state of the designer Hamiltonian. Localization of nodal surfaces afforded by the property of a designer Hamiltonian being separately frustration-free is the central pillar for solving the sign problem, which can even do without using bi-fermions. Indeed, a general bi-fermion supported by a continuous or discrete configuration space 2 can be equivalently encoded into a pair of ordinary rebits, with the first ordinary rebit bearing the 0 or 1 logic value, and the second indicating a positive or negative sign for the quantum amplitude, such that a bi-fermion state a|↓
+b|↑
with a, b∈
is encoded, namely, isophysically implemented, as
with the representation of (ϕ) involving no negative real numbers. Essentially, the π12 operator that acts on a bi-fermion and represents a negative sign is in turn mapped isophysically to the σx=|1
0|+|0
1| operator acting on the sign-indicating ordinary rebit. Then, any single-bi-fermion-moving GSQC gate
is isophysically implemented as
which is stoquastic, and consequently, the associated Gibbs operator
for any >0 has all non-negative matrix elements when represented in terms of the four computational basis states |00
, |01
, ∥0
, |11
of the conventional two-rebit system, so that the Markovian transition matrix [ϕ]G(
h) [ϕ]−1 becomes a bona fide stochastic matrix in terms of the same basis states. Furthermore, and clearly, it is even possible to employ just one sign-indicating ordinary rebit shared by many ordinary rebits representing a collection of many bi-fermions, because, as being noted before, it is sufficient to restrict all computational basis states of a many-bi-fermion system to contain no more than one negatively configured bi-fermion at any given time, and it is permissible to transfer any π12 operator freely from one bi-fermion to any other bi-fermion.
Universal Quantum Circuits Using Bi-Fermion Rebits
Having rigorously established the BPP solvability of SFF-FS and/or SFF-EB systems, also thor-oughly discussed bi-fermions and interactions among them as examples of building blocks, it only remains to demonstrate how an SFF-FS or SFF-EB system with a designer Hamiltonian for universal GSQC can be constructed out of such bi-fermion building blocks, which is able to encode any BQP computation in the designer Hamiltonian, more specifically, its ground state. In terms of a general quantum system consisting of abstract computational rebits, such a construct has been well established and known as the Feynman-Kitaev construct of time-space circuit-Hamiltonian mapping [19-22, 109, 288-290], which provides a recipe to compose a designer Hamiltonian HFK, called a Feynman-Kitaev Hamiltonian, whose ground state encodes the entire computational history of any BQP circuit that may or may not be fault-tolerant, and is given as a series of self-inverse quantum gates {U(t)}t=tT, where, ∀t∈[1, T]⊆, U(t) is a unitary operator with real-valued entries such that [U(t)]−1=U(t), corresponding to and representing a quantum computational gate.
In the case of quantum error correction being employed in said BQP circuit, it is necessary to note that concatenated quantum fault-tolerant encoding induces a hierarchical structure consisting of multiple levels of relatively called noisy versus coded rebits/gates, where at each level of fault-tolerant encoding, multiple error-prone noisy rebits are used to encode a single coded rebit, which in turn may serve as a noisy rebit at the next level of hierarchy to encode a higher-level coded rebit. Also, at each such level of encoding hierarchy, every “elementary” gate operation on one or two coded rebits comprises, or is realized by, a series of gate operations on the constituent noisy rebits. Now it can be clarified that the aforementioned series of self-inverse quantum gates {U(t)}t=1T refer to quantum operations on the truly physical rebits at the lowest level of the encoding hierarchy, which are the atomic units and building blocks of all of the multi-leveled circuitry of quantum computing, state preparation/measurement, and error-correction encoding/decoding to realize said fault-tolerant BQP circuit.
In a general Feynman-Kitaev construct, the computational history of a possibly fault-tolerant BQP circuit as a series of self-inverse quantum gates {UL(t)}t=1T is ground-state-encoded using a so-called clock register consisting of T+1 clock rebits and a so-called logic register comprising N logic rebits, T∈, N∈
, with the combined system of two quantum registers living in a Hilbert space spanned by effective computational basis states of the form
|c0c1 . . . cTC|l1 . . . lN
L,ct∈{0,1},∀t∈[0,T],li∈{0,1},∀i∈[1,N],
and being governed by a Feynman-Kitaev Hamiltonian
with each of the four partial Hamiltonians on the right hand side being DFF as
where ∀x∈, └x┘ denotes the largest integer that is smaller than or equal to x,
ΠC,t and ΓC,t are single-rebit operators acting on the t-th clock rebit such that ΠC,t|ctC,t=(1−2ct)|ct)C,t, Γt|ct
C,t=|(1−ct)
C,t, ∀ct∈{0,1}, ∀t∈[0, T], with ΠC,t=T+1+ reducing to the identity when there is no rebit to operate upon
is a single-rebit operator acting on the i-th clock rebit such that ZL,i|liL,i=(1−2li)|li
L,i, ∀li∈{0,1}, ∀i∈[1, N], while UL(t) denotes the t-th quantum computational gate that operates on no more than two logic rebits, Hprop, t as defined in equation (153) is called the t-th Feynman-Kitaev propagator (FK-propagator), ∀t∈[1, T]. Here and after in this specification, a subscript “C” indicates a clock rebit or the clock register, a state vector of a clock rebit or the clock register, or an operator acting on one or more clock rebits. By the same token, a subscript “L” signifies a logic rebit or the logic register, or a state vector or operator associated with logic rebits or the logic register. Such subscript “C” or “L” may be omitted when there is no ambiguity as to whether a clock or logic rebit is referred to, especially when a universal index is employed to address all of the rebits uniquely. The partial Hamiltonian Hclock ensures that the so-called domain wall clock [289, 290] states
lie in and span the manifold of the lowest and zero-valued energy, Hinit initializes the logic register to a prescribed initial state
∈{0, 1}N at “time” t=0 associated with and represented by the clock state
while the partial Hamiltonian Hprop, t, ∀t∈[1, T] facilitates a transition of the whole Feynman-Kitaev construct between the state |(t−1))C|ϕt−1L at time t−1 and the state
at time t, through which, the quantum gate UL(t) is effected on the logic register, such that, as t∈ increases from 0 to T, the sequence of unitarily time-evolving quantum states {|ϕt
L: t∈[1, T]}⊆
({0, 1}N) are defined and generated as
for all t∈[1, T] recursively.
For convenience in referring to the partial Hamiltonians, let
therefore, HFK=Σk=14 Hk. The partial Hamiltonians H1 and H2 are straightforwardly DFF. It is precisely in order to render both H3 and H4 DFF that the set of FK-propagators {Hprop, t: t∈[1, T]} are split into the two of them. Define size
As will be demonstrated below, when τ∈(0, ∞), τ=Θ(poly(size(HFK))) is sufficiently large, the Gibbs operators exp(−τHFK) and
are all essentially ground state projectors, and a process of iterating the four Gibbs operators {Gk}k∈[1,4], or the four similarity transformed Markov counterparts
mixes rapidly and converges to the ground state projector exp(−τHFK) or its Markov counterpart. All combined, the Feynman-Kitaev Hamiltonian HFK=Σk=14 Hk is SFF, which confines the system of two quantum registers to within a ground space spanned by the states {|tC|ϕt
L: t∈[0, T]}, called the Feynman-Kitaev history states, and links these Feynman-Kitaev history states into a one-dimensional Feynman-Kitaev lattice, with each lattice site being indexed by a t∈[0, T] and called the t-th Feynman-Kitaev site, or the Feynman-Kitaev time t. On the Feynman-Kitaev lattice, the Hamiltonian HFK is unitarily similar and equivalent to a tridiagonal matrix [20-22, 109, 288-290]
which has
as eigenvalues and the corresponding eigenvectors respectively [20]. Obviously, the Hamiltonian HFK is polynomially gapped, having 0 and 2−2 cos[π/(T+1)]=Ω(T−2) as the lowest and second lowest eigenvalues. The unique ground state of HFK is given by
or in a more compact form using shorthand notations,
√{square root over (T+1)}|ψ0(HFK)=Σt=0T|t
C[Πτ=1tUL(τ)]|l1 . . . lN
L. (157)
Being able to sample from |ψ0(HFK) implies that the result state
is amenable to efficient sampling, with the overhead factor T+1 being polynomially bounded. Better yet, a quantum circuit can be so arranged to have a number T′=Θ(T) of identity gates padded [288-290] after the last nontrivial gate UL(T) for a required state transformation to generate |ϕ*L=|ϕT
L as the result of the desired quantum computation, so that the ground state of the Feynman-Kitaev construct contains T′+1 identical copies of the result state |ϕ*
L. With such padding of identity gates, there is a boosted probability of (T′+1)/(T+T′+1)=Ω(1) to obtain a copy of the result state |ϕ*
L by sampling from the ground state of the Feynman-Kitaev construct. Depending on how the computational complexity Cost(HFK, T, T′) of generating a sample from |ψ0(HFK)
scales as the numbers of gates T and T′ increase, the parameter T′ can be optimized so that the cost Cost(HFK, T, T′)×(T+T′+1)/(T′+1) of producing a sample from the result state |ϕ*
L is minimized. Throughout this specification, unless explicitly stated otherwise, it is always assumed that a Feynman-Kitaev construct is identity-gate-padded properly, and the variable T is redefined to represent the total number of gates in the identity-gate-padded Feynman-Kitaev construct, having an optimal portion of the quantum gates being the identity operator to duplicate the computational result.
As usual let
θ∈[−π, π),
denote the Pauli matrices as single-rebit operators acting on a general computational rebit addressed by a universal index i∈[0, T+N], which may be either an i-th clock rebit when i∈[0, T], or an (i−T)-th logic rebit when i∈[T+1, T+N]. Define shorthand notations for the single-rebit-controlled single-rebit-transforming controlled-R/X/Z gates as
with i, j∈[0, T+N] indexing the clock or logic rebits. It is easily verified that Rj(θ), Xj, Zj, Rij(θ), Xij, and Zij are all self-inverse operators. It is also known that matrix-entry-wise real-valued quantum gates operating on real-valued wavefunctions can implement any quantum circuits [40, 109, 291] in the sense of homophysics, and the composite gate Rij (θ)Xij or Rij (θ)Zij, i, j∈[T+1, T+N] is already universal for any single fixed θ∉π [291]. Therefore, it is more than sufficient to consider an arbitrary BQP algorithm implemented as a fault-tolerant quantum circuit using self-inverse operators [109] from a set of at-most-two-rebit gates
with
and Θ∈ being any set of real number(s) that contains at least one irrational number, where each gate operation does not have to be perfect, but is allowed and assumed to err independently, so long as the probability of error is below a constant threshold set by an employed scheme of quantum fault-tolerant encoding [277-284], or without the use of quantum error correction, as long as the rate of error per gate is lower than a polynomial bound, such that the cumulative probability of errors being treated as hard failures will not become unacceptably too large even after the entire sequence of a polynomial number of quantum gates implementing the BQP algorithm.
Let () be a quantum physics/system representing the Feynman-Kitaev construct of abstract computational rebits, where
is a discrete configuration space,
is a Hilbert space supported by
is a Banach albegra operators over the field , which contains all of the semigroups {exp(−
Hk):
∈[0, ∞), k∈[1, 4]} generated by the partial Hamiltonians of {Hk}k∈[1,4], as well as the semigroup {exp(−
HFK):
∈[0, ∞)} generated by HFK=Σk=14 Hk. It is straightforward to have each abstract computational rebit implemented into, or represented by, a concrete bi-fermion as being constructed and described in the previous section with great details, so to effect a homophysics
: (
)
(
) from the quantum system (
) into a many-fermion system (
), where each of the image operators {
(Hk)}k∈[1,4] is fermionic Schrödinger, or essentially bounded, or both fermionic Schrödinger and essentially bounded, mostly importantly, as well as DFF, because each of the preimages {Hk}k∈[1,4] is DFF by design. The above-mentioned process of iterating the Gibbs operators {Gk}k∈[1,4] or the four Markov operators {Mk}k∈[1,4] that mixes rapidly and converges to the ground state projector exp(−τHFK) or its Markov counterpart is homophysically mapped to a process of iterating the four Gibbs operators {
(Gk)}k∈[1,4] or the four Markov operators {
(Mk)}k∈[1,4] that mixes rapidly and converges to the ground state projector exp(−τ
(HFK)) or its Markov counterpart. Therefore, the homophysical Feynman-Kitaev Hamiltonian
(HFK) is SFF-FS, or SFF-EB, or both SFF-FS and SFF-EB thus SFF-DU. Such
(HFK) is always amenable to efficient simulations via Monte Carlo on a classical computer.
It is important and interesting to note that, as building blocks in homophysical implementations of GSQC systems, particularly the Feynman-Kitaev constructs, the fermionic Schrödinger-oriented bi-fermions and the quantum gate operations on them, may be inherently imperfect and prone to errors and deviations, which necessitate considerations to control and correct them so to prevent their accumulation into a disastrous failure, similar to the need of minimizing qubit and gate error rates and employing quantum error correction in traditional experimental realizations of quantum computing devices. However, unlike the enterprise of experimentally building an actual and physical device for quantum computing, here for MCQC with theoretical and numerical implementations of a quantum circuit using theoretical and numerically simulated bi-fermions, the rebit and gate errors may be by design as a matter of principle, which can be made arbitrarily small easily and cost-effectively, to the limit of completely vanishing, also as a matter of principle. Specifically, the rate of leakage error Tr(Eleak(γ0)) per bi-fermion either can be diminished completely by using an essentially bounded only implementation employing L3 as the substrate space to support bi-fermions, or if the fermionic Schrödinger property is desired, then bi-fermions supported by the substrate spaces
n≥2 can be employed, for which the rate of leakage error Tr(Eleak(γ0)) can be made as small as Tr(Eleak(γ0))=O(γ0−3 log3 γ0) for
when n is large, or as small as Tr(Eleak(γ0))=O(γ0−2) for
in the limit of n→2, namely, the rate of leakage error can be rendered polynomially close to zero-by choosing the parameter γ0>0 sufficiently large but bounded by a polynomial of size(HFK), at the price of a polynomially increased time complexity to simulate such a system via a classical Monte Carlo.
As already been mentioned previously, there are two broadly categorized strategies to deal with the Tr(Eleak(γ0)) probability of error associated with each application of a Feynman-Kitaev gate during MCQC. The first strategy is brute-force, treating any instance of such error as a hard failure, upon which an ongoing Monte Carlo simulation is simply aborted, and a new simulation is restarted from the very beginning. At the most it requires a γ0=O(poly(size(HFK))) to make sure that the total probability of such a hard failure taking place is less than ⅓, over the course of an entire MCQC procedure consisting of an O(poly(size(HFK))) number of Feynman-Kitaev gates, so that said MCQC procedure has a larger than ⅔ probability to succeed, at the price of an O(poly(size(HFK))) complexity overhead for simulating a single Feynman-Kitaev gate. The second strategy takes advantage of the well developed methods of quantum error correction, either employing techniques known as subsystem-, operator-, or Hamiltonian-encoding [273-276] that suppress qubit errors using energy penalties, or more fundamentally and systematically, building the Feynman-Kitaev construct from a quantum circuit {UL(t)}t=1T that has quantum error correction and fault tolerance via concatenated encoding [277-284] built in as being explained previously in this section. Then, by the celebrated quantum threshold theorem (also known as the quantum fault-tolerance theorem), the built-in quantum fault tolerance is able to correct and clean up all of the errors and detrimental effects that take the form of an inaccurate or wrong state in a logic rebit, or a total loss of a rebit (namely, leaking out of its computational basis space), or an inaccurate or wrong logic gate operation leading to such a logic rebit error, at a computational cost of no more than a polylogarithmic overhead factor to the time and space complexities, provided that the error probability Tr(Eleak(γ0)) is below a certain constant threshold [277-284].
Using any of the Algorithms 1 through 8 with the polynomial Lie-Trotter-Kato decomposition of (18), each of the four ground state projectors {Gk}k∈[1,4] or {Mk}k∈[1,4] can be Monte Carlo simulated at the cost of a polynomial-bounded computational complexity. Then using Algorithm 9, the ground state projector exp(−(HFK)) or [ψ0(
(HFK))]exp(−
(HFK))[ψ0(
(HFK))]−1,
∈(0, ∞),
=Θ(poly(size(HFK))) can be simulated by simulating the sequence {Gk}k∈[1,4] or {Mk}k∈[1,4] repeatedly for a polynomial-bounded number of times.
Major Utility 2. (A Second Major Utility of Monte Carlo Quantum Computing)
Any BQP algorithm can be polynomially reduced to the computational problem of simulating an SFF-DU Hamiltonian, with the polynomial reduction also guaranteeing warm starts.
Demonstration. Firstly, if desired, any BQP circuit that executes a prescribed BQP algorithm using a number T0 of quantum gates from a fixed universal set of few-body-moving operators on N0 rebits can be transformed into a fault-tolerant quantum circuit comprising a number T=O((T0+N0)poly(log(T0+N0))) of multi-rebit-controlled single-rebit-transforming gates {U(t)}t=1T on N=O((T0+N0)poly(log(T0+N0))) rebits, with the built-in quantum fault-tolerance ensuring successful completion of quantum computation even under gate and rebit errors, provided that the rate of errors per gate and rebit is below a threshold. Or if no quantum error correction is built into the transformed quantum circuit {U(t)}t=1T, then it is only necessary to set N=N0 and T=O(T0), where a constant overhead T/T0=O(1) may be incurred to decompose each of the quantum gates used by the original BQP circuit into an O(1) number of the multi-rebit-controlled single-rebit-transforming gates in {U(t)}t=1T, as well to pad identity gates at the end of the Feynman-Kitaev construct for boosting the probability of measuring the computational result.
Next, a Feynman-Kitaev construct can be generated for the transformed quantum circuit {U(t)}t=1T, with a Hamiltonian HFK as defined in equations (148) through (153), such that the ground state ψ0(HFK) as a superposition of Feynman-Kitaev history states contains both a component that represents the initial state to start the BQP algorithm and one or more components which encodes the solution to the original computational problem, with each of such solution-encoding components carrying an Ω((T+1)−1/2)-sized probability amplitude. When the Feynman-Kitaev construct is implemented into a GSQC system of bi-fermions through a homophysics , the ground state of the homophysical image
(HFK) can be obtained by iterating the four Gibbs operators {
(Gk)}k∈[1,4] or the four Markov operators {
(Mk)}k∈[1,4] in sequence for an O(poly(size(HFK))) times. Each partial Hamiltonian
(Hk), k∈[1, 4] is DFF as a polynomial Lie-Trotter-Kato decomposition into a number O(T0+N0) of O(1/poly(T0+N0))-almost node-determinate FBM tensor monomials. A warm start is readily obtained by sampling a configuration point from the initial Feynman-Kitaev history state with t=0.
It only remains to show that iterating the ground state projectors {Gk}k∈[1,4] or {Mk}k∈[1,4] in sequence per Algorithm 9 does mix rapidly and converge to the ground state projector exp(−HFK) or its Markov counterpart polynomially fast, therefore the same holds true for and with the homophysical images. To that end, it is noted that repeated applications of the {Gk}k∈[1,4] or {Mk}k∈[1,4] operators ensure that the Feynman-Kitaev construct remains in the low-energy manifold spanned by the Feynman-Kitaev history states, using which as basis states each of the DFF operators H3 and H4 is represented as a particular form of tridiagonal matrix, that is a direct sum of 2×2 matrices, each 2×2 matrix mixing the quantum probability amplitudes between two adjacent Feynman-Kitaev sites. Iterating the sequence of operators {Gk}k∈[1,4] or {Mk}k∈[1,4] constitutes a time-inhomogeneous Markov chain [140, 264, 265], where an effective transition matrix T1 per each iteration, defined as
obviously an irreducible matrix, so that the time-inhomogeneous Markov chain is bound to converge to the unique steady state in which all Feynman-Kitaev sites have an equal probability amplitude, corresponding to the Feynman-Kitaev ground state of equation (156). Further, the T1 matrix corresponds to a linear graph on the Feynman-Kitaev lattice, where the Feynman-Kitaev sites are the vertices, and between each pair of adjacent Feynman-Kitaev sites there is an edge associated with a matrix element that is lower-bounded by Ω(T−1). In other words, the graph has an inverse polynomial-bounded conduc-tance or Cheeger constant, which ensures an inverse polynomial-bounded spectral gap for the T1 matrix by the celebrated Cheeger inequalities [58, 138, 293, 294], which in turn guarantees that the time-inhomogeneous Markov chain of interest converges polynomially fast.
Major Utility 3. (A Third Major Utility of Monte Carlo Quantum Computing)
BQP⊆BPP, therefore BPP=BQP.
Demonstration. By Major Utility 2, any BQP algorithm executed on a suitable fault-tolerant BQP circuit can be encoded into the ground state of an SFF-DU Hamiltonian, that is polynomially gapped and polynomial Lie-Trotter-Kato decomposable into a polynomial number of FBM interactions, each of which is polynomially computable and polynomially gapped. Furthermore, a warm start is guaranteed by such ground state encoding. Then BQP⊆BPP follows straightforwardly from Major Utility 1, while BPP⊆BQP is trivial as well known, thus BPP=BQP.
The significance of BPP=BQP can be hardly missed. It not only answers the long outstanding question of whether the laws of quantum mechanics endow more computational power, but also provides constructive and practical methods for simulating the all-important many-body quantum systems efficiently on classical machines. Furthermore, it opens up new avenues for developing and identifying efficient probabilistic algorithms from the vantage point of quantum computing. Any quantum based or inspired solution to a computational problem translates automatically into an efficient classical probabilistic algorithm. For instance, it is now certain that integer factorization is in BPP, thanks to Shor's celebrated quantum discovery [34] that started the long line of thoughts on quantum computing. Beyond the decision/promise problems in BPP=BQP, as well as function/optimization/search problems that are polynomially reducible or equivalent to them, there are potentially harder computational problems captured by the complexity classes of MA/QMA [41, 45]), which characterize decision/promise problems whose instances can be decided by Arthur the verifier running a bounded-error probabilistic/quantum polynomial time Turing machine, after receiving from Merlin the prover a polynomial-sized classical/quantum proof message, that is, respectively, a string of classical bits whose length is polynomially bounded, or a quantum state of a polynomially bounded number of qubits. There is also the class MQA (Merlin Quantum Arthur [41]), alternatively known as QCMA (Quantum Classical Merlin Arthur [295]), which represents a subset of QMA by restricting the proof message from Merlin to Arthur to be a string of classical bits. Other than the obvious MA⊆MQA⊆QMA, it has been an open question to refine the set relationship. A straightforward Derived Utility of Major Utility 3 is the following:
Derived Utility 5. MA=MQA.
Below are two other interesting Derived Utilities regarding complexity classes.
Derived Utility 6. BQP⊆MA⊆Σ2∩Π2⊆PH.
Demonstration. This follows straightforwardly from Major Utility 3 and the Sipser-Lautemann theorem [296-298].
Derived Utility 7. If NP⊆BQP, then P=BPP=NP=PH.
Demonstration. (NP⊆BQP)∧(BQP⊆BPP)⇒NP⊆BPP⇒P=BPP=NP=PH, where the latter implication has been well known for some time [297, 299].
Further Considerations of Monte Carlo Quantum Computing in Practice
For demonstrating the BPP membership of quantum algorithms, it has been sufficient to construct a quantum circuit using a polynomially bounded number T of self-inverse real-valued gates {U(t): t∈[1, T]} each of which moving no more than two rebits, with a portion of the gates being identities for state copying, then to employ a Feynman-Kitaev construct for GSQC that time-space maps the quantum circuit into a Hamiltonian HFK, whose non-degenerate ground state and excited states are separated by an energy gap that scales inverse quadratically against the number T of quantum gates, which is sufficiently polynomial, although not terribly efficient. Practical Monte Carlo quantum computing applications can no doubt benefit from further optimizations to achieve the most efficient probabilistic simulations.
One type of efficiency consideration tries to minimize the number T of Feynman-Kitaev sites. As having been aforementioned, the many-bodiness of an interaction among rebits/bi-fermions is not necessarily a concern in Monte Carlo quantum computing, when such an interaction is a controlled operation on a number of rebits/bi-fermions conditioned on the Z± projections of a number of other rebits/bi-fermions. While actual physical interactions among three and more particles are rare in nature and difficult to realize in material implementations of physical devices, simulations on classical computers have no problem to model a multi-rebit-controlled multi-rebit-transforming gate effecting a conditional operation on multiple rebits/bi-fermions controlled by the Z± projections of any number of other rebits/bi-fermions, that includes a controlled classical reversible gate array (controlled CRGA), namely, a CRGA operating on a first plurality of rebits/bi-fermions controlled by the Z± projections of a second plurality of rebits/bi-fermions. This possibility can be taken advantage of and used to reduce the number T of Feynman-Kitaev sites, by employing computationally more powerful many-body quantum gates in the forms of controlled CRGAs, as opposed to having always to break down such many-body quantum gates into pieces of few-body interactions.
Indeed, many quantum algorithms often employ classical reversible computations (such as the modular exponentiation function used in the famed Shor's algorithm of factorization [34]) that are CRGAs of the form
F:|s
1
s
2
. . . s
k
|0
⊗l
|s1s2 . . . sk
|F(s1s2 . . . sk)
=|s1s2 . . . sk
|ƒ1ƒ2 . . . ƒl
,k,l∈
, (161)
where (s1 s2 . . . sk)∈{0, 1}l, and ∀i∈[1, l], ƒj=ƒj (s1 s2 . . . sk)∈{0, 1} denotes the j-th bit or component of the function value F(s1 s2 . . . sk), as either a binary integer or a Boolean tuple. Using node-determinate controlled gates {Uj(ƒj)}j=1l of the form {poly({Zi±: i∈[1, k]})j ⊗Xjƒ
where ∀j∈[1, l], the j-th Feynman-Kitaev propagator applies the gate Uj[ƒj(SiS2 . . . sk)], which conditionally flips the j-th bit of the register on the right, as controlled by the state |s1 s2 . . . sk) of the register on the left and in accordance with the Boolean value of the function ƒj(sis2 . . . sk).
Furthermore, it is possible to implement the entire computation F directly within a single clock step through a multi-rebit interaction
without using any auxiliary rebit, where
is treated as a single composite logic gate controlled by the (s1 s2 . . . sk) configuration and in conjunction with the single t-th clock rebit that is implemented as an ordinary rebit, where, as a result of XC,t ⊗FC,t being a multi-rebit-simultaneously-flipping operator, the Feynman-Kitaev propagator Ht(F) is already node-determinate and efficiently computable at the abstract level of interactions among computational rebits, which guarantees node-determinacy and efficient computability for any homophysical image (Ht(F)) downstream as implemented using bi-fermions. As explained in the paragraphs around equation (44), the operator FL, t is doubly node-determinate, therefore, the operators Ht(F) and
(Ht(F)) are guaranteed to be note-determinate. It is also interesting to note that, the operators FL, t, Ht(F), and
(Ht(F)) remain efficiently computable so long as the total number of rebits moved by them is polynomially bounded, since they involve only or mostly projection operators as their tensor factors.
Still further, any essentially idempotent (i.e., idempotent up to a scaling constant) GSQC operator of the form
with c>0, |ψ=cos θ|0
ψ0
+sin θ|1
|ψ1, |ψ0
and |ψ1
being normalized non-negative wavefunctions associated with a fixed set of computational rebits, {|0
, |1
} being the computational basis of a single rebit, θ∈[−π/2, π/2], can be homophysically implemented as an interaction among a system of quantum bi-fermions, which is further fermion sign-rectified and isophysically mapped into a stoquastic operator
where
is a non-negative-definite wavefunction for a system of rectified bi-fermions, whose support is just one specific nodal cell of the associated system of quantum bi-fermions. The isomorphic images of |ψ′ extending to other nodal cells are obtained straightforwardly by exchanging constituent identical fermions, so that the overall quantum ground state wavefunction |ψ″
for the associated system of quantum bi-fermions can be easily reconstructed as
with Gex being the exchange symmetry group of the system of bi-fermions. Then a Markov operator [ψ]exp(−τHψ) [ψ]−1, τ>0 associated with a Gibbs operator exp(−τHψ) of the original system of computational rebits can be homophysically implemented as
which is efficiently computable and directly implemented in a single step of random walk during Monte Carlo simulations, without having to decompose the operator Hψ, or Hψ′, or
or an associated Gibbs operator into a combination of elementary interactions. Namely, the Gibbs transition amplitude
can be efficiently computed for any given pair of configuration points q and r, by firstly identifying a π∈Gex such that both
are within the support of the wavefunction |ψ′, then using the identity
r|exp(−τHψ″)|q
=
r′|exp(−τHψ′)|q′
to compute the desired Gibbs transition amplitude. In case no π∈Gex exists such that (r′, q′)⊆supp(|ψ′)), then set
r|exp(−τHψ″)|q
=0, which enforces the fixed-node diffusion strategy for Monte Carlo simulations.
Another category of efficiency consideration seeks to improve the Ω(T−2) scaling of the Feynman-Kitaev energy gap. In the literature, there have been techniques reported to obtain polynomially improved energy gaps for adiabatic quantum computations and efficient quantum simulations of classical Monte Carlo methods [300-303], culminating in a systematic formulation and solution of “spectral gap amplification” [292], which transforms any Hamiltonian in the form of a sum of frustration-free essentially idempotent self-adjoint operators into a new Hamiltonian with an energy gap that enjoys a polynomially improved scaling. Reference [21] has also alluded to a GSQC Hamiltonian with an energy gap that scales as Ω(T−1). It is also interesting to note another “operator square-rooting” type of technique which rewrites a given Hamiltonian H=ΣtHt with
t∈ into H=(Σt At)(Σt At) with
t∈, then constructs a new Hamiltonian
with |10| and |0
1| operating on an auxiliary rebit, such that the eigenvalues of H1/2 are {±[λn(H)]1/2: n≥0}. However, these spectral gap-improving techniques may not be readily amenable to MCQC, as they either create a degenerate ground state space, or need to operate quantum simulations and adiabatic evolutions on an excited energy level, or produce a new Hamiltonian that is no longer SFF-FS or SFF-EB.
Here I propose a method that adapts the technique of lifting Markov chains [304-306] and designs a lifted Feynman-Kitaev construct which is Monte Carlo-simulated via a lifted Markov chain, so to achieve an O(T) mixing time. A lifted Markov chain [304-306] may be time-inhomogeneous and violate the condition of detailed balance that is usually obeyed by the conventional reversible Markov chains, yet it could still be designed to relax to a desired stationary distribution over time, and do so at a polynomially faster rate of convergence. The proposal is to lift a ring-shaped graph of T∈ vertices for a discrete time Markov chain or its transition matrix into two isomorphic copies at a lower and an upper elevations respectively, similar to what has been exemplified and illustrated in reference [306], where for each state index t∈[0, T−1], there is a t-th vertex at the lower elevation in one-to-one correspondence with a t-th vertex at the upper elevation, both being derived from the t-th vertex of the ring-shaped graph for the original Markov chain before lifting. In each discrete time step indexed by a t∈
, on the ring at the lower elevation, a walker moves predominantly clockwise, hopping from a vertex to its clockwise next neighbor with probability 1−ϵ(T), ϵ(T)=O(T−1), ϵ(T)>0, and on the ring at the upper elevation, a walker moves predominantly counterclockwise, transitioning from a vertex to its counterclockwise next neighbor with probability 1−ϵ(T). The two rings are weakly coupled with each pair of one-to-one corresponding vertices at the two elevations to swap with probability ϵ(T) in each time step t∈
. Such a Markov chain will be referred to as a two-rings-at-two-elevations lifted Markov chain. It is the rapid and mostly unidirectional motion of walkers traversing the state space in O(T) time that enables the lifted Markov chain to reach equilibrium in O(T) time, which is a remarkable improvement over the O(T2) mixing time associated with the usual “diffusive” behavior of a reversible Markov chain.
Given a possibly fault-tolerant quantum circuit as a series of self-inverse quantum gates {U(t)}t=1T to implement a BQP algorithm that computes a result state
from an initial state
N∈, a self-reversing quantum circuit as a sequence of self-inverse quantum gates {UL(t)}t=16
, T>2T*/3, T−2T*/3=O(T*) is constructed accordingly, such that UL(2t)=IL the identity operator for all t∈[1, 3T], and UL(2t−1)=U(t)=(6T−2t+1) for all t∈[1, T*], whereas UL(2t−1)=IL for all t∈[T*+1, 3T−T*] by intentionally padding identity gates to duplicate the result state. Starting from an initial state
the self-reversing quantum circuit {UL(t)}t=16L: t∈[1, 6T]}⊆
({0, 1}N) defined and generated recursively through
for an t∈[1, 6T], with |ϕ2t)L=|ϕ2t−1)L for all t∈[1, 3T], and |ϕt
L=
for all t∈[2T*−1, 6T−2T*], due to the intentional state-copying by identity gates. The quantum circuit {UL(t)}t=16L=|ϕ2t
L=|ϕ6T−2t−1
L=|ϕ6T−2t
L, ∀t∈[1, 3T], in particular, Πt=16
L=|ϕ*
L, t∈[2T*−1, 6T−2T*] around the mid-point of the circuit at t=3T are reversely computed back to the initial state |ϕ6T−1
L=|ϕ6T
L=|ϕ0
L at the end. In other words, the graph of states and transitions associated with the self-reversing quantum circuit {UL(t)}t=16
/6T
. Any index and arithmetic operations on it are interpreted in the sense of modulo-6T arithmetic, with any resulted index t∈
out of the set [0, 6T−1] rolled back into an equivalence representative modulo 6T.
L, a double-line “
” between adjacent vertices represents a graph edge associated with a general quantum gate that may or may not reduce to an identity, while a single-line “
” between adjacent vertices represents a graph edge that is definitively associated with an identity gate.
To generate a lifted Markov chain of two rings at two elevations, a Feynman-Kitaev construct for a self-reversing quantum circuit as a sequence of self-inverse quantum gates {UL(t)}t=16 can be constructed using a GSQC system comprising a clock register of 6T clock rebits, a logic register of N logic rebits, and a single elevation rebit, whose Hilbert space is spanned by effective computational basis states of the form
|c0c1 . . . c6T−1C|l1 . . . lN
L|z
E,ct∈{0,1},∀t∈[0,T],li∈{0,1},∀i∈[1,N],z∈{0,1},
where z=0 is associated with the lower elevation, and z=1 the upper, each of the clock rebits may be implemented as a bi-fermion, while the elevation and the logic rebits may all be embodied as ordinary rebits of single bi-fermions or any conventional non-bi-fermion rebits. The Feynman-Kitaev construct shall employ the so-called pulse clock [289, 290] and be governed by a real (or Markovian) time t-dependent FBM frustration-free Hamiltonian
where ∀t∈, t %4 denotes the remainder of t modulo 4, [⋅]Iver is the Iverson bracket, ∀t∈[1, 6T], Wctrl, t and Wprop, t (k), k∈[0, 3] are called the t-th control switch and the t-th Feynman-Kitaev propagator depending on k∈[0, 3] respectively, which are defined as
where (t+1) %2 evaluates to 1 if and only if t is even, otherwise it is 0, the operator v(ZE−, t, k) reduces to a scalar-valued potential v(z, t, k)∈|, when projected onto each eigenstate |z
E, z∈{0, 1} of the operator
More details will be given below on the compositions and operations of the partial Hamiltonians Hclock, Hinit, Hprop, t(k), t∈[1, 6T], k∈[0, 3], and Hswap(k), k∈[0, 3], which are called the clock Hamiltonian, the initialization Hamiltonian, the Feynman-Kitaev propagators, and the elevation swap Hamiltonian respectively.
An effective computational basis state |c0c1 . . . c6T−1), of the clock register is called a pulse clock state when a t∈[0, 6T−1] exists such that ct=1 while ct′=0 for all t′∈[0, 6T−1]\{t}, in which case
is used as a shorthand notation for the basis state, which represents the Feynman-Kitaev time t∈[0, 6T−1]. That only the pulse clock states are allowed for the clock register at the lowest energy is guaranteed by the clock Hamiltonian Hclock of equation (167), which is easily computable for any given configuration point q∈ during a Monte Carlo simulation over a configuration space
as a homophysical image of the computational rebit configuration space
by simply parsing q to determine the computational 6T-rebit basis state
that is consistent with q∈, then reducing the
-diagonal clock Hamiltonian to Hclock˜[1−Σt=06
C
s(q)|C, when computing any Gibbs transition amplitude from q to another configuration point r∈
. The initialization Hamiltonian Hinit sets up the predetermined initial state
for the logic register at the Feynman-Kitaev time
Then the Feynman-Kitaev propagators {Hprop, t(k): t∈[1, T], k∈[0, 3]} will generate the series of quantum states {|ϕtL: t∈[1, 6T]}⊆
({0, 1}N) recursively through |ϕt
L=UL(t)|ϕt−1
L for all t∈[1, 6T]. At any fixed instant of the real (or Markovian) time t∈
, or at any finer-resolved time instant within each discrete t-step, the instantaneous Hamiltonian t∈
is frustration-free, whose ground state(s) can only live in the linear space that is spanned by the so-called Feynman-Kitaev history states {|t)C|ϕt
L|z
E: t∈[0, 6T−1], z∈{0,1}}.
Definition 44. (SFF Group of Hamiltonians, SFF Periodic Sequence of Hamiltonians)
A group of Hamiltonians {Hk: k∈[0, K−1]} supported by a configuration space is said to generate a periodic sequence of Hamiltonians {H(t): t∈
} with period K, when H(t)=Ht % K, ∀t∈
, where t % K denotes the remainder of t modulo K. Such periodic sequence of Hamiltonians in turn generates a time-inhomogeneous Markov chain [140, 264, 265] with a periodic sequence of Markov operators or transition matrices {M(t): t∈
}, where
exp{−(t)(H(t)}[ψ0(H(t))]−1 as specified in Auxiliary Utility
,
(t)>0 is a t-dependent constant, ∀t∈
.
Starting from an initial probability distribution ϕ02(q∈), such {M(t): t∈
}, being called a periodic Markov chain, generates a sequence of probability distributions {ϕt2(q∈
): t∈{0}∪
} such that
A period-averaged probability distribution ϕt2
is defined as
∀t∈{0}∪. When
exists in the topology of total variation distance and is independent of the initial ϕ02, then the generating periodic Markov chain {M(t): t∈} is said to converge, and
ϕ∞2) is called its stationary distribution, which generalizes the like-named notion for a time-homogeneous, irreducible, and reversible Markov chain.
A group of Hamiltonians {Hk: k∈[0, K−1]}, and the periodic sequence of Hamiltonians {H(t)=H(t−1)% K: t∈} generated by it, are both called separately frustration-free (SFF), when every Hamiltonian in the group {Hk: k∈[0, K−1]} satisfies all the conditions as specified in Definition 38 for it to be SFF, and further, the periodic sequence of Hamiltonians {H(t): t∈
} generates a periodic Markov chain that converges to a stationary distribution
ϕ∞2
.
When homophysically implemented into a system of bi-fermions, the homophysical image of the time-dependent Feynman-Kitaev Hamiltonian {HFK(t): t∈} as defined in equations (166) through (172), still denoted by the same {HFK(t): t∈
} for brevity of the mathematical formulas, is an SFF periodic sequence of Hamiltonians with period 4, which is generated by a group of Hamiltonians {HFK, k}k∈[0,3], with
Exactly to which of the two homophysical Feynman-Kitaev Hamiltonians is referred to by the same notation {HFK(t): t∈} should always be easily inferred from the context without ambiguity. Moreover, when applicable, the adjective “SFF” is often used before a reference of Feynman-Kitaev Hamiltonian {HFK(t): t∈
} to clearly indicate that it refers to the homophysical implementation in a system of bi-fermions which does possess the SFF property.
Let 0<v0=O(poly(T N)) and 0<0=O(poly(T N)) be two sufficiently large but still polynomially bounded constants. In order for the SFF Feynman-Kitaev Hamiltonian {HFK(t): t∈
} to generate a useful and efficient lifted Markov chain, the
-valued potential v(z, t, k) as a function of (z, t, k)∈{0, 1}×[0, 6T−1]×[0, 3] is so designed that a periodic Markov chain
works by repeating 4-phase cycles (or called 4-stroke cycles) on each 6-vertex segment of the ring-shaped graph as depicted in
Phase I: in the first phase/stroke with t % 4=k=1, the potential values are set as v(0, 6t+2, 1)=−v(0, 6t+4, 1)=v0, and v(0, 6t, 1)=0 for the lower elevation, while v(1, 6t+4, 1)=−v(1, 6t, 1)=v0, and v(1, 6t+2, 1)=0 for the higher elevation, ∀t∈[0, T−1], and the Markov transition matrix M(t) is applied with (t)=
0, such that an equilibrium distribution is reached, which has the probability amplitude concentrated in the Feynman-Kitaev history states {|(6t+i)
C|(ϕ6t+i
L|0
E: i∈{2, 3}} at the lower elevation, and the Feynman-Kitaev history states {|(6t+j)
C|(ϕ6t+j
L|1
E: j∈{4, 5}} at the higher elevation, moreover, the probability amplitude is substantially equally distributed between each pair of adjacent and connected Feynman-Kitaev history states at both the lower and the higher elevations;
Phase 2: in the second phase/stroke with t %4=k=2, the potential values are set as v(0, 6t+4, 2)=−v(0, 6t, 2)=v0, and v(0, 6t 2, 2)=0 for the lower elevation, while v(1, 6t 2, 2)=−v(1, 6t 4, 2)=v0, and v(1, 6t, 2)=0 for the higher elevation, ∀t∈[0, T−1], and the Markov transition matrix M(t) is applied with (t)=
0, such that an equilibrium distribution is reached, which has the probability amplitude concentrated in the Feynman-Kitaev history states {|(6t+i)
C|(ϕ6t+i
L|0
E: i∈{4, 5}} at the lower elevation, and the Feynman-Kitaev history states {|(6t+j)
C|(ϕ6t+j
L|1
E: j∈{2, 3}} at the higher elevation, moreover, the probability amplitude is substantially equally distributed between each pair of adjacent and connected Feynman-Kitaev history states at both the lower and the higher elevations;
Phase 3: in the third phase/stroke with t %4=k=3, the potential values are set as v(z, 6t, 3)=−v(z, 6t+2, 3)=v0, while v(z, 6t+4, 3)=0, ∀t∈[0, T−1], ∀z∈{0, 1}, and the Markov transition matrix M(t) is applied with (t)=
0, to effect transfer of probability amplitude from the Feynman-Kitaev history states {|(6t+i)
C|(ϕ6t+i
L|z
E: i∈[2, 5]} to the adjacent Feynman-Kitaev history states {|(6t+j)
C|(ϕ6t+j
L|z
E: j∈{0, 1}}, such that when an equilibrium distribution is reached, each of the Feynman-Kitaev history states {|(6t+i)
C|(ϕ6t+i
L|z
E: i∈[2, 5]} retains merely an exponentially small thus negligible probability amplitude, while each of the Feynman-Kitaev history states {|(6t+j)
C|(ϕ6t+j
L|z
E: j∈{0, 1}} gets substantially all of the available probability amplitude, ∀t∈[0, T−1], ∀z∈{0,1};
Phase: in the fourth phase/stroke with t % 4=k=0, all of the Feynman-Kitaev propagators are switched off, while the elevation swap Hamiltonian Hswap(k=0) is turned on, so that the Markov operator [ϕ0(Hswap (k))]exp{−(t)Hswap (k)}[ψ0(Hswap(k))]−1 is applied with
(t)=O(ϵ(T))=O(T−1), to effect a cross-elevation transition with probability ϵ(T)=O(T−1).
It is now easily seen that the repeated 4-phase or 4-stroke cycles of the periodic Markov chain {M(t): t∈} amount to a shift register-type of operation for the probability amplitude in each of the ring-shaped graphs at an elevation z∈{0, 1}, in conjunction with a brief and weak coupling periodically between the two elevations. At any instant of real time t∈
, essentially all of the probability amplitude is localized at a single pair of consecutive vertices in each 6-vertex segment as depicted in
It can be easily verified that the periodic Markov chain {M(t): t∈} of equation (173) implements precisely the two-rings-at-two-elevations lifted Markov chain as desired, where, at the lower elevation, a walker moves predominantly clockwise from a vertex to its neighbor with probability 1−ϵ(T), while at the upper elevation, a walker moves predominantly counterclockwise from a vertex to its neighbor with probability 1−ϵ(T), and the two rings are weakly coupled with a transition probability ϵ(T)=O(T−1). For this reason, {M(t): t∈
} of equation (173) is called a lifted periodic Markov chain. It follows straightforwardly from well established results [304-306] that the lifted periodic Markov chain {M(t): t∈
} of equation (173) converges in O(T) time to its stationary distribution, which is an equilibrium distribution with all Feynman-Kitaev history states equally populated. On the other hand, it is without difficulty to repeat the familiar analysis to demonstrate that the periodic Feynman-Kitaev Hamiltonian {HFK(t): t∈
} as defined in equations (166) through (172) can be made PLTKD, with each additive term in the Lie-Trotter-Kato decomposition being homophysically implementable as a node-determinate and efficiently computable tensor monomials on bi-fermions. In summary, the periodic Feynman-Kitaev Hamiltonian {HFK(t): t∈
} and the lifted periodic Markov chain {M(t): t∈
} generated by it facilitate highly efficient methods of MCQC.
By way of example and not limitation, the following algorithm describes an overall process for solving a general classical or quantum computational problem via Monte Carlo quantum computing, which firstly designs a quantum algorithm that solves the given computational problem, then syn-thesizes a quantum circuit and creates a Feynman-Kitaev construct whose bi-fermion implementation has either a time-independent SFF Hamiltonian or an SFF periodic sequence of Hamiltonians, to generate either a time-homogenous or a lifted periodic Markov chain, finally runs a Monte Carlo simulation using said Markov chain and derives from its stationary distribution a solution to the computational problem.
Algorithm 10. (Monte Carlo Quantum Computing)
As being discussed and specified previously, both multi-rebit-controlled single-rebit-transforming and multi-rebit-controlled multi-rebit-transforming quantum gates can be used to construct the quantum circuit in step 10.2, although a typical multi-rebit-transforming quantum gate might entail a different and likely higher computational cost than a single-rebit-transforming quantum gate for MCQC implementations and simulations. The quantum circuit should then be optimized so that the sum of the computational costs of all multi-rebit-transforming or single-rebit-transforming quantum gates is minimized.
It is only trivial a point to note that quantum mechanics and computing are not limited to isolated quantum systems described by a pure quantum state. Rather, they can be, as have already been, easily generalized to open quantum systems in mixed quantum states described by density matrices (also known as density operators) [4, 310-312]. A quantum computation executes a BQP algorithm as a sequence of primitive gate operations to a general quantum system associated with a description size N∈, where the description size characterizes the minimally required number of classical bits to encode the setup and parameters of the quantum system. In one preferred embodiment, N is the size of a configuration space associated with the quantum system. In another preferred embodiment, N counts the number of particles or modes of field constituting the quantum system. Each primitive gate operation is a so-called superoperator effecting a transformation on density matrices representing states of the physical system [310, 311], with said transformation involving multiplications and taking traces of tensor-factor matrices, and being implemented either numerically or physically up to an O(1/poly(N))-bounded relative error. Said tensor-factor matrices are associated with low-dimensional subsystems and have dimensions upper-bounded by a constant or O(log N). The quantum system is initially in either a simple classical state with each constituent particle or mode of field individually set to a predetermined basis state, or a simple quantum state that can be generated from a simple classical state followed by an application of an O(poly(N)) number of primitive gates. The density matrices representing the possible initial states and result states of a quantum system after an application of an O(poly(N)) number of primitive gates constitute a space of the so-called BQP computable states, which captures all of the relevant states to quantum computing. Quantum states out of the space of BQP computable (states) are too much entangled to be accessible and useful [313-315].
Definition 45. (BQP Computable Density Matrices)
A density matrix representing a state of a quantum system with a description size N∈ is BQP computable when the represented state of the quantum system can be generated by applying an O(poly(N)) number of primitive gated implemented numerically or physically up to an O(1/poly(N))-bounded relative error, said quantum system being initially in a simple classical state having each constituent particle or mode of field individually set to a predetermined basis state.
Quantum computing is just to generate or manipulate BQP computable density matrices, while classical computing, as a special case of quantum computing, is just to generate and manipulate a particular kind of BQP computable states that are necessarily diagonal and non-negative. The proposition of Major Utility 3, BPP=BQP, simply says that the computational power of generating and manipulating only diagonal and non-negative density matrices is polynomially equivalent to that of generating and manipulating all BQP computable density matrices.
Definition 46. (Classical and Quantum Computational Procedures)
A quantum computational procedure is a computational process that employs a BQP algorithm to generate and manipulate a plurality of BQP computable density matrices, said density matrices representing pure or mixed quantum states of a physical system. A classical computational procedure is a special case of a quantum computational procedure with respect to the same physical system, which employs a BPP algorithm to generate and manipulate a plurality of BQP computable density matrices that are necessarily diagonal and non-negative, said density matrices representing classical probabilistic states of said physical system.
Definition 47. (Minimally Entangled Density Matrices, Substantially Entangled Density Matrices, Practically Substantially Entangled Density Matrices)
Consider a quantum system supported by a configuration space , and a density matrix M representing a state of the quantum system. The density matrix M is said to be minimally entangled, or substantially entangle, or practically substantially entangled, when its configuration coordinate representation {
r|M|q
: (r, q)∈
2} as a density on the measurable space (
2, Bor(
2)) is minimally entangled, or substantially entangle, or practically substantially entangled, respectively, where Bor(
2) denotes the smallest σ-algebra consisting all of the Borel subsets of
2.
The methods and theory presented in this specification have shown that MCQC is able to implement quantum computational procedures on classical computers, so to generate and simulate many of the densities as well as density matrices which have hitherto been found or thought to be practically substantially entangled.
Lastly, taking cue from the hierarchically leveled specialization, organization, and cooperation of programming languages and software in the tremendously successful industry of classical computing, where there are low-level programming languages such as machine codes and assembly languages, that are strongly coupled to specific hardware architectures and machine instruction sets, which can run very efficiently but are difficult to use and hardly portable, and by contrast, high-level programming languages that provide strong abstraction from the hardware and architecture details of the machines being used, which are much more friendly to programmers and easily portable, as well as importantly, compilers that translate source codes from high-level programming languages to low-level programming languages and often perform code optimizations during the course, it is easily envisioned that a large scale adoption of Monte Carlo quantum computing will benefit from an MCQC compiler that lies in between and bridges two specialized domains of quantum computing and applications, where in the higher-level domain, users and applications of quantum computing can be oblivious of the underlying MCQC mechanism, particularly the MCMC details, but focus on constructing/programming general and abstract quantum algorithms/circuits that can be run on a bona fide, yet-to-be-built, materially quantum piece of hardware, as well as on a Monte Carlo-simulated quantum process using an ordinary classical computer in conjunction with the presently specified method of Monte Carlo quantum computing, while in the lower-level domain, an MCQC compiler and programming and software utilities take care automatically, and free users of higher-level quantum programming and applications from considerations of:
According to a preferred embodiment,
At step 710, data or signals describing a multivariable system are received, which comprise data or signals that represent a configuration space 711, an SFF partial Hamiltonian 712, and an objective operator 713. In one preferred embodiment, said data or signals are computer-readable numerical data stored in memory devices, which are read, written, processed, and transferred by computing devices. In another preferred embodiment, data or signals are physical signals including but not limited to electric charges, electric currents, voltages, magnetic fields, magnetic moments, electromagnetic fields, electromagnetic waves, masses of matter, number of particles, strengths of forces, physical locations, velocities of motion, mechanical energies, mechanical waves, chemical and material properties.
Configuration space 711 in turn comprises a plurality of configuration points, which are associated with a plurality of canonical variables. In one preferred embodiment, configuration space 711 is the collection of spatial coordinates of particles constituting a many-body quantum system with particle position operators being the canonical variables. In another preferred embodiment, the configuration points are strings of 0 or 1 valued binary variables representing configurations of spins, or qubits that comprise a quantum computing device. In yet another preferred embodiment, the configuration points are lattice points of a lattice model, namely, a physical model that is defined on a lattice, such as a quantum Ising model, a Heisenberg model of magnetism, a Hubbard model, a t-J model, or a lattice field theory. In yet another preferred embodiment, the configuration points are vertices of a graph on which a classical random walk or a quantum walk takes place. In still another preferred embodiment, every configuration point in configuration space 711 is a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. In yet another preferred embodiment, every configuration point in configuration space 711 is a Feynman path, which is a tuple of a plurality of coordinates, with each of the plurality of coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. Each preferred embodiment is given by way of example but no means of limitation. Configuration space 711 is endowed with a topology and comprising 2D, where each pair of different such connected components are topologically disconnected, each individual connected component is topologically connected and has a dimension smaller than or equal to D2, D2∈
, D2≥0, while at least one of the connected components contains at least one configuration point requiring a minimum number D2 of numerical values to label and distinguish from other configuration points.
SFF partial Hamiltonian 712 is a function or operator involving the canonical variables, which is a combination of a plurality of DFF or GFF partial Hamiltonians, each of which involves a substantially small number of the canonical variables. In one preferred embodiment, SFF partial Hamiltonian 712 is a linear sum of DFF or GFF partial Hamiltonians. In another preferred embodiment, SFF partial Hamiltonian 712 is Lie-Trotter-Kato decomposable into a sum of DFF or GFF partial Hamiltonians. The SFF partial Hamiltonian is associated with an SFF ground state, which is a practically substantially entangled density on said configuration space. In one preferred embodiment, the SFF ground state is a element in a Hilbert space of functions supported by the configuration space. In another preferred embodiment, the SFF ground state is a density matrix supported by the configuration space comprising configuration points as a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. Each of the DFF or GFF partial Hamiltonians in turn is associated with one or more efficiently computable partial ground states, one of which as a density on the configuration space is substantially the same as the SFF ground state. That a partial ground state denoted by ψ′0 is substantially the same as the SFF ground state denoted by ψ0 means, for substantially any pair of configuration points q and r on which both ψ′0 and ψ0 are defined, that the fraction ψ′0(r)/ψ′0(q) is substantially the same as the fraction ψ0(r)/ψ0(q) in one preferred embodiment, or that the plus or minus numerical sign of the fraction ψ′0(r)/ψ′0(q) is substantially the same as the plus or minus numerical sign of the fraction ψ0(r)/ψ0(q) in another preferred embodiment.
Importantly, the partial ground states associated with the same GFF partial Hamiltonian are substantially mutually non-overlapping in the configuration space, in the sense of a substantially negligible upper bound for the set of node-uncertain configuration points, that are the configuration points on which two different partial ground states of the same partial Hamiltonian exist, whose numerical values make a product that is substantially non-negligible comparing to the sum of the norms of the same two numerical values. Also importantly, each DFF partial Hamiltonian is a direct sum of FBM partial Hamiltonians, where each FBM partial Hamiltonian moves a configuration subspace that is annihilated by any other different FBM partial Hamiltonian, in other words or alternatively, any pair of different FBM partial Hamiltonians in the same direct sum of a DFF partial Hamiltonian multiply into a null operator. In the construction of a Feynman-Kitaev Hamiltonian, it is crucial to split the Feynman-Kitaev propagators into two groups, with each group forming a direct sum and constituting a DFF partial Hamiltonian.
In one preferred embodiment, SFF partial Hamiltonian 712 is the total energy operator of a many-body quantum system. In another preferred embodiment, SFF partial Hamiltonian 712 is a finite—although high-dimensional matrix in association with a graph, a grid or mesh, a lattice of points, or a connected network. In yet another preferred embodiment, SFF partial Hamiltonian 712 is a lower-bounded self-adjoint operator acting on square-integrable functions supported by configuration space 711. In yet another preferred embodiment, SFF partial Hamiltonian 712 is an operator generating a strongly continuous one-parameter semigroup of operators acting on square-integrable functions supported by configuration space 711. Each preferred embodiment is given by way of example but no means of limitation.
Objective operator 713 is an operator depending on the canonical variables, which enters a linear, quadratic, or another form of nonlinear functional involving the SFF ground state to produce an expectation value. In one preferred embodiment, objective operator 713 is the same as SFF partial Hamiltonian 712 that measures the total energy of a physical system. In another preferred embodiment, objective operator 713 is a configuration coordinate projection operator, or called a position measurement operator, that produces an expectation value indicating the probability of finding the multivariable system at a configuration point within a predetermined subset of configuration space 711. Each preferred embodiment is given by way of example but no means of limitation.
Process step 720 further comprises step 721 and step 722 to generate a plurality of result sampling points, which can be used to compute a numerical estimate of the expectation value produced by objective operator 713. In step 721, a current sampling point (CSP) variable holding a CSP value is initialized to a configuration point, on which the SFF ground state is valued substantially different from 0. Step 722 repeats an iterative procedure for a predetermined number N of times, where the iterative procedure further comprises sub-step 723 and sub-step 724. Sub-step 723 executes a transition from the CSP value held by the CSP variable to a new sampling point in the configuration space, according to a transition probability matrix that depends on one or more of the partial Hamiltonian(s) and uses the partial ground states associated with the one or more partial Hamiltonian(s), where the transition probability matrix has a probability density function as a stationary distribution, which is related to the partial ground states of the one or more DFF or GFF partial Hamiltonian(s). In one preferred embodiment, said probability density function is point-wise the square of the absolute value of a partial ground state. Sub-step 724 transfers the CSP value held by the CSP variable to one of one or more result container(s) as a result sampling point, then resets the CSP value held by the CSP variable to the new sampling point. In another preferred embodiment, the very first result sampling point is labeled by +1, and for the subsequent result sampling points denoted by q and r that are obtained before and after a sub-step 723 respectively, the label for r will be that of q multiplied by +1 or −1 depending upon whether the same or oppositely signed numerical values are assumed by the partial ground state used by the transition probability matrix for the execution of sub-step 723, then accordingly, each of the result sampling points will be transferred to a positive result container or a negative result container depending upon whether the result sampling point is labeled by +1 or −1 respectively. In another preferred embodiment, the negative result container in the previous preferred embodiment defaults to void, and all of the would-be (−1)-labeled result sampling points are discarded. In yet another preferred embodiments, more than two result containers are used. Each preferred embodiment is given by way of example but no means of limitation.
The result sampling points obtained in process step 720 enable a numerical estimate for the expectation value associated with objective operator 713, with the numerical estimate having a relative error smaller than or equal to a formula a(D1+D2)bN−c when both D1+D2 and N becoming substantially large, with a, b, and c being constants, both a and c further being positive.
According to a preferred embodiment,
At step 810, data or signals describing a multivariable system are received, which comprise data or signals that represent a multi-dimensional space (MDS) of the first kind 811, a plurality of groups of low-dimensional densities (LDDs) 812, and an objective operator 813. In one preferred embodiment, said data or signals are computer-readable numerical data stored in memory devices, which are read, written, processed, and transferred by computing devices. In another preferred embodiment, data or signals are physical signals including but not limited to electric charges, electric currents, voltages, magnetic fields, magnetic moments, electromagnetic fields, electromagnetic waves, masses of matter, number of particles, strengths of forces, physical locations, velocities of motion, mechanical energies, mechanical waves, chemical and material properties.
MDS of the first kind 811 in turn comprises a plurality of multi-dimensional points (MDPs) of a first kind. In one preferred embodiment, MDS of the first kind 811 is the collection of spatial coordinates of particles constituting a many-body quantum system with particle position operators being the canonical variables. In another preferred embodiment, the MDPs of the first kind are strings of 0 or 1 valued binary variables representing configurations of spins, or qubits that comprise a quantum computing device. In yet another preferred embodiment, the MDPs of the first kind are lattice points of a lattice model, namely, a physical model that is defined on a lattice, such as a quantum Ising model, a Heisenberg model of magnetism, a Hubbard model, a t-J model, or a lattice field theory. In yet another preferred embodiment, the MDPs of the first kind are vertices of a graph on which a classical random walk or a quantum walk takes place. In still another preferred embodiment, every MDP in MDS of the first kind 811 is a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. In yet another preferred embodiment, every MDP in MDS of the first kind 811 is a Feynman path, which is a tuple of a plurality of coordinates, with each of the plurality of coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. Each preferred embodiment is given by way of example but no means of limitation. MDS of the first kind 811 is endowed with a topology and comprising 2D, where each pair of different such connected components are topologically disconnected, each individual connected component is topologically connected and has a dimension smaller than or equal to D2, D2∈
, D2≥0, while at least one of the connected components contains at least one MDP requiring a minimum number D2 of numerical values to label and distinguish from other MDPs. MDS of the first kind 811 is associated with a plurality of means of factoring itself into a low-dimensional space (LDS) comprising a plurality of low-dimensional points (LDPs) and an MDS of a second kind comprising a plurality of MDPs of a second kind, such that each of the MDPs of the first kind in MDS of the first kind 811 is substantially the same as one and only ordered pair of an LDP and an MDP of the second kind, whereas said LDS is endowed with an LDS topology and comprises 2d
, where each pair of different such connected LDS components are topologically disconnected, each of such connected LDS components is topologically connected and has a dimension smaller than or equal to d2, d2∈
, d2≥0, at least one of the connected LDS components comprises one or more of said LDPs that require(s) a minimum number d2 of numerical values to label and distinguish, wherein 2d
All LDDs in the plurality of groups of LDDs 812 are efficiently computable. Each said group of LDDs is associated with an LDS and an MDS of the second kind according to one of said plurality of means of factoring MDS of the first kind 811, with each said LDD included in such group of LDDs being associated with a portion of the MDS of the second kind associated with the group of LDDs, thus each of the LDDs is associated with a portion of MDS of the first kind 811, such that each MDP included in said portion of MDS of the first kind 811 is substantially the same as one and only one ordered pair of an LDP included in the LDS and an MDP of the second kind associated with the group of LDDs, whereas each pair of said portions of MDS of the first kind 811 associated respectively with two different LDDs included in the same group of LDDs are substantially non-overlapping with respect to a volumetric measure, such that the volumetric measure of the intersection set of said pair of said portions of said MDS of the first kind is substantially negligible comparing to the volumetric measure of each of the pair of said portions of said MDS of the first kind, wherein the plurality of groups of LDDs 812 collectively determine a practically substantially entangled high-dimensional density (HDD) that is uniquely defined over MDS of the first kind 811, such that the HDD is substantially the same as each LDD in the corresponding portion of MDS of the first kind 811 associated with that LDD. In one preferred embodiments, a group of LDDs consists of the eigenvectors of a Gibbs operator associated with the largest eigenvalue, while in another preferred embodiment, a group of LDDs comprises general signed densities. Furthermore, in one preferred embodiment, said volumetric measure is a Lebesgue volume measure with respect to a Riemannian metric on a Riemannian manifold, while in another preferred embodiment, said volumetric measure is a probability measure with a probability density function defined in relation to the HDD and the LDDs. Still further, in one preferred embodiment, the HDD is a ground state wavefunction of a physical system, while in another preferred embodiment, the HDD is a general signed density over a multi-dimensional space. In yet another preferred embodiment, the HDD is a element in a Hilbert space of functions supported by the MDS. In still another preferred embodiment, the HDD is a density matrix supported by the MDS comprising MDPs as a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. Each preferred embodiment is given by way of example but no means of limitation.
Objective operator 813 is an operator that enters a linear, quadratic, or another form of nonlinear functional involving the HDD to produce an expectation value. In one preferred embodiment, objective operator 813 is a partial Hamiltonian that measures the total energy of a physical system. In another preferred embodiment, objective operator 813 is an MDP projection operator, or called a position measurement operator, that produces an expectation value indicating the probability of finding the multivariable system at an MDP within a predetermined subset of MDS of the first kind 811. Each preferred embodiment is given by way of example but no means of limitation.
Process step 820 further comprises step 821 and step 822 to generate a plurality of result sampling points, which can be used to compute a numerical estimate of the expectation value produced by objective operator 813. In step 821, a current sampling point (CSP) variable holding a CSP value is initialized to an MDP, on which the HDD is valued substantially different from 0. Step 822 repeats an iterative procedure for a predetermined number N of times, where the iterative procedure further comprises sub-step 823 and sub-step 824. Sub-step 823 executes a transition from the CSP value held by the CSP variable to a new sampling point in MDS of the first kind 811, according to a transition probability matrix that depends on one or more groups of the plurality of groups of LDDs 812 and uses the LDDs in the one or more groups of LDDs, where the transition probability matrix has a probability density function as a stationary distribution, which is related to one or more groups of the plurality of groups of LDDs 812. In one preferred embodiment, said probability density function is point-wise the square or sum of squares of the absolute value(s) of one or more LDD(s). Sub-step 824 transfers the CSP value held by the CSP variable to one of one or more result container(s) as a result sampling point, then resets the CSP value held by the CSP variable to the new sampling point. In one preferred embodiment, the very first result sampling point is labeled by +1, and for the subsequent result sampling points denoted by q and r that are obtained before and after a sub-step 823 respectively, the label for r will be that of q multiplied by +1 or −1 depending upon whether the same or oppositely signed numerical values are assumed by an LDD used by the transition probability matrix for the execution of sub-step 823, then accordingly, each of the result sampling points will be transferred to a positive result container or a negative result container depending upon whether the result sampling point is labeled by +1 or −1 respectively. In another preferred embodiment, the negative result container in the previous preferred embodiment defaults to void, and all of the would-be (−1)-labeled result sampling points are discarded. In yet another preferred embodiments, more than two result containers are used. Each preferred embodiment is given by way of example but no means of limitation.
The result sampling points obtained in process step 820 enable a numerical estimate for the expectation value associated with objective operator 813, with the numerical estimate having a relative error smaller than or equal to a formula a(D1+D2)bN−c when both D1+D2 and N becoming substantially large, with a, b, and c being constants, both a and c further being positive.
According to a preferred embodiment,
At step 910, a description of a quantum algorithm is received, which comprises: a list 911 of a plurality qubits, an initial state 912, a linearly ordered sequence 913 of a plurality of quantum gates, and a quantum measurement operator 914. Each qubit in list 911 is associated with a single-qubit configuration space, a single-qubit Hilbert space that comprising state vectors called single-qubit states, as well as single-qubit dynamical variables as operators acting on the single-qubit states. A Cartesian product of multiple single-qubit configuration spaces becomes a multi-qubit configuration space. In one preferred embodiment, the multi-qubit configuration space is continuous. In another preferred embodiment, the multi-qubit configuration space is a topological manifold. In yet another preferred embodiment, the multi-qubit configuration space is a discrete graph or a discrete space of lattice points. In yet another preferred embodiment, the multi-qubit configuration space is a continuous-discrete mixed space, namely, a space with both continuous and discrete coordinates or degrees of freedom. In still another preferred embodiment, every configuration point in the configuration space is a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. In yet another preferred embodiment, every configuration point in the configuration space is a Feynman path, which is a tuple of a plurality of coordinates, with each of the plurality of coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. Each preferred embodiment is given by way of example but no means of limitation. A tensor product of multiple single-qubit Hilbert spaces becomes a multi-qubit Hilbert space supported by the multi-qubit configuration space. Each element in the multi-qubit Hilbert space is called a multi-qubit state, or state vector. Initial state 912 is a multi-qubit state. In one preferred embodiment, initial state 912 is a element in a Hilbert space of functions supported by the configuration space. In another preferred embodiment, initial state 912 is a density matrix supported by the configuration space comprising configuration points as a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. The multi-qubit Hilbert space has an inner product defined, which takes any pair of multi-qubit states as input and produces a numerical value as output, with the output numerical value also referred to as the inner product between the two input multi-qubit states. Two multi-qubit states are said to be orthogonal, or one multi-qubit state is said to be orthogonal to the other, when the inner product between them is substantially 0. Each quantum gate in linearly ordered sequence 913 of a plurality of quantum gates involves a number of the single-qubit dynamical variables. Each such quantum gate acts on and transforms the multi-qubit states. The plurality of quantum gates in 913 are linearly ordered in the sense that, between any pair of such quantum gates, there is always one called earlier, and the other called later. In one preferred embodiment, some of the quantum gates are unitary matrices or operators. In another preferred embodiment, some of the quantum gates are non-unitary matrices or operators. In yet another preferred embodiment, some of the quantum gates are superoperators that map density matrices to density matrices [310-312]. Quantum measurement operator 914 also involves a number of single-qubit dynamical variables, as well as acts on and transforms the multi-qubit states. In one preferred embodiment, quantum measurement operator 914 is a partial Hamiltonian that measures energy. In another preferred embodiment, quantum measurement operator 914 is a position operator that measures the coordinates of one or more qubit(s) in their respective single-qubit configuration space(s). Each preferred embodiment is given by way of example but no means of limitation. In the described quantum algorithm, a multi-qubit system comprising multiple qubits is initially at initial state 912. Then the quantum gates in linearly ordered sequence 913 are applied to the multi-qubit system to produce a result multi-qubit state that is a practically substantially entangled density supported by the multi-qubit configuration space. Finally, a quantum measurement is performed using quantum measurement operator 914 against the result multi-qubit state, so to yield an interested expectation value of quantum measurement operator 914.
Step 920 constructs a Feynman-Kitaev (FK) register, which further comprises a sub-step 921 of allocating a plurality of FK logic bits associated with a plurality of logic states and a plurality of FK clock bits associated with a clock configuration space consisting of clock configuration points, a sub-step 922 of creating a plurality of FK time projectors with each said FK time projector fixing said clock configuration points, and a sub-step 923 of creating a plurality of FK time propagators, where said plurality of FK time propagators are divided into a first group of first FK time propagators and a second group of second FK time propagators. Each said first FK time propagator is able to connect different clock configuration points in a first subset of said clock configuration space. Each said second FK time propagator is able to connect different clock configuration points in a second subset of said clock configuration space. A first graph is formed having said configuration points in the first subset of said clock configuration space as vertices and said first FK time propagators as edges. A second graph is formed having said configuration points in the second subset of said clock configuration space as vertices and said second FK time propagators as edges. It is crucial to make sure that the first graph and the second graph are disjoint.
Step 930 constructs an SFF partial Hamiltonian that is a combination of a plurality of directly frustration-free (DFF) or ground state frustration-free (GFF) partial Hamiltonians, which further comprises a sub-step 931 of creating an FK state initializer as a tensor product between an FK time projector and a logic state initializer, a sub-step 932 of creating a first FK state operator as a combination of a plurality of first FK state propagators, and a sub-step 933 of creating a second FK state operator as a combination of a plurality of second FK state propagators. In sub-step 931, the FK state initializer is a DFF or GFF partial Hamiltonian, and the logic state initializer produces an initial logic state, while the initial logic state is related to said initial state. In sub-step 932, the first FK state operator is a DFF or GFF partial Hamiltonian, while each said first FK state propagator is a tensor product between a first FK time propagator and one of said quantum gates transforming said logic states. In sub-step 933, the second FK state operator is a DFF or GFF partial Hamiltonian, while each said second FK state propagator is a tensor product between a second FK time propagator and one of said quantum gates transforming said logic states. Importantly, both the first FK state operator and the second FK state operator are a direct sum of FBM partial Hamiltonians, where each FBM partial Hamiltonian moves a configuration subspace that is annihilated by any other different FBM partial Hamiltonian, in other words or alternatively, any pair of different FBM partial Hamiltonians in the same direct sum of a first FK state operator or a second FK state operator multiply into a null operator. An SFF partial Hamiltonian so constructed in step 930 has a non-degenerate ground state, which in association with said quantum measurement operator 914 produces a numerical estimate for the expectation value of quantum measurement operator 914.
According to a preferred embodiment,
Step 1010 generates data or signals describing a multivariable system, which further comprises a sub-step 1011 of allocating a configuration space, a sub-step 1012 of creating an SFF partial Hamiltonian, and a sub-step 1013 of generating an objective operator. In one preferred embodiment, said data or signals are computer-readable numerical data stored in memory devices, which are read, written, processed, and transferred by computing devices. In another preferred embodiment, data or signals are physical signals including but not limited to electric charges, electric currents, voltages, magnetic fields, magnetic moments, electromagnetic fields, electromagnetic waves, masses of matter, number of particles, strengths of forces, physical locations, velocities of motion, mechanical energies, mechanical waves, chemical and material properties.
The configuration space allocated in sub-step 1011 comprises a plurality of configuration points, which are associated with a plurality of canonical variables. In one preferred embodiment, the configuration space is the collection of spatial coordinates of particles constituting a many-body quantum system with particle position operators being the canonical variables. In another preferred embodiment, the configuration points are strings of 0 or 1 valued binary variables representing configurations of spins, or qubits that comprise a quantum computing device. In yet another preferred embodiment, the configuration points are lattice points of a lattice model, namely, a physical model that is defined on a lattice, such as a quantum Ising model, a Heisenberg model of magnetism, a Hubbard model, a t-J model, or a lattice field theory. In yet another preferred embodiment, the configuration points are vertices of a graph on which a classical random walk or a quantum walk takes place. In still another preferred embodiment, every configuration point in the configuration space is a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. In yet another preferred embodiment, every configuration point in the configuration space is a Feynman path, which is a tuple of a plurality of coordinates, with each of the plurality of coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. Each preferred embodiment is given by way of example but no means of limitation. The configuration space is endowed with a topology and comprising 2D, where each pair of different such connected components are topologically disconnected, each individual connected component is topologically connected and has a dimension smaller than or equal to D2, D2∈
, D2≥0, while at least one of the connected components contains at least one configuration point requiring a minimum number D2 of numerical values to label and distinguish from other configuration points.
The SFF partial Hamiltonian created in sub-step 1012 is a function or operator involving the canonical variables, which is a combination of a plurality of partial Hamiltonians, each of which involves a substantially small number of the canonical variables. In one preferred embodiment, the SFF partial Hamiltonian is a linear sum of partial Hamiltonians. In another preferred embodiment, the SFF partial Hamiltonian is Lie-Trotter-Kato decomposable into a sum of partial Hamiltonians. The SFF partial Hamiltonian is associated with an SFF ground state, which is a practically substantially entangled density on said configuration space. In one preferred embodiment, the SFF ground state is a element in a Hilbert space of functions supported by the configuration space. In another preferred embodiment, the SFF ground state is a density matrix supported by the configuration space comprising configuration points as a pair of two coordinates, with each of the two coordinates representing values of canonical variables associated with particles constituting a many-body quantum system. Each of the partial Hamiltonians in turn is associated with one or more efficiently computable partial ground states, one of which as a density on the configuration space is substantially the same as the SFF ground state. That a partial ground state denoted by ψ′0 is substantially the same as the SFF ground state denoted by ψ0 means, for substantially any pair of configuration points q and r on which both ψ′0 and ψ0 are defined, that the fraction ψ′0(r)/ψ′0(q) is substantially the same as the fraction ψ0(r)/ψ0(q) in one preferred embodiment, or that the plus or minus numerical sign of the fraction ψ′0(r)/ψ′0(q) is substantially the same as the plus or minus numerical sign of the fraction ψ0(r)/ψ0(q) in another preferred embodiment. Importantly, the partial ground states associated with the same partial Hamiltonian are substantially mutually non-overlapping in the configuration space, in the sense of a substantially negligible upper bound for the set of node-uncertain configuration points, that are the configuration points on which two different partial ground states of the same partial Hamiltonian exist, whose numerical values make a product that is substantially non-negligible comparing to the sum of the norms of the same two numerical values. In one preferred embodiment, the SFF partial Hamiltonian is the total energy operator of a many-body quantum system. In another preferred embodiment, the SFF partial Hamiltonian is a finite-although high-dimensional matrix in association with a graph, a grid or mesh, a lattice of points, or a connected network. In yet another preferred embodiment, the SFF partial Hamiltonian is a lower-bounded self-adjoint operator acting on square-integrable functions supported by the configuration space allocated in sub-step 1011. In yet another preferred embodiment, the SFF partial Hamiltonian is an operator generating a strongly continuous one-parameter semigroup of operators acting on square-integrable functions supported by the configuration space allocated in sub-step 1011. Each preferred embodiment is given by way of example but no means of limitation.
The objective operator generated in sub-step 1013 is an operator depending on the canonical variables, which enters a linear, quadratic, or another form of nonlinear functional involving the SFF ground state to produce an expectation value. In one preferred embodiment, the objective operator is the same as the SFF partial Hamiltonian created in sub-step 1012 that measures the total energy of a physical system. In another preferred embodiment, the objective operator is a configuration coordinate projection operator, or called a position measurement operator, that produces an expectation value indicating the probability of finding the multivariable system at a configuration point within a predetermined subset of the configuration space allocated in sub-step 1011. Each preferred embodiment is given by way of example but no means of limitation.
According to a preferred embodiment,
At step 1110, data or signals describing a multivariable system are generated, which comprise data or signals that represent a multi-dimensional space (MDS) of the first kind 1111, a plurality of groups of low-dimensional densities (LDDs) 1112, and an objective operator 1113. In one preferred embodiment, said data or signals are computer-readable numerical data stored in memory devices, which are read, written, processed, and transferred by computing devices. In another preferred embodiment, data or signals are physical signals including but not limited to electric charges, electric currents, voltages, magnetic fields, magnetic moments, electromagnetic fields, electromagnetic waves, masses of matter, number of particles, strengths of forces, physical locations, velocities of motion, mechanical energies, mechanical waves, chemical and material properties.
MDS of the first kind 1111 in turn comprises a plurality of multi-dimensional points (MDPs) of a first kind. In one preferred embodiment, MDS of the first kind 1111 is the collection of spatial coordinates of particles constituting a many-body quantum system with particle position operators being the canonical variables. In another preferred embodiment, the MDPs of the first kind are strings of 0 or 1 valued binary variables representing configurations of spins, or qubits that comprise a quantum computing device. In yet another preferred embodiment, the MDPs of the first kind are lattice points of a lattice model, namely, a physical model that is defined on a lattice, such as a quantum Ising model, a Heisenberg model of magnetism, a Hubbard model, a t-J model, or a lattice field theory. In yet another preferred embodiment, the MDPs of the first kind are vertices of a graph on which a classical random walk or a quantum walk takes place. Each preferred embodiment is given by way of example but no means of limitation. MDS of the first kind 1111 is endowed with a topology and comprising 2D
All LDDs in the plurality of groups of LDDs 1112 are efficiently computable. Each said group of LDDs is associated with an LDS and an MDS of the second kind according to one of said plurality of means of factoring MDS of the first kind 1111, with each said LDD included in such group of LDDs being associated with a portion of the MDS of the second kind associated with the group of LDDs, thus each of the LDDs is associated with a portion of MDS of the first kind 1111, such that each MDP included in said portion of MDS of the first kind 1111 is substantially the same as one and only one ordered pair of an LDP included in the LDS and an MDP of the second kind associated with the group of LDDs, whereas each pair of said portions of MDS of the first kind 1111 associated respectively with two different LDDs included in the same group of LDDs are substantially non-overlapping with respect to a volumetric measure, such that the volumetric measure of the intersection set of said pair of said portions of said MDS of the first kind is substantially negligible comparing to the volumetric measure of each of the pair of said portions of said MDS of the first kind, wherein the plurality of groups of LDDs 1112 collectively determine a practically substantially entangled high-dimensional density (HDD) that is uniquely defined over MDS of the first kind 1111, such that the HDD is substantially the same as each LDD in the corresponding portion of MDS of the first kind 1111 associated with that LDD. In one preferred embodiments, a group of LDDs consists of the eigenvectors of a Gibbs operator associated with the largest eigenvalue, while in another preferred embodiment, a group of LDDs comprises general signed densities. Furthermore, in one preferred embodiment, said volumetric measure is a Lebesgue volume measure with respect to a Riemannian metric on a Riemannian manifold, while in another preferred embodiment, said volumetric measure is a probability measure with a probability density function defined in relation to the HDD and the LDDs. Still further, in one preferred embodiment, the HDD is a ground state wavefunction of a physical system, while in another preferred embodiment, the HDD is a general signed density over a multi-dimensional space. Each preferred embodiment is given by way of example but no means of limitation.
Objective operator 1113 is an operator that enters a linear, quadratic, or another form of nonlinear functional involving the HDD to produce an expectation value. In one preferred embodiment, objective operator 1113 is a partial Hamiltonian that measures the total energy of a physical system. In another preferred embodiment, objective operator 1113 is an MDP projection operator, or called a position measurement operator, that produces an expectation value indicating the probability of finding the multivariable system at an MDP within a predetermined subset of MDS of the first kind 1111. Each preferred embodiment is given by way of example but no means of limitation.
Also the contemporary approach and translation by J. Merker: S. Lie and F. Engel, Theory of Transformation Groups I: General Properties of Continuous Transformation Groups (Springer, 2015).
English Translation: Functional Anal. Appl., vol. 2, pp. 282-291 (1968).
16, pp. 971-1023 (2016).
The present application claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application Ser. No. 63/130,847, filed Dec. 27, 2020. Said U.S. Provisional Application Ser. No. 63/130,847 is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
63130847 | Dec 2020 | US |