The present disclosure relates to the field of quantum computing, specifically to the field of quantum processor characterization.
The development of useful quantum computers relies crucially on the reduction of implementation errors—differences between the actual and ideal implementations of quantum logic operations. The long-term strategy for dealing with such errors is quantum error correction, which, however, requires very low error rates to begin with, as well as an excessive overhead in the number of qubits, relative to existing hardware. Complementary strategies, suited for dealing with errors in small circuits on existing hardware, are collectively termed quantum error suppression and mitigation. Both the improvement of hardware to meet the conditions for error correction, and the majority of error suppression and mitigation protocols, require a detailed and high-resolution characterization of implementation errors. Though the characterization of implementation errors in single qubits or qubit pairs is common practice, it is widely understood as computationally intractable for more than just a few qubits—unless various simplifying and often un-realistic assumptions are made. This state of affairs prevents the detailed and high-resolution characterization of state-of-the-art quantum processors, currently numbering tens to hundreds of qubits.
As an example, given a Markovian model for implementation errors to be characterized, the most detailed characterization protocol known to date is termed gate set tomography (GST) as described for example in Nielsen et al. “Gate Set Tomography” Quantum 5, 557 (2021), which is hereby incorporated by reference. Here ‘gate set’ corresponds to a set of quantum logic operations, generally including quantum states, gates and measurements, which are characterized simultaneously, relative to one another. High resolution is achieved in GST by using long periodic sequences of gates, such that the estimation error scales as the inverse of the sequence length. The flexibility in choosing the implementation errors to be characterized, and the fact that gates, states and measurements are all being characterized, means that GST can be viewed as generalizing and unifying various pre-existing protocols, such as state, measurement, process and Hamiltonian tomography. But the detailed output of GST naturally makes it resource extensive, in terms of both quantum and classical run times, as well as classical memory. As a result, the open-source implementation of GST, which is arguably the industry standard for detailed characterization, currently only supports single- or two-qubit GST as a standard use case, see Nielsen et al. “Probing quantum processor performance with pyGSTi,” Quantum Science and Technology 5, 044002 (2020).
A common approach to bypass the computational barrier of many-qubit characterization is to characterize a small patch (i.e. a small subset) of qubits in a large device, simply ignoring all qubits outside the patch. As an example, all published experimental uses of GST to date have used either single- or two-qubit patches. The above approach is also utilized by fast GST-like characterization protocols, including the single-qubit ‘robust phase estimation’ and variations thereof, Kimmel et al. “Robust calibration of a universal single-qubit gate set via robust phase estimation” Phys. Rev. A 92, 06235 (2015) and Landa et al. “Experimental Bayesian estimation of quantum state preparation, measurement, and gate errors in multi-qubit devices” Phys. Rev. Research 4, 013199 (2022), and the two-qubit ‘Floquet calibration’, Arute et al. “Observation of separated dynamics of charge and spin in the Fermi-Hubbard model” arXiv:2010.07965 (2020), and ‘Hamiltonian error amplifying tomography’, Sundaresan et al. “Reducing Unitary and Spectator Errors in Cross Resonance with Optimized Rotary Echoes” PRX Quantum 1, 020318 (2020). Similarly, most randomized characterization protocols are commonly applied to single- or two-qubit patches. However, gates intended to act within a particular patch also act unintentionally on additional, typically neighboring, qubits. This important phenomenon is known as cross-talk, and such cross-talk errors between the patch and its neighborhood cannot be captured by single-patch characterization protocols.
The Applicant has found that cross-talk between the characterized patch and its neighborhood leads to systematic estimation errors of implementation errors occurring within the patch, essentially since qubits out of the patch play the role of an unmodeled environment. Such systematic errors are herein referred to as patching errors. It has been found that patching errors are, in many realistic scenarios, a limiting factor in the performance of single-patch characterization protocols, as well as more general patch-based characterization protocols utilizing data from multiple patches.
Patch-based characterization protocols suffer systematic patching errors, as defined above. The present disclosure provides a method for overcoming this difficulty, enabling an efficient, detailed and high-resolution characterization of implementation errors in single patches, which is not limited by patching errors. As a result, the characterizations of multiple patches can be combined into a reliable full device characterization. The method involves novel complementary ‘neighboring gate sequences’ acting on neighboring qubits in addition to standard ‘characterization gate sequences’ in a given patch. While the characterization gate sequences are devised to produce measurement outcomes which are sensitive to implementation errors in the patch, the neighboring gate sequences are chosen to decrease patching errors, by reducing sensitivity to implementation errors involving qubits outside the patch (i.e. cross-talk). This results in single-patch and full-device estimates carrying small patching errors, which are demonstrated to be below statistical errors in practical regimes of interest.
In general, the characterization method disclosed herein may be usable for ranking a plurality of quantum processors based on the fitted model parameters (e.g. by computing a distance measure such as fidelity or diamond distance between ideal and actual implementations of at least some or each quantum operation). In other embodiments, the characterization method may be usable for providing optimal control over the quantum processor by modifying how pulses implement the quantum logic operations on the quantum processor based on the fitted model parameters. In some embodiments, the characterization method may be usable for providing error suppression and mitigation by recompiling a quantum algorithm based on the fitted model parameters. For example, the recompiling may be performed to minimize the use of the noisiest gates. Quantum algorithms and their applications are generally described in the literature (see e.g. www.quantumalgorithimzoo.org). For example, the quantum algorithm may be a factoring algorithm to find a prime factorization of an n-bit integer for use in cryptography.
In accordance with a first aspect of the presently disclosed subject matter, there is provided a computer implemented method, for characterizing implementation errors in a set of quantum logic operations of a quantum processor. The set of quantum operations is acting on a subset of qubits, the subset of qubits is defining a qubit patch. The implementation errors involve the qubit patch and are modeled by a model. The model includes a plurality of model parameters. The method comprises applying a set of quantum circuits to the quantum processor, wherein at least some of the quantum circuits comprise a characterization gate sequence and a neighboring gate sequence. The characterization gate sequence is applicable to the qubit patch and is configured to provide measurement outcomes that are sensitive to at least some of the model's parameters. That is, a small change in said model parameters leads to a large change in (the probability distribution of) measurement outcomes obtained after applying a characterization sequence. The neighboring gate sequence is applicable to at least some neighboring qubits that are outside the patch and interacting with the qubit patch. The neighboring gate sequence is configured to reduce the sensitivity of measurement outcomes to implementation errors involving environment qubits outside the qubit patch. The method further comprises: measuring the patch qubits using a measurement apparatus of the quantum processor; repeating the aforementioned actions of application of a quantum circuit and of measurement, to collect a set of frequencies, where the frequencies are associated with a measurement outcome and a quantum circuit; computing a value of the model parameters by fitting the model to the set of frequencies.
In addition to the above features, a computer implemented method, for characterizing implementation errors in a set of quantum logic operations of a quantum processor, according to this aspect of the presently disclosed subject matter can optionally comprise one or more of features (i) to (xxxiv) below, in any technically possible combination or permutation:
In accordance with a second aspect of the presently disclosed subject matter, there is provided a computer implemented method for characterizing a quantum processor. The method comprises successively applying the method according to the first aspect of the presently disclosed subject matter to a plurality of patches of qubits of said quantum processor.
In addition to the above features, a computer implemented method for characterizing a quantum processor according to this aspect of the presently disclosed subject matter can optionally comprise one or more of features (i) to (iv) below, in any technically possible combination or permutation:
In accordance with a third aspect of the presently disclosed subject matter, there is provided a non-transient computer-readable storage-medium storing computer instructions, wherein the computer instructions are used for causing a computer to execute the method according to the first or second aspects of the presently disclosed subject matter.
In accordance with a fourth aspect of the presently disclosed subject matter, there is provided a computer program product, comprising a computer program, wherein the computer program, when executed by a computer, implements the method according to the first or second aspects of the presently disclosed subject matter.
In accordance with a fifth aspect of the presently disclosed subject matter, there is provided a method for characterizing a quantum processor including a plurality of qubits, the method comprising applying a characterization protocol to a qubit patch including a subset of qubits, wherein the characterization protocol includes reducing a patching error due to interactions between qubits inside the qubit patch and qubits outside the qubit patch.
In addition to the above features, a method for characterizing a quantum processor including a plurality of qubits, according to this aspect of the presently disclosed subject matter can optionally comprise one or more of features (i) to (vi) below, in any technically possible combination or permutation:
In accordance with a sixth aspect of the presently disclosed subject matter, there is provided a quantum circuit for use in characterizing implementation errors in a set of quantum logic operations of a quantum processor. The set of quantum operations is acting on a subset of qubits, the subset of qubits is defining a qubit patch. The implementation errors involve the qubit patch and are modeled by a model. The model includes a plurality of model parameters. The quantum circuit comprises: a characterization gate sequence applicable to the qubit patch and configured to provide measurement outcomes sensitive to at least some of the model parameters; a neighboring gate sequence applicable to at least some neighboring qubits interacting with the qubit patch, wherein the neighboring gate sequence is configured to reduce a sensitivity of measurement outcomes to implementation errors involving environment qubits outside the qubit patch.
In addition to the above features, a quantum circuit for use in characterizing implementation errors in a set of quantum logic operations of a quantum processor, according to this aspect of the presently disclosed subject matter can optionally comprise one or more of features (i) to (iv) below, in any technically possible combination or permutation:
In accordance with a seventh aspect of the presently disclosed subject matter, there is provided a system comprising a computer and a quantum processor, the computer having a gate-level access to the quantum processor, the system being configured to perform the method according to the first, second, fifth, or sixth aspects of the presently disclosed subject matter.
In the present disclosure, the following terms and their derivatives may be understood according to the below explanations:
The term “qubit” may refer to a 2-level quantum system.
The term “qudit” may refer to a M-level quantum system, with M≥2. This includes the qubit case M=2, and may include the case M=∞ corresponding to a bosonic mode.
The term ‘axis’ may refer to a geometrical axis of rotation in the Bloch-sphere, such as the x-axis, representation of a qubit.
The term ‘Hamiltonian’ may refer to, depending on the context, a Hamiltonian operator or any term thereof, or may refer to having the Hamiltonian property—absence of decoherence.
The term “multi-qubit interaction” may refer to an implementation error involving (m>1) multiple qubits.
The term “quantum logic operation” may refer to any of the following operations applied to a quantum processor: initialization, measurement, gate application and reset operation. Additionally, it may also refer to pulse application, mid-circuit measurement and/or adaptive measurement.
The term ‘model’ may refer to a mathematical model for the implementation of a set of quantum logic operations in a quantum processor, i.e. a function θ(θ) mapping a plurality of model parameters θ to a particular implementation (θ).
The term “fitting” may refer to any form of selecting or updating a model based on measurement outcomes obtained from a quantum processor. This includes e.g. the computation of a maximum likelihood point in a given model parameter space, or the Bayesian update of a probability distribution over said parameter space.
Both model and fitting procedure may incorporate machine learning methods, such as artificial neural networks and optimization algorithms used for fitting such models to data.
The term “hypergraph” may refer to a generalization of a graph in which edges can join any number of vertices.
The term “patch” may refer to a subset of qubits in a quantum processor.
The term “patching error” may refer to systematic estimation errors occurring in a characterization protocol applied to a patch in a quantum processor, due to unwanted interactions between the characterized patch and additional qubits in said quantum processor.
The term “environment qubit” may refer to a qubit outside of a qubit patch defined in a characterization method according to the present disclosure.
The term “neighboring qubit” may refer to an environment qubit (i.e. a qubit outside of said patch) interacting with the qubit patch.
The term “gate sequence” may refer to a plurality of gates (i.e. quantum logic operations) applicable to given qubits in a given temporal arrangement. In particular, the term “characterization gate sequence” may refer to a plurality of gates applicable to (patch) qubits in the patch and the term “neighboring gate sequence” may refer to a plurality of gates applicable to (neighboring) qubits outside the qubit patch. In particular, a gate sequence may comprise a successive plurality of bursts of gates in a given temporal arrangement, wherein the gates of each burst are applied directly successively. As will be described herein, a gate sequence (e.g. the neighboring gate sequence) may be composed of a plurality of intertwined gate sequences (e.g. the context gate sequence and the dynamical decoupling gate sequence).
It is understood that the terms “at least one” and “at least some” are used herein to provide a general description of the method. For the sake of brevity, the description is not repeated by interchanging these terms by the term “each”. However, an embodiment for which the terms “at least some” and/or “at least one” is replaced by the term “each” or “substantially each” is also herein disclosed.
In order to better understand the subject matter that is disclosed herein and to exemplify how it may be carried out in practice, embodiments will now be described, by way of non-limiting example only, with reference to the accompanying drawings, in which:
rotations or a quantum state around the x-axis of the Bloch sphere.
Single-Qubit States and Rotations
Standard representations of single-qubit states and the meaning of rotation operations used in the following description are reviewed hereinbelow.
States which are not pure are termed mixed, and may be represented as points in the interior of the Bloch sphere—the Bloch-ball. In this case, the vector representation is replaced with a density operator representation. In
state Xπ/2|ψ220 and by π to the state Xπ|ψ230.
Quantum Logic Operations and Implementation Errors
The mathematical formalism used in the following description is reviewed hereinbelow.
A quantum processor, or quantum processing unit (QPU), may generally include a plurality of qubits and the following types of quantum logic operations:
The union of these three sets may be referred to as a set of quantum logic operations, or “gate set”,
={E|}∪{G}∪{|ρ}. (1)
The term “supported” means available as a native capability. Non-native gates may be replaced by equivalent combination of native gates. Similarly, non-native initial states or measurements may be obtained by applying native gates after applying a natively available initial state, or before the application of a natively available measurement. It is customary to collectively refer to the state preparations and measurements {|ρ}∪{E|} as SPAM.
Mathematically, each initial state p is a density operator, i.e. a positive semi-definite and unit-trace operator on a d-dimensional Hilbert space, where d=2n and n is the number of qubits. By choosing a Hilbert-Schmidt-orthonormal basis for Hermitian operators, each initial state can be represented as a vector |ρ∈d
Each element in the gate set will have an ideal implementation, as intended by the hardware manufacturer. The ideal gate set may be denoted by
0
={
E
0
|}∪{G
0}∪{|ρ0}. (2)
As described in more detail below, we do not restrict the ideal gates G0 to act non-trivially only on a small subset of qubits, e.g. to be 1- or 2-qubit gates. The ideal gates may act non-trivially on all qubits in the QPU, and can therefore be viewed as quantum-circuit-layers. Accordingly, we refer to a product of gates as a ‘sequence’ or a ‘circuit’ interchangeably. In some embodiments of the disclosed method, the gate set to be characterized contains at least one SPAM operation, or at least one non-idle gate, i.e. a gate G such that G o is not the identity matrix on all qubits.
The set of implementation errors may be defined as the difference between the actual implementation in Equation (1) and the ideal implementation in Equation (2). A Markovian error model may be used for simplicity without limitation of the scope of the present disclosure. In particular, this means that each gate set element may have a unique implementation error, independent of other circumstances. Such circumstances can include quantum operations applied previously, the time of day, etc. Additionally, it is noteworthy that given a Markovian error model on the full device, restricting attention to a particular patch generally leads to non-Markovian errors on that patch.
Each of the ideal gates G0=U0⊗U0* corresponds to a unitary operator U0∈SU(d), which acts on density operators as ρU0ρU0554. A coherent/unitary/Hamiltonian implementation error corresponds to an inaccurate unitary implementation, i.e., G=⊗U* with U≠U0. An incoherent/dissipative implementation error corresponds to a slightly non-unitary implementation, due to un-wanted interactions between the QPU and the rest of the universe.
Commonly, quantum processors admit a unique native initial state, ideally given by |ρ0=|0⊗n, where |0 corresponds to the state |00|, or simply |0, on each qubit. |ρ0 may also be written as |0, where 0 is now understood as an n-bit, rather than a single-bit, number. Similarly, Ei═=i| may correspond to a parallel measurement of all qubits in the 0-1 basis, each resulting in a binary outcome. These n bits may then be collected into an n-bit number i∈{0, . . . , d−1}. SPAM implementation errors can be understood as unwanted gates acting on the SPAM, e.g., |ρ=Gρ═ρ0.
Given the basic quantum logic operations in the gate set, each quantum circuit which can be implemented on the QPU corresponds to the application of a particular sequence of gates, labeled by j=(j1, . . . , jt) to the initial state, and a subsequent measurement which outputs one of the possible outcomes i=0, . . . , d−1. The symbol t may denote the number of gates in a sequence. The probability for the ith outcome for the sequence j in the gate set is given by
p
i
=p
i,j()=Ei|Gj
By repeating the same circuit N times, or shots, the frequencies fi=Ni/N may be collected, where Ni is the number of times the outcome i was obtained. The frequencies are multinomial random variables, with means pi, and variances pi(1−pi)/N, and therefore correspond to experimentally available approximations for the probabilities pi, with statistical errors ˜1/√{square root over (N)}. The ability to send quantum circuits and shot numbers to the QPU and receive the resulting measurement outcomes may be referred to as gate-level access to the QPU. This is the standard level of access provided to users by hardware manufacturers. A more detailed level of access may be referred to as ‘pulse-level’, and corresponds to users which can define and manipulate the time-dependent laser pulses generating gates. Such users may be interested in a direct characterization of pulses, rather than of the corresponding gates, and the method presented in the present disclosure is applicable to such pulse-level characterization protocols.
Finally, an important implication of equation (3) is that outcome probabilities are invariant under basis changes A∈GL(d2),
E
i
|
E
i
′|=
E
i
|A (4)
G
j
G
j
′=A
−1
G
j
A
|ρ|ρ=A−1|ρ
which are therefore referred to as “gauge transformations”. It follows that gate sets related by gauge transformations, ˜′, are physically indistinguishable, and a given QPU is actually described, at gate-level, by an equivalence class of such gate sets. A similar gauge-equivalence occurs also in a pulse-level description of QPUs.
Gate Set Tomography (GST)
Gate set tomography (GST) is a well-known protocol for the characterization of quantum logic operations. It is described herein as an example of a detailed characterization protocol. Its main drawback is the exponential scaling of required resources with the number of qubits, currently presenting difficulty in applying it to quantum processors numbering more than a few qubits.
The GST protocol is reviewed hereinbelow as a basis for the later description of embodiments of the present disclosure.
The goal of GST is to characterize implementation errors, given gate-level access to the QPU. This goal may be achieved in a few steps:
Characterization protocols, and GST in particular, require both quantum and classical resources. A practical way to quantify the quantum resources is to measure the QPU time, which is the amount of time the data acquisition step takes. This is generally a device-dependent function TQPU(N, {j}) of the number of shots per sequence, and the set of characterization sequences. As an example, on superconducting-qubit devices, the length t=tj of each sequence may not be a substantial factor influencing the QPU time because gate times are short i.e., of the order of about 10 ns. However, the number of distinct sequences Nseq=|{j}| may be a substantial factor influencing the QPU time. First, the number of distinct sequences controls the total number of shots Nseq×N which is significant because reset times may be long, i.e., in the order of about 100 μs. Second, TQPU can depend directly on Nseq, depending on how sequences are compiled into pulses acting on qubits.
Herein more details are provided on the sequence selection (step 2 of the GST protocol), which are relevant for the subsequent description of embodiments in the present disclosure.
In order to achieve a high sensitivity to model parameters, GST uses mostly-periodic long gate sequences, of the form FgqF′, with logarithmically-space powers q between 1 and qmax>>1, e.g. q∈{2k}k=1log
The parameter variations δθ that may be amplified are those for which δg=(∂g/∂θ)∂θ commutes with the germ, [g, δg]=0. This defines a linear space of amplified variations for each germ, amp(g). A first goal of a sequence selection algorithm may be to choose enough germs gα such that the space of all amplified variations Σαamp(gα) has the maximal rank.
The initial and final circuits F, F′ are referred to as fiducial circuits and may be configured such as to prepare a set of initial states and measurements, starting from the natively available |ρ and Ei|. A second goal of a sequence selection algorithm may be to choose, for each germ, a set of fiducial circuits which, collectively, have a large overlap with all amplified variations.
Both goals of sequence selection may be obtained by maximizing the Jacobian J=∂p/∂θ, evaluated at the ideal point in parameter space θ0, defined such that (θ0)=0. This is done by minimizing the (properly normalized and regularized) score function Tr(JTJ)−1), which approximates the statistical estimation error (measured by a Frobenius distance) of a GST protocol using the sequences from which J is constructed. In this manner, the sequence selection algorithm chooses sequences which minimize the error bars of estimated model parameters. At the end of a successful sequence selection, each amplified parameter will have a statistical error δθ˜1/(tmax√{square root over (N)})«ϵmax, so the largest amplified errors on the QPU can be characterized with a small relative error.
Neighboring Gate Sequences
As described in the Background section, detailed characterization methods known in the art, such as GST, require computational resources which quickly become infeasible as the number of qubits grows. A standard approach to dealing with this problem is to segment the plurality of qubits into “patches”—small subsets of qubits—and characterize each patch by itself, while ignoring the remaining qubits.
Naive segmentation is problematic due to crosstalk—unwanted interactions—between qubits which are in the patch and qubits of the QPU which are not in the patch, referred to as ‘environment qubits’. As demonstrated below, patch-environment interactions lead to an inaccurate characterization of implementation errors in the patch, and may be the limiting factor in the performance of single-patch characterization protocols, and multi-patch protocols constructed from these. The characterization error in single-patch protocols introduced by patch-environment interactions may be referred to as ‘patching error’.
The present disclosure addresses the above problem by adding ‘neighboring gate sequences’ in parallel to the characterization sequences (such as GST sequences described above), said neighboring gate sequences being applied to a subset of environment qubits, referred to as ‘neighboring qubits’, or as the ‘patch neighborhood’. The neighboring qubits may comprise qubits outside the patch (i.e., environment qubits) which are interacting with the qubit patch for example as defined by an interaction graph, or more generally, a hypergraph (which may also be referred to as a coupling map). The vertices of the interaction hypergraph may represent the qubits in the QPU, while the hyperedges may correspond to subsets of multiple qubits suspected to be interacting via multi-qubit implementation errors, or crosstalk. In some embodiments, the interaction hypergraph may be specified by a user. In some embodiments, the hypergraph may be defined such that hyperedges correspond to subsets of multiple qubits on which native multi-qubit gates exist in the QPU. The qubits in such subsets must be physically coupled in order to allow for the application of said native gates, and it is therefore natural to expect that these qubits will also suffer significant crosstalk. While characterization sequences are configured to produce measurement outcomes which are highly sensitive to implementation errors in the characterized patch, the neighboring gate sequences are configured to reduce the sensitivity of measurement outcomes to implementation errors involving the environment qubits, and more specifically, the neighboring qubits. Such a manipulation of the neighboring qubits results in reduced patching errors, enabling a reliable characterization of individual patches, and as a result, of full quantum processors.
The method may include a step 610 of applying a set of quantum circuits on the quantum processor. Applying said set of quantum circuits may include applying characterization gate sequences 613 to said qubit patch and applying neighboring gate sequences 617 to neighboring qubits. The characterization gate sequence may for example be a GST sequence. The neighboring qubits—as explained hereinabove with reference to
The method may further include a step of measuring 620 said qubits. The measuring of said qubits may be performed using a measurement apparatus of said quantum processor. In some embodiments, the measuring may include measuring some or all of the qubits of the quantum processor. In some embodiments, the measuring may include measuring the patch qubits.
The method may further include a step of repeating 630 steps 610-620 to collect a set of frequencies associated with a measurement outcome and a quantum circuit. In some embodiments, measuring said qubits may be performed on patch qubits and environment qubits. The method may then may further comprise computing reduced frequencies from the set of all frequencies by summing over frequencies corresponding to outcomes that differ only in the value of environment qubits. In some other embodiments, measurement of the patch qubits may be performed separately from the environment qubits, and the step of computing reduced frequencies may not be required.
The method may further include a step of fitting 640 the model parameters to said set of frequencies. The fitting principle may be similar to the fitting principles used for the GST protocol as explained hereinbelow.
In some embodiments, the method may be successively applied to a series of different qubit patches. This may provide a full characterization of the quantum processor.
In some embodiments, the method may be repeatedly applied to a same qubit patch, and fitted parameters obtained in each repetition may be used to construct both neighboring gate sequences and characterization gate sequences for the next repetition.
This may provide a characterization of the patch over time.
In some embodiments, the neighboring gate sequence may be configured to reduce a patching error estimated by a perturbative expansion.
In some embodiments, the neighboring gate sequence may be configured to cancel or reduce the leading orders in said perturbative expansion.
Patching Errors: Mathematical Definition
Hereinbelow, a detailed example, including a precise mathematical definition for patching errors is provided. Within this example, a perturbative computation of patching errors is subsequently performed, followed by examples of neighboring gate sequences motivated by the perturbative expansion, as well as numerical simulations demonstrating the performance of said neighboring gate sequences in realistic scenarios.
An n-qubit QPU, with native gate set QPU={Gj} including 1- and 2-qubit gates is provided. It is assumed that each gate may be represented as Gj=eh
An artificial separation of the n qubits into npatch qubits, to be considered as the ‘patch’, and the nenv=n−npatch complementary qubits, to be considered as the ‘environment’ may be made. In terms of error generators, this corresponds to a splitting
L
j
=L
j,patch
+L
j,env
+H
j,int
into patch, environment, and (Hamiltonian) interaction parts,
Note that an operator, or super-operator, A is said to ‘act only on the patch’ if it can be written as A=Apatch⊗Ienv, where Ienv is the identity operator, or super-operator, on the environment. In our notation we use Apatch to denote both the full A and its restriction to the patch, and the precise meaning should be clear from context. An analogous statement holds for operators that ‘act only on the environment’, of the form A=Ipatch⊗Aenv.
The goal of a characterization protocol applied to a single patch (‘single-patch protocol’) may be to characterize implementation errors Lj,patch involving only the patch qubits, and occurring in a subset of gates {Gj}j∈j
patch
={G
j,patch
=e
h
+L
}j∈J
In order to achieve this goal, a single-patch protocol will send quantum circuits j=j1, . . . , jt), with jk∈Jpatch, to the QPU, and receive N random samples per-sequence from the ‘reduced probabilities’
p
i,j
=
E
i,patch
|
I
env
|G
j
. . . G
j
G
j
|ρenv|ρpatch. (5)
These correspond to the preparation of an ideally tensor-product initial state |ρenv|ρpatch, the application of the chosen quantum circuit, and finally the ideal partial trace operation on the environment Ienv|, and a measurement of the patch qubits Ei,patch|. Note that we implicitly account for SPAM implementation errors as the first and last gates in each gate sequence.
A single-patch characterization protocol does not account for implementation errors involving qubits out of the patch, and in particular, assumes Hint=0. Under this assumption, the action of each gate Gj with j∈Jpatch becomes
G
j
=e
h
+L
+L
=e
h
+L
⊗e
L
=G
j,patch
⊗e
L
,
and the reduced probabilities simplify to
p
i,j
(0)
=
E
i,patch
|G
j
,patch
. . . G
j
,patch|ρpatchƒ, (6)
as would be the case if the environment qubits did not exist. Since the single-patch protocol assumes it receives samples from pi,j(0), but instead receives samples from pi,j, it will output an incorrect estimate for implementation errors in the patch, {Lj,patch}j∈J
δp=p−p(0)
The corresponding systematic (i.e., infinite statistics, N=∞) deviation in the estimate of Lj,patch, or alternatively patch, may be defined the patching error itself.
As an example for a single-patch characterization protocol, consider GST applied to a single patch. A parameterization, or model,
θpatch(θ)={Gj,patch(θ)}j∈J
is first chosen. This parameterization may be of the error generator form Gj,patch(θ)=eh
θest(0)=θreal,
or, more accurately, that the left- and right-hand sides of this equation lead to gauge equivalent gate sets, patch(θest(0))˜patch(θreal). For Hint≠0, the probabilities patching error δpi,j=[fi,j]−pi,j(0) leads to a systematic (N=∞) estimation error
δθest=θest−θreal,
which may be referred to as a ‘parameter patching error’. A quantitative relation between δθest and δp is given by
δθest=J−1δp+0(δp2), (7)
where J=∂p/∂θ is the Jacobian of parameterized probabilities, computed at θest, and J−1 is its suitably defined pseudo-inverse, already mentioned above.
Layer Characterization, Context Gates and Context Dependence
Above we considered a subset of gates {Gj}j∈J
The above generalization suggests a possible refinement of the goal of a single-patch characterization protocol: characterize the patch gate set patch acting on the patch qubits, in the ‘context gate set’
context
={G
j,context
=e
h
}j∈J
acting on the environment qubits. That is, the gate Gj,context should in principle be applied to the environment qubits whenever the gate Gj,patch is applied to the patch qubits. A gate sequence that consists of context gates may be referred to as context gate sequence. Note that the above refinement is relevant even if all context gates are idle gates hj,env=0, and that we omit Lj,env from Gj,context to stress that these implementation errors are not modeled or characterized by a single-patch protocol.
In order to quantify the extent to which applying context gates in parallel to corresponding patch gates is important, the notion of ‘context dependence’ may be defined. Consider two different patch gates Gj
Context dependence leads to a tension between reducing patching errors, and minimizing the time spent in incorrect contexts. This translates to a tension between neighboring gates sequences and context gate sequences, both acting on neighboring qubits during the application of characterization gate sequences. Generally speaking, strong context dependence favors ‘dilute’ neighboring gate sequences, which suppress patching errors efficiently with only few gates.
Below we demonstrate a perturbative analysis of patching errors. For simplicity, the analysis is carried out assuming all patch gates are meant to act in the idle context, and while ignoring context dependence. We then demonstrate in simulations that neighboring gate sequences motivated by the perturbative analysis can reduce patching errors efficiently even in context dependent scenarios.
Patching Errors: Perturbative Computation
A perturbative computation of the probabilities patching error δpi,j, subsequently used to construct examples of neighboring gate sequences is detailed hereinbelow. The perturbative expansion may require that Lpatch, Lenv, Hint are bounded, in a suitable operator-norm, by ϵpatch, ϵenv, ϵint≤ϵmax«1. It may further be required that tϵmax«1, where t=tj is the length of the sequence j.
For each sequence, it is convenient to express the perturbative expansion of δpi,j using an ‘effective error generator’ jeff, which is a super-operator on the patch such that
δpi,j=Ei,patch|Gj
The effective error generator is conveniently obtained by first expanding Ienv|Gj
j
eff=j,1eff+j,2eff+O(t3ϵint3),
where j,meff=O(tmϵintm). We then further expand j,meff in Lpatch and Lenv,
The first line in j,1eff corresponds to the first and leading order in the perturbative expansion, while the other explicit parts of j,1eff and j,2eff makeup the second order. The integral ∫u may refer to ∫0tdu and ∫v<u may refer to ∫0t du ∫ou dv. We also defined the piecewise constant Hint(u)=Hj
Initial States, Neighborhood-Mixed States and Initial Twirl
The Applicant has found that for particular initial states of the environment qubits ρenv, the expression «Ienv|H′int(u)|ρenv» appearing in the first two lines of Eq. (9) vanishes. For such states, Equations (8)-(10) reduce to
As a first example, assume Hint is a sum of terms Hx,y acting as ρx⊗ρyi[hx⊗Zy, ρx⊗ρy], where hx is a Hamiltonian operator on a patch qubit x, and Zy is the Pauli-Z operator in an environment qubit y. Here ρx, ρy are density matrices on qubits x, y. Under the above assumption, the expression «Ienv|Hint′(u)|ρenv» vanishes if ρenv is such that its reduced density matrix ρy on each neighboring qubit y (as defined above) is given by |+=|++|, where |+=|+X=(|0+|1)/√{square root over (2)}. As an example, the state
|ρenv=|+⊗n
where nneigh is the number of neighboring qubits, satisfies this condition. The above state can be efficiently prepared from the native state |ρenv=|0⊗n
More generally, the Applicant has found that if the initial state of the environment qubits ρenv is such that its reduced density matrix ρy on each neighboring qubit y is maximally mixed, i.e., ρx=Ib/2, the expression «Ienv|Hint′(u)|ρenv» appearing in the first two lines of Eq. (9) vanishes, for any Hint. Such a state may be referred to as ‘neighborhood-mixed’, and denoted ρenv(NM). Relevant examples for neighborhood-mixed states are
and the closely related
|ρenv(NM)=(|0⊗n
The above states are mixed but ‘separable’, or ‘un-entangled’, and can therefore be prepared efficiently, as described below. It should be noted that a neighborhood-mixed state can be pure, but must then be entangled, requiring a non-trivial quantum circuit to prepare. An example for such a neighborhood-mixed state is the “GHZ state” |GHZ»=|GHZGHZ|, where |GHZ=(|0⊗n
In some embodiments, a patching error may be estimated using a perturbative expansion and the neighboring gate sequences may comprise initializing the environment qubits in a state configured to cancel or at least reduces leading orders in said perturbative expansion.
In some embodiments, the neighboring gate sequences may comprise initializing the environment qubits in a neighborhood-mixed state.
In some embodiments, the neighboring gate sequences may comprise initializing (only) the neighborhood qubits in a neighborhood-mixed state.
In some embodiments, the definition of a neighborhood-mixed state does not include the maximally mixed state (i.e. the state where the density operator of all environment qubits is proportional the unit-operator).
Based on the interpretation of mixed states as probability distributions over pure states, the Applicant has found that the preparation of some neighborhood-mixed states can be obtained by including in the neighboring gate sequences an ‘initial twirl’. The latter corresponds to assigning to a given characterization sequence a number of distinct neighboring gate sequences, and averaging over the corresponding frequencies. These distinct neighboring sequences may differ in their initial gate, such that the averaging corresponds to the preparation of a mixed (and neighborhood-mixed) initial state. In other words, the set of quantum circuits may include (or may consist of a series of) quantum circuit clusters wherein each quantum circuit cluster comprises a plurality of quantum circuits having a same characterization gate sequence and a distinct neighboring sequence. In these embodiments, the method may include combining (e.g. averaging) the frequencies collected on quantum circuits of the same cluster. The distinct neighboring sequences may differ in their initial gate, such that the combining (e.g. averaging) corresponds to the preparation of a mixed (and neighborhood-mixed) initial state. The initial twirl thus eliminates the first-order in the perturbative expansion of the patching error, while requiring no complex initialization procedures.
As an example, the initial twirl corresponding to the state in Eq. (12) may be such that, for each characterization sequence, two distinct neighboring sequences are assigned, the first beginning with the idle gate and the second with an Xn gate, both applied in parallel to all neighboring qubits, and a subsequent averaging of the two distinct frequencies for each characterization sequence. A characterization method including the above initial twirl may include for example the following steps:
The QPU time required may be TQPU(N/2, {j′}∪{j″}). Compared to the QPU time TQPU(N, {j′}) of the bare sequences, the number of sequences doubled, but the total number of shots is unchanged.
Reiterating, the patching error with initial twirl (or neighborhood-mixing) may be of second-order only −[fi,j]−pi,j(0)=O(tj2ϵmax2)—and not first order, as may be without initial-twirl/neighborhood-mixing −[fi,j′]−pi,j(0)=O(tjϵmax). The statistical error (in the frequencies) may not change: [fi,j]=[fi,j′]=O(1/N).
As explained below and shown for example in
The characterization sequence 740 applied on the patch qubits 710 may be a GST sequence determined in accordance with the GST protocol. The characterization sequence may comprise an initial fiducial sequence F 742, a final fiducial sequence F′ 746 to prepare the qubits to measurement, and multiple applications of a germ g 744. The neighboring sequence 750 applied on the neighboring qubits 720 may comprise an initial twirl 753 and a dynamical decoupling sequence 756.
As explained above, the neighboring gate sequence 750 may include initializing the environment qubits in a neighborhood-mixed state. In particular, the neighboring sequence 750 may include an initial twirl 753 that may include applying a π rotation along the x-axis for each neighboring qubit only for half of the repetitions of each characterization sequence. For the other half, no rotation may be applied, symbolized as applying identity gates.
In some embodiments, at least some of the dynamical decoupling gates (or sets of gates) 756 are applied synchronously with germs, i.e. a dynamical decoupling (“DD”) operation may begin simultaneously with the beginning of a germ.
In the present disclosure, the term ‘coordination’ may refer to a relative timing in the application of the gates. The timing may be affected by the characteristics of the processor. For example, implementation errors dictate that even simple idling of any qubit may be sensitive to the time spent idling. Examples of relative timing between gates may include simultaneous beginning of one or more gate(s) applied on one or more qubit(s) and of one or more other gate(s) applied on one or more other qubit(s), simultaneous ending of one or more gate(s) applied on one or more qubit(s) and of one or more other gate(s) applied on one or more other qubit(s), or a predetermined delay between one or more gate(s) applied on one or more qubit(s) and one or more other gates applied to the same or one or more other qubits.
The quantum circuit may further include an initial fiducial sequence F 742B, a final fiducial sequence F′ 746B, and measurement 760B. The germs 744B and the fiducial sequences 742B 746B may form the characterization gate sequence 740B.
In some embodiments, initial and final fiducial sequences F 742B, F′ 746B may also be performed together with corresponding context gates (not shown) and the initial twirl may be performed before the context gate corresponding to the (initial) fiducial sequence F 742B.
It is noteworthy that a difference in the application of dynamical decoupling, between the embodiments shown in
In some other embodiments, the averaging of frequencies fi,j′, fi,j″ may be replaced by an averaging of the corresponding fitted model parameters. That is, the model is fitted once to the frequencies fi,j′, resulting in the estimate θest′ of model parameters, and once to the frequencies fi,j″, resulting in the estimate θest″. These estimates are then averaged, θest=(θest′+θest″)/2.
In some other embodiments, the clustering may include more than two quantum circuits per cluster, wherein each quantum circuit of a given cluster may comprise different neighboring sequences and the same characterization sequence.
In some other embodiments, the averaging may be weighted.
In some other embodiments, the averaging may be replaced by a non-linear function.
As explained above, for every quantum circuit j the patching error satisfies δpi,j=O(tjϵmax) for a generic initial state ρenv, but δpi,j=O(tj2ϵmax2) for a neighborhood-mixed initial state ρenv(NM). The dependence of these bounds on tj is generically saturated for mostly-periodic long characterization sequences Gj
where Hint,0 denotes the projection of Hint onto the commutant of the ideal gate Gideal=eh (recall the commutation requirement presented in the overview of GST). Similarly, when the environment qubits are in a neighborhood-mixed state, the leading order integrals in the expansion of jeff (see Eq. (11)) are given by:
As long as the commutant projection Hint,0 does not vanish, which is the generic case, the leading t=tj scaling in the effective error generator is indeed saturated. This holds also for general mostly-periodic sequences.
As demonstrated above, accidental amplification implies an un-favorable scaling of patching errors with sequence length, in mostly-periodic characterization sequences. The Applicant has found that in order to eliminate the accidental amplification of out-of-model implementation errors, while maintaining the carefully constructed amplification of in-model implementation errors, neighboring gate sequences including de-amplification sequences may be used. In some embodiments, such de-amplification sequences may be non-periodic. In some embodiments, such sequences may be periodic and have a de-amplification period differing from a period of the characterization sequence. In some embodiments, such sequences may be un-structured and/or random, or carefully structured for de-amplification. In particular, structured ‘dynamical decoupling’ sequences may be used. Dynamical decoupling (DD) is well known in the art for other applications. As a basic definition, an m′th order DD sequence applied to a given set of qubits is one which, for ideal and instantaneous DD gates, eliminates coherent errors involving said set of qubits up to (and including) the m′th order in perturbation theory. A standard example for a first order DD sequence is given by the so called XY4 sequence, which may be written as X-Y-X-Y. Applied to a qubit subjected to an unwanted Hamiltonian evolution etH=I+O(tH), the XY4 sequence (with ideal and instantaneous gates) modifies the evolution to YπetH/4XπetH/4YπetH/4XπetH/4=I+O(t2H2). DD is used, in some embodiments of this disclosure, to eliminate the systematic patching errors that arise in patch-based characterization protocols.
In some embodiments of this disclosure, the gates of the DD neighboring sequences may be coordinated with the gates of the characterization sequence. In some embodiments, the characterization sequence may have a periodic part (e.g. a GST sequence), and the DD sequence may be applied in parallel to the periodic part of the sequences (i.e., the GST germs). The synchronization may be such that each dynamical decoupling gate may be applied exactly with the first gate of the germ that may correspond it. The DD gates may be evenly spread over the germ repetitions. An algorithm for generating a quantum circuit including an almost-periodic characterization gate sequence and a neighboring gate sequence incorporating DD, for use in a characterization method according to the present disclosure, may for example include the following steps:
Furthermore, the QPU time is unchanged by the additional of DD neighboring sequences, TQPU(N, {jDD})=TQPU(N , {j}), as the DD gates may be applied in parallel to the characterization sequence.
The accidental amplification may thus be eliminated, reducing patching errors as follows. For a first-order (or higher) DD sequence, the patching error may be [fi,jDD]−pi,j(0)=O(ϵmax). Neighborhood-mixing (or initial twirl) used in conjunction with a second-order) (or higher) DD sequence may result in a patching error [fi,jDD+NM]−pi,j(0)=O(tjϵmax2). The statistical error may be unchanged.
In some embodiments, the characterization method described herein may be repeated using a plurality of different (distinct) patches. i.e. the characterization method may be successively applied to a plurality of subsets of qubits of a quantum processor.
In some embodiments, the plurality of distinct patches may cover all of the qubits of the quantum processor. In some embodiments, the plurality of distinct patches may overlap, i.e. different patches may have at least one qubit in common. Overlapping patches may be desired, for example, in order to collectively characterize all interactions specified by the interaction hypergraph. In some embodiments, some of the patches may contain more than two qubits (e.g., three, four, five or more qubits). In some embodiments, the sequences corresponding to different patches (including both characterization and neighboring gate sequences) may be applied in parallel, in order to reduce the overall QPU time.
While the method set forth hereinabove presumed qubits, the method is more generally applicable to QPUs utilizing quantum systems with M≥2 logical states. Quantum systems with two or more logical states are known as ‘qudits’. The mathematical framework of the presently disclosed characterization method has no dependency on the binary nature of qubits. In qudit embodiments, the neighborhood-mixed state (|0⊗n
where m=1, . . . , M. Using initial twirl, the N repetitions (shots per sequence) are split to M ‘sub-repetitions’, with initializing in each sub-repetition the environment's qudits to one of each of the M states. For example, in a processor built using three-level systems (‘qutrits’), the neighborhood mixed state is (|0⊗n
Note that the disclosed method is based on a logical description of the QPU, and may be applied if qubits are ‘encoded’ or ‘logical’ rather than ‘physical’—i.e. corresponding to general 2-dimensional subspaces in the space of physical states of the QPU. The same statement holds for a QPU described by logical qudits, and is relevant in particular for the application of the disclosed method to QPUs implementing quantum error correction, computation in decoherence free subspaces, and topological quantum computation.
Simulation results
The simulated characterization sequences j correspond to the application of Xπ/2t to the patch qubit, with time t=2, . . . , 9, corresponding to the different curves, and the measurement outcome is i=0. Each ϵint corresponds to the standard deviation of a normal distribution from which the interaction Hamiltonian Hint is sampled. Similarly, Lpatch=Hpatch is sampled, with a fixed ϵpatch=10−3. The y-axis corresponds to the standard deviation of δpi,j over many samples from these distributions, scaled by t2(t) for t even (odd). (The mean of δpi,j over randomly-sampled coherent implementation errors vanishes).
Visible in the graphs are eye-guides for linear scaling 931931′ and for quadratic scaling 932932′ with ϵint. Without the initial twirl, the patching error of the odd-length sequences, marked with solid circles, scales as tϵint 913 and the patching error of the even-length sequences, marked with hollow squares, scales as t2ϵint2 914 for ϵint»ϵpatch, and as t2ϵintϵpatch for ϵint«ϵpatch 915. With initial twirl, the tϵint and t2ϵintϵpatch contributions vanish, and the patching is reduced significantly. Both with initial twirl and without initial twirl, the patching error saturates at high ϵint values 933933′.
For simplicity, the errors for different parameters are averaged, and this average (y-axis) is plotted as a function of the number of germs in the characterization sequence (x-axis). Only significant implementation errors, with magnitude larger than 10−4 are included in the average. Every sequence was repeated for N=300 shots. Four datasets are plotted. The first data set corresponds to ‘bare’ characterization sequences, with no added neighboring gate sequences (shown as circle). The remaining data sets correspond to characterization sequences with the following types of neighboring gate sequences: initial twirl (shown as square), initial twirl followed by structured second order dynamical-decoupling sequences (shown as triangle) and initial twirl followed by an equally spaced random sequence (shown as crosses).
A significant reduction of patching error (compared to the statistical error) for the coherent parameters of qubit q1 is achieved as shown in graph 1030, as predicted by the perturbative expansion. In general, it is noteworthy that without initial twirl, for some parameters
for some parameters while with the initial twirl,
for all parameters.
With reference to
The exemplary PC 1200 further includes one or more storage devices 1140, such as a hard disk drive for reading from and writing to a hard disk, a magnetic disk drive for reading from or writing to a removable magnetic disk, and/or an optical disk drive for reading from or writing to a removable optical disk (such as a CD-ROM or other optical media). Such storage devices can be connected to the system bus 1130 by a hard disk drive interface, a magnetic disk drive interface, and/or an optical drive interface, respectively. The drives and their associated computer readable media provide nonvolatile storage of computer-readable instructions, data structures, program modules, and other data for the PC 1200. Other types of computer-readable media which can store data that is accessible by a PC, such as magnetic cassettes, flash memory, digital video disks, CDs, DVDs, RAMs, NVRAMs, ROMs, and the like, may also be used in the exemplary operating environment. As used herein, the terms storage, memory, and computer-readable media may not include or encompass propagating carrier waves or signals per se.
A number of program modules may be stored in the storage devices 1140, including an operating system, one or more application programs, other program modules, and program data. Storage of results of quantum measurements and instructions for obtaining such measurements (and/or instructions for performing any embodiment of the disclosed technology) can be stored in the storage devices 1140. A user may enter commands and information into the PC 1100 through one or more input devices 1150 such as a keyboard and a pointing device such as a mouse. Other input devices may include a digital camera, microphone, joystick, game pad, satellite dish, scanner, or the like. These and other input devices are often connected to the one or more processing units 1110 through a serial port interface that is coupled to the system bus 1130, but may be connected by other interfaces such as a parallel port, game port, or universal serial bus (USB). A monitor 1180 or other type of display device is also connected to the system bus 1130 via an interface, such as a video adapter. Other peripheral output devices 1160, such as speakers and printers (not shown), may be included. In some cases, a user interface is displayed so that a user can input a circuit for synthesis, and verify successful synthesis.
The PC 1200 may operate in a networked environment using logical connections to one or more remote computers, such as a remote computer 1190. In some examples, one or more network or communication connections 1170 are included. The remote computer 1190 may be another PC, a server, a router, a network PC, or a peer device or other common network node, and typically includes many or all of the elements described above relative to the PC 1100, although only a memory storage device 1195 has been illustrated in
When used in a LAN networking environment, the PC 1100 is connected to the LAN through a network interface. When used in a WAN networking environment, the PC 1100 typically includes a modem or other means for establishing communications over the WAN, such as the Internet. In a networked environment, program modules depicted relative to the personal computer 1100, or portions thereof, may be stored in the remote memory storage device or other locations on the LAN or WAN. The network connections shown are exemplary, and other means of establishing a communications link between the computers may be used.
With reference to
With reference to
The quantum processing unit(s) can be one or more of, but are not limited to: (a) a superconducting quantum computer; (b) an ion trap quantum computer; (c) a topological quantum computer using e.g. Majorana zero modes; (d) a photonic quantum computer; or (e) a neutral atom quantum computer. The sets of gates (e.g., using any of the disclosed embodiments) can be sent into (or otherwise applied to) the quantum processing unit(s) via control lines 1270 at a controller 1250. In the illustrated example, the desired quantum computing process is implemented with the aid of one or more controllers 1250 that are specially adapted to control a corresponding one of the quantum processor(s) 1210. The classical processor 1220 can further interact with measuring/monitoring devices (e.g., readout devices) 1280 to help control and implement the desired quantum computing process (e.g., by reading or measuring out data results from the quantum processing units once available, etc.).
Having described and illustrated the principles of the disclosed technology with reference to the illustrated embodiments, it will be recognized that the illustrated embodiments can be modified in arrangement and detail without departing from such principles. For instance, elements of the illustrated embodiments shown in software may be implemented in hardware and vice-versa. Also, the technologies from any example can be combined with the technologies described in any one or more of the other examples. It will be appreciated that procedures and functions such as those described with reference to the illustrated examples can be implemented in a single hardware or software module, or separate modules can be provided. The particular arrangements above are provided for convenient illustration, and other arrangements can be used.
Number | Date | Country | |
---|---|---|---|
63395962 | Aug 2022 | US |