Any and all applications for which a foreign or domestic priority claim is identified in the Application Data Sheet as filed with the present application are hereby incorporated by reference under 37 CFR 1.57.
The present disclosure relates to computer systems that include a hybrid combination of classical computers and quantum computers that mutually exchange data therebetween. More specifically, the present disclosure relates to computer systems and methods that are configured to receive input data from an apparatus and to use a model implemented at least in part on a quantum computer to process the input data to generate corresponding output data for use in controlling operation of the apparatus. Furthermore, the present disclosure relates to software products including executable code for causing a data processor to implement the aforesaid methods.
Probabilistic graphical models can be used to describe dependencies of variables in complex systems, for example dependencies of random variables. Such models are capable of being used to perform two important tasks: learning and inference. Such learning yields a model that approximates observed data distributions of the complex systems. Moreover, inference uses the model to answer queries about unobserved variables given observations of other variables. Contemporarily, in general, exact inference is computationally intractable, so methods that can produce approximate solutions to queries using computing systems are desirable.
According to a first aspect, there is provided a control system for controlling or monitoring a real physical system, wherein the control system includes a hybrid combination of a classical computer and a quantum computer, wherein the control system is configured to receive input data at the classical computer from the real physical system, wherein the classical computer and the quantum computer are configured to exchange data therebetween, and to use a variational inference arrangement executed on the hybrid combination to process the input data to generate corresponding output data from the classical computer for use in controlling or monitoring operation of the real physical system, wherein the variational inference arrangement is implemented at least in part by using at least one Bayesian network model arrangement either on the classical computer or on the quantum computer, and a Born machine arrangement implemented using the quantum computer.
In some cases, the variational inference arrangement may comprise a Bayesian network model that may be inferred. In some cases, the Bayesian network model can be implemented using a classical computer. In some cases, the Bayesian network model can be a trained model. The variational inference arrangement may further comprise a Born machine. In some examples, the Born machine may be trained to perform inference on the Bayesian network model. In some implementations, the Born machine may comprise a quantum circuit (e.g., a parametrized quantum circuit) implemented on the quantum computer. In some cases, Bayesian network model and the Born machine may be implemented on the quantum computer.
The invention is of advantage in that use of the at least one model, namely at least one generative model, implemented using the Born machine arrangement implemented using the quantum computer is capable of providing an enhanced degree of inference and control accuracy in respect of the real physical system. Implementing the model using the Born machine arrangement implemented on the quantum computer, may improve an accuracy of the control system, and allow the control system to infer an operating condition of the real physical system controlled of monitored by the control system.
Optionally, for the control system, the Born machine arrangement of the variational inference arrangement (e.g., used to perform inference on the at least one Bayesian network model arrangement), is configured to be taught by using an objective function for at least one of:
Optionally, for the control system, the Bayesian network model arrangement includes a nested series of models, for example a nested series of hidden Markov models (HMMs).
Optionally, the control system is configured to infer an operating condition of the physical system from an error signal used to compensate for deviations in operation of the physical system relative to a learnt representation of the physical system, wherein the learnt representation of the physical system is implemented using the at least one Bayesian network model arrangement that is at implemented using either the classical computer or the quantum computer or a hybrid combination thereof.
In some cases, the classical computer is in communication with the quantum computer, and executes program instructions that are stored in a non-transitory memory of the quantum computing system. In some cases, the non-transitory memory can be a memory of the classical computer.
In some implementations, the control system may include at least one conventional classical computer (also referred to as classical computer or binary data computer) and at least one quantum computer in communication with the classical computer. In some cases, the classical computer may be in communication with the quantum computer. The classical computer may exchange data with the quantum computer via one or more data links or a data interface.
In some examples, the operation of the quantum computer may be controlled by the classical computer or a separate quantum computer controller (e.g., using an FPGA), based on computer executable instructions stored in a non-transitory memory of the classical computer or the quantum computer controller.
In some cases, the control system (e.g., the classical computer in the control system) may include at least one electronic processor (also referred to as hardware processor) and a non-transitory memory configured to store data and specific computer-executable instructions. The electronic processor may be in communication with the memory and may be configured to specific computer-executable instructions to perform at least a portion of computation and the computational and control tasks associated with receiving input data, processing the input data using the variational inference arrangement, and generating output data.
According to a second aspect, there is provided a method for (namely, method of) using a control system for controlling or monitoring a real physical system, wherein the method includes:
Optionally, the method includes arranging for the at least one Bayesian network model arrangement, for example implemented using a hidden Markov model arrangement, to include a nested series of models, wherein at least one of the series of models is implemented using the classical or quantum computer or both. In some examples, the at least one Bayesian network model arrangement, may be implemented using models different from hidden Markov models, for example explicitly defined models.
Optionally, the method includes configuring at least one Born machine model arrangement of the variational inference arrangement to be taught by using an objective function for at least one of:
(ii) teaching using a kernelized Stein discrepancy (KSD) requiring explicit priors and likelihoods.
Optionally, the method includes configuring the control system to infer an operating condition of the real physical system from an error signal used to compensate for deviations in operation of the real physical system relative to a learnt representation of the real physical system, wherein the learnt representation of the real physical system is implemented using the at least one Bayesian network model arrangement that is at least partially implemented using the quantum computer.
According to a third aspect, there is provided software product, wherein the software product includes executable code for causing a data processor to implement the method of the second aspect.
In some cases, the method of the second aspect may be implemented, at least partially, by executing specific instructions stored in a machine-readable data storage medium that are executable on data processing hardware, using a data processing hardware.
Embodiments of the disclosure will be described with reference to the following diagrams, wherein:
The present disclosure relates to computer systems that include a hybrid combination of classical computers and quantum computers that mutually exchange data therebetween; for example, the computer systems are configured to receive input data from apparatus and to use variational inference implemented at least in part on the quantum computers to process the input data to generate corresponding output data from the classical computers for use in controlling operation of the apparatus; the computer systems are, for example, implemented as control systems. Moreover, the present disclosure relates to methods for using computer systems that include a hybrid combination of classical computers and quantum computers that mutually exchange data therebetween; for example, the methods include configuring the computer systems to receive input data from apparatus and using variational inference implemented at least in part on the quantum computers to process the input data to generate corresponding output data from the classical computers for use in controlling operation of the apparatus.
Probabilistic graphical models can be used to describe dependencies of variables in complex systems, for example dependencies of random variables. Examples of probabilistic graphical models include Bayesian networks and Markov networks, for example hidden Markov models (HMM). Such models are capable of being used to perform two important tasks: learning and inference. Such learning yields a model that approximates observed data distributions of the complex systems. Moreover, inference uses the model to answer queries about unobserved variables given observations of other variables. Contemporarily, in general, exact inference is computationally intractable, so producing approximate solutions to queries becomes a desired goal. Improving an accuracy of the approximate solutions is a technical problem that the present disclosure seeks to address.
Probabilistic graphical models may be used in applications where an inference is to be derived; for example, the use applications can include health care and medicine (e.g., triage and clinical diagnosis), biology, genetics and forensics, finance and fault diagnosis, question answering systems for troubleshooting and customer assistance, and the like. Other example applications include: optimal control of mechanical systems (e.g., robotic arms) whose state can be inferred via the use of positional sensors or cameras, and object tracking via cameras, radar, infrared and other electromagnetic wave sensors. Such applications have a crucial requirement for quantifying an uncertainty of conclusions inferred from the models. Quantifying the uncertainty is a technical problem. The technical problem is susceptible to being addressed by using a posterior distribution to quantify the uncertainty. Moreover, the posterior distribution can be used in downstream tasks such as determining a likeliest configuration of unobserved variables, which best explains observed data.
The approximate inference methods may broadly fall into two categories:
The MCMC methods produce samples from a true posterior distribution in an asymptotic limit. Conversely, VI methods are a machine learning technique which casts inference as an optimization problem over a parameterized family of probability distributions. A technical problem that the present disclosure seeks to address is how to improve these methods, for example in practical real-time control applications.
In some cases, MCMC methods may be implemented using quantum computing resources, e.g., by replacing standard MCMC with quantum annealing hardware when training some types of Bayesian networks and Markov networks. Despite initially promising empirical results, it has been found to be very difficult to show that using quantum computing resources is susceptible to providing any benefit. However, it has been considered that using gate-based quantum computers would be a more promising approach; there exist algorithms for implementing MCMC methods with proven asymptotic advantage, but they require error correction and other features that are not available in contemporary quantum computers.
Thus, an objective technical problem addressed in the present disclosure relates to providing an improved configuration of a quantum computer that uses variational inference (VI), for example to control an operation of an apparatus in a more optimal manner.
According to a first aspect, there is provided a control system for controlling or monitoring a real physical system, wherein the control system includes a hybrid combination of a classical computer and a quantum computer, wherein the control system is configured to receive input data at the classical computer from the real physical system, wherein the classical computer and the quantum computer are configured to exchange data therebetween, and to use a variational inference arrangement executed on the hybrid combination to process the input data to generate corresponding output data from the classical computer for use in controlling or monitoring operation of the real physical system, wherein the variational inference arrangement is implemented at least in part by using at least one Bayesian network model arrangement either on the classical computer or on the quantum computer or both, and a Born machine arrangement implemented using the quantum computer.
According to a second aspect, there is provided a method of using a control system for controlling or monitoring a real physical system, wherein the method includes:
According to a third aspect, there is provided software product, wherein the software product includes executable code for causing a data processor to implement the method of the second aspect.
The present disclosure is concerned with performing variational inference (VI) using a quantum computer, for example using a noisy intermediate scale quantum (NISQ) computer; beneficially, the VI is used in a situation for improving control of operation of a real physical system to enhance its operation, for example to provide better control accuracy, better energy efficiency, better operating performance, better operating efficiency, faster response, better safety of operation, and so forth. Embodiments of the present disclosure benefit from improving inference that is achievable using classical probabilistic models by using quantum computer processing resources. For example, embodiments of the present disclosure use “Born machines” implemented via parameterized quantum circuits, wherein such machines are susceptible to function as quantum machine learning models that exhibit high expressivity. Moreover, embodiments of the present disclosure beneficially use gradient-based methods and amortization in their training phase of quantum generative models such as Born machines, as will be elucidated in greater detail below.
Some embodiments of the present disclosure optionally use unsupervised learning methods; however, such unsupervised learning methods use considered complexity and learning theory arguments for distributions and quantum non-locaity and contextuality. Moreover, embodiments of the present disclosure use variational inference (VI), as aforementioned, that is applicable to real physical systems, for example for controlling complex manufacturing apparatus, robotics systems, self-drive vehicles and so forth. Results of the VI can be used at decision points in control algorithms that determine operation of the aforesaid real physical systems.
Next, in general overview, VI that is used in embodiments of the present disclosure will be elucidated in greater detail. When developing a probabilistic model p over some set of random variables y, the variables y can be continuous or discrete, depending on situation. Moreover, there will be available evidence for some variables in the model p; these variables that are supported by evidence x⊆y are then observed, for example measured, namely fixed to values of the evidence, wherein it is convenient to use a vector notation x to denote a realization of these observed variable. It is assumed that x˜p(x), meaning that the probabilistic model p is able to capture a distribution of observed data. In an event that it is desirable to infer a posterior distribution of unobserved variables of the model p, these are conveniently included in a set z:=y\x. Denoting these unobserved data by a vector z, it is desirable to compute the posterior distribution p(z|x), namely a conditional probability of z given x. By definition, the conditional probability can be expressed in terms of a joint divided by a marginal, namely: p(z|x)=p(x,z)/p(x). However, from the foregoing, it will be appreciated that the joint can be written as p(x,z)=p(x|z)p(z). Conveniently, Bayes' theorem combines the two identities and yields a relationship: p(z|x)=p(x|z)p(z)/p(x).
It will be appreciated that Bayesian networks are known to be targeted to solve inference problems and are beneficially used in embodiments of the present disclosure. A Bayesian network describes a set of random variables with a clear conditional probability structure. This structure is a directed acyclic graph where conditional probabilities are modelled by explicit distributions, for example often modelled using tables, or by using neural networks. Referring to
In
Contemporarily, it is found that inference is a computationally hard task to implement using classical (binary) computers, in all but simplest probabilistic models. Exact inference in Bayesian networks with discrete variables is #P-complete. Moreover, even approximate inference is non-deterministic polynomial-time hard (NP-hard); thus, unless some particular constraints are applied, such inference calculations are intractable using, for example, classical (binary) computers. Embodiments of the present disclosure seek to address such intractable calculations by using quantum computers. Conventionally, when tackling such inference computation using classical (binary) computers, there is performed a “forward pass” and unbiased samples are obtained from a joint (x,z)˜p(x,z)=p(x|z)p(z). However, obtaining unbiased samples from a posteriors z˜p(z|x)=p(x,z)/p(x) is intractable due to an associated unknown normalization constant. However, in such a situation, it is feasible to implement MCMC sampling by constructing an ergodic Markov chain whose stationary distribution is a desired posterior. A problem with such ergodic Markov chains is that may converge slowly in practice. In contrast to ergodic Markov chains, variational inference (VI) is often faster in high dimensions but does not provide guarantees. Without such guarantees, it is therefore a technical risk to apply such VI methods to control systems where safety is a critical issue. It will be appreciated that VI is configured to optimize a variational distribution q by minimizing its “distance” from a true posterior p.
In embodiments of the present disclosure, there is used a quantum Born machine arrangement as a candidate generator for variational distributions. Born machines are capable of functioning as highly expressive models that naturally represent discrete distributions as a result of quantum measurements. Additionally, the quantum Born machine arrangement is susceptible to being trained by gradient-based optimization.
Embodiments of the present disclosure use an inherent probabilistic nature of quantum mechanics to model probability distributions of classical data in one or more simultaneous states within a quantum computer. When considering binary data z|{0,1}n where n is the number of variables in question, the Born machine is a normalized quantum state |ω(θ) that is parameterized by θ which outputs n-bit strings with probabilities qθ(z)=|z|ψ(θ)|2. Here, |z represents computational basis states, wherein sampling the probabilities is implemented as a simple measurement in a quantum computer hosting the Born machines. Optionally, forms of discrete data can be processed by using suitable encoding. For example, when using amortization, a given variational distribution requires observed variables to be conditioned before being processed further. Optionally, in embodiments of the present disclosure, a parameter x is used as an additional parameter in order to yield a pure state where output probabilities are qθ(z|x)=|z|ψ(θ,x)|2. For example, in
It will be appreciated that Born machines are susceptible to being applied for benchmarking hybrid quantum-classical systems, generative modeling, finance, anomaly detection, and for demonstrating quantum advantage. These models can be realized in a variety of ways, in both classical and quantum computers. When the models are realized via certain classes of quantum circuits, they are classically intractable to simulate. A quantum computer cannot efficiently implement arbitrary quantum circuits, thus some Born machines are intractable even for a quantum computer. For example, instantaneous quantum poly-time (IQP) circuits are Born machines with O(poly(n)) parameters that yield classically intractable distributions in an average case under widely accepted complexity theoretic assumptions. Thus, quantum Born machines have expressive power that is larger than that of classical models, including neural networks and partially matrix product states. It can be shown that the model remains classically intractable throughout training, which is itself a form of quantum advantage. Before devising embodiments of the present disclosure, it has never hitherto been described how to perform variational inference using models having intractable likelihoods and discrete variables, such as Born Machines. Embodiments of the present invention seek to address a technical problem of intractable models.
In some embodiments of the present disclosure, it will be appreciated that a useful way to classify probabilistic models is as follows: prescribed models provide an explicit parametric specification of a given distribution, implicit models define only the data generation process. Moreover, it will be appreciated that Born machines can be effectively regarded as implicit models. Furthermore, it will be appreciated that it is feasible to obtain an unbiased sample provided that it is feasible to execute a corresponding circuit on a quantum computer and measure an associated computational basis; however, it requires exponential resources to estimate the distribution even on a quantum computer. It will be appreciated that there exist classes of Born machines for which a dependency between parameters and distribution is clear. For example, when restricting a Born machine to an IQP circuit, amplitudes |ψ(θ) are proportional to partition functions of complex Ising models. Yet, estimating arbitrary amplitudes remains intractable.
It will be appreciated that implicit models are challenging to train by using standard methods. A major challenge is designing objective functions precisely because likelihoods are “prohibited”. Valid objectives involve only statistical quantities (such as expectation values) than can be efficiently estimated from samples.
Embodiments of the present disclosure beneficially implement variational inference (VI) by using Born machines. Embodiments of the present disclosure are configured to implement inference methods that apply to classical graphical models and classical data; such methods are approximate, efficient and apply to graphical models, that are not Bayesian networks.
Next, operator variational inference (OPVI) will be described in greater detail. OPVI is a general method that uses mathematical operators to design objectives for an approximate posterior. Suitable operators to use in the OPVI are those for which:
wherein ƒ(·)∈d is a test function within a chosen family , and Op,q is an operator that depends on p(z|x) and q(z|x), and h(·)∈[0,∞] yields a non-negative objective.
In some embodiments, there are two methods that are used that follow directly from two operator choices. The operator choices result in objectives based on the Kullback-Leibler (KL) divergence and the Stein discrepancy. The former utilizes an adversary to make a computation tractable, whereas in the latter, there arises tractability from computing a kernel function.
The KL divergence is an example of an f-divergence, while the Stein discrepancy is in a class of integral probability metrics, namely two fundamentally different families of probability distance measures. OPVI can therefore yield methods from these two difference families under a suitable choice of operator. However, these two families intersect non-trivially only at the total variation distance (TVD). It is this reason that TVD is used as a benchmark in embodiments of the present disclosure.
Optionally, an objective function for VI used in embodiments of the present disclosure is the Kullback-Leibler divergence (KL) of the true posterior relative to a corresponding approximate objective function; this is regarded as being an adversarial method. This objective function is obtainable from Eq. 1 by choosing f and h to be identity functions, and by choosing the operator
wherein qq is a variational distribution parameterized by q. The objective's minimum is zero and is attained by the true posterior qθ(z|x)=p(z|x), ∀x.
By using a prior-contrastive approach, substituting Bayes' formula p(z|x)=p(x|z)p(z)/p(x) into Eq. (2) provides:
wherein entropy H[p(x)]:=−x˜p(x)[log p(x)] is constant with respect to q and can be ignored. There is assumed an explicit conditional p(x|z) that can be efficiently computed, for example when the observed x are “leaves” a Bayesian network.
In some embodiments, there is beneficially used a Born machine to model a variational distribution zθ(z|x)=|z|ψ(θ,x)|2. There is thereby provided an implicit model, wherein the ratio qθ(z|x)/p(z) cannot be computed efficiently. Thus, embodiments of the present disclosure are configured to make use of an adversarial method for estimating the aforesaid ratio approximately. It will be appreciated that the ratio can be estimated from an output of a binary classifier. Thus, ascribing samples (z,x)˜qθ(z|x)p(x) to a first class, and samples (z,x)˜p(z)p(x) to a second class, and defining a binary classifier dϕ parameterized by ϕ that outputs a probability dϕ(z,x) that the pair (z,x) belongs to one of the two classes, there is derived 1−dϕ(z,x) that indicates a probability that (z,x) belongs to the other class. However, it will be appreciated that there exist many possible choices of objective function for the classifier. It is convenient therefore to consider a cross entropy:
KL(ϕ;θ)=x˜p(x)z˜q
wherein an optimal classifier that makes this equation Eq. (4) a maximum is given by:
Since the probabilities in Eq. (5) are unknown, the classifier must be trained on a dataset of samples. Such training does not pose a problem because samples from the Born machine qθ(z|x) and prior p(z) are easy to obtain by assumption. Once the classifier is trained, the logit transformation provides the log-odds of a data point coming the Born machine joint qθ(z|x)p(x) vs the prior joint p(z)p(x). The log-odds are approximations to the log-ration of the two distributions:
which is exact if dϕ is the optimal classifier in Eq. (5). However, it is desirable to avoid having to compute the problematic term in the KL divergence. Applying this result into Eq. (3) and ignoring constant terms, the final objective for the Born machine is:
KL(θ;ϕ)=x˜p(x)z˜q
The optimization can be performed in tandem as:
using gradient ascent and descent, respectively. It will be appreciated that the gradient of log
with respect to q vanished. Thus, under the assumption of an optimal classifier dϕ, the gradient of Eq. (7) is significantly simplified. The gradient of Eq. (8) is derived in APPENDIX A.
A more intuitive interpretation of the procedure just described in the foregoing is as follows. The log-likelihood in Eq. (3) can be expanded as
wherein Eq. (3) can be rewritten as
Comparing the expression in the square brackets with Eq. (6) reveals that the difference is between two log-odds, wherein the first difference is given by an optimal classifier for the approximate posterior and prior, and the second difference is given by a hypothetical classifier for the true posterior and prior. The adversarial method as used in embodiments of the present disclosure is illustrated in
As an alternative to using the aforesaid adversarial method in embodiments of the present disclosure, an alternative kernelized method can be employed, as will next be described in greater detail.
Beneficially, another possible objective function for variational inference (VI) as used in embodiments of the present disclosure is the Stein discrepancy (SD) of the true posterior from the approximate one. The Stein discrepancy is obtained from Eq. (1) assuming that the image of f has the same dimension as z, choosing h to be the absolute value, and choosing Op, q to be a Stein operator. A Stein operator is independent from q and is characterized by having zero expectation under the true posterior of all functions ƒ in the chosen family . Then, for binary variables, a possible Stein operator is (Opƒ)(z)=sp(x,z)Tƒ(z)−tr(Δƒ(z)) wherein:
is the difference function, and
(Δƒ(z))ij=(ƒ(z))j−(ƒ(¬iz))j Eq. (10)
is the partial difference operator, and ¬iz flips the i-th bit in binary vector z. It is shown in APPENDIX B that this is a valid Stein operator for binary variables. Plugging these definitions in Eq. (1), there is obtained:
At this point, it is feasible to parameterize the test function ƒ and obtain an adversarial objective that is similar to aforesaid examples of adversarial functions. However, by restricting the Hilbert space norm of ƒ to be at most 1, the supremum in Eq. (11) can be calculated in closed from a kernel. A result is thereby obtained that is the kernelized Stein discrepancy (KSD):
KSD[qθ(z|x)∥p(z|x)]=√{square root over (z,z′˜q
wherein kp is the Stein kernel. For binary variables, Eq. (12) can be written as:
k
p(z,z′|X)=sp(x,z)Tk(z,z′)sp(x,z′)−sp(x,z)TΔz′k(z,z′)−Δzk(z,z′)Tsp(x,z′)+tr[Δz,z′k(z,z′)]. (13)
Eq. (13)
The Stein kernel, k, depends on another kernel k. For n unobserved Bernoulli variables, one possibility is the generic Hamming kernel
The KSD is a valid discrepancy measure if this “internal” kernel, k, is positive definite which is the case for the Hamming kernel. Thus, in summary, constraining ∥ƒ≤1 in Eq. (11) and substituting the KSD, there is thereby obtained:
KSD(θ)=x˜p
and the problem consists of finding minθKSD(θ). The gradient of Eq. (14) is derived in APPENDIX C. Moreover, the kernelized method is illustrated in
When developing embodiments of the present disclosure, various experiments were performed to verify operation of the embodiments. Methods used in embodiments of the present disclosure were validated using a canonical “sprinkler” Bayesian network. Moreover, the envisaged applications of the embodiments encompass also regimes where there are continuous observed variables on a hidden Markov model (HMM), wherein multiple observations can be incorporated effectively via amortization. Furthermore, there is also considered a larger discrete model in a “lung cancer” Bayesian network for demonstrating the methods using a quantum computer, for example a quantum computer as hosted by IBM.
Embodiments of the present disclosure can be verified for proof of principles for providing a “sprinkler” Bayesian network. For classically simulating a method of the disclosure on the “sprinkler” network of
Beneficially, for example, for the KL objective, there is beneficially utilized a multi-layer perception (MLP) classifier comprising 3 input units, 6 ReLU hidden units and 1 sigmoid output unit; however, it will be appreciated that other numbers of input units, ReLU hidden units and sigmoid output units are optionally utilized; however, it will be appreciated that other numbers of input units, ReLU hidden units and sigmoid output units can be optionally used. Moreover, beneficially, the classifier is trained with a dataset of 100 samples from a prior p(C, R, S) and 100 samples from a Born machine q(C, R, S|W=tr). There beneficially use stochastic gradient descent with batches of size 10 and a learning rate of 0.03. For the KSD objective, there is beneficially used a Hamming kernel of a type as aforementioned. For both the KL objective and the KSD objective, there is computed a total variation distance (TVD) of true and approximate posterior at each epoch. However, TVD cannot be efficiently computed in general and can be shown only for small examples.
Next, continuous observed variables and amortization with a hidden Markov model will be described.
When using the adversarial variational inference (VI) method in a hidden Markov model (HMM) in the example shown in
These represent an unknown “regime” at a time t. The regime affects how the observable data is generated. There are used Gaussian observed variables (filled circles in
Beneficially, when two time series observations x(1), x(2) are sampled from the HMM, the sampled observations can be used as an empirical data distribution. These time series are illustrated in
Instead of fitting two approximate posteriors separately, there is beneficially used a single Born machine with amortization |ψ(θ,x). There is used the Ansatz in
Histograms included in
Rather than focusing on the mode, it is feasible to make use of the whole distribution to estimate some quantity of interest. Such use of the whole distribution is achieved by taking samples from the Born machine and using them in a Monte Carlo estimate. For example, it is feasible to predict an expected value of a next latent variable zT+1, given available observations. For a datapoint x(1), making such a prediction entails computing an estimation of:
z
˜q
(zT|x(1))I
Next, use of IBMQ with the aforesaid “lung cancer” Bayesian network will be described. The lung cancer network (also referred to as an “Asia” network) is an example of a medical diagnosis Bayesian network, as illustrated in
Based on the graph structure in
p(A)p(T|A)p(X|X)p(I|T,L)p(D|B,I)p(B|S)p(L|S)p(S) Eq. (18)
In APPENDIX D, there is referred to an explicit probability table (Table I) for completeness. Modifying an illustrative example of a potential “real-world” use case in [47], a given patient potentially present in a clinic with an “illness” (I=tr) but no shortness of breath (D=fa) as a symptom. Furthermore, an X-ray reveals a negative result (X=fa). However, there is no patient history available and there is no knowledge available regarding which underlying disease is actually present. Beneficially, there is conditioned on having observed “evidence” variables, namely x: X, D and I. The remaining five are the latent variables, z, so there are required 5 qubits to perform the computation.
From the foregoing, it will be appreciated that two variational inference (VI) methods as described in the foregoing are susceptible to being used in embodiments of the present disclosure. By implementing the VI methods using a Born machine hosted on a quantum computer, highly expressive posteriors can be given by quantum models, for example using hidden Markov models (HMM).
The first VI method is based on minimizing the Kullback-Leibler (KL) divergence of the true posterior and relies on a classifier that estimates probability ratio; the resulting adversarial training may be challenging due to there being a large number of hyper-parameters and stability issues. Conversely, the first method requires an ability to (i) sample from the prior p(z), (ii) sample from the Born machine qq (z|x), and (iii) calculate the likelihood p(x|z). It is feasible to apply the method even when the prior is implicit (for example, as in a quantum-assisted Helmholtz machine).
The second VI method is based on a kernelized Stein discrepancy (KSD). However, a limitation of the second method is that it requires explicit priors and likelihoods. Advantageously, the second method provides plenty of flexibility in the choice of kernel. Beneficially, a generic Hamming kernel is used to compute similarities between bit-strings.
In the first and second methods used in embodiments of the present disclosure, the posterior is approximated by using traced out additional qubits. Such a solution allows trading a larger number of qubits for reduced circuit depth for a given constant expressivity of the hidden Markov model used.
Next, some practical embodiments of the present disclosure will be described, starting from first principles.
Referring to
In some implementations, The quantum computing arrangement 10 may include at least one classical (binary) computer 20 coupled in combination with at least one quantum computer 30. Optionally, the classical (binary) computer 20 and the quantum computer 30 are spatially co-located. Alternatively, optionally, the classical (binary) computer 20 and the quantum computer 30 are spatial mutually remote but are coupled together via a data communication network, for example a wireless network, the Internet, for example as in Internet of Things (“IoT”) when embodiments of the present disclosure are implemented as a distributed interconnected array of IoT devices. The classical computer 20 may include a binary data memory 60 and a processing arrangement 50, for example an array of electronic processors. The classical computer 20 is coupled to an input/output interface 40 that is configured to receive external data Dinput and also to output data Doutput.
In some embodiments, the quantum computer 30 may include an Ansatz configuration module 70, an array of qubits 80, and a qubits measuring arrangement 90. In operation, the Ansatz configuration module 70 is used to configure initial values (or initial quantum state) of the qubits 80. In some cases, the quantum computer 30 may comprise one or more quantum gates configured to operate on the array of qubits (e.g., to transform an initial quantum state of one or more qubit in the qubit array to a final quantum state). Moreover, the Ansatz configuration module 70 also configures circuits of quantum gates to operate on the qubits 80. The qubit measuring arrangement 90 is used to measure values of the qubits 80 when the aforesaid quantum circuit (akin to a temporal algorithm executed on the qubits 80) has been executed on the qubits 80. In some cases, a number of quantum operations performed by quantum circuit and/or the longest path in the quantum circuit may be referred to as the depth of the quantum circuit. In some cases, a path in the quantum circuit may comprise a sequence of quantum operations performed to transform the initial quantum states to the final quantum states. The quantum circuit has a finite depth on account of quantum noise arising in the qubits 80 that ultimately would result in decoherence of the qubits 80.
In some embodiments, one or more Born machines may be implemented using the qubits and/or the quantum circuits, wherein the Born machines 100 are used to implement one or more models 110. In some cases, the one or more models may comprise one or more hidden Markov models (HMMs). In some cases, the one or more models may be associated with one or more hidden Markov models (HMMs). As will be described in more detail below, the one or more models 110 can be configured in layers, namely in a “nested” configuration. Moreover, in some cases, the classical computer 20 may be configured to compute one or more hidden Markov models of the aforesaid one or more hidden Markov models (HMMs) implemented using the quantum computer 30. In various implementations, the quantum computer 30 can be implemented using cryogenically cooled Josephson junctions, ion traps, quantum photonics devices (e.g., integrated and on-chip quantum photonic devices), but not limited thereto. In some cases, the classic computer 20 may be integrated with the quantum computer 30 on a single platform. For example, the classic computer 20 may be implemented using conventional integrated circuits (e.g., silicon based integrated circuits) and the quantum computer can be implements using integrated photonic circuits (e.g., based on silicon photonic platforms). Moreover, the quantum computer 30 beneficially includes in a range of 10 to 500 qubits, optionally in a range of 50 to 500 qubits.
Referring next to
As described above,
In some implementations, the hidden Markov model 120 shown in
From the models, for example hidden Markov models, it can be feasible to determine inferences regarding an operating status of the real physical system 160, and also make inferences regarding how to control the real physical system 160. From the foregoing, it will be appreciated that two variational inference (VI) methods as described in the foregoing may be used in various embodiments of the present disclosure. In some cases, by implementing the VI methods using the Born machine hosted on the quantum computer 30, highly expressive posteriors can be given by using Bayesian network models, for example hidden Markov models (HMMs), that more accurately represent what is occurring in the real physical system.
In some examples, the first VI method used can be based on minimizing the Kullback-Leibler (KL) divergence of the true posterior and relies on a classifier that estimates probability ratios; the resulting adversarial training may be challenging due to there being a large number of hyper-parameters and stability issues. Conversely, the first method requires an ability to (i) sample from the prior p(z),
In some cases, the method may be applied even when the prior is implicit (for example, as in a quantum-assisted Helmholtz machine). Alternatively or additionally, the second VI method used can be based on a kernelized Stein discrepancy (KSD). In some cases, the second method can be limited by the requirement of having explicit priors and likelihoods. Advantageously, the second method may provide plenty of flexibility in the choice of kernel. Beneficially, a generic Hamming kernel is used to compute similarities between bit-strings.
In some cases, the real physical system 160 can be: a self-diving vehicle, an aircraft, an airport facility, a robotic device, a manufacturing facility, a renewable energy facility, a greenhouse for horticulture, a nuclear power station, an energy storage facility, an electric power grid, an engine, a collection of sensor monitoring the operation of an industrial facility, a financial electronic exchange, a computer network, a question answering system for troubleshooting or medical triage but not limited thereto. In some cases, measured signals from the real physical system 160, for example derived from one or more sensors or control inputs may be provided to the classical computer 20 and processed by the quantum computing arrangement 10 to generate output signals to control operation of the real physical system 160, for example via actuators, motors, hydraulic valves, pneumatic valves, and so forth. In some cases, the one or more sensors may include at least one of: optical sensors, gas sensors, pH sensors, cameras, tactile sensors, chemical sensors, temperature sensors, radar imaging sensors, ultrasound imaging sensors, ionizing radiation sensors, optical sensors, inertial navigation sensors (INS), humidity sensors, microphones, switches, joysticks, biosensors, and so forth.
Some embodiments of the present disclosure optionally use one or more generative models, for example one or more Bayesian network models, or more hidden Markov models (HMMs), for performing variational inference by teaching the one or more Born machine models to represent the real physical system 160 and then interrogating the one or more Born Machine models to derive an inference regarding operation of the real physical system 160 and how to control its operation, for example steering, signaling, engine or electric motor power applied and braking when the real physical system 160 is a self-drive vehicle, infer the real position of a moving vehicle from electromagnetic sensor data such as radar or lidar, suggest more likely informative next steps in a clinical journey following triage and test results, identify causes for an abnormal readings from the sensors of a mechanical apparatus, identifying abnormal workloads and usages that could be signalling cyberattacks in a computer network.
In some implementations, the Born machine models can be used in a control arrangement 150. In
In some cases, in the control arrangement 150, input control data Cinput is used to manage operation of the real physical system 160 giving rise to corresponding real outputs Cout:1 to Cout:m, wherein m is an integer, and also serves as an input to control the hidden Markov model (HMM) arrangement 170. Simulated outputs C′out:1 to Cout:m are generated by the HHM arrangement 170 that may generate output data usable for predicting how the real physical system 160 should be functioning. A difference between Cout:1 to Cout:m, relative to C′out:1 to Cout:m is generated by a differential amplifier arrangement 180 to generate one or more error signals CE that can be used to compensate for deviations in operations of the real physical system 160 (e.g., from a predicted or suggested operation based on the outputs from the HHM arrangement 170). The error signal CE provide a useful measure of condition of the real physical system 160, for example component ageing, component maladjustment, onset of component failure, as well as maintaining operation closely in calibration with the model arrangement 170. Optionally, a hidden Markov model of the model arrangement 170 is trained to analyze the error signal CE and diagnose therefrom any potential faults, onset of potential faults and miscalibration of the real physical system 160. Such insight is extremely valuable when the real physical system 160 is, for example, a nuclear power plant suffering an onset of neutron embrittlement of its critical operating parts, a rechargeable battery system of an electric vehicle, a wind turbine for electricity production, an aircraft system, and so forth.
In the present disclosure, there are a lot of mathematical derivations providing a comprehensive implementation of models, for example hidden Markov models but not limited thereto, using Born machines implemented on quantum computers. It will be appreciated that these models are especially accurate and expressive compared with hidden Markov models implemented solely on classical computers. Moreover, it will be appreciated that the hidden Markov models implemented using quantum computers have a wide range of applications in controlling, monitoring and diagnosing operation of complex real physical systems and are therefore capable of providing highly beneficial technical effect.
Referring next to
There is provided an example method of using the control arrangement 150 for controlling or monitoring the real physical system 160, wherein, in some implementations, the method may include:
The classical computer 20 can also be referred to as being a “conventional computer”. Moreover, the control arrangement 150 can also be referred to as being a “control system”.
In some cases, the method of using the control arrangement 150 includes arranging for at least one model arrangement 170 to include a nested series of models, wherein at least one of the series of models is implemented using the quantum computer 30. Optionally, the at least one model arrangement 170 includes at least one hidden Markov model arrangement.
In some cases, the method includes configuring at least one model of the variational inference arrangement to be taught by using an objective function for at least one of:
In some cases, the method includes configuring the control arrangement 150 to infer an operating condition of the real physical system 160 from an error signal used to compensate for deviations in operation of the real physical system 160 relative to a learnt representation of the real physical system 160, wherein the learnt representation of the real physical system 160 is implemented using the at least one model arrangement 170, for example a hidden Markov model arrangement, that is at least partially implemented using the quantum computer 30.
In some implementations, when a nested series of models is used, for example first, second and third models (for example hidden Markov models HMMs):
(i) the first model 170(1) is trained to process sensor signals, for example for noise reduction and filter out extraneous effects (wherein the first model 170(1) can be implemented on the classical computer 20);
(ii) the second model 170(2) is trained to make decisions based on filtered sensor signals provided from the first model 170(1) together with user command signals (wherein the second model 170(2) is implemented using the quantum computer 30, to achieve enhanced accuracy); and
(iii) the third model 170(3) is trained to generate output signals for driver the real physical system 160 (wherein the third model 170(2) can be implemented on the classical computer 20).
In the case of a self-driving vehicle:
(a) the first model 170(1) is trained to interpret sensor signals such as dashboard camera, ultrasonic proximity sensors, temperature sensor, humidity sensor;
(b) the second model 170(2) is trained to make decisions regarding driving direction, braking, acceleration, steering, indicating; and
(c) the third model 170(3) is trained to generate commands for the vehicle (for example, braking signals, motor drive signals, indicator light signals) that take into account road surface condition (e.g. wet, ice, snow, loose gravel), engine/motor power, battery power, road incline.
In other words, there is used:
(i) a first group of models in the series for filtering input data,
(ii) a second (middle) group in the series for making strategic inference and decisions, based on outputs from the first group, and
(iii) a third group of models in the series for receiving results from the second group, and for converting these results for output control purposes. “Group” here in (i) to (iii) denotes “one or more”.
Beneficially, by using nested models, each model can become specialized at performing its specialized function, so the models in synergy provide a superlative vehicle control and driving accuracy. Such a manner of configuring the nested series of models is application to other use applications, for example aircraft navigation and flight control, rocket flight control, ship navigation, robotics, manufacturing machinery control, chemical processing works control, and so forth.
It will be understood that the above description is given by way of example only and that various modifications may be made by those skilled in the art. The above specification, examples, and data provide a complete description of the structure and use of exemplary embodiments. Although various embodiments have been described above with a certain degree of particularity, or with reference to one or more individual embodiments, those skilled in the art could make numerous alterations to the disclosed embodiments without departing from the scope of this specification.
In some implementations, a control system may be configured to control a real physical system (e.g., the real physical system 160). In some cases, the control system may determine inferences regarding an operating status of the real physical system 160, make inferences regarding how to control the real physical system 160, and provide feedbacks to the real physical system based at least in part on the determined inferences.
In some implementations, the control system may comprise a computing arrangement or system (e.g., the computing arrangement 10) that includes a hybrid combination of classical computers (also referred to as classic computers) and the quantum computers that are in communication with each other (e.g., the classic computer 20 and the quantum computer 30).
In some cases, the classical computer may include a non-transitory memory configured to store specific computer-executable instructions and a hardware processor (e.g., an electronic processor) in communication with the non-transitory memory. In some examples, the quantum computer may include one or more qubits and one or more quantum gates configured to act on the one or more qubits. In some implementations, the quantum computer can be in communication with the classical computer via one or more communication links or via an interface. In some embodiments, the classical computer may execute instructions associated with a quantum compiling algorithm to compile output data generated by the classical computer to data and commands usable to configure the quantum computer and execute a quantum algorithm using one or more quantum circuits. In various implementations, the classical computer and the quantum computer can be included in different enclosures and communicate via a wired or wireless link. In some cases, the classical computer can be part of a distributed computing system (e.g., a cloud computing system).
In some examples, the control system may use a variational inference (VI) method and implement the VI method, at least partially, using a Born machine hosted on the quantum computer. In some cases, the Born machine may be implemented using parameterized quantum circuits. In some cases, the parameterized quantum circuits may function as quantum machine learning models. In some cases, the VI method may comprise one or more Bayesian network models (e.g., hidden Markov models or HMMs). In some such cases, the Bayesian network models may comprise a nested series of models (e.g., a nested series of HHMs).
In some examples, the implementation of a VI method by the control system may comprise teaching a Born machine model to represent the real physical system and then interrogating the Born Machine model to derive an inference regarding operation of the real physical system and how to control its operation.
The process 1400 begins at block 1402 where the control system receives input data (e.g., data usable for managing operation of the real physical system) from a data source. In some cases, the data source can be a user interface of the control system (e.g., a user interface of the classical computer). In some cases, the data source may include a memory (e.g., a non-transitory memory of the real physical system, the control system or that of a computing system different from control system), or a sensor arrangement (e.g., a sensor arrangement associated with the real physical system) that is configured to generate and stream sensor data. In some examples, the data source can be in communication with the control system via a wireless network. In some examples, the control system may receive the input data via the classical computer included in its computing system. In some cases, input data may be derived from the real physical system.
At block 1404, the control system may provide the input data to a variational inference arrangement to generate output data. The output data may comprise one or more inferences regarding an operating condition of the real physical system. In some cases, the variational inference arrangement may be included in the computing system of the control system and may comprise one or more Bayesian network models (e.g., one or more HMMs). In some examples, the Bayesian network models may be implemented using a combination of the classical computers and the quantum computers. In some cases, the variational inference arrangement includes at least Born machine implemented on the quantum computer. In some embodiments, the variational inference arrangement may generate the output data by executing instructions stored on a memory of the classical computer and performing quantum computing operations using one or more quantum circuits of the quantum computer. In some cases, quantum computing operation may include quantum computing using the Born machine configured in the quantum computer. In some implementations, at least one of the one or more Bayesian network models can be a trained model configured to provide a representation of the real physical system. In some examples, Bayesian network model may have been trained using training data acquired from the real physical system (e.g., by operating the system in all its states under various operating conditions). In some cases, at least one of the one or more Bayesian network models can be trained by using an objective function. In cases, the training may comprise minimizing a Kullback-Leibler (KL) divergence of a true posterior and relying on a classifier that estimated a probability ratio. In some cases, the training may comprise using a kernelized Stein discrepancy (KSD) requiring explicit priors and likelihoods. In some cases, the Born machine can be a trained Born machine. For example, the Born machine may have been trained before receiving the input data using the methods described above. In some cases, a trained Bayesian network model may comprise a trained Born machine.
At block 1406, the control system may receive system output data from the real physical system that is controlled by the control system. In some cases, the real physical system may generate the system output data using system input data. In some cases, system input data may comprise the input data provided to the control system at block 1402.
At block 1408, the control system may use the system output data and output data generated using the variational inference arrangement to generate an error signal. In some cases, the control system may generate the error signal by feeding the system output data and output data, or one or more signals associated with system output data and output data, to a differential amplifier. In some cases, the error signal may comprise a difference between the output data generated by the variational inference arrangement and system output data generated by the real physical system. In some cases, the error signal may provide a measure of a condition of the real physical system, for example component ageing, component maladjustment, or onset of component failure. In some cases, one of the more Bayesian network models (e.g., a hidden Markov model) implemented by the control system may be configured or trained to analyze the error signal and generate data usable for diagnosing potential faults, onset of potential faults, and miscalibration of the real physical system.
At block 1410, the control system may feed the error signal to an input of the real physical system use the system output data and output data generated using the variational inference arrangement to generate an error signal.
In some examples, at least a portion of the operations described with respect to the process 1400 may be performed by a processor of the control system (e.g., a processor of the classical computer). In some examples, at least a portion of the operations described with respect to block 1404 be performed by the quantum computer. In some examples, the processes performed by the quantum computer may be managed and controlled by a controller of the quantum computer based at least in part data received from eth classical computer. In some cases, the controller of the quantum computer may include a hardware processor and a non-transitory memory storing computer-executable instructions different from that of the classical computer. In some cases, the controller of the quantum computer may use the computer-executable instructions to configure the variational inference arrangement, e.g., by configuring a Born machine. In some cases, configuring the Born machine may comprise preparing quantum states and quantum circuits.
In some implementation, the qubits may comprise photons and having photonic states and the quantum gates may comprise photonic components configured to control and manipulate the photonic states.
Modifications to embodiments of the present disclosure described in the foregoing are possible without departing from the scope of the present disclosure as defined by the accompanying claims. Expressions such as “including”, “comprising”, “incorporating”, “consisting of”, “have”, “is” used to describe and claim the present invention are intended to be construed in a non-exclusive manner, namely allowing for items, components or elements not explicitly described also to be present. Reference to the singular is also to be construed to relate to the plural; as an example, “at least one of” indicates “one of” in an example, and “a plurality of” in another example; moreover, “one or more” is to be construed in a likewise manner.
The phrases “in an embodiment”, “according to an embodiment” and the like generally mean the particular feature, structure, or characteristic following the phrase is included in at least one embodiment of the present disclosure, and may be included in more than one embodiment of the present disclosure. Importantly, such phrases do not necessarily refer to the same embodiment.
The term “computer” or “computing-based device” is used herein to refer to any device with processing capability such that it executes instructions. Those skilled in the art will realize that such processing capabilities are incorporated into many different devices and therefore the terms “computer” and “computing-based device” each include personal computers (PCs), servers, mobile telephones (including smart phones), tablet computers, set-top boxes, media players, games consoles, personal digital assistants, wearable computers, and many other devices.
The methods described herein are performed, in some examples, by software in machine readable form on a tangible, non-transitory storage medium, e.g., in the form of a computer program comprising computer program code adapted to perform the operations of one or more of the methods described herein when the program is run on a computer and where the computer program may be embodied on a non-transitory computer readable medium. The software is suitable for execution on a parallel processor or a serial processor such that the method operations may be carried out in any suitable order, or simultaneously.
This acknowledges that software is a valuable, separately tradable commodity. It is intended to encompass software, which runs on or controls “dumb” or standard hardware, to carry out the desired functions. It is also intended to encompass software which “describes” or defines the configuration of hardware, such as HDL (hardware description language) software, as is used for designing silicon chips, or for configuring universal programmable chips, to carry out desired functions.
Those skilled in the art will realize that storage devices utilized to store program instructions are optionally distributed across a network. For example, a remote computer is able to store an example of the process described as software. A local or terminal computer is able to access the remote computer and download a part or all of the software to run the program. Alternatively, the local computer may download pieces of the software as needed, or execute some software instructions at the local terminal and some at the remote computer (or computer network). Those skilled in the art will also realize that by utilizing conventional techniques known to those skilled in the art that all, or a portion of the software instructions may be carried out by a dedicated circuit, such as a digital signal processor (DSP), programmable logic array, or the like.
Any range or device value given herein may be extended or altered without losing the effect sought, as will be apparent to the skilled person.
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.
It will be understood that the benefits and advantages described above may relate to one embodiment or may relate to several embodiments. The embodiments are not limited to those that solve any or all of the stated problems or those that have any or all of the stated benefits and advantages. No single feature or group of features is necessary or indispensable to every embodiment.
Conditional language used herein, such as, among others, “can,” “could,” “might,” “may,” “e.g.,” and the like, unless specifically stated otherwise, or otherwise understood within the context as used, is generally intended to convey that certain embodiments include, while other embodiments do not include, certain features, elements and/or steps. Thus, such conditional language is not generally intended to imply that features, elements, and/or steps are in any way required for one or more embodiments or that one or more embodiments necessarily include logic for deciding, with or without author input or prompting, whether these features, elements, and/or steps are included or are to be performed in any particular embodiment. The terms “comprising,” “including,” “having,” and the like are synonymous and are used inclusively, in an open-ended fashion, and do not exclude additional elements, features, acts, operations, blocks, and so forth. Also, the term “or” is used in its inclusive sense (and not in its exclusive sense) so that when used, for example, to connect a list of elements, the term “or” means one, some, or all of the elements in the list. In addition, the articles “a,” “an,” and “the” as used in this application and the appended claims are to be construed to mean “one or more” or “at least one” unless specified otherwise.
As used herein, a phrase referring to “at least one of” a list of items refers to any combination of those items, including single members. As an example, “at least one of: A, B, or C” is intended to cover: A; B; C; A and B; A and C; B and C; and A, B, and C. Conjunctive language such as the phrase “at least one of X, Y, and Z,” unless specifically stated otherwise, is otherwise understood with the context as used in general to convey that an item, term, etc. may be at least one of X, Y, or Z. Thus, such conjunctive language is not generally intended to imply that certain embodiments require at least one of X, at least one of Y, and at least one of Z to each be present.
The operations of the methods described herein may be carried out in any suitable order, or simultaneously where appropriate. Additionally, individual blocks may be deleted from, combined with other blocks, or rearranged in any of the methods without departing from the scope of the subject matter described herein. Aspects of any of the examples described above may be combined with aspects of any of the other examples described to form further examples without losing the effect sought.
It will be understood that the above description is given by way of example only and that various modifications may be made by those skilled in the art. The above specification, examples, and data provide a complete description of the structure and use of exemplary embodiments. Although various embodiments have been described above with a certain degree of particularity, or with reference to one or more individual embodiments, those skilled in the art could make numerous alterations to the disclosed embodiments without departing from the scope of this specification.
Reference is made to a research paper “Variational inference with a quantum computer” that is hereby incorporated by reference; the research paper is available in Physical Review Applied 15, 044057 (2021) accessible at
https://journals.aps.org/prapplied/pdf/10.1103/PhysRevApplied.16.044057
Reference is made to APPENDIX B: “Gradients for the adversarial method” as published in the aforesaid research paper “Variational inference with a quantum computer”.
Reference is made to APPENDIX C: “The Stein Operator” as published in the aforesaid research paper “Variational inference with a quantum computer”.
Reference is made to APPENDIX D: “Gradients for the kernelized method” as published in the aforesaid research paper “Variational inference with a quantum computer”.
Reference is made to APPENDIX F: “Probability table for the “lung cancer” network” as published in the aforesaid research paper “Variational inference with a quantum computer”.
Reference is made to APPENDIX E: “Learning curves for the adversarial method” as published in the aforesaid research paper “Variational inference with a quantum computer”.
General scientific publications are provided at the end of the aforesaid research paper “Variational inference with a quantum computer”.
Number | Date | Country | Kind |
---|---|---|---|
2103338.6 | Mar 2021 | GB | national |