DEVICE, COMPUTER-IMPLEMENTED METHOD OF ACTIVE LEARNING FOR OPERATING A PHYSICAL SYSTEM

CROSS REFERENCE

The present application claims the benefit under 35 U.S.C. § 119 of German Patent Application No. DE 10 2022 201 453.7 filed on Feb. 11, 2022, which is expressly incorporated herein by reference in its entirety.

FIELD

The present invention concerns a device and a computer-implemented method of active learning in particular for operating a physical system.

SUMMARY

According to an example embodiment of the present invention, a computer-implemented method of active learning in particular for operating a physical system comprises providing a data set that comprises data points, wherein each data point comprises an input for operating the physical system and a first observation and a second observation of the physical system, training a multi-output Gaussian process for predicting the first observation for a given input with the data set, training a Gaussian process for predicting the second observation for a given input with the data set, determining with the data set an input for operating the physical system for that an information gain or uncertainty about an operation of the physical system when operating the physical system with the input is larger than for at least one other input and for that a probability that the Gaussian process predicts a second observation that meets a condition exceeds a threshold, determining, in particular measuring, the first observation and the second observation that result from operating the physical system the determined input and adding a data point to the data set that comprises the determined input and the determined first observation and the determined second observation. This method provides safe active learning.

Determining the input preferably comprises determining the input to be different than inputs that the data points in the data set comprise. This avoids using redundant measurements and thus saves cost related to measuring data.

Determining the input may comprise sampling the input from possible inputs for the physical system. The possible inputs define the set of values that are useful for evaluating.

According to an example embodiment of the present invention, the method may comprise determining the input with an acquisition function that is defined depending on the input and the data set. This allows to evaluate the input based on already measured inputs and the observations that correspond to these inputs. This improves the safe active learning in a way that the already measured data helps predict what new inputs are useful, and thus improve the efficiency and save measuring cost by only measuring the seemingly informative new points.

In one example embodiment of the present invention, the acquisition function models the information gain or the uncertainty about the operation of the physical system when operating the physical system with the input, wherein the input is determined for that according to the acquisition function the information gain or uncertainty is larger than it is according to the acquisition function for at least one other input. This allows selecting the input that allows faster learning as the other input.

According to an example embodiment of the present invention preferably, the condition models operating states in which the physical system likely operates safely, wherein when it is determined that the condition is met, the determined input is selected for operating the physical system and/or that the condition models operating states in which the physical system likely operates unsafely, wherein when it is determined that the condition is not met, the determined input is not selected for operating the physical system. This allows selecting the input that is likely to be safe and avoids operating the physical system in an unsafe operating mode.

The method may comprise sending an instruction comprising the determined input. The method may thus be used on a back end machine for instructing the physical system when it is remote from the back end machine.

The method may comprise operating the physical system according to the instruction and measuring the first observation and the second observation that result from operating the physical system according to the instruction. The method may thus be executed at least in part on the physical system.

The method may comprise receiving the first observation and the second observation characterizing the operation of the physical system while operating it according to the instruction, or receiving operating data characterizing the operation of the physical system while operating it according to the instruction and determining the first observation and the second observation depending on the operating data. The method may thus be used on the back end machine for evaluating the physical system that is remote from the back end machine.

The physical system may be a technical system like, e.g., a computer-controlled machine, in particular a robotic system, a robot, a vehicle, a domestic appliance, a power tool, a manufacturing machine, a personal assistant or an access control system, wherein the method comprises capturing or receiving the first observation and/or the second observation or capturing operating data and determining the first observation and/or the second observation depending on the operating data.

According to an example embodiment of the present invention, the first observation and/or the second observation and/or the operating data may comprise sensor signals, in particular digital images, preferably video, radar, LiDAR, ultrasonic, motion, thermal images, or audio, or acceleration, or speed, or roll, or pitch, or steering angle, or yaw angle, torque, revolution, temperature, or corresponding synthetic data.

According to an example embodiment of the present invention, the device for operating a physical system comprises at least one processor and at least one memory, wherein the at least one memory is adapted to store a data set, wherein the at least one processor is adapted to operate the physical system according to the method. This device is capable of achieving what is described above for the method.

A computer program comprises computer readable instructions, that, when executed by the computer, cause the computer to execute the method according to the present invention.

Further advantageous embodiments are apparent from the following description and the figures.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 schematically depicts a device for operating a physical system and the physical system, according to an example embodiment of the present invention.

FIG. 2 depicts a flowchart of a method for operating the physical system, according to an example embodiment of the present invention.

DETAILED DESCRIPTION OF EXAMPLE EMBODIMENTS

FIG. 1 schematically depicts a device 100 for operating a physical system 102.

The physical system 102 may be a computer-controlled machine, like a robot, a vehicle, a domestic appliance, a power tool, a manufacturing machine, a personal assistant an engine, in particular an internal combustion or electric engine, or an access control system.

The device 100 comprises at least one processor 104 and at least one memory 106.

The at least one memory 106 is adapted to store a data set.

The at least one processor 104 is adapted to perform steps in a method to operate the physical system 102 that is described below.

The device 100 is adapted to be trained for operating the physical system 102 depending on at least a part of the data set. The device 100 is adapted to operate the physical system 102, in particular during and/or after training of the device 100. The device 100 comprises an interface 108. The interface 108 is adapted to output instructions 110 for operating the physical system 102. The physical system 102 comprises at least one actuator 112. The at least one actuator 112 is adapted to execute the instructions 110.

The device 100 in the example is adapted to determine or read from the at least one memory 106 at least a part of the data set. The interface 108 is for example adapted to receive operating data 114. In the example, the physical system 102 comprises at least one sensor 116. The at least one sensor 116 is adapted to capture the operating data 114 or to capture a measurement and determine the operating data 114 from the measurement.

In the example, the device 100 is adapted to instruct the at least one actuator 112 to operate the physical system 102 in an operating mode according to the instructions 110 and return operating data 114 when operating in this operating mode. This means the device 100 is adapted to actively select the operating mode in that data is captured and/or transmitted. Selecting the operating mode allows exploring specific operating modes. This may be used to reduce the risk of operating the physical system 102 in an unsafe operating mode and reduces data traffic and wear of the physical system 102 in a test bench setup, in case only specific operating modes are selected.

According to the following description, a multi-output Gaussian process model f and a Gaussian process model h model a behavior of the physical system 102 and have an input x_a. The instructions 110 are determined depending on the input x_a. The operating data 114 comprises a first observation that is modelled by the multi-output Gaussian process model f and a second observation z_athat is modelled by the Gaussian process model h. In one embodiment the second observation z_ais not modeled by the multi-output Gaussian process model f. In one embodiment the second observation z_ais modeled by the multi-output Gaussian process model f. This means the second observation z_amay be a component of the multi-dimensional first observation y_a.

The first observation may be a full observation y_athat is characterized in the example by operating data 114 that is observed at the same time. The first observation may be a partial observation y_pa. Partial in this context refers to observing a part of the available full observations y_aat the same time, while another part is not observerd at this time. An index p indicates what part of the full observation y_ais observed at the same time.

In the example, the input x_acharacterizes a target value for operating the physical system 102.

In one example, the physical system is an engine. For operating the engine, the target value is e.g. target revolutions, target torque, and/or for an internal combustion engine, its target air mass flow.

In the example, the first observation characterizes an actual operating mode or state of the physical system 102. In the example for the engine, the first observation characterizes e.g. rounds per minute, torque, and/or for the internal combustion engine, its air mass flow and/or its emission.

In the example, the second observation z_acharacterizes an actual operating mode or state of the physical system 102. In the example for the engine, the second observation z_acharacterizes a temperature of the engine.

The Gaussian process model h models a value of the second observation z_a. The value of the second observation z_ain the example must meet a condition in order to operate the physical system 102 safely. The second observation z_ain the example corresponds to a physical quantity and in the example characterizes an actual state of the engine, e.g. its temperature.

The device 100 is in one example adapted to explore the operation of the physical system 102. The device 100 is for example adapted to select with the multi-output Gaussian process model f and the Gaussian process model h data points that shall be captured. In the example, the device 100 is adapted to output instructions 110 that correspond to the input x_afor that, according to the Gaussian process model h, the physical system 102 operates safely, and for that according to the multi-output Gaussian process model f, the information gain about the operation of the physical system 102 is larger than for other input. The instructions 110 cause the physical system 102 to operate in order to observe its reaction. The instructions 110 may comprise information regarding the first observations that shall be observed. For partially observing, the instructions may be determine depending on the index p or may include the index p. The device 100 is adapted in this example to capture the operating data 114 at a plurality of data points that are selected with this multi-output Gaussian process model f and the Gaussian process model h. This device 100 may be adapted to generate training data based on the input x_aand the resulting the observation y_ain a fully observed case or the partial observation y_pain a partially observed case. In the partially observed case, the device 100 may be adapted to generate training data based on the index p. The device 100 may be adapted to train a machine learning system with the training data.

Instead of engine related sensor data, other sensor data may be used as well. The device 100 may be adapted to analyze data of the following types, which may be obtained by receiving sensor signals: digital images, e.g. video, radar, LiDAR, ultrasonic, motion, thermal images, or audio, or acceleration, or speed, or roll, or pitch, or steering angle, or yaw angle, torque, revolution, temperature or corresponding synthetic data. The device 100 may be adapted to avoid operating states of the physical system 102, in which the physical system 102 is damaged or damages its environment.

An exemplary algorithm for active learning for multi-output Gaussian processes requires as input a parameter δ∈(0,1], an initial data set D₀. The algorithm uses the multi-output Gaussian process model f and the Gaussian process model h.

The algorithm comprises a number I of iterations and is represented below:

D
_i
=D
₀

for i=0to I−1 do

determine hyperparameters for the models f and h with the data set D_i,

perform a prediction with the models f and h,

determine the input x_awith an acquisition function,

measure with the determined input x_ain the fully observed case or with the determined input x_aand the index p in the partially observed case, the first observation, i.e. either y_afor the fully observed case or y_pafor the partially observed case, and the second observation z_a,

add a data point comprising the determined input x_ain the fully observed case or the determined input x_aand the index p in the partially observed case, the first observation and the second observation z_ato the data set D_i+1for the next iteration,

end for

The algorithm determines models f and h and the data set D_i.

For the fully observed case, the acquisition function is α(·)∈ custom-character such that x_a=argmax_x{(α(x,D_i)x∈D_Pool,i)}. The data set D_i+1becomes Di∪{x_a,y_a,z_a},

wherein D_Pool,iis a given data set that comprises possible inputs x that are not yet in the data set D_i. Preferably, the given data set D_Pool,icomprises a plurality or all possible inputs x that the machine 102 can use to operate. The first observation y_aand the second observation z_acorresponding to the inputs x may or may not be given in the given data set D_Pool,i. In the example, these are not given and require the later described measuring steps in order to obtain them. In the example, once x_ais determined, the input x that corresponds to this x_ais removed from the given data set D_Pool,i.

In one embodiment, the given data set D_Pool,ifor the fully observed case comprises per input x also a given first observation y and a given second observation z that are assigned to the respective input x. In the example, once x_ais determined, the entiry triple {x,y,z} that comprises the input x is removed from the given data set D_Pool,i.

For the partially observed case, the acquisition function is a(·)∈ custom-character such that (x_a, p_a)=argmax_x,p{(α(x, p, D_i)|(x, p)∈D_Pool,i)}. The data point {x_a,y_pa,z_a} comprises y_pathat corresponds to [y_a]_p. The data set D_i+1becomes D_i∪{x_a, y_pa,z_a}.

In one embodiment, the given data set D_Pool,ifor the partially observed case comprises per input x and index p also a given partial first observation y_pand a given second observation z that are assigned to the respective input x. In the example, once x_ais determined, the entiry triple {x,y_p,z} that comprises the input x and the first observation corresponding to the index p is removed from the given data set D_Pool,i.

For safely operating the physical system 102, the input x_ais selected subject to

ξ(x_a)>1-δ

wherein ξ(x_a) represents a probability, that the physical system 102 is operating safely with x_a. Safe in this context may refer to an operation of the physical system 102 that mitigates destroying or damaging the physical system 102 or any part of it. Safe in this context may refer to an operation of the physical system 102 that mitigates damaging or destroying an environment of the physical system 102 or any part of it. Safe in this context may refer to an operation of the physical system 102 that allows damaging or destroying a part of the physical system 102, e.g. in a fatigue test, while mitigating adverse effects on other parts of the physical system 102 or its environment.

In these examples, it is assumed that the physical system 102 operates safely, if the probability ξ(x_a) exceeds a threshold, e.g., 1-δ in particular for a small δ ∈(0,1].

The corresponding ξ(x_a) is determined from the model h and δ is for example given by an expert as input to the algorithm.

The acquisition function a(·) in one example is, e.g. a predictive entropy

$α (\cdot, D) = H (\cdot, D) = \frac{1}{2} \log (❘ \sum ❘) + \frac{1}{2} R \log (❘ 2 π e ❘)$

In the fully observed case R=P and Σ is a covariance of a posterior p(f(x_*)|x_*,D) of the multi-output Gaussian process model

y
_a
=f(x_a)+ε_a=Wg(x_a)+ε_a∈ custom-character

with independent and identically distributed noise [ε_α]_p˜N(0,σ_p²) for p=1,2, . . . P, a linear transformation W∈ custom-character , and latent Gaussian processes, GP, g_l(·)=[g(·)]_l˜GP(0,k_l()) for l=1,2, . . . L, wherein P and L are finite, k_l() is a bounded kernel function, and each element of W is bounded by a constant. Consideringf_p(·)=[f(·)]p, the multi-output Gaussian processes has a zero mean and the covariance of f_p(x) and f_p,(x′) is

$\sum_{l = 1}^{L} W_{pl} W_{p^{'} l} k_{l} (x, x^{'}) =: η_{p, p^{'}} (x, x^{'})$

The posterior p(f(x_*)|x_*,D) is a multivariate Gaussian custom-character (μ(x_*),Σ(x_*)) of a collection of observations of the output Y:{y_i∈}N_i=1^Nwith mean

μ(x_*)=Ω_N*^T(Ω_NN+diag({σ_i²}_p=1^p)⊗I_N⁻¹Y

and covariance

Σ(x_*)=Ω_**−Ω_N*^T(Ω_NN+diag({σ_i²}_p=1^P)⊗I_N)⁻¹Ω_N*.

wherein ⊗ is the Kronecker product, wherein

[ω_**]_p,p,=η_p,p,(X_*,x_*)

[ω_**]η_p,p,η_p,p,(X,x_*)

[ω_N*]_{(p−1)N+1:pN,p},=η_p,p,(X,x_*)∈ custom-character

[ω_NN]_{(p−1)N+1:pN,(p′−1)N+1:p′N)}=η_p,p,(X,X)∈ custom-character

Y=(y₁₁, . . . ,Y_1N, . . . ,Y_P1, . . . ,Y_PN)^T

X:{X
_i}_i=1^N

D
_i
={x,y,z|x∈X,y∈Y,z∈Z}

wherein y_pnis the p-th component of the n-th observarion, i.e. y_pn=[Y_n]_p.

In case the output is partially observed, some components are omitted. This saves measuring cost and computational cost due to smaller ω, R=1 and because Σ is the variance of a partially observed multi-output Gaussian model

y
_pn
=[f(x_n)+ε_n]_p=[Wg(x_n)+ε_n]p∈ custom-character

with mean

μ(X_*,p_*)=[ω_N_sum*]_all,p*^T{circumflex over (ω)}_N_sum_N_sum⁻¹Y_Φ

and variance

$\sum (x_{*}, p_{*}) = η_{p_{*}, p_{*}} (x_{*}, x_{*}) - {[Ω_{N_{s u m^{*}}}]}_{a l l, p_{*}}^{T} {{\hat{Ω}}_{N_{s u m} N_{s u m}}^{- 1} [Ω_{N_{s u m^{*}}}]}_{a l l, p_{*}}$

for observations of the output

YΦ={y
_p
_k
_n
_k}_k=1^N^sum

wherein Φ: (p,n)→k is a re-indexing bijection with (p_k,n_k)=(Φ⁻¹(k), wherein an output domain of Φ is ∩[−NP+N_sum+1, N_sum], where {1, . . . ,N_sum} are the new indices of the observations, wherein

{circumflex over (ω)}_N_sum_N_sum=ω_N_sum_N_sum+diag({σ₁²}_k=1^N¹,{σ₂²}_k=1^N², . . . ,{σ_p_k²}_k=1^N^sum)

and wherein N_pis a number of outputs with p-th componend observerd and N_sum=Σ_i=1^pN_i.

Notice that, in the fully observed case, N₁=N₂= . . . N_P, =N and N_sum=PN.

The hyperparameters of the Gaussian model h may be determined considering that h:→ describes values Z⊆ of the physical quantity of the physical system 102, wherein h has a Gaussian process prior.

In an example, for values X∈ and Z∈, a predictive distribution p(h(x_*)|x_*,X,Z) is used to determine whether the threshold is exceeded probabilistically considering a probability

ξ(x):=∫_-∞^z^max(Z|μ_h(x), var_h(x))dz

at x∈X and under a condition for the value z, e.g. that the value z must not exceed a threshold z_max∈Z for a safe operation of the physical system 102.

The threshold Z_maxis provided by an expert either from experience, domain expertise or experiments.

In an example, the predictive distribution p(h(x_*)|x_*,X,Z) is used to determine whether the threshold is exceeded probabilistically considering a probability

ξ(x):=∫_z_min^∞(z|μ_h(x), var_h(x))dz

at x∈X and under a condition for the value z, e.g. that the value z must not be less than a threshold Z_min∈Z for a safe operation of the physical system 102.

The threshold Z_minis provided by an expert either from experience, domain expertise or experiments.

These examples comprise an integral of a standard normal distribution .

In the Gaussian process model h described above, the hyperparameters specify a mean function m→ and a positive definite kernel function k:×→ as GP prior for the respective function. The model h is for example for the data set D_i={x_i,y_i,z_i}_i=1^N

h˜GP(m(·),k(),z_i=h(x_i)+∈_i,∈_i,∈_i˜(0,σ²)

In the example of the method that is described below, the mean function m is a zero mean m=0. The posterior, i.e. the prediction for a second observation z, for this example is

p(h(x_*)|x_*,D)=(μ(x_*, var(x_*))

wherein

μ(x_*)=K_N^T(K_NN+σ²I)⁻¹(Z₁, . . . ,Z_N)

var(x_*)=k(x_*,x_*)−K_N^T(K_NN+σ²I)⁻¹K_N*

wherein K_N*∈ and K_NN∈ are matrices with [K_N*]_i=k(x_i,x_*) and [K_NN]_i,j=k(x_i,x_j) and I is the identity matrix of corresponding dimension N.

The prediction of the second observation z is a distribution that has the mean μ(x_*) and variance var(x_*). The method may comprise using the mean μ(x_*) directly as mean noise free prediction of the second observation z at x_*. The method may comprise using the distribution to draw samples of the second observation z at x_*.

The method is applied alike to non-zero mean functions.

The input X_iin the example is of dimension D. The input X_imay be used as instructions 110 or mapped to the instructions 110 with a mapping. The instructions 110 may comprise the index p. A range of values of the operating data 114 in the example is proportional to a range of values of the first observation, e.g. y_aor y_pa, and the second observation z_a. The first observation, e.g. y_aor y_pa, may comprise scalar or multi-dimensional data. The second observation z_ain the example is scalar. The operating data 114 may comprise the first observation, e.g. y_aor y_pa, and the second observation z_a

The hyperparameters of the Gaussian processes described above define the respective mean or mean function and the respective kernel. For zero mean, the hyperparameters define the kernel.

Determining the hyperparameters may comprise optimizing the hyperparameters with an optimization e.g. gradient based approaches or sampling the hyperparameters, e.g. with Monte Carlo sampling or expectation propagation. Expectation propagation is for example performed according to T. P. Minka, “Expectation Propagation for Approximate Bayesian Inference,” in Uncertainty in Artificial Intelligence. Morgan Kaufmann, 2001.

A method of active learning in particular for operating the physical system 102 is described with reference to FIG. 2. The steps of the method are processed iteratively.

The method comprises a step 202.

The step 202 comprises providing the data set D_i. In a first iteration, the data set D_iis initialized e.g. with D_i=D₀.

The step 202 comprises providing a threshold Z_threshodthat is e.g. either z_minor z_maxas described above.

Each data point of the data set D_icomprises an input x EX characterizing an instruction 110 for operating the physical system 102.

Each data point of the data set D_icomprises a first observation, e.g. either y_a∈Y for the fully observed case or y_pa∈Y for the partially observed case, and a second observation z_a∈Z of the physical system 102.

Afterwards, a step 204 is executed.

In the step 204, the method comprises training the multi-output Gaussian process f for predicting the first observation, e.g. either y_a∈Y for the fully observed case or y_pa∈Y for the partially observed case, depending on the given input x_awith the data set D_i.

In the step 204, the method comprises training the Gaussian process h for predicting the second observation z_afor a given input x_awith the data set D_i.

According to an example, redundant data points of the data set D_i, if there are any, are considered only once. Redundant in this context means, only one data point with input x_ais used, if multiple data points with the same x_aexist. The data points with the same x_amay comprise different second observations z_a.

Afterwards, a step 206 is executed.

In the step 206, the method comprises determining with the acquisition function a(·) an input x_a. By design of the models, this data point that comprises the input x_ahas a high probability ξ(x_a) that the Gaussian process h predicts a second observation z_athat meets the condition for the second observation z_a. In the example the probability ξ(x_a) exceeds the threshold ξ(x_a)>1-δ. The condition for the second observation may be determined depending on either the threshold z_maxor the threshold z_min.

The acquisition function α(·) is defined depending on the input x_aand the data set D_i.

In the partially observed case, the acquisition function α(·) is used that, as described above, that depends on the input x_aand the index p.

According to an example, the acquisition function α(·) models an information gain or an uncertainty about the operation of the physical system 102. The information gain or the uncertainty may be modeled using the predicted entropy.

In an example, the input x_ais selected from the given data set D_Pool,ifor that according to the acquisition function α(·) the information gain or uncertainty is larger than it is according to the acquisition function α(·) for another input of possible inputs for the physical system 102. Preferably, the input x_awith the largest value of the acquisition function α(·) is selected. This means the input x_amaximizes the acquisition function α(·). This means, the information gain and the uncertainty are the largest within the considered data points.

This means, selecting the input x_amay comprise determining the input x_afor that the acquisition function α(·) is larger than for other inputs.

In the example, the given input x that corresponds to the selected input x_ais removed from the given data set D_Pool,i.

In the fully observed case, the given data set D_Pool,icomprises the triples described above for this case, and the triple comprising the given input x that corresponds to the selected input x_ais removed from the given data set D_Pool,i.

In the partially observed case, the given data set D_Pool,icomprises the triples described above for this case, and the triple comprising the given input x that corresponds to the selected input x_amay be removed from the given data set D_Pool,i, once the first observations y_pathat corresponds to the indices p have been added to the data set D_i. According to one example, the triple comprising the second observation z_acomprises p first observations and is removed once the data set D_icomprises p triples with all of these p first observations

Afterwards, a step 208 is executed.

In the step 208, the method comprises sending the instruction 110 comprising the input x_afor operating the physical system 102 to the physical system 102. In the example the instruction 110 is determined to comprise the input x_a.

In a step 210, the method comprises operating the physical system 102 according to the instruction 110.

The step 210 in the example comprises measuring at the physical system 102.

In the fully observed case, the method comprises in step 210 measuring at the physical system 102 the first observation y_aand the second observation z_aor operating data 114 corresponding to these, while operating the physical system 102 with the input x_afrom the selected data point.

In the partially observed case, the method comprises in step 210 measuring at the physical system 102 the first observation y_paand the second observation z_aor operating data 114 corresponding to these while operating the physical system 102 with the input x_afrom the selected data point.

In the partially observed case, measuring of the same second observation z_amore than once, may be avoided. For example, the method may comprise evaluating whether a measurement for the second observation z_aand the input x_ahas already been made, and skip measuring the second observation z_awhile operating the machine with this input x_a. Alternatively, the measurement may be made. In this case, step 204 may comprise using the same second observation z_afor the triples in the data set D_ithat result from the same input x_a. This avoids duplicate second observation z_ain the data set D_i.

In a step 212, the method comprises receiving the operating data 114 characterizing the operation of the physical system 102 while operating it according to the instruction 110 that includes the input x_aof the selected data point.

The step 212 may instead comprise receiving operating data 114 characterizing the operation of the physical system 102 while operating it according to the instruction 110 and determining the the first observation y_aand the second observation z_aor the first observation y_paand the second observation z_adepending on the operating data 114.

Afterwards, a step 214 is executed.

In the step 214 the method comprises adding a data point to the data set D_i.

In the fully observed case, the data point comprises the input x_aand the measured first observation y_aand the measured second observation z_aIn the partially observed case, the resulting data point comprises the input x_aand the measured first observation y_paand the measured second observation z_a.

The data set D_imay be stored, e.g. as training data.

In the example, the condition models operating states in which the physical system 102 is likely to operate safely. When it is determined that the condition is met, the data point is selected for operating the physical system 102.

In one example, the condition models operating states in which the physical system 102 is likely to operate unsafely. When it is determined that the condition is not met, the data point is not selected for operating the physical system 102.

Afterwards, the step 202 is executed in a next iteration.

DEVICE, COMPUTER-IMPLEMENTED METHOD OF ACTIVE LEARNING FOR OPERATING A PHYSICAL SYSTEM

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)