RUNTIME MODEL VALIDATION FOR PARTIALLY-OBSERVABLE HYBRID SYSTEMS

FIELD OF THE INVENTION

This invention pertains to the field of cyber-physical systems (CPS), and, in particular to models approximating the behavior of CPS and methods for monitoring the actual CPS that are based on the models wherein the CPS are partially-observable hybrid systems.

BACKGROUND OF THE INVENTION

Formal verification provides strong safety guarantees about models of cyber-physical systems. Hybrid system models describe the required interplay of computation and physical dynamics, which is crucial to guarantee what computations lead to safe physical behavior (e.g., cars should not collide). Control computations that affect physical dynamics must act in advance to avoid possibly unsafe future circumstances. Formal verification then ensures that the controllers correctly identify and provably avoid unsafe future situations under a certain model of physics. But any model of physics necessarily deviates from reality and, moreover, any observation with real sensors and manipulation with real actuators is subject to uncertainty. This makes runtime validation a crucial step to monitor whether the model assumptions hold for the real system implementation.

Correctness arguments for cyber-physical systems crucially depend on models to enable predictions about their future behavior. Absent any models, neither tests nor verification provide any correctness results except for the limited amount of concrete (test) cases, because they cannot provide predictions for other cases. Models of physics are crucial in CPSs, but necessarily deviate from reality. Even cyber components may come with surprises when their detailed implementation is more complex than their verification model. Linking cyber and physical components with sensors and actuators adds another layer of uncertainty. These discrepancies are inevitable and call into question how safety analysis results about models can ever transfer to CPS implementations. Not using models, however, would invalidate all predictions.

Safety of the CPS implementation at runtime crucially depends on a faithful implementation of the model. However, even though rigorous correct-by-construction approaches promise provably correct implementation of the software portion of the model, their correctness is still predicated on meeting the assumptions of the model about the physical environment and the physical effects of actuators. Whether or not these environment and actuator effects are faithfully represented in the model can only be checked from actual measurements during system operation at runtime. That way, runtime monitoring can help to check the present run of a CPS implementation.

The key question, however, is what property needs to be runtime-monitored and what a runtime monitor says about the safety of the system. There is a fundamental asymmetry of the power of runtime monitoring in CPS compared to purely discrete software systems. In pure software, it may suffice to monitor the critical property itself and suspend the software upon violation, raising an exception to propagate the violation to surrounding code for mitigation. In cyber-physical systems any such attempt would be fundamentally flawed, because there is no way of reverting time and trying something else. For example, a desired property of a self-driving car may be to always keep at least 1 ft distance to other cars. Runtime monitoring this property, however, would raise an alarm when the car is less than 1 ft away, which (when going fast) makes a collision inevitable and all runtime monitoring futile. The underlying problem is that this property expresses unrealistic implicit assumptions about the continuous dynamics of a car.

Formal verification and validation play a crucial role in making cyber-physical systems safe. Formal methods make strong guarantees about the system behavior if accurate models of the system can be obtained, including models of the controller and of the physical dynamics. In CPS, models are essential; but any model that can be built necessarily deviates from the real world. If the real system fits to the model, the behavior of the real system can be guaranteed to satisfy the correctness properties verified with respect to the model. If not, then this verification is impossible.

Cyber-physical systems span controllers and the relevant dynamics of the environment. Since safety is crucial for CPS, their models (e. g., hybrid system models) need to be verified formally. Formal verification guarantees that a model is safe with respect to a safety property. The remaining task is to validate whether the models are adequate, such that the verification results transfer to the system implementation.

Actual system execution, however, provides many opportunities for surprising deviations from the model. Faults may cause the system to function improperly, sensors may deliver uncertain values and actuators may suffer from disturbance. The formal verification may have assumed simpler ideal-world dynamics for tractability reasons or made un-realistically strong assumptions about the behavior of other agents in the environment. Simpler models are often better for real-time decisions and optimizations, because they make predictions feasible to compute at the required rate. The same phenomenon of simplicity for predictability is often exploited for the models in formal verification and validation. As a consequence, the verification results obtained about models of a CPS only apply to the actual CPS at runtime to the extent that the system fits to the model.

Validation (i.e., checking whether a CPS implementation fits to a model) is an interesting but difficult problem. CPS models are more difficult to analyze than ordinary (discrete) programs because of the physical plant, the environment, sensor inaccuracies, and actuator disturbance. In CPS, models are essential; but any model necessarily deviates from the real world. Still, good models can be correct within certain error margins.

SUMMARY OF THE INVENTION

For an unbroken chain of correctness guarantees, runtime monitors are synthesized in a provably correct way from provably safe hybrid system models. This paper advances these techniques to make the synthesized monitoring conditions robust to partial observability of sensor uncertainty and partial controllability due to actuator disturbance. We show that the monitoring conditions result in provable safety guarantees with fallback controllers that react to monitor violation at runtime. We evaluate the method by synthesizing monitoring conditions from existing case studies on train and road traffic control.

Herein we introduce a method (referred to herein as “ModelPlex”) to synthesize monitors by theorem proving. The method uses sound proof rules to formally verify that a model is safe and to synthesize provably correct monitors that validate compliance of system executions with the model.

This invention performs runtime model validation. That is, validating whether a model, shown for example, in FIG. 3 as reference number 100, which is assumed for verification purposes is adequate for a particular system execution to ensure that the verification results apply to the current execution. The invention checks system execution with respect to a monitor specification 104, and thus, belongs to the field of runtime verification. Herein, the term runtime validation is used to clearly convey the purpose of monitoring. While “runtime verification” monitors properties without offline verification, “runtime validation” monitors model adequacy to transfer offline verification results. The focus of the invention is on verifiably correct runtime validation to ensure that verified properties of models provably apply, which is important for safety and certification of CPS.

If an observed system execution fits to the verified model 12, then this execution is safe according to the offline verification result about the model. If it does not fit, then the system is potentially unsafe because it no longer has an applicable safety proof, so a verified fail-safe action to avoid safety risks is initiated. Checking whether a system execution fits to a verified model 12 includes checking that the actions chosen by the (unverified) controller implementation fit to one of the choices and requirements of the verified controller model. It also includes checking that the observed states can be explained by the plant model. The crucial questions are: How can a compliance monitor be synthesized that provably represents the verified model? How much safety margin does a system need to ensure that fail-safe actions are initiated early enough for the system to remain safe even if its behavior ceases to comply with the model?

The second question is related to feedback control and can only be answered when assuming constraints on the deviation of the real system dynamics from the plant model. Otherwise, if the real system can be infinitely far off from the model, safety guarantees are impossible. By the sampling theorem in signal processing, such constraints further enable compliance monitoring solely on the basis of sample points instead of the unobservable intermediate states about which no sensor data exists. When such constraints are not available, the method of the present invention still generates verifiably correct runtime tests, which detect deviation from the model at the sampling points, not just between them. A fail-safe action will then lead to best-effort mitigation of safety risks (rather than guaranteed safety).

As presented herein, the present invention is a method to synthesize verifiably correct runtime validation monitors automatically. ModelPlex uses theorem proving with sound proof rules to turn hybrid system models into monitors in a verifiably correct way. Upon non-compliance with the model, ModelPlex initiates provably safe fail-safe actions.

ModelPlex is a principle to build and verify high-assurance controllers for safety-critical computerized systems that interact physically with their environment. It guarantees that verification results about CPS models transfer to the real system by safeguarding against deviations from the verified model. Monitors created by ModelPlex are provably correct and check at runtime whether or not the actual behavior of a CPS complies with the verified model and its assumptions. Upon non-compliance, ModelPlex initiates fail-safe fallback strategies. To initiate the fallback strategies early enough, ModelPlex uses prediction on the basis of disturbed plant models to check safety for the next control cycle. This way, ModelPlex ensures that verification results about a model of a CPS transfer to the actual system behavior at runtime.

For partially-observable systems, the approach disclosed herein provide offline safety proofs which ensure that a controller avoids all safety violations with respect to an explicit model of physics and offline monitor synthesis turns these models into runtime monitors that act in advance with provable safety guarantees about the true CPS/Lastly, the method provides offline proofs to justify that all future model behavior will be safe if the monitor is satisfied at runtime. For example, the specification “keep at least 1 ft distance to other cars” in the disclosed approach considers the dynamics and therefore results in a monitor “always keep sufficient stopping distance” that acts ahead of time as opposed to after a collision is inevitable.

FIG. 5, View (A) illustrates the intuition behind safety proofs and their relation to system execution and monitoring: A system should stay outside unsafe states (collision) so it has to avoid all paths that might eventually end up in unsafe states ((car)collision). A safety proof ([car]¬collision) shows that a model α (car) of the system indeed avoids such paths, no matter how often it is repeated (car*). At runtime, as shown in View(B) of FIG. 5, implementation correctness can be checked with monitors that inspect sensors and influence decisions on each iteration α (every time the controller of the loop α* runs. To guarantee safety, monitors must provably detect when the system is about to enter a path to the unsafe states. Such an approach crucially requires logical foundations for analyzing both [car]¬collision safety and <car>collision liveness in the same framework, which the described chosen differential dynamic logic supports.

The methods disclosed herein advance techniques for these questions from prior work, disclosed in U.S. patent application Ser. No. 15/026,690, incorporated herein in its entirety, with built-in systematic ways of coping with the inevitable complications of sensor uncertainty and actuator disturbance in partially observable hybrid systems.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the use of ModelPlex monitors along a system execution.

FIG. 2 show the water tank simulation using monitors

FIG. 3 is a high level flow chart of the process of deriving monitors from a model.

FIG. 4 is a flow chart of the “Analyze Monitor Specification” step of the process shown in FIG. 3.

FIG. 5 illustrates the intuition behind safety proofs and their relation to system execution and monitoring.

FIG. 6 shows a model monitor in which a system execution is checked for validation against a model.

FIG. 7, View (A) is a graph showing that controller observe true behavior through sensors. View (B) is a graph showing that monitors must check for the existence of a behavior that fits the model and explains the observed measurements.

FIG. 8 is a graph showing measurements with sensor uncertainty, showing true values and measured values.

FIG. 9 is a graph showing the bounds for a variable being derived from a measurement history and the detection of excess drift over time.

DETAILED DESCRIPTION
Preliminaries: Differential Dynamic Logic

For hybrid systems verification differential dynamic logic d custom-character is used, which has a notation for hybrid systems as hybrid programs. d allows us to make statements that we want to be true for all runs of a hybrid program ([α]Ø) or for at least one run (<α>Ø) of a hybrid program.

Both constructs are necessary to derive safe monitors: [α]Ø proofs are needed to ensure that all behavior of a model (including controllers) is safe; <α>Ø proofs are needed to find monitor specifications that detect whether or not system execution fits to the verified model.

Table 1 summarizes the relevant syntax fragment of hybrid programs together with an informal semantics. The semantics ρ(α) of hybrid program α is a relation on initial and final states of running α. The set of d custom-character formulas is generated by the following grammar (<∈{<, ≤, =, ≥, >} and θ₁, θ₂are arithmetic expressions in +, −, ·, / over the reals):

ϕ::=θ₁˜θ₂|¬ϕ|ϕ∧ψ|ϕ∨ψ|ϕ→ψ|∀xϕ|∃xϕ|[α]ϕ|<α>ϕ

Differential dynamic logic comes uses a theorem prover with a verification technique to prove correctness properties of hybrid programs. One example of a theorem prover and the one that is used in the preferred embodiment of the invention is KeYmaera.

TABLE 1

Hybrid program representations of hybrid systems.

Statement
Effect

α; β
sequential composition, first run hybrid program α,

then hybrid program β

α ∪ β
nondeterministic choice, following either hybrid

program α or β

α*
nondeterministic repetition, repeats hybrid program

α n ≥ 0 times

x := θ
assign value of term θ to variable x (discrete jump)

x := *
assign arbitrary real number to variable x

?F
check that a particular condition F holds, and abort if it

does not

(x_i′ = θ_i, . . . ,
evolve x_ialong differential equation system x_i′ = θ_i

x_n′ = θ_n& F′)
restricted to maximum evolution domain F′

ModelPlex Approach for Verified Runtime Validation

CPS are almost impossible to get right without sufficient attention to prior analysis, for instance by formal verification and formal validation techniques. We assume to be given a verified model of a CPS, i.e. formula (1) is proved valid. Note that differential dynamic logic (d custom-character ) and KeYmaera as a theorem prover are used to illustrate the concepts herein. The concept of ModelPlex is not predicated on the use of KeYmaera to prove (1). Other verification techniques could be used to establish validity of this formula. The flexibility of the underlying logic d, its support for both [α]ϕ and <α>ϕ, and its proof calculus, however, are exploited for systematically constructing monitors from proofs in the sequel.

ϕ→[α*]ψ with invariat φ→[α]φs.t.ϕ→φ and φ→ψ (1)

Eq.(1) expresses that all runs of the hybrid system α*, which start in states that satisfy the precondition Ø and repeat the model α arbitrarily many times, must end in states that satisfy the post condition ψ. Eq.(1) is proved using some form of induction, which shows that a loop invariant φ holds after every run of α if it was true before. The model α is a hybrid system model 100 of a CPS, which means that it describes both the discrete control actions of the controllers in the system and the continuous physics of the plant and the system's environment.

The safety guarantees obtained by proving Eq.(1) about the model α* transfer to the real system, if the actual CPS execution fits to α*. Because safety properties should be preserved, a CPS γ fits to a model α*, if the CPS reaches at most those states that are reachable by the model, i.e., ρ(γ)⊆ρ(α*). However, we do not know γ and therefore need to find a condition based on α* that can be checked at runtime to see if concrete runs of γ behave like α*.

Checking the post condition ψ is not sufficient because, if ψ does not hold, the system is already unsafe. Checking the invariant φ is insufficient as well, because if φ does not hold, the controller can no longer guarantee safety, even though the system may not yet be unsafe. But if we detect when a CPS is about to deviate from α* before leaving φ, we can still switch to a fail-safe controller to avoid ¬ψ from happening.

As shown in FIG. 1, ModelPlex derives three kinds of monitors, a model monitor 20, a controller monitor 22, and a prediction monitor 24. Reachability between consecutive states in α, α_ctrl, and α_δplantare checked by verifying states during execution against the corresponding monitor.

Model Monitor 20—

In each state ν_ithe sample point ν_i−1from the previous execution γ_i−1is tested for deviation from the single α, not α* i. e., test (ν_i−1, ν_i)∈ρ(α). If violated, other verified properties may no longer hold for the system. The system, however, is still safe if a prediction monitor 24 was satisfied on ν_i−1. Frequent violations indicate an inadequate model that should be revised to better reflect reality.

Controller Monitor 22—

In intermediate state {tilde over (v)}_ithe current controller decisions of the implementation γ_ctrlare tested for compliance with the model, i.e., test (v_i, {tilde over (v)}_i)∈ρ(α_ctrl). Controller monitors 22 are designed for switching between controllers similar to Simplex. If violated, the commands from a fail-safe controller replace the current controller's decisions to ensure that no unsafe commands are ever actuated.

Prediction Monitor 24—

In intermediate state {tilde over (v)}_ithe worst-case safety impact of the current controller decisions are tested with respect to the predictions of a bounded deviation plant model α_δplant, which has a tolerance around the model plant α_plant, i.e., check ν_i+1□=φ for all ν_i+1such that ({tilde over (v)}_i, ν_i+1)∈ρ(α_δplant). Note, that all ν_i+1are simultaneously checked by checking {tilde over (v)}_ifor a characterizing condition of α_δplant. If violated, the current control choice is not guaranteed to keep the system safe until the next control cycle and, thus, a fail-safe controller takes over.

The assumption for the prediction monitor 24 is that the real execution is not arbitrarily far off the plant models used for safety verification, because otherwise guarantees can neither be made on unobservable intermediate states nor on safety of the future system evolution. A separation of disturbance causes in the models can be used: ideal plant models α_plantfor correctness verification purposes, implementation deviation plant models α_δplantfor monitoring purposes. Any deviation model (e.g., piecewise constant disturbance, differential inclusion models of disturbance) is supported, as long as the deviation is bounded and differential invariants can be found. It is assumed that monitor evaluations are at most some e time units apart (e.g., along with a recurring controller execution). Note that disturbance in α_δplantis more manageable compared to α*, as single runs α can be focused on instead of repetitions for monitoring.

Relation Between States

A check that inspects states of the actual CPS to detect deviation from the model α* is systematically derived. First, a notion of state recall is established and shows that, when all previous state pairs comply with the model, compliance of the entire execution can be checked by checking the latest two states (ν_i−1, ν_i).

Definition 1 (State Recall).

V is used to denote the set of variables whose state we want to recall. γ_V⁻=∧_x∈v^x=x− are used to express a characterization of the values of variables in a state prior to a run of α, where the fresh variables x⁻ to are always presumed to occur solely in γ_V⁻. The variables in x⁻ can be used to recall this state. Likewise, γ_V⁺=∧_x∈v^x=x+ is used to characterize the posterior states and expect fresh x⁺.

With this notation, the following lemma states that an interconnected sequence of α transitions forms a transition of α*.

Lemma 1 (Loop Prior and Posterior State).

Let α be a hybrid program and α* be the program that repeats arbitrarily many times. Assume that all consecutive pairs of states (ν_i−1, ν_i)∈ρ(α) of n∈ custom-character ⁺ executions, whose valuations are recalled with γ_Vⁱ=∧_x∈Vx=xⁱand γ_V^i-1are plausible with respect to the model α, i.e., ∧_1≤i≤n(γ_V^i-1→αγ_Vⁱ) with γ_V⁻=γ_V⁰and γ_V⁺=γ_Vⁿ. Then, the sequence of states originates from an α* execution from γ_V⁰to γ_Vⁿ, i.e., custom-character γ_V⁻→α*γ_V⁺.

Lemma 1 enables us to check compliance with the model α* up to the current state by checking reachability of a posterior state from a prior state on each execution of α (i.e., online monitoring, which is easier because the loop was eliminated). To find compliance checks systematically, we construct formula (2), which relates a prior state of a CPS to its posterior state through at least one path through the model α. Consecutive states for α* mean before and after executions of α (i.e., α_;^↓α_;^↓α, not within α).

γ_V⁻→<α>γ_V⁺ (2)

Eq.(2) is satisfied in a state v, if there is at least one run of the model α starting in the state v recalled by γ_V⁻ and which results in a state ω recalled using γ_V⁺. In other words, at least one path through α explains how the prior state v got transformed into the posterior state ω. The d custom-character Eq.(2) characterizes the state transition relation of the model α directly. Its violation indicates compliance violation. Compliance at all intermediate states cannot be observed by real-world sensors.

In principle, Eq.(2) would be a monitor, because it relates a prior state to a posterior state through the model of a CPS, but the formula is difficult, if not impossible, to evaluate at runtime, because it refers to a hybrid system α, which includes non-determinism and differential equations. The basic observation is that any equation that is equivalent to Eq.(2) but conceptually easier to evaluate in a state would be a correct monitor 112. We use theorem proving for simplifying Eq.(2) into quantifier-free first-order real arithmetic form so that it can be evaluated efficiently at runtime. The resulting first-order real arithmetic formula can be easily implemented in a runtime monitor and executed along with the actual controller. A monitor 112 is executable code that only returns true if the transition from the prior system state to the posterior state is compliant with the model 100. Thus, deviations from the model 100 can be detected at runtime, so that appropriate fallback and mitigation strategies can be initiated.

The complexity for evaluating an arithmetic formula over the reals for concrete numbers is linear in the formula size, as opposed to deciding the validity of such formulas, which is doubly exponential. Evaluating the same formula on floating point numbers is inexpensive but may yield wrong results due to rounding errors; on exact rationals, the bit-complexity can be non-negligible. Interval arithmetic is used to obtain reliable results efficiently.

ModelPlex Monitor Synthesis

This section introduces the nature of ModelPlex monitor specifications, the approach in this invention for generating such specifications from hybrid system models, and how to turn those specifications into monitor code that can be executed at runtime along with the controller.

A ModelPlex specification 104 corresponds to the d custom-character formula (2). If the current state of a system does not satisfy a ModelPlex specification 104, some behavior that is not reflected in the model 100 occurred (e.g., the wrong control action was taken, unanticipated dynamics in the environment occurred, sensor uncertainty led to unexpected values, or the system was applied outside the specified operating environment).

A model monitor X_mchecks that two consecutive states v and w can be explained by an execution of the model α, i.e., (v,w)∈ρ(α). In the sequel, BV(α) are bound variables in α, FV(φ) are free variables in φ, Σ is the set of all variables, and A\B denotes the set of variables being in some set A but not in some other set B. Furthermore, we use v□□ to denote v projected onto the variables in A.

Theorem 1 (Model Monitor Correctness).

Let α* be provably safe, so custom-character ϕ→[α*]ψ. Let V_m=BV (α)∪FV (ψ). Let v₀, v₁, v₂v₃. . . ∈ⁿbe a sequence of states, with v₀ϕ and that agree on Σ\V_m, i.e., v₀|_Σ\V_m=v_k|_Σ\V_mfor all k. We define (v,v_i+1)x_mas x_mevaluated in the state resulting from v by interpreting x⁺ as v_i+1(x) for all x∈V_m, i.e., v_x₊^v^i+1(x) custom-character x_m. If (v_i,v_i+1)x_mfor all i<n then we have v_nψ where

X
_m≡(ϕ|_const→ custom-character αγ_v_m⁺) (3)

and ϕ|_constdenotes the conditions of ϕ that involve only constants that do not change in α, i. e., FV (ϕ|_const)∩BV (α)=Ø.

The approach shown herein to generate monitor specifications from hybrid system models 102 takes a verified d custom-character formula (1) as input 100 and produces a monitor x_min quantifier-free first-order form as output. The algorithm involves the following steps:

A d custom-character formula (1) about a model α of the form ϕ→[α*]ψ is turned into a specification conjecture (3) of the form ϕ|_const→αγ_V_m⁺. See process 102, resulting in monitor specification 104 in FIG. 3.

Theorem proving on the specification conjecture (3) is applied until no further proof rules are applicable and only first-order real arithmetic formulas remain open. See process 106, resulting in monitor conditions 108, containing only first-order logic (FOL) in FIG. 3. Process 106 is also shown in more detail in FIG. 4, and an algorithmic representation of process 106 is shown herein.

The monitor specification x_mis the conjunction of the unprovable first-order real arithmetic formulas from open sub-goals.

Generate the Monitor Conjecture.

d custom-character formula (1) is mapped syntactically to a specification conjecture of the form (3). By design, this conjecture will not be provable. But the unprovable branches of a proof attempt will reveal information that, had it been in the premises, would make (3) provable. Through γ_V_m⁺, those unprovable conditions collect the relations of the posterior state of model α characterized by x⁺ to the prior state x, i.e., the conditions are a representation of (2) in quantifier-free first-order real arithmetic.

Use Theorem Proving to Analyze the Specification Conjecture.

The proof rules of d custom-character are used to analyze the specification conjecture x_m. These proof rules syntactically decompose a hybrid model 100 into easier-to-handle parts, which leads to sequents with first-order real arithmetic formulas towards the leaves of a proof. Using real arithmetic quantifier elimination we close sequents with logical tautologies, which do not need to be checked at runtime since they always evaluate to true for any input. The conjunction of the remaining open sequents is the monitor specification 104, which implies (2).

A complete sequence of proof rules applied to the monitor conjecture of the water tank is described in below. Most steps are simple when analyzing specification conjectures: sequential composition ( custom-character ;), nondeterministic choice (∪), deterministic assignment (:=) and logical connectives (∧r etc.) replace current facts with simpler ones or branch the proof. Challenges arise from handling nondeterministic assignment and differential equations in hybrid programs.

First, consider nondeterministic assignment x:=*. The proof rule for nondeterministic assignment ( custom-character *) results in a new existentially quantified variable. By sequent proof rule ∃r, this existentially quantified variable is instantiated with an arbitrary term θ, which is often a new logical variable that is implicitly existentially quantified. Weakening (Wr) removes facts that are no longer necessary.

$((*)) \frac{\exists X (x := X) φ_{1}}{(x := *) φ} (\exists r) \frac{Γ ⊢ φ (θ), \exists x φ (x), Δ_{2}}{Γ ⊢ \exists x φ (x), Δ} (Wr) \frac{Γ ⊢ Δ}{Γ ⊢ φ, Δ}$

Optimization 1 (Instantiation Trigger).

If the variable is not changed in the remaining α, x_i=x_i⁺ is γ_V_m⁺ and X is not bound in γ_V_m⁺, then instantiate the existential quantifier by rule ∃r with the corresponding x_i⁺ that is part of the specification conjecture (i.e., θ=x_i⁺), since subsequent proof steps are going to reveal θ=x_i⁺ anyway.

Otherwise, a new logical variable is introduced which may result in an existential quantifier in the monitor specification if no further constraints can be found later in the proof.

Next, differential equations are handled. Even when the differential equation can be solved, existentially and universally quantified variables remain. Inspect the corresponding proof rule from the d custom-character calculus. For differential equations it must be proven that there exists a duration t, such that the differential equation stays within the evolution domain H throughout all intermediate times {tilde over (t)} and the result satisfies φ at the end. At this point there are three options:

- i. Option 1: instantiate the existential quantifier, if it is known that the duration will be t⁺;
- ii. Option 2: introduce a new logical variable, which is the generic case that always yields correct results, but may discover monitor specifications that are harder to evaluate;
- iii. Option 3: use quantifier elimination (QE) to obtain an equivalent quantifier-free result (a possible optimization could inspect the size of the resulting formula).

The analysis of the specification conjecture finishes with collecting the open sequents from the proof to create the monitor specification x_m custom-character ∧ (open sequent) 104. The collected open sequents may include new logical variables and new (Skolem) function symbols that were introduced for nondeterministic assignments and differential equations when handling existential or universal quantifiers. The invertible quantifier rule i∃ is used to re-introduce existential quantifiers for the new logical variables. Often, the now quantified logical variables are discovered to be equal to one of the post-state variables later in the proof, because those variables did not change in the model after the assignment. If this is the case, proof rule ∃σ can be used to further simplify the monitor specification by substituting the corresponding logical variable x with its equal term θ.

$(i \exists) \frac{Γ ⊢ \exists X (? (Φ_{i} ⊢ Ψ_{i})), Δ}{Γ, ? ⊢ ?, Δ \dots Γ, ? ⊢ ?, Δ}, (\exists σ) \frac{φ (θ)}{\exists x (x = θ ⋀ φ (x))}, ? indicates text missing or illegible when filed$

$^{1} Among all open branches, free logical variable X only occurs in the branches Γ, Φ_{i} ⊢ Ψ_{i}, Δ$

$^{2} Logical variable x does not appear in term θ$

Controller Monitor Synthesis

A controller monitor x_c, shown as reference number 112 in FIG. 3, checks that two consecutive states v and ω are reachable with one controller execution α_ctrl, i. e., (v,ω)∈ρ(α_ctrl) with V_c,=BV(α_ctr)∪FV(ψ). Controller monitors 112 are systematically derived in processes 106 and 110 from formulas 104: φ|_const→ custom-character α_ctrlγ_V_c⁺. A controller monitor 112 can be used to initiate controller switching similar to Simplex.

Theorem 2 (Controller Monitor Correctness).

Let α of the canonical form α_ctrl; α_plant. Assume custom-character ϕ→[α*]φ has been proven with invariant φ as in (1). Let vφ|_const∧φ, as checked by x_m(Theorem 1). Furthermore, let {tilde over (v)} be a post-controller state. If (v, {tilde over (v)})x_cwith x_c≡ϕ|_const→α_ctrlγ_V_c⁺ then we have that (v, {tilde over (v)})∈ρ(α_ctrl) and {tilde over (v)} custom-character φ.

Monitoring in the Presence of Expected Uncertainty and Disturbance

Up to now exact ideal-world models have been considered. However, real-world clocks drift, sensors measure with some uncertainty, and actuators are subject to disturbance. This makes the exact models safe but too conservative, which means that monitors for exact models are likely to fall back to a fail-safe controller unnecessarily often. This section is a discussion of how ModelPlex specifications are found such that the safety property (1) and the monitor specification become more robust to expected uncertainty and disturbance. That way, only unexpected deviations beyond those captured in the normal operational uncertainty and disturbance of α* cause the monitor to initiate fail-safe actions.

In d custom-character we can, for example, use nondeterministic assignment from an interval to model sensor uncertainty and piecewise constant actuator disturbance, or differential inequalities for actuator disturbance. Such models include non-determinism about sensed values in the controller model and often need more complex physics models than differential equations with polynomial solutions.

Currently, for finding model monitors, our prototype tool solves differential equations by the proof rule (′). Thus, it finds model monitor specifications for differential algebraic equations with polynomial solutions and for differential algebraic inequalities, which can be refined into solvable differential algebraic equations. For prediction monitors (discussed below) d custom-character techniques are used for finding differential variants and invariants, differential cuts, and differential auxiliaries to handle differential equations and inequalities without polynomial solutions.

Monitoring Compliance Guarantees for Unobservable Intermediate States

With controller monitors, non-compliance of a controller implementation with respect to the modeled controller can be detected right away. With model monitors, non-compliance of the actual system dynamics with respect to the modeled dynamics can be detected when they first occur. In such cases, a fail-safe action is switched to, which is verified using standard techniques, in both non-compliance cases. The crucial question is: can such a method always guarantee safety? The answer is linked to the image computation problem in model checking (i. e., approximation of states reachable from a current state), which is known to be not semi-decidable by numerical evaluation at points; approximation with uniform error is only possible if a bound is known for the continuous derivatives. This implies that additional assumptions are needed about the deviation between the actual and the modeled continuous dynamics to guarantee compliance for unobservable intermediate states. Unbounded deviation from the model between sample points just is unsafe, no matter how hard a controller tries. Hence, worst-case bounds capture how well reality is reflected in the model.

A prediction monitor is derived to check whether a current control decision will be able to keep the system safe for time ε even if the actual continuous dynamics deviate from the model. A prediction monitor checks the current state, because all previous states are ensured by a model monitor and subsequent states are then safe by (1).

Definition 2 (ε-Bounded Plant with Disturbance δ).

Let α_plantbe a model of the form x′=θ & H. An ε-bounded plant with disturbance δ, written α_δplant, is a plant model of the form x_o:=0; (f(θ, δ)≤x′≤g(θ, δ)&H∧x_o≤ε) for some f, g with fresh variable ε>0 and assuming x_o′=1. That disturbance δ is constant if x∉δ; it is additive if f(θ, δ)=θ−δ and g(θ, δ)=θ+δ.

Theorem 3 (Prediction Monitor Correctness).

Let α* be provably safe, i. e., custom-character ϕ→[α*]ψ has been proved using invariant φ as in (1). Let V_p=BV (α)∪FV ([α]φ]). Let vϕ|_const∧φ, as checked by χ_mfrom Theorem 1. Further assume {tilde over (v)} such that (v, {tilde over (v)})∈ρ(α_ctrl), as checked by x from Theorem 2. If (v, {tilde over (v)})χ_pwith χ_p=(ϕ|_const∧ϕ)→ custom-character α_ctrl(γ_V_p⁺∧[α_plant]φ), then we have for all ({tilde over (v)},ω) ρ(α_δplant) that ωφ.

By adding a controller execution custom-character α_ctrl prior to the disturbed plant model, we synthesize prediction monitors that take the actual controller decisions into account. For safety purposes, a monitor definition without controller χ_p≡(ϕ□_const∧ϕ)→[α_δplant]φ could be used, but doing so results in a conservative monitor, which has to keep the CPS safe without knowledge of the actual controller decision.

Decidability and Computability

One useful characteristic of ModelPlex beyond soundness is that monitor synthesis is computable, which yields a synthesis algorithm, and that the correctness of those synthesized monitors with respect to their specification is decidable. See Theorem 4.

Theorem 4 (Monitor Correctness is Decidable and Monitor Synthesis Computable).

Assume canonical models of the form α≡α_ctrl; α_plantwithout nested loops, with solvable differential equations in α_plantand disturbed plants α_δplantwith constant additive disturbance δ (see Definition. 2). Then, monitor correctness is decidable, i. e., the formulas χ_m→ custom-character αγ_V⁺, χ_c→α_ctrlγ_V⁺γ_V⁺, and χ_p→α_ctrl(γ_V⁺∧[α_δplant]φ) are decidable. Also, monitor synthesis is computable, i. e., the functions synth_m: αγ_V⁺χ_m, synth_c: α_ctrlγ_V⁺χ_c, and synth_p: α_ctrl(γ_V⁺∧[α_δplant]φ)χ_pare computable.

Monitor Synthesis and Fallback Controller Design—Design-by-Contract Monitoring

Preconditions, postconditions and invariants are crucial conditions in CPS design. Monitors for these conditions can check (i) whether or not it is safe to start a particular controller (i. e., check that the precondition of a controller is satisfied), (ii) whether or not a controller complies with its specification (i. e., check that a controller delivers set values that satisfy its post condition), and (iii) whether or not the system is still within its safety bounds (i. e., check that the loop invariant of α* is satisfied).

Precondition and post condition monitors are useful to decide whether or not it is safe to invoke a controller in the current state, and whether or not to trust a controller output. An invariant monitor of a CPS α* captures the main assumptions that have to be true throughout system execution. When an invariant monitor is unsatisfied, it may no longer be safe to run the CPS; a fail-safe controller can act as a mitigation strategy.

Design-by-contract monitors are useful to monitor specific design decisions, which are explicitly marked in the model. Our approach systematically creates monitors for a complete specification of the behavior of the model.

Monitor Synthesis

Once we found a model monitor, controller monitor, or prediction monitor specification, we want to turn it into an actual monitor implementation (e. g., in C). The main challenge is to reliably transfer the monitor specification, which is evaluated on custom-character , into executable code that uses floating point representations. We use the interval arithmetic library Apron to represent each real arithmetic value with an interval of a pair of floating point numbers. The interval reliably contains the real.

For certification purposes one still has to argue for the correctness of the actual machine code of the synthesized monitor. This entails that the transformation from the monitor specification as a first-order formula into actual code that evaluates the formula must be formally verified. If the synthesized code is still a high-level language, a certified compiler, can be used to produce machine code. Such a comprehensive proof chain suitable for certification is part of our ongoing research.

Designing for a Fail-Safe Fallback Controller

When we design a system for a fail-safe fallback controller ctrl_safe, it is important to know within which bounds the fail-safe controller can still keep our CPS safe, and which design limits we want a controller implementation to obey. The invariant of a CPS with the fail-safe fallback controller describes the safety bounds. When we start the fail-safe fallback controller ctrl_safein a state where its invariant G is satisfied, it will guarantee to keep the CPS in a state that satisfies the safety property ψ.

So, to safely operate an experimental controller ctrl_exp, we want a monitor that informs us when the experimental controller can no longer guarantee the invariant of the fail-safe controller or when it is about to violate the design limits.

A design for a CPS with a fail-safe fallback controller, therefore, involves proving two properties. First, we prove that the fail-safe controller ctrl_safeensures the safety property ψ as in Eq.(4) below. This property is only provable if we discover an invariant G for the CPS with the fail-safe controller. Then we use G as the safety condition for generating a prediction monitor.

Ø custom-character [(ctrl_safe;plant)*@inv(G)]ψ (4)

With this generic structure in mind, we can design for a fallback controller invoked by a model monitor X_m, controller monitor X_c, or prediction monitor X_p. Upon violation of either X_m, X_c, or X_pby the actual system execution, the set values of a fail-safe controller are used instead.

Monitor Synthesis Algorithm

Algorithm 1 lists the ModelPlex specification conjecture analysis algorithm 106, which turns a specification conjecture into an actual monitor, and is shown in flow-chart form in FIG. 4. The algorithm takes a hybrid system model α, a set of variables V that we want to monitor, and an initial condition Ø including constraints on the variables not changed in a. (Usually, we want a monitor for all the bound variables of the hybrid system model, i. e., V=BV (α).

Algorithm 1: ModelPlex monitor synthesis

input
: A hybrid program α, a set of variables ν ⊆ BV(α), an initial condition ϕ such

that |= ϕ −> [α*]ψ,

output: A monitor χ_msuch that |= χ_m≡ ϕ|_const→ <α>Y⁺,

begin

|
S ← ∅

|
Y⁺ ← Λ_x∈ν x = x⁺ with fresh variables x_i⁺
// Monitor conjecture

|
G ← {├ ϕ|_const→ <α>Y⁺}

|
while G ≠ ∅ do
// Analyze monitor conjecture

|
|
foreach g ∈ G do

|
|
|
G ← G − {g}

|
|
|
if g is first-order then

|
|
|
|
if |≠ g then S ← S ∪ {g}

|
|
|
else

|
|
|
|
{tilde over (g)} ← apply dL proof rule to g

|
|
|
|
G ← G ∪ {{tilde over (g)}}

|
|
|
_—

|
|
_—

|
_—

_—
χ_m← Λ_s∈Ss
// Collect open sequents

Simulation

To illustrate the behavior of the water tank model with a fallback controller, we created two monitors: Monitor X_mvalidates the complete model (as in the examples throughout this work) and is executed at the beginning of each control cycle (before the controller runs). Monitor X_cvalidates only the controller of the model α (compares prior and post state of

$f := *; ? - 1 \leq f \leq \frac{m - x}{ɛ})$

and is executed after the controller but before control actions are issued. Thus, monitor X, resembles conventional runtime verification approaches, which do not check CPS behavior for compliance with the complete hybrid model. This way, unexpected deviations from the model are detected at the beginning of each control cycle, while unsafe control actions are detected immediately before they are taken. With only monitor X_min place an additional control cycle would be required to detect unsafe control action, whereas with only monitor X_cin place, deviations from the model would be missed. Note that the monitor X_mcould be run in place of X_cto achieve the same effect. But monitor X_mimplements a more complicated formula, which is unnecessary when only the controller output should be validated.

FIG. 2 shows a plot of the variable traces of one simulation run. In the simulation the pump controller was run every 2 s (ε=2 s), indicated by the grid for the abscissa and the marks on sensor and actuator plots). The controller was set to pump with

$\frac{5 (m - x_{0})}{2 ɛ} = \frac{5}{2}$

for the first three controller cycles, which is unsafe on the third controller cycle. Monitor B immediately detects this violation at t=4, because on the third controller cycle setting f=5/2 violates

$f \leq \frac{m - x_{1}}{ɛ} .$

The fail-safe action at t=4 drains the tank and, after that, normal operation continues until t=12. Unexpected disturbance x′=f+ 1/20 occurs throughout t=[12,14], which is detected by monitor X_m. Note, that such a deviation would remain undetected with conventional approaches (monitor X_cis completely unaware of the deviation). In this simulation run, the disturbance is small enough to let the fail-safe action at t=14 keep the water tank in a safe state.

Partially-Observable Systems
Monitor Synthesis for Verified Runtime Validation

CPS are almost impossible to get right without sufficient attention to prior analysis, for instance by formal verification. Performed offline, these approaches result in a verified model of a CPS, i.e. Eq.(5) is proved valid, for example using the differential dynamic logic proof calculus implemented in KeYmaera X:

A→[α*]S (5)

Eq. (5) expresses that all runs of the hybrid system α*, which starts in states that satisfy the precondition A and repeat a arbitrarily many times, only end in states that satisfy the postcondition S. The model α* is a hybrid system model of a CPS, which means that it describes both the discrete control actions of the controllers in the system and the continuous physics of the plant and the system's environment.

Whether or not the control choices, actuator and environment effects are faithfully represented in the model can only be checked from actual measurements. Intuitively, such checks compare the values x in state v_i−1to the values x⁺ in state v_itaken at successive sample times to determine compatibility of the unknown system behavior γ_i−1with model α* as shown in FIG. 6.

For example, values x=2 and x⁺=3 are compliant with the differential equation x′=2, because starting the program at x can produce x⁺, as witnessed by a proof of the d custom-character formula x=2∧x⁺=3→x′=2x⁺=x. No x⁺<x is compliant with the program, because the program can reach only states where x⁺≥x.

ModelPlex, as described in U.S. patent application Ser. No. 15/026,690, provides the basis for obtaining such runtime checks from hybrid system models both automatically and in a provably correct manner. ModelPlex analyzes hybrid programs α, starting from monitor conditions in d custom-character of the form αγ⁺, where γ⁺ is a shortcut notation for x⁺=x for all x∈BV(α) to collect the effect of executing α. The satisfaction relation (w,v)ϕ can be used to refer to a monitor condition that is satisfied over two states w and v.

Definition 3 (Transition Satisfaction Relation).

The satisfaction relation (w,v) custom-character ϕ of d formula ϕ for a pair of states (w,v) evaluates ϕ in the state resulting from state w by interpreting variable x⁺ as v(x) for all x∈V, i.e., (w,v)ϕiff w_x₊^v(x)∈[[ϕ]].

The central correctness result about ModelPlex, as shown by Theorem 1, guarantees that the resulting monitoring formula preserves safety (i.e., if the monitoring formula evaluates to true with current sensor measurements and control choices, then the system is safe).

In the following sections, the ModelPlex monitor synthesis techniques are advanced to account for actuator disturbance and sensor uncertainty in partially observable cases.

Robustness to Actuator Disturbance

Actuator disturbance results in discrepancies between the chosen control decisions u and their physical effect ũ (e.g., wheel slip). A typical pattern to model piecewise constant actuator disturbance chooses a nondeterministic value i in the Δ-neighborhood around the control choice u (but does not affect other state variables) as input to the plant: u:∈ctrl(x); ũ∈U_Δ(u); x′=f(x,ũ).

Actuator disturbance is partially observable by monitoring the difference between the intended effect x′=f(x,u) and the actual effect x′=f(x,ũ) from observable quantities. As a side effect, safety and invariant properties of models do not mention the perturbed actuator effect. For example, a car controller can estimate deviation from its acceleration choice only after the fact by observing the actual resulting speed difference. The safety property of interest is usually about the actual resulting speed and not the perturbed acceleration. For monitor synthesis, we therefore adapt the input-output relation γ⁺ to conjecture existence of an effective actuator effect ũ⁺ that explains all other effects collected in γ⁺ about a program α. The monitor condition, shown in Eq. (6), can be turned into arithmetical form with the prior synthesis process and preserves safety by Theorem 5.

custom-character α∃ũ⁺γ⁺ (6)

Theorem 5 (Monitor with Actuator Disturbance Preserves Invariants).

Let α be a hybrid program of the shape u:∈ctrl(x); ũ∈U_Δ(u); x′=f(x,ũ). Let α* preserve an invariant J where ũ∉FV(J), so J→[α]J is valid. Assume the system transitions from state w to v, which agree on BV(α)^C, and assume w∈[[J]]. If the monitor condition in Eq. (6) is satisfied (i.e., (w,v) custom-character α∃ũ⁺γ⁺), then the invariant is preserved (i.e., v∈[[J]]).

Theorem 5 extends to multiple control outputs u and their effects with disturbance ũ in a straightforward way. The controller u:∈ctrl(x) does not access u and ũ∉FV(J)

is therefore natural as no information about l needs to be maintained in the invariant J.

Partial Observability from Sensor Uncertainty

Monitor correctness requires that all bound variables of a program α are monitored (i.e., BV(α)⊆FV(γ⁺)). This is the appropriate behavior except for variables that are unobservable in the CPS implementation. For example, when controllers use sensors to obtain information, only the measurement is known, but not the true value that the sensor is measuring. If a variable is unobservable then it cannot be included in a runtime monitor. But there may still be indirect implications about unobservable quantities when they are related to observable quantities, which necessitates monitors that indirectly check the properties of the true quantity from the measured quantity.

Unobservability results in a crucial difference between monitoring and control (and its safety proofs). Controllers estimate true behavior by taking measurements and observing the effect of their decisions in the next measurements that are again taken from the true values. Monitors must decide whether the measurements explain a true behavior that fits to the expected behavior model. FIG. 7, View (A) is a graph showing measurements {circumflex over (x)} taken near the true value x for a particular sensor. The bars indicate the uncertainty in the measurements. FIG. 7, View (B) shows estimating true values of x from sample measurements {circumflex over (x)}, considering {circumflex over (x)} plausible if the model behavior fits to any possible true x.

The measurements of a controller are taken from points that are linked through the underlying true behavior by assumption. Monitors cannot make this assumption, which results in a number of challenges that are discussed in this section: (i) check existence of behavior as explanation from a set of potential true values into a set of potential next true values; (ii) link explanations to a full path through sets of potential true values around a history of observations; and (iii) guarantee safety in both cases.

y with y∈x is used to refer to an unobservable state variable and ŷ is used to denote a measurement of y with some uncertainty Δ, so ŷ∈U_Δ(y). Assume non-faulty sensors that function according to their characteristics (i.e., they reliably deliver values that only deviate from the true values by at most some known sensor uncertainty Δ). In an implementation, sensor fusion methods for detecting sensor faults and correcting measurement outliers can satisfy this assumption. Definition 4 captures what it means for states to be similar with respect to a measurement that is subject to uncertainty.

Definition 4 (Uncertainty Similarity).

Use w{circumflex over (≈)}_y^Δ to denote the fact that w=v on {y}^Cwith w(y)∈U_Δ(v(ŷ)) and say that state w is Δ-uncertainty-similar to state v.

In the following paragraphs, Δ-techniques are developed to synthesize monitors that indicate whether or not there exist states that are Δ-uncertainty-similar to measured states and connected through a program α, for measured states w and v do there exist uncertainty-similar states {tilde over (w)} and {tilde over (v)} such that {tilde over (w)}{circumflex over (≈)}_y^Δ, {tilde over (v)}{circumflex over (≈)}_y^Δv and

Model Monitors for Pairwise Consistency of Measurements

Monitoring based on measurements requires a decision as to whether the true values y fit to a model α by only looking at the measurements ŷ. Intuitively, this can be answered by finding an unobservable prior state y∈U_Δ(ŷ) close to the measurement ŷ, such that running the model α on this possible y predicts a next unobservable y⁺ that is within measurement uncertainty y⁺∈U_Δ(ŷ⁺) to the next measurement y⁺. This intuition is illustrated in FIG. 8: a pair of two consecutive measurements is plausible if the potential true values of the second measurement overlap with the values predicted by the model α from one of the potential prior y (which is estimated from the previous measurement).

The goal when formalizing this intuition into monitoring conditions is to shift proof effort offline. Therefore, offline proofs are combined with online monitoring: (i) offline proving that the modeled dynamics ensure that all potential true values around the measurements satisfy an invariant property for safety (contraction, see Definition 5); and (ii) online monitoring that some potential true values around the measurements can be connected with the modeled dynamics (See Eq. 7). Both requirements are expressible by quantifying over potential true values in d custom-character .

Definition 5 (Contraction). The contraction ∀y∈U_δ(ŷ)J of margin δ≥0 ensures ∀y∈U_δ(ŷ)J (i.e. J for all potential values y in the δ-neighborhood of the value ŷ. Program β is contraction-safe for margin δ with measurement uncertainty Δ if ∀y∈U_δ(ŷ)J→[u:∈β; x′=f(x,u); ?t=ε; ŷ:∈U_Δ(y)]∀y∈U_δ(ŷ)J is valid.

Monitor condition Eq.(7) answers the question of the plausibility of two measurements ŷ and ŷ⁺ taken t=∈ time apart according to the model dynamics by asking for existence of true values y and y⁺.

X
_m
≡∃y∈U
_Δ(ŷ) custom-character u:∈ctrl(ŷ);x′=f(x,u);?t=ε;ŷ:∈U_Δ(y)(∃y⁺γ⁺) (7)

To facilitate monitor synthesis, both measurements ŷ and ŷ⁺ are needed in a single loop iteration. Therefore, Eq.(7) takes fresh measurements after x′−f(x,u). Now monitor condition (7) indicates that there exist true values close to the measurements; for these true values to have the desired properties, the controller u:∈ctrl(ŷ) must be contraction-safe, as shown by Theorem 6.

Theorem 6 (Pairwise Measurement Monitor Preserves Invariants).

Let α be a program of the shape u:∈ctrl(ŷ); x′=f(x,u); ?t=ε; ŷ:∈U_Δ(y) with contraction-safe controller u:∈ctrl(ŷ) for margin Δ and measurement uncertainty Δ (i.e., ∀y∈U_δ(ŷ)J→[α]∀y∈U_δ(ŷ)J) is valid. Assume the system transitions from w to v, which agree on BV(α)^C, with non-faulty sensors, so w∈[[y∈U_Δ(ŷ)]] and v∈[[y∈U_Δ(ŷ)]]. If the contraction holds w∈[[∀y∈U_δ(ŷ)J]] and the pairwise measurement monitor satisfies (w,v) custom-character ∃y∈U_Δ(ŷ)(α)(∃y⁺y⁺) then J is preserved (i.e., v∈[[J]]).

Theorem 6 extends to vectors of several unobservable variables y and their measured variables ŷ in a straightforward way. The test ?t=ε is reflected in monitor condition (7), so the monitor is only satisfied for states w and v according to the sampling interval ε (formally: v(t)=w(t)+ε for clock t′=1).

Besides safety, monitoring for existence of unobservable values y that fit to the present measurements ŷ also guarantees that the variation between true values is bounded, but not that it forms a linked chain of model executions because true values are freshly estimated from only the last measurement (see FIG. 7, View (B) and Proposition 1).

Proposition 1 (Bounded Variation Coincidence).

Let α be a hybrid program of the form u:∈ctrl(γ); x′=f(x,u); ?t=ε; ŷ:∈U_Δ(y). Assume the system transitions through the sequence of states v₀, v₁, . . . , v_n, which agree on BV(α)^C, such that (v_i−1,v_i) custom-character X_mwith Eq. (7) for all 1≤i≤n. Then there are w_i−1{circumflex over (≈)}v_i=1,μ{circumflex over (≈)}_y^Δv_isuch that (w_i−1,μ_i)∈[[α]].

Proposition 2 and Corollary 1 bound the maximum variation between true values y and measurements ŷ that the monitor condition must allow according to the model.

Proposition 2 (Single-Step Variation Distance).

Let α be a hybrid program of the form u:∈ctrl(ŷ); x′=f(x,u); ?t=ε; ŷ:∈U_Δ(y). Assume the model transitions (w,v)∈[[α]] with intermediate states μ and {tilde over (μ)}. such that (w,μ)∈[[u:∈ctrl(ŷ)]], (μ,{circumflex over (μ)})∈[[x′=f(x,u)]], and ({circumflex over (μ)},v)∈[[ŷ∈U_Δ(y)]]. If (w,v) custom-character X_mthen the variation distance in a single α-step is bounded: v(γ)∈U_4Δ(w(y)+{circumflex over (μ)}(y)) and v(y)∈U_3Δ(w(ŷ)+{circumflex over (μ)}(y)−μ(y)).

Corollary 1 (Multi-Step Distance).

Assume the model transitions through a sequence of states v₀, v₁, . . . , v_nwith intermediate states μ_iand {circumflex over (μ)}_is.t. (v_i−1,μ_i)∈[[u:∈ctrl(ŷ)]], (μ_i,{circumflex over (μ)}_i)∈[[x′=f(x,u); ?t=ε]] and ({circumflex over (μ)}_i,v_i)∈[[ŷ∈U_Δ(y)]]. If (v_i−1,v_i) custom-character X_mfor all 1≤i≤n then the variation is bounded: v_n(y)∈U_2Δ(n-1)(v₀(y)+Σ_i=1ⁿ({circumflex over (μ)}_i(y)−μ_i(y))).

As a consequence of Proposition 2 and Corollary 1, Theorem 6 is useful for single-step consistency checks: such a monitor (i) keeps at least Δ>0 safety margin with a contraction-safe controller because otherwise freshly estimating true y from measurements on every iteration does not preserve invariants, and (ii) detects “large” deviations that occur in a single monitoring step. However, pairwise consistency is unable to detect gradual drift and therefore has to be safeguarded for its entire 2Δ(n+1) deviation over n steps, which inhibits motion, and cannot exploit improving the safety margin Δ over a history of measurements.

Model Monitors for Rolling Consistency of Measurements

Even if measurement pairs in isolation do not exceed the detectable single-step violation of the previous section, the resulting aggregated drift over multiple measurements is still detectable when we keep a history of the control choices. Instead of an explicit list of measurement and control choice histories, the actuation and measurement history can be aggregated into what really matters: acceptable bounds for the upcoming true values (area 902 in FIG. 9) and measurements (area 904 in FIG. 9). The bounds are updated on each monitor execution, taking into account the current control choice. The resulting rolling state estimator detects gradual deviations over the course of multiple measurements.

Definition 6 (Non-Diverging Rolling State Estimator).

A rolling state estimator [l,u]:=e(ŷ₀,ŷ,y−y₀,Δ,[l₀,u₀] updates the estimate [l,u] from the previous measurement ŷ₀, current measurement ŷ, the modeled interpolated plant effect y−y₀and the previous estimate [l₀,u₀] with y₀∈U_[l₀_,u₀_](ŷ₀) and U_[l₀_,u₀_](ŷ₀)⊆U_Δ(ŷ). The rolling state estimator is non-diverging if u−l≤U₀−l₀for all ŷ₀,ŷ,y−y₀,l,u with y∈U_[l,u](ŷ) and U_[l,u](ŷ)⊆U_Δ(ŷ).

The rolling state estimator updates estimates y∈U_[l,u](ŷ) of true y on every measurement such that the history of measurements is preserved in aggregate form. On each step, the monitor checks for existence of a true state y in the estimate and uses the rolling state estimator to incorporate the current measurement ŷ into the estimate for the next check. This also means that the plant effect y−y₀is existentially quantified in the monitor condition of Eq.(8) below, so does not need to be directly observable.

The following basic rolling state estimator is non-diverging:

l=max(−Δ,ŷ₀−ŷ+y−y₀+l₀)

u=max(−Δ,ŷ₀−ŷ+y−y₀+u₀)

For a sequence of n measurements, a non-diverging rolling state estimator keeps tighter bounds compared to the 2Δ(n+1) bounds of Corollary 1 without measurement history. Note that measurements typically vary due to sensor uncertainty, so the estimate almost surely even improves over time by observing measurements.

The monitor condition (8) checks the plausibility of a history of measurements ŷ:∈U_Δ(y) by estimating the true y∈U_[l,u](ŷ) from the observations.

$\begin{matrix} X_{m} \equiv \exists y \in U_{[l, u]} (\hat{y}) 〈 \overset{\overset{remember prior state}{}}{y_{0} := y; {\hat{y}}_{0} := \hat{y};} u : \in ctrl (\hat{y}); x^{'} = f (x, u); ? t = ɛ; \overset{\overset{measure}{}}{\hat{y} : \in U_{Δ (y)}}; \underset{\underset{update eestimator}{}}{[l, u] := e ({\hat{y}}_{0}, \hat{y}, y - y_{0}, Δ, + [l_{0}, u_{0}])} 〉 (\exists y^{+} γ^{+}) & (8) \end{matrix}$

The dynamics x′=f(x,u) over time are reflected by the rolling state estimator in the estimate y∈U_[l,u](ŷ), which guarantees safety by Theorem 7.

Theorem 7 (Monitor with rolling state estimator maintains invariants).

Let α be a program of the form y₀:=y; ŷ₀=ŷ; u:∈ctrl(ŷ); x′=f(x,u); ?t=ε; ŷ:∈U_Δ(γ); [l,u]:=e(ŷ₀,ŷ,y−y₀,Δ,[l₀, u₀]) with a contraction-safe program y₀=y; ŷ₀=ŷ; u:∈ctrl(ŷ) for margin [l,u] and measurement uncertainty Δ. Assume the system transitions from w to v, which agree on BV(α)^C, with non-faulty sensors, so both w∈[[y∈U_Δ(ŷ)]] and v∈[[y∈U_Δ(ŷ)]]. If w∈[[∀y∈U_[l,u](ŷ)J]] and the monitor condition (8) is satisfied, i.e., (w,v) custom-character ∃y∈U_[l,u](ŷ)<α>(∃y⁺γ⁺), then the invariant J is maintained: v∈[[J]].

Theorem 7 extends to vectors of several unobservable variables y and their measured variables ŷ in a straightforward way.

In summary, the approach disclosed herein improves over existing runtime monitoring techniques with provably correct monitor conditions, explicit dynamics with sensor uncertainty and actuator disturbance, and shifts expensive computation offline.

Specification—

Other methods start from purely discrete monitor specifications and therefore the continuous dynamics are implicit and unchecked (e.g., a monitor specification “always positive distance to obstacles” has the implicit unrealistic assumption that a car can stop instantaneously from any speed). In contrast, the approach disclosed herein starts from safety requirements about dynamical hybrid system models with continuous dynamics and therefore synthesizes monitor specifications that are provably correct with respect to the dynamics model (e.g., a specification “always positive distance to obstacles” in the approach herein results in a monitor “always sufficient stopping distance” that acts ahead of time as opposed to after a collision).

Assumptions Between Sampling Points—

Methods that rely on discrete time or combine discrete-time with continuous-time descriptions require additional assumptions on the continuous dynamics between sampling points to be sound, which is handled in the approach herein explicitly, to characterize partial controllability and partial observability and to distinguish between deviation caused by mere uncertainty versus actually unsafe environment behavior.

Offline Computation—

Some methods rely on extensive runtime computations (e.g., reachable sets). The approach herein, in contrast, performs expensive computations offline. At runtime, the resulting formula is only evaluated in real arithmetic for concrete sensor values and control decisions, which enables fast enough responses.

Crucially, the approach herein proves correctness properties that correctly link satisfied monitors to offline safety proofs, resulting in monitors that warn ahead offline, and account for partial controllability and partial observability so that the monitored system inherits the safety guarantees about the model.

Provable guarantees about the safety of cyber-physical systems at runtime are crucial as systems become increasingly autonomous. Formal verification techniques provide an important basis by proving safety of CPS models, which then requires transferring the guarantees of offline proofs to system execution. The approach herein addresses the key question of how offline proofs transfer by runtime monitoring, and crucially, what property needs to be runtime-monitored to imply safety of the monitored system. The disclosed techniques significantly extend previous methods to models of practical interest by implementing proof tactics for differential dynamic logic that correctly synthesize monitor conditions that are robust to bounded sensor uncertainty and bounded actuator disturbance, which are both fundamental sources of partial observability in models.

Number	Date	Country
62917007	Nov 2018	US
61999318	Jul 2014	US
61961366	Oct 2013	US

	Number	Date	Country
Parent	15026690	Apr 2016	US
Child	16685099		US

RUNTIME MODEL VALIDATION FOR PARTIALLY-OBSERVABLE HYBRID SYSTEMS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

RELATED APPLICATIONS

GOVERNMENT INTEREST

Provisional Applications (3)

Continuation in Parts (1)