Low-pass adaptive/neural controller device and method with improved transient performance

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention generally relates to the control of systems with time-varying unknown parameters, time-varying bounded disturbances, and matched uncertain nonlinearities, and in particular to controllers designed to adapt to parameters that vary in uncertain ways.

2. Background Description

The conventional Model Reference Adaptive Controller (MRAC) was developed to control linear systems in the presence of parametric uncertainties. The development of this architecture has been facilitated by the Lyapunov stability theory that defines sufficient conditions for stable performance, but offers no means for characterizing the system's input/output performance during the transient phase. System uncertainties during the transient phase have led to unpredictable and/or undesirable situations, involving control signals of high frequency or large amplitudes, large transient errors or slow convergence rate of tracking errors, to name a few. Application of adaptive/neural controllers has therefore been largely restricted. Large deviation implies poor transient performance. It could even lead to instability, especially for neural controllers where the signals are required to be inside a compact set where approximation is conducted.

One such situation is bandwidth limitation in the control channel, especially in mechanical actuators. A high frequency control signal is impractical as it can lead to destabilization of the system. Another circumstance where use of conventional adaptive/neural controllers is limited is where the system models are mostly based on low-frequency approximations. A high frequency control signal can easily excite the omitted high frequency dynamics of the system and lead to unpredictable consequences.

The transient performance of adaptive/neural controllers depends on unknown parameters, reference input, and adaptive gain in a nonlinear way. Extensive tuning of adaptive gains and Monte-Carlo runs have been the primary methods for enabling the transition of adaptive control solutions to real world applications. This approach has rendered verification and validation of adaptive controllers overly challenging. Moreover, there is no systematic way of selecting design parameters that would yield the desired transient performance for all possible scenarios.

As compared to the linear systems theory, several important aspects of transient performance analysis seem to be missing in the prior art. First, all the bounds in the prior art are computed for tracking errors only, and not for control signals. Although the latter can be deduced from the former, it is straightforward to verify that the ability to adjust the former may not extend to the latter in the case of nonlinear control laws. Second, since the purpose of adaptive control is to ensure stable performance in the presence of modeling uncertainties, one needs to ensure that the changes in reference input and unknown parameters due to possible faults or unexpected uncertainties do not lead to unacceptable transient deviations or oscillatory control signals, implying that a retuning of adaptive parameters is required. Finally, one needs to ensure that whatever modifications or solutions are suggested for performance improvement of adaptive controllers, they are not achieved via high-gain feedback.

SUMMARY OF THE INVENTION

It is therefore an object of the present invention to provide an adaptive/neural controller design with improved transient performance.

Another object of the invention is to provide a systematic way of selecting design parameters that would yield desired transient performance for all possible scenarios.

A further object of the invention is to make it easier to verify and validate adaptive controllers.

We invented a novel custom character ₁adaptive/neural control architecture that permits fast adaptation and yields guaranteed transient response simultaneously for both the system's input and output signals, in addition to providing asymptotic tracking. The main feature of the invention is rapid adaptation with a guaranteed low frequency control signal. The ability to adapt rapidly ensures the desired transient performance for both the system's input and output signals, simultaneously, while a low-pass filter in the feedback loop attenuates the high-frequency components in the control signal.

The custom character ₁adaptive/neural controller can be applied to systems that have time-varying unknown parameters with an arbitrary rate of variation, and also ensures the desired transient performance for both input and output signals of the system. We prove that by increasing the adaptation gain one can achieve arbitrarily close transient and asymptotic tracking for both input and output signals simultaneously.

No matter how configured, in a high-gain feedback controller or MRAC large gain or adaptive gain leads to reduced phase or time-delay margins. We demonstrate that increasing the adaptative gain will not hurt the time-delay margin of the closed-loop system with the custom character ₁adaptive control architecture, in contrast to conventional adaptive or feedback schemes.

An aspect of the invention is A low-pass adaptive/neural controller for a dynamic system, comprising a reference input for the dynamic system, the dynamic system being described by a dynamic model subject to time-varying unknown parameters and an unknown time-varying disturbance, there being a measured output of the dynamic system. A companion model, described by the dynamic model, has adaptive estimates substituted in the dynamic model for the time-varying unknown parameters and the unknown time-varying disturbance, there being a computed output for the companion model. The system has means for generating a control signal to be applied to the dynamic system and the companion model so that the measured output tracks the reference input, the generating means having a low-pass filter to attenuate high frequency components in the control signal. In a further aspect of the invention, the low-pass filter is a stable transfer function and is applied by the generating means so that both the control signal and a difference between the measured output and the reference input achieve a target stability within a transient period.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and other objects, aspects and advantages will be better understood from the following detailed description of a preferred embodiment of the invention with reference to the drawings, in which:

FIG. 1 is a schematic block diagram of a closed loop system with an custom character ₁adaptive controller.

FIG. 2 is a schematic block diagram of a Linear Time Invariant (LTI) closed loop system.

FIG. 3 is a graph showing application of the custom character ₁gain stability requirement for a control signal objective in an exemplar dynamical system.

FIG. 4
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal; FIG. 4b is a graph showing the time history of the control signal where a time varying disturbance signal is

Υ(t)=sin(πt).

FIG. 5
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal; FIG. 5b is a graph showing the time history of the control signal where a time varying disturbance signal is

σ(t)=cos(x₁(t))+2 sin(10t)+cos(15t).

FIG. 6
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal; FIG. 6b is a graph showing the time history of the control signal where a time varying disturbance signal is

σ(t)=cos(x₁(t))+2 sin(100t)+cos(150t).

FIG. 7 is a schematic of interconnection LTI systems.

FIGS. 8
a and 8b are graphs of MRAC performance and time histories, respectively, for r=100 and Γ_c=0.04.

FIGS. 8
c and 8d are graphs of MRAC performance and time histories, respectively, for r=100 and Γ_c=0.2.

FIGS. 9
a and 9b are graphs of MRAC performance and time histories, respectively, for r=400 and Γ_c=0.04.

FIGS. 10
a and 10b are graphs of MRAC performance and time histories, respectively, for r=25 and Γ_c=0.04.

FIG. 11 is a schematic showing a closed loop system with an custom character ₁adaptive controller.

FIG. 12 is a schematic showing a closed loop reference LTI system.

FIGS. 13
a and 13b are graphs showing the equivalent results of cascading low-pass and high-pass systems.

FIGS. 14
a and 14b are graphs showing application of the custom character ₁gain stability requirement for differing adaptive gain values.

FIGS. 15
a and 15b are graphs showing simulation results and time histories, respectively, for an custom character ₁adaptive controller.

FIGS. 16
a and 16b are graphs showing performance and time history, respectively, for an custom character ₁adaptive controller.

FIGS. 17
a and 17b are graphs showing performance results and time histories, respectively, for an custom character ₁adaptive controller.

FIGS. 18
a and 18b are graphs showing performance and time history, respectively, for an custom character ₁adaptive controller.

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT OF THE INVENTION

Problem Formulation

Consider the following system dynamics:

{dot over (x)}(t)=A_mx(t)+b(ωu(t)+θ^τ(t)x(t)+σ(t)),
y(t)=c^τx(t), x(0)=x₀, Eq.1

where xεIRⁿis the system state vector (measurable), uεIR is the control signal, yεIR is the regulated output, b,cεIRⁿare known constant vectors, A_mis a known n×n matrix, ωεIR is known, θ(t)εIRⁿis a vector of time-varying unknown parameters, while σ(t)εIR is a time-varying disturbance. Without loss of generality, we assume that

θ(t)εΘ, |σ(t)≦Δ, t≧0, Eq.2

where Θ is a known compact set and ΔεIR⁺ is a known (conservative) custom character _∞ bound of σ(t).

The control objective is to design a full-state feedback adaptive controller to ensure that y(t) tracks a given bounded reference signal r(t) both in transient and steady state, while all other error signals remain bounded.

We further assume that θ(t) and σ(t) are continuously differentiable and their derivatives are uniformly bounded:

∥{dot over (θ)}(t)∥₂≦d_θ<∞, |{dot over (σ)}(t)|≦d_σ<∞, ∀t≧0, Eq.3

where

∥•∥₂denotes the 2-norm, while the numbers d_θ, d_σ can be arbitrarily large.

custom character
₁Adaptive Controller

In this section, we develop a novel adaptive control architecture for the system in Eq.1 that permits complete transient characterization for both u(t) and x(t).

The elements of custom character ₁adaptive controller are introduced next:

Companion Model: We consider the following companion model:

{circumflex over ({dot over (x)})}(t)=A_m{circumflex over (x)}(t)+b(ωu(t)+{circumflex over (θ)}^τ(t)x(t)+{circumflex over (σ)}(t)),
ŷ(t)=c^τ{circumflex over (x)}(t), {circumflex over (x)}(0)=x₀, Eq.4

which has the same structure as the system in Eq.1. The only difference is that the unknown parameters

θ(t), σ(t)

are replaced by their adaptive estimates

{circumflex over (θ)}(t), {circumflex over (σ)}(t)

that are governed by the following adaptation laws.

Adaptive Laws: Adaptive estimates are given by:

{circumflex over ({dot over (θ)})}(t)=Γ_θProj(−x(t){tilde over (x)}^τ(t)Pb,{circumflex over (θ)}(t)),{circumflex over (θ)}(0)={circumflex over (θ)}₀ Eq.5
{circumflex over ({dot over (σ)})}(t)=Γ_σProj(−{tilde over (x)}^τ(t)Pb,{circumflex over (σ)}(t)),{circumflex over (σ)}(0)={circumflex over (σ)}₀ Eq.6

where

{tilde over (x)}(t)={circumflex over (x)}(t)−x(t)

is the error signal between the state of the system and the companion model,

Γ_θ=Γ_cI_n×nεIR^n×n,Γ_σ=Γ_c

are adaptation gains with

Γ_cεIR⁺,

and P is the solution of the algebraic equation

A_m^τP+PA_m=−Q, Q>0.

Control Law: The control signal is generated by:
$\begin{matrix} u (s) = C (s) \overline{r} (s), where & Eq . 7 \\ \overline{r} (t) = \frac{k_{g} r (t) - {\hat{θ}}^{⊤} (t) x (t) - \hat{σ} (t)}{ω}, & Eq . 8 \\ k_{g} = \frac{1}{c^{⊤} A_{m}^{- 1} b} . & Eq . 9 \end{matrix}$

And where

kεIR⁺ is a feedback gain, while C(s) is any strictly proper stable transfer function with low-pass gain C(0)=1. One simple choice is
$\begin{matrix} C (s) = \frac{ω k}{s + ω k} . & Eq . 10 \end{matrix}$

Stability Requirement

Further, let
$\begin{matrix} L = \max_{θ (t) \in Θ} \sum_{i = 1}^{n} \langle θ_{i} (t) \rangle, & Eq . 11 \end{matrix}$

where θ_i(t) is the i^thelement of θ(t), Θ is the compact set defined in (2). We now state the custom character ₁performance requirement that ensures stability of the entire system and desired transient performance.

custom character
₁-gain stability requirement: Design C(s) to ensure that

∥G(s)∥₁L<1, Eq.12

where

G(s)=(sI−A_m)⁻¹b(1−C(s)).

The complete custom character ₁adaptive controller consists of Eq.4, Eq.5, Eq.6 and Eq.7 subject to ₁-gain stability requirement in Eq.12.

The custom character ₁adaptive controller is illustrated in FIG. 1. The system to be controlled 110 is coupled with companion model 120. Companion model 120 has the same structure as system 110, but the unknown parameters are replaced by their adaptive estimates 130. Controller 140 generates a control signal u 145.

In case of constant θ(t), the stability requirement of the custom character ₁adaptive controller can be simplified. For the specific choice of C(s) in Eq.10, the stability requirement of ₁adaptive controller is reduced to
$\begin{matrix} A_{g} = [\begin{matrix} A_{m} + b θ^{⊤} & b ω \\ - k θ^{⊤} & - k ω \end{matrix}] & Eq . 13 \end{matrix}$

being Hurwitz for all θεΘ.

Closed-Loop Reference System

We now consider the following closed-loop LTI reference system with its control signal and system response being defined as follows:
$\begin{matrix} {\dot{x}}_{ref} (t) = A_{m} x_{ref} (t) + b (ω u_{ref} (t) + θ^{⊤} (t) x_{ref} (t) + σ (t)), & Eq . 14 \\ u_{ref} (s) = C (s) \frac{{\overline{r}}_{ref} (s)}{ω}, x_{ref} (0) = x_{0}, & Eq . 15 \\ y_{ref} (t) = c^{⊤} x_{ref} (t), & Eq . 16 \end{matrix}$

where

{overscore (r)}_ref(s) is the Laplace transformation of the signal

{overscore (r)}_ref(t)=−θ^τ(t)x_ref(t)−σ(t)+k_gr(t),

And k_gis introduced in Eq.9.

Tracking Performance

Let

H(s)=(sI−A_m)⁻¹b. Eq.17

It is proved that there exists

c_oεIRⁿ

such that
$\begin{matrix} c_{o}^{⊤} H (s) = \frac{N_{n} (s)}{N_{d} (s)}, & Eq . 18 \end{matrix}$

where the order of N_d(s) is one more than the order of N_n(s), and both N_n(s) and N_d(s) are stable polynomials.

Theorem 1: Given the system in (1) and the custom character ₁adaptive controller defined via Eq.4, Eq.5, Eq.6 and Eq.7 subject to Eq.12, we have:

∥x−x_ref∥_∞≦γ₁, Eq.19
∥u−u_ref∥_∞≦γ₂, Eq.20

where
$\begin{matrix} γ_{1} = \frac{{ C (s) }_{ℒ_{1}}}{1 - { H (s) (1 - C (s)) }_{ℒ_{1}} L} \sqrt{\frac{θ_{m}}{λ_{\max} (P) Γ_{c}}}, & Eq . 21 \\ γ_{2} = { \frac{C (s)}{ω} }_{ℒ_{1}} L γ_{1} + { \frac{C (s)}{ω} \frac{1}{c_{o}^{⊤} H (s)} c_{o}^{⊤} }_{ℒ_{1}} \sqrt{\frac{θ_{m}}{λ_{\max} (P) Γ_{c}}} . & Eq . 22 \end{matrix}$

Corollary 1: Given the system in Eq.1 and the custom character ₁adaptive controller defined via Eq.4, Eq.5, Eq.6 and Eq.7 subject to Eq.12, we have:
$\begin{matrix} \lim_{Γ_{c} \to \infty} (x (t) - x_{ref} (t)) = 0, \forall t \geq 0, & Eq . 23 \\ \lim_{Γ_{c} \to \infty} (u (t) - u_{ref} (t)) = 0, \forall t \geq 0. & Eq . 24 \end{matrix}$

Thus, the tracking error between x(t) and x_ref(t), as well between u(t) and u_ref(t), is uniformly bounded by a constant inverse proportional to Γ_c. This implies that during the transient one can achieve arbitrarily close tracking performance for both signals simultaneously by increasing Γ_c.

Design Guidelines

We note that the control law u_ref(t) in the closed-loop reference system, which is used in the analysis of custom character _∞ norm bounds, is not implementable since its definition involves the unknown parameters. Theorem 1 ensures that the ₁adaptive controller approximates u_ref(t) both in transient and steady state. So, it is important to understand how these bounds can be used for ensuring uniform transient response with desired specifications. We notice that the following ideal control signal
$\begin{matrix} u_{ideal} (t) = \frac{k_{g} r (t) - θ^{⊤} (t) x_{ref} (t) - σ (t)}{ω} & Eq . 25 \end{matrix}$

Is the one that leads to the desired system response:

{dot over (x)}_ref(t)=A_mx_ref(t)+bk_gr(t) Eq.26
y_ref(t)=c^τx_ref(t) Eq.27

by cancelling the uncertainties exactly. In the closed-loop reference system Eq.14-Eq.16, u_ideal(t) is further low-pass filtered by C(s) in Eq.15 to have a guaranteed low-frequency range. Thus, the reference system in Eq.14-Eq.16 has a different response as compared to Eq.26, Eq.27 with Eq.25. It should be noted that C(s) may be selected to ensure that in case of constant θ the response of x_ref(t), u_ref(t), can be made as close as possible to Eq.26 with Eq.25. In case of fast varying θ(t), it is obvious that the bandwidth of the controller needs to be matched correspondingly.

Time-Delay Margin Analysis

We consider the following LTI system with output measurement delay:
$\begin{matrix} ζ_{l} (s) = \frac{1}{1 - C (s)} (r_{b} (s) - r_{f} (s)), r_{f} (s) = C (s) (1 + θ^{⊤} \overline{H} (s)) ζ_{l_{d}} (s), & Eq . 28 \end{matrix}$

where r_b(s) is the Laplace transformation of bounded signal r_b(t). The block-diagram of the closed-loop system in Eq.28 is shown in FIG. 2.

The open-loop transfer function of the system in Eq.28 is:

H_o(s)=C(s)(1+θ^τ{overscore (H)}(s))/(1−C(s)) Eq.29

whose phase margin P (H_o(s)) can be derived easily from its Bode plot. The time-delay margin of the open-loop transfer function is given by:

T (H_o(s))=P (H_o(s))/ω_c Eq.30

where P (H_o(s)) is the phase margin of the open-loop system H_o(s), and ω_cis the cross-over frequency of H_o(s).

With regard to the time-delay margin of the closed-loop custom character ₁adaptive controller, we have:

Theorem 2: Given the system in (1) and the custom character ₁adaptive controller defined via Eq.4, Eq.5-Eq.6 and Eq.7 subject to Eq.12, where Γ_cand Δ are large enough, the closed-loop adaptive system is stable in the presence of time delay τ in its output if τ<T (H_o(s)), where T (H_o(s)) is defined in Eq.30.

Novel Features

This new adaptive controller generates low-pass control signals. It has constructive design technique for ensuring desired bandwidth of the generated control signals. Thus, it can meet the bandwidth limitations of the actuators. It has improved transient performance. Using fast adaptation, it guarantees desired transient performance for both control signal and system response. It enables time-delay margin analysis. It has the ability to ensure convergence of tracking error to zero in the steady state performance. This new architecture permits faster rates of adaptation without generating high-frequency control signals and without destabilizing the system. It has a proof for stable performance, which is a basic requirement for every control system design. We define a new stability criterion which gives a systematic design algorithm for required specifications.

All the discussions above apply to neural adaptive controllers and higher dimensional systems, as well.

Variations

The above described architecture may be varied in several ways without departing from the spirit of the invention.

For example, if the objective is only to get rid of high frequency oscillation in the control signal, the control signal may be filtered without setting a large adaptive gain.

In another variation, the low pass filter may be applied to all or only a part of the signal

{overscore (r)}(t) as in (31),
$\begin{matrix} u (s) = k_{g} r (t) + C (s) \overline{r} (s), where & Eq . 31 \\ \overline{r} (t) = \frac{- {\hat{θ}}^{⊤} (t) x (t) - \hat{σ} (t)}{ω} . & Eq . 32 \end{matrix}$

In this variation the closed-loop reference system must also be modified. However, similar results related to transient performance can be obtained.

In a further variation, other signals can be used for scaled reference input r(t), depending on your tracking objective.

Another variation is to express the companion model and control law in other equivalent forms.

Extensions

Detailed results and proof of the custom character ₁adaptive controller can be found in the attached papers. Based on ₁adaptive controller, ₁neural controller can be established which guarantees transient performance with its details in attached paper. We also note that all these results can be extended into Multiple Input Multiple Output systems easily.

Results Demonstration

As an illustrative example, consider a single-link robot arm which is rotating on a vertical plane. The system dynamics are given by:
$\begin{matrix} I \ddot{q} (t) + \frac{MgL \cos q (t)}{2} + F (t) \dot{q} (t) + F_{1} (t) q (t) + \overline{σ} (t) = u (t), & Eq . 33 \end{matrix}$

where

q(t) and {dot over (q)}(t) are measured angular position and velocity, respectively, u(t) is the input torque, I is the given moment of inertia, M is the unknown mass, L is the unknown length, F(t) is an unknown time-varying friction coefficient, F₁(t) is position dependent external torque, and

- {overscore (σ)}(t) is an unknown bounded disturbance. The control objective is to design u(t) to achieve tracking of the bounded reference input r(t) by q(t). Let
  
  x=[q{dot over (q)}]^τ.
  
  The system in (33) can be presented in the state-space form as:
  $\begin{matrix} \dot{x} (t) = Ax (t) + b (\frac{u (t)}{I} + \frac{MgL \cos (x_{1} (t))}{2 I} + \frac{σ (t)}{I} + \frac{F_{1} (t)}{I} x_{1} (t) + \frac{F (t)}{I} x_{2} (t)), x (0) = x_{0}, y (t) = c^{⊤} x (t), & Eq . 34 \end{matrix}$
  
  Where X₀is the initial condition,
  $\begin{matrix} A = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}], b = [\begin{matrix} 0 \\ 1 \end{matrix}], c = [\begin{matrix} 1 \\ 0 \end{matrix}] . & Eq . 35 \end{matrix}$
  
  The system can be further put into the form:
  $\begin{matrix} \dot{x} (t) = A_{m} x (t) + b (ω u (t) + θ^{⊤} (t) x (t) + σ (t)), y (t) = c^{⊤} x (t), x (0) = x_{0}, where A_{m} = [\begin{matrix} 0 & 1 \\ - 1 & - 1.4 \end{matrix}], b = [\begin{matrix} 0 \\ 1 \end{matrix}], c = [\begin{matrix} 1 \\ 0 \end{matrix}], w = \frac{1}{I}, θ (t) = {[\frac{1 + F_{1} (t)}{I} 1.4 + \frac{F (t)}{I}]}^{⊤}, σ (t) = \frac{MgL \cos (x_{1} (t))}{2 I} + \frac{σ (t)}{I} . & Eq . 36 \end{matrix}$
  
  Let ω=1, and the unknown control effectiveness, time-varying parameters and disturbance be given by:
  
  θ(t)=[2+cos(πt)2+0.3 sin(πt)+0.2 cos(2t)]^τ;
  σ(t)=sin(πt) Eq.37
  
  so that the compact sets can be conservatively chosen as
  
  Θ=[−10,10],Δ=[−10,10]. Eq.38

For implementation of the custom character ₁adaptive controller Eq.4, Eq.5-Eq.6 and Eq.7, we need to verify the ₁stability requirement in Eq.12. Letting

C(s)=ω_α/(s+ω_α),

we have
$\begin{matrix} G (s) = \frac{ω_{α}}{s + ω_{α}} H (s), where & Eq . 39 \\ H (s) = [\begin{matrix} \frac{1}{s^{2} + 1.4 s + 1} \\ \frac{s}{s^{2} + 1.4 s + 1} \end{matrix}] . & Eq . 40 \end{matrix}$

We can check easily that for our selection of compact sets in Eq.38, the resulting L=20 in Eq.11. As shown in FIG. 3, a plot 320 of

- ∥G(s)∥₁L as a function of ω_kand compare it to 1 (item 310). We notice that for ω_k>30, we have
  
  ∥G(s)∥₁L<1
  
  Finally, we set the adaptive gain as Γ_c=10000.

Turning now to FIGS. 4a and 4b, we see the simulation results of the custom character ₁adaptive controller for the reference input r=cos(πt). FIG. 4a shows graphs of the system state vector 410, the companion model 420 of the invention, and the bounded reference signal 430. FIG. 4b is a graph showing the time history of the control signal u(t) where a time varying disturbance signal is

σ(t)=sin(πt).

Next, we consider a different disturbance signal:

σ(t)=cos(x₁(t))+2 sin(10t)+cos(15t).

The simulation results are shown in FIGS. 5a and 5b. FIG. 5a shows graphs of the system state vector 510, the companion model 520 of the invention, and the bounded reference signal 530. FIG. 5b is a graph showing the time history of the control signal u(t).

Finally, we consider much higher frequencies in the disturbance:

σ(t)=cos(x₁(t))+2 sin(100t)+cos(150t).

The simulation results are shown in FIGS. 6a and 6b. FIG. 6a shows graphs of the system state vector 610, the companion model 620 of the invention, and the bounded reference signal 630. FIG. 6b is a graph showing the time history of the control signal u(t).

We note that the custom character ₁adaptive controller guarantees smooth and uniform transient performance in the presence of different unknown nonlinearities and time-varying disturbances. The controller frequencies are exactly matched with the frequencies of the disturbance that it is supposed to cancel out. We also notice that the system state vector signal

x₁(t)

and the companion model signal

{circumflex over (x)}₁(t)

are almost the same in FIGS. 4a, 5a and 6a.

We will now present an implementation of the invention, providing further detail and adaptations.

I. Introduction

As described above, transient performance in the implementation can be characterized both for the system input and output signals. To achieve this, a Companion Model Adaptive Control (CMAC) architecture is introduced and its equivalence to MRAC is shown. The difference between CMAC and MRAC is in definition of the error signal for adaptive laws, which consequently allows for incorporation of a low-pass filter in the feedback loop of CMAC and enables us to enforce the desired transient performance by increasing adaptation gain. For proof of asymptotic stability, the custom character ₁gain of a cascaded system, comprised of this filter and the closed-loop desired transfer function, is required to be less than the inverse of the upper bound on the norm of unknown parameters used in projection based adaptation laws. Thus, with the low-pass filter in the loop, the custom character ₁adaptive controller is guaranteed to stay in the low-frequency range even in the presence of high adaptive gains and large reference inputs. The ideal (non-adaptive) version of this ₁adaptive controller is used along with the main system dynamics to define a closed-loop reference system, which gives an opportunity to estimate performance bounds in terms of custom character _∞ norms for both system's input and output signals as compared to the same signals of this reference system. These bounds immediately imply that the transient performance of the control signal in MRAC cannot be characterized. Design guidelines for selection of the low-pass filter ensure that the closed-loop reference system approximates the desired system response, despite the fact that it depends upon the unknown parameter. Thus, the desired tracking performance is achieved by systematic selection of the low-pass filter, which in its turn enables fast adaptation, as opposed to high-gain designs leading to increased control efforts.

The paper is organized as follows. Section II states some preliminary definitions, and Section III gives the problem formulation. In Section IV, we recall the conventional MRAC design and introduce the Companion Model Adaptive Controller (CMAC), which is a reparameterization of MRAC. In Section V, a new custom character ₁adaptive controller is presented. Stability and tracking results of the ₁adaptive controller are presented in Section VI. Comparison of the performance of ₁adaptive controller, MRAC and the high gain controller are discussed in section VIII. In section IX, simulation results are presented, while Section X concludes the paper.

II. Preliminaries

In this Section, we recall some basic definitions and facts from linear systems theory.

Definition 1: For a signal ξ(t), t≧0, ξεIRⁿ, its truncated custom character _∞ norm and _∞ norm are defined as
${ ξ_{t} }_{ℒ_{\infty}} = \max_{i = 1, \dots, n} (\sup_{0 \leq τ \leq t} \langle ξ_{i} (τ) \rangle), { ξ }_{ℒ_{\infty}} = \max_{i = 1, \dots, n} (\sup_{τ \geq 0} \langle ξ_{i} (τ) \rangle),$

where ξ_iis the i^thcomponent of ξ.

Definition 2: The custom character ₁gain of a stable proper single-input single-output system H(s) is defined to be ∥H(s)∥₁=∫₀^∞|h(t)|dt, where h(t) is the impulse response of H(s), computed via the inverse Laplace transform
$h (t) = \frac{1}{2 π i} \int_{α - i \infty}^{α + i \infty} H (s) e^{st} ⅆ s, t \geq 0,$

in which the integration is done along the vertical line x=α>0 in the complex plane.

Proposition: A continuous time LTI system (proper) with impulse response h(t) is stable if and only if ∫₀^∞|h(τ)|dτ<∞. A proof can be found in [1] (page 81, Theorem 3.3.2).

Definition 3: For a stable proper m input n output system H(s) its custom character ₁gain is defined as
$\begin{matrix} { H (s) }_{ℒ_{1}} = \max_{i = 1, \dots, n} (\sum_{j = 1}^{m} { H_{ij} (s) }_{ℒ_{1}}), & (1) \end{matrix}$

where H_ij(s) is the i^throw j^thcolumn element of H(s).

The next lemma extends the results of Example 5.2. ([2], page 199) to general multiple input multiple output systems.

Lemma 1: For a stable proper multi-input multi-output (MIMO) system H(s) with input r(t)εIR^mand output x(t)εIRⁿ, we have

∥x_t∥ custom character _∞≦∥H∥₁∥r_t∥_∞, ∀t>0. (2)

Proof. Let x_i(t) be the i^thelement of x(t), r_j(t) be the j^thelement of r(t), H_ij(s) be the i^throw j^thelement of H(s), and H_ij(t) be the impulse response of H_ij(s). Then for any t′ε[0, t], we have
$\begin{matrix} x_{i} (t^{'}) = \int_{0}^{t^{'}} (\sum_{j = 1}^{m} h_{ij} (t^{'} - τ) r_{j} (τ)) ⅆ τ . & (3) \end{matrix}$

From (3) it follows that
$\begin{matrix} \langle x_{i} (t^{'}) \rangle \leq \int_{0}^{t^{'}} (\sum_{j = 1}^{m} \langle h_{ij} (t^{'} - τ) \rangle \langle r_{j} (τ) \rangle) ⅆ τ \leq \\ \int_{0}^{t^{'}} (\sum_{j = 1}^{m} \langle h_{ij} (t^{'} - τ) \rangle) ⅆ τ (\max_{j = 1, \dots, m} \sup_{0 \leq τ \leq t^{'}} \langle r_{j} (τ) \rangle) \leq \\ \sum_{j = 1}^{m} (\int_{0}^{t^{'}} \langle h_{ij} (τ) \rangle ⅆ τ) (\max_{j = 1, \dots, m} \sup_{0 \leq τ \leq t^{'}} \langle r_{j} (τ) \rangle), \end{matrix}$

and hence ∥x_i_t∥ custom character _∞≦(Σ_j=1^m∥H_ij(s)∥₁)∥r_t∥_z,900_∞. It follows from (1) that
$\begin{matrix} { x_{t} }_{ℒ_{\infty}} = \max_{i = 1, \dots, n} { x_{i_{t}} }_{ℒ_{\infty}} \leq \max_{i = 1, \dots, n} (\sum_{j = 1}^{m} { H_{ij} (s) }_{ℒ_{1}}) { r_{t} }_{ℒ_{\infty}} \\ = { H (s) }_{ℒ_{1}} { r_{t} }_{ℒ_{\infty}} \end{matrix}$

for any t≧0. The proof is complete.

Corollary 1: For a stable proper MIMO system H(s), if the input r(t)εIR^mis bounded, then the output x(t)εIRⁿis also bounded as ∥x∥ custom character _∞≦∥H(s)∥₁∥τ∥_∞.

Lemma 2: For a cascaded system H(s)=H₂(s)H₁(s), where H₁(s) is a stable proper system with m inputs and l outputs and H₂(s) is a stable proper system with l inputs and n outputs, we have ∥H(s)∥ custom character ₁≦∥H₂(s)∥₁∥H₁(s)∥₁.

Proof. Let y(t)εIRⁿbe the output of H(s)=H₁(s)H₂(s) in response to input r(t)εIR^m. It follows from Lemma 1 that

∥y(t)∥≦∥y∥_∞≦∥H₂(s)∥ custom character ₁∥H₁(s)∥₁∥r∥_∞ (4)

for any bounded r(t). Let H_i(s), i=1, . . . , n be the i^throw of the system H(s). It follows from (1) that there exists i such that

∥H(s)∥₁=∥H_i(s)∥₁. (5)

Let h_ij(t) be the j^thelement of the impulse response of the system H_i(s). For any T, let

r_j(t)=sgnh_ij(T−t), tε[0,T], ∀j=1, . . . , m. (6)

It follows from Definition 1 that ∥r∥ custom character _∞=1, and hence ∥y(t)∥≦∥H₂(s)∥₁∥H₁(s)∥₁, ∀t≧0. For r(t) satisfying (6), we have
$\begin{matrix} y (T) = \int_{t = 0}^{T} \sum_{j = 1}^{m} h_{ij} (T - t) r_{j} (t) ⅆ t \\ = \int_{t = 0}^{T} \sum_{j = 1}^{m} \langle h_{ij} (T - t) \rangle ⅆ t \\ = \sum_{j = 1}^{m} (\int_{t = 0}^{T} \langle h_{ij} (t) \rangle ⅆ t) . \end{matrix}$

Therefore, it follows from (4) that for any T, Σ_j=1^m(∫_t=0^T|h_ij(t)|dt)≦∥H₂(s)∥ custom character ₁∥H₁∥₁. As T→∞, it follows from (5) that
$\begin{matrix} { H (s) }_{ℒ_{1}} = { H_{i} (s) }_{ℒ_{1}} \\ = \lim_{T \to \infty} \sum_{j = 1}^{m} (\int_{t = 0}^{T} \langle h_{ij} (t) \rangle ⅆ t) \leq { H_{2} (s) }_{ℒ_{1}} { H_{1} (s) }_{ℒ_{1}}, \end{matrix}$

and this completes the proof.
embedded image

Consider an interconnected LTI system in FIG. 7, where w₁εIRⁿ¹, w₂εIRⁿ², M(s) is a stable proper system with n₂inputs and n₁outputs, and Δ(s) is a stable proper system with n₁inputs and n₂outputs.

Theorem 1: ( custom character ₁Small Gain Theorem) The interconnected system in FIG. 7 is stable if ∥M(s)∥₁∥Δ(s)∥₁<1.

The proof follows from Theorem 5.6 ([1], page 218), written for custom character ₁gain.

Consider a linear time invariant system:

{dot over (x)}(t)=Ax(t)+bu(t), (7)

where xεIRⁿ, uεIR, bεIRⁿ, AεIR^n×nis Hurwitz, and assume that the transfer function (sI−A)⁻¹b is strictly proper and stable. Notice that it can be expressed as:
$\begin{matrix} {(sI - A)}^{- 1} b = \frac{n (s)}{d (s)}, & (8) \end{matrix}$

where (d(s)=det(sI−A) is a n^thorder stable polynomial, and n(s) is a n×1 vector with its i^thelement being a polynomial function:
$\begin{matrix} n_{i} (s) = \sum_{j = 1}^{n} n_{ij} s^{j - 1} & (9) \end{matrix}$

Lemma 3: If (AεIR^n×n, bεIRⁿ) is controllable, the matrix N with its i^throw j^thcolumn entry n_ijis full rank.

Proof. Controllability of (A, b) for the LTI system in (7) implies that given an initial condition x(t₀)=0 and arbitrary x_t₁εIRⁿand arbitrary t₁, there exists u(τ), τε[t₀, t₁] such that x(t₁)=x_t₁. If N is not full rank, then there exists a non-zero vector uεIRⁿ, such that u^τn(s)=0. Then it follows that for x(t₀)=0 one has u^τ(τ)x(τ)=0, ∀_τ>t₀. This contradicts x(t₁)=x_t₁, in which x_t₁εIRⁿis assumed to be an arbitrary point. Therefore, N must be full rank, and the proof is complete.

Lemma 4: If (A, b) is controllable and (sI−A)⁻¹b is strictly proper and stable, there exists cεIRⁿsuch that the transfer function c^τ(sI−A)⁻¹b is minimum phase with relative degree one, i.e. all its zeros are located in the left half plane, and its denominator is one order larger than its numerator.

Proof. It follows from (8) that
$\begin{matrix} {c^{⊤} (sI - A)}^{- 1} b = \frac{c^{⊤} {N [s^{n - 1} \dots 1]}^{⊤}}{d (s)}, & (10) \end{matrix}$

where NεIR^n×nis matrix with its i^throw j^thcolumn entry n_ijintroduced in (9). We choose {overscore (c)}εIRⁿsuch that {overscore (c)}^τ[s^n-1. . . 1]^τ is a stable n−1 order polynomial. Since (A, b) is controllable, it follows from Lemma 3 that N is full rank. Let c=(N⁻¹)^τ{overscore (c)}. Then it follows from (10) that
${c^{⊤} (sI - A)}^{- 1} b = \frac{{{\overline{c}}^{⊤} [s^{n - 1} \dots 1]}^{⊤}}{d (s)}$

has relative degree 1 with all its zeros in the left half plane.

III. Problem Formulation

Consider the following single-input single-output system dynamics:

{dot over (x)}(t)=Ax(t)+bu(t), x(0)=x₀
y(t)=c^τx(t), (11)

where xεIRⁿis the system state vector (measurable), uεIR is the control signal, b, cεIRⁿare known constant vectors. A is an unknown n×n matrix, yεIR is the regulated output.

The control objective is to design an adaptive controller to ensure that y(t) tracks a given bounded continuous reference signal r(t) both is transient and steady state, while all other error signals remain bounded. More rigorously, the control objective can be stated as

y(s)≈D(s)r(s), (12)

where y(s), r(s) are Laplace transformations of y(t), r(t) respectively, and D(s) is a strictly proper stable LTI system that specifies the desired transient and steady state performance.

Following the convention, we introduce the following matching assumption:

Assumption 1: There exist a Hurwitz matrix A_mεIR^n×nand a vector of ideal parameters θεIRⁿsuch that (A_m, b) is controllable and A_m−A=bθ^τ. We further assume the unknown parameter θ belongs to a given compact convex set θεΩ.

In the next section, we present two equivalent control architectures that can guarantee the steady state tracking of the bounded reference input r(t). We further use one of those to develop a novel adaptive control architecture with guaranteed transient performance.

IV. MRAC and Companion Model Adaptive Controller

A. Model Reference Adaptive Controller

Let

{dot over (x)}_m(t)=A_mx_m(t)+bk_gr(t), x_m(0)=x₀
y_m(t)=c^τx_m(t) (13)

be the state space representation of the desired transfer function D(s), where x_mεIRⁿ, A_mis an n×n matrix k_gis a design gain. Usually A_mis chosen such that the triple (A_m, b, c) approximates D(s) so that y_m(s)≈D(s)r(s) with comparable transient and steady steady specifications, subject to the matching condition in Assumption 1.

Theorem 2: [MRAC] The following direct adaptive feedback/feedforward controller

u_MRAC(t)={circumflex over (θ)}^τ(t)x(t)+k_gr(t), (14)
{circumflex over ({dot over (θ)})}(t)=ΓProj({circumflex over (θ)}(t), x((t)e^τ(t)Pb), {circumflex over (θ)}(0)={circumflex over (θ)}₀, (15)

in which {circumflex over (θ)}(t)εIRⁿare the adaptive parameters, Proj(•,•) denotes the projection operator, e(t)=x_m(t)−x(t) is the tracking error, ΓεIR^n×nis a positive definite matrix of adaptation gains, and P=P^τ>0 be the solution of the algebraic equation A_m^τP+PA_m=−Q for arbitrary Q>0, ensures that
$\lim_{t \to \infty} e (t) = 0.$

A proof can be found in [3]. Indeed, the tracking error dynamics with the control law (14), (15) can be written as:

{dot over (e)}(t)=A_me(t)−b{tilde over (θ)}^τ(t)x(t), e(0)=0, {tilde over (θ)}(t) custom character {circumflex over (θ)}(t)−θ.(16)

Using standard Lyapunov arguments and Barbalat's lemma, one can prove that
$\lim_{t \to \infty} e (t) = 0.$

B. Companion Model Adaptive Controller

Theorem 3: [CMAC] Given a bounded reference input signal r(t) of interest to track, the following direct adaptive feedback/feedforward controller

u_CMAC(t)={circumflex over (θ)}^τ(t)x(t)+k_gr(t). (17)
{circumflex over ({dot over (θ)})}(t)=ΓProj(x(t){tilde over (x)}^τ(t)Pb,{circumflex over (θ)}(t)),{circumflex over (θ)}(0)={circumflex over (θ)}₀, (18)

in which {circumflex over (θ)}(t)εIRⁿare the adaptive parameters, {tilde over (x)}(t)={circumflex over (x)}(t)−x(t) is the tracking error between system dynamics in (11) and the following companion system

{circumflex over ({dot over (x)})}(t)=A_m{circumflex over (x)}(t)+b(u(t)−{circumflex over (θ)}^τ(t)x(t)),{circumflex over (x)}(0)=x₀
ŷ(t)=c^τ{circumflex over (x)}(t), (19)

ensures that
$\lim_{t \to \infty} \tilde{x} (t) = 0.$

The proof is straightforward. Indeed, subject to Assumption 1, the system dynamics in (11) can be rewritten as:

{dot over (x)}(t)=A_mx(t)+b(u(t)−θ^τx(t)), x(0)=x₀
y(t)=c^τx(t). (20)

Notice that the companion model in (19) shares the same structure with (20), while the control law in (17), (18) reduces the the closed loop dynamics of the companion model to the desired reference model in (13):

{circumflex over ({dot over (x)})}(t)=A_m{circumflex over (x)}(t)+bk_gr(t), {circumflex over (x)}(0)=x₀. (21)

We also notice that the closed-loop tracking error dynamics are the same as in (16):

{tilde over ({dot over (x)})}(t)=A_m{tilde over (x)}(t)−b{tilde over (θ)}^τ(t)x(t), {tilde over (x)}(0)=0. (22)

Since the closed-loop companion model in (21) is bounded, from standard Lyapunov arguments and Barbalat's lemma it follows that
$\lim_{t \to \infty} \tilde{x} (t) = 0.$

Thus, the companion model adaptive control architecture is equivalent to MRAC. The following remark is in order.

Remark 1: The matching assumption implies that the ideal tracking controller is given by the following linear relationship
$\begin{matrix} u_{ideal} (t) = θ^{⊤} x (t) + k_{g} r (t), where & (23) \\ k_{g} = - \frac{1}{c^{⊤} A_{m}^{- 1} b} . & (24) \end{matrix}$

The choice of k_gin (24) ensures that for constant r one has
$\lim_{t \to \infty} y (t) = r$

in both architectures.

C. Bounded Tracking Error Signal

For both architectures MRAC and CMAC, one can prove that the tracking error can be rendered arbitrarily small by increasing the adaptive gain. The main result is given by the following lemma.

Lemma 5: Let Γ=Γ_cII, where Γ_cεIR⁺, and II is the identity matrix. For the system in (20)
$\begin{matrix}  \overset{⋓}{x} (t)  \leq \sqrt{\frac{{\overline{θ}}_{\max}}{λ_{\min} (P) Γ_{c}}}, {\overline{θ}}_{\max} \overset{Δ}{=} \max_{θ \in Ω} \sum_{i = 1}^{n} 4 θ_{i}^{2}, \forall t \geq 0, & (25) \end{matrix}$

and λ_min(P) is the minimum eigenvalue of P.

Proof. The candidate Lyapunov function, which can be used to prove asymptotic convergence of tracking error to zero in Theorems 2 and 3, is given by V({tilde over (x)}(t), {tilde over (θ)}(t))={tilde over (x)}^τ(t)P{tilde over (x)}(t)+{tilde over (θ)}^τ(t)Γ⁻¹{tilde over (θ)}(t). The following upper bound is straight-forward to derive: {tilde over (x)}^τ(t)P{tilde over (x)}(t)≦V(t)≦V(0), ∀t≧0. The projection algorithm ensures that {circumflex over (θ)}(t)εΩ, ∀t≧0, and therefore
$\begin{matrix} \max_{t \geq 0} {\tilde{θ}}^{⊤} (t) Γ^{- 1} \tilde{θ} (t) \leq \frac{{\overline{θ}}_{\max}}{Γ_{c}}, \forall t \geq 0, & (26) \end{matrix}$

where {overscore (θ)}_maxis defined in (25). Since {tilde over (x)}(0)=0, then V(0)={tilde over (θ)}^τ(0)Γ⁻¹{tilde over (θ)}(0), which leads to
${\tilde{x}}^{⊤} (t) P \tilde{x} (t) \leq \frac{θ_{\max}}{Γ_{c}},$

t≧0. Since λ_min(P)∥{tilde over (x)}∥²≦{tilde over (x)}^τ(t)P{tilde over (x)}(t), then
$ \tilde{x} (t)  \leq \sqrt{\frac{{\overline{θ}}_{\max}}{λ_{\min} (P) Γ_{c}}} .$

D. Transient Performance

Theorems 2 and 3 state that the tracking error goes to zero asymptotically as t→∞. Lemma 5 states that the tracking error can be reduced by increasing the adaptation gain Γ_c. The following simulations demonstrate that increasing the adaptation gain Γ_cindeed leads to better transient tracking, but results in unacceptable high-frequency oscillations in the control signal. For simulation purposes, the following system parameters have been selected:
$A = [\begin{matrix} 0 & 1 \\ - 5 & 3.1 \end{matrix}], A_{m} = [\begin{matrix} 0 & 1 \\ - 1 & - 1.4 \end{matrix}], b = {[0 1]}^{⊤}, c = {[1 0]}^{⊤}, θ = {[4 - 4.5]}^{⊤} .$

The choice of Γ_c=0.04 and Q=I leads to desired tracking performance for the reference input r=100, FIGS. 2(a), 2(b). FIGS. 2(c) and 2(d) demonstrate that increasing the adaptive gain improves the transient tracking at the price of high frequency oscillations in the control signal.

FIGS. 3(a) and 3(b) plot the response of the adaptive controller to reference input r=400, without retuning of the adaptive controller. The response to reference input r=25 without retuning the control parameters results in slow convergence, FIGS. 4(a) and 4(b).

These simulations imply two important messages: a) increasing the adaptation gain leads to improved transient tracking performance at the price of high-frequency oscillations in the control signal, b) every change in the reference input implies that retuning of adaptive controller needs to be done to recover the transient tracking performance. Similar deterioration in the transient tracking performance can be observed if one changes the unknown parameters in the system or the initial conditions. Otherwise saying, there is no systematic way of selecting design parameters that would yield the desired transient performance for all possible changes in the system dynamics. On the other hand, the bandwidth limitations of mechanical actuators render implementation of high-frequency control signals
embedded image

overly challenging. Even if implemented, high-frequency control signal can easily excite the high-frequency dynamics of the system, omitted in the modeling, and lead to destabilization.
embedded image

V. ₁Adaptive Controller

In this section, we develop a novel adaptive control architecture that permits complete transient characterization for both system input and output signals. Towards that end, notice that using the matching condition in Assumption 1 the dynamics in (11) can be rewritten as in (20):

{dot over (x)}(t)=A_mx(t)−bθ^τx(t)+bu(t), x(0)=x₀
y(t)=c^τx(t). (27)

The following control structure

u(t)=u₁(t)+u₂(t), u₁(t)=−K^τx(t), (28)

where u₂(t) is the adaptive controller to be determined later, while K is a nominal design gain and can be set to zero, leads to the following partially closed-loop dynamics:

{dot over (x)}(t)=A_ox(t)−bθ^τx(t)+bu₂(t), x(0)=x₀
y(t)=c^τx(t). (29)

The choice of K needs to ensure that A_o=A_m−bK^τ is Hurwitz or, equivalently, that

H_o(s)=(sI−A_o)⁻¹b (30)

is stable. One obvious choice is K=0. For the linearly parameterized system in (29), we consider the following companion model

{circumflex over ({dot over (x)})}(t)=A_o{circumflex over (x)}(t)+b(u₂(t)−{circumflex over (θ)}^τ(t)x(t)), {circumflex over (x)}(0)=x₀
ŷ(t)=c^τ{circumflex over (x)}(t) (31)

along with the adaptive law for {circumflex over (θ)}(t):

{circumflex over ({dot over (θ)})}(t)=ΓProj(x(t){tilde over (x)}^τ(t)P_ob,{circumflex over (θ)}(t)), {circumflex over (θ)}(0)={circumflex over (θ)}₀, (32)

where {tilde over (x)}(t)={circumflex over (x)}(t)−x(t) is the tracking error, ΓεIR^n×n=Γ_cI_n×nis the matrix of adaptation gains, and P_ois the solution of the algebraic equation A_o^τP_o+P_oA_o=−Q_o, Q_o>0.

Letting

{overscore (r)}(t)={circumflex over (θ)}^τ(t)x(t), (33)

the companion model in (31) can be viewed as a low-pass system with u(t) being the control signal, {overscore (r)}(t) being a time-varying disturbance, which is not prevented from having high-frequency oscillations. Instead of (17), we consider the following control design for (31):

u₂(s)=C(s)({overscore (r)}(s)+k_gr(s)), (34)

where u₂(s), {overscore (r)}(s), r(s) are the Laplace transformations of u₂(t), {overscore (r)}(t), r(t), respectively, C(s) is a stable and strictly proper system with low-pass gain C(0)=1, and k_gis
$\begin{matrix} k_{g} = \lim_{s ⇀ 0} \frac{1}{c^{⊤} H_{o} (s)} = \frac{1}{c^{⊤} H_{o} (0)} . & (35) \end{matrix}$

The complete custom character ₁adaptive controller consists of (28), (31), (32), (34), and closed-loop system with it is illustrated in FIG. 11.

Consider the closed-loop companion model in (31) with the control signal defined in (34). It can be viewed as an LTI system with two inputs r(t) and
embedded image

{overscore (r)}(t):

{circumflex over (x)}(s)={overscore (G)}(s){overscore (r)}(s)+G(s)r(s) (36)
{overscore (G)}(s)=H_o(s)(C(s)−1) (37)
G(s)=k_gH_o(s)C(s), (38)

where {circumflex over (x)}(s), {overscore (r)}(s) are the Laplace transformations of the signals {circumflex over (x)}(t), {overscore (r)}(t), respectively. We note that {overscore (r)}(t) is related to {circumflex over (x)}(t), u(t) and r(t) via nonlinear relationships.

Remark 2: Since both H_o(s) and C(s) are strictly proper stable systems, one can check easily that {overscore (G)}(s) and G(s) are strictly proper stable systems, even though that 1−C(s) is proper.

Let
$\begin{matrix} θ_{\max} = \max_{θ \in Ω} \sum_{i = 1}^{n} \langle θ_{i} \rangle, & (39) \end{matrix}$

where θ_iis the i^thelement of θ. Ω is the compact set, where the unknown parameter lies. We now give the custom character ₁performance requirement that ensures stability of the entire system and desired transient performance, as discussed later in Section VI.

custom character
₁-gain requirement: Design K and C(s) to satisfy

∥{overscore (G)}(s)∥₁θ_max<1. (40)

VI. Analysis of ₁Adaptive Controller

A. Stability and Asymptotic Convergence

Consider the following Lyapunov function candidate:

V({circumflex over (x)}(t), {tilde over (θ)}(t))={tilde over (x)}^τ(t)P_o{tilde over (x)}(t)+{circumflex over (θ)}^τ(t)Γ⁻¹{tilde over (θ)}(t), (41)

where P_oand Γ are introduced in (32). It follows from (29) and (31) that

{circumflex over ({dot over (x)})}(t)=A_o{tilde over (x)}(t)−b{tilde over (θ)}^τ(t)x(t), {tilde over (x)}(0)=0. (42)

Hence, it is straightforward to verify from (32) that

{dot over (V)}(t)≦−{tilde over (x)}^τ(t)Q_o{tilde over (x)}(t)≦0. (43)

Notice that the result in (43) is independent of u₂(t), and, hence, Lemma 5 also holds for the custom character ₁adaptive controller along with its adaptive law in (32). However, one cannot deduce stability from it. One needs to prove in addition that with the ₁adaptive controller the state of the companion model will remain bounded. Boundedness of the system state then will follow.

Theorem 4: Given the system in (27) and the custom character ₁adaptive controller defined via (28), (31), (32), (34) subject to (40), the tracking error {tilde over (x)}(t) converges to zero asymptotically:
$\begin{matrix} \lim_{t ⇀ \infty} \overset{⋓}{x} (t) = 0. & (44) \end{matrix}$

Proof. Let λ_min(P_o) be the minimum eigenvalue of P_o. From (41) and (43) it follows that

λ_min(P_o)∥{tilde over (x)}(t)∥²≦{tilde over (x)}^τ(t)P_o{tilde over (x)}(t)≦V(t)≦V(0),

implying that
$\begin{matrix} { \tilde{x} (t) }^{2} \leq \frac{V (0)}{λ_{\min} (P_{o})}, t \geq 0. & (45) \end{matrix}$

From Definition 1,
${ \tilde{x} }_{ℒ_{\infty}} = \max_{i = 1 \dots, n, t \geq 0} \langle {\tilde{x}}_{i} (t) \rangle .$

The relationship in (45) ensures that
$\max_{i = 1, \dots, n, t \geq 0} \langle {\tilde{x}}_{i} (t) \rangle \leq \sqrt{\frac{V (0)}{λ_{\min} (P_{o})}},$

and therefore for all t>0 one has
${ {\tilde{x}}_{t} }_{ℒ_{\infty}} \leq \sqrt{\frac{V (0)}{λ_{\min} (P_{o})}} .$

Using the triangular relationship for norms implies that
$\begin{matrix} \langle { {\hat{x}}_{t} }_{ℒ_{\infty}} = { x_{t} }_{𝔷_{\infty}} \rangle \leq \sqrt{\frac{V (0)}{λ_{\min} (P_{o})}} . & (46) \end{matrix}$

The projection algorithm in (15) ensures that {circumflex over (θ)}(t)εΩ, ∀t≧0. The definition of {overscore (r)}(t) in (33) implies that ∥{overscore (r)}_t∥ custom character _∞≦θ_max∥x_t∥_∞. Substituting for ∥x_t∥_∞ from (46) leads to the following
$\begin{matrix} { {\overline{r}}_{t} }_{𝔷_{\infty}} \leq θ_{\max} ({ {\hat{x}}_{t} }_{ℒ_{\infty}} + \sqrt{\frac{V (0)}{λ_{\min} (P_{o})}}) . & (47) \end{matrix}$

It follows from Lemma 1 that ∥{circumflex over (x)}_t∥ custom character _∞≦∥{overscore (G)}(s)∥₁∥{overscore (r)}_t∥_∞+∥G(s)∥₁∥r_t∥_∞, which along with (47) gives the following upper bound
$\begin{matrix} { {\hat{x}}_{t} }_{ℒ_{\infty}} \leq { \overline{G} (s) }_{𝔷_{1}} θ_{\max} ({ {\hat{x}}_{t} }_{ℒ_{\infty}} + \sqrt{\frac{V (0)}{λ_{\min} (P_{o})}}) + { G (s) }_{𝔷_{1}} { r_{t} }_{ℒ_{\infty}} . Let & (48) \\ λ = { \overline{G} (s) }_{𝔷_{1}} θ_{\max} . & (49) \end{matrix}$

From (40) it follows that λ<1. The relationship in (48) can be written as
$(1 - λ) { {\hat{x}}_{t} }_{ℒ_{\infty}} \leq λ \sqrt{\frac{V (0)}{λ_{\min} (P_{o})}} + { G (s) }_{𝔷_{1}} { r_{t} }_{ℒ_{\infty}};$

and hence
$\begin{matrix} { {\hat{x}}_{t} }_{ℒ_{\infty}} \leq \frac{λ \sqrt{\frac{V (0)}{λ_{\min} (P_{o})}} + { G (s) }_{𝔷_{1}} { r_{t} }_{ℒ_{\infty}}}{1 - λ} . & (50) \end{matrix}$

Since V(0), λ_min(P_o), ∥G(s)∥ custom character _∞, λ are all finite and λ<1, the relationship in (50) implies that ∥{circumflex over (x)}_t∥_∞ is finite for any t>0, and hence {circumflex over (x)}(t) is bounded. The relationship in (46) implies that ∥x_t∥_∞ is also finite for all t>0, and therefore
embedded image

x(t) is bounded. The adaptive law in (32) ensures that the estimates {circumflex over (θ)}(t) are also bounded. Hence, it can be checked easily from (22) that {tilde over ({dot over (x)})}(t) is bounded, and it follows from Barbalat's lemma that
$\lim_{t \to \infty} \tilde{x} (t) = 0.$

B. Reference System

In this section we characterize the reference system that the custom character ₁adaptive controller in (28), (31), (32), (34) tracks both in transient and steady state, and this tracking is valid for system's both input and output signals. Towards that end, consider the following ideal version of the adaptive controller in (28), (34):

u_ref(s)=C(s)(k_gr(s)+θ^τx_ref(s))−K^τx_ref(s), (51)

where x_ref(s) is used to denote the Laplace transformation of the state x_ref(t) of the closed-loop system. The closed-loop system (20) with the controller (51) is given in FIG. 12.

Remark 3: Notice that when C(s)=1 and K=0, one recovers the reference model of MRAC, and the controller in (51) reduces to the one in (23). If C(s)≠1 and K≠0, then the control law in (51) changes the bandwidth of u_ideal(t)=θ^τx(t)+k_gr(t) in (23).

The control law in (51) leads to the following closed-loop dynamics:

x_ref(s)=H_o(s)(k_gC(s)r(s)+(C(s)−1)θ^τx_ref(s))
y_ref(s)=c^τx_ref(s), (52)

which can be explicitly solved for x_ref(s):

x_ref(s)=(I−(C(s)−1)H_o(s)θ^τ)⁻¹H_o(s)k_gC(s)r(s).

Hence, it follows from (37) and (38) that

x_ref(s)=(I−{overscore (G)}(s)θ^τ)⁻¹G(s)r(s). (53)

Lemma 6: If ∥{overscore (G)}(s)∥ custom character ₁θ_max<1, then

(i) (I−{overscore (G)}(s)θ^τ)⁻¹is stable;
(ii) (I−{overscore (G)}(s)θ^τ)⁻¹G(s) is stable. (54)

Proof. It follows from (1) that
${ \overline{G} (s) θ^{⊤} }_{𝔷_{1}} = \max_{i = 1, \dots, n} ({ {\overline{G}}_{i} (s) }_{𝔷_{1}} (\sum_{j = 1}^{n} \langle θ_{j} \rangle)),$

where {overscore (G)}_i(s) is the i^thelement of G(s), and θ_jis the j^thelement of θ. From (39) we have Σ_j=1ⁿ|θ_j|≦θ_max, and hence
$\begin{matrix} { \overline{G} (s) θ^{⊤} }_{ℒ_{1}} \leq \max_{i = 1 \dots, n} ({ {\overline{G}}_{i} (s) }_{ℒ_{1}}) θ_{\max} = { \overline{G} (s) }_{ℒ_{1}} θ_{\max}, \forall θ ε Ω . & (55) \end{matrix}$

∥{overscore (G)}(s)θ^τ∥ custom character ₁<1, Thus, Theorem 1 ensures that the LTI system (I−{overscore (G)}(s)θ^τ)⁻¹is stable. Since G(s) is stable, then it follows from Remark 2 that (I−{overscore (G)}(s)θ^τ)⁻¹G(s) is stable.

C. System Response and Control Signal of the ₁Adaptive Controller

Letting

r₁(t)={tilde over (θ)}^τ(t)x(t), (56)

we notice that {overscore (r)}(t) in (33) can be rewritten as {overscore (r)}(t)=θ^τ({circumflex over (x)}(t)−{tilde over (x)}(t))+r₁(t). Hence, the companion model in (36) can be rewritten as {circumflex over (x)}(s)={tilde over (G)}(s)(θ^τ{circumflex over (x)}(s)−θ^τ{tilde over (x)}(s)+r₁(s))+G(s)r(s), where r₁(s) is the Laplace transformation of r₁(t) defined in (56), and further put into the form:

{circumflex over (x)}(s)=(I−{overscore (G)}(s)θ^τ)⁻¹(−{overscore (G)}(s)θ^τ{tilde over (x)}(s)+{overscore (G)}(s)r₁(s)+G(s)r(s)). (57)

It follows from (42) and (56) that {tilde over ({dot over (x)})}(t)=A_o{tilde over (x)}(t)−br₁(t), and hence

{tilde over (x)}(s)=−H_o(s)r₁(s). (58)

Using the expression of {overscore (G)}(s) from (37), the state of the companion model can be presented as

{circumflex over (x)}(s)=(I−{overscore (G)}(s)θ^τ)⁻¹(−{overscore (G)}(s)θ^τ{tilde over (x)}(s)−(C(s)−1){tilde over (x)}(s)+G(s)r(s)),

which can be further put into the form:

{circumflex over (x)}(s)=(I−{overscore (G)}(s)θ^τ)⁻¹G(s)r(s)+(I−{overscore (G)}(s)θ^τ)⁻¹(−{overscore (G)}(s)θ^τ{tilde over (x)}(s)−(C(s))−1){tilde over (x)}(s)).

Using x_ref(s) from (53) and recalling the definition of {tilde over (x)}(s)={circumflex over (x)}(s)−x(s), one arrives at

x(s)=x_ref(s)−(I+(I−{overscore (G)}(s)θ^τ)₋₁({overscore (G)}(s)θ^τ+(C(s)−1)I)){tilde over (x)}(s). (59)

The expressions in (28), (34) and (51) lead to the following expression of the control signal

u(s)=u_ref(s)+C(s)r₁(s)+(C(s)θ^τ−K^τ)(x(s)−x_r(s)). (60)

D. Asymptotic Performance and Steady State Error

Theorem 5: Given the system in (27) and the custom character ₁adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
$\begin{matrix} \lim_{t \to \infty}  x (t) - x_{ref} (t)  = 0, & (61) \\ \lim_{t \to \infty} \langle u (t) - u_{ref} (t) \rangle = 0. & (62) \end{matrix}$

Proof. Let

r₂(s)=(I+(I−{overscore (G)}(s)θ^τ)⁻¹({overscore (G)}(s)θ^τ+(C(s)−1)I)){tilde over (x)}(s). (63)

It follows from (59) that

r₂(t)=x_ref(t)−x(t). (64)

The signal r₂(t) can be viewed as the response of the LTI system

H₂(s)=I+(I−{overscore (G)}(s)θ^τ)⁻¹({overscore (G)}(s)θ^τ+(C(s)−1)I) (65)

to the bounded error signal {tilde over (x)}(t). It follows from (54) and Remark 2 that (I−{overscore (G)}(s)θ^τ)⁻¹, {overscore (G)}(s), C(s) are stable and, therefore, H₂(s) is stable. Hence, from (44) we have
$\lim_{t \to \infty} r_{2} (t) = 0.$

Let

r₃(s)=C(s)r₁(s)+(C(s)θ^τ−K^τ)(x(s)−x_r(s)). (66)

It follows from (60) that

r₃(t)=u(t)−u_ref(t). (67)

Since the projection operator ensures that {tilde over (θ)}(t) is bounded, it follows from (42) and (44) that
$\lim_{t \to \infty} r_{1} (t) = 0.$

Since C(s) is a stable proper system, it follows from (61) that
$\lim_{t \to \infty} r_{3} (t) = 0.$

Lemma 7: Given the system in (27) and the custom character ₁adaptive controller defined via (28), (31), (32), (34) subject to (40), if r(t) is constant, then
$\lim_{t \to \infty} y (t) = r .$

Proof. Since

y_ref(t)=c^τx_ref(t), (68)

it follows from (61) that
$\begin{matrix} \lim_{t \to \infty} (y (t) - y_{ref} (t)) = 0. & (69) \end{matrix}$

From (53) it follows that y_ref(s)=c^τ(I−{overscore (G)}(s)θ^τ)⁻¹G(s)r(s). The end value theorem ensures
$\begin{matrix} \lim_{t \to \infty} y_{ref} (t) = \lim_{s \to 0} {c^{⊤} (I - \overline{G} (s) θ^{⊤})}^{- 1} G (s) r = c^{⊤} H_{o} (0) C (0) k_{g} r . & (70) \end{matrix}$

Definition of k_gin (35) leads to
$\lim_{t \to \infty} y (t) = r .$

In addition to the constant reference input signal r, we need to characterize the closed-loop system response with the custom character ₁controller to a time varying input r(t). This is analyzed in the following sections.

E. Transient Performance

We note that (A_m−bK^τ, b) is the state space realization of H_o(s). Since (A_m, b) is controllable, it can be proved easily that (A_m−bK^τ, b) is also controllable. It follows from Lemma 4 that there exists c_oεIRⁿsuch that
$\begin{matrix} c_{o}^{⊤} H_{o} (s) = \frac{N_{n} (s)}{N_{d} (s)}, & (71) \end{matrix}$

where the order of N_d(s) is one more than the order of N_n(s), and both N_n(s) and N_d(s) are stable polynomials.

Theorem 6: Given the system in (27) and the custom character ₁adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
$\begin{matrix} { x - x_{ref} }_{ℒ_{\infty}} \leq \frac{γ_{1}}{\sqrt{Γ_{c}}}, & (72) \\ { y - y_{ref} }_{ℒ_{\infty}} \leq { c^{⊤} }_{ℒ_{1}} \frac{γ_{1}}{\sqrt{Γ_{c}}}, & (73) \\ { u - u_{ref} }_{ℒ_{\infty}} \leq \frac{γ_{2}}{\sqrt{Γ_{c}}}, & (74) \end{matrix}$

where ∥c^τ∥ custom character ₁is the ₁gain of c^τ and
$\begin{matrix} γ_{1} = { H_{2} (s) }_{ℒ_{1}} \sqrt{\frac{{\overline{θ}}_{\max}}{λ_{\max} (P_{o})}}, & (75) \\ γ_{2} = { C (s) \frac{1}{c_{o}^{⊤} H_{o} (s)} c_{o}^{⊤} }_{ℒ_{1}} \sqrt{\frac{{\overline{θ}}_{\max}}{λ_{\max} (P_{o})}} + { C (s) θ^{⊤} - K^{⊤} }_{ℒ_{1}} γ_{1} . & (76) \end{matrix}$

Proof. It follows from (63), (65) and Lemma 1 that ∥r₂∥ custom character _∞≦∥H₂(s)∥₁∥{tilde over (x)}∥_∞, while Lemma 5 implies that
$\begin{matrix} { \tilde{x} }_{ℒ_{\infty}} \leq \sqrt{\frac{{\overline{θ}}_{\max}}{λ_{\max} (P_{o}) Γ_{c}}} . & (77) \end{matrix}$

Therefore,
${ r_{2} }_{𝔷_{\infty}} \leq { H_{2} (s) }_{𝔷_{1}} \sqrt{\frac{{\overline{θ}}_{\max}}{λ_{\max} (P_{o}) Γ_{c}}},$

which leads to (72). The upper bound in (73) follows from (72) and Lemma 2 directly. From (58) we have
$\begin{matrix} r_{3} (s) = C (s) \frac{1}{c_{o}^{T} H_{o} (s)} c_{o}^{⊤} H_{o} (s) r_{1} (s) + (C (s) θ^{⊤} - K^{⊤}) (x (s) - x_{r} (s)) \\ = - C (s) \frac{1}{c_{o}^{⊤} H_{o} (s)} c_{o}^{⊤} \tilde{x} (s) + (C (s) θ^{⊤} - K^{⊤}) (x (s) - x_{r} (s)), \end{matrix}$

where c_ois introduced in (71). It follows from (71) that
$C (s) \frac{1}{c_{o}^{⊤} H_{o} (s)} = C (s) \frac{N_{d} (s)}{N_{n} (s)},$

where N_d(s), in N_n(s) are stable polynomials and the order of N_n(s) is one less than the order of N_d(s). Since C(s) is stable and strictly proper, the complete system
$C (s) \frac{1}{c_{o}^{⊤} H_{o} (s)}$

is proper and stable, which implies that its custom character ₁gain exists and is finite. Hence, we have
${ r_{3} }_{𝔷_{\infty}} \leq { C (s) \frac{1}{c_{o}^{⊤} H_{o} (s)} c_{o}^{⊤} }_{𝔷_{1}} { \tilde{x} }_{𝔷_{\infty}} + { C (s) θ^{⊤} - K^{⊤} }_{𝔷_{1}} { x - x_{r} }_{𝔷_{\infty}} .$

Lemma 5 leads to the upper bound in (74):
${ r_{3} }_{𝔷_{\infty}} \leq { C (s) \frac{1}{c_{o}^{⊤} H_{o} (s)} c_{o}^{⊤} }_{𝔷_{1}} \sqrt{\frac{{\overline{θ}}_{\max}}{λ_{\max} (P_{o}) Γ_{c}}} + { C (s) θ^{⊤} - K^{⊤} }_{𝔷_{1}} { x - x_{r} }_{𝔷_{\infty}} .$

Corollary 2: Given the system in (27) and the custom character ₁adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
$\begin{matrix} \begin{matrix} \lim_{Γ_{c} -> \infty} (x (t) - x_{ref} (t)) = 0, & \forall t \geq 0, \end{matrix} & (78) \\ \begin{matrix} \lim_{Γ_{c} -> \infty} (y (t) - y_{ref} (t)) = 0, & \forall t \geq 0, \end{matrix} & (79) \\ \begin{matrix} \lim_{Γ_{c} -> \infty} (u (t) - u_{ref} (t)) = 0, & \forall t \geq 0. \end{matrix} & (80) \end{matrix}$

Corollary 2 states that x(t), y(t) and u(t) follow x_ref(t), y_ref(t) and u_ref(t) not only asymptotically but also during the transient, provided that the adaptive gain is selected sufficiently large. Thus, the control objective is reduced to designing K and C(s) to ensure that the reference LTI system has the desired response D(s).

Remark 4: Notice that if we set C(s)=1, then the custom character ₁adaptive controller degenerates into a CMAC type, which is equivalent to MRAC. In that case
${ C (s) \frac{1}{c_{o}^{⊤} H_{o} (s)} c_{o}^{⊤} }_{𝔷_{1}}$

cannot be finite, since H_o(s) is strictly proper. Therefore, from (76) it follows that γ₂→∞, and hence for the control signal in CMAC or MRAC one can not reduce the bound in (74) by increasing the adaptive gain.

VII. Design of the ₁Adaptive Controller

We proved that the error between the state and the control signal of the closed-loop system with custom character ₁adaptive controller in (27), (28), (31), (32), (34) (FIG. 11) and the state and the control signal of the closed-loop reference system in (51), (53) (FIG. 12) can be rendered arbitrarily small by choosing large adaptive gain. Therefore, the control objective is reduced to determining K and C(s) to ensure that the reference system in (51), (53) (FIG. 12) has the desired response D(s) from r(t) to y_ref(t). Notice that the reference system in FIG. 12 depends upon the unknown parameter θ.

Consider the following signals:

y_des(s)=c^τG(s)r(s)=C(s)k_gc^τH_o(s)r(s), (81)
u_des(s)=k_gC(s)(1+C(s)θ^τH_o(s)−K^τH_o(s))r(s). (82)

We note that u_des(t) depends on the unknown parameter θ, while y_des(t) does not.

Lemma 8: For the LTI system in FIG. 12, subject to (40), the following upper bounds hold:
$\begin{matrix} { y_{ref} - y_{des} }_{𝔷_{\infty}} \leq \frac{λ}{1 - λ} { c^{⊤} }_{𝔷_{1}} { G (s) }_{𝔷_{1}} { r }_{𝔷_{\infty}}, & (83) \\ { y_{ref} - y_{des} }_{𝔷_{\infty}} \leq \frac{1}{1 - λ} { c^{⊤} }_{𝔷_{1}} { h_{3} }_{𝔷_{\infty}}, & (84) \\ { u_{ref} - u_{des} }_{𝔷_{\infty}} \leq \frac{λ}{1 - λ} { C (s) θ^{⊤} - K^{⊤} }_{𝔷_{1}} { G (s) }_{𝔷_{1}} { r }_{𝔷_{\infty}}, & (85) \\ { u_{ref} - u_{des} }_{𝔷_{\infty}} \leq \frac{1}{1 - λ} { C (s) θ^{⊤} - K^{⊤} }_{𝔷_{1}} { h_{3} }_{𝔷_{\infty}}, & (86) \end{matrix}$

where λ is defined in (49), and h₃(t) is the inverse Laplace transformation of

H₃(s)=(C(s)−1)C(s)r(s)k_gH_o(s)θ^τH_o(s). (87)

Proof. It follows from (52) and (53) that y_ref(s)=c^τ(I−{overscore (G)}(s)θ^τ)⁻¹G(s)r(s). Following Lemma 6, the condition in (40) ensures the stability of the reference LTI system. Since (I−{overscore (G)}(s)θ^τ)⁻¹is stable, then one can expand it into convergent series and further write
$\begin{matrix} \begin{matrix} y_{ref} (s) = c^{⊤} (I + \sum_{i = 1}^{\infty} {(\overline{G} (s) θ^{⊤})}^{i}) G (s) r (s) \\ = y_{des} (s) + c^{⊤} (\sum_{i = 1}^{\infty} {(\overline{G} (s) θ^{⊤})}^{i}) G (s) r (s) . \end{matrix} Let r_{4} (s) = c^{⊤} (\sum_{i = 1}^{\infty} {(\overline{G} (s) θ^{⊤})}^{i}) G (s) r (s) . Then & (88) \\ \begin{matrix} r_{4} (t) = y_{ref} (t) - y_{des} (t), & \forall t \geq 0. \end{matrix} & (89) \end{matrix}$

The relationship in (55) implies that ∥{overscore (G)}(s)θ^τ∥ custom character ₁≦λ, and it follows from Lemma 2 that
$\begin{matrix} { r_{4} }_{ℒ_{\infty}} \leq (\sum_{i = 1}^{\infty} λ^{i}) { c^{⊤} }_{ℒ_{1}} { G }_{ℒ_{1}} { r }_{ℒ_{\infty}} = \frac{λ}{1 - λ} { c^{⊤} }_{ℒ_{1}} { G }_{ℒ_{1}} { r }_{ℒ_{\infty}} . & (90) \end{matrix}$

The relationship in (88) can be equivalently written as
$y_{ref} (s) = y_{des} (s) + c^{⊤} (\sum_{i = 1}^{\infty} {({\overline{G}}^{'} (s) θ^{⊤})}^{i - 1}) \overline{G} (s) θ^{⊤} G (s) r (s),$

which along with (37), (38) and (87) leads to
$\begin{matrix} y_{ref} (s) = y_{des} (s) + c^{⊤} (\sum_{i = 1}^{\infty} {({\overline{G}}^{'} (s) θ^{⊤})}^{i - 1}) (C (s) - 1) C (s) r (s) k_{g} H_{o} (s) θ^{⊤} H_{o} (s) \\ = y_{des} (s) + c^{⊤} (\sum_{i = 1}^{\infty} {({\overline{G}}^{'} (s) θ^{⊤})}^{i - 1}) H_{3} (s) . \end{matrix}$

Lemma 1 immediately implies that ∥r₄∥ custom character _∞≦(Σ_i=1^∞λ^i-1)∥c^τ∥₁∥h₃∥_∞. Comparing u_des(s) in (82) to u_ref(s) in (51) it follows that u_des(s) can be written as u_des(s)=k_gC(s)r(s)+(C(s)θ^τ−K^τ)x_des(s), where x_des(s)=C(s)k_gH_o(s)r(s). Therefore u_ref(s)−u_des(s)=(C(s)θ^τ−K^τ)(x_ref(s)−x_des(s)). Hence, it follows from Lemma 1 that ∥u_ref−u_des∥ custom character _∞≦∥C(s)θ^τ−K^τ∥₁∥x_ref−x_des∥_∞. Using the same steps as for ∥y_ref−y_des∥_∞, we have
${ x_{ref} - x_{des} }_{ℒ_{\infty}} \leq \frac{λ}{1 - λ} { G (s) }_{ℒ_{1}} { r }_{ℒ_{\infty}}, { x_{ref} - x_{des} }_{ℒ_{\infty}} \leq \frac{λ}{1 - λ} { h_{3} }_{ℒ_{\infty}},$

which leads to the upper bounds in (85) and (86).

Thus, the problem is reduced to finding a strictly proper stable C(s) to ensure that

(i) λ<1 or ∥h₃∥ custom character _∞ are sufficiently small, (91)
(ii) y_des(s)≈D(s)r(s), (92)

where D(s) is the desired LTI system introduced in (12). Then, Theorem 6 and Lemma 8 will imply that the output y(t) of the system in (27) and the ₁adaptive control signal u(t) will follow y_des(t) and u_des(t) both in transient and steady state with quantifiable bounds, given in (73), (74) and (83)-(86).

Notice that λ<1 is required for stability. From (81)-(86), it follows that for achieving y_des(s)≈D(s)r(s) it is desirable to ensure that λ or ∥h₃∥ custom character _∞ are sufficiently small and, in addition, C(s)c^τH_o(s)≈D(s). We notice that these requirements are not in conflict with each other. So, using Lemma 2, one can consider the following conservative upper bound

λ=∥{overscore (G)}(s)∥₁θ_max=∥H_o(s)(C(s)−1∥₁θ_max≦∥H_o(s)∥ custom character ₁∥C(s)−1∥₁θ_max. (93)

Thus, minimization of λ can be achieved from two different perspectives: i) fix C(s) and minimize ∥H_o(s)∥₁, ii) fix H_o(s) and minimize the ₁-gain of one of the cascaded systems ∥H_o(s)(C(s)−1)∥₁, ∥(C(s)−1)r(s)∥₁or ∥C(s)(C(s)−1)∥ custom character ₁via the choice of C(s).

i) High-gain design. Set C(s)=D(s). Then minimization of ∥H_o(s)∥ custom character ₁can be achieved via high-gain feedback by choosing K sufficiently large. However, minimized ∥H_o(s)∥₁via large K leads to large poles of H_o(s), which is typical for high-gain design methods. Since C(s) is a strictly proper system containing the dominant poles of the closed-loop system in k_gc^τH_o(s)C(s) and k_gc^τH_o(0)=1, we have k_gc^τH_o(s)C(s)≈C(s)=D(s). Hence, the system response will be y_ref(s)≈D(s)r(s). We note that with large feedback K, the performance of custom character ₁adaptive controller degenerates into a high-gain type. The shortcoming of this design is that the high gain feedback K leads to a reduced phase margin and consequently affects robustness.

ii) Design without linear feedback. As in MRAC, assume that we can select A_mto ensure
$\begin{matrix} k_{g} c^{⊤} H_{o} (s) \approx D (s) . Let & (94) \\ C (s) = \frac{w}{s + w} . & (95) \end{matrix}$

Lemma 9: For any single input n-output strictly proper stable system H_o(s) the following is true:
$\lim_{w -> \infty} { (C (s) - 1) H_{o} (s) }_{ℒ_{1}} = 0.$

Proof. It follows from (95) that
$(C (s) - 1) H_{o} (s) = \frac{- s}{s + w} H_{o} (s) = \frac{- 1}{s + w} s H_{o} (s) .$

Since H_o(s) is strictly proper and stable, sH_o(s) is stable and has relative degree ≧0, and hence ∥sH_o(s)∥ custom character ₁is finite. Since
${ \frac{- 1}{s + w} }_{ℒ_{1}} = \frac{1}{w},$

it follows from (2) that
${ (C (s) - 1) H_{o} (s) }_{ℒ_{1}} \leq \frac{1}{w} { s H_{o} (s) }_{ℒ_{1}},$

and the proof is complete.

Lemma 9 states that if one chooses k_gc^τH_o(s)r(s)≈D(s), then by increasing the bandwidth of the low-pass system C(s), it is possible to render ∥{overscore (G)}(s)∥ custom character ₁arbitrarily small. With large ω, the pole −ω due to C(s) is omitted, and H_o(s) is the dominant reference system leading to

y_ref(s)≈k_gc^τH_o(s)r(s)≈D(s)r(s).

We note that k_gc^τH_o(s) is exactly the reference model of the MRAC design. Therefore this approach is equivalent to mimicking MRAC, and, hence, high-gain feedback can be completely avoided.

However, increasing the bandwidth of C(s) is not the only choice for minimizing ∥{overscore (G)}(s)∥ custom character ₁. Since C(s) is a low-pass filter, its complementary 1−C(s) is a high-pass filter with its cutoff frequency approximating the bandwidth of C(s). Since both H_o(s) and C(s) are strictly proper systems, {overscore (G)}(s)=H_o(s)(C(s)−1) is equivalent to cascading a low-pass system H_o(s) with a high-pass system C(s)−1. If one chooses the cut-off frequency of C(s)−1 larger than the bandwidth of H_o(s), it ensures that {overscore (G)}(s) is a “no-pass” system, and hence its custom character ₁gain can be rendered arbitrarily small. This can be achieved via higher order filter design methods. The illustration is given in FIG. 13.

To minimize ∥h₃∥ custom character _∞, we note that ∥₃∥_∞ can be upperbounded in two ways:

(i) ∥h₃∥_∞∥(C(s)−1)r(s)∥₁∥h₄∥_∞,
embedded image

where h₄(t) is the inverse Laplace transformation of H₄(s)=C(s)k_gH_o(s)θ^τH_o(s), and

(ii) ∥h₃∥_∞≦∥(C(s)−1)C(s)∥₁∥h₅∥ custom character _∞,

where h₅(t) is the inverse Laplace transformation of H₅(s)=r(s)k_gH_o(s)θ^τH_o(s).

We note that since r(t) is a bounded signal and C(s), H_o(s) are stable proper systems, ∥h₄∥ custom character _∞ and ∥h₅∥_∞ are finite. Therefore, ∥h₃∥_∞ can be minimized by minimizing ∥(C(s)−1)r(s)∥₁or ∥(C(s)−1)C(s)∥₁. Following the same arguments as above and assuming that r(t) is in low-frequency range, one can choose the cut-off frequency of C(s)−1 to be larger than the bandwidth of the reference signal r(t) to minimize ∥(C(s)−1)r(s)∥ custom character ₁. For minimization of ∥C(s)(C(s)−1)∥₁notice that if C(s) is an ideal low-pass filter, then C(s)(C(s)−1)=0 and hence ∥h₃∥_∞=0. Since an ideal low-pass filter is not physically implementable, one can minimize ∥C(s)(C(s)−1)∥₁via appropriate choice of C(s).

The above presented approaches ensure that C(s)≈1 in the bandwidth of r(s) and H_o(s). Therefore it follows from (81) that y_des(s)=C(s)k_gc^τH_o(s)r(s)≈k_gc^τH_o(s)r(s), which along with (94) yields y_des(s)≈D(s)r(s).

Remark 5: From Corollary 2 and Lemma 8 it follows that the custom character ₁adaptive controller can generate a system response to track (81) and (82) both in transient and steady state if we set the adaptive gain large and minimize λ or ∥h₃∥_∞. Notice that u_des(t) in (82) depends upon the unknown parameter θ, while y_des(t) in (81) does not. This implies that for different values of θ, the custom character ₁adaptive controller will generate different control signals (dependent on θ) to ensure uniform system response (independent of θ). This is natural, since different unknown parameters imply different systems, and to have similar response for different systems the control signals have to be different. Here is the obvious advantage of the custom character ₁adaptive controller in a sense that it controls a partially known system as an LTI feedback controller would have done if the unknown parameters were known. Finally, we note that if the term k_gC(s)C(s)θ^τH_o(s) is dominated by k_gC(s)K^τH_o(s), then the controller in (82) turns into a robust one, and consequently the custom character ₁adaptive controller degenerates into robust design.

Remark 6: It follows from (78) that y_ref(t), u_ref(t) approximate the unknown system's response and the custom character ₁adaptive control signal, if the latter is implemented with large adaptive gain. It follows from (53) that y(t) approximates the response of the LTI system c^τ(I−{overscore (G)}(s)θ^τ)⁻¹G(s) to r(t), hence its transient performance such as overshoot and settling time can be derived for every value of θ. If we further minimize λ or ∥h₃∥ custom character _∞, it follows from Lemma 8 that y(t) approximates the response of the LTI system C(s)c^τH_o(s). In this case, the ₁adaptive controller leads to uniform transient performance of y(t) independent of the value of the unknown parameter θ. It follows from (80) then that the same is true for the custom character ₁adaptive control signal u(t). For the resulting ₁adaptive control signal one can characterize the transient specifications such as its amplitude and rate change for every θεΩ, using u_des(t) for it.

VIII. Discussion

We use a scalar system to compare the performance of custom character ₁adaptive and a high-gain controllers. Towards that end, let {dot over (x)}(t)=θx(t)+u(t) where xεIR is the measurable system state, uεIR is the control signal and θεIR is unknown, which belongs to a given compact set [θ_min, θ_max]. Let u(t)=−kx(t)+kr(t), leading to the following closed-loop system:

{dot over (x)}(t)=(θ−k)x(t)+kr(t).

We need to choose k>θ_maxto guarantee stability. We note that both the steady state error and transient performance depend on the unknown parameter value θ. By further introducing a proportional-integral controller, one can achieve zero steady state error. If one chooses k>>max{θ_max, θ_min}, it leads to high-gain system
$x (s) = \frac{k}{s - (θ - k)} r (s) \approx \frac{k}{s + k} r (s) .$

To apply the custom character ₁adaptive controller, let the desired reference system be
$H_{o} (s) = \frac{1}{s + 2} .$

Let u₁=−2x, k_g=2, leading to
$D (s) = \frac{2}{s + 2} .$

Choose C(s) as in (95) with large ω_n, and set adaptive gain Γ_clarge. Then it follows from Theorem 6 that
$\begin{matrix} x (s) \approx x_{ref} (s) = C (s) k_{g} H_{o} (s) r (s) \approx \frac{ω_{n}}{s + ω_{n}} \frac{2}{s + 2} \approx \frac{2}{s + 2} & (96) \\ u (s) \approx u_{ref} (s) = (- 2 + θ) x_{ref} (s) + 2 r (s) . & (97) \end{matrix}$

The relationship in (96) implies that the control objective is met, while the relationship in (97) states that the custom character ₁adaptive controller approximates u_ref(t), which cancels the unknown θ.

IX. Simulations

Consider the same simulation example from Section IV-D. We give now the complete custom character ₁adaptive controller for this system. We set K=0, Γ_c=40000, and implement the L₁adaptive controller following (28), (31), (32) and (34). First, we give analysis of the ₁adaptive controller. It follows from (30) that
$\begin{matrix} H_{o} (s) = [\begin{matrix} \frac{1}{s^{2} + 1.4 s + 1} \\ \frac{s}{s^{2} + 1.4 s + 1} \end{matrix}] & (98) \end{matrix}$

and hence
$\begin{matrix} y_{des} (s) = C (s) c^{T} H_{o} (s) r (s) = \frac{1}{s^{2} + 1.4 s + 1} C (s) r (s) . & (99) \end{matrix}$

Next, we check stability of this custom character ₁adaptive controller. It follows from (39) that θ_max=20, and ∥{overscore (G)}∥_L₁can be calculated numerically. In FIG. 14(a), we plot
$\begin{matrix} λ = { \overline{G} }_{L_{1}} θ_{\max} = { \frac{1}{s^{2} + 1.4 s + 1} \frac{ω}{s + ω} }_{ℒ_{1}} θ_{\max} & (100) \end{matrix}$

with respect to ω and compare it to 1. We notice that for ω>30, we have λ<1, and the custom character ₁gain requirement for stability is guaranteed. So, we can choose
$\begin{matrix} C (s) = \frac{160}{s + 160} & (101) \end{matrix}$

to ensure that λ<0.01, which consequently leads to improved performance bounds in (83)-(86). For ω=160, we have λ=∥{overscore (G)}(s)∥₁θ_max=0.1725<1, so the custom character ₁-gain requirement in (40) is indeed satisfied.
embedded image

Next, we compute the bound between y_ref(t) and y_des(t) in (99). It follows from (87) that
${ h_{3} }_{𝔷_{\infty}} \leq \max_{θ \in Ω} { (C (s) - 1) C (s) k_{g} H_{o} (s) θ^{T} H_{o} (s) }_{ℒ_{1}} { r }_{ℒ_{\infty}} .$

For C(s) and H_o(s) in (101) and (98), it can be numerically verified that
$\begin{matrix} \max_{θ \in Ω} { (C (s) - 1) C (s) k_{g} H_{o} (s) θ^{T} H_{o} (s) }_{ℒ_{1}} = 0.0946, & (102) \end{matrix}$

and it follows from (84) that ∥y_ref−y_des∥ custom character _∞≦0.0946∥r∥_∞. Therefore, we can state that
$y_{ref} (s) \approx y_{des} (s) = C (s) c^{T} H_{o} (s) r (s) = \frac{1}{s^{2} + 1.4 s + 1} \frac{160}{s + 160} r (s) .$

Similarly, it follows from (86) that u_ref(t) approximates u_des(t), i.e.
$u_{ref} (s) \approx u_{des} (s) = 2 \frac{160}{s + 160} (1 + \frac{160}{s + 160} θ^{⊤} [\begin{matrix} \frac{1}{s^{2} + 1.4 s + 1} \\ \frac{s}{s^{2} + 1.4 s + 1} \end{matrix}]) r (s) .$

With large adaptive gain, it follows from Theorem 6 that y(t)≈y_ref(t), u(t)≈u_ref(t), ∀t≧0, and hence
$y (s) \approx \frac{1}{s^{2} + 1.4 s + 1} \frac{160}{s + 160} r (s) \approx \frac{1}{s^{2} + 1.4 s + 1} r (s)$ $u (s) \approx 2 \frac{160}{s + 160} (1 + \frac{160}{s + 160} θ^{⊤} [\begin{matrix} \frac{1}{s^{2} + 1.4 s + 1} \\ \frac{s}{s^{2} + 1.4 s + 1} \end{matrix}]) r (s) \approx 2 r (s) + θ^{⊤} [\begin{matrix} \frac{1}{s^{2} + 1.4 s + 1} \\ \frac{s}{s^{2} + 1.4 s + 1} \end{matrix}] r (s),$

if one just considers the dominant poles. The simulation results of the custom character ₁adaptive controller are shown in FIGS. 15(a)-15(b) for reference inputs r=25, 100, 400, respectively. We note that it leads to scaled control input and system response for scaled reference input, as compared to MRAC in FIGS. 8(a)-10(b). FIG. 16(a)-16(b) show the system response and control signal for reference input r(t)=100 cos(0.2t), without any retuning of the controller.
embedded image

Next, we consider a higher order filter with low adaptive gain Γ_c=400,
$C (s) = \frac{3 w^{2} s + w^{3}}{{(s + w)}^{3}} .$

In FIG. 14(a), we plot
$\begin{matrix} λ = { \overline{G} }_{L_{1}} θ_{\max} = { \frac{1}{s^{2} + 1.4 s + 1} \frac{3 w^{2} s + w^{3}}{{(s + w)}^{3}} }_{ℒ_{1}} θ_{\max} & (103) \end{matrix}$

with respect to ω and compare it to 1. We notice that when ω>25, we have λ<1 and the custom character ₁-gain requirement in (40) is satisfied. Letting ω=50 leads to λ=0.3984, and therefore ∥y_ref−y_des∥_∞≦0.0721∥r∥_∞. Following similar arguments above
$\begin{matrix} y (s) \approx y_{des} (s) = \frac{1}{s^{2} + 1.4 s + 1} \frac{3 w^{2} s + w^{3}}{{(s + w)}^{3}} r (s) \approx \frac{1}{s^{2} + 1.4 s + 1} r (s), & (104) \end{matrix}$

if one just considers the dominant poles. The simulation results of the custom character ₁adaptive controller are shown in FIGS. 11(a)-11(b), for reference inputs r=25, 100, 400, respectively. We note that it again leads to scaled control input and system response for scaled reference input, as compared to MRAC in FIGS. 8(a)-10(b). In addition, we notice that this performance is achieved by a much smaller adaptive gain as compared to the design with the first order C(s). FIG. 18(a)-18(b) show the system response and control signal for reference input r(t)=100 cos(0.2t), without any retuning of the controller.
embedded image

Remark 7: The simulations pointed out that with higher order filter C(s) one could use relatively small adaptive gain. While a rigorous relationship between the choice of adaptive gain and the order of filter cannot be derived, an insight into this can be gained from the following analysis. It follows from (27), (28) and (34) that

x(s)=G(s)r(s)+H_o(s)θ^τx(s)+H_o(s)C(s){overscore (r)}(s), (105)

while the companion model in (36) can be rewritten as

{circumflex over (x)}(s)=G(s)r(s)+H_o(s)(C(s)−1){overscore (r)}(s).

We note that {overscore (r)}(t) is divided into two parts. Its low-frequency component C(s){overscore (r)}(s) is what the system in (105) gets, while the complementary high-frequency component (C(s)−1){overscore (r)}(s) goes into the companion model. If the bandwidth of C(s) is large, then it can suppress only the high frequencies in {overscore (r)}(t), which appear only in the presence of large adaptive gain. A properly designed higher order C(s) can be more effective to serve the purpose of filtering with reduced tailing effects, and, hence can generate similar λ with smaller bandwidth. This further implies that similar performance can be achieved with smaller adaptive gain.

Note the following references referred to in the above discussion: [1] is P. Ioannou and J. Sun, Robust Adaptive Control (Prentice Hall, 1996); [2] is H. K. Khalil, Nonlinear Systems (Prentice Hall, Englewood Cliff, N.J., 2002); [3] is J.-J. E. Slotine and W. Li, Applied Nonlinear Control (Prentice Hall, Englewood Cliffs, N.J., 1991).

While the invention has been described in terms of preferred embodiments, those skilled in the art will recognize that the invention can be practiced with modification within the spirit and scope of the appended claims.

Low-pass adaptive/neural controller device and method with improved transient performance

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

US Classifications

International Classifications

Abstract

Description

Claims

Parent Case Info

Provisional Applications (1)