Low-pass adaptive/neural controller device and method with improved transient performance

Information

  • Patent Application
  • 20060217819
  • Publication Number
    20060217819
  • Date Filed
    March 23, 2006
    18 years ago
  • Date Published
    September 28, 2006
    18 years ago
Abstract
A novel 1 adaptive/neural control architecture provides a device and method that permits fast adaptation and yields guaranteed transient response simultaneously for both the system's input and output signals, in addition to providing asymptotic tracking. The main feature of the invention is rapid adaptation with a guaranteed low frequency control signal. The ability to adapt rapidly ensures the desired transient performance for both the system's input and output signals, simultaneously, while a low-pass filter in the feedback loop attenuates the high-frequency components in the control signal.
Description
BACKGROUND OF THE INVENTION

1. Field of the Invention


The present invention generally relates to the control of systems with time-varying unknown parameters, time-varying bounded disturbances, and matched uncertain nonlinearities, and in particular to controllers designed to adapt to parameters that vary in uncertain ways.


2. Background Description


The conventional Model Reference Adaptive Controller (MRAC) was developed to control linear systems in the presence of parametric uncertainties. The development of this architecture has been facilitated by the Lyapunov stability theory that defines sufficient conditions for stable performance, but offers no means for characterizing the system's input/output performance during the transient phase. System uncertainties during the transient phase have led to unpredictable and/or undesirable situations, involving control signals of high frequency or large amplitudes, large transient errors or slow convergence rate of tracking errors, to name a few. Application of adaptive/neural controllers has therefore been largely restricted. Large deviation implies poor transient performance. It could even lead to instability, especially for neural controllers where the signals are required to be inside a compact set where approximation is conducted.


One such situation is bandwidth limitation in the control channel, especially in mechanical actuators. A high frequency control signal is impractical as it can lead to destabilization of the system. Another circumstance where use of conventional adaptive/neural controllers is limited is where the system models are mostly based on low-frequency approximations. A high frequency control signal can easily excite the omitted high frequency dynamics of the system and lead to unpredictable consequences.


The transient performance of adaptive/neural controllers depends on unknown parameters, reference input, and adaptive gain in a nonlinear way. Extensive tuning of adaptive gains and Monte-Carlo runs have been the primary methods for enabling the transition of adaptive control solutions to real world applications. This approach has rendered verification and validation of adaptive controllers overly challenging. Moreover, there is no systematic way of selecting design parameters that would yield the desired transient performance for all possible scenarios.


As compared to the linear systems theory, several important aspects of transient performance analysis seem to be missing in the prior art. First, all the bounds in the prior art are computed for tracking errors only, and not for control signals. Although the latter can be deduced from the former, it is straightforward to verify that the ability to adjust the former may not extend to the latter in the case of nonlinear control laws. Second, since the purpose of adaptive control is to ensure stable performance in the presence of modeling uncertainties, one needs to ensure that the changes in reference input and unknown parameters due to possible faults or unexpected uncertainties do not lead to unacceptable transient deviations or oscillatory control signals, implying that a retuning of adaptive parameters is required. Finally, one needs to ensure that whatever modifications or solutions are suggested for performance improvement of adaptive controllers, they are not achieved via high-gain feedback.


SUMMARY OF THE INVENTION

It is therefore an object of the present invention to provide an adaptive/neural controller design with improved transient performance.


Another object of the invention is to provide a systematic way of selecting design parameters that would yield desired transient performance for all possible scenarios.


A further object of the invention is to make it easier to verify and validate adaptive controllers.


We invented a novel custom character1 adaptive/neural control architecture that permits fast adaptation and yields guaranteed transient response simultaneously for both the system's input and output signals, in addition to providing asymptotic tracking. The main feature of the invention is rapid adaptation with a guaranteed low frequency control signal. The ability to adapt rapidly ensures the desired transient performance for both the system's input and output signals, simultaneously, while a low-pass filter in the feedback loop attenuates the high-frequency components in the control signal.


The custom character1 adaptive/neural controller can be applied to systems that have time-varying unknown parameters with an arbitrary rate of variation, and also ensures the desired transient performance for both input and output signals of the system. We prove that by increasing the adaptation gain one can achieve arbitrarily close transient and asymptotic tracking for both input and output signals simultaneously.


No matter how configured, in a high-gain feedback controller or MRAC large gain or adaptive gain leads to reduced phase or time-delay margins. We demonstrate that increasing the adaptative gain will not hurt the time-delay margin of the closed-loop system with the custom character1 adaptive control architecture, in contrast to conventional adaptive or feedback schemes.


An aspect of the invention is A low-pass adaptive/neural controller for a dynamic system, comprising a reference input for the dynamic system, the dynamic system being described by a dynamic model subject to time-varying unknown parameters and an unknown time-varying disturbance, there being a measured output of the dynamic system. A companion model, described by the dynamic model, has adaptive estimates substituted in the dynamic model for the time-varying unknown parameters and the unknown time-varying disturbance, there being a computed output for the companion model. The system has means for generating a control signal to be applied to the dynamic system and the companion model so that the measured output tracks the reference input, the generating means having a low-pass filter to attenuate high frequency components in the control signal. In a further aspect of the invention, the low-pass filter is a stable transfer function and is applied by the generating means so that both the control signal and a difference between the measured output and the reference input achieve a target stability within a transient period.




BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and other objects, aspects and advantages will be better understood from the following detailed description of a preferred embodiment of the invention with reference to the drawings, in which:



FIG. 1 is a schematic block diagram of a closed loop system with an custom character1 adaptive controller.



FIG. 2 is a schematic block diagram of a Linear Time Invariant (LTI) closed loop system.



FIG. 3 is a graph showing application of the custom character1 gain stability requirement for a control signal objective in an exemplar dynamical system.



FIG. 4
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal; FIG. 4b is a graph showing the time history of the control signal where a time varying disturbance signal is

Υ(t)=sin(πt).



FIG. 5
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal; FIG. 5b is a graph showing the time history of the control signal where a time varying disturbance signal is

σ(t)=cos(x1(t))+2 sin(10t)+cos(15t).



FIG. 6
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal; FIG. 6b is a graph showing the time history of the control signal where a time varying disturbance signal is

σ(t)=cos(x1(t))+2 sin(100t)+cos(150t).



FIG. 7 is a schematic of interconnection LTI systems.



FIGS. 8
a and 8b are graphs of MRAC performance and time histories, respectively, for r=100 and Γc=0.04.



FIGS. 8
c and 8d are graphs of MRAC performance and time histories, respectively, for r=100 and Γc=0.2.



FIGS. 9
a and 9b are graphs of MRAC performance and time histories, respectively, for r=400 and Γc=0.04.



FIGS. 10
a and 10b are graphs of MRAC performance and time histories, respectively, for r=25 and Γc=0.04.



FIG. 11 is a schematic showing a closed loop system with an custom character1 adaptive controller.



FIG. 12 is a schematic showing a closed loop reference LTI system.



FIGS. 13
a and 13b are graphs showing the equivalent results of cascading low-pass and high-pass systems.



FIGS. 14
a and 14b are graphs showing application of the custom character1 gain stability requirement for differing adaptive gain values.



FIGS. 15
a and 15b are graphs showing simulation results and time histories, respectively, for an custom character1 adaptive controller.



FIGS. 16
a and 16b are graphs showing performance and time history, respectively, for an custom character1 adaptive controller.



FIGS. 17
a and 17b are graphs showing performance results and time histories, respectively, for an custom character1 adaptive controller.



FIGS. 18
a and 18b are graphs showing performance and time history, respectively, for an custom character1 adaptive controller.




DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT OF THE INVENTION

Problem Formulation


Consider the following system dynamics:

{dot over (x)}(t)=Amx(t)+bu(t)+θτ(t)x(t)+σ(t)),
y(t)=cτx(t), x(0)=x0,  Eq.1

where xεIRn is the system state vector (measurable), uεIR is the control signal, yεIR is the regulated output, b,cεIRn are known constant vectors, Am is a known n×n matrix, ωεIR is known, θ(t)εIRn is a vector of time-varying unknown parameters, while σ(t)εIR is a time-varying disturbance. Without loss of generality, we assume that

θ(t)εΘ, |σ(t)≦Δ, t≧0,  Eq.2

where Θ is a known compact set and ΔεIR+ is a known (conservative) custom character bound of σ(t).


The control objective is to design a full-state feedback adaptive controller to ensure that y(t) tracks a given bounded reference signal r(t) both in transient and steady state, while all other error signals remain bounded.


We further assume that θ(t) and σ(t) are continuously differentiable and their derivatives are uniformly bounded:

∥{dot over (θ)}(t)∥2≦dθ<∞, |{dot over (σ)}(t)|≦dσ<∞, ∀t≧0,  Eq.3

where


∥•∥2 denotes the 2-norm, while the numbers dθ, dσ can be arbitrarily large.



custom character
1 Adaptive Controller


In this section, we develop a novel adaptive control architecture for the system in Eq.1 that permits complete transient characterization for both u(t) and x(t).


The elements of custom character1 adaptive controller are introduced next:


Companion Model: We consider the following companion model:

{circumflex over ({dot over (x)})}(t)=Am{circumflex over (x)}(t)+b(ωu(t)+{circumflex over (θ)}τ(t)x(t)+{circumflex over (σ)}(t)),
ŷ(t)=cτ{circumflex over (x)}(t), {circumflex over (x)}(0)=x0,  Eq.4

which has the same structure as the system in Eq.1. The only difference is that the unknown parameters

θ(t), σ(t)

are replaced by their adaptive estimates

{circumflex over (θ)}(t), {circumflex over (σ)}(t)

that are governed by the following adaptation laws.


Adaptive Laws: Adaptive estimates are given by:

{circumflex over ({dot over (θ)})}(t)=ΓθProj(−x(t){tilde over (x)}τ(t)Pb,{circumflex over (θ)}(t)),{circumflex over (θ)}(0)={circumflex over (θ)}0  Eq.5
{circumflex over ({dot over (σ)})}(t)=ΓσProj(−{tilde over (x)}τ(t)Pb,{circumflex over (σ)}(t)),{circumflex over (σ)}(0)={circumflex over (σ)}0  Eq.6

where

{tilde over (x)}(t)={circumflex over (x)}(t)−x(t)

is the error signal between the state of the system and the companion model,

ΓθcIn×nεIRn×nσc

are adaptation gains with

ΓcεIR+,

and P is the solution of the algebraic equation

AmτP+PAm=−Q, Q>0.


Control Law: The control signal is generated by:
u(s)=C(s)r_(s),whereEq.7r_(t)=kgr(t)-θ^(t)x(t)-σ^(t)ω,Eq.8kg=1cAm-1b.Eq.9

And where


kεIR+ is a feedback gain, while C(s) is any strictly proper stable transfer function with low-pass gain C(0)=1. One simple choice is
C(s)=ωks+ωk.Eq.10

Stability Requirement


Further, let
L=maxθ(t)Θi=1nθi(t),Eq.11

where θi(t) is the ith element of θ(t), Θ is the compact set defined in (2). We now state the custom character1 performance requirement that ensures stability of the entire system and desired transient performance.



custom character
1-gain stability requirement: Design C(s) to ensure that

G(s)∥custom character1L<1,  Eq.12

where

G(s)=(sI−Am)−1b(1−C(s)).


The complete custom character1 adaptive controller consists of Eq.4, Eq.5, Eq.6 and Eq.7 subject to custom character1-gain stability requirement in Eq.12.


The custom character1 adaptive controller is illustrated in FIG. 1. The system to be controlled 110 is coupled with companion model 120. Companion model 120 has the same structure as system 110, but the unknown parameters are replaced by their adaptive estimates 130. Controller 140 generates a control signal u 145.


In case of constant θ(t), the stability requirement of the custom character1 adaptive controller can be simplified. For the specific choice of C(s) in Eq.10, the stability requirement of custom character1 adaptive controller is reduced to
Ag=[Am+bθbω-kθ-kω]Eq.13

being Hurwitz for all θεΘ.


Closed-Loop Reference System


We now consider the following closed-loop LTI reference system with its control signal and system response being defined as follows:
x.ref(t)=Amxref(t)+b(ωuref(t)+θ(t)xref(t)+σ(t)),Eq.14uref(s)=C(s)r_ref(s)ω,xref(0)=x0,Eq.15yref(t)=cxref(t),Eq.16

where


{overscore (r)}ref(s) is the Laplace transformation of the signal

{overscore (r)}ref(t)=−θτ(t)xref(t)−σ(t)+kgr(t),

And kg is introduced in Eq.9.


Tracking Performance


Let

H(s)=(sI−Am)−1b.  Eq.17

It is proved that there exists

coεIRn

such that
coH(s)=Nn(s)Nd(s),Eq.18

where the order of Nd(s) is one more than the order of Nn(s), and both Nn(s) and Nd(s) are stable polynomials.


Theorem 1: Given the system in (1) and the custom character1 adaptive controller defined via Eq.4, Eq.5, Eq.6 and Eq.7 subject to Eq.12, we have:

x−xrefcustom character≦γ1,  Eq.19
u−urefcustom character≦γ2,  Eq.20

where
γ1=C(s)11-H(s)(1-C(s))1Lθmλmax(P)Γc,Eq.21γ2=C(s)ω1Lγ1+C(s)ω1coH(s)co1θmλmax(P)Γc.Eq.22

Corollary 1: Given the system in Eq.1 and the custom character1 adaptive controller defined via Eq.4, Eq.5, Eq.6 and Eq.7 subject to Eq.12, we have:
limΓc(x(t)-xref(t))=0,t0,Eq.23limΓc(u(t)-uref(t))=0,t0.Eq.24


Thus, the tracking error between x(t) and xref(t), as well between u(t) and uref(t), is uniformly bounded by a constant inverse proportional to Γc. This implies that during the transient one can achieve arbitrarily close tracking performance for both signals simultaneously by increasing Γc.


Design Guidelines


We note that the control law uref(t) in the closed-loop reference system, which is used in the analysis of custom character norm bounds, is not implementable since its definition involves the unknown parameters. Theorem 1 ensures that the custom character1 adaptive controller approximates uref(t) both in transient and steady state. So, it is important to understand how these bounds can be used for ensuring uniform transient response with desired specifications. We notice that the following ideal control signal
uideal(t)=kgr(t)-θ(t)xref(t)-σ(t)ωEq.25

Is the one that leads to the desired system response:

{dot over (x)}ref(t)=Amxref(t)+bkgr(t)  Eq.26
yref(t)=cτxref(t)  Eq.27

by cancelling the uncertainties exactly. In the closed-loop reference system Eq.14-Eq.16, uideal(t) is further low-pass filtered by C(s) in Eq.15 to have a guaranteed low-frequency range. Thus, the reference system in Eq.14-Eq.16 has a different response as compared to Eq.26, Eq.27 with Eq.25. It should be noted that C(s) may be selected to ensure that in case of constant θ the response of xref(t), uref(t), can be made as close as possible to Eq.26 with Eq.25. In case of fast varying θ(t), it is obvious that the bandwidth of the controller needs to be matched correspondingly.


Time-Delay Margin Analysis


We consider the following LTI system with output measurement delay:
ζl(s)=11-C(s)(rb(s)-rf(s)),rf(s)=C(s)(1+θH_(s))ζld(s),Eq.28

where rb(s) is the Laplace transformation of bounded signal rb(t). The block-diagram of the closed-loop system in Eq.28 is shown in FIG. 2.


The open-loop transfer function of the system in Eq.28 is:

Ho(s)=C(s)(1+θτ{overscore (H)}(s))/(1−C(s))  Eq.29

whose phase margin P (Ho(s)) can be derived easily from its Bode plot. The time-delay margin of the open-loop transfer function is given by:

T (Ho(s))=P (Ho(s))/ωc  Eq.30

where P (Ho(s)) is the phase margin of the open-loop system Ho(s), and ωc is the cross-over frequency of Ho(s).


With regard to the time-delay margin of the closed-loop custom character1 adaptive controller, we have:


Theorem 2: Given the system in (1) and the custom character1 adaptive controller defined via Eq.4, Eq.5-Eq.6 and Eq.7 subject to Eq.12, where Γc and Δ are large enough, the closed-loop adaptive system is stable in the presence of time delay τ in its output if τ<T (Ho(s)), where T (Ho(s)) is defined in Eq.30.


Novel Features


This new adaptive controller generates low-pass control signals. It has constructive design technique for ensuring desired bandwidth of the generated control signals. Thus, it can meet the bandwidth limitations of the actuators. It has improved transient performance. Using fast adaptation, it guarantees desired transient performance for both control signal and system response. It enables time-delay margin analysis. It has the ability to ensure convergence of tracking error to zero in the steady state performance. This new architecture permits faster rates of adaptation without generating high-frequency control signals and without destabilizing the system. It has a proof for stable performance, which is a basic requirement for every control system design. We define a new stability criterion which gives a systematic design algorithm for required specifications.


All the discussions above apply to neural adaptive controllers and higher dimensional systems, as well.


Variations


The above described architecture may be varied in several ways without departing from the spirit of the invention.


For example, if the objective is only to get rid of high frequency oscillation in the control signal, the control signal may be filtered without setting a large adaptive gain.


In another variation, the low pass filter may be applied to all or only a part of the signal


{overscore (r)}(t) as in (31),
u(s)=kgr(t)+C(s)r_(s),whereEq.31r_(t)=-θ^(t)x(t)-σ^(t)ω.Eq.32

In this variation the closed-loop reference system must also be modified. However, similar results related to transient performance can be obtained.


In a further variation, other signals can be used for scaled reference input r(t), depending on your tracking objective.


Another variation is to express the companion model and control law in other equivalent forms.


Extensions


Detailed results and proof of the custom character1 adaptive controller can be found in the attached papers. Based on custom character1 adaptive controller, custom character1 neural controller can be established which guarantees transient performance with its details in attached paper. We also note that all these results can be extended into Multiple Input Multiple Output systems easily.


Results Demonstration


As an illustrative example, consider a single-link robot arm which is rotating on a vertical plane. The system dynamics are given by:
Iq¨(t)+MgLcosq(t)2+F(t)q.(t)+F1(t)q(t)+σ_(t)=u(t),Eq.33

where


q(t) and {dot over (q)}(t) are measured angular position and velocity, respectively, u(t) is the input torque, I is the given moment of inertia, M is the unknown mass, L is the unknown length, F(t) is an unknown time-varying friction coefficient, F1(t) is position dependent external torque, and

    • {overscore (σ)}(t) is an unknown bounded disturbance. The control objective is to design u(t) to achieve tracking of the bounded reference input r(t) by q(t). Let

      x=[q{dot over (q)}]τ.

      The system in (33) can be presented in the state-space form as:
      x.(t)=Ax(t)+b(u(t)I+MgLcos(x1(t))2I+σ(t)I+F1(t)Ix1(t)+F(t)Ix2(t)),x(0)=x0,y(t)=cx(t),Eq.34

      Where X0 is the initial condition,
      A=[0100],b=[01],c=[10].Eq.35

      The system can be further put into the form:
      x.(t)=Amx(t)+b(ωu(t)+θ(t)x(t)+σ(t)),y(t)=cx(t),x(0)=x0,whereAm=[01-1-1.4],b=[01],c=[10],w=1I,θ(t)=[1+F1(t)I1.4+F(t)I],σ(t)=MgLcos(x1(t))2I+σ(t)I.Eq.36

      Let ω=1, and the unknown control effectiveness, time-varying parameters and disturbance be given by:

      θ(t)=[2+cos(πt)2+0.3 sin(πt)+0.2 cos(2t)]τ;
      σ(t)=sin(πt)  Eq.37

      so that the compact sets can be conservatively chosen as

      Θ=[−10,10],Δ=[−10,10].  Eq.38


For implementation of the custom character1 adaptive controller Eq.4, Eq.5-Eq.6 and Eq.7, we need to verify the custom character1 stability requirement in Eq.12. Letting

C(s)=ωα/(s+ωα),

we have
G(s)=ωαs+ωαH(s),whereEq.39H(s)=[1s2+1.4s+1ss2+1.4s+1].Eq.40


We can check easily that for our selection of compact sets in Eq.38, the resulting L=20 in Eq.11. As shown in FIG. 3, a plot 320 of

    • ∥G(s)∥custom character1L as a function of ωk and compare it to 1 (item 310). We notice that for ωk>30, we have

      G(s)∥custom character1L<1

      Finally, we set the adaptive gain as Γc=10000.


Turning now to FIGS. 4a and 4b, we see the simulation results of the custom character1 adaptive controller for the reference input r=cos(πt). FIG. 4a shows graphs of the system state vector 410, the companion model 420 of the invention, and the bounded reference signal 430. FIG. 4b is a graph showing the time history of the control signal u(t) where a time varying disturbance signal is

σ(t)=sin(πt).


Next, we consider a different disturbance signal:

σ(t)=cos(x1(t))+2 sin(10t)+cos(15t).


The simulation results are shown in FIGS. 5a and 5b. FIG. 5a shows graphs of the system state vector 510, the companion model 520 of the invention, and the bounded reference signal 530. FIG. 5b is a graph showing the time history of the control signal u(t).


Finally, we consider much higher frequencies in the disturbance:

σ(t)=cos(x1(t))+2 sin(100t)+cos(150t).


The simulation results are shown in FIGS. 6a and 6b. FIG. 6a shows graphs of the system state vector 610, the companion model 620 of the invention, and the bounded reference signal 630. FIG. 6b is a graph showing the time history of the control signal u(t).


We note that the custom character1 adaptive controller guarantees smooth and uniform transient performance in the presence of different unknown nonlinearities and time-varying disturbances. The controller frequencies are exactly matched with the frequencies of the disturbance that it is supposed to cancel out. We also notice that the system state vector signal

x1(t)

and the companion model signal

{circumflex over (x)}1(t)

are almost the same in FIGS. 4a, 5a and 6a.


We will now present an implementation of the invention, providing further detail and adaptations.


I. Introduction

As described above, transient performance in the implementation can be characterized both for the system input and output signals. To achieve this, a Companion Model Adaptive Control (CMAC) architecture is introduced and its equivalence to MRAC is shown. The difference between CMAC and MRAC is in definition of the error signal for adaptive laws, which consequently allows for incorporation of a low-pass filter in the feedback loop of CMAC and enables us to enforce the desired transient performance by increasing adaptation gain. For proof of asymptotic stability, the custom character1 gain of a cascaded system, comprised of this filter and the closed-loop desired transfer function, is required to be less than the inverse of the upper bound on the norm of unknown parameters used in projection based adaptation laws. Thus, with the low-pass filter in the loop, the custom character1 adaptive controller is guaranteed to stay in the low-frequency range even in the presence of high adaptive gains and large reference inputs. The ideal (non-adaptive) version of this custom character1 adaptive controller is used along with the main system dynamics to define a closed-loop reference system, which gives an opportunity to estimate performance bounds in terms of custom character norms for both system's input and output signals as compared to the same signals of this reference system. These bounds immediately imply that the transient performance of the control signal in MRAC cannot be characterized. Design guidelines for selection of the low-pass filter ensure that the closed-loop reference system approximates the desired system response, despite the fact that it depends upon the unknown parameter. Thus, the desired tracking performance is achieved by systematic selection of the low-pass filter, which in its turn enables fast adaptation, as opposed to high-gain designs leading to increased control efforts.


The paper is organized as follows. Section II states some preliminary definitions, and Section III gives the problem formulation. In Section IV, we recall the conventional MRAC design and introduce the Companion Model Adaptive Controller (CMAC), which is a reparameterization of MRAC. In Section V, a new custom character1 adaptive controller is presented. Stability and tracking results of the custom character1 adaptive controller are presented in Section VI. Comparison of the performance of custom character1 adaptive controller, MRAC and the high gain controller are discussed in section VIII. In section IX, simulation results are presented, while Section X concludes the paper.


II. Preliminaries

In this Section, we recall some basic definitions and facts from linear systems theory.


Definition 1: For a signal ξ(t), t≧0, ξεIRn, its truncated custom character norm and custom character norm are defined as
ξt=maxi=1,,n(sup0τtξi(τ)),ξ=maxi=1,,n(supτ0ξi(τ)),

where ξi is the ith component of ξ.


Definition 2: The custom character1 gain of a stable proper single-input single-output system H(s) is defined to be ∥H(s)∥custom character1=∫0|h(t)|dt, where h(t) is the impulse response of H(s), computed via the inverse Laplace transform
h(t)=12πiα-iα+iH(s)ests,t0,

in which the integration is done along the vertical line x=α>0 in the complex plane.


Proposition: A continuous time LTI system (proper) with impulse response h(t) is stable if and only if ∫0|h(τ)|dτ<∞. A proof can be found in [1] (page 81, Theorem 3.3.2).


Definition 3: For a stable proper m input n output system H(s) its custom character1 gain is defined as
H(s)1=maxi=1,,n(j=1mHij(s)1),(1)

where Hij(s) is the ith row jth column element of H(s).


The next lemma extends the results of Example 5.2. ([2], page 199) to general multiple input multiple output systems.


Lemma 1: For a stable proper multi-input multi-output (MIMO) system H(s) with input r(t)εIRm and output x(t)εIRn, we have

xtcustom character≦∥H∥1∥rtcustom character, ∀t>0.  (2)

Proof. Let xi(t) be the ith element of x(t), rj(t) be the jth element of r(t), Hij(s) be the ith row jth element of H(s), and Hij(t) be the impulse response of Hij(s). Then for any t′ε[0, t], we have
xi(t)=0t(j=1mhij(t-τ)rj(τ))τ.(3)

From (3) it follows that
xi(t)0t(j=1mhij(t-τ)rj(τ))τ0t(j=1mhij(t-τ))τ(maxj=1,,msup0τtrj(τ))j=1m(0thij(τ)τ)(maxj=1,,msup0τtrj(τ)),

and hence ∥xitcustom character≦(Σj=1m∥Hij(s)∥custom character1)∥rtz,900 . It follows from (1) that
xt=maxi=1,,nxitmaxi=1,,n(j=1mHij(s)1)rt=H(s)1rt

for any t≧0. The proof is complete.


Corollary 1: For a stable proper MIMO system H(s), if the input r(t)εIRm is bounded, then the output x(t)εIRn is also bounded as ∥x∥custom character≦∥H(s)∥custom character1∥τ∥custom character.


Lemma 2: For a cascaded system H(s)=H2(s)H1(s), where H1(s) is a stable proper system with m inputs and l outputs and H2(s) is a stable proper system with l inputs and n outputs, we have ∥H(s)∥custom character1≦∥H2(s)∥custom character1∥H1(s)∥custom character1.


Proof. Let y(t)εIRn be the output of H(s)=H1(s)H2(s) in response to input r(t)εIRm. It follows from Lemma 1 that

y(t)∥≦∥y∥≦∥H2(s)∥custom character1∥H1(s)∥custom character1∥r∥  (4)

for any bounded r(t). Let Hi(s), i=1, . . . , n be the ith row of the system H(s). It follows from (1) that there exists i such that

H(s)∥custom character1=∥Hi(s)∥custom character1.  (5)

Let hij(t) be the jth element of the impulse response of the system Hi(s). For any T, let

rj(t)=sgnhij(T−t), tε[0,T], ∀j=1, . . . , m.  (6)

It follows from Definition 1 that ∥r∥custom character=1, and hence ∥y(t)∥≦∥H2(s)∥custom character1∥H1(s)∥custom character1, ∀t≧0. For r(t) satisfying (6), we have
y(T)=t=0Tj=1mhij(T-t)rj(t)t=t=0Tj=1mhij(T-t)t=j=1m(t=0Thij(t)t).

Therefore, it follows from (4) that for any T, Σj=1m(∫t=0T|hij(t)|dt)≦∥H2(s)∥custom character1∥H1custom character1. As T→∞, it follows from (5) that
H(s)1=Hi(s)1=limTj=1m(t=0Thij(t)t)H2(s)1H1(s)1,

and this completes the proof.
embedded image


Consider an interconnected LTI system in FIG. 7, where w1εIRn1, w2εIRn2, M(s) is a stable proper system with n2 inputs and n1 outputs, and Δ(s) is a stable proper system with n1 inputs and n2 outputs.


Theorem 1: (custom character1 Small Gain Theorem) The interconnected system in FIG. 7 is stable if ∥M(s)∥custom character1∥Δ(s)∥custom character1<1.


The proof follows from Theorem 5.6 ([1], page 218), written for custom character1 gain.


Consider a linear time invariant system:

{dot over (x)}(t)=Ax(t)+bu(t),  (7)

where xεIRn, uεIR, bεIRn, AεIRn×n is Hurwitz, and assume that the transfer function (sI−A)−1b is strictly proper and stable. Notice that it can be expressed as:
(sI-A)-1b=n(s)d(s),(8)

where (d(s)=det(sI−A) is a nth order stable polynomial, and n(s) is a n×1 vector with its ith element being a polynomial function:
ni(s)=j=1nnijsj-1(9)


Lemma 3: If (AεIRn×n, bεIRn) is controllable, the matrix N with its ith row jth column entry nij is full rank.


Proof. Controllability of (A, b) for the LTI system in (7) implies that given an initial condition x(t0)=0 and arbitrary xt1εIRn and arbitrary t1, there exists u(τ), τε[t0, t1] such that x(t1)=xt1. If N is not full rank, then there exists a non-zero vector uεIRn, such that uτn(s)=0. Then it follows that for x(t0)=0 one has uτ(τ)x(τ)=0, ∀τ>t0. This contradicts x(t1)=xt1, in which xt1εIRn is assumed to be an arbitrary point. Therefore, N must be full rank, and the proof is complete.


Lemma 4: If (A, b) is controllable and (sI−A)−1b is strictly proper and stable, there exists cεIRn such that the transfer function cτ(sI−A)−1b is minimum phase with relative degree one, i.e. all its zeros are located in the left half plane, and its denominator is one order larger than its numerator.


Proof. It follows from (8) that
c(sI-A)-1b=cN[sn-11]d(s),(10)

where NεIRn×n is matrix with its ith row jth column entry nij introduced in (9). We choose {overscore (c)}εIRn such that {overscore (c)}τ[sn-1 . . . 1]τ is a stable n−1 order polynomial. Since (A, b) is controllable, it follows from Lemma 3 that N is full rank. Let c=(N−1)τ{overscore (c)}. Then it follows from (10) that
c(sI-A)-1b=c_[sn-11]d(s)

has relative degree 1 with all its zeros in the left half plane.


III. Problem Formulation

Consider the following single-input single-output system dynamics:

{dot over (x)}(t)=Ax(t)+bu(t), x(0)=x0
y(t)=cτx(t),  (11)

where xεIRn is the system state vector (measurable), uεIR is the control signal, b, cεIRn are known constant vectors. A is an unknown n×n matrix, yεIR is the regulated output.


The control objective is to design an adaptive controller to ensure that y(t) tracks a given bounded continuous reference signal r(t) both is transient and steady state, while all other error signals remain bounded. More rigorously, the control objective can be stated as

y(s)≈D(s)r(s),  (12)

where y(s), r(s) are Laplace transformations of y(t), r(t) respectively, and D(s) is a strictly proper stable LTI system that specifies the desired transient and steady state performance.


Following the convention, we introduce the following matching assumption:


Assumption 1: There exist a Hurwitz matrix AmεIRn×n and a vector of ideal parameters θεIRn such that (Am, b) is controllable and Am−A=bθτ. We further assume the unknown parameter θ belongs to a given compact convex set θεΩ.


In the next section, we present two equivalent control architectures that can guarantee the steady state tracking of the bounded reference input r(t). We further use one of those to develop a novel adaptive control architecture with guaranteed transient performance.


IV. MRAC and Companion Model Adaptive Controller

A. Model Reference Adaptive Controller


Let

{dot over (x)}m(t)=Amxm(t)+bkgr(t), xm(0)=x0
ym(t)=cτxm(t)  (13)

be the state space representation of the desired transfer function D(s), where xmεIRn, Am is an n×n matrix kg is a design gain. Usually Am is chosen such that the triple (Am, b, c) approximates D(s) so that ym(s)≈D(s)r(s) with comparable transient and steady steady specifications, subject to the matching condition in Assumption 1.


Theorem 2: [MRAC] The following direct adaptive feedback/feedforward controller

uMRAC(t)={circumflex over (θ)}τ(t)x(t)+kgr(t),  (14)
{circumflex over ({dot over (θ)})}(t)=ΓProj({circumflex over (θ)}(t), x((t)eτ(t)Pb), {circumflex over (θ)}(0)={circumflex over (θ)}0,  (15)

in which {circumflex over (θ)}(t)εIRn are the adaptive parameters, Proj(•,•) denotes the projection operator, e(t)=xm(t)−x(t) is the tracking error, ΓεIRn×n is a positive definite matrix of adaptation gains, and P=Pτ>0 be the solution of the algebraic equation AmτP+PAm=−Q for arbitrary Q>0, ensures that
limte(t)=0.

A proof can be found in [3]. Indeed, the tracking error dynamics with the control law (14), (15) can be written as:

{dot over (e)}(t)=Ame(t)−b{tilde over (θ)}τ(t)x(t), e(0)=0, {tilde over (θ)}(t)custom character{circumflex over (θ)}(t)−θ.(16)

Using standard Lyapunov arguments and Barbalat's lemma, one can prove that
limte(t)=0.

B. Companion Model Adaptive Controller


Theorem 3: [CMAC] Given a bounded reference input signal r(t) of interest to track, the following direct adaptive feedback/feedforward controller

uCMAC(t)={circumflex over (θ)}τ(t)x(t)+kgr(t).  (17)
{circumflex over ({dot over (θ)})}(t)=ΓProj(x(t){tilde over (x)}τ(t)Pb,{circumflex over (θ)}(t)),{circumflex over (θ)}(0)={circumflex over (θ)}0,  (18)

in which {circumflex over (θ)}(t)εIRn are the adaptive parameters, {tilde over (x)}(t)={circumflex over (x)}(t)−x(t) is the tracking error between system dynamics in (11) and the following companion system

{circumflex over ({dot over (x)})}(t)=Am{circumflex over (x)}(t)+b(u(t)−{circumflex over (θ)}τ(t)x(t)),{circumflex over (x)}(0)=x0
ŷ(t)=cτ{circumflex over (x)}(t),  (19)

ensures that
limtx~(t)=0.

The proof is straightforward. Indeed, subject to Assumption 1, the system dynamics in (11) can be rewritten as:

{dot over (x)}(t)=Amx(t)+b(u(t)−θτx(t)), x(0)=x0
y(t)=cτx(t).  (20)

Notice that the companion model in (19) shares the same structure with (20), while the control law in (17), (18) reduces the the closed loop dynamics of the companion model to the desired reference model in (13):

{circumflex over ({dot over (x)})}(t)=Am{circumflex over (x)}(t)+bkgr(t), {circumflex over (x)}(0)=x0.  (21)

We also notice that the closed-loop tracking error dynamics are the same as in (16):

{tilde over ({dot over (x)})}(t)=Am{tilde over (x)}(t)−b{tilde over (θ)}τ(t)x(t), {tilde over (x)}(0)=0.  (22)

Since the closed-loop companion model in (21) is bounded, from standard Lyapunov arguments and Barbalat's lemma it follows that
limtx~(t)=0.

Thus, the companion model adaptive control architecture is equivalent to MRAC. The following remark is in order.


Remark 1: The matching assumption implies that the ideal tracking controller is given by the following linear relationship
uideal(t)=θx(t)+kgr(t),where(23)kg=-1cAm-1b.(24)

The choice of kg in (24) ensures that for constant r one has
limty(t)=r

in both architectures.


C. Bounded Tracking Error Signal


For both architectures MRAC and CMAC, one can prove that the tracking error can be rendered arbitrarily small by increasing the adaptive gain. The main result is given by the following lemma.


Lemma 5: Let Γ=ΓcII, where ΓcεIR+, and II is the identity matrix. For the system in (20)
x(t)θ_maxλmin(P)Γc,θ_max=ΔmaxθΩi=1n4θi2,t0,(25)

and λmin(P) is the minimum eigenvalue of P.


Proof. The candidate Lyapunov function, which can be used to prove asymptotic convergence of tracking error to zero in Theorems 2 and 3, is given by V({tilde over (x)}(t), {tilde over (θ)}(t))={tilde over (x)}τ(t)P{tilde over (x)}(t)+{tilde over (θ)}τ(t)Γ−1{tilde over (θ)}(t). The following upper bound is straight-forward to derive: {tilde over (x)}τ(t)P{tilde over (x)}(t)≦V(t)≦V(0), ∀t≧0. The projection algorithm ensures that {circumflex over (θ)}(t)εΩ, ∀t≧0, and therefore
maxt0θ~(t)Γ-1θ~(t)θ_maxΓc,t0,(26)

where {overscore (θ)}max is defined in (25). Since {tilde over (x)}(0)=0, then V(0)={tilde over (θ)}τ(0)Γ−1{tilde over (θ)}(0), which leads to
x~(t)Px~(t)θmaxΓc,

t≧0. Since λmin(P)∥{tilde over (x)}∥2≦{tilde over (x)}τ(t)P{tilde over (x)}(t), then
x~(t)θ_maxλmin(P)Γc.

D. Transient Performance


Theorems 2 and 3 state that the tracking error goes to zero asymptotically as t→∞. Lemma 5 states that the tracking error can be reduced by increasing the adaptation gain Γc. The following simulations demonstrate that increasing the adaptation gain Γc indeed leads to better transient tracking, but results in unacceptable high-frequency oscillations in the control signal. For simulation purposes, the following system parameters have been selected:
A=[01-53.1],Am=[01-1-1.4],b=[01],c=[10],θ=[4-4.5].

The choice of Γc=0.04 and Q=I leads to desired tracking performance for the reference input r=100, FIGS. 2(a), 2(b). FIGS. 2(c) and 2(d) demonstrate that increasing the adaptive gain improves the transient tracking at the price of high frequency oscillations in the control signal.


FIGS. 3(a) and 3(b) plot the response of the adaptive controller to reference input r=400, without retuning of the adaptive controller. The response to reference input r=25 without retuning the control parameters results in slow convergence, FIGS. 4(a) and 4(b).


These simulations imply two important messages: a) increasing the adaptation gain leads to improved transient tracking performance at the price of high-frequency oscillations in the control signal, b) every change in the reference input implies that retuning of adaptive controller needs to be done to recover the transient tracking performance. Similar deterioration in the transient tracking performance can be observed if one changes the unknown parameters in the system or the initial conditions. Otherwise saying, there is no systematic way of selecting design parameters that would yield the desired transient performance for all possible changes in the system dynamics. On the other hand, the bandwidth limitations of mechanical actuators render implementation of high-frequency control signals
embedded image


overly challenging. Even if implemented, high-frequency control signal can easily excite the high-frequency dynamics of the system, omitted in the modeling, and lead to destabilization.
embedded image


V. 1 Adaptive Controller

In this section, we develop a novel adaptive control architecture that permits complete transient characterization for both system input and output signals. Towards that end, notice that using the matching condition in Assumption 1 the dynamics in (11) can be rewritten as in (20):

{dot over (x)}(t)=Amx(t)−τx(t)+bu(t), x(0)=x0
y(t)=cτx(t).  (27)

The following control structure

u(t)=u1(t)+u2(t), u1(t)=−Kτx(t),  (28)

where u2(t) is the adaptive controller to be determined later, while K is a nominal design gain and can be set to zero, leads to the following partially closed-loop dynamics:

{dot over (x)}(t)=Aox(t)−τx(t)+bu2(t), x(0)=x0
y(t)=cτx(t).  (29)

The choice of K needs to ensure that Ao=Am−bKτ is Hurwitz or, equivalently, that

Ho(s)=(sI−Ao)−1b  (30)

is stable. One obvious choice is K=0. For the linearly parameterized system in (29), we consider the following companion model

{circumflex over ({dot over (x)})}(t)=Ao{circumflex over (x)}(t)+b(u2(t)−{circumflex over (θ)}τ(t)x(t)), {circumflex over (x)}(0)=x0
ŷ(t)=cτ{circumflex over (x)}(t)  (31)

along with the adaptive law for {circumflex over (θ)}(t):

{circumflex over ({dot over (θ)})}(t)=ΓProj(x(t){tilde over (x)}τ(t)Pob,{circumflex over (θ)}(t)), {circumflex over (θ)}(0)={circumflex over (θ)}0,  (32)

where {tilde over (x)}(t)={circumflex over (x)}(t)−x(t) is the tracking error, ΓεIRn×ncIn×n is the matrix of adaptation gains, and Po is the solution of the algebraic equation AoτPo+PoAo=−Qo, Qo>0.


Letting

{overscore (r)}(t)={circumflex over (θ)}τ(t)x(t),  (33)

the companion model in (31) can be viewed as a low-pass system with u(t) being the control signal, {overscore (r)}(t) being a time-varying disturbance, which is not prevented from having high-frequency oscillations. Instead of (17), we consider the following control design for (31):

u2(s)=C(s)({overscore (r)}(s)+kgr(s)),  (34)

where u2(s), {overscore (r)}(s), r(s) are the Laplace transformations of u2(t), {overscore (r)}(t), r(t), respectively, C(s) is a stable and strictly proper system with low-pass gain C(0)=1, and kg is
kg=lims01cHo(s)=1cHo(0).(35)

The complete custom character1 adaptive controller consists of (28), (31), (32), (34), and closed-loop system with it is illustrated in FIG. 11.


Consider the closed-loop companion model in (31) with the control signal defined in (34). It can be viewed as an LTI system with two inputs r(t) and
embedded image


{overscore (r)}(t):

{circumflex over (x)}(s)={overscore (G)}(s){overscore (r)}(s)+G(s)r(s)  (36)
{overscore (G)}(s)=Ho(s)(C(s)−1)  (37)
G(s)=kgHo(s)C(s),  (38)

where {circumflex over (x)}(s), {overscore (r)}(s) are the Laplace transformations of the signals {circumflex over (x)}(t), {overscore (r)}(t), respectively. We note that {overscore (r)}(t) is related to {circumflex over (x)}(t), u(t) and r(t) via nonlinear relationships.


Remark 2: Since both Ho(s) and C(s) are strictly proper stable systems, one can check easily that {overscore (G)}(s) and G(s) are strictly proper stable systems, even though that 1−C(s) is proper.


Let
θmax=maxθΩi=1nθi,(39)

where θi is the ith element of θ. Ω is the compact set, where the unknown parameter lies. We now give the custom character1 performance requirement that ensures stability of the entire system and desired transient performance, as discussed later in Section VI.



custom character
1-gain requirement: Design K and C(s) to satisfy

{overscore (G)}(s)∥custom character1θmax<1.  (40)


VI. Analysis of 1 Adaptive Controller

A. Stability and Asymptotic Convergence


Consider the following Lyapunov function candidate:

V({circumflex over (x)}(t), {tilde over (θ)}(t))={tilde over (x)}τ(t)Po{tilde over (x)}(t)+{circumflex over (θ)}τ(t−1{tilde over (θ)}(t),  (41)

where Po and Γ are introduced in (32). It follows from (29) and (31) that

{circumflex over ({dot over (x)})}(t)=Ao{tilde over (x)}(t)−b{tilde over (θ)}τ(t)x(t), {tilde over (x)}(0)=0.  (42)

Hence, it is straightforward to verify from (32) that

{dot over (V)}(t)≦−{tilde over (x)}τ(t)Qo{tilde over (x)}(t)≦0.  (43)

Notice that the result in (43) is independent of u2(t), and, hence, Lemma 5 also holds for the custom character1 adaptive controller along with its adaptive law in (32). However, one cannot deduce stability from it. One needs to prove in addition that with the custom character1 adaptive controller the state of the companion model will remain bounded. Boundedness of the system state then will follow.


Theorem 4: Given the system in (27) and the custom character1 adaptive controller defined via (28), (31), (32), (34) subject to (40), the tracking error {tilde over (x)}(t) converges to zero asymptotically:
limtx(t)=0.(44)

Proof. Let λmin(Po) be the minimum eigenvalue of Po. From (41) and (43) it follows that

λmin(Po)∥{tilde over (x)}(t)∥2≦{tilde over (x)}τ(t)Po{tilde over (x)}(t)≦V(t)≦V(0),

implying that
x~(t)2V(0)λmin(Po),t0.(45)

From Definition 1,
x~=maxi=1,n,t0x~i(t).

The relationship in (45) ensures that
maxi=1,,n,t0x~i(t)V(0)λmin(Po),

and therefore for all t>0 one has
x~tV(0)λmin(Po).

Using the triangular relationship for norms implies that
x^t=xt𝔷V(0)λmin(Po).(46)

The projection algorithm in (15) ensures that {circumflex over (θ)}(t)εΩ, ∀t≧0. The definition of {overscore (r)}(t) in (33) implies that ∥{overscore (r)}tcustom character≦θmax∥xt. Substituting for ∥xtcustom character from (46) leads to the following
r_t𝔷θmax(x^t+V(0)λmin(Po)).(47)

It follows from Lemma 1 that ∥{circumflex over (x)}tcustom character≦∥{overscore (G)}(s)∥custom character1∥{overscore (r)}tcustom character+∥G(s)∥custom character1∥rtcustom character, which along with (47) gives the following upper bound
x^tG_(s)𝔷1θmax(x^t+V(0)λmin(Po))+G(s)𝔷1rt.Let(48)λ=G_(s)𝔷1θmax.(49)

From (40) it follows that λ<1. The relationship in (48) can be written as
(1-λ)x^tλV(0)λmin(Po)+G(s)𝔷1rt;

and hence
x^tλV(0)λmin(Po)+G(s)𝔷1rt1-λ.(50)

Since V(0), λmin(Po), ∥G(s)∥custom character, λ are all finite and λ<1, the relationship in (50) implies that ∥{circumflex over (x)}tcustom character is finite for any t>0, and hence {circumflex over (x)}(t) is bounded. The relationship in (46) implies that ∥xtcustom character is also finite for all t>0, and therefore
embedded image

x(t) is bounded. The adaptive law in (32) ensures that the estimates {circumflex over (θ)}(t) are also bounded. Hence, it can be checked easily from (22) that {tilde over ({dot over (x)})}(t) is bounded, and it follows from Barbalat's lemma that
limtx~(t)=0.

B. Reference System


In this section we characterize the reference system that the custom character1 adaptive controller in (28), (31), (32), (34) tracks both in transient and steady state, and this tracking is valid for system's both input and output signals. Towards that end, consider the following ideal version of the adaptive controller in (28), (34):

uref(s)=C(s)(kgr(s)+θτxref(s))−Kτxref(s),  (51)

where xref(s) is used to denote the Laplace transformation of the state xref(t) of the closed-loop system. The closed-loop system (20) with the controller (51) is given in FIG. 12.


Remark 3: Notice that when C(s)=1 and K=0, one recovers the reference model of MRAC, and the controller in (51) reduces to the one in (23). If C(s)≠1 and K≠0, then the control law in (51) changes the bandwidth of uideal(t)=θτx(t)+kgr(t) in (23).


The control law in (51) leads to the following closed-loop dynamics:

xref(s)=Ho(s)(kgC(s)r(s)+(C(s)−1)θτxref(s))
yref(s)=cτxref(s),  (52)

which can be explicitly solved for xref(s):

xref(s)=(I−(C(s)−1)Ho(sτ)−1Ho(s)kgC(s)r(s).

Hence, it follows from (37) and (38) that

xref(s)=(I−{overscore (G)}(sτ)−1G(s)r(s).  (53)

Lemma 6: If ∥{overscore (G)}(s)∥custom character1θmax<1, then

(i) (I−{overscore (G)}(sτ)−1 is stable;
(ii) (I−{overscore (G)}(sτ)−1G(s) is stable.  (54)

Proof. It follows from (1) that
G_(s)θ𝔷1=maxi=1,,n(G_i(s)𝔷1(j=1nθj)),

where {overscore (G)}i(s) is the ith element of G(s), and θj is the jth element of θ. From (39) we have Σj=1nj|≦θmax, and hence
G_(s)θ1maxi=1,n(G_i(s)1)θmax=G_(s)1θmax,θεΩ.(55)

∥{overscore (G)}(s)θτcustom character1<1, Thus, Theorem 1 ensures that the LTI system (I−{overscore (G)}(s)θτ)−1 is stable. Since G(s) is stable, then it follows from Remark 2 that (I−{overscore (G)}(s)θτ)−1G(s) is stable.


C. System Response and Control Signal of the custom character1 Adaptive Controller


Letting

r1(t)={tilde over (θ)}τ(t)x(t),  (56)

we notice that {overscore (r)}(t) in (33) can be rewritten as {overscore (r)}(t)=θτ({circumflex over (x)}(t)−{tilde over (x)}(t))+r1(t). Hence, the companion model in (36) can be rewritten as {circumflex over (x)}(s)={tilde over (G)}(s)(θτ{circumflex over (x)}(s)−θτ{tilde over (x)}(s)+r1(s))+G(s)r(s), where r1(s) is the Laplace transformation of r1(t) defined in (56), and further put into the form:

{circumflex over (x)}(s)=(I−{overscore (G)}(sτ)−1(−{overscore (G)}(sτ{tilde over (x)}(s)+{overscore (G)}(s)r1(s)+G(s)r(s)).  (57)

It follows from (42) and (56) that {tilde over ({dot over (x)})}(t)=Ao{tilde over (x)}(t)−br1(t), and hence

{tilde over (x)}(s)=−Ho(s)r1(s).  (58)

Using the expression of {overscore (G)}(s) from (37), the state of the companion model can be presented as

{circumflex over (x)}(s)=(I−{overscore (G)}(sτ)−1(−{overscore (G)}(sτ{tilde over (x)}(s)−(C(s)−1){tilde over (x)}(s)+G(s)r(s)),

which can be further put into the form:

{circumflex over (x)}(s)=(I−{overscore (G)}(sτ)−1G(s)r(s)+(I−{overscore (G)}(sτ)−1(−{overscore (G)}(sτ{tilde over (x)}(s)−(C(s))−1){tilde over (x)}(s)).

Using xref(s) from (53) and recalling the definition of {tilde over (x)}(s)={circumflex over (x)}(s)−x(s), one arrives at

x(s)=xref(s)−(I+(I−{overscore (G)}(sτ)−1({overscore (G)}(sτ+(C(s)−1)I)){tilde over (x)}(s).  (59)

The expressions in (28), (34) and (51) lead to the following expression of the control signal

u(s)=uref(s)+C(s)r1(s)+(C(sτ−Kτ)(x(s)−xr(s)).  (60)

D. Asymptotic Performance and Steady State Error


Theorem 5: Given the system in (27) and the custom character1 adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
limtx(t)-xref(t)=0,(61)limtu(t)-uref(t)=0.(62)

Proof. Let

r2(s)=(I+(I−{overscore (G)}(sτ)−1({overscore (G)}(sτ+(C(s)−1)I)){tilde over (x)}(s).  (63)

It follows from (59) that

r2(t)=xref(t)−x(t).  (64)

The signal r2(t) can be viewed as the response of the LTI system

H2(s)=I+(I−{overscore (G)}(sτ)−1({overscore (G)}(sτ+(C(s)−1)I)  (65)

to the bounded error signal {tilde over (x)}(t). It follows from (54) and Remark 2 that (I−{overscore (G)}(s)θτ)−1, {overscore (G)}(s), C(s) are stable and, therefore, H2(s) is stable. Hence, from (44) we have
limtr2(t)=0.


Let

r3(s)=C(s)r1(s)+(C(sτ−Kτ)(x(s)−xr(s)).  (66)

It follows from (60) that

r3(t)=u(t)−uref(t).  (67)

Since the projection operator ensures that {tilde over (θ)}(t) is bounded, it follows from (42) and (44) that
limtr1(t)=0.

Since C(s) is a stable proper system, it follows from (61) that
limtr3(t)=0.


Lemma 7: Given the system in (27) and the custom character1 adaptive controller defined via (28), (31), (32), (34) subject to (40), if r(t) is constant, then
limty(t)=r.

Proof. Since

yref(t)=cτxref(t),  (68)

it follows from (61) that
limt(y(t)-yref(t))=0.(69)

From (53) it follows that yref(s)=cτ(I−{overscore (G)}(s)θτ)−1G(s)r(s). The end value theorem ensures
limtyref(t)=lims0c(I-G_(s)θ)-1G(s)r=cHo(0)C(0)kgr.(70)

Definition of kg in (35) leads to
limty(t)=r.


In addition to the constant reference input signal r, we need to characterize the closed-loop system response with the custom character1 controller to a time varying input r(t). This is analyzed in the following sections.


E. Transient Performance


We note that (Am−bKτ, b) is the state space realization of Ho(s). Since (Am, b) is controllable, it can be proved easily that (Am−bKτ, b) is also controllable. It follows from Lemma 4 that there exists coεIRn such that
coHo(s)=Nn(s)Nd(s),(71)

where the order of Nd(s) is one more than the order of Nn(s), and both Nn(s) and Nd(s) are stable polynomials.


Theorem 6: Given the system in (27) and the custom character1 adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
x-xrefγ1Γc,(72)y-yrefc1γ1Γc,(73)u-urefγ2Γc,(74)

where ∥cτcustom character1 is the custom character1 gain of cτ and
γ1=H2(s)1θ_maxλmax(Po),(75)γ2=C(s)1coHo(s)co1θ_maxλmax(Po)+C(s)θ-K1γ1.(76)


Proof. It follows from (63), (65) and Lemma 1 that ∥r2custom character≦∥H2(s)∥custom character1∥{tilde over (x)}∥custom character, while Lemma 5 implies that
x~θ_maxλmax(Po)Γc.(77)

Therefore,
r2𝔷H2(s)𝔷1θ_maxλmax(Po)Γc,

which leads to (72). The upper bound in (73) follows from (72) and Lemma 2 directly. From (58) we have
r3(s)=C(s)1coTHo(s)coHo(s)r1(s)+(C(s)θ-K)(x(s)-xr(s))=-C(s)1coHo(s)cox~(s)+(C(s)θ-K)(x(s)-xr(s)),

where co is introduced in (71). It follows from (71) that
C(s)1coHo(s)=C(s)Nd(s)Nn(s),

where Nd(s), in Nn(s) are stable polynomials and the order of Nn(s) is one less than the order of Nd(s). Since C(s) is stable and strictly proper, the complete system
C(s)1coHo(s)

is proper and stable, which implies that its custom character1 gain exists and is finite. Hence, we have
r3𝔷C(s)1coHo(s)co𝔷1x~𝔷+C(s)θ-K𝔷1x-xr𝔷.

Lemma 5 leads to the upper bound in (74):
r3𝔷C(s)1coHo(s)co𝔷1θ_maxλmax(Po)Γc+C(s)θ-K𝔷1x-xr𝔷.


Corollary 2: Given the system in (27) and the custom character1 adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
limΓc->(x(t)-xref(t))=0,t0,(78)limΓc->(y(t)-yref(t))=0,t0,(79)limΓc->(u(t)-uref(t))=0,t0.(80)


Corollary 2 states that x(t), y(t) and u(t) follow xref(t), yref(t) and uref(t) not only asymptotically but also during the transient, provided that the adaptive gain is selected sufficiently large. Thus, the control objective is reduced to designing K and C(s) to ensure that the reference LTI system has the desired response D(s).


Remark 4: Notice that if we set C(s)=1, then the custom character1 adaptive controller degenerates into a CMAC type, which is equivalent to MRAC. In that case
C(s)1coHo(s)co𝔷1

cannot be finite, since Ho(s) is strictly proper. Therefore, from (76) it follows that γ2→∞, and hence for the control signal in CMAC or MRAC one can not reduce the bound in (74) by increasing the adaptive gain.


VII. Design of the 1 Adaptive Controller

We proved that the error between the state and the control signal of the closed-loop system with custom character1 adaptive controller in (27), (28), (31), (32), (34) (FIG. 11) and the state and the control signal of the closed-loop reference system in (51), (53) (FIG. 12) can be rendered arbitrarily small by choosing large adaptive gain. Therefore, the control objective is reduced to determining K and C(s) to ensure that the reference system in (51), (53) (FIG. 12) has the desired response D(s) from r(t) to yref(t). Notice that the reference system in FIG. 12 depends upon the unknown parameter θ.


Consider the following signals:

ydes(s)=cτG(s)r(s)=C(s)kgcτHo(s)r(s),  (81)
udes(s)=kgC(s)(1+C(sτHo(s)−KτHo(s))r(s).  (82)

We note that udes(t) depends on the unknown parameter θ, while ydes(t) does not.


Lemma 8: For the LTI system in FIG. 12, subject to (40), the following upper bounds hold:
yref-ydes𝔷λ1-λc𝔷1G(s)𝔷1r𝔷,(83)yref-ydes𝔷11-λc𝔷1h3𝔷,(84)uref-udes𝔷λ1-λC(s)θ-K𝔷1G(s)𝔷1r𝔷,(85)uref-udes𝔷11-λC(s)θ-K𝔷1h3𝔷,(86)

where λ is defined in (49), and h3(t) is the inverse Laplace transformation of

H3(s)=(C(s)−1)C(s)r(s)kgHo(sτHo(s).  (87)

Proof. It follows from (52) and (53) that yref(s)=cτ(I−{overscore (G)}(s)θτ)−1G(s)r(s). Following Lemma 6, the condition in (40) ensures the stability of the reference LTI system. Since (I−{overscore (G)}(s)θτ)−1 is stable, then one can expand it into convergent series and further write
yref(s)=c(I+i=1(G_(s)θ)i)G(s)r(s)=ydes(s)+c(i=1(G_(s)θ)i)G(s)r(s).Letr4(s)=c(i=1(G_(s)θ)i)G(s)r(s).Then(88)r4(t)=yref(t)-ydes(t),t0.(89)

The relationship in (55) implies that ∥{overscore (G)}(s)θτcustom character1≦λ, and it follows from Lemma 2 that
r4(i=1λi)c1G1r=λ1-λc1G1r.(90)

The relationship in (88) can be equivalently written as
yref(s)=ydes(s)+c(i=1(G_(s)θ)i-1)G_(s)θG(s)r(s),

which along with (37), (38) and (87) leads to
yref(s)=ydes(s)+c(i=1(G_(s)θ)i-1)(C(s)-1)C(s)r(s)kgHo(s)θHo(s)=ydes(s)+c(i=1(G_(s)θ)i-1)H3(s).

Lemma 1 immediately implies that ∥r4custom character≦(Σi=1λi-1)∥cτcustom character1∥h3custom character. Comparing udes(s) in (82) to uref(s) in (51) it follows that udes(s) can be written as udes(s)=kgC(s)r(s)+(C(s)θτ−Kτ)xdes(s), where xdes(s)=C(s)kgHo(s)r(s). Therefore uref(s)−udes(s)=(C(s)θτ−Kτ)(xref(s)−xdes(s)). Hence, it follows from Lemma 1 that ∥uref−udescustom character≦∥C(s)θτ−Kτcustom character1∥xref−xdescustom character. Using the same steps as for ∥yref−ydescustom character, we have
xref-xdesλ1-λG(s)1r,xref-xdesλ1-λh3,

which leads to the upper bounds in (85) and (86).


Thus, the problem is reduced to finding a strictly proper stable C(s) to ensure that

(i) λ<1 or ∥h3custom character are sufficiently small,  (91)
(ii) ydes(s)≈D(s)r(s),  (92)

where D(s) is the desired LTI system introduced in (12). Then, Theorem 6 and Lemma 8 will imply that the output y(t) of the system in (27) and the custom character1 adaptive control signal u(t) will follow ydes(t) and udes(t) both in transient and steady state with quantifiable bounds, given in (73), (74) and (83)-(86).


Notice that λ<1 is required for stability. From (81)-(86), it follows that for achieving ydes(s)≈D(s)r(s) it is desirable to ensure that λ or ∥h3custom character are sufficiently small and, in addition, C(s)cτHo(s)≈D(s). We notice that these requirements are not in conflict with each other. So, using Lemma 2, one can consider the following conservative upper bound

λ=∥{overscore (G)}(s)∥custom character1θmax=∥Ho(s)(C(s)−1∥custom character1θmax≦∥Ho(s)∥custom character1∥C(s)−1∥custom character1θmax.  (93)

Thus, minimization of λ can be achieved from two different perspectives: i) fix C(s) and minimize ∥Ho(s)∥custom character1, ii) fix Ho(s) and minimize the custom character1-gain of one of the cascaded systems ∥Ho(s)(C(s)−1)∥custom character1, ∥(C(s)−1)r(s)∥custom character1 or ∥C(s)(C(s)−1)∥custom character1 via the choice of C(s).


i) High-gain design. Set C(s)=D(s). Then minimization of ∥Ho(s)∥custom character1 can be achieved via high-gain feedback by choosing K sufficiently large. However, minimized ∥Ho(s)∥custom character1 via large K leads to large poles of Ho(s), which is typical for high-gain design methods. Since C(s) is a strictly proper system containing the dominant poles of the closed-loop system in kgcτHo(s)C(s) and kgcτHo(0)=1, we have kgcτHo(s)C(s)≈C(s)=D(s). Hence, the system response will be yref(s)≈D(s)r(s). We note that with large feedback K, the performance of custom character1 adaptive controller degenerates into a high-gain type. The shortcoming of this design is that the high gain feedback K leads to a reduced phase margin and consequently affects robustness.


ii) Design without linear feedback. As in MRAC, assume that we can select Am to ensure
kgcHo(s)D(s).Let(94)C(s)=ws+w.(95)


Lemma 9: For any single input n-output strictly proper stable system Ho(s) the following is true:
limw->(C(s)-1)Ho(s)1=0.

Proof. It follows from (95) that
(C(s)-1)Ho(s)=-ss+wHo(s)=-1s+wsHo(s).

Since Ho(s) is strictly proper and stable, sHo(s) is stable and has relative degree ≧0, and hence ∥sHo(s)∥custom character1 is finite. Since
-1s+w1=1w,

it follows from (2) that
(C(s)-1)Ho(s)11wsHo(s)1,

and the proof is complete.


Lemma 9 states that if one chooses kgcτHo(s)r(s)≈D(s), then by increasing the bandwidth of the low-pass system C(s), it is possible to render ∥{overscore (G)}(s)∥custom character1 arbitrarily small. With large ω, the pole −ω due to C(s) is omitted, and Ho(s) is the dominant reference system leading to

yref(s)≈kgcτHo(s)r(s)≈D(s)r(s).

We note that kgcτHo(s) is exactly the reference model of the MRAC design. Therefore this approach is equivalent to mimicking MRAC, and, hence, high-gain feedback can be completely avoided.


However, increasing the bandwidth of C(s) is not the only choice for minimizing ∥{overscore (G)}(s)∥custom character1. Since C(s) is a low-pass filter, its complementary 1−C(s) is a high-pass filter with its cutoff frequency approximating the bandwidth of C(s). Since both Ho(s) and C(s) are strictly proper systems, {overscore (G)}(s)=Ho(s)(C(s)−1) is equivalent to cascading a low-pass system Ho(s) with a high-pass system C(s)−1. If one chooses the cut-off frequency of C(s)−1 larger than the bandwidth of Ho(s), it ensures that {overscore (G)}(s) is a “no-pass” system, and hence its custom character1 gain can be rendered arbitrarily small. This can be achieved via higher order filter design methods. The illustration is given in FIG. 13.


To minimize ∥h3custom character, we note that ∥3custom character can be upperbounded in two ways:

(i) ∥h3custom character∥(C(s)−1)r(s)∥custom character1∥h4custom character,
embedded image

where h4(t) is the inverse Laplace transformation of H4(s)=C(s)kgHo(s)θτHo(s), and

(ii) ∥h3custom character≦∥(C(s)−1)C(s)∥custom character1∥h5custom character,

where h5(t) is the inverse Laplace transformation of H5(s)=r(s)kgHo(s)θτHo(s).


We note that since r(t) is a bounded signal and C(s), Ho(s) are stable proper systems, ∥h4custom character and ∥h5custom character are finite. Therefore, ∥h3custom character can be minimized by minimizing ∥(C(s)−1)r(s)∥custom character1 or ∥(C(s)−1)C(s)∥custom character1. Following the same arguments as above and assuming that r(t) is in low-frequency range, one can choose the cut-off frequency of C(s)−1 to be larger than the bandwidth of the reference signal r(t) to minimize ∥(C(s)−1)r(s)∥custom character1. For minimization of ∥C(s)(C(s)−1)∥custom character1 notice that if C(s) is an ideal low-pass filter, then C(s)(C(s)−1)=0 and hence ∥h3custom character=0. Since an ideal low-pass filter is not physically implementable, one can minimize ∥C(s)(C(s)−1)∥custom character1 via appropriate choice of C(s).


The above presented approaches ensure that C(s)≈1 in the bandwidth of r(s) and Ho(s). Therefore it follows from (81) that ydes(s)=C(s)kgcτHo(s)r(s)≈kgcτHo(s)r(s), which along with (94) yields ydes(s)≈D(s)r(s).


Remark 5: From Corollary 2 and Lemma 8 it follows that the custom character1 adaptive controller can generate a system response to track (81) and (82) both in transient and steady state if we set the adaptive gain large and minimize λ or ∥h3custom character. Notice that udes(t) in (82) depends upon the unknown parameter θ, while ydes(t) in (81) does not. This implies that for different values of θ, the custom character1 adaptive controller will generate different control signals (dependent on θ) to ensure uniform system response (independent of θ). This is natural, since different unknown parameters imply different systems, and to have similar response for different systems the control signals have to be different. Here is the obvious advantage of the custom character1 adaptive controller in a sense that it controls a partially known system as an LTI feedback controller would have done if the unknown parameters were known. Finally, we note that if the term kgC(s)C(s)θτHo(s) is dominated by kgC(s)KτHo(s), then the controller in (82) turns into a robust one, and consequently the custom character1 adaptive controller degenerates into robust design.


Remark 6: It follows from (78) that yref(t), uref(t) approximate the unknown system's response and the custom character1 adaptive control signal, if the latter is implemented with large adaptive gain. It follows from (53) that y(t) approximates the response of the LTI system cτ(I−{overscore (G)}(s)θτ)−1G(s) to r(t), hence its transient performance such as overshoot and settling time can be derived for every value of θ. If we further minimize λ or ∥h3custom character, it follows from Lemma 8 that y(t) approximates the response of the LTI system C(s)cτHo(s). In this case, the custom character1 adaptive controller leads to uniform transient performance of y(t) independent of the value of the unknown parameter θ. It follows from (80) then that the same is true for the custom character1 adaptive control signal u(t). For the resulting custom character1 adaptive control signal one can characterize the transient specifications such as its amplitude and rate change for every θεΩ, using udes(t) for it.


VIII. Discussion

We use a scalar system to compare the performance of custom character1 adaptive and a high-gain controllers. Towards that end, let {dot over (x)}(t)=θx(t)+u(t) where xεIR is the measurable system state, uεIR is the control signal and θεIR is unknown, which belongs to a given compact set [θmin, θmax]. Let u(t)=−kx(t)+kr(t), leading to the following closed-loop system:

{dot over (x)}(t)=(θ−k)x(t)+kr(t).

We need to choose k>θmax to guarantee stability. We note that both the steady state error and transient performance depend on the unknown parameter value θ. By further introducing a proportional-integral controller, one can achieve zero steady state error. If one chooses k>>max{θmax, θmin}, it leads to high-gain system
x(s)=ks-(θ-k)r(s)ks+kr(s).


To apply the custom character1 adaptive controller, let the desired reference system be
Ho(s)=1s+2.

Let u1=−2x, kg=2, leading to
D(s)=2s+2.

Choose C(s) as in (95) with large ωn, and set adaptive gain Γc large. Then it follows from Theorem 6 that
x(s)xref(s)=C(s)kgHo(s)r(s)ωns+ωn2s+22s+2(96)u(s)uref(s)=(-2+θ)xref(s)+2r(s).(97)

The relationship in (96) implies that the control objective is met, while the relationship in (97) states that the custom character1 adaptive controller approximates uref(t), which cancels the unknown θ.


IX. Simulations

Consider the same simulation example from Section IV-D. We give now the complete custom character1 adaptive controller for this system. We set K=0, Γc=40000, and implement the L1 adaptive controller following (28), (31), (32) and (34). First, we give analysis of the custom character1 adaptive controller. It follows from (30) that
Ho(s)=[1s2+1.4s+1ss2+1.4s+1](98)

and hence
ydes(s)=C(s)cTHo(s)r(s)=1s2+1.4s+1C(s)r(s).(99)

Next, we check stability of this custom character1 adaptive controller. It follows from (39) that θmax=20, and ∥{overscore (G)}∥L1 can be calculated numerically. In FIG. 14(a), we plot
λ=G_L1θmax=1s2+1.4s+1ωs+ω1θmax(100)

with respect to ω and compare it to 1. We notice that for ω>30, we have λ<1, and the custom character1 gain requirement for stability is guaranteed. So, we can choose
C(s)=160s+160(101)

to ensure that λ<0.01, which consequently leads to improved performance bounds in (83)-(86). For ω=160, we have λ=∥{overscore (G)}(s)∥custom character1θmax=0.1725<1, so the custom character1-gain requirement in (40) is indeed satisfied.
embedded image


Next, we compute the bound between yref(t) and ydes(t) in (99). It follows from (87) that
h3𝔷maxθΩ(C(s)-1)C(s)kgHo(s)θTHo(s)1r.

For C(s) and Ho(s) in (101) and (98), it can be numerically verified that
maxθΩ(C(s)-1)C(s)kgHo(s)θTHo(s)1=0.0946,(102)

and it follows from (84) that ∥yref−ydescustom character≦0.0946∥r∥custom character. Therefore, we can state that
yref(s)ydes(s)=C(s)cTHo(s)r(s)=1s2+1.4s+1160s+160r(s).

Similarly, it follows from (86) that uref(t) approximates udes(t), i.e.
uref(s)udes(s)=2160s+160(1+160s+160θ[1s2+1.4s+1ss2+1.4s+1])r(s).

With large adaptive gain, it follows from Theorem 6 that y(t)≈yref(t), u(t)≈uref(t), ∀t≧0, and hence
y(s)1s2+1.4s+1160s+160r(s)1s2+1.4s+1r(s)u(s)2160s+160(1+160s+160θ[1s2+1.4s+1ss2+1.4s+1])r(s)2r(s)+θ[1s2+1.4s+1ss2+1.4s+1]r(s),

if one just considers the dominant poles. The simulation results of the custom character1 adaptive controller are shown in FIGS. 15(a)-15(b) for reference inputs r=25, 100, 400, respectively. We note that it leads to scaled control input and system response for scaled reference input, as compared to MRAC in FIGS. 8(a)-10(b). FIG. 16(a)-16(b) show the system response and control signal for reference input r(t)=100 cos(0.2t), without any retuning of the controller.
embedded imageembedded image


Next, we consider a higher order filter with low adaptive gain Γc=400,
C(s)=3w2s+w3(s+w)3.

In FIG. 14(a), we plot
λ=G_L1θmax=1s2+1.4s+13w2s+w3(s+w)31θmax(103)

with respect to ω and compare it to 1. We notice that when ω>25, we have λ<1 and the custom character1-gain requirement in (40) is satisfied. Letting ω=50 leads to λ=0.3984, and therefore ∥yref−ydescustom character≦0.0721∥r∥custom character. Following similar arguments above
y(s)ydes(s)=1s2+1.4s+13w2s+w3(s+w)3r(s)1s2+1.4s+1r(s),(104)

if one just considers the dominant poles. The simulation results of the custom character1 adaptive controller are shown in FIGS. 11(a)-11(b), for reference inputs r=25, 100, 400, respectively. We note that it again leads to scaled control input and system response for scaled reference input, as compared to MRAC in FIGS. 8(a)-10(b). In addition, we notice that this performance is achieved by a much smaller adaptive gain as compared to the design with the first order C(s). FIG. 18(a)-18(b) show the system response and control signal for reference input r(t)=100 cos(0.2t), without any retuning of the controller.
embedded image


Remark 7: The simulations pointed out that with higher order filter C(s) one could use relatively small adaptive gain. While a rigorous relationship between the choice of adaptive gain and the order of filter cannot be derived, an insight into this can be gained from the following analysis. It follows from (27), (28) and (34) that

x(s)=G(s)r(s)+Ho(sτx(s)+Ho(s)C(s){overscore (r)}(s),  (105)

while the companion model in (36) can be rewritten as

{circumflex over (x)}(s)=G(s)r(s)+Ho(s)(C(s)−1){overscore (r)}(s).


We note that {overscore (r)}(t) is divided into two parts. Its low-frequency component C(s){overscore (r)}(s) is what the system in (105) gets, while the complementary high-frequency component (C(s)−1){overscore (r)}(s) goes into the companion model. If the bandwidth of C(s) is large, then it can suppress only the high frequencies in {overscore (r)}(t), which appear only in the presence of large adaptive gain. A properly designed higher order C(s) can be more effective to serve the purpose of filtering with reduced tailing effects, and, hence can generate similar λ with smaller bandwidth. This further implies that similar performance can be achieved with smaller adaptive gain.


Note the following references referred to in the above discussion: [1] is P. Ioannou and J. Sun, Robust Adaptive Control (Prentice Hall, 1996); [2] is H. K. Khalil, Nonlinear Systems (Prentice Hall, Englewood Cliff, N.J., 2002); [3] is J.-J. E. Slotine and W. Li, Applied Nonlinear Control (Prentice Hall, Englewood Cliffs, N.J., 1991).


While the invention has been described in terms of preferred embodiments, those skilled in the art will recognize that the invention can be practiced with modification within the spirit and scope of the appended claims.

Claims
  • 1. A low-pass adaptive/neural controller for a dynamic system, comprising: a reference input for the dynamic system, the dynamic system being described by a dynamic model subject to time-varying unknown parameters and an unknown time-varying disturbance, there being a measured output of the dynamic system; a companion model, described by the dynamic model, adaptive estimates being substituted in the dynamic model for the time-varying unknown parameters and the unknown time-varying disturbance, there being a computed output for the companion model; and means for generating a control signal to be applied to the dynamic system and the companion model so that the measured output tracks the reference input, said generating means having a low-pass filter to attenuate high frequency components in the control signal; wherein said low-pass filter is a stable transfer function and is applied by said generating means so that both the control signal and a tracking error difference between the measured output and the reference input achieve a target stability and desired performance asymptotically within a transient period.
  • 2. The controller of claim 1, wherein an adaptive gain applied by said generating means is made very large in order to regulate the tracking error asymptotically within desired bounds during the transient period.
  • 3. The controller of claim 1, wherein an error signal difference between said measured output and said computed output is used to generate the adaptive estimates.
  • 4. The controller of claim 1, wherein the control signal is generated by applying said stable transfer function to a feedback signal derived from the reference input, said measured output and the adaptive estimates.
  • 5. The controller of claim 4, wherein the low-pass filter is applied to only a part of the feedback signal.
  • 6. The controller of claim 1, wherein the low-pass filter cascaded with the desired reference system has L1 gain, the L1 gain being less than an inverse of an upper bound of a norm of the unknown parameters.
  • 7. A method for adaptively controlling a dynamic system, comprising: defining an error signal between a measured output of the dynamic system and a computed output of a companion model, the dynamic system being described by a dynamic model subject to time-varying unknown parameters and an unknown time-varying disturbance, adaptive estimates for the unknown parameters and the unknown disturbance being substituted in the companion model to produce the computed output; generating a control signal with a low-pass filter to attenuate high frequency components in the control signal; and applying the control signal to the dynamic system and the companion model so that the measured output tracks a reference input, wherein said low-pass filter is a stable transfer function and is applied by said generating means so that both the control signal and a tracking error difference between the measured output and the reference input achieve a target stability and desired performance asymptotically within a transient period.
  • 8. The method of claim 7, wherein an adaptive gain of the control signal is made very large in order to regulate the tracking error asymptotically within desired bounds during the transient period.
  • 9. The method of claim 7, wherein an error signal difference between said measured output and said computed output is used to generate the adaptive estimates.
  • 10. The method of claim 7, wherein the control signal is generated by applying said stable transfer function to a feedback signal derived from the reference input, said measured output and the adaptive estimates.
  • 11. The method of claim 10, wherein the low-pass filter is applied to only a part of the feedback signal.
  • 12. The method of claim 7, wherein an adaptive gain of the control signal is less than an inverse of an upper bound of a norm of the unknown parameters.
Parent Case Info

This invention claims priority from U.S. Provisional Patent Application Ser. No. 60/664,187 filed on Mar. 23, 2005 and entitled Low-pass Adaptive/Neural Controller Design with Improved Transient Performance, which is incorporated herein by reference. The invention was made under partial support from contract numbers F49620-03-1-0443 and FA9550-05-1-0157 with the Air Force Office of Scientific Research, and also partial support by ADVANCE VT Institutional Transformation Research Seed Grant from the National Science Foundation.

Provisional Applications (1)
Number Date Country
60664187 Mar 2005 US