1. Field of the Invention
The present invention generally relates to the control of systems with time-varying unknown parameters, time-varying bounded disturbances, and matched uncertain nonlinearities, and in particular to controllers designed to adapt to parameters that vary in uncertain ways.
2. Background Description
The conventional Model Reference Adaptive Controller (MRAC) was developed to control linear systems in the presence of parametric uncertainties. The development of this architecture has been facilitated by the Lyapunov stability theory that defines sufficient conditions for stable performance, but offers no means for characterizing the system's input/output performance during the transient phase. System uncertainties during the transient phase have led to unpredictable and/or undesirable situations, involving control signals of high frequency or large amplitudes, large transient errors or slow convergence rate of tracking errors, to name a few. Application of adaptive/neural controllers has therefore been largely restricted. Large deviation implies poor transient performance. It could even lead to instability, especially for neural controllers where the signals are required to be inside a compact set where approximation is conducted.
One such situation is bandwidth limitation in the control channel, especially in mechanical actuators. A high frequency control signal is impractical as it can lead to destabilization of the system. Another circumstance where use of conventional adaptive/neural controllers is limited is where the system models are mostly based on low-frequency approximations. A high frequency control signal can easily excite the omitted high frequency dynamics of the system and lead to unpredictable consequences.
The transient performance of adaptive/neural controllers depends on unknown parameters, reference input, and adaptive gain in a nonlinear way. Extensive tuning of adaptive gains and Monte-Carlo runs have been the primary methods for enabling the transition of adaptive control solutions to real world applications. This approach has rendered verification and validation of adaptive controllers overly challenging. Moreover, there is no systematic way of selecting design parameters that would yield the desired transient performance for all possible scenarios.
As compared to the linear systems theory, several important aspects of transient performance analysis seem to be missing in the prior art. First, all the bounds in the prior art are computed for tracking errors only, and not for control signals. Although the latter can be deduced from the former, it is straightforward to verify that the ability to adjust the former may not extend to the latter in the case of nonlinear control laws. Second, since the purpose of adaptive control is to ensure stable performance in the presence of modeling uncertainties, one needs to ensure that the changes in reference input and unknown parameters due to possible faults or unexpected uncertainties do not lead to unacceptable transient deviations or oscillatory control signals, implying that a retuning of adaptive parameters is required. Finally, one needs to ensure that whatever modifications or solutions are suggested for performance improvement of adaptive controllers, they are not achieved via high-gain feedback.
It is therefore an object of the present invention to provide an adaptive/neural controller design with improved transient performance.
Another object of the invention is to provide a systematic way of selecting design parameters that would yield desired transient performance for all possible scenarios.
A further object of the invention is to make it easier to verify and validate adaptive controllers.
We invented a novel 1 adaptive/neural control architecture that permits fast adaptation and yields guaranteed transient response simultaneously for both the system's input and output signals, in addition to providing asymptotic tracking. The main feature of the invention is rapid adaptation with a guaranteed low frequency control signal. The ability to adapt rapidly ensures the desired transient performance for both the system's input and output signals, simultaneously, while a low-pass filter in the feedback loop attenuates the high-frequency components in the control signal.
The 1 adaptive/neural controller can be applied to systems that have time-varying unknown parameters with an arbitrary rate of variation, and also ensures the desired transient performance for both input and output signals of the system. We prove that by increasing the adaptation gain one can achieve arbitrarily close transient and asymptotic tracking for both input and output signals simultaneously.
No matter how configured, in a high-gain feedback controller or MRAC large gain or adaptive gain leads to reduced phase or time-delay margins. We demonstrate that increasing the adaptative gain will not hurt the time-delay margin of the closed-loop system with the 1 adaptive control architecture, in contrast to conventional adaptive or feedback schemes.
An aspect of the invention is A low-pass adaptive/neural controller for a dynamic system, comprising a reference input for the dynamic system, the dynamic system being described by a dynamic model subject to time-varying unknown parameters and an unknown time-varying disturbance, there being a measured output of the dynamic system. A companion model, described by the dynamic model, has adaptive estimates substituted in the dynamic model for the time-varying unknown parameters and the unknown time-varying disturbance, there being a computed output for the companion model. The system has means for generating a control signal to be applied to the dynamic system and the companion model so that the measured output tracks the reference input, the generating means having a low-pass filter to attenuate high frequency components in the control signal. In a further aspect of the invention, the low-pass filter is a stable transfer function and is applied by the generating means so that both the control signal and a difference between the measured output and the reference input achieve a target stability within a transient period.
The foregoing and other objects, aspects and advantages will be better understood from the following detailed description of a preferred embodiment of the invention with reference to the drawings, in which:
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal;
Υ(t)=sin(πt).
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal;
σ(t)=cos(x1(t))+2 sin(10t)+cos(15t).
a is a graph showing the system state vector, the companion model of the invention, and a bounded reference signal;
σ(t)=cos(x1(t))+2 sin(100t)+cos(150t).
a and 8b are graphs of MRAC performance and time histories, respectively, for r=100 and Γc=0.04.
c and 8d are graphs of MRAC performance and time histories, respectively, for r=100 and Γc=0.2.
a and 9b are graphs of MRAC performance and time histories, respectively, for r=400 and Γc=0.04.
a and 10b are graphs of MRAC performance and time histories, respectively, for r=25 and Γc=0.04.
a and 13b are graphs showing the equivalent results of cascading low-pass and high-pass systems.
a and 14b are graphs showing application of the 1 gain stability requirement for differing adaptive gain values.
a and 15b are graphs showing simulation results and time histories, respectively, for an 1 adaptive controller.
a and 16b are graphs showing performance and time history, respectively, for an 1 adaptive controller.
a and 17b are graphs showing performance results and time histories, respectively, for an 1 adaptive controller.
a and 18b are graphs showing performance and time history, respectively, for an 1 adaptive controller.
Problem Formulation
Consider the following system dynamics:
{dot over (x)}(t)=Amx(t)+b(ωu(t)+θτ(t)x(t)+σ(t)),
y(t)=cτx(t), x(0)=x0, Eq.1
where xεIRn is the system state vector (measurable), uεIR is the control signal, yεIR is the regulated output, b,cεIRn are known constant vectors, Am is a known n×n matrix, ωεIR is known, θ(t)εIRn is a vector of time-varying unknown parameters, while σ(t)εIR is a time-varying disturbance. Without loss of generality, we assume that
θ(t)εΘ, |σ(t)≦Δ, t≧0, Eq.2
where Θ is a known compact set and ΔεIR+ is a known (conservative) ∞ bound of σ(t).
The control objective is to design a full-state feedback adaptive controller to ensure that y(t) tracks a given bounded reference signal r(t) both in transient and steady state, while all other error signals remain bounded.
We further assume that θ(t) and σ(t) are continuously differentiable and their derivatives are uniformly bounded:
∥{dot over (θ)}(t)∥2≦dθ<∞, |{dot over (σ)}(t)|≦dσ<∞, ∀t≧0, Eq.3
where
∥•∥2 denotes the 2-norm, while the numbers dθ, dσ can be arbitrarily large.
1 Adaptive Controller
In this section, we develop a novel adaptive control architecture for the system in Eq.1 that permits complete transient characterization for both u(t) and x(t).
The elements of 1 adaptive controller are introduced next:
Companion Model: We consider the following companion model:
{circumflex over ({dot over (x)})}(t)=Am{circumflex over (x)}(t)+b(ωu(t)+{circumflex over (θ)}τ(t)x(t)+{circumflex over (σ)}(t)),
ŷ(t)=cτ{circumflex over (x)}(t), {circumflex over (x)}(0)=x0, Eq.4
which has the same structure as the system in Eq.1. The only difference is that the unknown parameters
θ(t), σ(t)
are replaced by their adaptive estimates
{circumflex over (θ)}(t), {circumflex over (σ)}(t)
that are governed by the following adaptation laws.
Adaptive Laws: Adaptive estimates are given by:
{circumflex over ({dot over (θ)})}(t)=ΓθProj(−x(t){tilde over (x)}τ(t)Pb,{circumflex over (θ)}(t)),{circumflex over (θ)}(0)={circumflex over (θ)}0 Eq.5
{circumflex over ({dot over (σ)})}(t)=ΓσProj(−{tilde over (x)}τ(t)Pb,{circumflex over (σ)}(t)),{circumflex over (σ)}(0)={circumflex over (σ)}0 Eq.6
where
{tilde over (x)}(t)={circumflex over (x)}(t)−x(t)
is the error signal between the state of the system and the companion model,
Γθ=ΓcIn×nεIRn×n,Γσ=Γc
are adaptation gains with
ΓcεIR+,
and P is the solution of the algebraic equation
AmτP+PAm=−Q, Q>0.
Control Law: The control signal is generated by:
And where
kεIR+ is a feedback gain, while C(s) is any strictly proper stable transfer function with low-pass gain C(0)=1. One simple choice is
Stability Requirement
Further, let
where θi(t) is the ith element of θ(t), Θ is the compact set defined in (2). We now state the 1 performance requirement that ensures stability of the entire system and desired transient performance.
1-gain stability requirement: Design C(s) to ensure that
∥G(s)∥
where
G(s)=(sI−Am)−1b(1−C(s)).
The complete 1 adaptive controller consists of Eq.4, Eq.5, Eq.6 and Eq.7 subject to 1-gain stability requirement in Eq.12.
The 1 adaptive controller is illustrated in
In case of constant θ(t), the stability requirement of the 1 adaptive controller can be simplified. For the specific choice of C(s) in Eq.10, the stability requirement of 1 adaptive controller is reduced to
being Hurwitz for all θεΘ.
Closed-Loop Reference System
We now consider the following closed-loop LTI reference system with its control signal and system response being defined as follows:
where
{overscore (r)}ref(s) is the Laplace transformation of the signal
{overscore (r)}ref(t)=−θτ(t)xref(t)−σ(t)+kgr(t),
And kg is introduced in Eq.9.
Tracking Performance
Let
H(s)=(sI−Am)−1b. Eq.17
It is proved that there exists
coεIRn
such that
where the order of Nd(s) is one more than the order of Nn(s), and both Nn(s) and Nd(s) are stable polynomials.
Theorem 1: Given the system in (1) and the 1 adaptive controller defined via Eq.4, Eq.5, Eq.6 and Eq.7 subject to Eq.12, we have:
∥x−xref∥
∥u−uref∥
where
Corollary 1: Given the system in Eq.1 and the 1 adaptive controller defined via Eq.4, Eq.5, Eq.6 and Eq.7 subject to Eq.12, we have:
Thus, the tracking error between x(t) and xref(t), as well between u(t) and uref(t), is uniformly bounded by a constant inverse proportional to Γc. This implies that during the transient one can achieve arbitrarily close tracking performance for both signals simultaneously by increasing Γc.
Design Guidelines
We note that the control law uref(t) in the closed-loop reference system, which is used in the analysis of ∞ norm bounds, is not implementable since its definition involves the unknown parameters. Theorem 1 ensures that the 1 adaptive controller approximates uref(t) both in transient and steady state. So, it is important to understand how these bounds can be used for ensuring uniform transient response with desired specifications. We notice that the following ideal control signal
Is the one that leads to the desired system response:
{dot over (x)}ref(t)=Amxref(t)+bkgr(t) Eq.26
yref(t)=cτxref(t) Eq.27
by cancelling the uncertainties exactly. In the closed-loop reference system Eq.14-Eq.16, uideal(t) is further low-pass filtered by C(s) in Eq.15 to have a guaranteed low-frequency range. Thus, the reference system in Eq.14-Eq.16 has a different response as compared to Eq.26, Eq.27 with Eq.25. It should be noted that C(s) may be selected to ensure that in case of constant θ the response of xref(t), uref(t), can be made as close as possible to Eq.26 with Eq.25. In case of fast varying θ(t), it is obvious that the bandwidth of the controller needs to be matched correspondingly.
Time-Delay Margin Analysis
We consider the following LTI system with output measurement delay:
where rb(s) is the Laplace transformation of bounded signal rb(t). The block-diagram of the closed-loop system in Eq.28 is shown in
The open-loop transfer function of the system in Eq.28 is:
Ho(s)=C(s)(1+θτ{overscore (H)}(s))/(1−C(s)) Eq.29
whose phase margin P (Ho(s)) can be derived easily from its Bode plot. The time-delay margin of the open-loop transfer function is given by:
T (Ho(s))=P (Ho(s))/ωc Eq.30
where P (Ho(s)) is the phase margin of the open-loop system Ho(s), and ωc is the cross-over frequency of Ho(s).
With regard to the time-delay margin of the closed-loop 1 adaptive controller, we have:
Theorem 2: Given the system in (1) and the 1 adaptive controller defined via Eq.4, Eq.5-Eq.6 and Eq.7 subject to Eq.12, where Γc and Δ are large enough, the closed-loop adaptive system is stable in the presence of time delay τ in its output if τ<T (Ho(s)), where T (Ho(s)) is defined in Eq.30.
Novel Features
This new adaptive controller generates low-pass control signals. It has constructive design technique for ensuring desired bandwidth of the generated control signals. Thus, it can meet the bandwidth limitations of the actuators. It has improved transient performance. Using fast adaptation, it guarantees desired transient performance for both control signal and system response. It enables time-delay margin analysis. It has the ability to ensure convergence of tracking error to zero in the steady state performance. This new architecture permits faster rates of adaptation without generating high-frequency control signals and without destabilizing the system. It has a proof for stable performance, which is a basic requirement for every control system design. We define a new stability criterion which gives a systematic design algorithm for required specifications.
All the discussions above apply to neural adaptive controllers and higher dimensional systems, as well.
Variations
The above described architecture may be varied in several ways without departing from the spirit of the invention.
For example, if the objective is only to get rid of high frequency oscillation in the control signal, the control signal may be filtered without setting a large adaptive gain.
In another variation, the low pass filter may be applied to all or only a part of the signal
{overscore (r)}(t) as in (31),
In this variation the closed-loop reference system must also be modified. However, similar results related to transient performance can be obtained.
In a further variation, other signals can be used for scaled reference input r(t), depending on your tracking objective.
Another variation is to express the companion model and control law in other equivalent forms.
Extensions
Detailed results and proof of the 1 adaptive controller can be found in the attached papers. Based on 1 adaptive controller, 1 neural controller can be established which guarantees transient performance with its details in attached paper. We also note that all these results can be extended into Multiple Input Multiple Output systems easily.
Results Demonstration
As an illustrative example, consider a single-link robot arm which is rotating on a vertical plane. The system dynamics are given by:
where
q(t) and {dot over (q)}(t) are measured angular position and velocity, respectively, u(t) is the input torque, I is the given moment of inertia, M is the unknown mass, L is the unknown length, F(t) is an unknown time-varying friction coefficient, F1(t) is position dependent external torque, and
For implementation of the 1 adaptive controller Eq.4, Eq.5-Eq.6 and Eq.7, we need to verify the 1 stability requirement in Eq.12. Letting
C(s)=ωα/(s+ωα),
we have
We can check easily that for our selection of compact sets in Eq.38, the resulting L=20 in Eq.11. As shown in
Turning now to
σ(t)=sin(πt).
Next, we consider a different disturbance signal:
σ(t)=cos(x1(t))+2 sin(10t)+cos(15t).
The simulation results are shown in
Finally, we consider much higher frequencies in the disturbance:
σ(t)=cos(x1(t))+2 sin(100t)+cos(150t).
The simulation results are shown in
We note that the 1 adaptive controller guarantees smooth and uniform transient performance in the presence of different unknown nonlinearities and time-varying disturbances. The controller frequencies are exactly matched with the frequencies of the disturbance that it is supposed to cancel out. We also notice that the system state vector signal
x1(t)
and the companion model signal
{circumflex over (x)}1(t)
are almost the same in
We will now present an implementation of the invention, providing further detail and adaptations.
As described above, transient performance in the implementation can be characterized both for the system input and output signals. To achieve this, a Companion Model Adaptive Control (CMAC) architecture is introduced and its equivalence to MRAC is shown. The difference between CMAC and MRAC is in definition of the error signal for adaptive laws, which consequently allows for incorporation of a low-pass filter in the feedback loop of CMAC and enables us to enforce the desired transient performance by increasing adaptation gain. For proof of asymptotic stability, the 1 gain of a cascaded system, comprised of this filter and the closed-loop desired transfer function, is required to be less than the inverse of the upper bound on the norm of unknown parameters used in projection based adaptation laws. Thus, with the low-pass filter in the loop, the 1 adaptive controller is guaranteed to stay in the low-frequency range even in the presence of high adaptive gains and large reference inputs. The ideal (non-adaptive) version of this 1 adaptive controller is used along with the main system dynamics to define a closed-loop reference system, which gives an opportunity to estimate performance bounds in terms of ∞ norms for both system's input and output signals as compared to the same signals of this reference system. These bounds immediately imply that the transient performance of the control signal in MRAC cannot be characterized. Design guidelines for selection of the low-pass filter ensure that the closed-loop reference system approximates the desired system response, despite the fact that it depends upon the unknown parameter. Thus, the desired tracking performance is achieved by systematic selection of the low-pass filter, which in its turn enables fast adaptation, as opposed to high-gain designs leading to increased control efforts.
The paper is organized as follows. Section II states some preliminary definitions, and Section III gives the problem formulation. In Section IV, we recall the conventional MRAC design and introduce the Companion Model Adaptive Controller (CMAC), which is a reparameterization of MRAC. In Section V, a new 1 adaptive controller is presented. Stability and tracking results of the 1 adaptive controller are presented in Section VI. Comparison of the performance of 1 adaptive controller, MRAC and the high gain controller are discussed in section VIII. In section IX, simulation results are presented, while Section X concludes the paper.
In this Section, we recall some basic definitions and facts from linear systems theory.
Definition 1: For a signal ξ(t), t≧0, ξεIRn, its truncated ∞ norm and ∞ norm are defined as
where ξi is the ith component of ξ.
Definition 2: The 1 gain of a stable proper single-input single-output system H(s) is defined to be ∥H(s)∥
in which the integration is done along the vertical line x=α>0 in the complex plane.
Proposition: A continuous time LTI system (proper) with impulse response h(t) is stable if and only if ∫0∞|h(τ)|dτ<∞. A proof can be found in [1] (page 81, Theorem 3.3.2).
Definition 3: For a stable proper m input n output system H(s) its 1 gain is defined as
where Hij(s) is the ith row jth column element of H(s).
The next lemma extends the results of Example 5.2. ([2], page 199) to general multiple input multiple output systems.
Lemma 1: For a stable proper multi-input multi-output (MIMO) system H(s) with input r(t)εIRm and output x(t)εIRn, we have
∥xt∥
Proof. Let xi(t) be the ith element of x(t), rj(t) be the jth element of r(t), Hij(s) be the ith row jth element of H(s), and Hij(t) be the impulse response of Hij(s). Then for any t′ε[0, t], we have
From (3) it follows that
and hence ∥xi
for any t≧0. The proof is complete.
Corollary 1: For a stable proper MIMO system H(s), if the input r(t)εIRm is bounded, then the output x(t)εIRn is also bounded as ∥x∥
Lemma 2: For a cascaded system H(s)=H2(s)H1(s), where H1(s) is a stable proper system with m inputs and l outputs and H2(s) is a stable proper system with l inputs and n outputs, we have ∥H(s)∥
Proof. Let y(t)εIRn be the output of H(s)=H1(s)H2(s) in response to input r(t)εIRm. It follows from Lemma 1 that
∥y(t)∥≦∥y∥
for any bounded r(t). Let Hi(s), i=1, . . . , n be the ith row of the system H(s). It follows from (1) that there exists i such that
∥H(s)∥
Let hij(t) be the jth element of the impulse response of the system Hi(s). For any T, let
rj(t)=sgnhij(T−t), tε[0,T], ∀j=1, . . . , m. (6)
It follows from Definition 1 that ∥r∥
Therefore, it follows from (4) that for any T, Σj=1m(∫t=0T|hij(t)|dt)≦∥H2(s)∥
and this completes the proof.
Consider an interconnected LTI system in
Theorem 1: (1 Small Gain Theorem) The interconnected system in
The proof follows from Theorem 5.6 ([1], page 218), written for 1 gain.
Consider a linear time invariant system:
{dot over (x)}(t)=Ax(t)+bu(t), (7)
where xεIRn, uεIR, bεIRn, AεIRn×n is Hurwitz, and assume that the transfer function (sI−A)−1b is strictly proper and stable. Notice that it can be expressed as:
where (d(s)=det(sI−A) is a nth order stable polynomial, and n(s) is a n×1 vector with its ith element being a polynomial function:
Lemma 3: If (AεIRn×n, bεIRn) is controllable, the matrix N with its ith row jth column entry nij is full rank.
Proof. Controllability of (A, b) for the LTI system in (7) implies that given an initial condition x(t0)=0 and arbitrary xt
Lemma 4: If (A, b) is controllable and (sI−A)−1b is strictly proper and stable, there exists cεIRn such that the transfer function cτ(sI−A)−1b is minimum phase with relative degree one, i.e. all its zeros are located in the left half plane, and its denominator is one order larger than its numerator.
Proof. It follows from (8) that
where NεIRn×n is matrix with its ith row jth column entry nij introduced in (9). We choose {overscore (c)}εIRn such that {overscore (c)}τ[sn-1 . . . 1]τ is a stable n−1 order polynomial. Since (A, b) is controllable, it follows from Lemma 3 that N is full rank. Let c=(N−1)τ{overscore (c)}. Then it follows from (10) that
has relative degree 1 with all its zeros in the left half plane.
Consider the following single-input single-output system dynamics:
{dot over (x)}(t)=Ax(t)+bu(t), x(0)=x0
y(t)=cτx(t), (11)
where xεIRn is the system state vector (measurable), uεIR is the control signal, b, cεIRn are known constant vectors. A is an unknown n×n matrix, yεIR is the regulated output.
The control objective is to design an adaptive controller to ensure that y(t) tracks a given bounded continuous reference signal r(t) both is transient and steady state, while all other error signals remain bounded. More rigorously, the control objective can be stated as
y(s)≈D(s)r(s), (12)
where y(s), r(s) are Laplace transformations of y(t), r(t) respectively, and D(s) is a strictly proper stable LTI system that specifies the desired transient and steady state performance.
Following the convention, we introduce the following matching assumption:
Assumption 1: There exist a Hurwitz matrix AmεIRn×n and a vector of ideal parameters θεIRn such that (Am, b) is controllable and Am−A=bθτ. We further assume the unknown parameter θ belongs to a given compact convex set θεΩ.
In the next section, we present two equivalent control architectures that can guarantee the steady state tracking of the bounded reference input r(t). We further use one of those to develop a novel adaptive control architecture with guaranteed transient performance.
A. Model Reference Adaptive Controller
Let
{dot over (x)}m(t)=Amxm(t)+bkgr(t), xm(0)=x0
ym(t)=cτxm(t) (13)
be the state space representation of the desired transfer function D(s), where xmεIRn, Am is an n×n matrix kg is a design gain. Usually Am is chosen such that the triple (Am, b, c) approximates D(s) so that ym(s)≈D(s)r(s) with comparable transient and steady steady specifications, subject to the matching condition in Assumption 1.
Theorem 2: [MRAC] The following direct adaptive feedback/feedforward controller
uMRAC(t)={circumflex over (θ)}τ(t)x(t)+kgr(t), (14)
{circumflex over ({dot over (θ)})}(t)=ΓProj({circumflex over (θ)}(t), x((t)eτ(t)Pb), {circumflex over (θ)}(0)={circumflex over (θ)}0, (15)
in which {circumflex over (θ)}(t)εIRn are the adaptive parameters, Proj(•,•) denotes the projection operator, e(t)=xm(t)−x(t) is the tracking error, ΓεIRn×n is a positive definite matrix of adaptation gains, and P=Pτ>0 be the solution of the algebraic equation AmτP+PAm=−Q for arbitrary Q>0, ensures that
A proof can be found in [3]. Indeed, the tracking error dynamics with the control law (14), (15) can be written as:
{dot over (e)}(t)=Ame(t)−b{tilde over (θ)}τ(t)x(t), e(0)=0, {tilde over (θ)}(t){circumflex over (θ)}(t)−θ.(16)
Using standard Lyapunov arguments and Barbalat's lemma, one can prove that
B. Companion Model Adaptive Controller
Theorem 3: [CMAC] Given a bounded reference input signal r(t) of interest to track, the following direct adaptive feedback/feedforward controller
uCMAC(t)={circumflex over (θ)}τ(t)x(t)+kgr(t). (17)
{circumflex over ({dot over (θ)})}(t)=ΓProj(x(t){tilde over (x)}τ(t)Pb,{circumflex over (θ)}(t)),{circumflex over (θ)}(0)={circumflex over (θ)}0, (18)
in which {circumflex over (θ)}(t)εIRn are the adaptive parameters, {tilde over (x)}(t)={circumflex over (x)}(t)−x(t) is the tracking error between system dynamics in (11) and the following companion system
{circumflex over ({dot over (x)})}(t)=Am{circumflex over (x)}(t)+b(u(t)−{circumflex over (θ)}τ(t)x(t)),{circumflex over (x)}(0)=x0
ŷ(t)=cτ{circumflex over (x)}(t), (19)
ensures that
The proof is straightforward. Indeed, subject to Assumption 1, the system dynamics in (11) can be rewritten as:
{dot over (x)}(t)=Amx(t)+b(u(t)−θτx(t)), x(0)=x0
y(t)=cτx(t). (20)
Notice that the companion model in (19) shares the same structure with (20), while the control law in (17), (18) reduces the the closed loop dynamics of the companion model to the desired reference model in (13):
{circumflex over ({dot over (x)})}(t)=Am{circumflex over (x)}(t)+bkgr(t), {circumflex over (x)}(0)=x0. (21)
We also notice that the closed-loop tracking error dynamics are the same as in (16):
{tilde over ({dot over (x)})}(t)=Am{tilde over (x)}(t)−b{tilde over (θ)}τ(t)x(t), {tilde over (x)}(0)=0. (22)
Since the closed-loop companion model in (21) is bounded, from standard Lyapunov arguments and Barbalat's lemma it follows that
Thus, the companion model adaptive control architecture is equivalent to MRAC. The following remark is in order.
Remark 1: The matching assumption implies that the ideal tracking controller is given by the following linear relationship
The choice of kg in (24) ensures that for constant r one has
in both architectures.
C. Bounded Tracking Error Signal
For both architectures MRAC and CMAC, one can prove that the tracking error can be rendered arbitrarily small by increasing the adaptive gain. The main result is given by the following lemma.
Lemma 5: Let Γ=ΓcII, where ΓcεIR+, and II is the identity matrix. For the system in (20)
and λmin(P) is the minimum eigenvalue of P.
Proof. The candidate Lyapunov function, which can be used to prove asymptotic convergence of tracking error to zero in Theorems 2 and 3, is given by V({tilde over (x)}(t), {tilde over (θ)}(t))={tilde over (x)}τ(t)P{tilde over (x)}(t)+{tilde over (θ)}τ(t)Γ−1{tilde over (θ)}(t). The following upper bound is straight-forward to derive: {tilde over (x)}τ(t)P{tilde over (x)}(t)≦V(t)≦V(0), ∀t≧0. The projection algorithm ensures that {circumflex over (θ)}(t)εΩ, ∀t≧0, and therefore
where {overscore (θ)}max is defined in (25). Since {tilde over (x)}(0)=0, then V(0)={tilde over (θ)}τ(0)Γ−1{tilde over (θ)}(0), which leads to
t≧0. Since λmin(P)∥{tilde over (x)}∥2≦{tilde over (x)}τ(t)P{tilde over (x)}(t), then
D. Transient Performance
Theorems 2 and 3 state that the tracking error goes to zero asymptotically as t→∞. Lemma 5 states that the tracking error can be reduced by increasing the adaptation gain Γc. The following simulations demonstrate that increasing the adaptation gain Γc indeed leads to better transient tracking, but results in unacceptable high-frequency oscillations in the control signal. For simulation purposes, the following system parameters have been selected:
The choice of Γc=0.04 and Q=I leads to desired tracking performance for the reference input r=100, FIGS. 2(a), 2(b). FIGS. 2(c) and 2(d) demonstrate that increasing the adaptive gain improves the transient tracking at the price of high frequency oscillations in the control signal.
FIGS. 3(a) and 3(b) plot the response of the adaptive controller to reference input r=400, without retuning of the adaptive controller. The response to reference input r=25 without retuning the control parameters results in slow convergence, FIGS. 4(a) and 4(b).
These simulations imply two important messages: a) increasing the adaptation gain leads to improved transient tracking performance at the price of high-frequency oscillations in the control signal, b) every change in the reference input implies that retuning of adaptive controller needs to be done to recover the transient tracking performance. Similar deterioration in the transient tracking performance can be observed if one changes the unknown parameters in the system or the initial conditions. Otherwise saying, there is no systematic way of selecting design parameters that would yield the desired transient performance for all possible changes in the system dynamics. On the other hand, the bandwidth limitations of mechanical actuators render implementation of high-frequency control signals
overly challenging. Even if implemented, high-frequency control signal can easily excite the high-frequency dynamics of the system, omitted in the modeling, and lead to destabilization.
In this section, we develop a novel adaptive control architecture that permits complete transient characterization for both system input and output signals. Towards that end, notice that using the matching condition in Assumption 1 the dynamics in (11) can be rewritten as in (20):
{dot over (x)}(t)=Amx(t)−bθτx(t)+bu(t), x(0)=x0
y(t)=cτx(t). (27)
The following control structure
u(t)=u1(t)+u2(t), u1(t)=−Kτx(t), (28)
where u2(t) is the adaptive controller to be determined later, while K is a nominal design gain and can be set to zero, leads to the following partially closed-loop dynamics:
{dot over (x)}(t)=Aox(t)−bθτx(t)+bu2(t), x(0)=x0
y(t)=cτx(t). (29)
The choice of K needs to ensure that Ao=Am−bKτ is Hurwitz or, equivalently, that
Ho(s)=(sI−Ao)−1b (30)
is stable. One obvious choice is K=0. For the linearly parameterized system in (29), we consider the following companion model
{circumflex over ({dot over (x)})}(t)=Ao{circumflex over (x)}(t)+b(u2(t)−{circumflex over (θ)}τ(t)x(t)), {circumflex over (x)}(0)=x0
ŷ(t)=cτ{circumflex over (x)}(t) (31)
along with the adaptive law for {circumflex over (θ)}(t):
{circumflex over ({dot over (θ)})}(t)=ΓProj(x(t){tilde over (x)}τ(t)Pob,{circumflex over (θ)}(t)), {circumflex over (θ)}(0)={circumflex over (θ)}0, (32)
where {tilde over (x)}(t)={circumflex over (x)}(t)−x(t) is the tracking error, ΓεIRn×n=ΓcIn×n is the matrix of adaptation gains, and Po is the solution of the algebraic equation AoτPo+PoAo=−Qo, Qo>0.
Letting
{overscore (r)}(t)={circumflex over (θ)}τ(t)x(t), (33)
the companion model in (31) can be viewed as a low-pass system with u(t) being the control signal, {overscore (r)}(t) being a time-varying disturbance, which is not prevented from having high-frequency oscillations. Instead of (17), we consider the following control design for (31):
u2(s)=C(s)({overscore (r)}(s)+kgr(s)), (34)
where u2(s), {overscore (r)}(s), r(s) are the Laplace transformations of u2(t), {overscore (r)}(t), r(t), respectively, C(s) is a stable and strictly proper system with low-pass gain C(0)=1, and kg is
The complete 1 adaptive controller consists of (28), (31), (32), (34), and closed-loop system with it is illustrated in
Consider the closed-loop companion model in (31) with the control signal defined in (34). It can be viewed as an LTI system with two inputs r(t) and
{overscore (r)}(t):
{circumflex over (x)}(s)={overscore (G)}(s){overscore (r)}(s)+G(s)r(s) (36)
{overscore (G)}(s)=Ho(s)(C(s)−1) (37)
G(s)=kgHo(s)C(s), (38)
where {circumflex over (x)}(s), {overscore (r)}(s) are the Laplace transformations of the signals {circumflex over (x)}(t), {overscore (r)}(t), respectively. We note that {overscore (r)}(t) is related to {circumflex over (x)}(t), u(t) and r(t) via nonlinear relationships.
Remark 2: Since both Ho(s) and C(s) are strictly proper stable systems, one can check easily that {overscore (G)}(s) and G(s) are strictly proper stable systems, even though that 1−C(s) is proper.
Let
where θi is the ith element of θ. Ω is the compact set, where the unknown parameter lies. We now give the 1 performance requirement that ensures stability of the entire system and desired transient performance, as discussed later in Section VI.
1-gain requirement: Design K and C(s) to satisfy
∥{overscore (G)}(s)∥
A. Stability and Asymptotic Convergence
Consider the following Lyapunov function candidate:
V({circumflex over (x)}(t), {tilde over (θ)}(t))={tilde over (x)}τ(t)Po{tilde over (x)}(t)+{circumflex over (θ)}τ(t)Γ−1{tilde over (θ)}(t), (41)
where Po and Γ are introduced in (32). It follows from (29) and (31) that
{circumflex over ({dot over (x)})}(t)=Ao{tilde over (x)}(t)−b{tilde over (θ)}τ(t)x(t), {tilde over (x)}(0)=0. (42)
Hence, it is straightforward to verify from (32) that
{dot over (V)}(t)≦−{tilde over (x)}τ(t)Qo{tilde over (x)}(t)≦0. (43)
Notice that the result in (43) is independent of u2(t), and, hence, Lemma 5 also holds for the 1 adaptive controller along with its adaptive law in (32). However, one cannot deduce stability from it. One needs to prove in addition that with the 1 adaptive controller the state of the companion model will remain bounded. Boundedness of the system state then will follow.
Theorem 4: Given the system in (27) and the 1 adaptive controller defined via (28), (31), (32), (34) subject to (40), the tracking error {tilde over (x)}(t) converges to zero asymptotically:
Proof. Let λmin(Po) be the minimum eigenvalue of Po. From (41) and (43) it follows that
λmin(Po)∥{tilde over (x)}(t)∥2≦{tilde over (x)}τ(t)Po{tilde over (x)}(t)≦V(t)≦V(0),
implying that
From Definition 1,
The relationship in (45) ensures that
and therefore for all t>0 one has
Using the triangular relationship for norms implies that
The projection algorithm in (15) ensures that {circumflex over (θ)}(t)εΩ, ∀t≧0. The definition of {overscore (r)}(t) in (33) implies that ∥{overscore (r)}t∥
It follows from Lemma 1 that ∥{circumflex over (x)}t∥
From (40) it follows that λ<1. The relationship in (48) can be written as
and hence
Since V(0), λmin(Po), ∥G(s)∥
x(t) is bounded. The adaptive law in (32) ensures that the estimates {circumflex over (θ)}(t) are also bounded. Hence, it can be checked easily from (22) that {tilde over ({dot over (x)})}(t) is bounded, and it follows from Barbalat's lemma that
B. Reference System
In this section we characterize the reference system that the 1 adaptive controller in (28), (31), (32), (34) tracks both in transient and steady state, and this tracking is valid for system's both input and output signals. Towards that end, consider the following ideal version of the adaptive controller in (28), (34):
uref(s)=C(s)(kgr(s)+θτxref(s))−Kτxref(s), (51)
where xref(s) is used to denote the Laplace transformation of the state xref(t) of the closed-loop system. The closed-loop system (20) with the controller (51) is given in
Remark 3: Notice that when C(s)=1 and K=0, one recovers the reference model of MRAC, and the controller in (51) reduces to the one in (23). If C(s)≠1 and K≠0, then the control law in (51) changes the bandwidth of uideal(t)=θτx(t)+kgr(t) in (23).
The control law in (51) leads to the following closed-loop dynamics:
xref(s)=Ho(s)(kgC(s)r(s)+(C(s)−1)θτxref(s))
yref(s)=cτxref(s), (52)
which can be explicitly solved for xref(s):
xref(s)=(I−(C(s)−1)Ho(s)θτ)−1Ho(s)kgC(s)r(s).
Hence, it follows from (37) and (38) that
xref(s)=(I−{overscore (G)}(s)θτ)−1G(s)r(s). (53)
Lemma 6: If ∥{overscore (G)}(s)∥
(i) (I−{overscore (G)}(s)θτ)−1 is stable;
(ii) (I−{overscore (G)}(s)θτ)−1G(s) is stable. (54)
Proof. It follows from (1) that
where {overscore (G)}i(s) is the ith element of G(s), and θj is the jth element of θ. From (39) we have Σj=1n|θj|≦θmax, and hence
∥{overscore (G)}(s)θτ∥
C. System Response and Control Signal of the 1 Adaptive Controller
Letting
r1(t)={tilde over (θ)}τ(t)x(t), (56)
we notice that {overscore (r)}(t) in (33) can be rewritten as {overscore (r)}(t)=θτ({circumflex over (x)}(t)−{tilde over (x)}(t))+r1(t). Hence, the companion model in (36) can be rewritten as {circumflex over (x)}(s)={tilde over (G)}(s)(θτ{circumflex over (x)}(s)−θτ{tilde over (x)}(s)+r1(s))+G(s)r(s), where r1(s) is the Laplace transformation of r1(t) defined in (56), and further put into the form:
{circumflex over (x)}(s)=(I−{overscore (G)}(s)θτ)−1(−{overscore (G)}(s)θτ{tilde over (x)}(s)+{overscore (G)}(s)r1(s)+G(s)r(s)). (57)
It follows from (42) and (56) that {tilde over ({dot over (x)})}(t)=Ao{tilde over (x)}(t)−br1(t), and hence
{tilde over (x)}(s)=−Ho(s)r1(s). (58)
Using the expression of {overscore (G)}(s) from (37), the state of the companion model can be presented as
{circumflex over (x)}(s)=(I−{overscore (G)}(s)θτ)−1(−{overscore (G)}(s)θτ{tilde over (x)}(s)−(C(s)−1){tilde over (x)}(s)+G(s)r(s)),
which can be further put into the form:
{circumflex over (x)}(s)=(I−{overscore (G)}(s)θτ)−1G(s)r(s)+(I−{overscore (G)}(s)θτ)−1(−{overscore (G)}(s)θτ{tilde over (x)}(s)−(C(s))−1){tilde over (x)}(s)).
Using xref(s) from (53) and recalling the definition of {tilde over (x)}(s)={circumflex over (x)}(s)−x(s), one arrives at
x(s)=xref(s)−(I+(I−{overscore (G)}(s)θτ)−1({overscore (G)}(s)θτ+(C(s)−1)I)){tilde over (x)}(s). (59)
The expressions in (28), (34) and (51) lead to the following expression of the control signal
u(s)=uref(s)+C(s)r1(s)+(C(s)θτ−Kτ)(x(s)−xr(s)). (60)
D. Asymptotic Performance and Steady State Error
Theorem 5: Given the system in (27) and the 1 adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
Proof. Let
r2(s)=(I+(I−{overscore (G)}(s)θτ)−1({overscore (G)}(s)θτ+(C(s)−1)I)){tilde over (x)}(s). (63)
It follows from (59) that
r2(t)=xref(t)−x(t). (64)
The signal r2(t) can be viewed as the response of the LTI system
H2(s)=I+(I−{overscore (G)}(s)θτ)−1({overscore (G)}(s)θτ+(C(s)−1)I) (65)
to the bounded error signal {tilde over (x)}(t). It follows from (54) and Remark 2 that (I−{overscore (G)}(s)θτ)−1, {overscore (G)}(s), C(s) are stable and, therefore, H2(s) is stable. Hence, from (44) we have
Let
r3(s)=C(s)r1(s)+(C(s)θτ−Kτ)(x(s)−xr(s)). (66)
It follows from (60) that
r3(t)=u(t)−uref(t). (67)
Since the projection operator ensures that {tilde over (θ)}(t) is bounded, it follows from (42) and (44) that
Since C(s) is a stable proper system, it follows from (61) that
Lemma 7: Given the system in (27) and the 1 adaptive controller defined via (28), (31), (32), (34) subject to (40), if r(t) is constant, then
Proof. Since
yref(t)=cτxref(t), (68)
it follows from (61) that
From (53) it follows that yref(s)=cτ(I−{overscore (G)}(s)θτ)−1G(s)r(s). The end value theorem ensures
Definition of kg in (35) leads to
In addition to the constant reference input signal r, we need to characterize the closed-loop system response with the 1 controller to a time varying input r(t). This is analyzed in the following sections.
E. Transient Performance
We note that (Am−bKτ, b) is the state space realization of Ho(s). Since (Am, b) is controllable, it can be proved easily that (Am−bKτ, b) is also controllable. It follows from Lemma 4 that there exists coεIRn such that
where the order of Nd(s) is one more than the order of Nn(s), and both Nn(s) and Nd(s) are stable polynomials.
Theorem 6: Given the system in (27) and the 1 adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
where ∥cτ∥
Proof. It follows from (63), (65) and Lemma 1 that ∥r2∥
Therefore,
which leads to (72). The upper bound in (73) follows from (72) and Lemma 2 directly. From (58) we have
where co is introduced in (71). It follows from (71) that
where Nd(s), in Nn(s) are stable polynomials and the order of Nn(s) is one less than the order of Nd(s). Since C(s) is stable and strictly proper, the complete system
is proper and stable, which implies that its 1 gain exists and is finite. Hence, we have
Lemma 5 leads to the upper bound in (74):
Corollary 2: Given the system in (27) and the 1 adaptive controller defined via (28), (31), (32), (34) subject to (40), we have:
Corollary 2 states that x(t), y(t) and u(t) follow xref(t), yref(t) and uref(t) not only asymptotically but also during the transient, provided that the adaptive gain is selected sufficiently large. Thus, the control objective is reduced to designing K and C(s) to ensure that the reference LTI system has the desired response D(s).
Remark 4: Notice that if we set C(s)=1, then the 1 adaptive controller degenerates into a CMAC type, which is equivalent to MRAC. In that case
cannot be finite, since Ho(s) is strictly proper. Therefore, from (76) it follows that γ2→∞, and hence for the control signal in CMAC or MRAC one can not reduce the bound in (74) by increasing the adaptive gain.
We proved that the error between the state and the control signal of the closed-loop system with 1 adaptive controller in (27), (28), (31), (32), (34) (
Consider the following signals:
ydes(s)=cτG(s)r(s)=C(s)kgcτHo(s)r(s), (81)
udes(s)=kgC(s)(1+C(s)θτHo(s)−KτHo(s))r(s). (82)
We note that udes(t) depends on the unknown parameter θ, while ydes(t) does not.
Lemma 8: For the LTI system in
where λ is defined in (49), and h3(t) is the inverse Laplace transformation of
H3(s)=(C(s)−1)C(s)r(s)kgHo(s)θτHo(s). (87)
Proof. It follows from (52) and (53) that yref(s)=cτ(I−{overscore (G)}(s)θτ)−1G(s)r(s). Following Lemma 6, the condition in (40) ensures the stability of the reference LTI system. Since (I−{overscore (G)}(s)θτ)−1 is stable, then one can expand it into convergent series and further write
The relationship in (55) implies that ∥{overscore (G)}(s)θτ∥
The relationship in (88) can be equivalently written as
which along with (37), (38) and (87) leads to
Lemma 1 immediately implies that ∥r4∥
which leads to the upper bounds in (85) and (86).
Thus, the problem is reduced to finding a strictly proper stable C(s) to ensure that
(i) λ<1 or ∥h3∥
(ii) ydes(s)≈D(s)r(s), (92)
where D(s) is the desired LTI system introduced in (12). Then, Theorem 6 and Lemma 8 will imply that the output y(t) of the system in (27) and the 1 adaptive control signal u(t) will follow ydes(t) and udes(t) both in transient and steady state with quantifiable bounds, given in (73), (74) and (83)-(86).
Notice that λ<1 is required for stability. From (81)-(86), it follows that for achieving ydes(s)≈D(s)r(s) it is desirable to ensure that λ or ∥h3∥
λ=∥{overscore (G)}(s)∥
Thus, minimization of λ can be achieved from two different perspectives: i) fix C(s) and minimize ∥Ho(s)∥
i) High-gain design. Set C(s)=D(s). Then minimization of ∥Ho(s)∥
ii) Design without linear feedback. As in MRAC, assume that we can select Am to ensure
Lemma 9: For any single input n-output strictly proper stable system Ho(s) the following is true:
Proof. It follows from (95) that
Since Ho(s) is strictly proper and stable, sHo(s) is stable and has relative degree ≧0, and hence ∥sHo(s)∥
it follows from (2) that
and the proof is complete.
Lemma 9 states that if one chooses kgcτHo(s)r(s)≈D(s), then by increasing the bandwidth of the low-pass system C(s), it is possible to render ∥{overscore (G)}(s)∥
yref(s)≈kgcτHo(s)r(s)≈D(s)r(s).
We note that kgcτHo(s) is exactly the reference model of the MRAC design. Therefore this approach is equivalent to mimicking MRAC, and, hence, high-gain feedback can be completely avoided.
However, increasing the bandwidth of C(s) is not the only choice for minimizing ∥{overscore (G)}(s)∥
To minimize ∥h3∥
(i) ∥h3∥
where h4(t) is the inverse Laplace transformation of H4(s)=C(s)kgHo(s)θτHo(s), and
(ii) ∥h3∥
where h5(t) is the inverse Laplace transformation of H5(s)=r(s)kgHo(s)θτHo(s).
We note that since r(t) is a bounded signal and C(s), Ho(s) are stable proper systems, ∥h4∥
The above presented approaches ensure that C(s)≈1 in the bandwidth of r(s) and Ho(s). Therefore it follows from (81) that ydes(s)=C(s)kgcτHo(s)r(s)≈kgcτHo(s)r(s), which along with (94) yields ydes(s)≈D(s)r(s).
Remark 5: From Corollary 2 and Lemma 8 it follows that the 1 adaptive controller can generate a system response to track (81) and (82) both in transient and steady state if we set the adaptive gain large and minimize λ or ∥h3∥
Remark 6: It follows from (78) that yref(t), uref(t) approximate the unknown system's response and the 1 adaptive control signal, if the latter is implemented with large adaptive gain. It follows from (53) that y(t) approximates the response of the LTI system cτ(I−{overscore (G)}(s)θτ)−1G(s) to r(t), hence its transient performance such as overshoot and settling time can be derived for every value of θ. If we further minimize λ or ∥h3∥
We use a scalar system to compare the performance of 1 adaptive and a high-gain controllers. Towards that end, let {dot over (x)}(t)=θx(t)+u(t) where xεIR is the measurable system state, uεIR is the control signal and θεIR is unknown, which belongs to a given compact set [θmin, θmax]. Let u(t)=−kx(t)+kr(t), leading to the following closed-loop system:
{dot over (x)}(t)=(θ−k)x(t)+kr(t).
We need to choose k>θmax to guarantee stability. We note that both the steady state error and transient performance depend on the unknown parameter value θ. By further introducing a proportional-integral controller, one can achieve zero steady state error. If one chooses k>>max{θmax, θmin}, it leads to high-gain system
To apply the 1 adaptive controller, let the desired reference system be
Let u1=−2x, kg=2, leading to
Choose C(s) as in (95) with large ωn, and set adaptive gain Γc large. Then it follows from Theorem 6 that
The relationship in (96) implies that the control objective is met, while the relationship in (97) states that the 1 adaptive controller approximates uref(t), which cancels the unknown θ.
Consider the same simulation example from Section IV-D. We give now the complete 1 adaptive controller for this system. We set K=0, Γc=40000, and implement the L1 adaptive controller following (28), (31), (32) and (34). First, we give analysis of the 1 adaptive controller. It follows from (30) that
and hence
Next, we check stability of this 1 adaptive controller. It follows from (39) that θmax=20, and ∥{overscore (G)}∥L
with respect to ω and compare it to 1. We notice that for ω>30, we have λ<1, and the 1 gain requirement for stability is guaranteed. So, we can choose
to ensure that λ<0.01, which consequently leads to improved performance bounds in (83)-(86). For ω=160, we have λ=∥{overscore (G)}(s)∥
Next, we compute the bound between yref(t) and ydes(t) in (99). It follows from (87) that
For C(s) and Ho(s) in (101) and (98), it can be numerically verified that
and it follows from (84) that ∥yref−ydes∥
Similarly, it follows from (86) that uref(t) approximates udes(t), i.e.
With large adaptive gain, it follows from Theorem 6 that y(t)≈yref(t), u(t)≈uref(t), ∀t≧0, and hence
if one just considers the dominant poles. The simulation results of the 1 adaptive controller are shown in FIGS. 15(a)-15(b) for reference inputs r=25, 100, 400, respectively. We note that it leads to scaled control input and system response for scaled reference input, as compared to MRAC in FIGS. 8(a)-10(b).
Next, we consider a higher order filter with low adaptive gain Γc=400,
In
with respect to ω and compare it to 1. We notice that when ω>25, we have λ<1 and the 1-gain requirement in (40) is satisfied. Letting ω=50 leads to λ=0.3984, and therefore ∥yref−ydes∥
if one just considers the dominant poles. The simulation results of the 1 adaptive controller are shown in FIGS. 11(a)-11(b), for reference inputs r=25, 100, 400, respectively. We note that it again leads to scaled control input and system response for scaled reference input, as compared to MRAC in FIGS. 8(a)-10(b). In addition, we notice that this performance is achieved by a much smaller adaptive gain as compared to the design with the first order C(s).
Remark 7: The simulations pointed out that with higher order filter C(s) one could use relatively small adaptive gain. While a rigorous relationship between the choice of adaptive gain and the order of filter cannot be derived, an insight into this can be gained from the following analysis. It follows from (27), (28) and (34) that
x(s)=G(s)r(s)+Ho(s)θτx(s)+Ho(s)C(s){overscore (r)}(s), (105)
while the companion model in (36) can be rewritten as
{circumflex over (x)}(s)=G(s)r(s)+Ho(s)(C(s)−1){overscore (r)}(s).
We note that {overscore (r)}(t) is divided into two parts. Its low-frequency component C(s){overscore (r)}(s) is what the system in (105) gets, while the complementary high-frequency component (C(s)−1){overscore (r)}(s) goes into the companion model. If the bandwidth of C(s) is large, then it can suppress only the high frequencies in {overscore (r)}(t), which appear only in the presence of large adaptive gain. A properly designed higher order C(s) can be more effective to serve the purpose of filtering with reduced tailing effects, and, hence can generate similar λ with smaller bandwidth. This further implies that similar performance can be achieved with smaller adaptive gain.
Note the following references referred to in the above discussion: [1] is P. Ioannou and J. Sun, Robust Adaptive Control (Prentice Hall, 1996); [2] is H. K. Khalil, Nonlinear Systems (Prentice Hall, Englewood Cliff, N.J., 2002); [3] is J.-J. E. Slotine and W. Li, Applied Nonlinear Control (Prentice Hall, Englewood Cliffs, N.J., 1991).
While the invention has been described in terms of preferred embodiments, those skilled in the art will recognize that the invention can be practiced with modification within the spirit and scope of the appended claims.
This invention claims priority from U.S. Provisional Patent Application Ser. No. 60/664,187 filed on Mar. 23, 2005 and entitled Low-pass Adaptive/Neural Controller Design with Improved Transient Performance, which is incorporated herein by reference. The invention was made under partial support from contract numbers F49620-03-1-0443 and FA9550-05-1-0157 with the Air Force Office of Scientific Research, and also partial support by ADVANCE VT Institutional Transformation Research Seed Grant from the National Science Foundation.
Number | Date | Country | |
---|---|---|---|
60664187 | Mar 2005 | US |