Heterogeneous multi-agent systems formed by networks of agents having different dynamics and dimensions present a significantly broader class of multiagent systems than their heterogeneous and homogeneous counterparts that consist of networks of agents having different dynamics with the same dimension and identical dynamics, respectively. Therefore, the distributed control of this class of multiagent systems has been an attractive research topic in the systems and control field.
In particular, the cooperative output regulation problem of heterogeneous (in dynamics and dimension) linear time-invariant multiagent systems, where the output of all agents synchronize to the output of the leader, over general fixed directed communication graph topologies have been recently investigated. This problem can be regarded as the generalization of the linear output regulation problem to multi-agent systems. Therefore, distributed control approaches to this problem can be classified into two types: feedforward design methodology and internal model principle. With the former methodology, the feedforward gain of each agent relies on the solution of the regulator equations; hence, this methodology is known to be not robust to plant uncertainties. On the other hand, the latter methodology is robust with respect to small variations of the plant parameters. However, it cannot be applied when the transmission zero condition does not hold.
The common denominator of these results is that an exosystem, which has an unforced linear time-invariant dynamics, generates both a reference trajectory and external disturbances to be tracked and rejected by networks of agents. Specifically, the system matrix of the exosystem is explicitly used by controllers of all agents in some systems and a proper subset of agents in other systems; or each agent incorporates a p-copy internal model of this matrix in its controller. In practical applications, however, it can be a challenge to precisely know the system matrix of the exosystem, even the dynamical structure of the exosystem; especially, when an external leader interacts with the network of agents or a control designer simply injects optimized trajectory commands to the network based on, for example, an online path planning algorithm. To allow ultimately bounded tracking error in such cases, an alternative, generalized definition is needed for the cooperative output regulation problem.
In this disclosure, the cooperative output regulation problem of heterogeneous (in dynamics and dimension) linear time-invariant multiagent systems over general fixed directed communication graph topologies is considered. A new definition of the linear cooperative output regulation problem that is more suitable for practical applications is presented. For internal model based distributed dynamic state feedback, output feedback with local measurement, and output feedback control laws, the solvability of this problem by first assuming a global condition is investigated and then a local sufficient condition under standard assumptions is provided.
The approach of this disclosure is relevant to previous works in which the linear cooperative output regulation problem with an internal model based distributed dynamic state feedback control law are studied. In particular, some previous systems use an output feedback control under an output feedback stabilizability condition. In addition to the generalized definition of the linear cooperative output regulation problem, this disclosure differs from previous approaches at least in terms of the following points.
This disclosure considers not only dynamic state feedback but also dynamic output feedback with local measurement and dynamic output feedback, where the output feedback stabilizability is not assumed.
To prove the existence of a unique solution to the matrix equations that is important for the solvability of the problem, previous approaches decomposes the matrix equations, which include the overall dynamics of the multi-agent system, into matrix equations, which deal with the dynamics of each agent separately. In contrast, Lemma 3B described herein, which is also applicable to dynamic output feedback cases, guarantees that these matrix equations have a unique solution without the need to decompose them.
In addition, a few gaps in the related results of previous approaches are illustrated and fixed.
The disclosure provides a system for controlling motion of a vehicle in a group of vehicles. In one embodiments, the system includes a communication interface, a vehicle platform for travelling among the group of vehicles, and an electronic processor. The electronic processor is configured to determine a local virtual tracking error signal and a controller state signal. The electronic processor is also configured to determine a self-navigation input control signal based on the local virtual tracking error signal and the controller state signal. The self-navigation input control signal is for navigating the vehicle platform when travelling as a member of the group of vehicles. A trajectory of an exosystem is based on a boundedness condition. The trajectory of the exosystem includes external disturbances and a trajectory of a leader vehicle of the group of vehicles. The vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology. Each vehicle in the group of vehicles is stabilizable. Each vehicle in the group of vehicles satisfies a transmission zero condition. Design matrices of the vehicle satisfy an internal model principle.
The disclosure also provides a method for controlling motion of a vehicle in a group of vehicles. In one embodiment, the method includes determining, with an electronic processor of the vehicle, a local virtual tracking error signal and a controller state signal. The method also includes determining, with the electronic processor, a self-navigation input control signal based on the local virtual tracking error signal and the controller state signal. The self-navigation input control signal is for navigating a vehicle platform of the vehicle when travelling as a member of the group of vehicles. A trajectory of an exosystem is based on a boundedness condition. The trajectory of the exosystem includes external disturbances and a trajectory of a leader vehicle of the group of vehicles. The vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology. Each vehicle in the group of vehicles is stabilizable. Each vehicle in the group of vehicles satisfies a transmission zero condition. Design matrices of the vehicle satisfy an internal model principle.
Other aspects of the invention will become apparent by consideration of the detailed description and accompanying drawings.
Before any embodiments of the disclosure are explained in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the following drawings. The invention is capable of other embodiments and of being practiced or of being carried out in various ways.
In what follows of this document, two embodiments are presented. The first embodiment should be regarded as an early preliminary result performed by the inventors, where the second embodiment captures the first embodiment as a special case and in a more mathematically-elegant fashion.
A standard notation is used in the first embodiment. Specifically, R denotes the set of real numbers, denotes the set of complex numbers, Rn denotes the set of n×1 real column vectors, Rn denotes the set of n×m real matrices, In denotes the n×n identity matrix, 1n denotes the n×1 vector of all ones, and ≙ denotes equality by definition. In addition, (⋅)T for transpose, (⋅)−1 for inverse, det(⋅) for determinant ker(⋅) for kernel, ρ(⋅) for spectral radius, ∥⋅∥ for any norm in Rn, |⋅|2 for the Euclidean norm, diag(⋅) for block diagonal operator, and ⊗ for the Kronecker product. Furthermore, the following notation from [33] is adopted. If a, b∈Rn, then the statement “a≤b” is equivalent to “ak≤bk” for all k=1, . . . , n. If “a≤b” and T∈Rn×n is a nonnegative matrix (i.e., all elements of T are nonnegative), than Ta≤Tb. Finally, the space p for 1≤p<∞ is defined as the set of all piecewise continuous functions u: [0 ∞)→Rm such that ∥u(t)∥p=(∫0∞∥u(t)∥p dt)1/p<∞ and extended space pe is defined using ur (i.e., truncated u) instead of u [9].
Next, the graph theoretical notation used in the first embodiment, which is based on [15]. In particular, consider a time-invariant directed graph =(V, E), where V={v1, . . . , vN} is a nonempty finite set of N nodes and E⊂V×V is a set of edges. Each node in V corresponds to a follower agent in the network. There is an edge rooted at node vj and ended at vi, i.e. (vj, vi)∈E if and only if vi receives information from vj. A=[aij]∈RN×N denotes the adjacency matrix which describes the graph structure, that is aij=1⇔(vj, vi)∈E and aij=0 otherwise. Repeated edges and self loops are not allowed, that is aii=0, ∀i∈ with ={1, . . . , N}. The set of neighbors of node vi is denoted as Ni={j|(vj, vi)∈E}. In-degree and Laplacian matrices are defined as D=diag (d1, . . . , dN) with
and L=D−A, respectively. Thus L has zero row sums (i.e., L1N=0). A directed path from node vi to node vj is a sequence of successive edges in the form of {(vivp), (vp, vq), . . . , (vr, vj)}. A directed graph is said to have a spanning tree if there is a root node such that it has directed paths to all other nodes in the graph. Augmented time-invariant directed graph is defined as =(
Problem Formulation. Consider a system of N follower agents with heterogeneous linear time-invariant dynamics, subject to external disturbances, exchanging information amount each other using their local measurement according to a fixed and directed communication graph topology . For example, the dynamics of follower agent i∈ can be given by
{dot over (x)}i(t)=Aixi(t)+Biui(t)+δi(t), xi(0)=xi0, t≥0, (101)
yi(t)=Cixi(t), (102)
with state xi(t)∈n
In contrast to unforced linear time-invariant exosystems that are studied in the literature (e.g., see [7,30]), the leader node is a command generator for the set of follower agents with dynamics given by (101) and (102). From this perspective, the dynamics of the leader node can have any (for example, linear or nonlinear) dynamics with any dimension provided that it has a unique solution and it can even be a static system. Similarly, external disturbances cannot be generated by known unforced linear time-invariant exosystem.
To state the objective considered here, the follow equation is defined
ei(t)≙yi(t)−y0(t), (103)
as the output tracking error between the output of each follower agent and the output of the leader. In particular, considering the heterogeneous multiagent system, subject to unknown external disturbances, given by equations (101) and (102) together with the output of the leader y0 (t) having unknown dynamics, the objective is to establish a distributed control architecture ui (t) for all agents i in such that the output tracking error given by equation (103) becomes uniformly ultimately bounded. If, in addition, the external disturbances and the output of the leader are constant or they converge to constant vectors, then the output of each follower agent asymptotically converges to the output of the leader y0(t) (i.e., asymptotic synchronization). This invention makes the following assumptions to achieve this objective.
ASSUMPTION 1A. There exist αi≥0 and β≥0 such that
∥δi(t)∥2≤αi<∞, ∀t≥0, ∀i∈, (104)
∥{dot over (y)}0(t)∥2≤β<∞, ∀t≥0. (105)
ASSUMPTION 2A. The augmented graph has a spanning tree with the root node being the leader node.
ASSUMPTION 3A. There exist K1i and K2i such that Ami and CiAmi−1Bmi are nonsingular for all i∈, where Ami ≙Ai−BiK1i∈Rn
ASSUMPTION 4A. The pair (Ai, Bi) is stabilizable for all i∈.
ASSUMPTION 5A. Each follower agent satisfies rank
Note that Assumption 1A is standard and it even allows the external disturbances and the leader to have unbounded signals provided that their derivatives are bounded. Furthermore, from Remark 3.2 in [15], Assumption 2A can be a necessary condition for the cooperative tracking problem considered in this disclosure. It should also be noted that Assumption 3A shows uniform ultimate boundedness of the output tracking error given by equation (103). Specifically, by applying Lemma 2.5.2 in [1], necessary conditions for Assumption 3A are stated as l≤ni, l≤mi, rank (Ci)=l (i.e., Ci is full row rank), rank (K2i)=1 (i.e., K2i is full column rank), and rank (Bi)≥l. In the proposed approach B, Assumptions 1A, 2A, and 3A and a global sufficient condition are used to achieve the objective stated above. In addition, Assumptions 4A and 5A are further utilized to achieve the same objective, but with an agent-wise local sufficient condition.
Distributed Control Architecture. Based on the stated objective and assumptions in the foregoing discussions, the proposed distributed control architecture is presented. For this purpose, first recall that the leader node is observed from a nonempty proper subset of nodes in graph . If all follower agents observe the leader, independent controller can be designed for each follower even though the controller architecture proposed herein it still applicable. In particular, if node vi observes the leader node v0, then there exists an edge (v0, vi) with weighting gain ki>0. Next, each node i in has access to its own state xi(t) and relative output error, that is, (yi(t)−yj(t)) for all j∈i. Similar to [30], the local virtual output tracking error can be defined as
Departing from results in [30,7], an auxiliary dynamics (compensator) with a pair G1i, G2i is defined that incorporates an l-copy internal model of the exosystem, which has unforced linear time-invariant dynamics. This is due to the fact that the leader dynamics is assumed to be unknown and the external disturbances are not due to the exosystem in this invention. Instead, the following auxiliary dynamics are utilized which represents the integration of the local virtual output tracking error, to address the stated objective in the previous section
żi(t)=evi(t), zi(0)=zi0, t≥0. (108)
Note that equation (108) can be viewed to have the pair (0, Il) incorporating an l-copy internal model, but it does not necessarily match with the dynamics of the leader and the spectral properties of the external disturbances unless the output of the leader and the external disturbances are generated by an unforced linear time-invariant exosystem yielding constant output and disturbances. Building on the above definitions, the local cooperative controller considered in this invention has the form
ui(t)=−K1ixi(t)−K2izi(t). (109)
The Proposed Approach: B): Global and Local Sufficient Stability Conditions. Considering the objective, assumptions, and the proposed distributed control architecture in previous sections, a global sufficient condition is established for the uniform ultimate boundedness of the output tracking error, where it is shown the conditions when this result reduces to asymptotic synchronization. Based on a converse theorem for linear time-invariant systems, we then derive an agent-wise local sufficient condition by utilizing a small gain theorem and stabilizability and detectability of global multiagent system dynamics, which is shown that both of them are independent of graph topology except one necessary condition for cooperative tracking problem.
Global Sufficient Stability Condition. In order to express the closed-loop dynamics of follower agents in a compact form, let x(t)≙[x1T(t), . . . , xNT(t)]T ∈R
{dot over (x)}(t)=Amx(t)+Bmz(t)+δ(t), x(0)=x0, t≥0. (110)
Local virtual output tracking error given by equation (107) can also be equivalently written as
Now, using equation (102), equation (111) can be further rewritten as
To express the auxiliary dynamics of all followers in a compact form, let
and C≙diag (C1, . . . , CN) (F is finite matrix here by Assumption 2A). From equations (102), (108), and (112), it is given
ż(t)=[(IN−FA)⊗Il]Cx(t)−(FK⊗1l)y0a(t),
z(0)=z0, t>0. (113)
By letting y(t)≙[y1T(t), . . . , yNT(t)]T∈RNl, the output equation of all follower agents is given by
y(t)=Cx(t). (114)
Finally, let η(t)≙[xT(t)zT(t)]T, ω(t)≙[δT (t),y0aT(t)]T,
and Cg≙[C, 0]. Using equations (110), (113), and (114), the closed-loop dynamics of follower agents together with their auxiliary dynamics can be compactly represented as
{dot over (η)}(t)=Agη(t)+Bgω(t), η(0)=η0, t≥0, (115)
y(t)=Cgη(t). (116)
In order to state Theorem 1A, which provides the global sufficient condition for uniformly ultimately bounded output tracking error, we require two lemmas. The first lemma shows that CgAg−1[Iñ,0]T=0 and y0a(t) is inherently in the kernel of CgAg−1[0, −FK⊗Il]T+INl. The second lemma provides an inequality to upper bound the output tracking error.
LEMMA 1A. If Assumptions 2A and 3A hold, then
−CgAg−1Bgω(t)=y0a(t). (117)
Proof. Starting by proving that Ag is nonsingular. From Proposition 2.8.7 in [1], it is known that Am and −[(IN−FA)⊗Il]CAm−1Bm are nonsingular, then Ag is nonsingular. Based on Assumption 3A, it can be observed that the first sufficient condition, Am being nonsingular, is satisfied. Further, −[(IN−FA)⊗Il]CAm−1Bm is nonsingular. For this purpose, first note that Am−1=diag(Am1−1, . . . , AmN−1) by at least Assumption 3A and Lemma 2.8.2 in [1]. Then, CAm−1Bm=diag(C1Am1−1Bm1, . . . , CNAmN−1BmN), and hence, CAm−1Bm is nonsingular by Assumption 3A. Furthermore, the theorem in [25] is applicable here because of Assumption 2A. This theorem implies that IN−FA is nonsingular, and hence, (IN−FA)⊗Il is nonsingular by Proposition 7.1.7 in [1]. Since CAm−1Bm and (IN−FA)⊗Il are nonsingular matrices, the second sufficient condition, −[(IN−FA)⊗Il]CAm−1Bm is nonsingular, is satisfied. Thus, Ag is nonsingular.
Next, let
Using the definition of Ag−1, Bg, Cg, and ω(t), the following equation is obtained
−CgAg−1Bg(t)=−CM1δ(t)+CM2(FK⊗Il)y0a(t). (118)
From Proposition 2.8.7 in [1],
M1=Am−1+Am−1Bm(−[(IN−FA)⊗Il]CAm−1Bm)−1
x[(IN−FA)⊗Il]CAm−1
=Am−1−Am−1Bm(CAm−1Bm)−1
x[(IN−FA)⊗Il]−1[(IN−FA)⊗Il]CAm−1
=Am−1−Am−1Bm(CAm−1Bm)−1CAm−1 (119)
M2=−Am−1Bm(−[(IN−FA)⊗Il]CAm−1Bm)−1,
=Am−1Bm(CAm−1Bm)−1[(IN−FA)−1⊗Il]. (120)
Now, inserting M1 and M2 into equation (118), the following equation is obtained
−CgAg−1Bbω(t)=[(IN−FA)−1⊗Il](FK⊗Il)y0a(t),
=[(IN−FA)−1FK⊗Il](1N⊗y0(t)
=(IN−FA)−1FK1N⊗y0(t). (121)
Note that F is nonsingular by Assumption 2A and F−1=D+K, t then F−1(IN−FA)1N=(D+K−A)1N=(L+K)1N=K1N. Thus, (IN−FA)−1FK1N=1N (i.e., each row of (IN−FA)−1FK has a sum equal to 1). Using the foregoing property, the equality given in equation (117) from equation (121) is obtained.
LEMMA 2A. If Assumptions 2A and 3A hold, then
∥ei(t)∥2≤∥Cg∥2∥ζ(t)∥2, ∀i∈, (122)
where ζ(t)≙η(t)+Ag−1Bgω(t) is the assistant state.
Proof. From the proof of Lemma 1, it is known that Ag is nonsingular under Assumptions 2A and 3A. Using equation (116) and the assistant state ζ(t), the following is obtained
y(t)=Cg(ζ(t)−Ag−1Bgω(t)). (123)
Now, from Lemma 1A, equation (123) can be rewritten as
y(t)=Cgζ(t)+y0a(t). (124)
Using the fact that ∥ei(t)∥2≤∥y(t)−y0a(t)∥2, ∀i∈, we have
∥ei(t)∥2≤∥Cgζ(t)∥2, ∀i∈ (125)
Then, equation (122) immediately follows from equation (125).
THEOREM 1A. Consider the heterogeneous multiagent system given by equations (101) and (102) together with the output of the leader y0(t). In addition, consider the local cooperative controller given by equation (109) along with equations (107) and (108). Let Assumptions 1A, 2A, and 3A hold. If Ag is Hurwitz, then
∥ei(t)∥2≤max{
where ζ0=ζ(0),
with c and λ are being positive constants that satisfy ∥eA
Proof. The time derivative of ζ(t) can be expressed as
{dot over (ζ)}(t)={dot over (η)}(t)+Ag−1Bg{dot over (ω)}(t). (127)
Inserting equation (115) into equation (127) and using η(t)=ζ(t)−Ag−1Bgω(t), equation (127) can be rewritten as
{dot over (ζ)}(t)=Agζ(t)+Ag−1Bgω(t), ζ(0)=ζ0, t≥0. (128)
Then, the solution of (128) can be written as
ζ(t)=eA
Since Ag is Hurwitz, there exist positive constants c and λ such that ∥eA
∥ζ(t)∥2≤ce−λt∥0∥2+∫0tce−λ(t−τ)∥Ag−1Bg∥2∥{dot over (ω)}(τ)∥2dτ. (130)
Note that ∥{dot over (ω)}(t)∥22=∥{dot over (y)}0a(t)∥22+∥{dot over (δ)}(t)∥22. Moreover, {dot over (y)}0a(t)=1N⊗{dot over (y)}0(t), and hence, ∥{dot over (y)}0a(t)∥22=N∥{dot over (y)}0(t)∥22 and ∥{dot over (δ)}(t)∥22=Σi=1N∥{dot over (δ)}(t)∥22. Based on Assumption 1A, ∥{dot over (w)}(t)∥2≤√{square root over (Nβ2+α2)} for all t≥0. Thus, the upper bound of the assistant state is given by
From equation (131) and Lemma 2A,
Now, it follows from equation (132) and the fact that a1(t)+a2≤max{2a1(t), 2a2}, ∀t≥0, which holds for any a1(t)≥0 and a2≥0, (26) holds.
The following corollary is now immediate.
COROLLARY 1A. If the external disturbance of a follower agent is time-varying (i.e., ∃i∈N such that ai>0) or the leader has time-varying output (i.e., β>0), equation (126) implies that there exists T≥0 such that
∥ei(t)∥2≤
∥ei(t)∥2≤b, ∀t≥T, ∀i∈. (134)
Proof By Assumption 3A, it is possible that ∥Ci∥2≠0 for all i∈, and hence, ∥Cg∥2>0. Furthermore, rank (Bg)≥
From equation (126), either
Thus, equations (133) and (134) are satisfied. In the latter case, equations (133) and (134) are satisfied with T=0 trivially.
REMARK 1A. Theorem 1A shows that the ultimate bound b of the output tracking error of each follower agent is associated with the bound on the time derivative of δ(t) and y0(t) (i.e., α and β). For example, as α and β decrease (respectively, increase), b decreases (respectively, increases). If, in addition, each follower agent is subject to constant external disturbance and the leader has constant output (i.e., α=0 and β=0), it is clear from (26) that b=0, and hence, the output tracking error of each follower agent goes to zero asymptotically (i.e., limt→∞ei(t)=0, ∀i∈).
REMARK 2A. Since the solution of linear time-invariant systems are known, we use this advantage in the stability analysis conducted in Theorem 1A and Corollary 1A. On the other hand, for uniform ultimate boundedness, one can also apply Lyapunov-like theorems such as Theorem 4.18 in [9] and Theorem 4.5 in [11] (e.g., see [31]), or resort to the final value theorem (e.g., see [36]).
From Remark 1A, it is known that output tracking error of each follower agent converges to zero when each follower agent is subject to constant external disturbance and leader has constant output. The following intuitive question now arises: Does the output tracking error of each follower agent still converge to zero if the external disturbances and the output of the leader converge to constant vectors? Since Theorem 1A is not used to answer this question, the following corollary may now be useful.
COROLLARY 2A. Let Assumptions 1A, 2A, and 3A hold. If Ag is Hurwitz, limt→∞δi(t)=δi*∈n
Proof. Since the assistant system in equation (128) is linear time-invariant and Ag is Hurwitz, equation (128) is input-to-state stable. Since limt→∞y0(t)=r* and {dot over (y)}0(t) is uniformly continuous on [0, ∞), limt→∞{dot over (y)}0(t)=0 from l number of independent applications of Barbalat's lemma. Thus, limt→∞{dot over (y)}0a(t)=0. Similarly, limt→∞{dot over (δ)}(t)=0. It now follows from the derivation given after Definition 4.6 in [5] that limt→∞{dot over (ω)}(t)=0 implies limt→∞ζ(t)=0 owing to the input-to-state stability of equation (128). Finally, limt→∞ei(t)=0, ∀i∈, follows from Lemma 2A.
REMARK 3A. It is clear from the proof of Corollary 2A that if Ag is Hurwitz, limt→∞{dot over (δ)}i(t)=0, ∀i∈, and limt→∞{dot over (y)}0(t)=0, then limt→∞ei(t)=0, ∀i∈. That is, asymptotic synchronization can be achieved even if the external disturbances and the output of the leader do not converge to constant vectors. For example, ln(t+1) does not have a limit but its derivative 1/(t+1) tends to zero as t→∞.
Agent-wise Local Sufficient Stability Condition. The main purpose of this section is to derive agent-wise local sufficient condition that provides Hurwitz Ag. For this purpose, the stabilizability and detectability of the global dynamics given by equations (115) and (116) is established. Then, agent-wise local sufficient condition are derived, which provides input-output stability of the global dynamics, by applying a version of the small gain theorem from Theorem 6.2.2.12 in [32]. The input-output stability of global dynamics by itself may not imply that Ag is Hurwitz. Therefore, stabilizability and detectability of finite-dimensional linear time-invariant systems should be carefully tracked to rule out the possibility of unstable hidden modes, and hence, conclude from input-output stability that system matrix is Hurwitz.
For the sake of completeness, a well-known converse theorem (e.g., see Corollary 9.1.80 in [2]) is restated in Theorem 2A using global dynamics given by equations (115) and (116). For finite-dimensional linear time-invariant systems, 2 stability and uniform bounded-input, bounded-output stability are equivalent notions of input-output stability and are used interchangeably in the literature (e.g., see Remark 2 in [13]). Based on Theorem 2A, the derived agent-wise local sufficient condition for input-output stability provides Hurwitz Ag.
THEOREM 2A [2]. Suppose that the pair (Ag, Bg) is stabilizable and the pair (Ag, Cg) is detectable. If the linear time-invariant system given by equations (115) and (116) is 2 stable, then Ag is Hurwitz.
Stabilizability and Detectability of Global Multiagent System Dynamics. In order to derive an agent-wise local sufficient condition, Theorem 2A is used. Thus, it is first needed to establish the stabilizability of the pair (Ag, Bg) and the detectability of the pair (Ag, Cg). These are given in Lemma 3A and Lemma 4A, respectively.
LEMMA 3A. If Assumptions 2A and 3A hold, then the pair (Ag, Bg) is controllable.
Proof. Define the following matrix
where κ∈. By Popov-Belevitch-Hautus test for controllability, the pair (Ag, Bg) is controllable if and only if rank ((κ))=
LEMMA 4A. If Assumptions 2A and 3A hold and Ami is Hurwitz for all i∈, then the pair (Ag, Cg) is detectable.
Proof. An example goal is to show that if ω(t)≡0 and y(t)≡0, then η(t)→0 as t→∞. For this purpose, first let ω(t)≡0, then rewrite equations (115) and (116) as two interconnected systems given by
{dot over (x)}(t)=Amx(t)+Bmz(t), x(0)=x0, t≥0, (136)
y(t)=Cx(t), (137)
and
ż(t)=[(IN−FA)⊗Ii]y(t), z(0)=z0, t≥0 (138)
One can show detectability of the pair (Ag, Cg) from Popov-Belevitch-Hautus test for detectability (i.e., Theorem 16.5 in [6]). For that proof, detectability counterpart of (κ) needs to be represented as a multiplication of two matrices and Corollary 2.5.10 in [1] should be applied. Since the presented proof requires less space, it is preferred.
Next, let y(t)≡0. Then, from equation (138), ż(t)≡0, and hence, z(t)≡z0. To show that z0=0, a contradiction argument as follows can be used. Suppose z0≠0 and take Laplace transform of equations (136) and (137). Thus
Y(s)=C(sI−Am)−1BmZ(s)+C(sI−Am)−1x0, (139)
where Z(s)=1/sz0. Since Ami is Hurwitz for all i∈, Am is Hurwitz. Therefore, a final value theorem as follows can be applied:
It is explained in Lemma 1A that CAm−1Bm is nonsingular due to the Assumption 3A. Thus, it implies that ker(−CAm−1Bm)={0}. Since z0≠0, limt-∞y(t)≠0 that is a contradiction to the fact that y(t)≡0; therefore, z0=0.
Until now, it has been established that if ω(t)≡0 and y(t)≡0, then z(t)≡0. To conclude that η(t)→0 as t→∞, it should be shown that x(t)→0 as t→∞. Note that z(t) and y(t) are the input and the output of the system equations (136) and (137), respectively. Recall that Am is Hurwitz, y(t)≡0, and z(t)≡0. Thus, from equations (136) and (137), x(t)→0 as t→∞.
REMARK 4A. Stabilizability (controllability implies stabilizability) and detectability of the global dynamics given by equations (115) and (116) do not require any information from graph topology (except the necessary condition given in Assumption 2A). Compared to stabilizability, detectability of the global dynamics is established if Ami is also Hurwitz for all i∈. By Assumption 4A, notice that there always exists Kli such that Ami is Hurwitz for all i∈.
A Small Gain Analysis. In this subsection, a version of the small gain theorem given in [32] is used, which is proposed for large-scale systems, to establish the finite gain 2 stability of the global dynamics in equations (115) and (116). By applying Theorem 2A, the agent-wise local sufficient condition for stability of Ag can be determined.
Define ξi(t)≙[xiT(t),ziT(t)]T for i∈ and consider the dynamics of each follower given by equations (101) and (108) with equation (112)
{dot over (ξ)}i(t)=Āiξi(t)+
where
and a positive constant ϕi is introduced to have control over Bfi, which affects the gain of the follower agents. Using equation (109), the definition of Ami and Bmi and recalling equation (102), the dynamics of each follower can equivalently be represented as
{dot over (ξ)}i(t)=Afiξi(t)+Bfivi(t), ξi(0)=ξi0, t≥0 (142)
yi(t)=Cfiξi(t), (143)
where
and Cfi=[Ci 0]. The transfer matrix of the system equations (142) and (143), which is denoted by gi(s), satisfies
gi(s)=Cfi(sI−Afi)−1Bfi. (144)
Assumptions 4A and 5A ensure the stabilizability of the pair (Āi,
Conversely, equations (142) and (143) are stabilizable and detectable for all i∈ when Am is Hurwitz for all i∈and Assumption 3A holds. Specifically, since rank (Bfi)=ni+l, the pair (Afi, Bfi) is controllable from controllability matrix test (i.e., Theorem 12.1 in [6]). Furthermore, by following the similar steps in the proof of Lemma 4A, it can be shown that if Ami is Hurwitz, the pair (Afi, Cfi) is detectable under Assumption 3A. Therefore, if all poles of gi(s) have negative real part (γi is finite) for all i∈, then Afi is Hurwitz for all i∈.
THEOREM 3A. Consider Assumptions 2A and 3A. Let Ami be Hurwitz for all i∈. If
ρ(Γ)ρ(FA)<1, (146)
then Ag is Hurwitz, where Γ≙diag (γ1, . . . , γN).
Proof. It is first shown that equations (115) and (116) are 2 stable with finite gain. This part of the proof can be regarded as an application of Theorem 6.2.2.12 in [32]. Since F is finite, which is owing to Assumption 2A, and A is finite, then F is finite from (46). Therefore, under the stated assumptions and conditions, Afi is Hurwitz for all i∈, and hence, equations (142) and (143) are 2 stable with finite gain γi given by equation (145) for all i∈. Now, the following inequality is determined,
∥yiτ(t)∥
Using the definition vi(t), Minkowski's inequality, and letting y0(t)∈2 and δi(t)∈2 for all i∈, the inequality from equation (147) is given by
Let pτ≙[∥y1τ(t)∥
pτ≤ΓΦ
where equation (149) can also be written as
(IN−ΓFA)pτ≤ΓΦ
Note that Γ is positive-definite diagonal matrix ρ(Γ)=max1≤i≤N γi, and FA is nonnegative matrix. Then, the following inequality is obtained from Lemma 8 in [7]
ρ(ΓFA)≤ρ(Γ)ρ(FA). (151)
Since equation (146) holds, we have the following from equation (151)
ρ(ΓFA)<1. (152)
From Lemma 6.2.1.8 and Lemma 6.2.1.9 in [32], it is known that IN−ΓFA has an inverse that has all nonnegative elements because ΓFA is nonnegative matrix and equation (152) holds. Since (IN−ΓFA)−1 is nonnegative matrix, both sides of equation (150) are can be multiplied by (IN−ΓFA)−1. Thus,
pτ≤(IN−ΓFA)−1ΓΦ
Since the right hand side of equation (153) is independent oft it is concluded from Lemma 2.1.12 in [32] that yi(t)∈2 for all i∈. Hence, equation (153) directly implies that there exists
∥y(t)∥
It follows from equation (154) that equation (115) and (116) are 2 stable with finite gain. Since Assumptions 2A and 3A hold and Ami is Hurwitz for all ∈, equations (115) and (116) are stabilizable and detectable from Lemma 3A and Lemma 4A. Therefore, Ag is Hurwitz from Theorem 2A.
REMARK 5A. If Assumptions 2A, 3A, 4A, and 5A hold, K1i, K2i are designed such that Ami is Hurwitz for all i∈, and the sufficient condition given by equation (146) is satisfied, then Ag is Hurwitz from Theorem 3A. In addition to the foregoing assumptions, if Assumption 1A holds, then uniformly ultimately bounded output tracking error between output of each follower and the output of the leader is achieved by Theorem 1A. Similar to [7,16, 30], Theorem 3A provides agent-wise local condition with a clear link between input-output stability and internal stability of global dynamics given by equations (115) and (116). In contrast to the distributed output regulation problems considered in [27,7,16,29,30], the controller design described herein does not depend on the dynamics of an exosystem.
REMARK 6A. Since ρ(Γ)=max1≤i≤N γi, the sufficient condition given in equation (146) basically implies γiρ(FA)<1, ∀i∈. Therefore, Theorem 3A provides agent-wise local sufficient condition for controller design. If the sufficient condition given by equation (146) in Theorem 3A is replaced with equation (152), it is clearly seen that Theorem 3A is still valid and equations (152) decreases conservatism in sufficient condition. However, equations (152) does not provide agent-wise local sufficient condition anymore. It can be an alternative global sufficient condition, which is together with Hurwitz Ami for all i∈, to the one given in Theorem 1A which states that Ag is Hurwitz. It is also noted that the sufficient condition in equation (146) can be satisfied by solving algebraic Riccati equation (e.g., see Lemma 9 in [7]) or linear matrix inequality (e.g., see Theorem 6 in [30]).
For acyclic directed graphs, derived distributed criterion for controller design is not only agent-wise but also graph-wise local except for the necessary condition given by Assumption 2A. It is shown in the next result.
COROLLARY 3A. Consider Assumptions 2A and 3A. Let Ami be Hurwitz and γi is finite for all i∈. If the directed graph is acyclic (i.e., contains no loop), then Ag is Hurwitz.
Proof. Similar to [34,29], the nodes in can be relabeled such that i>j if (vivj)∈E since is acyclic. Then, the adjacency matrix A of the directed is lower triangular with zero diagonal entries. In this case, FA is also lower triangular matrix with zero diagonal entries. Thus, ρ(FA)=0 and sufficient condition given by equation (146) in Theorem 3A is automatically satisfied. It now follows from Theorem 3A that Ag is Hurwitz.
REMARK 7A. For acyclic graph, obtaining Hurwitz Ag is reduced to designing Hurwitz Ami together with any finite γi is finite for all i∈ if Assumptions 2A, 3A, 4A, and 5A hold. In terms of being agent-wise and graph-wise local, this result is consistent with the results in [34,29] which are obtained by applying similarity transformation.
To illustrate the performance of the proposed distributed controller architecture described in this embodiment, the following two numerical examples are presented. The first example has nonlinear leader dynamics and the second one has linear leader dynamics. For both examples, five follower agents are considered with the following system, input, and output matrices:
and the augmented graph shown in
Now, linear quadratic theory is used to design K1i and K2i. In particular, Q1,4,5=diag(10,1,10,1) and R1,1=diag(1,1) are used to penalize ξ1,4,5(t) and u1,4,5(t), respectively. Similarly, Q2,3=diag(1,1,1,1,2,2) and R2,3=diag(1,1,1) are used to penalize ξ2,3(t) and u2,3(t), respectively. With these design parameters, Assumption 3A is satisfied and Ami is Hurwitz for all i∈. For the given graph, ρ(FA)=0.6334. Letting ϕi=100 for all i∈, then ρ(Γ)=1.0156. Thus, Ag is Hurwitz from Theorem 3A. In the simulations, initial conditions for the follower agents are as follows
x10=[1 0.6]T, x20=[−1 0 −0.2 0]T, x30=[−0.8 −0.4 0 0]T, x40=[0.6 0]T, x50=[0 0.5]T.
EXAMPLE 1. In this example, the dynamics of the leader is nonlinear and has the form:
{dot over (x)}0
{dot over (x)}0
{dot over (x)}0
y0
y0
This leader dynamics is from the exercise problems given in [10]. Regarding x0
and hence, there exists β that satisfies equation (105). Moreover, follower agents are subject to external disturbances, which satisfy equation (104), as follows: δ1(t)=[−0.2, 1−e−0.02t]T, δ2(t)=[0.1 cos(0.1t) 0, 0, −0.1]T, δ3(t)=[0, 0, 0.05 sin(4t), 0]T, δ4(t)=[0.5, 0.4]T, and δ5(t)=[0.01t 0]T, t≥0. Thus, Assumption 1A holds for this example.
EXAMPLE 2. The dynamics of the leader is now given by the following linear system
where u0(t)=1, for t≥0. Since the leader has linear time-invariant dynamics and its system matrix is Hurwitz, we have input-to-state stable leader dynamics. Furthermore, by applying final value theorem, the steady-state value of the output is found to be equal to [1−0.5]T (i.e., r*=[1−0.5]T). In addition, note that {dot over (y)}0(t) is uniformly continuous since ÿ0(t) is bounded owing to the boundedness of x0(t), u0(t), and {dot over (u)}0(t). Furthermore, external disturbances are given as follows: δ1(t)=[0.2 −0.5+e−0.5t]T, δ2(t)=[0.3 0 −0.3e−0.2t sin(t) 0]T, δ3(t)=[0 0.3 0 −0.2]T, δ4(t)=[0.5 0.1e−0.4t sin(4t)]T, δ5(t)=[1−e−0.3t 0]T. Similar to y0(t), given disturbances satisfy the conditions in Corollary 2A. Thus, Corollary 2A guarantees asymptotic synchronization and this fact is demonstrated in
It is worth noting that the distributed controller gains are selected without using any information from leader dynamics and external disturbances. Same controller gains are used for the examples which are presented in this disclosure.
References Related to the First Embodiment
A standard notation is used in the second and third embodiments. Specifically, , n, and n×m respectively denote the sets of all real numbers, real column vectors, and n×m matrices; 1n and In respectively denote the n×1 vector of all ones and the n×n identity matrix; and “≙” denotes equality by definition. In this disclosure, all real matrices are defined over the field of complex numbers. In this disclosure, write (⋅)T for the transpose and ∥⋅∥2 for the (induced) two norm of a matrix; σ(⋅) for the spectrum and ρ(⋅) for the spectral radius of a square matrix; (⋅)−1 for the inverse of a nonsingular matrix; and ⊗ for the Kronecker product. Finally, diag(A1, . . . , An) is a block-diagonal matrix with entries (A1, . . . , An) on its diagonal. Definition 4.4.4 in [1] is followed for the spectrum.
Next, the graph theoretical notation used in the second and third embodiments, which is based on [9], is concisely stated. In particular, consider a fixed (i.e., time-invariant) directed graph =(, ), where ={v1, . . . , vN} is a nonempty finite set of N nodes and ⊂× is a set of edges. Each node in corresponds to a follower agent. There is an edge rooted at node vj and ended at vi (i.e. (vj, vi))∈ if and only if vi receives information from vj. =[aij]∈N×N denotes the adjacency matrix, which describes the graph structure; that is aij>0⇔(vj, vi)∈ and aij=0 otherwise. Repeated edges and self loops are not allowed; that is aii=0, ∀i∈with ={1, . . . , N}. The set of neighbors of node vi is denoted as Ni={j|(vj, vi)∈}. In-degree matrix is defined as =diag(d1, . . . , dn) with di=Σj∈N
The concept of internal model introduced next slightly modifies Definition 1.22 and Remark 1.24 in [5].
Definition 1. Given any square matrix A0, a triple of matrixes (M1, M2, M3) is said to incorporate a p-copy internal model of the matrix A0 if
or
M1=G1, M2=G2, M3=0, (202)
where Sl, l=1, 2, 3, 4, is any matrix with appropriate dimension, T is any nonsingular matrix with an appropriate dimension, the zero matrix in M3 has as many rows as those of G1, and
G1=diag(β1, . . . ,βp), G2=diag(σ1, . . . , σp),
where for l=1, . . . , p, βl∈s
a) The pair (βl, σl) is controllable.
b) The minimal polynomial of A0 equals the characteristic polynomial of βl.
Problem Formulation. Consider a system of N (follower) agents with heterogeneous linear time-invariant dynamics subject to external disturbances over a fixed directed communication graph topology . The dynamics of agent i∈can be given by
{dot over (x)}i(t)=Aixi(t)+Biui(t)+δi(t), xi(0)=xi0, t≥0,
yi(t)=Cixi(t)+Diui(t),
with state xi(t)∈n
Let ω(t)≙[r0T(t),δT(t)]T∈q be the solution of the unknown exosystem, where q=qr+qδ. Instead of assuming that the exosystem has an unforced linear time-invariant dynamics with a known system matrix (for example, see [14, 4, 16]), this disclosure considers that the exosystem has (partially or completely) unknown dynamics. From this perspective, the exosystem can represent any (for example, linear or nonlinear) dynamics provided that its solution is unique and satisfies the conditions given later in Assumptions 1B and 2B.
Define Ei≙[0 Eδi] and R≙[Rr 0]. Furthermore, let ei(t)≙yi(t)−y0(t) be the tracking error. The state of each agent and its tracking error can be defined as
{dot over (x)}i(t)=Aixi(t)+Biui(t)+Eiω(t), xi(O)=xi0, t≥0, (203)
ei(t)=Cixi(t)+Diui(t)−Rω(t). (204)
In this disclosure, the tracking error ei(t) is available to a nonempty proper subset of agents. If all agents observe the leader, decentralized controller can be designed for each agent even though the distributed controllers described herein are still applicable. In particular, if node vi observes the leader node v0, then there exists an edge (v0, vi) with weighting gain ki>0; otherwise ki=0. Each agent has also access to the relative output error; that is, yi(t)−yj(t) for all j∈Ni. Similar to [16], the local virtual tracking error can be defined as
Next, three classes of distributed control laws are defined based on additional available information for each agent.
1) Dynamic State Feedback. If each agent has full access to its own state xi(t), then the dynamic state feedback control law can be defined as
ui(t)=K1ixi(t)+K2izi(t), (206)
żi(t)=G1izi(t)+G2ievi(t), zi(0)=zi0, t≥0, (207)
where zi(t)∈z
2) Dynamic Output Feedback with Local Measurement. If each agent has local measurement output ymi(t)∈pi of the form
ymi(t)=Cmixi(t)+Dmiui(t), (208)
then the dynamic output feedback control law with local measurement is given by
ui(t)=
żi(t)=M1izi(t)+M2ievi(t)+M3iymi(t), zi(0)=zi0, t≥0, (210)
where zi(t)∈z
3) Dynamic Output Feedback. If each agent does not have additional information; that is, the local virtual tracking error evi(t) is the only available information to it, then the dynamic output feedback control law is given by
ui(t)=
żi(t)=M1izi(t)+M2ievi(t), zi(0)=zi0, t≥0, (212)
where zi(t)∈z
This disclosure makes the following first and second assumptions before define the problem.
ASSUMPTION 1B. A0∈q×q has no eigenvalues with negative real parts.
ASSUMPTION 2B. There exist k>0 such that
∥A0ω(t)−{dot over (ω)}(t)∥2≤k<∞, ∀t≥0,
where {dot over (ω)}(t) is a piecewise continuous function in time. The definition given in page 650 of [7] is followed.
Assumption 1B is standard in linear output regulation theory (for example, see Remark 1.3 in [5]). Assumption 2B is required to show the ultimate boundedness of the tracking error and it automatically holds if the exosystem has an unforced linear time-invariant dynamics with the system matrix A0.
Based on a definition of the linear cooperative output regulation problem in [14, 4], the problem considered in this disclosure can be defined as follows.
Definition 2. Given the system in equations (203) and (204) together with the exosystem, which satisfies Assumptions 1B and 2B, and the fixed augmented directed graph, find a distributed control law of the form of equations (206) and (207), or equation (209) and (210), or equations (211) and (212) such that:
a) The resulting closed-loop system matrix is Hurwitz.
b) The tracking error ei(t) is ultimately bounded with ultimate bound b for all initial conditions of the closed-loop system and for all i∈; that is, there exists b>0 and for each initial condition of the closed-loop system, there is T≥0 such that ∥ei(t)∥2≤b, ∀t≥T, ∀i∈.
c) If limt→∞A0ω(t)−{dot over (ω)}(t)=0, then for all initial conditions of the closed-loop system limt→∞ei(t)=0, ∀i∈.
This disclosure makes the following addition assumptions to solve this problem.
ASSUMPTION 3B. The fixed augmented directed graph has a spanning tree with the root node being the leader node.
ASSUMPTION 4B. The pair (Ai, Bi) is stabilizable for all i∈.
ASSUMPTION 5B. For all λ∈σ(A0),
ASSUMPTION 6B. The triple (G1i, G2i, 0) incorporates a p-copy internal model of A0 for all i∈.
ASSUMPTION 7B. The pair (Ai, Cmi) is detectable for all i∈.
ASSUMPTION 8B. The pair (Ai, Ci) is detectable for all i∈.
Assumption 3B is natural to solve the stated problem (for example, see Remark 3.2 in [9]). Similar to Assumption 1B, Assumptions 4B, 5B, 6B, 7B, and 8B are standard in linear output regulation theory (for example, see Chapter 1 of [5]). Assumptions 1B, 2B, 3B, 4B, 5B, and 6B can be used for dynamic state feedback. To utilize some results from dynamic state feedback in the absence of full state information, each agent requires the estimation of its own state. For this purpose, Assumption 7B and Assumption 8B are included for dynamic output feedback with local measurement and dynamic output feedback, respectively.
Solvability of the Problem
For the three different distributed control laws introduced previously herein, the solvability of the problem given in Definition 2 can be investigated. First, property a) of Definition 2 is assumed and it is shown, under mild conditions, that properties b) and c) of Definition 2 are satisfied. Second, an agent-wise local sufficient condition (i.e., distributed criterion) is provided for property a) of Definition 2 (i.e., the stability of the closed-loop system matrix) under standard assumptions.
Before describing the solvability of the problem for each distributed control law, the following definitions are presented that are used throughout this description to express the closed-loop systems in compact forms, some results related to the communication graph topology, and a key lemma about the solvability of matrix equations, which play a role on the solvability of the problem.
Define the following matrices: Φ≙diag(Φ1, . . . , ΦN), Φ=A, B, C, D, E; Φm≙diag (Φm1, . . . , ΦmN), Φm=Cm,Dm; Kl≙diag(Kl1, . . . , KlN), l=1, 2; A0a=IN⊗A0, and Ra=IN⊗R. Further let x(t)≙[x1T(t), . . . , xNT(t)]T∈
Observing yi(t)−yj(t)=ei(t)−ej(t) and recalling di=Σj∈N
Let
and ≙(IN−)⊗Ip. Here, in should be noted that d1+k1>0, ∀i∈ by Assumption 3B; hence, is well-defined. From equation (213), we have
ev(t)=e(t). (214)
Similar to Lemma 3.3 in [9], the following lemma is for IN−.
Lemma 1B. Under Assumption 3B, IN− is non-singular. In addition, all its eigenvalues have positive real parts.
Proof. Under Assumption 3B, IN− satisfies conditions of the theorem in [13]. Thus, it is nonsingular. Since the singularity is eliminated, all the eigenvalues of IN− have positive real parts by the Gershgorin circle theorem (see, for example, Fact 4.10.17 in [1]).
Remark 1B. Since IN− is nonsingular under Assumption 3B, so is by Proposition 7.1.7 in [1]. Then, it is clear from equation (214) that ei(t) is bounded for all i∈ if and only if evi(t) is bounded for all i∈; limt→∞ei(t)=0, ∀i∈ if and only if limt→∞evi(t)=0, ∀i∈.
Now looking at the spectral radius of .
Lemma 2B. Under Assumption 3B, ρ()<1.
Proof. By Lemma 1B, all the eigenvalues of IN− have positive real parts under Assumption 3B. This directly implies from the Fact 6.2.1 in [17] that the leading principal minors of IN− are all positive as IN− is a square matrix whose off-diagonal elements are all nonpositive. Since is a nonnegative square matrix and the leading principal minors of IN− are all positive, ρ()<1 from Lemma 6.2.1.8 in [17].
Next, a lemma is described that extends the field of application of Lemma 1.27 in [5] to heterogeneous (in dynamics and dimension) linear time-invariant multiagent systems over general fixed directed graph communication graph topologies.
Lemma 3B. Let Assumptions 1B and 3B hold. Suppose the triple (M1, M2, M3) incorporates an N p-copy internal model of A0a. If
is Hurwitz, where Â, {circumflex over (B)}, Ĉ, Ĉm, {circumflex over (D)}, and {circumflex over (D)}m are any matrices with appropriate dimensions, then the matrix equations
XA0a=ÂX+{circumflex over (B)}Z+Ê, (215)
ZA0a=M1Z+M2(ĈX+{circumflex over (D)}Z+{circumflex over (F)})+M3(ĈmX+{circumflex over (D)}mZ), (216)
have unique solutions X and Z for any matrices Ê and {circumflex over (F)} of appropriate dimensions. Furthermore, X and Z satisfy
0=ĈX+{circumflex over (D)}Z+{circumflex over (F)}. (217)
In other words, the conclusion is that the matrix equations
XcA0a=AcXc+Bc, (218)
0=CcXc+Dc, (219)
have a unique solution Xc, where
Proof. Note that equations (215) and (216) (respectively, equation (217)) can be equivalently written as equation (218) (respectively, equation (219)). Note also that σ(A0a)=σ(A0). Since Assumption 1B holds and A, is Hurwitz, A0a and Ac have no eigenvalues in common. Thus, the Sylvester equation in equation (218) has a unique solution X, =[XT ZT]T by the first part of Proposition A.2 in [5]. In addition, we show that X and Z also satisfy equation (217). To this end, let
Note that if the triple (M1, M2, M3) takes the form of equation (202), equation (216) already satisfies equation (220), where
Dynamic State Feedback
Let
wherein
{dot over (x)}(t)=(A+BK1)x+BK2z+Eωa(t), x(0)=x0, t≥0, (221)
ż(t)=G1z(t)+G2ev(t), z(0)=z0, t≥0, (222)
e(t)=(C+DK1)x(t)+DK2z(t)−Raωa(t). (223)
Next, insert equation (223) into equation (214) and replace the obtained expression with the one in equation (222). Define
Then, the closed-loop system defined by equations (203), (204), (205), (206), and (207) becomes
{dot over (x)}g(t)=Agxg(t)+Bgωa(t), xg(0)=xg0, t≥0 (224)
e(t)=Cgxg(t)+Dgωa(t), (225)
where
Theorem 1B. Let Assumptions 1B, 2B, 3B, and 6B hold. If Ag is Hurwitz, then the distributed dynamic state feedback control given by equations (206) and (207) can be used in solving the problem in Definition 2.
Proof. By the definition of A0a minimal polynomials for A0a and A0 are the same. Thus, the triple (G1, G2, 0) incorporates an N p-copy internal model of A0a under Assumption 6B. Let (M1, M2, M3)≙(G1,G2,0). Let also Â≙A+BK1, {circumflex over (B)}≙BK2, Ĉ≙C+DK1, Ĉm≙0, {circumflex over (D)}≙DK2, {circumflex over (D)}m≙0, Ê≙E, and {circumflex over (F)}≙−Ra. Then, the quadruple (Ag, Bg, Cg, Dg) takes the form of (Ac, Bc, Cc, Dc) in Lemma 3B. In addition, Ag is Hurwitz and Assumptions 1B and 3B hold. Hence, Lemma 3B is applicable and it implies that the matrix equations
XgA0a=AgXg=Bg, (226)
0=CgXg=Dg, (227)
have a unique solution Xg. Additional discussion on the solvability of equations (226) and (227) are described later in the disclosure.
Under Assumption 2B, ∥A0aωa(t)−{dot over (ω)}a(t)∥2≤Nk, ∀t≥0 since ∥A0aωa(t)−{dot over (ω)}a(t)∥22=N∥A0ω(t)−{dot over (ω)}(t)∥22. Let
e(t)=Cg
Now, the solution of equation (228) can be written as
Since Ag is Hurwitz, there exist c>0 and α>0 such that ∥eA
Using the fact ∥ei(t)∥2≤e(t)∥2, ∀i∈ and observing ∥e(t)∥2≤∥Cg∥2∥
∥ei(t)∥2≤ce−αt∥Cg∥2∥
Where b′=c∥Cg∥2∥Xg∥2Nkα−1. For a given ∈>0, we have either c∥Cg∥2∥
In the latter case, the foregoing inequality may hold for all t≥0.
Thus, ei(t) may be ultimately bounded with the ultimate bound b≙b′+∈ for all
If limt→∞A0ω(t)−{dot over (ω)}(t)=0, then limt→∞A0ωa(t)−{dot over (ω)}a(t)=0. Since Ag is Hurwitz and the system in equations (228) is linear time-invariant when A0aωa(t)−{dot over (ω)}a(t) is viewed as an input to the system, equation (228) is input-to-state stable with respect to this piecewise continuous input (for example, see Chapter 4.9 in [7]). Thus, limt→∞A0ωa(t)−{dot over (ω)}a(t)=0 implies limt→∞
Remark 2B. The ultimate bound b of the tracking error for each agent can be associated with the bound k in Assumption 2B. For example, as k decreases (respectively, increases), b decreases (respectively, increases). To elucidate the role of Assumptions 1B and 2B in practice, the following example scenarios may be considered:
a) When the piecewise continuity and boundedness of {dot over (ω)}(t) are the only information that is available to a control designer, the triple (0,Ip,0) incorporating a p-copy internal model of A0=0 is quite natural; hence, equation (207) can become a distributed integrator. Moreover, Xg in b can be explicitly expressed in terms of Ag and Bg; that is, Xg=−Ag−1Bg by equation (226).
b) When the piecewise continuity and boundedness of {dot over (ω)}(t), the boundedness of (t), and some frequencies in ω(t) are available to a control designer, the triple (G1i, G2i, 0) incorporating a p-copy internal model of A0, which includes these frequencies and zero eigenvalues, is an alternative to the pure distributed integrator.
Remark 3B. As it is shown in Theorem 1B, asymptotic synchronization can be achieved when limt→∞A0ω(t)−ω(t)=0. Next provided are sufficient conditions to check this condition can be determined as follows. If A0=0 holds, limt→∞{dot over (ω)}(t)=0 can replace limt→∞A0ω(t)−{dot over (ω)}(t)=0; hence ω(t)≡ω* (ω* is finite) in place of a), and limt→∞ω(t)=ω* and {dot over (ω)}(t) is uniformly contouring in place of b). If one of the following conditions holds
a) {dot over (ω)}(t)=A0ω(t), ω(0)=ω0, t≥0;
b) limt→∞eA
then limt→∞A0ω(t)−{dot over (ω)}(t)=0. Note that a) may imply b). From Barbalat's lemma given by Lemma 8.2 in [8], b) may imply that limt→∞A0eA
To obtain an agent-wise local sufficient condition that assures property a) of Definition 2 under some standard assumptions, let,
and
{dot over (ξ)}i(t)=Āiξi(t)+
ei(t)=
Next, define the matrices
C
fi≙[Ci+DiK1iDiK2i].
Using equations (206), (230), and (231) can be written as
ξi(t)=Afiξi(t)+Bfiμi(t), ξi(0)=ξi0, t≥0, (232)
ei(t)=Cfiξi(t). (233)
Let, in addition, Ψf≙diag(Ψfi, . . . , ΨfN), Ψ=A, B, C and ξ(t)≙[ξ1T(t), . . . , ξNT(t)]T. Then, equations (232) and (233) can be written in the compact form given by
{dot over (ξ)}(t)=Afξ(t)+Bf(⊗Ip){tilde over (w)}(t), ξ(0)=ξ0, t≥0, (234)
{tilde over (z)}(t)=Cfξ(t), (235)
where e(t)={tilde over (w)}(t)={tilde over (z)}(t). Observe that the system in equations (234) and (235) can take the form of equation (212) in [4]. Therefore, Theorem 2 in [4] is supposed to be used immediately. However, its statement is not correct as it is written. A counterexample is described further later in the disclosure.
This paragraph uses the notation and the terminology from [4]. Readers are referred to (12), Theorem 1, and Theorem 2 in [4]. It should be noted that Theorem 2 relies on Theorem 1 and this theorem is derived by means of Theorem 11.8 and Lemma 11.2 in [19]. According to the mentioned results and Chapter 5.3, which is devoted to the notion of internal stability for the system of interest, in [19], it is clear that the following condition should be added to the hypothesis of Theorem 1: Let the realization of T(s) given by (12) be stabilizable and detectable. With this modification, not only the gap in Theorem 1, but also the one in Theorem 2 is filled.
It is understood that the system in equations (234) and (235) is stabilizable and detectable if Af is Hurwitz. Thus, the new condition is satisfied if Afi is Hurwitz for all i∈.
Remark 4B. Assumptions 4B, 5B, and 6B can ensure the stability of the pair EQN for all i∈. Therefore, K1i and K2i can be chosen such that Afi is Hurwitz for all i∈.
Remark 4B. Assumptions 4B, 5B, and 6B ensure the stabilizability of the pair (Āi,
Let gfi(s)≙Cfi(sI−Afi)−1Bfi. We now state the following theorem for the dynamic state feedback case.
Theorem 2B. Let Assumption 3B hold and Afi be Hurwitz for all i∈. If
∥gfi∥∞ρ()<1, ∀i∈, (236)
where ∥gfi∥∞ is the H∞ norm of gfi(s), then Ag is Hurwitz.
Proof. It follows from Theorem 2 in [4] and the above discussion.
Remark 5B. The inequality given by equation (236) is an agent-wise local sufficient condition; that is, it paves the way for independent controller design for each agent. For the connection between this condition and an algebraic Riccati equation (respectively, linear matrix inequality), this disclosure refers to Lemma 9 in [4] (respectively, Theorem 6 in [16]). Moreover, it is understood from Lemma 2B that ρ()<1 under Assumption 3B. Therefore, we can restate Theorem 2B by replacing equation (236) with ∥gfi∥∞≤1, ∀i∈. In this statement, although the condition becomes more conservative, it is not only agent-wise local but also graph-wise local except Assumption 3. Finally, it should be noted that if the graph considered in Theorem 2B contains no loop (i.e., acyclic), then the nodes in can be relabeled such that i>j when (vj, vi)∈. Thus, A is similar to a lower triangular matrix with zero diagonal entries, so is . This implies that ρ()=0; hence, Theorem 2B does not require the condition given by equation (236) anymore. In terms of being agent-wise and graph-wise local, this special case is consistent with the result in [18].
Dynamic Output Feedback with Local Measurement
Let
where xi(t) is the estimate of the state xi(t),
ui(t)=K1i{circumflex over (x)}i(t)+K2i
To estimate the state xi(t), the following local Luenberger observer is employed
{circumflex over ({dot over (x)})}i(t)=Ai{circumflex over (x)}i(t)+Biui(t)+Hi(ymi(t)−Cmi{circumflex over (x)}i(t)−Dmiui(t)), {circumflex over (x)}i(0)={circumflex over (x)}i0, t≥0, (238)
where H; is the observer gain matrix. Using equation (237), equation (238) can be written as
{circumflex over ({dot over (x)})}i(t)=(Ai+BiK1i−Hi(Cmi+DmiK1i)){circumflex over (x)}i(t)+Hiymi(t)+(Bi−HiDmi)K2i
Let also
By equations (239) and (240), one can define the triple (M1i, M2i, M3i) in equation (210) as
Using equation (208), equation (239) can be rewritten as
{circumflex over ({dot over (x)})}i(t)=HiCmixi(t)+(Ai+BiK1i−HiCmi){circumflex over (x)}i(t)+BiK2i
Next, define {circumflex over (x)}(t)≙[{circumflex over (x)}iT(t), . . . , {circumflex over (x)}NT(t)]T∈
{dot over (x)}(t)=Ax(t)+BK1{circumflex over (x)}(t)+BK2
{circumflex over ({dot over (x)})}(t)=HCmx(t)+(A+BK1−HCm){circumflex over (x)}(t)+BK2
e(t)=Cx(t)+DK1{circumflex over (x)}(t)+DK2
Now, insert equation (246) into equation (214) and replace the obtained expression with the one in equation (245). Let
where
η(t)=Aηη(t)+Bηωa(t), η(0)=η0, t≥0, (247)
e(t)=Cηη(t)+Dηωa(t) (248)
where
For the following result, define AHi≙Ai−HiCmi and AH≙A−HCm. By Assumption 7B, Hi can be chosen such that AHi is Hurwitz for all i∈.
Theorem 3B. Let Assumptions 1B, 2B, 3B, and 6B hold. If Ag is Hurwitz and AHi is Hurwitz for all i∈, then the distributed dynamic output feedback control with local measurement given by equations (209) and (210) can solve the problem in Definition 2.
Proof. Let K≙[K1 K2], Â≙A, {circumflex over (B)}≙BK, Ĉ≙C, Ĉm≙Cm, {circumflex over (D)}≙DK, {circumflex over (D)}m ≙DmK, Ê≙E, {circumflex over (F)}≙−Ra,
Then, observe that the quadruple (Aη, Bη, Cη, Dη) takes the form of (Ac, Bc, Cc, Dc) in Lemma 3B. Recall from the proof of Theorem 1B that the triple (G1, G2, 0) incorporates an N p-copy internal model of A0a under Assumption 6B. Thus, the triple (M1, M2, M3) may also incorporate an N p-copy internal model of A0a. It is given that Assumptions 1B and 3B hold. In order to apply Lemma 3B, it should be shown that An is Hurwitz under the conditions that Ag is Hurwitz and AHi is Hurwitz for all i∈. To this end, the following elementary row and column operations are performed on Aη. First, subtract row 1 from row 2 and add column 2 to column 1. Second, interchange rows 2 and 3, and interchange columns 2 and 3. Thus, we obtain the matrix given by
Considering the performed elementary row and column operations, one can verify that Aη is similar to Āη; hence, they have the same eigenvalues. Since Āη is upper block triangular, σ(Āη)=σ(Ag)∪σ(AH). Note that AH is Hurwitz as AHi is Hurwitz for all i∈. It is also given that Ag is Hurwitz. Thus, Aη is Hurwitz. Then, the matrix equations
XηA0a=AηXη+Bη,
0=CηXη+Dη,
have a unique solution Xη by Lemma 3B.
Following similar steps to those in the proof of Theorem 1B, it can be shown under Assumption 2B that ei(t) is ultimately bounded with an ultimate bound for all η0 and for all i∈. If, in addition, limt→∞A0ω(t)−{dot over (ω)}(t)=0 then for all η0 limt→∞ei(t)=0, ∀i∈.
Since the condition on AHi is both agent-wise and graph-wise local, an agent-wise local sufficient condition that ensures property a) of Definition 2 can be determined by determining an agent-wise local sufficient condition, under standard assumptions, for the stability of Ag, which is already given in Theorem 2B.
Dynamic Output Feedback
Define zi(t),
{circumflex over ({dot over (x)})}i(t)=(Ai+BiK1i−Li(Ci+DiK1i)){circumflex over (x)}i+Lievi(t)+(Bi−LiDi)K2i
where Li is the observer gain matrix. Let
Define {circumflex over (x)}(t) and {circumflex over (x)}(t) as described previously herein and L≙diag(L1, . . . , LN). Inserting equation (237) into equations (203) and (204), using equation (250), equation (240), and the above definitions, equations (203), (212), and (204) can be expressed by equation (243),
{circumflex over ({dot over (x)})}i(t)=(A+BK1−L(C+DK1)){circumflex over (x)}(t)+(B−LD)K2
equation (245), and equation (246). Next, insert equation (246) into equation (214) and replace the obtained expression not only with the one in equation (245) but also with the one in equation (251). In addition, define η(t) as described previously herein. Then, the closed-loop system of equations (203), (204), (205), (211), and (212) can be expressed by equations (247) and (248) if the second row of Aη is replaced with
[LCA+BK1−L(C+DK1−DK1)(B−LD+LD)K2]
and the second row of Bη is replace with −LRa.
Theorem 4B. Let Assumptions 1B, 2B, 3B, and 6B hold. If the resulting Aη is Hurwitz, then the distributed dynamic output feedback control given by equations (211) and (212) solves the problem in Definition 2.
Proof. Define K, Â, {circumflex over (B)}, Ĉ, {circumflex over (D)}, Ê, and {circumflex over (F)}, as in the proof of Theorem 3B. Let Ĉm≙0, {circumflex over (D)}m≙0, and M3≙0. Define also the pair (M1, M2) by replacing the triple (H, Cm, Dm) in M1 (respectively, the zero matrix in M2) given by equation (249) with (L, C, D) (respectively, L). Then, observe that the resulting quadruple (Aη, Bη, Cη, Dη) takes the form of (Ac, Bc, Cc, Dc) in Lemma 3B. By the same argument in the proof of Theorem 3B, the resulting triple (M1, M2, M3) incorporates an N p-copy internal model of A0a under Assumption 6B. Since, in addition, Assumptions 1B, 2B, and 3B hold and Aη is Hurwitz, the rest of the proof can be completed by following the steps given in the proof of Theorem 1B.
Now, a goal can be to determine an agent-wise local sufficient condition that assures property a) of Definition 2 under some standard assumptions. For this purpose, define μi(t) as in as described previously herein and let ζi(t)≙[xiT(t),{circumflex over (x)}iT(t),
Furthermore, consider equations (203), (212), (213), and (204) when ω(t)≡0. By inserting equation (211) into the considered equations, we have
{dot over (ζ)}i(t)=AFiζi(t)+BFiμi(t), ζi(0)=ζi0, t≥0, (252)
ei(t)=CFtζi(t). (253)
Remark 6B. Let ALi≙Ai−LiCi. By performing the elementary row and column operations given in the proof of Theorem 3B on AFi, it can be shown that σ(AFi)=σ(Afi)∪σ(ALi). Note that by Assumption 8B, Li can be chosen such that ALi is Hurwitz for all i∈. In conjunction with Remark 4B, this shows that under Assumptions 4B, 5B, 6B, and 8B, it is possible to find K1i, K2i, and Li such that AFi is Hurwitz for all i∈.
Let gFi(s)≙CFi(sI−AFi)−1BFi. For the dynamic output feedback case, the following theorem can be described.
Theorem 5B. Let Assumption 3B hold and AFi be Hurwitz for all i∈. If
∥gFi∥∞ρ()<1, ∀i∈. (254)
then the resulting Aη is Hurwitz.
Proof. It follows by comparing equations (252) and (253) with equations (232) and (233).
To illustrate the performance of the proposed distributed controller architecture described in this embodiment, the following two numerical examples with different exosystems are presented. In particular, the first (respectively, second) example presents the distributed dynamic state (respectively, output) feedback control law. For both examples, we consider five agents with the following system, input, output, and direct feedthrough matrices
and the augmented graph shown in
Example 3. In this example, the disturbance δ(t) and the trajectory of the leader r0(t) satisfy the following dynamics
{dot over (r)}
0(t)=−t03(t)+u0(t), r0(0), t≥0,
respectively, where
By the solution of the disturbance dynamics with the given initial condition, {dot over (δ)}(t) is bounded. Since u0(t) is piecewise continuous and bounded, r0(t) is bounded by Example 4.25 in [7]; hence, {dot over (r)}0(t) is piecewise continuous and bounded. Clearly, {dot over (ω)}(t) is piecewise continuous and bounded. Furthermore, the exosystem affects the state of each agent and its tracking error through matrices
Suppose the piecewise continuity and boundedness of {dot over (ω)}(t) are the only information that is known about the exosystem. As it is suggested in part a) of Remark 2B, let A0=0 and (G1i, G2i)=(0,1) for all i∈. Thus, Assumptions 1B, 2B, 5B, and 6B hold. With the following controller parameters
K1
Afi is Hurwitz for all i∈ and the condition given by equation (236) is satisfied. Thus, Ag is Hurwitz by Theorem 2B. As Theorem 1B promises, ultimately bounded tracking error is observed in
Example 2. The disturbance and the trajectory of the leader satisfy
{dot over (δ)}(t)=e−0.1t, δ(0)=1, t≥0
respectively. Moreover, Eδ
Suppose the unforced parts of the given dynamics are available to the control designer and the forcing terms are known to be piecewise continuous and convergent to zero. Then, let
and
Hence, Assumptions 1B, 5B, and 6B hold. In addition, limt→∞A0ω(t)−{dot over (ω)}(t)=0. Note that Assumption 2B automatically holds since A0ω(t)−{dot over (ω)}(t) is piecewise continuous and convergent. With the following controller parameters
K1i=−[5.1794 0.7932], Li=[17 80.2]T,
K2i=−[2 5.4458 10.3182], i=1,4,5,
L
i[−187 756 600]T,
AFi is Hurwitz for all i∈ and the condition given by equation (254) is satisfied. Thus, Aη is Hurwitz by Theorem 5B. Furthermore, it is guaranteed by Theorem 4B that limt→∞ei(t)=0, ∀i∈ and this fact is demonstrated in
Solvability of Equations (226) and (227)
Section III in [4] also studies the solvability of the matrix equations in equations (226) and (227), which correspond to the matrix equations given by (6) in [4], with an alternative approach. Specifically, the last paragraph of Section III in [4] lists three sufficient conditions based on Remark 3.8 of [6] to guarantee that these matrix equations have a unique solution. However, it cannot be guaranteed as it claimed in [4]. This subsection aims to present the gaps between the conditions and the existence of a unique solution to the matrix equations, propose appropriate modifications that fill these gaps, and explain the motivation behind the disclosed approach. For this purpose, the first focus is on Definition 3.7 and Remark 3.8 in [6] to fix a problem in [6]. Then, the conditions listed in [4] are revisited to point out the missing one. Finally, a motivational example is provided and the difference between the approach in [4] and the one in this disclosure is highlighted.
In this paragraph, the notation and the terminology in [6] are adopted and readers are referred to (3.5), (3.6), (3.8), Definition 3.7, and Remark 3.8 in [6]. The problem in [6] is that the conditions of Remark 3.8 do not ensure the stabilizability of the pair given by (3.8). Moreover, this problem is directly transferred to [4]. To illustrate this point, the following system, input, output, and direct feedthrough matrices of the plant; and system matrix of the exosystem are considered
C=[0.5 −0.5], D=0, A1=0.
It can be easily checked that the plant and the exosystem above satisfy the first and the second condition of Remark 3.8. Note that m(s)=s is the minimal polynomial of A1. Then, choose the pair (β1,σ1) in (3.6) as follows
It is obvious that the pair (β1,σ1) is controllable and the minimal polynomial of A1 divides the characteristic polynomial of β1. Thus, the pair (G1, G2)≙(β1,σ1) incorporates a 1-copy internal model of A1 according to Definition 3.7. Now investigated the stabilizability of the pair in equation (3.8). This pair is not controllable by the controllability matrix test (for example, see Theorem 12.1 in [3]) and the eigenvalues of the first matrix of this pair are −1, 0, 1, and 2. The eigenvector test for stabilizability (for example, see Theorem 14.1 in [3]) reveals that unstable eigenvalue 1 is the uncontrollable mode; that is, the pair in (3.8) is not stabilizable. Hence, there do not exist K1 and K2 such that Ac defined in (3.5) is Hurwitz. This counterexample to Remark 3.8 is obtained due to the fact that the constructed G1 violates Property 1.5 in [5]. In fact, J. Huang (personal communications, Jun. 9, 2018) recognizes the problem in Remark 3.8; hence, he adds Property 1.5 as a condition to Lemma 1.27 of [5]. Also noted is that the proof of Lemma 1.26 in [5] is still valid even if Assumption 1.1 in [5] is removed from the hypotheses of Lemma 1.26.
In this disclosure, Definition 1 modifies the second property of Definition 1.22 given after (1.58) in [5]. This modification guarantees that Property 1.5 in [5] automatically holds if Assumption 5B holds. Based on the foregoing discussions, it is clear that Remark 4B is true.
The following two paragraphs adopt the notation and the terminology from [4]. Readers are referred to (5), (6), (7), (8), (10), Definition 2, Lemma 2, Section IIB, and Section III in [4]. It is shown in Section III that if the matrix equations in (8) have solutions X1i and X2i for i=1, . . . , N, then the ones in (7) have solutions X1=diag(X11, . . . , X1N) and X2=diag(X21, . . . , X2N); that is, the matrix equations in (6) has a solution X=[X1TX2T]T. Furthermore, it is claimed that if the three conditions listed in the last paragraph of Section III hold, then the matrix equations in (8) have unique solutions X1i and X2i for i=1, . . . , N. In section II.B, S is assumed to have no strictly stable modes. However, these conditions do not guarantee the unique solutions. For, consider A1=0, B1=1, C1=1, D1=0, S=0, R=1, P1=1, F1=0, and G1=1. It can be easily checked that the listed conditions are satisfied and Property 1.5 in [5] is not violated. Choose K1=0 and H1=0. From the first matrix equation in (8), we get 1=0, which is a contradiction. Next, the problem in the claim is pointed out. First, observe that the matrix equations in (8) can be equivalently written as the matrix equations given by (1.70) and (1.71) in [5]. Then, Lemma 1.27 in [5], one can note that the following condition is missed in the claim: Ã; given after (10) is Hurwitz for i=1, . . . , N. After the suggested modification above, Ki and Hi can always be chosen such that Ãi is Hurwitz under the listed conditions. It can be shown that this condition, together with the assumption on S, ensures that zero matrices are the unique solutions to the off-block-diagonal matrix equations in (7) by adding Gc((Cc=DcKc)X1+DcHcX2−Rc) to the left side of the second equation in (7) that gives an equivalent form of (7) and applying the first part of Proposition A.2 in [5]. In conclusion, if the assumption on S holds, the third condition in the list holds for i=1, . . . , N, and Ãi is Hurwitz for i=1, . . . , N, then the matrix equations in (6) has a unique solution X.
According to Lemma 2B, the problem in Definition 2 is solved if the assumption on S holds, Al given after (5) is Hurwitz, and the matrix equations in (6) have a unique solution X. Although the approach utilized during the derivation of the listed conditions does not take into account the assumption on Al, which is required to solve the problem in Definition 2, one may wonder the answer of the following question: Let the listed conditions hold and Al be Hurwitz. Then, can it be concluded that Ãi is Hurwitz for i=1, . . . , N? The answer is no. That is, the missing condition cannot be satisfied by assuming that the listed conditions hold and Al is Hurwitz. To clarify this point, consider the system parameters of the agents, the system matrix of the exosystem, and the adjacency matrix of *.
A
3=1, B3=−1, C3=1, D3=0, S=0,
Choose (Fi,Gi)=(0,1), i=1,2,3. It can be easily checked that the listed conditions are satisfied and Property 1.5 in [5] is not violated. One can also obtain W, which is required to construct Ai, from *. Then, choose the remaining parameters of the controllers as follows
K
2=−[104.56 57.936 14.828], H2=−80, K3=0.8, H3=1.
With this setup, it can be verified that 3 is not Hurwitz even though A is Hurwitz.
Based on the previous example, the following question arises: Is the missing condition in [4] necessary to ensure that the matrix equations given by (6) in [4] have a unique solution? In fact, this question is the motivation behind the key lemma (i.e., Lemma 3B) of this disclosure and the answer is no. In contrast to Section III in [4], the approach in Lemma 3B does not decompose matrix equations, which consist of the overall dynamics of the multiagent system, into matrix equations, which deal with the dynamics of each agent separately; hence, the missing condition in [4] is not required in Lemma 3B. Furthermore, not only dynamic stated feedback but also dynamic output feedback with local measurement and dynamic output feedback effectively utilize Lemma 3B to solve the stated problem in Definition 2 (see Theorems 1B, 3B, and 4B).
Since the proof of Theorem 1 and the statement of Theorem 4 in [16] use the approach in Section III of [4], the description in this subsection will also be helpful for the readers of the results in [16].
On Theorem 2 in [2]
In this subsection, the notion and the terminology in [4] are adopted and readers are referred to (5), (10), (15), and Theorem 2 in [4]. Now, consider the system parameters of the agent, the system matrix of the exosystem, and the adjacency matrix of * given by
Choose (F1, G1)=(0, I2) and
Note that W=1 from *; hence, Al given after (5) is nothing but Ã1 given after (10). With this setup, it can be verified that T1(s) given before Theorem 2 is stable and the condition in (15) is automatically satisfied, but Al is not Hurwitz. This counterexample is obtained because the realization of T1(s) is neither stabilizable nor detectable.
The above setup also applies to Theorem 5 in [16] since it relies on Theorem 2 and its conditions are satisfied. It should be noted that although Assumptions 1-4 in [16] and Property 1.5 in [5] are not listed in the hypothesis of Theorem 5 in [16], this counterexample does not violate them.
References Related to the Second Embodiment
In one embodiment, the capable agent 110 and the other agents 120 represent six respective vehicles comprising a capable vehicle 110 and other vehicles 120 in a group of vehicles of the multiagent system 100. Arrows between the vehicles represent an example of peer-to-peer communication paths for exchanging navigation information among the six vehicles of the multiagent system 100. The dynamics of each of the agents is represented, for example, by linearized equations of translation dynamics described in the first and second embodiments.
The vehicle agent device 300 may be similar or substantially the same as a capable agent 110 or one of the other agents 120 in the multiagent system having a distributed control architecture, as described with respect to
A group of the vehicles may be referred to as a swarm and may comprise a plurality of vehicles that travel over time in a one dimensional spatial system, a two dimensional spatial system, or a three dimensional spatial system to reach a common destination. The common destination may be a fixed destination or one that changes position over time. For example, a group may pursue or follow a target vehicle or an object. The direction of travel of the group may change over time. The direction of travel of the group may be initiated by a vehicle agent device 300 that functions as a capable agent 110. The direction of travel of the group may further be implemented autonomously by one or more vehicle agent devices that function as other agents 120 of the group.
In one embodiment, a vehicle platform may comprise an automobile that travels over land or roads to pursue a moving target vehicle. In another embodiment, the vehicle platform may comprise flying crafts such as planes or drones that travel in the air or space towards a moving target. In another embodiment, the vehicle platform may comprise boats or submarines that travel in or under water. The moving target may travel in the air, space, water, or along the ground. In another embodiment, a plurality of vehicles of a group may travel towards a fixed destination. For example, vehicles traveling in a group may carry passengers, goods, or materials to a common fixed destination. However, the disclosure is not limited to any specific type of vehicle platform or mode of travel, and any suitable vehicle platform or mode of travel may be controlled by the vehicle agent device 300 to be a member of a group of vehicles.
The vehicle agent device 300 shown in
A vehicle agent device 300 that operates as a capable agent 110 may be referred to as a capable agent device. A vehicle agent device 300 that operates as one of the other agents 120 may be referred to as a following agent device. A vehicle agent device 300 may refer to either or both of a capable agent device or a following agent device.
The navigation logic 324 may determine a direction for a course of travel in one, two, or three dimensions for a vehicle platform connected to the vehicle agent device 300. For example, the navigation logic 324 may determine when, where, and how a connected vehicle platform should change its spatial orientation and to what degree its spatial orientation should change in its course of travel within a group. In one embodiment, the navigation logic 324 may utilize location information from the GNSS receiver 332 and may utilize navigation mapping software and USGS data, for example, to determine the course of travel. The navigation mapping software may track the location of the vehicle agent device 300 based on the GNSS location information. In some embodiments, the navigation logic 324 may utilize information received via one or more of the agent sensor devices 322 to determine the course of travel. For example, the agent sensor devices 322 may include ultrasonic sensors, infrared (IR) sensors, cameras with vision processing, a light detection and ranging system (LIDAR), peer-to-peer wireless radio communication, a sound navigation and ranging (SONAR) system, and the like. For example, the information from the agent sensor devices 322 may provide the location or relative location of one or more other vehicles or objects that the vehicle agent device 300 and its connected platform vehicle are following. The one or more other vehicles or objects may be a target of interest, such as a vehicle or object that the group as a whole is following or following. Moreover, the one or more other vehicles or objects sensed by the agent sensor devices 322 may include one or more other vehicles of a group that the vehicle agent device 300 is a member of and travelling among. This form of sensing of the other vehicles of the group may be referred to peer-to-peer communication among vehicles of the group, and may enable a plurality of vehicle agent devices 300 of the group to each autonomously determine their own direction of travel and/or speed.
A capable agent device may serve as a leader of a group of vehicles, and may receive or determine its own navigation parameters, for example, for direction of travel and speed, in a variety of ways. In one embodiment, when the vehicle agent device 300 functions as a capable agent device, the navigation logic 324 may determine a course direction for the capable agent device based on instructions received via a wireless interface of the communication interfaces 320, from an external control station 360 (see
A capable agent device may communicate with the external control station 360 based on any suitable wireless technology or protocols. For example, the external communications may be transmitted via wireless wide area, local area, or personal area networks. Furthermore, the external communications may be implemented using, without limitation, cellular, satellite, Wi-Fi, Bluetooth, two-way radio, half duplex radio, and military or public safety communication systems. The external control station 360 may communicate direction information only with capable agent devices, and may not provide direction information to any of the following agent devices. Furthermore, the external control station 360 and the capable agent device may not have a global knowledge of the state of the group that the vehicle agent device 300 is a member of and travelling among. For example, the location and/or direction of travel of all of the following agent devices may not be known to, and are not communicated by the external control station 360, the capable agent device, or the following agent devices. Instead, the direction of travel of following agent devices travelling among a group, depend on one or more of peer-to-peer communication, local measurements, and autonomous navigation determination. In other words, the external control station 360 may provide navigation control information only to the capable agent device.
In some embodiments, the vehicle platform connected to a capable agent device may be a piloted vehicle. The pilot of the vehicle platform may provide navigation control input to the capable agent device via the graphical user interface 334 or by piloting the vehicle platform of the capable agent device. The pilot input may be received in addition to, or in place of, the information received from the external control station 360. The pilot's and/or external control station 360 control input may communicated peer-to-peer to one or more following agent devices, and may be further propagated peer-to-peer throughout the group.
In some embodiments, the navigation logic 324 may determine navigation parameters for group vehicle navigation commands based on input from one or more agent sensor devices 322 and location information received from the GNSS receiver 332 (for example, data received from vehicle sensors). For example, the navigation and parameters may be based on sensor information received while the vehicle platform and the vehicle agent device 300 are trained on the target of interest and encounter objects or obstacles along a course traveled while tracking the target of interest. Alternatively, the navigation parameters for group vehicle navigation commands may be based on program instructions stored in the memory 340 and/or data from the agent sensor devices 322 received while the capable agent travels along a programmed course.
In some embodiments, there may be more than one capable agent device that functions to lead a group of following vehicles. The multiple capable agent devices may not communicate with each other. However, if the multiple capable agent devices propagate different peer-to-peer commands to the following agent devices of the group, for example, for different navigation scaling factors, the scale factors may eventually reach a consensus value in the group, for example, an average of the different scale factors.
The vehicle navigation command generator 328 may generate commands to the vehicle platform connected to the vehicle agent device 300. The commands may be generated utilizing information received from the group navigation logic. The commands may control the direction of travel of the vehicle platform and the distance of the vehicle platform from any other vehicle agent vehicle platforms that are travelling as a member of the same group. The speed of following agent devices of a group may depend directly or indirectly on the speed of the capable agent device of the group. The speed of the capable agent device may be controlled by the external control station 360, a pilot, or the speed of an object or vehicle that the capable agent device is following.
The vehicle navigation command generator 328 may generate navigation commands for peer-to-peer communication to one or more neighboring following agent devices. The navigation commands for neighboring following agent devices are communicated between agents and may include an agent's current position, scaling factor, rotation angle (to control the group's orientation), and a local integral state vector (to construct the local control).
The vehicle navigation controllers 330 may comprise one or more control interfaces to, for example, steering, elevation, speed, or braking systems in the vehicle platform that the vehicle agent device 300 is connected to. The vehicle navigation controllers 330 may communicate the generate group vehicle navigation commands to the steering, elevation, speed, or braking systems in order for the vehicle platform to perform as a member of the group and perform group tasks. In some embodiments, the vehicle navigation controller 330 controls the movement of the vehicle agent device 300 based on the self-navigation input control signal described herein.
In some embodiments, the electronic processor 338 may be communicatively coupled to, the I/O interface 370, one or more the communication interfaces 320, the agent sensor devices 322, the navigation logic 324, the vehicle navigation command generator 328, the vehicle navigation controllers 330, the GNSS receiver 332, the graphical user interface 334, the electronic processor 338, the memory 340, the camera 350, the microphone 352, the display 354, the speaker 356, the network interface 364 and the user interfaces 366.
The memory 340 may store program instructions 346 that when executed by the electronic processor 338 may cause the electronic processor 338 to perform or support functions of the vehicle agent device 300 according to the embodiments.
In various embodiments, electronic processor 338 may be a uniprocessor system including one electronic processor 338, or a multiprocessor system including several electronic processors 338 (for example, two, four, eight, or another suitable number). Electronic processors 338 may be any suitable processor capable of executing instructions. For example, in various embodiments, the electronic processors 338 may implement any of a variety of instruction set architectures (ISAs), such as the x86, PowerPC, SPARC, or MIPS ISAs, or any other suitable ISA. In multiprocessor systems, each of the electronic processors 338 may commonly, but not necessarily, implement the same ISA.
In some embodiments, at least one electronic processor 338 may be a graphics processing unit. A graphics processing unit or GPU may be considered a dedicated graphics-rendering device. Modern GPUs may be very efficient at manipulating and displaying computer graphics, and their highly parallel structure may make them more effective than typical CPUs for a range of complex graphical algorithms. For example, a graphics processor may implement a number of graphics primitive operations in a way that makes executing them much faster than drawing directly to the screen with a host central processing unit (CPU). In various embodiments, the image processing methods disclosed herein may, at least in part, be implemented by program instructions configured for execution on one of, or parallel execution on two or more of, such GPUs. The GPU(s) may implement one or more application programmer interfaces (APIs) that permit programmers to invoke the functionality of the GPU(s). Suitable GPUs may be commercially available from vendors such as NVIDIA Corporation, ATI Technologies (AMD), and others.
The memory 340 may be configured to store program instructions 346 and/or program data (for example, capable agent data 342 and navigation data 344) accessible by the electronic processor 338 and/or by the navigation logic, 324, the vehicle navigation command generator 328, and/or the vehicle navigation controllers 330, among other elements of the vehicle agent device 300. In various embodiments, the memory 340 may be implemented using any suitable memory technology, such as static random access memory (SRAM), synchronous dynamic RAM (SDRAM), nonvolatile/Flash-type memory, or any other type of memory. In the illustrated embodiment, program instructions and data implementing desired functions, such as those described above for various embodiments, are shown stored within the memory 340 as program instructions 346 and data storage 342 and 344. In other embodiments, program instructions and/or data may be received, sent or stored upon different types of computer-accessible media or on similar media separate from the memory 340 or vehicle agent device 300. Moreover, in some embodiments, a database that is accessible via the network interface 364 may store, among other things, data for implementing desired functions, such as those described above for various embodiments. Generally speaking, a computer-accessible medium may include storage media or memory media such as magnetic or optical media, for example, disk or CD/DVD-ROM coupled to the vehicle agent device 300 via the I/O interface 370. Program instructions and data stored via a computer-accessible medium may be transmitted by transmission media or signals such as electrical, electromagnetic, or digital signals, which may be conveyed via a communication medium such as a network and/or a wireless link, such as may be implemented via network interface 364.
In one embodiment, I/O interface 370 may be configured to coordinate I/O traffic between electronic processor 338, memory 340, one or more of the communication interfaces 320, the agent sensor devices 322, the navigation logic 324, the vehicle navigation command generator 328, the vehicle navigation controllers 330, the GNSS receiver 332, the graphical user interface 334, and any peripheral devices in the vehicle agent device 300, including network interface 364 or other peripheral interfaces, such as the camera 350, microphone 352, display 345, speaker 356, and user interfaces 366. In some embodiments, I/O interface 370 may perform any necessary protocol, timing or other data transformations to convert data signals from one component (for example, the memory 340) into a format suitable for use by another component (for example, electronic processor 338). In some embodiments, I/O interface 370 may include support for devices attached through various types of peripheral buses, such as a variant of the Peripheral Component Interconnect (PCI) bus standard or the Universal Serial Bus (USB) standard, for example. In some embodiments, the function of I/O interface 370 may be split into two or more separate components, such as a north bridge and a south bridge, for example. In addition, in some embodiments some or all of the functionality of I/O interface 370, such as an interface to memory 340, may be incorporated directly into electronic processor 338.
The network interface 364 may be configured to allow data to be exchanged between the vehicle agent device 300 and other devices attached to a network, such as computer systems, a database, or between nodes of the vehicle agent device 300. In various embodiments, network interface 364 may support communication via wired or wireless general data networks, for example: via telecommunications/telephony networks such as voice networks or digital fiber communications networks; via storage area networks such as Fiber Channel SANs, or via any other suitable type of network and/or communications protocol.
The user interfaces may support, in some embodiments, one or more of display terminals, keyboards, keypads, touchpads, scanning devices, voice or optical recognition devices, or any other devices suitable for entering or retrieving data by one or more vehicle agent device 300. Multiple user input/output devices may be present in the vehicle agent device 300 or may be distributed on various nodes of the vehicle agent device 300. In some embodiments, similar input/output devices may be separate from the vehicle agent device 300 and may interact with one or more nodes of the vehicle agent device 300 through a wired or wireless connection, such as over network interface 364.
Those skilled in the art will also appreciate that, while various items are illustrated as being stored in memory or on storage while being used, these items or portions of them may be transferred between memory and other storage devices for purposes of memory management and data integrity. Alternatively, in other embodiments some or all of the software components may execute in memory on another device and communicate with the illustrated vehicle agent device 300 via inter-device communication. Some or all of the system components or data structures may also be stored (for example, as instructions or structured data) on a computer-accessible medium or a portable article to be read by an appropriate drive, various examples of which are described above. In some embodiments, instructions stored on a computer-accessible medium separate from the vehicle agent device 300 may be transmitted to the vehicle agent device 300 via transmission media or signals such as electrical, electromagnetic, or digital signals, conveyed via a communication medium such as a network and/or a wireless link. Various embodiments may further include receiving, sending or storing instructions and/or data implemented in accordance with the foregoing description upon a computer-accessible medium. Accordingly, the present embodiments may be practiced with other system configurations.
At block 410, the electronic processor 338 of the vehicle determines a controller state signal. For example, the electronic processor 338 may determine the controller state signal in accordance with the controller state zi(t) described in the second embodiment.
At block 415, the electronic processor 338 of the vehicle determines self-navigation input control signal based on the local virtual tracking error signal and the controller state signal. For example, the electronic processor 338 may determine the self-navigation input control signal (for example, ui(t)) in accordance with equation (206), equation (208), and/or equation (211) described in the second embodiment.
In some embodiments, the electronic processor 338 receives data from a plurality of vehicle sensors (for example, the agent sensor devices 322). In some embodiments, the electronic processor 338 determines a local measurement output signal (for example, ymi(t)) based on the data received from the plurality of vehicle sensors. For example, the electronic processor 338 may determine the local measurement output signal based on equation (208) described in the second embodiment. In some embodiments, the electronic processor 338 determines the self-navigation input control signal based on the local measurement output signal. For example, the electronic processor 338 may determine the self-navigation input control signal based on equations (208) and (209) described in the second embodiment.
In some embodiments, the electronic processor 338 determines the location virtual tracking error signal based on a tracking error signal, a relative output error signal, or both. In some embodiments, the electronic processor 338 receives the tracking error signal (for example, ei(t)) from a capable agent vehicle. In some embodiments, the electronic processor 338 receives the relative output error signal (for example, yi(t)−yj(t)) from a neighboring agent vehicle. In some embodiments, the electronic processor 338 receives data from a neighboring vehicle (for example a position or a heading) and determine the relative output error signal (for example, yi(t)−yj(t)) based at least in part on the data received from the neighboring agent vehicle.
In some embodiments, the electronic processor 338 receives determines a self-state signal (for example, xi(t)) based on equations (206) and (207) described in the second embodiment.
In this disclosure, the cooperative output regulation problem of heterogeneous linear time-invariant multiagent systems over fixed directed communication graph topologies is described. Among other things, this disclosure provides a new definition of the linear cooperative output regulation problem (see Definition 2), which allows a broad class of functions to be tracked and rejected by a network of agents, and focused on an internal model based distributed control approach. For the three different distributed control laws (i.e., dynamic state feedback, dynamic output feedback with local measurement, and dynamic output feedback), global and local sufficient conditions are determined (see Theorems 1B, 2B, 3B, 4B, and 5B).
The approach in this disclosure is relevant, for example, to the linear cooperative output regulation problem with an internal model based distributed dynamic state feedback control law. This disclosure considers not only dynamic state feedback but also dynamic output feedback with local measurement and dynamic output feedback, where the output feedback stabilizability is not assumed. To prove the existence of a unique solution to the matrix equations that is important for the solvability of the problem, previously systems decompose these matrix equations, which consist of the overall dynamics of the multiagent system, into matrix equations, which deal with the dynamics of each agent separately. In contrast, Lemma 3B as described herein, which is also applicable to dynamic output feedback cases, guarantees that these matrix equations have a unique solution without the need to decompose them.
Various features and advantages are set forth in the following claims.
This application is a non-provisional of and claims benefit of U.S. Provisional Application No. 62/540,813, filed on Aug. 3, 2017, the entire contents of which are incorporated herein by reference.
This invention was made with government support CMMI1657637 awarded by the National Science Foundation. The Government has certain rights in the invention
Number | Name | Date | Kind |
---|---|---|---|
6691151 | Cheyer et al. | Feb 2004 | B1 |
7036128 | Julia et al. | Apr 2006 | B1 |
20070203693 | Estes | Aug 2007 | A1 |
20170329348 | Li | Nov 2017 | A1 |
20180101169 | Applewhite | Apr 2018 | A1 |
Entry |
---|
Yucelen et al., “Control of multivehicle systems in the presence of uncertain dynamics,” International Journal of Control, 2013, 86(9):1540-1553 (Year: 2013). |
Adib Yaghmaie et al., “Output regulation of heterogeneous linear multi-agent systems with differential graphical game,” International Journal of Robust and Nonlinear Control, 2016, 26:2256-2278. |
Adib Yaghmaie et al., “Output regulation of linear heterogeneous multi-agent systems via output and state feedback,” Automatica, 2016, 67:157-164. |
Cai et al., “The adaptive distributed observer approach to the cooperative output regulation of linear multi-agent systems,” Automatica, 2017, 75:299-305. |
Cao et al., “Leader-follower consensus of linear multi-agent systems with unknown external disturbances,” Systems & Control Letters, 2015, 82:64-70. |
Fiedler et al., “On matrices with non-positive off-diagonal elements and positive principal minors,” Czechoslovak Mathematical Journal, 1962, 12(3):382-400. |
Francis et al., “The internal model principle of control theory,” Automatica, 1976, 12(5). |
Huang et al., “Cooperative output regulation of heterogeneous multi-agent systems: an H∞ criterion,” IEEE Transactions on Automatic Control, 2014, 59(1):267-273. |
Huang et al., “On a robust nonlinear servomechanism problem,” IEEE Transactions on Automatic Control, 1994, 39(7):1510-1513. |
Kofman, “Non conservative ultimate bound estimation in LTI perturbed systems,” Automatica, 2005, 41:1835-1838. |
Kottenstette et al., “On relationships among passivity, positive realness, and dissipativity in linear systems,” Automatica, 2014, 50(4): 18 pages. |
Li et al., “Distributed robust consensus control of multi-agent systems with heterogeneous matching uncertainties,” Automatica, 2014, 50(3): 883-889. |
Li et al., “Distributed tracking control for linear multiagent systems with a leader of bounded unknown input,” IEEE Transactions on Automatic Control, 2013, 58(2):518-523. |
Li et al., “Synchronised output regulation of leader-following heterogeneous networked systems via error feedback,” International Journal of Systems Science, 2016, 47(4): 755-764. |
Modares et al., “Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning,” Automatica, 2016, 71:334-341. |
Moylan et al., “On the stability and well-posedness of interconnected nonlinear dynamical systems,” IEEE Transactions on Circuits and Systems, 1980, 27(11):1097-1101. |
Olfati-Saber et al., “Consensus and cooperation in networked multi-agent systems,” Proceedings of the IEEE, 2007, 95(1):215-233. |
Peng et al., “Cooperative tracking and estimation of linear multi-agent systems with a dynamic leader via iterative learning,” International Journal of Control, 2014, 87(6): 1163-1171. |
Sarsilmaz et al., “On control of heterogeneous multiagent systems with unknown leader dynamics,” ASME Dynamic Systems and Control Conference, 2017. |
Sarsilmaz et al., “On control of heterogeneous multiagent systems: A dynamic measurement output feedback approach,” in American Control Conference, 2018, 6 pages. |
Shivakumar et al., “A sufficient condition for nonvanishing of determinants,” Proceedings of the American Mathematical Society, 1974, 43(1):63-66. |
Sontag, “Input to state stability: Basic concepts and results,” Nonlinear and Optimal Control Theory, 2006, pp. 163-220. |
Sontag, “The ISS philosophy as a unifying framework for stability-like behavior,” Nonlinear Control in the Year 2000, 2000, pp. 443-468. |
Su et al., “Cooperative output regulation of linear multi-agent systems by output feedback,” Systems & Control Letters, 2012, 61(12):1248-1253. |
Su et al., “Cooperative output regulation of linear multiagent systems,” IEEE Transactions on Automatic Control, 2012, 57(4): 1062-1066. |
Tang, “Leader-following coordination problem with an uncertain leader in a multi-agent system,” IET Control Theory and Applications, 2014, 8(10): 773-781. |
Tran et al., “On control of multiagent formations through local interactions,” in IEEE Conference on Decision and Control, 2016. |
Wang et al., “A Distributed Control Approach to a Robust Output Regulation Problem for Multi-Agent Linear Systems,” IEEE Transactions on Automatic Control, 2010, 55(12): 2891-2895. |
Wieland et al., “An internal model principle is necessary and sufficient for linear output synchronization,” Automatica, 2011, 47(5): 1068-1074. |
Willems, “The generation of Lyapunov functions for input-output stable systems,” SIAM Journal on Control, 1971, 9(1):105-134. |
Yucelen et al., “Control of multivehicle systems in the presence of uncertain dynamics,” International Journal of Control, 2013, 86(9):1540-1553. |
Number | Date | Country | |
---|---|---|---|
62540813 | Aug 2017 | US |