Distributed control of heterogeneous multi-agent systems

Information

  • Patent Grant
  • 10983532
  • Patent Number
    10,983,532
  • Date Filed
    Friday, August 3, 2018
    6 years ago
  • Date Issued
    Tuesday, April 20, 2021
    3 years ago
Abstract
Systems and methods for controlling motion of a vehicle in a group of vehicles. In one embodiment, the system includes a communication interface, a vehicle platform for travelling among the group of vehicles, and an electronic processor. The electronic processor is configured to determine a local virtual tracking error signal and a controller state signal. The electronic processor is also configured to determine a self-navigation input control signal based on the local virtual tracking error signal and the controller state signal. The self-navigation input control signal is for navigating the vehicle platform. A trajectory of an exosystem is based on a boundedness condition. The vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology. Each vehicle in the group of vehicles is stabilizable and satisfies a transmission zero condition. Design matrices of the vehicle satisfy an internal model principle.
Description
BACKGROUND OF THE INVENTION

Heterogeneous multi-agent systems formed by networks of agents having different dynamics and dimensions present a significantly broader class of multiagent systems than their heterogeneous and homogeneous counterparts that consist of networks of agents having different dynamics with the same dimension and identical dynamics, respectively. Therefore, the distributed control of this class of multiagent systems has been an attractive research topic in the systems and control field.


In particular, the cooperative output regulation problem of heterogeneous (in dynamics and dimension) linear time-invariant multiagent systems, where the output of all agents synchronize to the output of the leader, over general fixed directed communication graph topologies have been recently investigated. This problem can be regarded as the generalization of the linear output regulation problem to multi-agent systems. Therefore, distributed control approaches to this problem can be classified into two types: feedforward design methodology and internal model principle. With the former methodology, the feedforward gain of each agent relies on the solution of the regulator equations; hence, this methodology is known to be not robust to plant uncertainties. On the other hand, the latter methodology is robust with respect to small variations of the plant parameters. However, it cannot be applied when the transmission zero condition does not hold.


The common denominator of these results is that an exosystem, which has an unforced linear time-invariant dynamics, generates both a reference trajectory and external disturbances to be tracked and rejected by networks of agents. Specifically, the system matrix of the exosystem is explicitly used by controllers of all agents in some systems and a proper subset of agents in other systems; or each agent incorporates a p-copy internal model of this matrix in its controller. In practical applications, however, it can be a challenge to precisely know the system matrix of the exosystem, even the dynamical structure of the exosystem; especially, when an external leader interacts with the network of agents or a control designer simply injects optimized trajectory commands to the network based on, for example, an online path planning algorithm. To allow ultimately bounded tracking error in such cases, an alternative, generalized definition is needed for the cooperative output regulation problem.


SUMMARY OF THE INVENTION

In this disclosure, the cooperative output regulation problem of heterogeneous (in dynamics and dimension) linear time-invariant multiagent systems over general fixed directed communication graph topologies is considered. A new definition of the linear cooperative output regulation problem that is more suitable for practical applications is presented. For internal model based distributed dynamic state feedback, output feedback with local measurement, and output feedback control laws, the solvability of this problem by first assuming a global condition is investigated and then a local sufficient condition under standard assumptions is provided.


The approach of this disclosure is relevant to previous works in which the linear cooperative output regulation problem with an internal model based distributed dynamic state feedback control law are studied. In particular, some previous systems use an output feedback control under an output feedback stabilizability condition. In addition to the generalized definition of the linear cooperative output regulation problem, this disclosure differs from previous approaches at least in terms of the following points.


This disclosure considers not only dynamic state feedback but also dynamic output feedback with local measurement and dynamic output feedback, where the output feedback stabilizability is not assumed.


To prove the existence of a unique solution to the matrix equations that is important for the solvability of the problem, previous approaches decomposes the matrix equations, which include the overall dynamics of the multi-agent system, into matrix equations, which deal with the dynamics of each agent separately. In contrast, Lemma 3B described herein, which is also applicable to dynamic output feedback cases, guarantees that these matrix equations have a unique solution without the need to decompose them.


In addition, a few gaps in the related results of previous approaches are illustrated and fixed.


The disclosure provides a system for controlling motion of a vehicle in a group of vehicles. In one embodiments, the system includes a communication interface, a vehicle platform for travelling among the group of vehicles, and an electronic processor. The electronic processor is configured to determine a local virtual tracking error signal and a controller state signal. The electronic processor is also configured to determine a self-navigation input control signal based on the local virtual tracking error signal and the controller state signal. The self-navigation input control signal is for navigating the vehicle platform when travelling as a member of the group of vehicles. A trajectory of an exosystem is based on a boundedness condition. The trajectory of the exosystem includes external disturbances and a trajectory of a leader vehicle of the group of vehicles. The vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology. Each vehicle in the group of vehicles is stabilizable. Each vehicle in the group of vehicles satisfies a transmission zero condition. Design matrices of the vehicle satisfy an internal model principle.


The disclosure also provides a method for controlling motion of a vehicle in a group of vehicles. In one embodiment, the method includes determining, with an electronic processor of the vehicle, a local virtual tracking error signal and a controller state signal. The method also includes determining, with the electronic processor, a self-navigation input control signal based on the local virtual tracking error signal and the controller state signal. The self-navigation input control signal is for navigating a vehicle platform of the vehicle when travelling as a member of the group of vehicles. A trajectory of an exosystem is based on a boundedness condition. The trajectory of the exosystem includes external disturbances and a trajectory of a leader vehicle of the group of vehicles. The vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology. Each vehicle in the group of vehicles is stabilizable. Each vehicle in the group of vehicles satisfies a transmission zero condition. Design matrices of the vehicle satisfy an internal model principle.


Other aspects of the invention will become apparent by consideration of the detailed description and accompanying drawings.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is an augmented directed graph of a multi-agent system, in accordance with some embodiments.



FIG. 2 are graphs that represent example output responses of a multi-agent system with a leader having nonlinear dynamics, in accordance with the first embodiment.



FIG. 3 are graphs that represent example output responses of a multi-agent system with a leader having linear dynamics, in accordance with the first embodiment.



FIG. 4 is a graph that represents an example output response of a multi-agent system with dynamic state feedback, in accordance with the second embodiment.



FIG. 5 is a graph that represents an example output response of a multi-agent system with dynamic output feedback, in accordance with the second embodiment.



FIG. 6 is a block diagram of a vehicle agent device, in accordance with some embodiments.



FIG. 7 is a flow chart of a method for controlling motion of a vehicle in a group of vehicles, in accordance with some embodiments.





DETAILED DESCRIPTION

Before any embodiments of the disclosure are explained in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the following drawings. The invention is capable of other embodiments and of being practiced or of being carried out in various ways.


In what follows of this document, two embodiments are presented. The first embodiment should be regarded as an early preliminary result performed by the inventors, where the second embodiment captures the first embodiment as a special case and in a more mathematically-elegant fashion.


First Embodiment

A standard notation is used in the first embodiment. Specifically, R denotes the set of real numbers, custom character denotes the set of complex numbers, Rn denotes the set of n×1 real column vectors, Rn denotes the set of n×m real matrices, In denotes the n×n identity matrix, 1n denotes the n×1 vector of all ones, and ≙ denotes equality by definition. In addition, (⋅)T for transpose, (⋅)−1 for inverse, det(⋅) for determinant ker(⋅) for kernel, ρ(⋅) for spectral radius, ∥⋅∥ for any norm in Rn, |⋅|2 for the Euclidean norm, diag(⋅) for block diagonal operator, and ⊗ for the Kronecker product. Furthermore, the following notation from [33] is adopted. If a, b∈Rn, then the statement “a≤b” is equivalent to “ak≤bk” for all k=1, . . . , n. If “a≤b” and T∈Rn×n is a nonnegative matrix (i.e., all elements of T are nonnegative), than Ta≤Tb. Finally, the space custom characterp for 1≤p<∞ is defined as the set of all piecewise continuous functions u: [0 ∞)→Rm such that ∥u(t)∥custom characterp=(∫0∥u(t)∥p dt)1/p<∞ and extended space custom characterpe is defined using ur (i.e., truncated u) instead of u [9].


Next, the graph theoretical notation used in the first embodiment, which is based on [15]. In particular, consider a time-invariant directed graph custom character=(V, E), where V={v1, . . . , vN} is a nonempty finite set of N nodes and E⊂V×V is a set of edges. Each node in V corresponds to a follower agent in the network. There is an edge rooted at node vj and ended at vi, i.e. (vj, vi)∈E if and only if vi receives information from vj. A=[aij]∈RN×N denotes the adjacency matrix which describes the graph structure, that is aij=1⇔(vj, vi)∈E and aij=0 otherwise. Repeated edges and self loops are not allowed, that is aii=0, ∀i∈custom character with custom character={1, . . . , N}. The set of neighbors of node vi is denoted as Ni={j|(vj, vi)∈E}. In-degree and Laplacian matrices are defined as D=diag (d1, . . . , dN) with















d
i

=




J


N
i





a
ij
















and L=D−A, respectively. Thus L has zero row sums (i.e., L1N=0). A directed path from node vi to node vj is a sequence of successive edges in the form of {(vivp), (vp, vq), . . . , (vr, vj)}. A directed graph is said to have a spanning tree if there is a root node such that it has directed paths to all other nodes in the graph. Augmented time-invariant directed graph is defined as custom character=(V, Ē), where V={v0, v1, . . . , vN} is the set of N+1 nodes, including the leader node and all follower nodes, and Ē⊂V×V is the set of edges such that E⊂Ē.


Problem Formulation. Consider a system of N follower agents with heterogeneous linear time-invariant dynamics, subject to external disturbances, exchanging information amount each other using their local measurement according to a fixed and directed communication graph topology custom character. For example, the dynamics of follower agent i∈custom character can be given by

{dot over (x)}i(t)=Aixi(t)+Biui(t)+δi(t), xi(0)=xi0, t≥0,  (101)
yi(t)=Cixi(t),  (102)

with state xi(t)∈custom characterni, input ui(t)∈custom characterni, unknown external disturbance δi(t)∈custom characterni, and output yi(t)∈custom characterl. In addition, consider a leader node with unknown dynamics, where the output of this leader, y0(t)∈custom characterl, is available to a nonempty proper subset of follower agents.


In contrast to unforced linear time-invariant exosystems that are studied in the literature (e.g., see [7,30]), the leader node is a command generator for the set of follower agents with dynamics given by (101) and (102). From this perspective, the dynamics of the leader node can have any (for example, linear or nonlinear) dynamics with any dimension provided that it has a unique solution and it can even be a static system. Similarly, external disturbances cannot be generated by known unforced linear time-invariant exosystem.


To state the objective considered here, the follow equation is defined

ei(t)≙yi(t)−y0(t),  (103)

as the output tracking error between the output of each follower agent and the output of the leader. In particular, considering the heterogeneous multiagent system, subject to unknown external disturbances, given by equations (101) and (102) together with the output of the leader y0 (t) having unknown dynamics, the objective is to establish a distributed control architecture ui (t) for all agents i in custom character such that the output tracking error given by equation (103) becomes uniformly ultimately bounded. If, in addition, the external disturbances and the output of the leader are constant or they converge to constant vectors, then the output of each follower agent asymptotically converges to the output of the leader y0(t) (i.e., asymptotic synchronization). This invention makes the following assumptions to achieve this objective.


ASSUMPTION 1A. There exist αi≥0 and β≥0 such that

∥δi(t)∥2≤αi<∞, ∀t≥0, ∀i∈custom character,  (104)
{dot over (y)}0(t)∥2≤β<∞, ∀t≥0.  (105)


ASSUMPTION 2A. The augmented graph custom character has a spanning tree with the root node being the leader node.


ASSUMPTION 3A. There exist K1i and K2i such that Ami and CiAmi−1Bmi are nonsingular for all i∈custom character, where Ami ≙Ai−BiK1i∈Rni×ni and Bmi ≙−BiK2i∈Rni×l.


ASSUMPTION 4A. The pair (Ai, Bi) is stabilizable for all i∈custom character.


ASSUMPTION 5A. Each follower agent satisfies rank










[




A
i




B
i






C
i



0



]

=


n
i

+

l
.






(
106
)







Note that Assumption 1A is standard and it even allows the external disturbances and the leader to have unbounded signals provided that their derivatives are bounded. Furthermore, from Remark 3.2 in [15], Assumption 2A can be a necessary condition for the cooperative tracking problem considered in this disclosure. It should also be noted that Assumption 3A shows uniform ultimate boundedness of the output tracking error given by equation (103). Specifically, by applying Lemma 2.5.2 in [1], necessary conditions for Assumption 3A are stated as l≤ni, l≤mi, rank (Ci)=l (i.e., Ci is full row rank), rank (K2i)=1 (i.e., K2i is full column rank), and rank (Bi)≥l. In the proposed approach B, Assumptions 1A, 2A, and 3A and a global sufficient condition are used to achieve the objective stated above. In addition, Assumptions 4A and 5A are further utilized to achieve the same objective, but with an agent-wise local sufficient condition.


Distributed Control Architecture. Based on the stated objective and assumptions in the foregoing discussions, the proposed distributed control architecture is presented. For this purpose, first recall that the leader node is observed from a nonempty proper subset of nodes in graph custom character. If all follower agents observe the leader, independent controller can be designed for each follower even though the controller architecture proposed herein it still applicable. In particular, if node vi observes the leader node v0, then there exists an edge (v0, vi) with weighting gain ki>0. Next, each node i in custom character has access to its own state xi(t) and relative output error, that is, (yi(t)−yj(t)) for all j∈custom characteri. Similar to [30], the local virtual output tracking error can be defined as











e
vi



(
t
)




=






1


d
i

+

k
i





[





j


N
i






a
ij



(



y
i



(
t
)


-


y
j



(
t
)



)



+


k
i



(



y
i



(
t
)


-


y
0



(
t
)



)



]


.





(
107
)







Departing from results in [30,7], an auxiliary dynamics (compensator) with a pair G1i, G2i is defined that incorporates an l-copy internal model of the exosystem, which has unforced linear time-invariant dynamics. This is due to the fact that the leader dynamics is assumed to be unknown and the external disturbances are not due to the exosystem in this invention. Instead, the following auxiliary dynamics are utilized which represents the integration of the local virtual output tracking error, to address the stated objective in the previous section

żi(t)=evi(t), zi(0)=zi0, t≥0.  (108)

Note that equation (108) can be viewed to have the pair (0, Il) incorporating an l-copy internal model, but it does not necessarily match with the dynamics of the leader and the spectral properties of the external disturbances unless the output of the leader and the external disturbances are generated by an unforced linear time-invariant exosystem yielding constant output and disturbances. Building on the above definitions, the local cooperative controller considered in this invention has the form

ui(t)=−K1ixi(t)−K2izi(t).  (109)


The Proposed Approach: B): Global and Local Sufficient Stability Conditions. Considering the objective, assumptions, and the proposed distributed control architecture in previous sections, a global sufficient condition is established for the uniform ultimate boundedness of the output tracking error, where it is shown the conditions when this result reduces to asymptotic synchronization. Based on a converse theorem for linear time-invariant systems, we then derive an agent-wise local sufficient condition by utilizing a small gain theorem and stabilizability and detectability of global multiagent system dynamics, which is shown that both of them are independent of graph topology except one necessary condition for cooperative tracking problem.


Global Sufficient Stability Condition. In order to express the closed-loop dynamics of follower agents in a compact form, let x(t)≙[x1T(t), . . . , xNT(t)]T ∈Rn, δ(t)≙[δ1T(t), . . . , δNT(t)]T ∈Rn, where ni=1Nni,z(t)≙[z1T(t), . . . , zNT(t)]T ∈RNl, Am≙diag (Am1, . . . , AmN), and Bm ≙diag (Bm1, . . . , BmN). From equations (101) and (109), it is given

{dot over (x)}(t)=Amx(t)+Bmz(t)+δ(t), x(0)=x0, t≥0.  (110)

Local virtual output tracking error given by equation (107) can also be equivalently written as











e

v

i




(
t
)


=


1


d
i

+

k
i





[



(


d
i

+

k
i


)




y
i



(
t
)



-




j


N
i






a

i

j





y
j



(
t
)




-


k
i




y
0



(
t
)




]






(
111
)







Now, using equation (102), equation (111) can be further rewritten as











e

v

i




(
t
)


=



C
i




x
i



(
t
)



-


1


d
i

+

k
i





[





j


N
i






a

i

j





y
j



(
t
)




-


k
i




y
0



(
t
)




]







(
112
)







To express the auxiliary dynamics of all followers in a compact form, let












y

0

a




(
t
)




=





1
N




y
0



(
t
)




,



R

N

l



,

K


=




diag


(


k
1

,





,





k
N


)



,





F


=




diag


(


1


d
1

+

k
1



,





,





1


d
N

+

k
N




)



,













and C≙diag (C1, . . . , CN) (F is finite matrix here by Assumption 2A). From equations (102), (108), and (112), it is given

ż(t)=[(IN−FA)⊗Il]Cx(t)−(FK⊗1l)y0a(t),
z(0)=z0, t>0.  (113)


By letting y(t)≙[y1T(t), . . . , yNT(t)]T∈RNl, the output equation of all follower agents is given by

y(t)=Cx(t).  (114)


Finally, let η(t)≙[xT(t)zT(t)]T, ω(t)≙[δT (t),y0aT(t)]T,








A
g



=




[




A
m




B
m







[


(


I
N

-
FA

)



I
t


]


C



0



]


,


B
g



=




[




I

n
_




0




0




-
FK



I
t





]







and Cg≙[C, 0]. Using equations (110), (113), and (114), the closed-loop dynamics of follower agents together with their auxiliary dynamics can be compactly represented as

{dot over (η)}(t)=Agη(t)+Bgω(t), η(0)=η0, t≥0,  (115)
y(t)=Cgη(t).  (116)


In order to state Theorem 1A, which provides the global sufficient condition for uniformly ultimately bounded output tracking error, we require two lemmas. The first lemma shows that CgAg−1[Iñ,0]T=0 and y0a(t) is inherently in the kernel of CgAg−1[0, −FK⊗Il]T+INl. The second lemma provides an inequality to upper bound the output tracking error.


LEMMA 1A. If Assumptions 2A and 3A hold, then

CgAg−1Bgω(t)=y0a(t).  (117)


Proof. Starting by proving that Ag is nonsingular. From Proposition 2.8.7 in [1], it is known that Am and −[(IN−FA)⊗Il]CAm−1Bm are nonsingular, then Ag is nonsingular. Based on Assumption 3A, it can be observed that the first sufficient condition, Am being nonsingular, is satisfied. Further, −[(IN−FA)⊗Il]CAm−1Bm is nonsingular. For this purpose, first note that Am−1=diag(Am1−1, . . . , AmN−1) by at least Assumption 3A and Lemma 2.8.2 in [1]. Then, CAm−1Bm=diag(C1Am1−1Bm1, . . . , CNAmN−1BmN), and hence, CAm−1Bm is nonsingular by Assumption 3A. Furthermore, the theorem in [25] is applicable here because of Assumption 2A. This theorem implies that IN−FA is nonsingular, and hence, (IN−FA)⊗Il is nonsingular by Proposition 7.1.7 in [1]. Since CAm−1Bm and (IN−FA)⊗Il are nonsingular matrices, the second sufficient condition, −[(IN−FA)⊗Il]CAm−1Bm is nonsingular, is satisfied. Thus, Ag is nonsingular.


Next, let







A
g

-
1




=





[




M
1




M
2






M
3




M
4




]

.






Using the definition of Ag−1, Bg, Cg, and ω(t), the following equation is obtained

CgAg−1Bg(t)=−CM1δ(t)+CM2(FK⊗Il)y0a(t).  (118)


From Proposition 2.8.7 in [1],

M1=Am−1+Am−1Bm(−[(IN−FA)⊗Il]CAm−1Bm)−1
x[(IN−FA)⊗Il]CAm−1
=Am−1−Am−1Bm(CAm−1Bm)−1
x[(IN−FA)⊗Il]−1[(IN−FA)⊗Il]CAm−1
=Am−1−Am−1Bm(CAm−1Bm)−1CAm−1  (119)
M2=−Am−1Bm(−[(IN−FA)⊗Il]CAm−1Bm)−1,
=Am−1Bm(CAm−1Bm)−1[(IN−FA)−1⊗Il].  (120)


Now, inserting M1 and M2 into equation (118), the following equation is obtained

CgAg−1Bbω(t)=[(IN−FA)−1⊗Il](FK⊗Il)y0a(t),
=[(IN−FA)−1FK⊗Il](1N⊗y0(t)
=(IN−FA)−1FK1N⊗y0(t).  (121)


Note that F is nonsingular by Assumption 2A and F−1=D+K, t then F−1(IN−FA)1N=(D+K−A)1N=(L+K)1N=K1N. Thus, (IN−FA)−1FK1N=1N (i.e., each row of (IN−FA)−1FK has a sum equal to 1). Using the foregoing property, the equality given in equation (117) from equation (121) is obtained.


LEMMA 2A. If Assumptions 2A and 3A hold, then

ei(t)∥2≤∥Cg2∥ζ(t)∥2, ∀i∈custom character,  (122)

where ζ(t)≙η(t)+Ag−1Bgω(t) is the assistant state.


Proof. From the proof of Lemma 1, it is known that Ag is nonsingular under Assumptions 2A and 3A. Using equation (116) and the assistant state ζ(t), the following is obtained

y(t)=Cg(ζ(t)−Ag−1Bgω(t)).  (123)


Now, from Lemma 1A, equation (123) can be rewritten as

y(t)=Cgζ(t)+y0a(t).  (124)


Using the fact that ∥ei(t)∥2≤∥y(t)−y0a(t)∥2, ∀i∈custom character, we have

ei(t)∥2≤∥Cgζ(t)∥2, ∀i∈custom character  (125)

Then, equation (122) immediately follows from equation (125).


THEOREM 1A. Consider the heterogeneous multiagent system given by equations (101) and (102) together with the output of the leader y0(t). In addition, consider the local cooperative controller given by equation (109) along with equations (107) and (108). Let Assumptions 1A, 2A, and 3A hold. If Ag is Hurwitz, then

ei(t)∥2≤max{ce−λt∥ζ02,b}, ∀t≥0, ∀i∈custom character,  (126)

where ζ0=ζ(0), c=2c∥Cg2, and







b
=



c
_







A
g

-
1




B
b




2





N






β
2


+

a
2





 

λ



,





with c and λ are being positive constants that satisfy ∥eAt22<ce−λt and a2i=1Nai2.


Proof. The time derivative of ζ(t) can be expressed as

{dot over (ζ)}(t)={dot over (η)}(t)+Ag−1Bg{dot over (ω)}(t).  (127)


Inserting equation (115) into equation (127) and using η(t)=ζ(t)−Ag−1Bgω(t), equation (127) can be rewritten as

{dot over (ζ)}(t)=Agζ(t)+Ag−1Bgω(t), ζ(0)=ζ0, t≥0.  (128)

Then, the solution of (128) can be written as

ζ(t)=eAgtζ0+∫0teAg(t−τ)Ag−1Bg{dot over (ω)}(τ)dτ.  (129)


Since Ag is Hurwitz, there exist positive constants c and λ such that ∥eAgt2≤ce−λt. Furthermore, let the eigenvalues of Ag be labeled in a nondecreasing order, that is, Re(λ1)≤Re(λ2)≤ . . . ≤Re(λn+Nl). For example, λ<−Re(λn+Nl) from Section 8.3 in [6]. Using the bound on the state transition matrix,

∥ζ(t)∥2≤ce−λt02+∫0tce−λ(t−τ)∥Ag−1Bg2∥{dot over (ω)}(τ)∥2dτ.  (130)


Note that ∥{dot over (ω)}(t)∥22=∥{dot over (y)}0a(t)∥22+∥{dot over (δ)}(t)∥22. Moreover, {dot over (y)}0a(t)=1N⊗{dot over (y)}0(t), and hence, ∥{dot over (y)}0a(t)∥22=N∥{dot over (y)}0(t)∥22 and ∥{dot over (δ)}(t)∥22i=1N∥{dot over (δ)}(t)∥22. Based on Assumption 1A, ∥{dot over (w)}(t)∥2≤√{square root over (Nβ22)} for all t≥0. Thus, the upper bound of the assistant state is given by














ζ


(
t
)




2





ce


-
λ






t







ζ
0



2


+



c






A
g

-
1




B
g




2



 

λ






N






β
2


+

a
2






,



t

0.






(
131
)







From equation (131) and Lemma 2A,















e
i



(
t
)




2





ce


-
λ






t







C
g



2






ζ
0



2


+



c





C
g



2







A
g

-
1




B
g




2



 

λ






N






β
2


+

a
2






,



t

0


,



i





ϵ






𝒩
.







(
132
)







Now, it follows from equation (132) and the fact that a1(t)+a2≤max{2a1(t), 2a2}, ∀t≥0, which holds for any a1(t)≥0 and a2≥0, (26) holds.


The following corollary is now immediate.


COROLLARY 1A. If the external disturbance of a follower agent is time-varying (i.e., ∃i∈N such that ai>0) or the leader has time-varying output (i.e., β>0), equation (126) implies that there exists T≥0 such that

ei(t)∥2ce−λt∥ζ02, ∀T≥t≥0, ∀i∈custom character,  (133)
ei(t)∥2≤b, ∀t≥T, ∀i∈custom character.  (134)


Proof By Assumption 3A, it is possible that ∥Ci2≠0 for all i∈custom character, and hence, ∥Cg2>0. Furthermore, rank (Bg)≥n+l by Assumption 2A. Recall that Ag is nonsingular, then rank (Ag)=rank(Ag−1)=n+Nl. From Sylvester's inequality, rank (Ag−1Bg)≥n+l, and hence, ∥Ag−1Bg2>0. Finally, since c and A are positive constants, it follows that b>0 as β>0 or a>0.


From equation (126), either c∥ζ02>b or c∥ζ02≤b at t=0 is obtained. In the former case, owing to the continuity of e−λt, there is T>0 such that ce−λt∥ζ02=b. Then, it can be readily shown that






T
=



λ

-
1




ln


(






ζ
0



2


λ







A
g

-
1




B
g




2





N


β
2


+

a
2





)



>

0
.







Thus, equations (133) and (134) are satisfied. In the latter case, equations (133) and (134) are satisfied with T=0 trivially.


REMARK 1A. Theorem 1A shows that the ultimate bound b of the output tracking error of each follower agent is associated with the bound on the time derivative of δ(t) and y0(t) (i.e., α and β). For example, as α and β decrease (respectively, increase), b decreases (respectively, increases). If, in addition, each follower agent is subject to constant external disturbance and the leader has constant output (i.e., α=0 and β=0), it is clear from (26) that b=0, and hence, the output tracking error of each follower agent goes to zero asymptotically (i.e., limt→∞ei(t)=0, ∀i∈custom character).


REMARK 2A. Since the solution of linear time-invariant systems are known, we use this advantage in the stability analysis conducted in Theorem 1A and Corollary 1A. On the other hand, for uniform ultimate boundedness, one can also apply Lyapunov-like theorems such as Theorem 4.18 in [9] and Theorem 4.5 in [11] (e.g., see [31]), or resort to the final value theorem (e.g., see [36]).


From Remark 1A, it is known that output tracking error of each follower agent converges to zero when each follower agent is subject to constant external disturbance and leader has constant output. The following intuitive question now arises: Does the output tracking error of each follower agent still converge to zero if the external disturbances and the output of the leader converge to constant vectors? Since Theorem 1A is not used to answer this question, the following corollary may now be useful.


COROLLARY 2A. Let Assumptions 1A, 2A, and 3A hold. If Ag is Hurwitz, limt→∞δi(t)=δi*∈custom characterni, ∀i∈custom character and limt→∞y0(t)=r*∈custom characterli* for all i∈custom character and r* are finite), and {dot over (δ)}i(t) for all i∈custom character and {dot over (y)}0(t) is uniformly continuous on [0, ∞), then limt→∞ei(t)=0, ∀i∈custom character.


Proof. Since the assistant system in equation (128) is linear time-invariant and Ag is Hurwitz, equation (128) is input-to-state stable. Since limt→∞y0(t)=r* and {dot over (y)}0(t) is uniformly continuous on [0, ∞), limt→∞{dot over (y)}0(t)=0 from l number of independent applications of Barbalat's lemma. Thus, limt→∞{dot over (y)}0a(t)=0. Similarly, limt→∞{dot over (δ)}(t)=0. It now follows from the derivation given after Definition 4.6 in [5] that limt→∞{dot over (ω)}(t)=0 implies limt→∞ζ(t)=0 owing to the input-to-state stability of equation (128). Finally, limt→∞ei(t)=0, ∀i∈custom character, follows from Lemma 2A.


REMARK 3A. It is clear from the proof of Corollary 2A that if Ag is Hurwitz, limt→∞{dot over (δ)}i(t)=0, ∀i∈custom character, and limt→∞{dot over (y)}0(t)=0, then limt→∞ei(t)=0, ∀i∈custom character. That is, asymptotic synchronization can be achieved even if the external disturbances and the output of the leader do not converge to constant vectors. For example, ln(t+1) does not have a limit but its derivative 1/(t+1) tends to zero as t→∞.


Agent-wise Local Sufficient Stability Condition. The main purpose of this section is to derive agent-wise local sufficient condition that provides Hurwitz Ag. For this purpose, the stabilizability and detectability of the global dynamics given by equations (115) and (116) is established. Then, agent-wise local sufficient condition are derived, which provides input-output stability of the global dynamics, by applying a version of the small gain theorem from Theorem 6.2.2.12 in [32]. The input-output stability of global dynamics by itself may not imply that Ag is Hurwitz. Therefore, stabilizability and detectability of finite-dimensional linear time-invariant systems should be carefully tracked to rule out the possibility of unstable hidden modes, and hence, conclude from input-output stability that system matrix is Hurwitz.


For the sake of completeness, a well-known converse theorem (e.g., see Corollary 9.1.80 in [2]) is restated in Theorem 2A using global dynamics given by equations (115) and (116). For finite-dimensional linear time-invariant systems, custom character2 stability and uniform bounded-input, bounded-output stability are equivalent notions of input-output stability and are used interchangeably in the literature (e.g., see Remark 2 in [13]). Based on Theorem 2A, the derived agent-wise local sufficient condition for input-output stability provides Hurwitz Ag.


THEOREM 2A [2]. Suppose that the pair (Ag, Bg) is stabilizable and the pair (Ag, Cg) is detectable. If the linear time-invariant system given by equations (115) and (116) is custom character2 stable, then Ag is Hurwitz.


Stabilizability and Detectability of Global Multiagent System Dynamics. In order to derive an agent-wise local sufficient condition, Theorem 2A is used. Thus, it is first needed to establish the stabilizability of the pair (Ag, Bg) and the detectability of the pair (Ag, Cg). These are given in Lemma 3A and Lemma 4A, respectively.


LEMMA 3A. If Assumptions 2A and 3A hold, then the pair (Ag, Bg) is controllable.


Proof. Define the following matrix











Q


(
κ
)




=
Δ



[





A
m

-

κ






I

n
_







B
m




I

n
_




0






[


(


I
N

-
FA

)



I
l


]


C





-
κ







I
Nl




0




-
FK



I
l





]


,




(
135
)








where κ∈custom character. By Popov-Belevitch-Hautus test for controllability, the pair (Ag, Bg) is controllable if and only if rank (custom character(κ))=n+Nl, ∀κ∈custom character. Note that rank (In)=n and rank (−κINl)=Nl if κ≠0. Thus, rank (custom character(κ))=n+Nl if κ≠0. Furthermore, Ag−κIn+Nl=Ag if κ=0. Recall from Lemma 1A that Ag is nonsingular by Assumptions 2A and 3A. Thus, rank (custom character(0))=n+Nl. Now, considering them together (i.e., κ≠0 and κ=0), it is established that rank (custom character(κ))=n+Nl, ∀κ∈custom character.


LEMMA 4A. If Assumptions 2A and 3A hold and Ami is Hurwitz for all i∈custom character, then the pair (Ag, Cg) is detectable.


Proof. An example goal is to show that if ω(t)≡0 and y(t)≡0, then η(t)→0 as t→∞. For this purpose, first let ω(t)≡0, then rewrite equations (115) and (116) as two interconnected systems given by

{dot over (x)}(t)=Amx(t)+Bmz(t), x(0)=x0, t≥0,  (136)
y(t)=Cx(t),  (137)
and
ż(t)=[(IN−FA)⊗Ii]y(t), z(0)=z0, t≥0  (138)


One can show detectability of the pair (Ag, Cg) from Popov-Belevitch-Hautus test for detectability (i.e., Theorem 16.5 in [6]). For that proof, detectability counterpart of custom character(κ) needs to be represented as a multiplication of two matrices and Corollary 2.5.10 in [1] should be applied. Since the presented proof requires less space, it is preferred.


Next, let y(t)≡0. Then, from equation (138), ż(t)≡0, and hence, z(t)≡z0. To show that z0=0, a contradiction argument as follows can be used. Suppose z0≠0 and take Laplace transform of equations (136) and (137). Thus

Y(s)=C(sI−Am)−1BmZ(s)+C(sI−Am)−1x0,  (139)

where Z(s)=1/sz0. Since Ami is Hurwitz for all i∈custom character, Am is Hurwitz. Therefore, a final value theorem as follows can be applied:















lim






y


(
t
)




t




=


lim





sY


(
s
)



s

0



,







=




lim





C


s

0





(

sI
-

A
m


)


-
1




B
m



z
0


+

s



C


(

sI
-

A
m


)



-
1




x
0




,






=


-
C



A
m

-
1




B
m




z
0

.









(
140
)







It is explained in Lemma 1A that CAm−1Bm is nonsingular due to the Assumption 3A. Thus, it implies that ker(−CAm−1Bm)={0}. Since z0≠0, limt-∞y(t)≠0 that is a contradiction to the fact that y(t)≡0; therefore, z0=0.


Until now, it has been established that if ω(t)≡0 and y(t)≡0, then z(t)≡0. To conclude that η(t)→0 as t→∞, it should be shown that x(t)→0 as t→∞. Note that z(t) and y(t) are the input and the output of the system equations (136) and (137), respectively. Recall that Am is Hurwitz, y(t)≡0, and z(t)≡0. Thus, from equations (136) and (137), x(t)→0 as t→∞.


REMARK 4A. Stabilizability (controllability implies stabilizability) and detectability of the global dynamics given by equations (115) and (116) do not require any information from graph topology (except the necessary condition given in Assumption 2A). Compared to stabilizability, detectability of the global dynamics is established if Ami is also Hurwitz for all i∈custom character. By Assumption 4A, notice that there always exists Kli such that Ami is Hurwitz for all i∈custom character.


A Small Gain Analysis. In this subsection, a version of the small gain theorem given in [32] is used, which is proposed for large-scale systems, to establish the finite gain custom character2 stability of the global dynamics in equations (115) and (116). By applying Theorem 2A, the agent-wise local sufficient condition for stability of Ag can be determined.


Define ξi(t)≙[xiT(t),ziT(t)]T for i∈custom character and consider the dynamics of each follower given by equations (101) and (108) with equation (112)

{dot over (ξ)}i(t)=Āiξi(t)+Biui(t)+Bfivi(t), ξi(0)=ξi0, t≥0,  (141)

where









A
¯

i

=

[





A
i


0







C
i


0




]


,



B
¯

i

=

[




B
i





0



]


,


B

f

i


=

[





ϕ
i

-
1




I

n
i





0




0



-

I
l





]


,







v
i



(
t
)


=


[



ϕ
i




δ
i
T



(
t
)



,






μ
i
T



(
t
)



]

T


,



μ
i



(
t
)


=


1


d
i

+

k
i



[





j


N
i






a

i

j





y
j



(
t
)




+


k
i




y
0



(
t
)




]


,





and a positive constant ϕi is introduced to have control over Bfi, which affects the gain of the follower agents. Using equation (109), the definition of Ami and Bmi and recalling equation (102), the dynamics of each follower can equivalently be represented as

{dot over (ξ)}i(t)=Afiξi(t)+Bfivi(t), ξi(0)=ξi0, t≥0  (142)
yi(t)=Cfiξi(t),  (143)

where







A

f

i


=

[




A

m

i





B

m

i







C
i



0



]






and Cfi=[Ci 0]. The transfer matrix of the system equations (142) and (143), which is denoted by gi(s), satisfies

gi(s)=Cfi(sI−Afi)−1Bfi.  (144)


Assumptions 4A and 5A ensure the stabilizability of the pair (Āi,Bi)i for all i∈custom character. This can be verified by Lemma 1.26 in [8] with (G1, G2)=(0,Il) and D=0. Therefore, K1i and K2i can always be chosen such that Afi is Hurwitz. If Afi is Hurwitz for all i∈custom character, then it is concluded from Corollary 5.2 in [9] that for all i∈custom character, the system given in equations (142) and (143) is custom characterp stable with finite gain for any p∈[1, ∞]. For example, for p=2, it follows from Theorem 5.4 in [9] that the gain of the system satisfies











γ
i

=



sup

ω










g
i



(

j





ω

)




2


<



,



i


𝒩
.







(
145
)







Conversely, equations (142) and (143) are stabilizable and detectable for all i∈custom character when Am is Hurwitz for all i∈custom characterand Assumption 3A holds. Specifically, since rank (Bfi)=ni+l, the pair (Afi, Bfi) is controllable from controllability matrix test (i.e., Theorem 12.1 in [6]). Furthermore, by following the similar steps in the proof of Lemma 4A, it can be shown that if Ami is Hurwitz, the pair (Afi, Cfi) is detectable under Assumption 3A. Therefore, if all poles of gi(s) have negative real part (γi is finite) for all i∈custom character, then Afi is Hurwitz for all i∈custom character.


THEOREM 3A. Consider Assumptions 2A and 3A. Let Ami be Hurwitz for all i∈custom character. If

ρ(Γ)ρ(FA)<1,  (146)

then Ag is Hurwitz, where Γ≙diag (γ1, . . . , γN).


Proof. It is first shown that equations (115) and (116) are custom character2 stable with finite gain. This part of the proof can be regarded as an application of Theorem 6.2.2.12 in [32]. Since F is finite, which is owing to Assumption 2A, and A is finite, then F is finite from (46). Therefore, under the stated assumptions and conditions, Afi is Hurwitz for all i∈custom character, and hence, equations (142) and (143) are custom character2 stable with finite gain γi given by equation (145) for all i∈custom character. Now, the following inequality is determined,

y(t)∥custom character2≤γi∥v(t)∥custom character2, ∀τ∈[0,∞), ∀i∈custom character  (147)


Using the definition vi(t), Minkowski's inequality, and letting y0(t)∈custom character2 and δi(t)∈custom character2 for all i∈custom character, the inequality from equation (147) is given by















y

i

τ




(
t
)






2






γ
i







ϕ
i




δ
i



(
t
)







2



+


γ
i







1


d
i

+

k
i








j


N
i






a

i

j





y

j

τ




(
t
)









2



+






γ
i







1


d
i

+

k
i





k
i




y
0



(
t
)







2





,



τ


[

0
,






)



,



i


𝒩
.







(
148
)







Let pτ≙[∥y(t)∥custom character2, . . . , ∥y(t)∥custom character2]T, y0≙∥y0(t)∥custom character21N, δ≙[∥δ1(t)∥custom character2]T, and Φ≙diag(ϕ1, . . . , ϕN). From equation (148),

pτ≤ΓΦδ+ΓFApτ+ΓFKy0, ∀τ∈[0,∞),  (149)

where equation (149) can also be written as

(IN−ΓFA)pτ≤ΓΦδ+ΓFKy0, ∀τ∈[0,∞).  (150)


Note that Γ is positive-definite diagonal matrix ρ(Γ)=max1≤i≤N γi, and FA is nonnegative matrix. Then, the following inequality is obtained from Lemma 8 in [7]

ρ(ΓFA)≤ρ(Γ)ρ(FA).  (151)

Since equation (146) holds, we have the following from equation (151)

ρ(ΓFA)<1.  (152)


From Lemma 6.2.1.8 and Lemma 6.2.1.9 in [32], it is known that IN−ΓFA has an inverse that has all nonnegative elements because ΓFA is nonnegative matrix and equation (152) holds. Since (IN−ΓFA)−1 is nonnegative matrix, both sides of equation (150) are can be multiplied by (IN−ΓFA)−1. Thus,

pτ≤(IN−ΓFA)−1ΓΦδ+(IN−ΓFA)−1ΓFKy0, ∀τ∈[0,∞).  (153)


Since the right hand side of equation (153) is independent oft it is concluded from Lemma 2.1.12 in [32] that yi(t)∈custom character2 for all i∈custom character. Hence, equation (153) directly implies that there exists γ such that

y(t)∥custom character2γ∥ω(t)∥custom character2.  (154)


It follows from equation (154) that equation (115) and (116) are custom character2 stable with finite gain. Since Assumptions 2A and 3A hold and Ami is Hurwitz for all ∈custom character, equations (115) and (116) are stabilizable and detectable from Lemma 3A and Lemma 4A. Therefore, Ag is Hurwitz from Theorem 2A.


REMARK 5A. If Assumptions 2A, 3A, 4A, and 5A hold, K1i, K2i are designed such that Ami is Hurwitz for all i∈custom character, and the sufficient condition given by equation (146) is satisfied, then Ag is Hurwitz from Theorem 3A. In addition to the foregoing assumptions, if Assumption 1A holds, then uniformly ultimately bounded output tracking error between output of each follower and the output of the leader is achieved by Theorem 1A. Similar to [7,16, 30], Theorem 3A provides agent-wise local condition with a clear link between input-output stability and internal stability of global dynamics given by equations (115) and (116). In contrast to the distributed output regulation problems considered in [27,7,16,29,30], the controller design described herein does not depend on the dynamics of an exosystem.


REMARK 6A. Since ρ(Γ)=max1≤i≤N γi, the sufficient condition given in equation (146) basically implies γiρ(FA)<1, ∀i∈custom character. Therefore, Theorem 3A provides agent-wise local sufficient condition for controller design. If the sufficient condition given by equation (146) in Theorem 3A is replaced with equation (152), it is clearly seen that Theorem 3A is still valid and equations (152) decreases conservatism in sufficient condition. However, equations (152) does not provide agent-wise local sufficient condition anymore. It can be an alternative global sufficient condition, which is together with Hurwitz Ami for all i∈custom character, to the one given in Theorem 1A which states that Ag is Hurwitz. It is also noted that the sufficient condition in equation (146) can be satisfied by solving algebraic Riccati equation (e.g., see Lemma 9 in [7]) or linear matrix inequality (e.g., see Theorem 6 in [30]).


For acyclic directed graphs, derived distributed criterion for controller design is not only agent-wise but also graph-wise local except for the necessary condition given by Assumption 2A. It is shown in the next result.


COROLLARY 3A. Consider Assumptions 2A and 3A. Let Ami be Hurwitz and γi is finite for all i∈custom character. If the directed graph custom character is acyclic (i.e., contains no loop), then Ag is Hurwitz.


Proof. Similar to [34,29], the nodes in custom character can be relabeled such that i>j if (vivj)∈E since custom character is acyclic. Then, the adjacency matrix A of the directed custom character is lower triangular with zero diagonal entries. In this case, FA is also lower triangular matrix with zero diagonal entries. Thus, ρ(FA)=0 and sufficient condition given by equation (146) in Theorem 3A is automatically satisfied. It now follows from Theorem 3A that Ag is Hurwitz.


REMARK 7A. For acyclic graph, obtaining Hurwitz Ag is reduced to designing Hurwitz Ami together with any finite γi is finite for all i∈custom character if Assumptions 2A, 3A, 4A, and 5A hold. In terms of being agent-wise and graph-wise local, this result is consistent with the results in [34,29] which are obtained by applying similarity transformation.


To illustrate the performance of the proposed distributed controller architecture described in this embodiment, the following two numerical examples are presented. The first example has nonlinear leader dynamics and the second one has linear leader dynamics. For both examples, five follower agents are considered with the following system, input, and output matrices:








A

1
,
4
,
5


=

[




-
1



1





0
.
5



0



]


,


B

1
,
4
,
5


=

[



1



0
.
4





0



0
.
2




]


,


C

1
,
4
,
5


=

[




1
.
2




-

0
.
4






0


1



]


,






A

2
,
3


=

[



0


1


0



-

0
.
5






0


2


1


0




0


0


0


1





0
.
2



0


0


0



]


,






B

2
,
3


=

[



0


0


1




2


0


0




0



-
1



0




3


0


0



]


,












C

2
,
3



=

[



1


0


0


0




0


1


0


0



]


,





and the augmented graph custom character shown in FIG. 1. Dynamics of follower agents satisfy Assumptions 4A and 5A. It is also clear from FIG. 1 that Assumption 2A is satisfied. In the simulations, weighting gains are equal to 1 (i.e., k1,2=1).


Now, linear quadratic theory is used to design K1i and K2i. In particular, Q1,4,5=diag(10,1,10,1) and R1,1=diag(1,1) are used to penalize ξ1,4,5(t) and u1,4,5(t), respectively. Similarly, Q2,3=diag(1,1,1,1,2,2) and R2,3=diag(1,1,1) are used to penalize ξ2,3(t) and u2,3(t), respectively. With these design parameters, Assumption 3A is satisfied and Ami is Hurwitz for all i∈custom character. For the given graph, ρ(FA)=0.6334. Letting ϕi=100 for all i∈custom character, then ρ(Γ)=1.0156. Thus, Ag is Hurwitz from Theorem 3A. In the simulations, initial conditions for the follower agents are as follows

x10=[1 0.6]T, x20=[−1 0 −0.2 0]T, x30=[−0.8 −0.4 0 0]T, x40=[0.6 0]T, x50=[0 0.5]T.


EXAMPLE 1. In this example, the dynamics of the leader is nonlinear and has the form:

{dot over (x)}01(t)=−x01(t)−x013(t)+x02(t), x01(0)=x010, t≥0,
{dot over (x)}02(t)=−x01(t)−x02(t)+x03(t), x02(0)=x020, t≥0,
{dot over (x)}01(t)−x033(t)+u0(t), x03(0)=x030, t≥0
y01(t)=x01(t),
y02(t)=x02(t).


This leader dynamics is from the exercise problems given in [10]. Regarding x03(t) as a control input to the state model that consists x01(t) and x02(t), given leader dynamics can be evaluated as a cascade system. From Example 4.25 in [9], scalar system, which has x03(t) state, is input-to-state stable. One can show that state model that consists of x01(t) and x0z(t) is also input-to-state stable by Theorem 4.19 in [9]. Therefore, given leader dynamics, which is a cascade system, is input-to-state stable. In other words, if u0(t) is piecewise continuous and bounded, then x01(t), x02(t), and x03(t) are bounded. Note that {dot over (y)}01(t)={dot over (x)}10(t) and {dot over (y)}02(t)={dot over (x)}02(t). Thus, {dot over (y)}0(t) is bounded. In the simulation, the input of leader is taken as u0(t)=








{






0
.




1


t

,





0

t
<

1

0

0


,








0.





1

t

-

2


sin


(

0.1

t

)




e


-

0
.
0



1


(

t
-

1

0

0


)





,






1

0

0


t
<

2

0

0


,







2

0

,




t


2

0

0











and hence, there exists β that satisfies equation (105). Moreover, follower agents are subject to external disturbances, which satisfy equation (104), as follows: δ1(t)=[−0.2, 1−e−0.02t]T, δ2(t)=[0.1 cos(0.1t) 0, 0, −0.1]T, δ3(t)=[0, 0, 0.05 sin(4t), 0]T, δ4(t)=[0.5, 0.4]T, and δ5(t)=[0.01t 0]T, t≥0. Thus, Assumption 1A holds for this example.



FIG. 2 illustrates that the output of each follower agent closely tracks the output of the leader. That is, output tracking error is uniformly ultimately bounded as expected by Theorem 1A.


EXAMPLE 2. The dynamics of the leader is now given by the following linear system










x
.

0



(
t
)


=



[



0


1






-

0
.
2



5




-
1




]




x
0



(
t
)



+


[



0






0
.
2


5




]




u
0



(
t
)





,







x
0



(
0
)


=




x
00

,

t
>
0

,



y
0



(
t
)


=






[



1


0





-

0
.
5




1



]




x
0



(
t
)




,








where u0(t)=1, for t≥0. Since the leader has linear time-invariant dynamics and its system matrix is Hurwitz, we have input-to-state stable leader dynamics. Furthermore, by applying final value theorem, the steady-state value of the output is found to be equal to [1−0.5]T (i.e., r*=[1−0.5]T). In addition, note that {dot over (y)}0(t) is uniformly continuous since ÿ0(t) is bounded owing to the boundedness of x0(t), u0(t), and {dot over (u)}0(t). Furthermore, external disturbances are given as follows: δ1(t)=[0.2 −0.5+e−0.5t]T, δ2(t)=[0.3 0 −0.3e−0.2t sin(t) 0]T, δ3(t)=[0 0.3 0 −0.2]T, δ4(t)=[0.5 0.1e−0.4t sin(4t)]T, δ5(t)=[1−e−0.3t 0]T. Similar to y0(t), given disturbances satisfy the conditions in Corollary 2A. Thus, Corollary 2A guarantees asymptotic synchronization and this fact is demonstrated in FIG. 3.


It is worth noting that the distributed controller gains are selected without using any information from leader dynamics and external disturbances. Same controller gains are used for the examples which are presented in this disclosure.


References Related to the First Embodiment

  • [1] D. S. Bernstein. Matrix mathematics: Theory, facts, and formulas. Princeton University Press, 2009.
  • [2] F. M. Callier and C. A. Desoer. Linear system theory. Springer-Verlag, 1991.
  • [3] W. Cao, J. Zhang, and W. Ren. Leader-follower consensus of linear multi-agent systems with unknown external disturbances. Systems & Control Letters, 82:64-70, 2015.
  • [4] B. A. Francis and W. M. Wonham. The internal model principle of control theory. Automatica, 12(5), 1976.
  • [5] W. M. Haddad and V. Chellaboina. Nonlinear dynamical systems and control: A Lyapunov-based approach. Princeton University Press, 2008.
  • [6] J. P. Hespanha. Linear systems theory. Princeton University Press, 2009.
  • [7] C. Huang and X. Ye. Cooperative output regulation of heterogeneous multi-agent systems: an H criterion. IEEE Transactions on Automatic Control, 59(1):267-273, 2014.
  • [8] J. Huang. Nonlinear output regulation problem: Theory and applications. SIAM, 2004.
  • [9] H. K. Khalil. Nonlinear systems. Prentice Hall, 2002.
  • [10] H. K. Khalil. Introduction to nonlinear systems analysis and nonlinear feedback control. Module 8. International Graduate School on Control—European Embedded Control Institute, Yildiz Technical University, 2015.
  • [11] H. K. Khalil. Nonlinear control. Pearson, 2015.
  • [12] E. Kofman. Non conservative ultimate bound estimation in LTI perturbed systems. Automatica, 41:1835-1838, 2005.
  • [13] N. Kottenstette, M. J. McCourt, M. Xia, V. Gupta, and P. J. Antsaklis. On relationships among passivity, positive realness, and dissipativity in linear systems. Automatica, 50(4), 2014.
  • [14] E. Lavretsky and K. A. Wise. Robust and adaptive control with aerospace applications. Springer-Verlag, 2013.
  • [15] F. L. Lewis, H. Zhang, K. Hengster-Movric, and A. Das. Cooperative control of multi-agent systems: Optimal and adaptive design approaches. Springer, 2014.
  • [16] Y. Li, X. Wang, J. Xiang, and W. Wei. Synchronised output regulation of leader-following heterogeneous networked systems via error feedback. International Journal of Systems Science, 47(4), 2015.
  • [17] Z. Li, Z. Duan, and F. L. Lewis. Distributed robust consensus control of multi-agent systems with heterogeneous matching uncertainties. Automatica, 50(3), 2014.
  • [18] Z. Li, X. Liu, W. Ren, and L. Xie. Distributed tracking control for linear multiagent systems with a leader of bounded unknown input. IEEE Transactions on Automatic Control, 58(2):518-523, 2013.
  • [19] M. Mesbahi and M. Egerstedt. Graph theoretic methods in multiagent networks. Princeton University Press, 2010.
  • [20] H. Modares, S. P. Nageshrao, G. A. Delgado Lopes, R. Babuska, and F. L. Lewis. Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning. Automatica, 71:334-341, 2016.
  • [21] R. Olfati-Saber, J. A. Fax, and R. M. Murray. Consensus and cooperation in networked multi-agent systems. Proceedings of the IEEE, 95(1):215-233, 2007.
  • [22] Z. Peng, D. Wang, and H. Zhang. Cooperative tracking and estimation of linear multi-agent systems with a dynamic leader via iterative learning. International Journal of Control, 87(6), 2014.
  • [23] W. Ren and Y. Cao. Distributed coordination of multi-agent networks: Emergent problems, models, and issues. Springer-Verlag, 2011.
  • [24] S. B. Sarsilmaz and T. Yucelen. On control of heterogeneous multiagent systems with unknown leader dynamics. In ASME Dynamic Systems and Control Conference, 2017.
  • [25] P. N. Shivakumar and K. H. Chew. A sufficient condition for nonvanishing of determinants. Proceedings of the American Mathematical Society, 43(1):63-66, 1974.
  • [26] S. Skogestad and I. Postlethwaite. Multivariable feedback control: Analysis and design. Wiley, 2005.
  • [27] Y. Su and J. Huang. Cooperative output regulation of linear multiagent systems. IEEE Transactions on Automatic Control, 57(4), 2012.
  • [28] Y. Tang. Leader-following coordination problem with an uncertain leader in a multi-agent system. IET Control Theory and Applications, 8(10), 2014.
  • [29] F. Adib Yaghmaie, F. L. Lewis, and R. Su. Output regulation of heterogeneous linear multi-agent systems with differential graphical game. International Journal of Robust and Nonlinear Control, 26:2256-2278, 2016.
  • [30] F. Adib Yaghmaie, F. L. Lewis, and R. Su. Output regulation of linear heterogeneous multi-agent systems via output and state feedback. Automatica, 67:157-164, 2016.
  • [31] D. Tran and T. Yucelen. On control of multiagent formations through local interactions. In IEEE Conference on Decision and Control, 2016.
  • [32] M. Vidyasagar. Input-output analysis of large-scale interconnected systems: Decomposition, well-posedness, and stability. Springer-Verlag, 1981.
  • [33] M. Vidyasagar. Nonlinear systems analysis. Prentice Hall, 1993.
  • [34] X. Wang, Y. Hong, J. Huang, and Z.-P. Jiang. A distributed control approach to a robust output regulation problem for multiagent linear systems. IEEE Transactions on Automatic Control, 55(12):2891-2895, 2010.
  • [35] P. Wieland, R. Sepulchre, and F. Allgower. An internal model principle is necessary and sufficient for linear output synchronization. Automatica, 47(5), 2011.


    [36] T. Yucelen and E. J. Johnson. Control of multivehicle systems in the presence of uncertain dynamics. International Journal of Control, 86(9):1540-1553, 2013.
  • [37] K. Zhou, J. C. Doyle, and K. Glover. Robust and optimal control. Prentice-Hall, 1996.


Second Embodiment

A standard notation is used in the second and third embodiments. Specifically, custom character, custom charactern, and custom charactern×m respectively denote the sets of all real numbers, real column vectors, and n×m matrices; 1n and In respectively denote the n×1 vector of all ones and the n×n identity matrix; and “≙” denotes equality by definition. In this disclosure, all real matrices are defined over the field of complex numbers. In this disclosure, write (⋅)T for the transpose and ∥⋅∥2 for the (induced) two norm of a matrix; σ(⋅) for the spectrum and ρ(⋅) for the spectral radius of a square matrix; (⋅)−1 for the inverse of a nonsingular matrix; and ⊗ for the Kronecker product. Finally, diag(A1, . . . , An) is a block-diagonal matrix with entries (A1, . . . , An) on its diagonal. Definition 4.4.4 in [1] is followed for the spectrum.


Next, the graph theoretical notation used in the second and third embodiments, which is based on [9], is concisely stated. In particular, consider a fixed (i.e., time-invariant) directed graph custom character=(custom character, custom character), where custom character={v1, . . . , vN} is a nonempty finite set of N nodes and custom charactercustom character×custom character is a set of edges. Each node in custom character corresponds to a follower agent. There is an edge rooted at node vj and ended at vi (i.e. (vj, vi))∈custom character if and only if vi receives information from vj. custom character=[aij]∈custom characterN×N denotes the adjacency matrix, which describes the graph structure; that is aij>0⇔(vj, vi)∈custom character and aij=0 otherwise. Repeated edges and self loops are not allowed; that is aii=0, ∀i∈custom characterwith custom character={1, . . . , N}. The set of neighbors of node vi is denoted as Ni={j|(vj, vi)∈custom character}. In-degree matrix is defined as custom character=diag(d1, . . . , dn) with dij∈Niaij. A directed path from node vi to node vj is a sequence of successive edges in the form of {(vi, vp), (vp, vq), . . . , (vr, vj)}. If vi=vj, then the directed path is called a loop. A directed graph is said to have a spanning tree if there is a root node such that it has directed paths to all other nodes in the graph. A fixed augmented directed graph is defined as custom character=(custom character, custom character), where custom character={v0, v1, . . . , vN} is the set of N+1 nodes, including the leader node v0 and all nodes in custom character, and custom character=custom charactercustom character′ is the set of edges with custom character′ consisting of some edges in the form of (v0, vi), i∈custom character.


The concept of internal model introduced next slightly modifies Definition 1.22 and Remark 1.24 in [5].


Definition 1. Given any square matrix A0, a triple of matrixes (M1, M2, M3) is said to incorporate a p-copy internal model of the matrix A0 if











M
1

=


T


[




S
1




S
2





0



G
1




]




T

-
1




,


M
2

=



T


[




S
3






G
2




]




M
3


=

T


[




S
4





0



]




,




(
201
)








or

M1=G1, M2=G2, M3=0,  (202)

where Sl, l=1, 2, 3, 4, is any matrix with appropriate dimension, T is any nonsingular matrix with an appropriate dimension, the zero matrix in M3 has as many rows as those of G1, and

G1=diag(β1, . . . ,βp), G2=diag(σ1, . . . , σp),

where for l=1, . . . , p, βlcustom charactersl×sl and σ1ϵRsl satisfy the following conditions:


a) The pair (βl, σl) is controllable.


b) The minimal polynomial of A0 equals the characteristic polynomial of βl.


Problem Formulation. Consider a system of N (follower) agents with heterogeneous linear time-invariant dynamics subject to external disturbances over a fixed directed communication graph topology custom character. The dynamics of agent i∈custom charactercan be given by

{dot over (x)}i(t)=Aixi(t)+Biui(t)+δi(t), xi(0)=xi0, t≥0,
yi(t)=Cixi(t)+Diui(t),

with state xi(t)∈custom characterni, input ui(t)∈custom charactermi, output yi(t)∈custom characterp, and external disturbance δi(t)=Eδiδ(t)∈custom characterni, where δ(t)∈custom characterqδ is a solution to the unknown disturbance dynamics with an initial condition. In addition, the reference trajectory to be tracked is denoted by y0(t)=Rrr0(t)∈custom characterp, where r0(t)∈custom characterqr is a solution to the unknown leader dynamic with an initial condition.


Let ω(t)≙[r0T(t),δT(t)]Tcustom characterq be the solution of the unknown exosystem, where q=qr+qδ. Instead of assuming that the exosystem has an unforced linear time-invariant dynamics with a known system matrix (for example, see [14, 4, 16]), this disclosure considers that the exosystem has (partially or completely) unknown dynamics. From this perspective, the exosystem can represent any (for example, linear or nonlinear) dynamics provided that its solution is unique and satisfies the conditions given later in Assumptions 1B and 2B.


Define Ei≙[0 Eδi] and R≙[Rr 0]. Furthermore, let ei(t)≙yi(t)−y0(t) be the tracking error. The state of each agent and its tracking error can be defined as

{dot over (x)}i(t)=Aixi(t)+Biui(t)+Eiω(t), xi(O)=xi0, t≥0,  (203)
ei(t)=Cixi(t)+Diui(t)−Rω(t).  (204)

In this disclosure, the tracking error ei(t) is available to a nonempty proper subset of agents. If all agents observe the leader, decentralized controller can be designed for each agent even though the distributed controllers described herein are still applicable. In particular, if node vi observes the leader node v0, then there exists an edge (v0, vi) with weighting gain ki>0; otherwise ki=0. Each agent has also access to the relative output error; that is, yi(t)−yj(t) for all j∈Ni. Similar to [16], the local virtual tracking error can be defined as











e

v

i




(
t
)




=
Δ





1


d
i

+

k
i





[





j


N
i






a

i

j




(



y
i



(
t
)


-


y
j



(
t
)



)



+


k
i



(



y
i



(
t
)


-


y
0



(
t
)



)



]


.





(
205
)







Next, three classes of distributed control laws are defined based on additional available information for each agent.


1) Dynamic State Feedback. If each agent has full access to its own state xi(t), then the dynamic state feedback control law can be defined as

ui(t)=K1ixi(t)+K2izi(t),  (206)
żi(t)=G1izi(t)+G2ievi(t), zi(0)=zi0, t≥0,  (207)

where zi(t)∈custom characterz1i is the controller state and the quadruple (K1i, K2i, G1i, G2i) is described later herein.


2) Dynamic Output Feedback with Local Measurement. If each agent has local measurement output ymi(t)∈custom characterpi of the form

ymi(t)=Cmixi(t)+Dmiui(t),  (208)

then the dynamic output feedback control law with local measurement is given by

ui(t)=Kizi(t),  (209)
żi(t)=M1izi(t)+M2ievi(t)+M3iymi(t), zi(0)=zi0, t≥0,  (210)

where zi(t)∈custom characterz2i is the controller state and the quadruple (Ki, M1i, M2i, M3i) is described later herein.


3) Dynamic Output Feedback. If each agent does not have additional information; that is, the local virtual tracking error evi(t) is the only available information to it, then the dynamic output feedback control law is given by

ui(t)=Kizi(t),  (211)
żi(t)=M1izi(t)+M2ievi(t), zi(0)=zi0, t≥0,  (212)

where zi(t)∈custom characterz3i is the controller state and the triple (Ki, M1i, M2i) is described later herein.


This disclosure makes the following first and second assumptions before define the problem.


ASSUMPTION 1B. A0custom characterq×q has no eigenvalues with negative real parts.


ASSUMPTION 2B. There exist k>0 such that

A0ω(t)−{dot over (ω)}(t)∥2≤k<∞, ∀t≥0,

where {dot over (ω)}(t) is a piecewise continuous function in time. The definition given in page 650 of [7] is followed.


Assumption 1B is standard in linear output regulation theory (for example, see Remark 1.3 in [5]). Assumption 2B is required to show the ultimate boundedness of the tracking error and it automatically holds if the exosystem has an unforced linear time-invariant dynamics with the system matrix A0.


Based on a definition of the linear cooperative output regulation problem in [14, 4], the problem considered in this disclosure can be defined as follows.


Definition 2. Given the system in equations (203) and (204) together with the exosystem, which satisfies Assumptions 1B and 2B, and the fixed augmented directed graph, find a distributed control law of the form of equations (206) and (207), or equation (209) and (210), or equations (211) and (212) such that:


a) The resulting closed-loop system matrix is Hurwitz.


b) The tracking error ei(t) is ultimately bounded with ultimate bound b for all initial conditions of the closed-loop system and for all i∈custom character; that is, there exists b>0 and for each initial condition of the closed-loop system, there is T≥0 such that ∥ei(t)∥2≤b, ∀t≥T, ∀i∈custom character.


c) If limt→∞A0ω(t)−{dot over (ω)}(t)=0, then for all initial conditions of the closed-loop system limt→∞ei(t)=0, ∀i∈custom character.


This disclosure makes the following addition assumptions to solve this problem.


ASSUMPTION 3B. The fixed augmented directed graph custom character has a spanning tree with the root node being the leader node.


ASSUMPTION 4B. The pair (Ai, Bi) is stabilizable for all i∈custom character.


ASSUMPTION 5B. For all λ∈σ(A0),








rank




[





A
i

-

λ






I

n
i







B
i






C
i




D
i




]

=


n
i

+
p


,



i


𝒩
.







ASSUMPTION 6B. The triple (G1i, G2i, 0) incorporates a p-copy internal model of A0 for all i∈custom character.


ASSUMPTION 7B. The pair (Ai, Cmi) is detectable for all i∈custom character.


ASSUMPTION 8B. The pair (Ai, Ci) is detectable for all i∈custom character.


Assumption 3B is natural to solve the stated problem (for example, see Remark 3.2 in [9]). Similar to Assumption 1B, Assumptions 4B, 5B, 6B, 7B, and 8B are standard in linear output regulation theory (for example, see Chapter 1 of [5]). Assumptions 1B, 2B, 3B, 4B, 5B, and 6B can be used for dynamic state feedback. To utilize some results from dynamic state feedback in the absence of full state information, each agent requires the estimation of its own state. For this purpose, Assumption 7B and Assumption 8B are included for dynamic output feedback with local measurement and dynamic output feedback, respectively.


Solvability of the Problem


For the three different distributed control laws introduced previously herein, the solvability of the problem given in Definition 2 can be investigated. First, property a) of Definition 2 is assumed and it is shown, under mild conditions, that properties b) and c) of Definition 2 are satisfied. Second, an agent-wise local sufficient condition (i.e., distributed criterion) is provided for property a) of Definition 2 (i.e., the stability of the closed-loop system matrix) under standard assumptions.


Before describing the solvability of the problem for each distributed control law, the following definitions are presented that are used throughout this description to express the closed-loop systems in compact forms, some results related to the communication graph topology, and a key lemma about the solvability of matrix equations, which play a role on the solvability of the problem.


Define the following matrices: Φ≙diag(Φ1, . . . , ΦN), Φ=A, B, C, D, E; Φm≙diag (Φm1, . . . , ΦmN), Φm=Cm,Dm; Kl≙diag(Kl1, . . . , KlN), l=1, 2; A0a=IN⊗A0, and Ra=IN⊗R. Further let x(t)≙[x1T(t), . . . , xNT(t)]Tcustom charactern, where ni=1Nni; e(t)≙[e1T(t), . . . eNT(t)]Tcustom characterNp, ev(t)≙[ev1T(t), . . . evNT(t)]Tcustom characterNp, and ωa(t)≙1N⊗ω(t)∈RNq.


Observing yi(t)−yj(t)=ei(t)−ej(t) and recalling dij∈Niaij, equation (205) can be equivalently written as











e

v

i




(
t
)


=



e
i



(
t
)


-


1


d
i

+

k
i








j


N
i






a

i

j






e
j



(
t
)


.









(
213
)







Let









=
Δ



diag






(


1


d
1

+

k
1







,





,





1


d
N

+

k
N




)







and custom character≙(INcustom charactercustom character)⊗Ip. Here, in should be noted that d1+k1>0, ∀i∈custom character by Assumption 3B; hence, custom character is well-defined. From equation (213), we have

ev(t)=custom charactere(t).  (214)


Similar to Lemma 3.3 in [9], the following lemma is for INcustom charactercustom character.


Lemma 1B. Under Assumption 3B, INcustom charactercustom character is non-singular. In addition, all its eigenvalues have positive real parts.


Proof. Under Assumption 3B, INcustom character satisfies conditions of the theorem in [13]. Thus, it is nonsingular. Since the singularity is eliminated, all the eigenvalues of INcustom charactercustom character have positive real parts by the Gershgorin circle theorem (see, for example, Fact 4.10.17 in [1]).


Remark 1B. Since INcustom charactercustom character is nonsingular under Assumption 3B, so is custom character by Proposition 7.1.7 in [1]. Then, it is clear from equation (214) that ei(t) is bounded for all i∈custom character if and only if evi(t) is bounded for all i∈custom character; limt→∞ei(t)=0, ∀i∈custom character if and only if limt→∞evi(t)=0, ∀i∈custom character.


Now looking at the spectral radius of custom charactercustom character.


Lemma 2B. Under Assumption 3B, ρ(custom charactercustom character)<1.


Proof. By Lemma 1B, all the eigenvalues of INcustom charactercustom character have positive real parts under Assumption 3B. This directly implies from the Fact 6.2.1 in [17] that the leading principal minors of INcustom charactercustom character are all positive as INcustom charactercustom character is a square matrix whose off-diagonal elements are all nonpositive. Since custom charactercustom character is a nonnegative square matrix and the leading principal minors of INcustom charactercustom character are all positive, ρ(custom charactercustom character)<1 from Lemma 6.2.1.8 in [17].


Next, a lemma is described that extends the field of application of Lemma 1.27 in [5] to heterogeneous (in dynamics and dimension) linear time-invariant multiagent systems over general fixed directed graph communication graph topologies.


Lemma 3B. Let Assumptions 1B and 3B hold. Suppose the triple (M1, M2, M3) incorporates an N p-copy internal model of A0a. If







A
C



=




[




A
^




B
^








M
2


W


C
^


+


M
3




C
^

m


+





M
1

+


M
2


W


D
^


+


M
3




D
^

m






]






is Hurwitz, where Â, {circumflex over (B)}, Ĉ, Ĉm, {circumflex over (D)}, and {circumflex over (D)}m are any matrices with appropriate dimensions, then the matrix equations

XA0a=ÂX+{circumflex over (B)}Z+Ê,  (215)
ZA0a=M1Z+M2custom character(ĈX+{circumflex over (D)}Z+{circumflex over (F)})+M3(ĈmX+{circumflex over (D)}mZ),  (216)

have unique solutions X and Z for any matrices Ê and {circumflex over (F)} of appropriate dimensions. Furthermore, X and Z satisfy

0=ĈX+{circumflex over (D)}Z+{circumflex over (F)}.  (217)

In other words, the conclusion is that the matrix equations

XcA0a=AcXc+Bc,  (218)
0=CcXc+Dc,  (219)

have a unique solution Xc, where








X
c

=

[



X




Z



]


,


B
c

=

{




E
^







M
2


W


F
^





}


,


C
c

=

[


C
^



D
^


]


,


D
c

=


F
^

.






Proof. Note that equations (215) and (216) (respectively, equation (217)) can be equivalently written as equation (218) (respectively, equation (219)). Note also that σ(A0a)=σ(A0). Since Assumption 1B holds and A, is Hurwitz, A0a and Ac have no eigenvalues in common. Thus, the Sylvester equation in equation (218) has a unique solution X, =[XT ZT]T by the first part of Proposition A.2 in [5]. In addition, we show that X and Z also satisfy equation (217). To this end, let y≙ĈX+{circumflex over (D)}Z+{circumflex over (F)}. Since the triple (M1, M2, M3) incorporates an N p-copy internal model of A0a, it has the form given by equation (201) or (202). If it takes the form equation (201), let [{circumflex over (θ)}T θT]T≙T−1Z, where θ has as many rows as those of G1. Pre-multiplying equation (216) by T−1 and using the foregoing definitions,

θA0a=G1θ+G2custom characterγ  (220)


Note that if the triple (M1, M2, M3) takes the form of equation (202), equation (216) already satisfies equation (220), where θ=Z. Let γ≙custom characterγ; then equation (220) is in the form of (1.74) in [5]. Hence, γ=0 by the proof of Lemma 1.27 in [5]. By Remark 1B, custom character is nonsingular under Assumption 3B. As a consequence, γ=0 implies γ=0. This completes the proof of this lemma.


Dynamic State Feedback


Let








z


(
t
)




=
Δ





[



z
1
T



(
t
)


,





,

z
N
T


]

T






n
_


z
1





,





wherein nz1i=1N nz1i, and Gl≙diag(Gl1, . . . , GlN), l=1,2. Inserting equation (206) into equations (203) and (204), and using the above definitions, equations (203), (207), and (204) can be compactly written as

{dot over (x)}(t)=(A+BK1)x+BK2z+Eωa(t), x(0)=x0, t≥0,  (221)
ż(t)=G1z(t)+G2ev(t), z(0)=z0, t≥0,  (222)
e(t)=(C+DK1)x(t)+DK2z(t)−Raωa(t).  (223)


Next, insert equation (223) into equation (214) and replace the obtained expression with the one in equation (222). Define








x
g



(
t
)




=
Δ





[



x
T



(
t
)


,


z
T



(
t
)



]

T







n
_

+


n
_


z
1




.







Then, the closed-loop system defined by equations (203), (204), (205), (206), and (207) becomes

{dot over (x)}g(t)=Agxg(t)+Bgωa(t), xg(0)=xg0, t≥0  (224)
e(t)=Cgxg(t)+Dgωa(t),  (225)

where








A
g

=

[




A
+

B


K
1











B


K
2








G
2



W


(

C
+

D


K
1



)












G
1

+


G
2



WKDK
2






]


,


B
g

=

[



E






-

G
2




WR
a





]


,






C
g

=

[

C
+

D


K
1


D


K
2



]


,


D
g

=

-


R
a

.







Theorem 1B. Let Assumptions 1B, 2B, 3B, and 6B hold. If Ag is Hurwitz, then the distributed dynamic state feedback control given by equations (206) and (207) can be used in solving the problem in Definition 2.


Proof. By the definition of A0a minimal polynomials for A0a and A0 are the same. Thus, the triple (G1, G2, 0) incorporates an N p-copy internal model of A0a under Assumption 6B. Let (M1, M2, M3)≙(G1,G2,0). Let also Â≙A+BK1, {circumflex over (B)}≙BK2, Ĉ≙C+DK1, Ĉm≙0, {circumflex over (D)}≙DK2, {circumflex over (D)}m≙0, Ê≙E, and {circumflex over (F)}≙−Ra. Then, the quadruple (Ag, Bg, Cg, Dg) takes the form of (Ac, Bc, Cc, Dc) in Lemma 3B. In addition, Ag is Hurwitz and Assumptions 1B and 3B hold. Hence, Lemma 3B is applicable and it implies that the matrix equations

XgA0a=AgXg=Bg,  (226)
0=CgXg=Dg,  (227)

have a unique solution Xg. Additional discussion on the solvability of equations (226) and (227) are described later in the disclosure.


Under Assumption 2B, ∥A0aωa(t)−{dot over (ω)}a(t)∥2Nk, ∀t≥0 since ∥A0aωa(t)−{dot over (ω)}a(t)∥22=N∥A0ω(t)−{dot over (ω)}(t)∥22. Let xg(t)−Xg ωa(t). Let xg(t)−Xg ωa(t). Then, using the definition of xg(t) and equations (226) and (227), equations (224) and (225) can be rewritten as

{dot over (x)}g(t)=Agxg(t)+Xg(A0aωa(t)−ωa(t)), xg(0)=xg0, t≥0,  (228)
e(t)=Cgxg(t).  (229)

Now, the solution of equation (228) can be written as

xg(t)=eAgtxg0+∫0teAg(t−τ)Xg(A0aωa(r)−{dot over (ω)}a(τ))dτ.

Since Ag is Hurwitz, there exist c>0 and α>0 such that ∥eAgt2≤ce−at, ∀t≥0 (for example, see Lecture 8.3 in [3]). Owing to this bound and the bound on ∥A0aωa(t)−{dot over (ω)}a(t)∥2, the following inequality exists













x
¯

g



(
t
)




2




c


e


-
a


t








x
¯


g

0




2


+



c




x
g




2

α




N

κ





,



t


0
.








Using the fact ∥ei(t)∥2≤e(t)∥2, ∀i∈custom character and observing ∥e(t)∥2≤∥Cg2xg(t)∥2 from equation (229),

ei(t)∥2≤ce−αt∥Cg2xg02+bζ, ∀t>0, ∀i∈custom character.

Where b′=c∥Cg2∥Xg2N−1. For a given ∈>0, we have either c∥Cg2xg02>∈ or c∥Cg2xg02≤∈. In the former case, it can be readily shown that ce−at∥Cg2xg02≤∈, ∀t≥T with






T
=



α

-
1




ln


(


c




C
g




2





x
_


g





0





2

ϵ

)



>

0
.







In the latter case, the foregoing inequality may hold for all t≥0.


Thus, ei(t) may be ultimately bounded with the ultimate bound b≙b′+∈ for all xg0, which is also true for all xg0, and for all i∈custom character.


If limt→∞A0ω(t)−{dot over (ω)}(t)=0, then limt→∞A0ωa(t)−{dot over (ω)}a(t)=0. Since Ag is Hurwitz and the system in equations (228) is linear time-invariant when A0aωa(t)−{dot over (ω)}a(t) is viewed as an input to the system, equation (228) is input-to-state stable with respect to this piecewise continuous input (for example, see Chapter 4.9 in [7]). Thus, limt→∞A0ωa(t)−{dot over (ω)}a(t)=0 implies limt→∞xg(t)=0 for all xg0 (for example, see Exercise 4.58 in [7]). Finally, it follows from equation (229) that for all xg0 limt→∞ei(t)=0, ∀i∈custom character.


Remark 2B. The ultimate bound b of the tracking error for each agent can be associated with the bound k in Assumption 2B. For example, as k decreases (respectively, increases), b decreases (respectively, increases). To elucidate the role of Assumptions 1B and 2B in practice, the following example scenarios may be considered:


a) When the piecewise continuity and boundedness of {dot over (ω)}(t) are the only information that is available to a control designer, the triple (0,Ip,0) incorporating a p-copy internal model of A0=0 is quite natural; hence, equation (207) can become a distributed integrator. Moreover, Xg in b can be explicitly expressed in terms of Ag and Bg; that is, Xg=−Ag−1Bg by equation (226).


b) When the piecewise continuity and boundedness of {dot over (ω)}(t), the boundedness of (t), and some frequencies in ω(t) are available to a control designer, the triple (G1i, G2i, 0) incorporating a p-copy internal model of A0, which includes these frequencies and zero eigenvalues, is an alternative to the pure distributed integrator.


Remark 3B. As it is shown in Theorem 1B, asymptotic synchronization can be achieved when limt→∞A0ω(t)−ω(t)=0. Next provided are sufficient conditions to check this condition can be determined as follows. If A0=0 holds, limt→∞{dot over (ω)}(t)=0 can replace limt→∞A0ω(t)−{dot over (ω)}(t)=0; hence ω(t)≡ω* (ω* is finite) in place of a), and limt→∞ω(t)=ω* and {dot over (ω)}(t) is uniformly contouring in place of b). If one of the following conditions holds


a) {dot over (ω)}(t)=A0ω(t), ω(0)=ω0, t≥0;


b) limt→∞eA0tω0−ω(t)=0 and A0eA0tω0−{dot over (ω)}(t) is uniformly continuous on [0,∞),


then limt→∞A0ω(t)−{dot over (ω)}(t)=0. Note that a) may imply b). From Barbalat's lemma given by Lemma 8.2 in [8], b) may imply that limt→∞A0eA0tω0−{dot over (ω)}(t)=0. Thus, limt→∞A0ω(t)−{dot over (ω)}(t)=A0 limt→∞ω(t)−eA0tω0+limt→∞A0eA0tω0−{dot over (ω)}(t)=0. In general, asymptotic synchronization results in the literature (for example, see [14, 4, 16]) are obtained under condition a). This disclosure covers all (or most) class of functions generated under condition a).


To obtain an agent-wise local sufficient condition that assures property a) of Definition 2 under some standard assumptions, let,









ξ
i



(
t
)




=






[



x
i
T



(
t
)


,


z
i
T



(
t
)



]

T






n
i

+

n

z

1

i







,



μ
i



(
t
)




=





1


d
i

+

k
i








j


N
i










a
ij




e
j



(
t
)






,











A
¯

i



=




[




A
i



0






G

2

i




C
i





G

1

i





]


,



B
¯


i



=




[




B
i







G

2

i




D
i





]


,



B
¯

fi



=




[



0





-

G

2

i






]


,





and Ci ≙[Ci 0]. Furthermore, considering equations (203), (207), (213), and (204) when ω(t)≡0. We now have

{dot over (ξ)}i(t)=Āiξi(t)+Biui(t)+Bfiμi(t), ξi(0)=ξi0, t≥0,  (230)
ei(t)=Ciξi(t)+Diui(t).  (231)

Next, define the matrices








A
fi



=




[





A
i

+


B
i



k

1

i








B
i



k

2

i









G

2

i




(


C
i

+


D
i



K

1

i




)







G

1

i


+


G

2

i




D
i



K

2

i




)




]


,






C
fi≙[Ci+DiK1iDiK2i].


Using equations (206), (230), and (231) can be written as

ξi(t)=Afiξi(t)+Bfiμi(t), ξi(0)=ξi0, t≥0,  (232)
ei(t)=Cfiξi(t).  (233)


Let, in addition, Ψf≙diag(Ψfi, . . . , ΨfN), Ψ=A, B, C and ξ(t)≙[ξ1T(t), . . . , ξNT(t)]T. Then, equations (232) and (233) can be written in the compact form given by

{dot over (ξ)}(t)=Afξ(t)+Bf(custom charactercustom characterIp){tilde over (w)}(t), ξ(0)=ξ0, t≥0,  (234)
{tilde over (z)}(t)=Cfξ(t),  (235)

where e(t)={tilde over (w)}(t)={tilde over (z)}(t). Observe that the system in equations (234) and (235) can take the form of equation (212) in [4]. Therefore, Theorem 2 in [4] is supposed to be used immediately. However, its statement is not correct as it is written. A counterexample is described further later in the disclosure.


This paragraph uses the notation and the terminology from [4]. Readers are referred to (12), Theorem 1, and Theorem 2 in [4]. It should be noted that Theorem 2 relies on Theorem 1 and this theorem is derived by means of Theorem 11.8 and Lemma 11.2 in [19]. According to the mentioned results and Chapter 5.3, which is devoted to the notion of internal stability for the system of interest, in [19], it is clear that the following condition should be added to the hypothesis of Theorem 1: Let the realization of T(s) given by (12) be stabilizable and detectable. With this modification, not only the gap in Theorem 1, but also the one in Theorem 2 is filled.


It is understood that the system in equations (234) and (235) is stabilizable and detectable if Af is Hurwitz. Thus, the new condition is satisfied if Afi is Hurwitz for all i∈custom character.


Remark 4B. Assumptions 4B, 5B, and 6B can ensure the stability of the pair EQN for all i∈custom character. Therefore, K1i and K2i can be chosen such that Afi is Hurwitz for all i∈custom character.


Remark 4B. Assumptions 4B, 5B, and 6B ensure the stabilizability of the pair (Āi, Bi) for all i∈custom character by Lemma 1.25 in [5]. Therefore, K1i and K2i can always be chosen such that Afi is Hurwitz for all i∈custom character.


Let gfi(s)≙Cfi(sI−Afi)−1Bfi. We now state the following theorem for the dynamic state feedback case.


Theorem 2B. Let Assumption 3B hold and Afi be Hurwitz for all i∈custom character. If

gfiρ(custom charactercustom character)<1, ∀i∈custom character,  (236)

where ∥gfi is the H norm of gfi(s), then Ag is Hurwitz.


Proof. It follows from Theorem 2 in [4] and the above discussion.


Remark 5B. The inequality given by equation (236) is an agent-wise local sufficient condition; that is, it paves the way for independent controller design for each agent. For the connection between this condition and an algebraic Riccati equation (respectively, linear matrix inequality), this disclosure refers to Lemma 9 in [4] (respectively, Theorem 6 in [16]). Moreover, it is understood from Lemma 2B that ρ(custom charactercustom character)<1 under Assumption 3B. Therefore, we can restate Theorem 2B by replacing equation (236) with ∥gfi≤1, ∀i∈custom character. In this statement, although the condition becomes more conservative, it is not only agent-wise local but also graph-wise local except Assumption 3. Finally, it should be noted that if the graph custom character considered in Theorem 2B contains no loop (i.e., acyclic), then the nodes in custom character can be relabeled such that i>j when (vj, vi)∈custom character. Thus, A is similar to a lower triangular matrix with zero diagonal entries, so is custom charactercustom character. This implies that ρ(custom charactercustom character)=0; hence, Theorem 2B does not require the condition given by equation (236) anymore. In terms of being agent-wise and graph-wise local, this special case is consistent with the result in [18].


Dynamic Output Feedback with Local Measurement


Let









z
i



(
t
)




=
Δ





[




x
^

i
T



(
t
)


,



z
¯

i
T



(
t
)



]

T






ϵ








n

z

2

i






,





where xi(t) is the estimate of the state xi(t), Ki≙[K1i K2i], and equation (209) have the form given by

ui(t)=K1i{circumflex over (x)}i(t)+K2izi(t).  (237)

To estimate the state xi(t), the following local Luenberger observer is employed

{circumflex over ({dot over (x)})}i(t)=Ai{circumflex over (x)}i(t)+Biui(t)+Hi(ymi(t)−Cmi{circumflex over (x)}i(t)−Dmiui(t)), {circumflex over (x)}i(0)={circumflex over (x)}i0, t≥0,  (238)

where H; is the observer gain matrix. Using equation (237), equation (238) can be written as

{circumflex over ({dot over (x)})}i(t)=(Ai+BiK1i−Hi(Cmi+DmiK1i)){circumflex over (x)}i(t)+Hiymi(t)+(Bi−HiDmi)K2izi(t), {circumflex over (x)}i(0)={circumflex over (x)}i0, t≥0,  (239)

Let also zi(t) evolve according to the dynamics given by

żi(t)=Glizi(t)+G2ievi(t), zi(0)=zi0, t≥0,  (240)

By equations (239) and (240), one can define the triple (M1i, M2i, M3i) in equation (210) as










M

1

i




=




[





A
i

+


B
i



K

1

i



-


H
i



(


C
mi

+


D
mi



K

1

i




)







(


B
i

-


H
i



D
mi



)



K

2

i







0



G

1

i





]














M

2

i




=




[



0





G

2

i





]


,


M

3

i




=





[




H
i





0



]

.






(
241
)







Using equation (208), equation (239) can be rewritten as

{circumflex over ({dot over (x)})}i(t)=HiCmixi(t)+(Ai+BiK1i−HiCmi){circumflex over (x)}i(t)+BiK2izi(t), {circumflex over (x)}i(0)=xi0, t≥0,  (242)


Next, define {circumflex over (x)}(t)≙[{circumflex over (x)}iT(t), . . . , {circumflex over (x)}NT(t)]Tcustom charactern, z(t)≙[ziT(t), . . . , zNT(t)]T and H≙diag (H1, . . . , HN). Inserting equation (237) into equations (203) and (204), using equation (242), equation (240), and the above definitions, equations (3), (10), and (4) can be compactly written as

{dot over (x)}(t)=Ax(t)+BK1{circumflex over (x)}(t)+BK2z(t)+a(t), x(0)=x0, t≥0,  (243)
{circumflex over ({dot over (x)})}(t)=HCmx(t)+(A+BK1−HCm){circumflex over (x)}(t)+BK2z(t), {circumflex over (x)}(0)=x0, t≥0,  (244)
ż(t)=G1z(t)+G2ev(t), z(0)=z0, t≥0,  (245)
e(t)=Cx(t)+DK1{circumflex over (x)}(t)+DK2z(t)−Raωa(t).  (246)


Now, insert equation (246) into equation (214) and replace the obtained expression with the one in equation (245). Let








η


(
t
)




=






[



x
T



(
t
)


,



x
^

T



(
t
)


,



z
_

T



(
t
)



]

T



ϵℝ


n
_

+


n
_


z
2






,





where nz2i=1N nz2i. Then, the closed-loop system of equations (203), (204), (205), (208), (209), and (210) can be represented as

η(t)=Aηη(t)+Bηωa(t), η(0)=η0, t≥0,  (247)
e(t)=Cηη(t)+Dηωa(t)  (248)

where








A
η

=

[



A



B


K
1





B


K
2







H


C
m





A
+

B


K
1


-

H


C
m






B


K
2








G
2



𝒲

C






G
2




𝒲

DK

1






G
1

+


G
2




𝒲

DK

2






]


,






B
η

=

[



E




0






-

G
2





𝒲

R

a





]


,


C
η

=

[



C



DK
1




DK
2




]


,


D
η

=

-


R
a

.







For the following result, define AHi≙Ai−HiCmi and AH≙A−HCm. By Assumption 7B, Hi can be chosen such that AHi is Hurwitz for all i∈custom character.


Theorem 3B. Let Assumptions 1B, 2B, 3B, and 6B hold. If Ag is Hurwitz and AHi is Hurwitz for all i∈custom character, then the distributed dynamic output feedback control with local measurement given by equations (209) and (210) can solve the problem in Definition 2.


Proof. Let K≙[K1 K2], Â≙A, {circumflex over (B)}≙BK, Ĉ≙C, Ĉm≙Cm, {circumflex over (D)}≙DK, {circumflex over (D)}m ≙DmK, Ê≙E, {circumflex over (F)}≙−Ra,








M
1



=




[




A
+

BK
1

-

H


(


C
m

+


D
m



K
1



)







(

B
-

HD
m


)



K
2






0



G
1




]


,













M
2



=




[



0





G
2




]


,


M
3



=





[



H




0



]

.






(
249
)







Then, observe that the quadruple (Aη, Bη, Cη, Dη) takes the form of (Ac, Bc, Cc, Dc) in Lemma 3B. Recall from the proof of Theorem 1B that the triple (G1, G2, 0) incorporates an N p-copy internal model of A0a under Assumption 6B. Thus, the triple (M1, M2, M3) may also incorporate an N p-copy internal model of A0a. It is given that Assumptions 1B and 3B hold. In order to apply Lemma 3B, it should be shown that An is Hurwitz under the conditions that Ag is Hurwitz and AHi is Hurwitz for all i∈custom character. To this end, the following elementary row and column operations are performed on Aη. First, subtract row 1 from row 2 and add column 2 to column 1. Second, interchange rows 2 and 3, and interchange columns 2 and 3. Thus, we obtain the matrix given by







A
η



=





[




A
+

B


K
1






B


K
2





B


K
1








G
2



W


(

C
+

D


K
1



)







G
1

+


G
2


WD


K
2







G
2



WDK
2






0


0



A
H




]

.





Considering the performed elementary row and column operations, one can verify that Aη is similar to Āη; hence, they have the same eigenvalues. Since Āη is upper block triangular, σ(Āη)=σ(Ag)∪σ(AH). Note that AH is Hurwitz as AHi is Hurwitz for all i∈custom character. It is also given that Ag is Hurwitz. Thus, Aη is Hurwitz. Then, the matrix equations

XηA0a=AηXη+Bη,
0=CηXη+Dη,

have a unique solution Xη by Lemma 3B.


Following similar steps to those in the proof of Theorem 1B, it can be shown under Assumption 2B that ei(t) is ultimately bounded with an ultimate bound for all η0 and for all i∈custom character. If, in addition, limt→∞A0ω(t)−{dot over (ω)}(t)=0 then for all η0 limt→∞ei(t)=0, ∀i∈custom character.


Since the condition on AHi is both agent-wise and graph-wise local, an agent-wise local sufficient condition that ensures property a) of Definition 2 can be determined by determining an agent-wise local sufficient condition, under standard assumptions, for the stability of Ag, which is already given in Theorem 2B.


Dynamic Output Feedback


Define zi(t), Ki, and ui(t) as previously described herein; that is, equation (211) has the form of equation (237). Since evi(t) is the only available information to each agent, the following distributed observer is considered instead of equation (239) to estimate the state xi(t)

{circumflex over ({dot over (x)})}i(t)=(Ai+BiK1i−Li(Ci+DiK1i)){circumflex over (x)}i+Lievi(t)+(Bi−LiDi)K2izi(t), {circumflex over (x)}i(0)=xi0, t≥0,  (250)

where Li is the observer gain matrix. Let zi(t) satisfy the dynamics in equation (240). Define the pair (M1i, M2i) in equation (212) by replacing the triple (Hi, Cmi, Dmi) in M1i (respectively, the zero matrix in M2i) given by equation (241) with (Li, Ci, Di) (respectively, Li).


Define {circumflex over (x)}(t) and {circumflex over (x)}(t) as described previously herein and L≙diag(L1, . . . , LN). Inserting equation (237) into equations (203) and (204), using equation (250), equation (240), and the above definitions, equations (203), (212), and (204) can be expressed by equation (243),

{circumflex over ({dot over (x)})}i(t)=(A+BK1−L(C+DK1)){circumflex over (x)}(t)+(B−LD)K2z(t)+Lev(t), {circumflex over (x)}(0)={circumflex over (x)}0, t≥0,  (251)

equation (245), and equation (246). Next, insert equation (246) into equation (214) and replace the obtained expression not only with the one in equation (245) but also with the one in equation (251). In addition, define η(t) as described previously herein. Then, the closed-loop system of equations (203), (204), (205), (211), and (212) can be expressed by equations (247) and (248) if the second row of Aη is replaced with

[Lcustom characterCA+BK1−L(C+DK1custom characterDK1)(B−LD+Lcustom characterD)K2]

and the second row of Bη is replace with −Lcustom characterRa.


Theorem 4B. Let Assumptions 1B, 2B, 3B, and 6B hold. If the resulting Aη is Hurwitz, then the distributed dynamic output feedback control given by equations (211) and (212) solves the problem in Definition 2.


Proof. Define K, Â, {circumflex over (B)}, Ĉ, {circumflex over (D)}, Ê, and {circumflex over (F)}, as in the proof of Theorem 3B. Let Ĉm≙0, {circumflex over (D)}m≙0, and M3≙0. Define also the pair (M1, M2) by replacing the triple (H, Cm, Dm) in M1 (respectively, the zero matrix in M2) given by equation (249) with (L, C, D) (respectively, L). Then, observe that the resulting quadruple (Aη, Bη, Cη, Dη) takes the form of (Ac, Bc, Cc, Dc) in Lemma 3B. By the same argument in the proof of Theorem 3B, the resulting triple (M1, M2, M3) incorporates an N p-copy internal model of A0a under Assumption 6B. Since, in addition, Assumptions 1B, 2B, and 3B hold and Aη is Hurwitz, the rest of the proof can be completed by following the steps given in the proof of Theorem 1B.


Now, a goal can be to determine an agent-wise local sufficient condition that assures property a) of Definition 2 under some standard assumptions. For this purpose, define μi(t) as in as described previously herein and let ζi(t)≙[xiT(t),{circumflex over (x)}iT(t),ziT(t)]T,








A
fi



=




[




A
i





B
i



K

1

i







B
i



K

1

i









L
i



C
i






A
i

+


B
i



K

1

i



-


L
i



C
i







B
i



K

1

i









G

2

i




C
i






G

2

i




C
i



K

1

i







G

1

i


+


G

2

i




D
i



K

2

i







]


,






B
Fi



=




[



0





-

L
i







-

G

2

i






]


,


C
Fi



=





[


C
i



D
i



K

1

i




D
i



K

2

i



]

.






Furthermore, consider equations (203), (212), (213), and (204) when ω(t)≡0. By inserting equation (211) into the considered equations, we have

{dot over (ζ)}i(t)=AFiζi(t)+BFiμi(t), ζi(0)=ζi0, t≥0,  (252)
ei(t)=CFtζi(t).  (253)


Remark 6B. Let ALi≙Ai−LiCi. By performing the elementary row and column operations given in the proof of Theorem 3B on AFi, it can be shown that σ(AFi)=σ(Afi)∪σ(ALi). Note that by Assumption 8B, Li can be chosen such that ALi is Hurwitz for all i∈custom character. In conjunction with Remark 4B, this shows that under Assumptions 4B, 5B, 6B, and 8B, it is possible to find K1i, K2i, and Li such that AFi is Hurwitz for all i∈custom character.


Let gFi(s)≙CFi(sI−AFi)−1BFi. For the dynamic output feedback case, the following theorem can be described.


Theorem 5B. Let Assumption 3B hold and AFi be Hurwitz for all i∈custom character. If

gFiρ(custom charactercustom character)<1, ∀i∈custom character.  (254)

then the resulting Aη is Hurwitz.


Proof. It follows by comparing equations (252) and (253) with equations (232) and (233).


To illustrate the performance of the proposed distributed controller architecture described in this embodiment, the following two numerical examples with different exosystems are presented. In particular, the first (respectively, second) example presents the distributed dynamic state (respectively, output) feedback control law. For both examples, we consider five agents with the following system, input, output, and direct feedthrough matrices








A
i

=

[




-
1



1





0
.
2



0



]


,


B
i

=

[



1




2



]


,


C
i

=

[



1


0



]


,


D
i

=

0
.
1


,

i
=
1

,
4
,
5
,






A
i

=

[



0


1


0




0


2


1




0


0


0



]


,


B
i

=

[



0


0




1


0




0


1



]


,


C
i

=

[



1


0


0.4



]


,


D
i

=
0

,

i
=
2

,
3
,





and the augmented graph custom character shown in FIG. 1. With this setup, each agent satisfies Assumptions 4B and 8B. It is also clear from FIG. 1 that Assumption 3B holds. In the simulations, each nonzero aij is set to 1 and ki=1, i=1,2. Moreover, initial conditions for the agents are given by x10=[1, 0.6]T, x20=[−0.5, 0, −0.2]T, x30=[−0.2, −0.3, 0]T, x40=[0.6, 0]T, x50=[0, 0.5]T, and the controllers of all agents are initialized at zero.


Example 3. In this example, the disturbance δ(t) and the trajectory of the leader r0(t) satisfy the following dynamics









δ
.



(
t
)


=



[



0




0
.
0


1



0




0


0


0




0


0




-

0
.
0



5




]



δ


(
t
)



+

[



0




0






0
.
0


5




]



,


δ


(
0
)


=

[



0





-

0
.
2






0



]


,

t

0

,






{dot over (r)}
0(t)=−t03(t)+u0(t), r0(0), t≥0,


respectively, where








u
0



(
t
)


=

{





0.1

t

,





0

t
<
100

,








0.1

t

-

2


sin


(

0.1

t

)




e


-
0.01



(

t
-
100

)





,





100

t
<
200

,







14
+

sin


(

0.05


(

t
-
200

)


)



,




t

200.









By the solution of the disturbance dynamics with the given initial condition, {dot over (δ)}(t) is bounded. Since u0(t) is piecewise continuous and bounded, r0(t) is bounded by Example 4.25 in [7]; hence, {dot over (r)}0(t) is piecewise continuous and bounded. Clearly, {dot over (ω)}(t) is piecewise continuous and bounded. Furthermore, the exosystem affects the state of each agent and its tracking error through matrices








E

δ
1


=

[



0


1


0




0


0


0



]


,






E

δ
4


=

[




0
.
1



0


0




0


0



-

0
.
1





]


,






E

δ
5


=

[



0


0


0





-

0
.
1





-

0
.
2




0



]


,






E

δ
2


=

[



0


0


1




0


0


0




0


0



0
.
5




]


,






E

δ
3


=

[



0



-
0.5



0




0


0



-
1





0


0.4


0



]


,


R
r

=
1





Suppose the piecewise continuity and boundedness of {dot over (ω)}(t) are the only information that is known about the exosystem. As it is suggested in part a) of Remark 2B, let A0=0 and (G1i, G2i)=(0,1) for all i∈custom character. Thus, Assumptions 1B, 2B, 5B, and 6B hold. With the following controller parameters

K1i=−[1.19600.9611], K2i=−1.4142, i=1,4,5,








K

1
i


=

-

[





4
.
2


3

2

8





5
.
3


9

0

4





1
.
4


0

3

8







1
.
2


6

0

4





1
.
4


0

3

8





1
.
7


1

1

5




]



,






K

2

i


=

-

[





1
.
2


7

8

8







1
.
3


6

5

5




]



,

i
=
2

,
3
,




Afi is Hurwitz for all i∈custom character and the condition given by equation (236) is satisfied. Thus, Ag is Hurwitz by Theorem 2B. As Theorem 1B promises, ultimately bounded tracking error is observed in FIG. 4.


Example 2. The disturbance and the trajectory of the leader satisfy

{dot over (δ)}(t)=e−0.1t, δ(0)=1, t≥0










r
.

0



(
t
)


=



[



0



0
.
5






-

0
.
5




0



]




r
0



(
t
)



+

[




t


e

-
t




sin


(
t
)








2


e

-
t






]



,







r
0



(
0
)


=

[




-
1





1



]


,

t

0

,





respectively. Moreover, Eδ1=[1 0]T, Eδ2=[0 1 0]T, Eδ3=[−1.5 0 0.3]T, Eδ4=[0 2]T, Eδ5=[0.2 −0.2]T, and Rr=[1 0].


Suppose the unforced parts of the given dynamics are available to the control designer and the forcing terms are known to be piecewise continuous and convergent to zero. Then, let








A
0

=

[



0



0
.
5



0





-

0
.
5




0


0




0


0


0



]


,





and








G

1

i


=

[



0


1


0




0


0


1




0




-

0
.
2



5



0



]


,






G

2

i


=

[



0




0




1



]


,



i






ϵ𝒩
.







Hence, Assumptions 1B, 5B, and 6B hold. In addition, limt→∞A0ω(t)−{dot over (ω)}(t)=0. Note that Assumption 2B automatically holds since A0ω(t)−{dot over (ω)}(t) is piecewise continuous and convergent. With the following controller parameters

K1i=−[5.1794 0.7932], Li=[17 80.2]T,
K2i=−[2 5.4458 10.3182], i=1,4,5,








K

1

i


=

-

[





6
.
1


9

1

6





5
.
7


6

8

6





1
.
7


8

3

5







3
.
9


2

9

9





1
.
7


8

3

5





2
.
4


2

8

2




]



,






L
i[−187 756 600]T,








K

2

i


=

-

[





0
.
4


5

1

3





0
.
9


1

7

3





3
.
3


8

3

9







0
.
8


9

2

4





2
.
2


2

8

5





5
.
6


3

7

7




]



,




AFi is Hurwitz for all i∈custom character and the condition given by equation (254) is satisfied. Thus, Aη is Hurwitz by Theorem 5B. Furthermore, it is guaranteed by Theorem 4B that limt→∞ei(t)=0, ∀i∈custom character and this fact is demonstrated in FIG. 5.


Solvability of Equations (226) and (227)


Section III in [4] also studies the solvability of the matrix equations in equations (226) and (227), which correspond to the matrix equations given by (6) in [4], with an alternative approach. Specifically, the last paragraph of Section III in [4] lists three sufficient conditions based on Remark 3.8 of [6] to guarantee that these matrix equations have a unique solution. However, it cannot be guaranteed as it claimed in [4]. This subsection aims to present the gaps between the conditions and the existence of a unique solution to the matrix equations, propose appropriate modifications that fill these gaps, and explain the motivation behind the disclosed approach. For this purpose, the first focus is on Definition 3.7 and Remark 3.8 in [6] to fix a problem in [6]. Then, the conditions listed in [4] are revisited to point out the missing one. Finally, a motivational example is provided and the difference between the approach in [4] and the one in this disclosure is highlighted.


In this paragraph, the notation and the terminology in [6] are adopted and readers are referred to (3.5), (3.6), (3.8), Definition 3.7, and Remark 3.8 in [6]. The problem in [6] is that the conditions of Remark 3.8 do not ensure the stabilizability of the pair given by (3.8). Moreover, this problem is directly transferred to [4]. To illustrate this point, the following system, input, output, and direct feedthrough matrices of the plant; and system matrix of the exosystem are considered







A
=

[



1


2




1


0



]


,





B
=

[



2




0



]


,






C=[0.5 −0.5], D=0, A1=0.


It can be easily checked that the plant and the exosystem above satisfy the first and the second condition of Remark 3.8. Note that m(s)=s is the minimal polynomial of A1. Then, choose the pair (β11) in (3.6) as follows








β
1

=

[



0


1




0


1



]


,






σ
1

=


[



0




1



]

.






It is obvious that the pair (β11) is controllable and the minimal polynomial of A1 divides the characteristic polynomial of β1. Thus, the pair (G1, G2)≙(β11) incorporates a 1-copy internal model of A1 according to Definition 3.7. Now investigated the stabilizability of the pair in equation (3.8). This pair is not controllable by the controllability matrix test (for example, see Theorem 12.1 in [3]) and the eigenvalues of the first matrix of this pair are −1, 0, 1, and 2. The eigenvector test for stabilizability (for example, see Theorem 14.1 in [3]) reveals that unstable eigenvalue 1 is the uncontrollable mode; that is, the pair in (3.8) is not stabilizable. Hence, there do not exist K1 and K2 such that Ac defined in (3.5) is Hurwitz. This counterexample to Remark 3.8 is obtained due to the fact that the constructed G1 violates Property 1.5 in [5]. In fact, J. Huang (personal communications, Jun. 9, 2018) recognizes the problem in Remark 3.8; hence, he adds Property 1.5 as a condition to Lemma 1.27 of [5]. Also noted is that the proof of Lemma 1.26 in [5] is still valid even if Assumption 1.1 in [5] is removed from the hypotheses of Lemma 1.26.


In this disclosure, Definition 1 modifies the second property of Definition 1.22 given after (1.58) in [5]. This modification guarantees that Property 1.5 in [5] automatically holds if Assumption 5B holds. Based on the foregoing discussions, it is clear that Remark 4B is true.


The following two paragraphs adopt the notation and the terminology from [4]. Readers are referred to (5), (6), (7), (8), (10), Definition 2, Lemma 2, Section IIB, and Section III in [4]. It is shown in Section III that if the matrix equations in (8) have solutions X1i and X2i for i=1, . . . , N, then the ones in (7) have solutions X1=diag(X11, . . . , X1N) and X2=diag(X21, . . . , X2N); that is, the matrix equations in (6) has a solution X=[X1TX2T]T. Furthermore, it is claimed that if the three conditions listed in the last paragraph of Section III hold, then the matrix equations in (8) have unique solutions X1i and X2i for i=1, . . . , N. In section II.B, S is assumed to have no strictly stable modes. However, these conditions do not guarantee the unique solutions. For, consider A1=0, B1=1, C1=1, D1=0, S=0, R=1, P1=1, F1=0, and G1=1. It can be easily checked that the listed conditions are satisfied and Property 1.5 in [5] is not violated. Choose K1=0 and H1=0. From the first matrix equation in (8), we get 1=0, which is a contradiction. Next, the problem in the claim is pointed out. First, observe that the matrix equations in (8) can be equivalently written as the matrix equations given by (1.70) and (1.71) in [5]. Then, Lemma 1.27 in [5], one can note that the following condition is missed in the claim: Ã; given after (10) is Hurwitz for i=1, . . . , N. After the suggested modification above, Ki and Hi can always be chosen such that Ãi is Hurwitz under the listed conditions. It can be shown that this condition, together with the assumption on S, ensures that zero matrices are the unique solutions to the off-block-diagonal matrix equations in (7) by adding Gc((Cc=DcKc)X1+DcHcX2−Rc) to the left side of the second equation in (7) that gives an equivalent form of (7) and applying the first part of Proposition A.2 in [5]. In conclusion, if the assumption on S holds, the third condition in the list holds for i=1, . . . , N, and Ãi is Hurwitz for i=1, . . . , N, then the matrix equations in (6) has a unique solution X.


According to Lemma 2B, the problem in Definition 2 is solved if the assumption on S holds, Al given after (5) is Hurwitz, and the matrix equations in (6) have a unique solution X. Although the approach utilized during the derivation of the listed conditions does not take into account the assumption on Al, which is required to solve the problem in Definition 2, one may wonder the answer of the following question: Let the listed conditions hold and Al be Hurwitz. Then, can it be concluded that Ãi is Hurwitz for i=1, . . . , N? The answer is no. That is, the missing condition cannot be satisfied by assuming that the listed conditions hold and Al is Hurwitz. To clarify this point, consider the system parameters of the agents, the system matrix of the exosystem, and the adjacency matrix of custom character*.








A
1

=

[




-
1



1




1


0



]


,






B
1

=

[



1



0
.
5





0




0
.
2


5




]


,






C
1

=

[

1




-

0
.
5


]


,






D
1

=
0

,






A
2

=

[



0


1


0




0


0


1




0


0


0



]


,






B
2

=

[



0




0




1



]


,






C
2

=

[



1


0


0



]


,






D
2

=
0

,






A
3=1, B3=−1, C3=1, D3=0, S=0,







Q
*

=


[



1


0


0


0





0
.
5



0


0



0
.
5





0



0
.
5



0



0
.
5





0



0
.
5




0
.
5



0



]

.





Choose (Fi,Gi)=(0,1), i=1,2,3. It can be easily checked that the listed conditions are satisfied and Property 1.5 in [5] is not violated. One can also obtain W, which is required to construct Ai, from custom character*. Then, choose the remaining parameters of the controllers as follows








K
1

=

[





2
.
6


7

5

2





9
.
6


6

2

4







-
1



0
.
6


7

5

2





-
2



4
.
6


6

2

4




]


,






H
1

=

[




-

6
.
4







6
.
4




]


,






K
2=−[104.56 57.936 14.828], H2=−80, K3=0.8, H3=1.


With this setup, it can be verified that 3 is not Hurwitz even though A is Hurwitz.


Based on the previous example, the following question arises: Is the missing condition in [4] necessary to ensure that the matrix equations given by (6) in [4] have a unique solution? In fact, this question is the motivation behind the key lemma (i.e., Lemma 3B) of this disclosure and the answer is no. In contrast to Section III in [4], the approach in Lemma 3B does not decompose matrix equations, which consist of the overall dynamics of the multiagent system, into matrix equations, which deal with the dynamics of each agent separately; hence, the missing condition in [4] is not required in Lemma 3B. Furthermore, not only dynamic stated feedback but also dynamic output feedback with local measurement and dynamic output feedback effectively utilize Lemma 3B to solve the stated problem in Definition 2 (see Theorems 1B, 3B, and 4B).


Since the proof of Theorem 1 and the statement of Theorem 4 in [16] use the approach in Section III of [4], the description in this subsection will also be helpful for the readers of the results in [16].


On Theorem 2 in [2]


In this subsection, the notion and the terminology in [4] are adopted and readers are referred to (5), (10), (15), and Theorem 2 in [4]. Now, consider the system parameters of the agent, the system matrix of the exosystem, and the adjacency matrix of custom character* given by








A
1

=

[



1


0


0




0


1


0




0


0



-
1




]


,






B
1

=

I
3


,






C
1

=

[



1


0


0




0


1


0



]


,






D
1

=
0

,





S
=
0

,






Q
*

=


[



1


0




1


0



]

.






Choose (F1, G1)=(0, I2) and








K
1

=

[




-
2



0


0




0



-
2



0




0


0


2



]


,






H
1

=


[




-
1



0




0



-
1





0


0



]

.






Note that W=1 from custom character*; hence, Al given after (5) is nothing but Ã1 given after (10). With this setup, it can be verified that T1(s) given before Theorem 2 is stable and the condition in (15) is automatically satisfied, but Al is not Hurwitz. This counterexample is obtained because the realization of T1(s) is neither stabilizable nor detectable.


The above setup also applies to Theorem 5 in [16] since it relies on Theorem 2 and its conditions are satisfied. It should be noted that although Assumptions 1-4 in [16] and Property 1.5 in [5] are not listed in the hypothesis of Theorem 5 in [16], this counterexample does not violate them.


References Related to the Second Embodiment

  • [1] D. S. Bernstein. Matrix mathematics: Theory, facts, and formulas. Princeton University Press, 2009.
  • [2] H. Cai, F. L. Lewis, G. Hu, and J. Huang. The adaptive distributed observer approach to the cooperative output regulation of linear multi-agent systems. Automatica, 75:299-305, 2017.
  • [3] J. P. Hespanha. Linear systems theory. Princeton University Press, 2009.
  • [4] C. Huang and X. Ye. Cooperative output regulation of heterogeneous multi-agent systems: An H1 criterion. IEEE Transactions on Automatic Control, 59(1):267-273, 2014.
  • [5] J. Huang. Nonlinear output regulation: Theory and applications. SIAM, 2004.
  • [6] J. Huang and C.-F. Lin. On a robust nonlinear servomechanism problem. IEEE Transactions on Automatic Control, 39(7):1510-1513, 1994.
  • [7] H. K. Khalil. Nonlinear systems. Prentice Hall, 2002.
  • [8] E. Lavretsky and K. A. Wise. Robust and adaptive control with aerospace applications. Springer, 2013.
  • [9] F. L. Lewis, H. Zhang, K. Hengster-Movric, and A. Das. Cooperative control of multi-agent systems: Optimal and adaptive design approaches. Springer, 2014.
  • [10] Li, X. Wang, J. Xiang, and W. Wei. Synchronised output regulation of leader-following heterogeneous networked systems via error feedback. International Journal of Systems Science, 47(4), 2015.
  • [11] S. B. Sarsilmaz and T. Yucelen. On control of heterogeneous multiagent systems with unknown leader dynamics. In ASME Dynamic Systems and Control Conference, 2017.
  • [12] S. B. Sarsilmaz and T. Yucelen. On control of heterogeneous multiagent systems: A dynamic measurement output feedback approach. In American Control Conference, 2018.
  • [13] P. N. Shivakumar and K. H. Chew. A sufficient condition for nonvanishing of determinants. Proceedings of the American Mathematical Society, 43(1):63-66, 1974.
  • [14] Y. Su and J. Huang. Cooperative output regulation of linear multi-agent systems. IEEE Transactions on Automatic Control, 57(4):1062-1066, 2012.
  • [15] Y. Su and J. Huang. Cooperative output regulation of linear multiagent systems by output feedback. Systems & Control Letters, 61(12):1248-1253, 2012.
  • [16] F. Adib Yaghmaie, F. L. Lewis, and R. Su. Output regulation of linear heterogeneous multi-agent systems via output and state feedback. Automatica, 67:157-164, 2016.
  • [17] M. Vidyasagar. Input-output analysis of large-scale interconnected systems: Decomposition, well-posedness, and stability. Springer-Verlag, 1981.
  • [18] X. Wang, Y. Hong, J. Huang, and Z.-P. Jiang. A distributed control approach to a robust output regulation problem for multiagent linear systems. IEEE Transactions on Automatic Control, 55(12):2891-2895, 2010.
  • [19] K. Zhou, J. C. Doyle, and K. Glover. Robust and optimal control. Prentice Hall, 1996.


Third Embodiment


FIG. 1 is a representation of a multiagent system having six agents of a distributed control architecture for spatial control of the multiagent system. Referring to FIG. 1, a multiagent system 100 comprises a group of six agents including a capable agent 110 and five other agents 120.


In one embodiment, the capable agent 110 and the other agents 120 represent six respective vehicles comprising a capable vehicle 110 and other vehicles 120 in a group of vehicles of the multiagent system 100. Arrows between the vehicles represent an example of peer-to-peer communication paths for exchanging navigation information among the six vehicles of the multiagent system 100. The dynamics of each of the agents is represented, for example, by linearized equations of translation dynamics described in the first and second embodiments.



FIG. 6 is a block diagram of a vehicle agent device 300. Referring to FIG. 6, there is shown a vehicle agent device 300 that includes, among other things, communication interfaces 320, agent sensor devices 322, navigation logic 324, a vehicle navigation command generator 328, vehicle navigation controllers 330, a GNSS receiver 332, a graphical user interface 334, an electronic processor 338, a memory 340, a camera 350, a microphone 352, a display 354, a speaker 356, a network interface 364, user interfaces 366 and an input/output (I/O) interface 370.


The vehicle agent device 300 may be similar or substantially the same as a capable agent 110 or one of the other agents 120 in the multiagent system having a distributed control architecture, as described with respect to FIG. 1. Other agent devices 120 of a group may be referred to as following agent devices 120. The vehicle agent device 300 may comprise a single device or may comprise a plurality of devices connected to or integrated within a vehicle platform. The vehicle platform may comprise any suitable autonomous underwater, ground, aerial, or space vehicle that is operable to travel as a member of a group having distributed control architecture as described herein. A vehicle platform may be configured to transport passengers or may be an unmanned vehicle platform. The vehicle platform may be controlled by the vehicle agent device 300 to travel as a member of a group or swarm of vehicles that communicate group course direction information via peer-to-peer communications among a plurality of vehicles that travel as members of the group.


A group of the vehicles may be referred to as a swarm and may comprise a plurality of vehicles that travel over time in a one dimensional spatial system, a two dimensional spatial system, or a three dimensional spatial system to reach a common destination. The common destination may be a fixed destination or one that changes position over time. For example, a group may pursue or follow a target vehicle or an object. The direction of travel of the group may change over time. The direction of travel of the group may be initiated by a vehicle agent device 300 that functions as a capable agent 110. The direction of travel of the group may further be implemented autonomously by one or more vehicle agent devices that function as other agents 120 of the group.


In one embodiment, a vehicle platform may comprise an automobile that travels over land or roads to pursue a moving target vehicle. In another embodiment, the vehicle platform may comprise flying crafts such as planes or drones that travel in the air or space towards a moving target. In another embodiment, the vehicle platform may comprise boats or submarines that travel in or under water. The moving target may travel in the air, space, water, or along the ground. In another embodiment, a plurality of vehicles of a group may travel towards a fixed destination. For example, vehicles traveling in a group may carry passengers, goods, or materials to a common fixed destination. However, the disclosure is not limited to any specific type of vehicle platform or mode of travel, and any suitable vehicle platform or mode of travel may be controlled by the vehicle agent device 300 to be a member of a group of vehicles.


The vehicle agent device 300 shown in FIG. 3 includes elements that enable the vehicle agent device 300 to function as a capable agent device 110 and as another agent device 120. The one or more other agent devices 120 of a group may be referred to as following agent devices 120. A group may comprise one or more capable agent devices 110 and one or more following agent devices 120. For example, a group may have one capable agent device 110 and one hundred following agent devices 120, or two capable agent devices and five hundred following agent devices 120 may comprise a group. However, the disclosure is not limited to a specific number of, or ratio of, capable agent devices 110 and following agent devices 120. Furthermore, although the vehicle agent device 300 shown in FIG. 3 includes elements that enable the vehicle agent device 300 to function as a capable agent device 110 and as another agent device 120, in some embodiments, the vehicle agent device 300 may include only features needed for functioning as a following agent 120. Alternatively the vehicle agent device 300 may include only features needed for functioning as a capable agent 120.


A vehicle agent device 300 that operates as a capable agent 110 may be referred to as a capable agent device. A vehicle agent device 300 that operates as one of the other agents 120 may be referred to as a following agent device. A vehicle agent device 300 may refer to either or both of a capable agent device or a following agent device.


The navigation logic 324 may determine a direction for a course of travel in one, two, or three dimensions for a vehicle platform connected to the vehicle agent device 300. For example, the navigation logic 324 may determine when, where, and how a connected vehicle platform should change its spatial orientation and to what degree its spatial orientation should change in its course of travel within a group. In one embodiment, the navigation logic 324 may utilize location information from the GNSS receiver 332 and may utilize navigation mapping software and USGS data, for example, to determine the course of travel. The navigation mapping software may track the location of the vehicle agent device 300 based on the GNSS location information. In some embodiments, the navigation logic 324 may utilize information received via one or more of the agent sensor devices 322 to determine the course of travel. For example, the agent sensor devices 322 may include ultrasonic sensors, infrared (IR) sensors, cameras with vision processing, a light detection and ranging system (LIDAR), peer-to-peer wireless radio communication, a sound navigation and ranging (SONAR) system, and the like. For example, the information from the agent sensor devices 322 may provide the location or relative location of one or more other vehicles or objects that the vehicle agent device 300 and its connected platform vehicle are following. The one or more other vehicles or objects may be a target of interest, such as a vehicle or object that the group as a whole is following or following. Moreover, the one or more other vehicles or objects sensed by the agent sensor devices 322 may include one or more other vehicles of a group that the vehicle agent device 300 is a member of and travelling among. This form of sensing of the other vehicles of the group may be referred to peer-to-peer communication among vehicles of the group, and may enable a plurality of vehicle agent devices 300 of the group to each autonomously determine their own direction of travel and/or speed.


A capable agent device may serve as a leader of a group of vehicles, and may receive or determine its own navigation parameters, for example, for direction of travel and speed, in a variety of ways. In one embodiment, when the vehicle agent device 300 functions as a capable agent device, the navigation logic 324 may determine a course direction for the capable agent device based on instructions received via a wireless interface of the communication interfaces 320, from an external control station 360 (see FIG. 1). In this regard, the external control station 360 may be external to the group of vehicles such that the vehicle agent device 300 is a member of, and may be a stationary external control station or a moving external control station. Furthermore, communications between the capable agent device and the external control station 360 are external to the peer-to-peer communications that occur among the members of the group that the capable agent device is a member of.


A capable agent device may communicate with the external control station 360 based on any suitable wireless technology or protocols. For example, the external communications may be transmitted via wireless wide area, local area, or personal area networks. Furthermore, the external communications may be implemented using, without limitation, cellular, satellite, Wi-Fi, Bluetooth, two-way radio, half duplex radio, and military or public safety communication systems. The external control station 360 may communicate direction information only with capable agent devices, and may not provide direction information to any of the following agent devices. Furthermore, the external control station 360 and the capable agent device may not have a global knowledge of the state of the group that the vehicle agent device 300 is a member of and travelling among. For example, the location and/or direction of travel of all of the following agent devices may not be known to, and are not communicated by the external control station 360, the capable agent device, or the following agent devices. Instead, the direction of travel of following agent devices travelling among a group, depend on one or more of peer-to-peer communication, local measurements, and autonomous navigation determination. In other words, the external control station 360 may provide navigation control information only to the capable agent device.


In some embodiments, the vehicle platform connected to a capable agent device may be a piloted vehicle. The pilot of the vehicle platform may provide navigation control input to the capable agent device via the graphical user interface 334 or by piloting the vehicle platform of the capable agent device. The pilot input may be received in addition to, or in place of, the information received from the external control station 360. The pilot's and/or external control station 360 control input may communicated peer-to-peer to one or more following agent devices, and may be further propagated peer-to-peer throughout the group.


In some embodiments, the navigation logic 324 may determine navigation parameters for group vehicle navigation commands based on input from one or more agent sensor devices 322 and location information received from the GNSS receiver 332 (for example, data received from vehicle sensors). For example, the navigation and parameters may be based on sensor information received while the vehicle platform and the vehicle agent device 300 are trained on the target of interest and encounter objects or obstacles along a course traveled while tracking the target of interest. Alternatively, the navigation parameters for group vehicle navigation commands may be based on program instructions stored in the memory 340 and/or data from the agent sensor devices 322 received while the capable agent travels along a programmed course.


In some embodiments, there may be more than one capable agent device that functions to lead a group of following vehicles. The multiple capable agent devices may not communicate with each other. However, if the multiple capable agent devices propagate different peer-to-peer commands to the following agent devices of the group, for example, for different navigation scaling factors, the scale factors may eventually reach a consensus value in the group, for example, an average of the different scale factors.


The vehicle navigation command generator 328 may generate commands to the vehicle platform connected to the vehicle agent device 300. The commands may be generated utilizing information received from the group navigation logic. The commands may control the direction of travel of the vehicle platform and the distance of the vehicle platform from any other vehicle agent vehicle platforms that are travelling as a member of the same group. The speed of following agent devices of a group may depend directly or indirectly on the speed of the capable agent device of the group. The speed of the capable agent device may be controlled by the external control station 360, a pilot, or the speed of an object or vehicle that the capable agent device is following.


The vehicle navigation command generator 328 may generate navigation commands for peer-to-peer communication to one or more neighboring following agent devices. The navigation commands for neighboring following agent devices are communicated between agents and may include an agent's current position, scaling factor, rotation angle (to control the group's orientation), and a local integral state vector (to construct the local control).


The vehicle navigation controllers 330 may comprise one or more control interfaces to, for example, steering, elevation, speed, or braking systems in the vehicle platform that the vehicle agent device 300 is connected to. The vehicle navigation controllers 330 may communicate the generate group vehicle navigation commands to the steering, elevation, speed, or braking systems in order for the vehicle platform to perform as a member of the group and perform group tasks. In some embodiments, the vehicle navigation controller 330 controls the movement of the vehicle agent device 300 based on the self-navigation input control signal described herein.


In some embodiments, the electronic processor 338 may be communicatively coupled to, the I/O interface 370, one or more the communication interfaces 320, the agent sensor devices 322, the navigation logic 324, the vehicle navigation command generator 328, the vehicle navigation controllers 330, the GNSS receiver 332, the graphical user interface 334, the electronic processor 338, the memory 340, the camera 350, the microphone 352, the display 354, the speaker 356, the network interface 364 and the user interfaces 366.


The memory 340 may store program instructions 346 that when executed by the electronic processor 338 may cause the electronic processor 338 to perform or support functions of the vehicle agent device 300 according to the embodiments.


In various embodiments, electronic processor 338 may be a uniprocessor system including one electronic processor 338, or a multiprocessor system including several electronic processors 338 (for example, two, four, eight, or another suitable number). Electronic processors 338 may be any suitable processor capable of executing instructions. For example, in various embodiments, the electronic processors 338 may implement any of a variety of instruction set architectures (ISAs), such as the x86, PowerPC, SPARC, or MIPS ISAs, or any other suitable ISA. In multiprocessor systems, each of the electronic processors 338 may commonly, but not necessarily, implement the same ISA.


In some embodiments, at least one electronic processor 338 may be a graphics processing unit. A graphics processing unit or GPU may be considered a dedicated graphics-rendering device. Modern GPUs may be very efficient at manipulating and displaying computer graphics, and their highly parallel structure may make them more effective than typical CPUs for a range of complex graphical algorithms. For example, a graphics processor may implement a number of graphics primitive operations in a way that makes executing them much faster than drawing directly to the screen with a host central processing unit (CPU). In various embodiments, the image processing methods disclosed herein may, at least in part, be implemented by program instructions configured for execution on one of, or parallel execution on two or more of, such GPUs. The GPU(s) may implement one or more application programmer interfaces (APIs) that permit programmers to invoke the functionality of the GPU(s). Suitable GPUs may be commercially available from vendors such as NVIDIA Corporation, ATI Technologies (AMD), and others.


The memory 340 may be configured to store program instructions 346 and/or program data (for example, capable agent data 342 and navigation data 344) accessible by the electronic processor 338 and/or by the navigation logic, 324, the vehicle navigation command generator 328, and/or the vehicle navigation controllers 330, among other elements of the vehicle agent device 300. In various embodiments, the memory 340 may be implemented using any suitable memory technology, such as static random access memory (SRAM), synchronous dynamic RAM (SDRAM), nonvolatile/Flash-type memory, or any other type of memory. In the illustrated embodiment, program instructions and data implementing desired functions, such as those described above for various embodiments, are shown stored within the memory 340 as program instructions 346 and data storage 342 and 344. In other embodiments, program instructions and/or data may be received, sent or stored upon different types of computer-accessible media or on similar media separate from the memory 340 or vehicle agent device 300. Moreover, in some embodiments, a database that is accessible via the network interface 364 may store, among other things, data for implementing desired functions, such as those described above for various embodiments. Generally speaking, a computer-accessible medium may include storage media or memory media such as magnetic or optical media, for example, disk or CD/DVD-ROM coupled to the vehicle agent device 300 via the I/O interface 370. Program instructions and data stored via a computer-accessible medium may be transmitted by transmission media or signals such as electrical, electromagnetic, or digital signals, which may be conveyed via a communication medium such as a network and/or a wireless link, such as may be implemented via network interface 364.


In one embodiment, I/O interface 370 may be configured to coordinate I/O traffic between electronic processor 338, memory 340, one or more of the communication interfaces 320, the agent sensor devices 322, the navigation logic 324, the vehicle navigation command generator 328, the vehicle navigation controllers 330, the GNSS receiver 332, the graphical user interface 334, and any peripheral devices in the vehicle agent device 300, including network interface 364 or other peripheral interfaces, such as the camera 350, microphone 352, display 345, speaker 356, and user interfaces 366. In some embodiments, I/O interface 370 may perform any necessary protocol, timing or other data transformations to convert data signals from one component (for example, the memory 340) into a format suitable for use by another component (for example, electronic processor 338). In some embodiments, I/O interface 370 may include support for devices attached through various types of peripheral buses, such as a variant of the Peripheral Component Interconnect (PCI) bus standard or the Universal Serial Bus (USB) standard, for example. In some embodiments, the function of I/O interface 370 may be split into two or more separate components, such as a north bridge and a south bridge, for example. In addition, in some embodiments some or all of the functionality of I/O interface 370, such as an interface to memory 340, may be incorporated directly into electronic processor 338.


The network interface 364 may be configured to allow data to be exchanged between the vehicle agent device 300 and other devices attached to a network, such as computer systems, a database, or between nodes of the vehicle agent device 300. In various embodiments, network interface 364 may support communication via wired or wireless general data networks, for example: via telecommunications/telephony networks such as voice networks or digital fiber communications networks; via storage area networks such as Fiber Channel SANs, or via any other suitable type of network and/or communications protocol.


The user interfaces may support, in some embodiments, one or more of display terminals, keyboards, keypads, touchpads, scanning devices, voice or optical recognition devices, or any other devices suitable for entering or retrieving data by one or more vehicle agent device 300. Multiple user input/output devices may be present in the vehicle agent device 300 or may be distributed on various nodes of the vehicle agent device 300. In some embodiments, similar input/output devices may be separate from the vehicle agent device 300 and may interact with one or more nodes of the vehicle agent device 300 through a wired or wireless connection, such as over network interface 364.


Those skilled in the art will also appreciate that, while various items are illustrated as being stored in memory or on storage while being used, these items or portions of them may be transferred between memory and other storage devices for purposes of memory management and data integrity. Alternatively, in other embodiments some or all of the software components may execute in memory on another device and communicate with the illustrated vehicle agent device 300 via inter-device communication. Some or all of the system components or data structures may also be stored (for example, as instructions or structured data) on a computer-accessible medium or a portable article to be read by an appropriate drive, various examples of which are described above. In some embodiments, instructions stored on a computer-accessible medium separate from the vehicle agent device 300 may be transmitted to the vehicle agent device 300 via transmission media or signals such as electrical, electromagnetic, or digital signals, conveyed via a communication medium such as a network and/or a wireless link. Various embodiments may further include receiving, sending or storing instructions and/or data implemented in accordance with the foregoing description upon a computer-accessible medium. Accordingly, the present embodiments may be practiced with other system configurations.



FIG. 7 is a flow chart of a method 400 for controlling motion of a vehicle in a group of vehicles. At block 405, the electronic processor 338 of the vehicle (for example, the vehicle agent device 300) determines a local virtual tracking error signal. For example, the electronic processor 338 may determine the local virtual tracking error signal in accordance with equation (205) described in the second embodiment.


At block 410, the electronic processor 338 of the vehicle determines a controller state signal. For example, the electronic processor 338 may determine the controller state signal in accordance with the controller state zi(t) described in the second embodiment.


At block 415, the electronic processor 338 of the vehicle determines self-navigation input control signal based on the local virtual tracking error signal and the controller state signal. For example, the electronic processor 338 may determine the self-navigation input control signal (for example, ui(t)) in accordance with equation (206), equation (208), and/or equation (211) described in the second embodiment.


In some embodiments, the electronic processor 338 receives data from a plurality of vehicle sensors (for example, the agent sensor devices 322). In some embodiments, the electronic processor 338 determines a local measurement output signal (for example, ymi(t)) based on the data received from the plurality of vehicle sensors. For example, the electronic processor 338 may determine the local measurement output signal based on equation (208) described in the second embodiment. In some embodiments, the electronic processor 338 determines the self-navigation input control signal based on the local measurement output signal. For example, the electronic processor 338 may determine the self-navigation input control signal based on equations (208) and (209) described in the second embodiment.


In some embodiments, the electronic processor 338 determines the location virtual tracking error signal based on a tracking error signal, a relative output error signal, or both. In some embodiments, the electronic processor 338 receives the tracking error signal (for example, ei(t)) from a capable agent vehicle. In some embodiments, the electronic processor 338 receives the relative output error signal (for example, yi(t)−yj(t)) from a neighboring agent vehicle. In some embodiments, the electronic processor 338 receives data from a neighboring vehicle (for example a position or a heading) and determine the relative output error signal (for example, yi(t)−yj(t)) based at least in part on the data received from the neighboring agent vehicle.


In some embodiments, the electronic processor 338 receives determines a self-state signal (for example, xi(t)) based on equations (206) and (207) described in the second embodiment.


In this disclosure, the cooperative output regulation problem of heterogeneous linear time-invariant multiagent systems over fixed directed communication graph topologies is described. Among other things, this disclosure provides a new definition of the linear cooperative output regulation problem (see Definition 2), which allows a broad class of functions to be tracked and rejected by a network of agents, and focused on an internal model based distributed control approach. For the three different distributed control laws (i.e., dynamic state feedback, dynamic output feedback with local measurement, and dynamic output feedback), global and local sufficient conditions are determined (see Theorems 1B, 2B, 3B, 4B, and 5B).


The approach in this disclosure is relevant, for example, to the linear cooperative output regulation problem with an internal model based distributed dynamic state feedback control law. This disclosure considers not only dynamic state feedback but also dynamic output feedback with local measurement and dynamic output feedback, where the output feedback stabilizability is not assumed. To prove the existence of a unique solution to the matrix equations that is important for the solvability of the problem, previously systems decompose these matrix equations, which consist of the overall dynamics of the multiagent system, into matrix equations, which deal with the dynamics of each agent separately. In contrast, Lemma 3B as described herein, which is also applicable to dynamic output feedback cases, guarantees that these matrix equations have a unique solution without the need to decompose them.


Various features and advantages are set forth in the following claims.

Claims
  • 1. A system for controlling motion of a vehicle in a group of vehicles, the system comprising a communication interface;a vehicle platform for travelling among the group of vehicles; andan electronic processor configured to determine a local virtual tracking error signal,determine a controller state signal, anddetermine a self-navigation input control signal based on the local virtual tracking error signal and the controller state signal,wherein the self-navigation input control signal is for navigating the vehicle platform when travelling as a member of the group of vehicles, wherein a trajectory of an exosystem is based on a boundedness condition, wherein the trajectory of the exosystem including external disturbances and a trajectory of a leader vehicle of the group of vehicles, wherein the vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology, wherein each vehicle in the group of vehicles is stabilizable, wherein each vehicle in the group of vehicles satisfies a transmission zero condition, wherein design matrices of the vehicle satisfy an internal model principle,wherein the self-navigation input control signal, ui(t), is defined by: ui(t)=Kizi(t),wherein Ki is a controller gain and zi(t), is the controller state signal at time, t, and wherein the controller state signal, zi(t), is defined by at least one of: żi(t)=M1izi(t)+M2ievi(t)wherein M1i and M2i are design matrices and evi(t) is the local virtual tracking error signal at the time, t,wherein the controller gain, Ki, is selected such that an agent-wise local gain condition is less than one, and wherein a locally-controlled agent matrix of the vehicle is Hurwitz, or żi(t)=M1izi(t)+M2ievi(t)+M3iymi(t),wherein M1i, M2i, and M3i are design matrices, evi(t) is the local virtual tracking error signal at the time, t, and ymi(t) is a local measurement output signal at the time, t,wherein the controller gain, Ki, is selected such that an agent-wise local gain condition is less than one, wherein a locally-controlled agent matrix of the vehicle is Hurwitz, and wherein an observer gain, Hi, is selected such that the locally-observed agent matrix on the vehicle is Hurwitz.
  • 2. The system of claim 1, wherein each vehicle of the group of vehicles is detectable based on an output matrix.
  • 3. The system of claim 1, wherein the controller state signal, zi(t), is defined by zi(t)=M1izi(t)+M2ievi(t)+M3iymi
  • 4. A method for controlling motion of a vehicle in a group of vehicles, the method comprising determining, with an electronic processor of the vehicle, a local virtual tracking error signal;determining, with the electronic processor, a controller state signal; anddetermining, with the electronic processor, a self-navigation input control signal based on the local virtual tracking error signal and the controller state signal,wherein the self-navigation input control signal is for navigating a vehicle platform of the vehicle when travelling as a member of the group of vehicles, wherein a trajectory of an exosystem is based on a boundedness condition, wherein the trajectory of the exosystem including external disturbances and a trajectory of a leader vehicle of the group of vehicles, wherein the vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology, wherein each vehicle in the group of vehicles is stabilizable, wherein each vehicle in the group of vehicles satisfies a transmission zero condition, wherein design matrices of the vehicle satisfy an internal model principle,wherein the self-navigation input control signal, ui(t), is defined by: ui(t)=Kizi(t),wherein Ki is a controller gain and zi(t) is the controller state signal at time, t, and wherein the controller state signal, zi(t), is defined by at least one of: żi(t)=M1izi(t)+M2ievi(t)wherein M1i and M2i are design matrices and evi(t) is the local virtual tracking error signal at the time, t,wherein the controller gain, Ki, is selected such that an agent-wise local gain condition is less than one, and wherein a locally-controlled agent matrix of the vehicle is Hurwitz, or żi(t)=M1izi(t)+M2ievi(t)+M3iymi(t),wherein M1i, M2i, and M3i are design matrices, evi(t) is the local virtual tracking error signal at the time, t, and ymi(t) is a local measurement output signal at the time, t,wherein the controller gain, Ki, is selected such that an agent-wise local gain condition is less than one, wherein a locally-controlled agent matrix of the vehicle is Hurwitz, and wherein an observer gain, Hi, is selected such that the locally-observed agent matrix on the vehicle is Hurwitz.
  • 5. The method of claim 4, wherein each vehicle of the group of vehicles is detectable based on an output matrix.
  • 6. The method of claim 4, wherein the controller state signal, zi(t), is defined by żi(t)=M1izi(t)+M2ievi(t)+M3iymi(t),
  • 7. The method of claim 4, further comprising determining, with the electronic processor, a self-state signal; anddetermining, with the electronic processor, the self-navigation input control signal based on the self-state signal.
  • 8. A system for controlling motion of a vehicle in a group of vehicles, the system comprising a communication interface;a vehicle platform for travelling among the group of vehicles; andan electronic processor configured to determine a local virtual tracking error signal,determine a controller state signal,determine a self-state signal, anddetermine a self-navigation input control signal based on the local virtual tracking error signal, the controller state signal, and the self-state signal,wherein the self-navigation input control signal is for navigating the vehicle platform when travelling as a member of the group of vehicles, wherein a trajectory of an exosystem is based on a boundedness condition, wherein the trajectory of the exosystem including external disturbances and a trajectory of a leader vehicle of the group of vehicles, wherein the vehicle communicates with other vehicles in the group of vehicles via a fixed augmented directed connected communication graph topology, wherein each vehicle in the group of vehicles is stabilizable, wherein each vehicle in the group of vehicles satisfies a transmission zero condition, wherein design matrices of the vehicle satisfy an internal model principle,wherein the self-navigation input control signal, u1(t), is defined by: ui(t)=K1ixi(t)+K2izi(t),wherein K1i and K2i are controller gains, xi(t) is the self-state signal at time, t, and zi(t) is the controller state signal at the time, t, wherein the controller state signal, zi(t), is defined by: żi(t)=G1izi(t)+G2ievi(t)wherein G1i and G2i are design matrices, and evi(t) is the local virtual tracking error signal at the time, t.
  • 9. The system of claim 8, wherein the vehicle platform comprises an underwater, ground, aerial, or space vehicle.
  • 10. The system of claim 8, wherein the group of vehicles travel over time in a one-dimensional spatial system, a two-dimensional spatial system, or a three-dimensional spatial system.
  • 11. The system of claim 8, wherein the internal model principle comprises an N p-copy internal model principle.
  • 12. The system of claim 8, wherein the group of vehicles travel over time to reach a common destination.
  • 13. The system of claim 12, wherein the common destination comprises a fixed destination or a destination that changes over time.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a non-provisional of and claims benefit of U.S. Provisional Application No. 62/540,813, filed on Aug. 3, 2017, the entire contents of which are incorporated herein by reference.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

This invention was made with government support CMMI1657637 awarded by the National Science Foundation. The Government has certain rights in the invention

US Referenced Citations (5)
Number Name Date Kind
6691151 Cheyer et al. Feb 2004 B1
7036128 Julia et al. Apr 2006 B1
20070203693 Estes Aug 2007 A1
20170329348 Li Nov 2017 A1
20180101169 Applewhite Apr 2018 A1
Non-Patent Literature Citations (31)
Entry
Yucelen et al., “Control of multivehicle systems in the presence of uncertain dynamics,” International Journal of Control, 2013, 86(9):1540-1553 (Year: 2013).
Adib Yaghmaie et al., “Output regulation of heterogeneous linear multi-agent systems with differential graphical game,” International Journal of Robust and Nonlinear Control, 2016, 26:2256-2278.
Adib Yaghmaie et al., “Output regulation of linear heterogeneous multi-agent systems via output and state feedback,” Automatica, 2016, 67:157-164.
Cai et al., “The adaptive distributed observer approach to the cooperative output regulation of linear multi-agent systems,” Automatica, 2017, 75:299-305.
Cao et al., “Leader-follower consensus of linear multi-agent systems with unknown external disturbances,” Systems & Control Letters, 2015, 82:64-70.
Fiedler et al., “On matrices with non-positive off-diagonal elements and positive principal minors,” Czechoslovak Mathematical Journal, 1962, 12(3):382-400.
Francis et al., “The internal model principle of control theory,” Automatica, 1976, 12(5).
Huang et al., “Cooperative output regulation of heterogeneous multi-agent systems: an H∞ criterion,” IEEE Transactions on Automatic Control, 2014, 59(1):267-273.
Huang et al., “On a robust nonlinear servomechanism problem,” IEEE Transactions on Automatic Control, 1994, 39(7):1510-1513.
Kofman, “Non conservative ultimate bound estimation in LTI perturbed systems,” Automatica, 2005, 41:1835-1838.
Kottenstette et al., “On relationships among passivity, positive realness, and dissipativity in linear systems,” Automatica, 2014, 50(4): 18 pages.
Li et al., “Distributed robust consensus control of multi-agent systems with heterogeneous matching uncertainties,” Automatica, 2014, 50(3): 883-889.
Li et al., “Distributed tracking control for linear multiagent systems with a leader of bounded unknown input,” IEEE Transactions on Automatic Control, 2013, 58(2):518-523.
Li et al., “Synchronised output regulation of leader-following heterogeneous networked systems via error feedback,” International Journal of Systems Science, 2016, 47(4): 755-764.
Modares et al., “Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning,” Automatica, 2016, 71:334-341.
Moylan et al., “On the stability and well-posedness of interconnected nonlinear dynamical systems,” IEEE Transactions on Circuits and Systems, 1980, 27(11):1097-1101.
Olfati-Saber et al., “Consensus and cooperation in networked multi-agent systems,” Proceedings of the IEEE, 2007, 95(1):215-233.
Peng et al., “Cooperative tracking and estimation of linear multi-agent systems with a dynamic leader via iterative learning,” International Journal of Control, 2014, 87(6): 1163-1171.
Sarsilmaz et al., “On control of heterogeneous multiagent systems with unknown leader dynamics,” ASME Dynamic Systems and Control Conference, 2017.
Sarsilmaz et al., “On control of heterogeneous multiagent systems: A dynamic measurement output feedback approach,” in American Control Conference, 2018, 6 pages.
Shivakumar et al., “A sufficient condition for nonvanishing of determinants,” Proceedings of the American Mathematical Society, 1974, 43(1):63-66.
Sontag, “Input to state stability: Basic concepts and results,” Nonlinear and Optimal Control Theory, 2006, pp. 163-220.
Sontag, “The ISS philosophy as a unifying framework for stability-like behavior,” Nonlinear Control in the Year 2000, 2000, pp. 443-468.
Su et al., “Cooperative output regulation of linear multi-agent systems by output feedback,” Systems & Control Letters, 2012, 61(12):1248-1253.
Su et al., “Cooperative output regulation of linear multiagent systems,” IEEE Transactions on Automatic Control, 2012, 57(4): 1062-1066.
Tang, “Leader-following coordination problem with an uncertain leader in a multi-agent system,” IET Control Theory and Applications, 2014, 8(10): 773-781.
Tran et al., “On control of multiagent formations through local interactions,” in IEEE Conference on Decision and Control, 2016.
Wang et al., “A Distributed Control Approach to a Robust Output Regulation Problem for Multi-Agent Linear Systems,” IEEE Transactions on Automatic Control, 2010, 55(12): 2891-2895.
Wieland et al., “An internal model principle is necessary and sufficient for linear output synchronization,” Automatica, 2011, 47(5): 1068-1074.
Willems, “The generation of Lyapunov functions for input-output stable systems,” SIAM Journal on Control, 1971, 9(1):105-134.
Yucelen et al., “Control of multivehicle systems in the presence of uncertain dynamics,” International Journal of Control, 2013, 86(9):1540-1553.
Provisional Applications (1)
Number Date Country
62540813 Aug 2017 US