The embodiments herein generally relate to coalition formation techniques, and more particularly to target detection and subsequent task completion in a coalition of devices.
Recent advancements in communication and computation systems allow deployment of large teams of Unmanned Aerial Vehicles (UAVs) to cooperatively accomplish complex missions that often cannot be performed by a single. Several features including task coordination and reliable communications are typically required to enable interoperability within the heterogeneous airborne networks, particularly for autonomous operations and providing on-board data processing during the mission. These heterogeneous autonomous vehicle systems could provide a great flexibility to complete compound tasks which are distributed in time and space.
The task allocation problem in multi-agent systems is defined as the process of allocating a set of tasks to groups of agents to ensure timely and efficient task completions, noting the individual capabilities of the agents. Previous solutions to the complex problem of task assignment have involved using different approaches including mixed integer linear programming, dynamic network flow optimization, market-based strategy, finite state machine, and multiple-choice knapsack problem. While in the majority of the aforementioned solutions, the tasks are centrally assigned to the agents by a base station that has complete knowledge about the tasks and often the agents' capabilities; in many dynamic systems, tasks may appear at unpredictable locations and times (e.g., target detection in military settings, as well as search and rescue operations).
Hence, a priori knowledge about these tasks is not always available to the base station. Even when such centralized task allocation techniques exist, they often require computationally intensive solutions even in homogeneous networks and are not easily scalable to systems with a large number of tasks or agents. Noting the computation and communication capabilities of modern devices, the agents can be considered as smart entities with decision-making capabilities. Such cognitive capability may facilitate the implementation of distributed task allocation mechanisms by allowing the agents to observe the environment and monitor the operation of other agents and properly respond to the observed situations.
Coalition formation game is a class of games, in which the players cooperate with each other by forming various sub-groups called coalitions. This class of games has been recently used in various applications such as task assignment in multi-agent systems, and communication networks. In conventional solutions of coalition formation for task allocation, the objective is to enhance the efficiency of the formed groups in task performance noting the different capabilities available at the coalition members. Therefore, the agents often consider a solution to be optimal when it maximizes the total utilities of the group in executing the existing tasks with minimum resources. In these solutions, it is assumed that all the agents are fully trustable, and they are obligated to cooperate with one another by utilizing their initially claimed resources to complete the tasks. However, this assumption is far too optimistic since cooperation is not an inherent characteristic of cognitive but potentially self-interested agents. Accordingly, a new solution to cooperative coalition formation is desirable.
In view of the foregoing, an embodiment herein provides an integrated decision making and communication system comprising a memory to store a list of resources necessary to execute a mission; a transceiver to send and receive data between communicatively linked devices; and a processor to identify a set of available resources capable of executing the mission based on the data received from the devices; compare the list of resources necessary to execute the mission from the memory with the set of available resources; and identify a combination of the devices to execute the mission based on the comparison of the list of resources necessary to execute the mission and the set of available resources.
The processor may instruct the transceiver to send messages to the identified combination of the devices to execute the mission. The system may further comprise a first device containing the memory, the transceiver, and the processor to autonomously store, transceive, and process the data without control by a centralized base station; and a plurality of other functionally similar devices communicatively linked to the first device. The processor of the first device groups the plurality of other functionally similar devices based on possible combinations of devices comprising resources capable of executing the mission. The processor of the first device selects the group of the plurality of other functionally similar devices comprising a most efficient utilization of the available resources to execute the mission in a predetermined period of time.
The transceiver of the first device may transmit a proposal to the plurality of other functionally similar devices to request the plurality of other functionally similar devices to combine their available resources to execute the mission; and receives responses from the plurality of other functionally similar devices of whether to they agree to utilize their available resources to execute the mission. The devices may comprise unmanned aerial vehicles (UAVs).
Another embodiment provides a machine-readable storage medium comprising computer-executable instructions that when executed cause a processor of a first UAV to determine resources used to execute a mission; generate a set of available resources capable of executing the mission from a group of UAVs in communication with the first UAV; compare the resources used for executing the mission with the set of available resources; and identify a combination of the UAVs to assist in executing the mission based on the comparison of the resources used for executing the mission and the set of available resources.
In the machine-readable storage medium, wherein the instructions, when executed, further cause the processor to identify the group of UAVs; and transmit a request to the group of UAVs to send to the first UAV the available resources for each of the UAVs in order to generate the set of available resources capable of executing the mission. In the machine-readable storage medium, wherein the instructions, when executed, further cause the processor to generate the set of available resources capable of executing the mission based on an assessment of which combination of the available resources from the group of UAVs best maximizes an efficiency in executing the mission.
In the machine-readable storage medium, wherein the instructions, when executed, further cause the processor to filter out any UAVs from the group of UAVs based on a past performance of executing other missions. In the machine-readable storage medium of claim 8, wherein the instructions, when executed, further cause the processor to filter out any UAVs from the group of UAVs based on a real-time assessment of a performance of any of the UAVs utilization of its available resources in executing the mission.
In the machine-readable storage medium, wherein the instructions, when executed, further cause the processor to filter out any UAVs from the group of UAVs that decline to offer available resources in executing the mission. In the machine-readable storage medium, wherein the instructions, when executed, further cause the processor to select the combination of the UAVs to assist in executing the mission; and transmit a coalition message to the selected UAVs to execute the mission. In the machine-readable storage medium, wherein the instructions, when executed, further cause the processor to generate the set of available resources capable of executing the mission based on assigning a predetermined value to each of the available resources provided by each of the UAVs; and determine which combination of UAVs corresponds to the predetermined value of the available resources to complete the mission without overspending the available resources.
Another embodiment provides a method of autonomously assessing a cooperative performance of a task, the method comprising operating a first UAV to search for a target; upon locating the target, assessing a requirement for the first UAV to perform the task relating to the target; transmitting a first electronic message to other UAVs capable of assisting the first UAV in performing the task; receiving at least one second electronic message from the other UAVs identifying available resources from each of the other UAVs to perform the task; and determining which combination of the other UAVs is best capable of assisting the first UAV in performing the task based on a predefined criterion.
The method may further comprise selecting the combination of the other UAVs that is best capable of assisting the first UAV in performing the task; and transmitting a third electronic message to the other UAVs requesting the selected combination of the other UAVs to assist the first UAV in performing the task. The method may further comprise performing the task using the first UAV with the selected combination of the other UAVs. The method may further comprise selecting the combination of the other UAVs based on any of a time required to perform the task, resource allocation capable of being provided by the other UAVs, a trust factor associated with the other UAVs, and a likelihood of performance of the task based on the available resources from each of the other UAVs. Each of the other UAVs may comprise a different available resource to assist the first UAV in performing the task.
These and other aspects of the embodiments herein will be better appreciated and understood when considered in conjunction with the following description and the accompanying drawings. It should be understood, however, that the following descriptions, while indicating preferred embodiments and numerous specific details thereof, are given by way of illustration and not of limitation. Many changes and modifications may be made within the scope of the embodiments herein without departing from the spirit thereof, and the embodiments herein include all such modifications.
The embodiments herein will be better understood from the following detailed description with reference to the drawings, in which:
Embodiments of the disclosed invention, its various features and the advantageous details thereof, are explained more fully with reference to the non-limiting embodiments that are illustrated in the accompanying drawings and detailed in the following description. Descriptions of well-known components and processing techniques are omitted to not unnecessarily obscure what is being disclosed. Examples may be provided and when so provided are intended merely to facilitate an understanding of the ways in which the invention may be practiced and to further enable those of skill in the art to practice its various embodiments. Accordingly, examples should not be construed as limiting the scope of what is disclosed and otherwise claimed.
According to various examples, the embodiments herein provide a leader-follower coalition system in a UAV network that identifies the required resources necessary to carry out a task; e.g., perform a mission. The mission may be any suitable type of mission including military tasks related to targets, search-and-rescue and surveillance, etc. A leader UAV sends a call out to other UAVs in the network asking them to form a coalition based on the available resources that the other UAVs have to assist in completing the task/mission. The leader UAV determines which group of other UAVs can most efficiently complete the task/mission based on comparing the available resources of the other UAVs and the required resources necessary to complete the task/mission. The leader UAV then selects the group of UAVs that meets the efficiency criteria.
Referring now to the drawings, and more particularly to
In some examples, the various devices and processors described herein and/or illustrated in the figures may be embodied as hardware-enabled modules and may be configured as a plurality of overlapping or independent electronic circuits, devices, and discrete elements packaged onto a circuit board to provide data and signal processing functionality within a computer. An example might be a comparator, inverter, or flip-flop, which could include a plurality of transistors and other supporting devices and circuit elements. The modules that are configured with electronic circuits process computer logic instructions capable of providing digital and/or analog signals for performing various functions as described herein. As used herein, the modules may refer to any component or set of components that perform the functionality attributed to the module, and may include a physical processor to execute processor-readable instructions. Additionally, the modules may include the processor, the processor readable instructions, circuitry, hardware, storage media, other components, and combinations thereof. The various functions can further be embodied and physically saved as any of data structures, data paths, data objects, data object models, object files, database components. For example, the data objects could be configured as a digital packet of structured data. The data structures could be configured as any of an array, tuple, map, union, variant, set, graph, tree, node, and an object, which may be stored and retrieved by computer memory and may be managed by processors, compilers, and other computer hardware components. The data paths can be configured as part of a computer CPU that performs operations and calculations as instructed by the computer logic instructions. The data paths could include digital electronic circuits, multipliers, registers, and buses capable of performing data processing operations and arithmetic operations (e.g., Add, Subtract, etc.), bitwise logical operations (AND, OR, XOR, etc.), bit shift operations (e.g., arithmetic, logical, rotate, etc.), complex operations (e.g., using single clock calculations, sequential calculations, iterative calculations, etc.). The data objects may be configured as physical locations in computer memory and can be a variable, a data structure, or a function. In the embodiments configured as relational databases (e.g., such Oracle® relational databases), the data objects can be configured as a table or column. Other configurations include specialized objects, distributed objects, object-oriented programming objects, and semantic web objects, for example. The data object models can be configured as an application programming interface for creating HyperText Markup Language (HTML) and Extensible Markup Language (XML) electronic documents. The models can be further configured as any of a tree, graph, container, list, map, queue, set, stack, and variations thereof. The data object files are created by compilers and assemblers and contain generated binary code and data for a source file. The database components can include any of tables, indexes, views, stored procedures, and triggers.
The system 10 comprises a transceiver 30 to send and receive data 35 between communicatively linked devices 401 . . . 40x. As used herein, the subscript x refers to any positive integer. In an example, the transceiver 30 may be an electronic device, circuit, or module that is part of a cellular system, and which enables the exchange of data 35 from among networked devices; e.g., devices 401 . . . 40x. The data 35 may comprise any type of qualitative and quantitative data capable of being transmitted through wired or wireless transfer protocols, and the data 35 may be encrypted.
The communication that occurs in the system 10 may occur through wired communication or wirelessly, in some examples, in an integrated network of the devices 401 . . . 40x. In some examples, the devices 401 . . . 40x may be operatively linked via one or more electronic communication links. For example, such electronic communication links may be established, at least in part, via a communication network such as the Internet and/or other communication networks.
The system 10 further comprises a processor 45. In some examples, the processor 45 may comprise a central processing unit (CPU) of the one or more devices 401 . . . 40x. In other examples the processor 45 may be a discrete component independent of other processing components in the one or more devices 401 . . . 40x. In other examples, the processor 45 may be a microprocessor, microcontroller, hardware engine, hardware pipeline, and/or other hardware-enabled device suitable for receiving, processing, operating, and performing various functions required by the one or more of the devices 401 . . . 40x. Furthermore, the processor 45 may be configured to execute modules by software, hardware, firmware, or combinations thereof, or other mechanisms for configuring processing capabilities. In some examples, the processor 45 may be programmable and may be updated in real-time. In accordance with the embodiments herein, the memory 15, transceiver 30, and processor 45 are contained within any of the devices 401 . . . 40x. In some examples, each of the devices 401 . . . 40x comprise their own memory 15, transceiver 30, and processor 45 and are each capable of performing the functions associated with these mechanisms as described herein.
The processor 45 is configured to identify a set of available resources 50 capable of executing the mission 25 based on the data 35 received from the devices 401 . . . 40x. For example, the transceiver 30 may receive the data 35 from the devices 401 . . . 40x which contains all of the collective available resources 50 from the devices 401 . . . 40x. This data 35 is directly or indirectly transmitted to the processor 45 either by wired or wireless transmission. The processor 45 analyzes the data 35 to determine which of the available resources 50 are capable of executing the mission 25. Moreover, the processor 45 is configured to compare the list of resources 20 necessary to execute the mission 25 from the memory 15 with the set of available resources 50. In this regard, the processor 45 retrieves the list of resources 20 from the memory 15 and compares it with the set of available resources 50 provided by the devices 401 . . . 40x. Accordingly, the processor 45 is to determine the resources 20 that are required to execute the mission 25 and compares it with the set of available resources 50 from the devices 401 . . . 40x to determine whether the devices 401 . . . 40x are capable of providing the necessary resources 50 to accomplish the mission 25. Since the list of resources 20 indicate what is necessary (e.g., which resources are required) to perform the mission 25, the processor 45 compares the data 35 containing the available resources 50 with the list of resources 20.
As an alternative to the static configuration described above, in an example of a dynamic or real-time configuration, when the mission 25 changes and the list of resources 20 necessary to perform the mission 25 also changes, or when the devices 401 . . . 40x begin utilizing and/or exhausting the available resources 50 in performing the mission 25, then the transceiver 30 may continue to receive data 35 from the devices 401 . . . 40x and transmit the data 35 to the processor 45 to allow the processor 45 to compare the changing list of resources 20 with the changing set of available resources 50.
Furthermore, in either the static or dynamic configuration, the processor 45 is configured to identify a combination of the devices 40a to execute the mission 25 based on the comparison of the list of resources 20 necessary to execute the mission 25 and the set of available resources 50. In this regard, the combination of devices 40a may be any combination of the devices 401 . . . 40x based on the ability to execute the mission 25 in view of the set of available resources 50 of the devices 401 . . . 40x and the list of resources 20 necessary to execute the mission 25. In one example, the combination of devices 40a may be only one of the devices 401 . . . 40x and in another example, the combination of devices 40a may be the entire set of devices 401 . . . 40x, and in still another example, the combination of devices 40a may be only some of the devices 401 . . . 40x.
The system 10 may comprise a plurality of other functionally similar devices 402 . . . 40x communicatively linked to the first device 401. In this regard, the devices 402 . . . 40x are functionally similar to the first device 401. Additionally, the other functionally similar devices 402 . . . 40x may each comprise a similar memory 15, transceiver 30, and processor 45 as the first device 401. Accordingly, any of the devices 401 . . . 40x may be considered as the leader device in the coalition of the devices 401 . . . 40x. The first device 401 together with the other functionally similar devices 402 . . . 40x may collectively form the coalition of devices 401 . . . 40x such that each of the devices 401 . . . 40x may be communicatively linked with each other. As such, communicatively linking the devices 401 . . . 40x may include wired or wireless linking of the devices 401 . . . 40x, and may further include direct or indirect connections therebetween. Regardless, of the type of connection between the devices 401 . . . 40x, the devices 401 . . . 40x are configured to send/receive data 35 and messages 55 to one another autonomously; i.e., without control by a centralized base station 60.
As used in the nomenclature herein, vectors and matrices are represented by lower-case and upper-case underlined letters, respectively. The notations (.)*, (.)T and (.)H demonstrate the conjugate, transpose and conjugate transpose (Hermitian) operations, respectively. The real value and imaginary values are shown by {.} and ℑ{.}. The diagonal matrix A=diag{a} is a matrix whose diagonal elements are the elements of the vector a. Finally, the function γL,ϵ(x) for x>0 is defined as:
for a given value of L and small value of ∈, giving: γL,ϵ(x)=γL,O(x).
A heterogeneous system of NUAVs, U={U1, U2, . . . , UN} is considered, where the UAVs (e.g., UAVs 701 . . . 70x) can form various coalitions 70a to accomplish the dynamic tasks in the network 75. Suppose that the ith UAV is assigned with a capability vector r=[ri1, ri2 . . . , riN
A coalition of users, named Sk, is a non-empty subset of UAVs, Sk⊂U that handles task k. It is assumed that each coalition of UAVs (e.g., coalition of UAVs 70a) will handle one target 80 at a time. Moreover, a sparse distribution of targets 80 in the environment is considered, hence noting the potential distances among the formed coalitions 70a, it is assumed that each UAV (e.g., UAVs 701 . . . 70x) can be only a member of one of these coalitions 70a at a certain time. Hence, the coalitions 70a are non-overlapping, meaning that Sk∩Sl=ϕ. The coalition of all UAVs, U is the grand coalition and a coalition that only contains one UAV is called a singleton coalition. Each coalition Sk is associated with a vector of available resources which is shown by Rk=[Rk1, Rk2, . . . , RkN
It is assumed that the UAVs 701 . . . 70x operate in an unpredictable environment, where targets 80 of various types with different resource requirements appear in random time and locations and move freely afterwards. Therefore, a base station 60 is not aware of the potential targets 80. It is assumed that all the UAVs have the capability of searching for new targets in a limited geographical field, and the target detection is performed by the UAVs independent (i.e., autonomous) of the base station 60. The procedure of prosecuting a target 80 is defined as a compound task to be accomplished by a coalition, such that the term task can refer to a set of multiple sub-tasks required to prosecute a target 80. The tasks can differ inherently based on the characteristics of their corresponding targets 80; hence the number, duration, and location of tasks vary over time. Since each task is associated to a target 80, the terms target and task may be used interchangeably herein.
When a UAV (e.g., UAV 701) detects a task k, it determines the type and amount of the resources 20 required to carry out this task; i.e., Γk=[τk1, τk2, . . . , τkN
Another criterion in coalition formation from the perspective of the leaders (e.g., UAV 701) is the deadline to complete a task. Hence, the leaders (e.g., UAV 701) take into account the traveling time of the UAVs 701 . . . 70x to the task in order to choose the members of the coalition 70a. For this purpose, δi,k is defined as the traveling time of the ith member of the coalition to target k location that should be less than or equal to a preferred threshold fδ(ρ) as a function of field radius ρ.
One aspect of the embodiments herein is to identify the reliable UAVs (e.g., UAVs in coalition 70a) in the network 75 by the leaders (e.g., UAV 701) to join the coalition(s) and filter out any UAVs (e.g., UAV 71) with selfish behavior. Such networks 75 may be prone to suffer from both malicious/intruding UAVs as well as selfish ones. Accordingly, the technique provided by the embodiments herein prevent potential selfish behavior of legitimate non-altruistic UAVs in not spending their resources after joining a coalition to encourage cooperation among them. This is facilitated by defining a cumulative cooperation credit for each UAV that indicates its cooperative behavior in terms of resource sharing during task completion over the course of time. On one hand, since the leaders (e.g., UAV 701) prefer to select the potential followers (e.g., UAVs 702 . . . 70x) with higher credits, the UAVs 702 . . . 70x are motivated to maintain a good cooperative credit, as further described below. On the other hand, the follower UAVs 702 . . . 70x similar to other types of cognitive agents may act selfishly and avoid consuming their resources after joining a coalition 70a. The incentive behind this behavior is to save their limited available resources for future missions to earn more monetary benefits.
The coalition formation process may occur as follows: First, the coalition leader (e.g., UAV 701) broadcasts a proposal 65 to form a coalition 70a. Then, the UAVs who possess at least one of the required resources 20 can provide a response 66 to this request by reporting their available resources 50 as well as their current positions. This may be referred to as the bid process. The coalition leader (e.g., UAV 701) then evaluates all the bids by assessing the resources offered by the volunteer UAVs (e.g., UAVs 702 . . . 70x), their estimated arrival time, their cooperative credits as well as the provided QoSs for the relaying services during the formation process to determine if a coalition 70a can be formed to complete the encountered task. If a coalition 70a cannot be formed, the coalition leader (e.g., UAV 701) informs the UAVs (e.g., UAVs 702 . . . 70x), otherwise the coalition leader (e.g., UAV 701) provides the selected UAVs in the coalition 70a with information about the tasks (e.g., required resources and location of the target 80). The UAVs 71 that are not selected by the coalition leader (e.g., UAV 701) go back to the search mode. When a UAV receives multiple coalition formation requests, it considers two factors of expected increase in its cooperative credit based on information it received from the leaders (e.g., UAV 701) about required resources 20 for the tasks as well as its distance to the targets 80 in order to decide which coalition 70a to join. The coalition formation process is summarized in
Amplify-and-Forward (AF) is a widely used relaying method in communication networks due to its simplicity and low-complexity. Beamforming is a technique based on AF relaying, where a set of relay nodes amplify and shift the phase of a transmitter's signal and rebroadcast it such that they add up constructively, while interfering signals add up destructively. In order to relay the message from the target 80 to the ground or base station 60, a beamforming scenario is provided in which each UAV in the coalition 70a multiplies the received target's signal to a complex weight number and rebroadcasts it (e.g., AF relaying). A common assumption of knowledge of Channel State Information (CSI) in the system 10 is followed in this technique. It is assumed that
is the vector of instantaneous channel coefficients between the target and the coalition members, where NS=|S|−1. Likewise,
the vector of channel coefficients between the coalition members and the base station 60. Here, the index 0 represents the leader and the indices i=1, 2, . . . , NS represent the members of the coalition 70a. It is assumed that the leader (e.g., UAV 701) has the knowledge of instantaneous reciprocal channel vectors; i.e., hTU and hUB) and is responsible for calculating the optimum beamforming and notifying the members of its coalition 70a.
If ST denotes the transmit signal, the vector of the received signal at the coalition can be written as:
x=h
TU
s
T
+n (1)
where n˜(0, Σ1/2), with diagonal covariance matrix Σ, is an additive zero mean Gaussian noise vector at the coalition members. It is assumed that each UAV 701 . . . 70x is aware of its local noise characteristics and performs the whitening transformation process before the beamforming. Hence, the whitened received signal at the UAVs 701 . . . 70x can be written as:
{tilde over (x)}=Σ
−1/2
x=Σ
−1/2
h
TU
s
T+Σ−1/2n, (2)
The ith coalition member, i=0, 1, . . . , NS, multiplies the received target signal by a complex weight wi* and then relays it. The broadcasted signal by the members of the coalition 70a can be written as follows, assuming w=[w0, w1, . . . , wN
t=W
H
{tilde over (x)}=W
HΣ−1/2hTUsT+WHΣ−1/2n (3)
where W=diag(w). The received signal at the base station 60 is:
where, HUB=diag(hUB) and v˜M(0, σ2) is white Gaussian noise at the base station receiver. Therefore, the Signal-to-Noise Ratio (SNR) at the base station 60 can be expressed by:
where k=HUBHΣ−1/2hTU. Each UAV 701 . . . 70x (including the leader UAV 701) has a limited energy to forward the target's signal. Using Equation (3), the power consumption at each member of the coalition 70a for relaying the target's signal can be written as:
P
i,R=[wH]i[Σ−1/2]i,i[hTU]i[hTUH]i[Σ−1/2]i,i[w]i+[wH]i[w]i (6)
where i=0, 1, 2, . . . , NS. It is assumed that the individual transmission power of UAV i is below a certain threshold, denoted by pimax.
To monitor the cooperative behavior of the UAVs 701 . . . 70x, a cumulative cooperative credit is defined for each UAV 701 . . . 70x based on the amount of resources 501 . . . 50x that it utilizes for a specific task. It is assumed that the credit of all UAVs 701 . . . 70x are initialized to an equal initial credit Ci(0)=C>0, i=1, 2, . . . , N. After completing a task, each UAV's credit is calculated using the following steps:
First, the credit of UAV i at time n is updated as:
Here, the change in credit Ci(n) is defined by:
where τk=Σj=1N
denotes the effective resource contribution of each UAV 701 . . . 70x.
Second, if all {tilde over (C)}i(n) for I=1, 2, . . . , N are equal, then the process sets Ci(n)=C to reset the credits. Otherwise:
where {tilde over (C)}max=maxi=1, 2, . . . , N{{tilde over (C)}i(n)} and {tilde over (C)}min=mini=1, 2, . . . , N{{tilde over (C)}i(n)}.
Therefore, the credits in each step are scaled to values within [0,C] range. After the completion of each task, the updated credits are broadcasted by the coalition leader (e.g., UAV 701) to be used for future coalition formations. For the sake of simplicity, the superscripts (n) are dropped hereafter.
In the coalition formation method provided by the embodiments herein, first the leaders (e.g., UAV 701) select their coalition members (e.g., UAVs 70a) among the available candidates (e.g., UAV 701 . . . 70x) to maximize their corresponding coalition's utility function to enhance the efficiency in completing their encountered task noting the resource constraints. The coalition value for the kth leader is defined in such a way to: i) assure the existence of required resources 20 to handle a task, ii) avoid over-spending the resources 50 on a specific task, iii) guarantee the timely completion of the task, iv) provide the required quality of communication to relay target's message, and also v) select the reliable UAVs as follows:
for a large enough positive value of L. Here, the design parameters α1, α2, and α3>0 represent the importance of the credit and quality of communication compared to the resource optimization goal for the leader UAV 701. Also, SNRThr represents the minimum required SNRS
In order to find the optimal coalition which maximizes the leader's utility function in Equation (10), a search over all 2L
The coalition formation process may comprise:
When the optimal coalitions from the leaders' perspective are formed, the formation requests will be sent out to the selected UAVs. If a potential follower receives multiple requests from different leaders, it prefers to join the coalition which benefits it the most. The utility function of the followers is defined as:
v
p(U|i,Sk)=ΔCi−α4δi,k, (11)
where ΔCi and δi,k are the expected credit (knowing the required resources for the encountered task) and traveling time to task k if user Ui joins the coalition Sk, respectively. Here, α4 is a design parameter that indicates the importance of the traveling cost compared to the change in credit.
The coalition formation process includes a series of merge-and-split coalition formation steps. After each stage of merge-and-split coalition formation, the leader (e.g., UAV 701) sends out the requests 55 and collects the followers' responses. If all responses are affirmative, the coalition formation process stops; otherwise, a new merge-and-split is executed that keeps the current members with positive responses, and evaluates the new available UAVs.
As such, in order to show the stability of the process, it is sufficient to show that: i) the number of sequential rounds of the process is finite, and ii) each merge-and-split stage is stable. The first is ensured, because at each round, the members with no interest in joining the formed coalition 70a are excluded, and therefore after at most ms iterations the process stops, where ms is the number of UAVs with a negative response to join a coalition 70a. The second condition is also satisfied since the formed coalitions by the leaders (e.g., UAV 701) are Dhp-stable. This is due to the fact that the only type of allowed membership changes are based on single or possibly multiple merge-and-splits (i.e., a UAV or a group of them are only allowed to leave a partition by means of merges or splitting).
The optimization from the leaders' perspective is to maximize the coalition value in Equation (10) by searching over the coalitions (i.e., S) and determine the optimum beamforming scheme; i.e., w to optimize SNRB. For each coalition, the optimal value of Equation (5), SNRSopt can be obtained via the following inner optimization problem:
A bisection method may be used to solve the fractional Quadratically Constrained Quadratic Program (QCQP)s such as the optimization problem in Equation (12) that can be rewritten as:
where t is an auxiliary variable. In the bisection method, given the value of t≥0, the following feasibility check problem is investigated:
Find w[wH]i([Σ−1/2]i,i[hTU]i[hTUH]i[Σ−1/2]i,i+1)[w]i≤Pimax,i=0,1, . . . ,NcwHkkHw≥twHHUBHHUBw+tσ2. (14)
If the problem described in Equation (14) is feasible, then the optimal solution of the problem in Equation (13); i.e., SNRSopt, is less than or equal to t; otherwise SNRSopt≥t. The variable t is up limited by tup which is obtained in the following Lemma:
Lemma: tup=kHQ−1k is an up limit for objective in optimization problem Equation (12). The optimization problem becomes feasible if and only if the matrix kkH−tHUBHHUB is non-negative definite. Since the matrix Q=HUBHHUB is positive semi-definite, the matrix Q−1/2(kkH−tHUBHHUB)Q−1/2=Q−1/2kkHQ−1/2−tI must be non-negative definite. That results in t≤λmax(Q−1/2kkHQ−1/2)=kHQ−1k.
Using the above Lemma and by assuming δ as the absolute precision of the final objective function value, the iteration's complexity order of the bisection method can be written as
The feasibility problem of Equation (14) can be solved using the Semi-Definite Programming (SDP) relaxation method. One point to notice is whether wopt is an optimal solution to the feasibility problem of Equation (14), then for any arbitrary real number θ, {tilde over (w)}=woptej<θ is also an optimal solution. Therefore, it is possible to assume that wHHUBΣ−1/2hTU is a non-negative real number. By considering this constraint, the feasibility problem of Equation (14) can be rewritten as the following SOCP problem for a given t≥0:
Find w[wH]i([Σ−1/2]i,i[hTU]i[hTUH]i[Σ−1/2]i,i+1)[w]i≤Pimax,i=0,1, . . . ,NcwHk≥√{square root over (twHHUBHHUBw+tσ2)}{wHHUBHΣ−1/2hTU}≥0I{wHHUBHΣ−1/2|hTU}=0. (15)
The feasibility check problem in Equation (15) can be effectively solved using the cvx convex optimization toolbox. By considering the cubic complexity order of the SOCP method, the complexity order of the communication optimization can be expressed as
Processor 45 may fetch, decode, and execute computer-executable instructions 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, and 180 to enable execution of locally-hosted applications for controlling action of the first UAV 701. As an alternative or in addition to retrieving and executing instructions, processor 45 may include one or more electronic circuits including a number of electronic components for performing the functionality of one or more of the instructions 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, and 180.
The machine-readable storage medium 110 may be any electronic, magnetic, optical, or other physical storage device that stores computer-executable instructions 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, and 180. Thus, the machine-readable storage medium 110 may be, for example, Random Access Memory, an Electrically-Erasable Programmable Read-Only Memory, volatile memory, non-volatile memory, flash memory, a storage drive (e.g., a hard drive), a solid-state drive, optical drive, any type of storage disc (e.g., a compact disc, a DVD, etc.), and the like, or a combination thereof. In one example, the machine-readable storage medium 110 may include a non-transitory computer-readable storage medium.
In an example, the processor 45 of the first UAV 701 executes the computer-executable instructions 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, and 180. In
In
As shown in
In
As indicated in
The method 200 in
As shown in
As shown in
In this regard, the first UAV 701 may consider several of these predefined criterion, among others, to create the coalition or combination of UAVs 70a to perform the task (i.e., mission 25). The time T may be set based on a desired time established by the first UAV 701, base station 60, or system programmer associated with the system 10. The resource allocation may include the available resources 502 . . . 50x associated with the other UAVs 702 . . . 70x as well as the available resources 501 of the first UAV 701 and what the optimum resource allocation is amongst the available resources 501 . . . 50x to satisfy the required or necessary resources 20 without overspending (i.e., overusing, duplication, or otherwise wasting) the available resources 501 . . . 50x or with only a minimal amount of overspending of the available resources 501 . . . 50x. The trust factor may involve the reputation of the other UAVs 702 . . . 70x, which may be based on their respective performance(s) on prior missions 25a, and the extent that the first UAV 701 can trust that the other UAVs 702 . . . 70x will be able to perform the current task (i.e., mission 25). The likelihood of performance of the task (i.e., mission) may involve the first UAV 701 determining that the task (i.e., mission 25) will likely be successfully accomplished based on the available resources 501 . . . 50x and a comparison to the required or necessary resources 20 for performing the task (i.e., mission 25).
Furthermore, each of the other UAVs 702 . . . 70x may comprise a different available resource 502 . . . 50x compared to one another to assist the first UAV 701 in performing the task (i.e., mission 25). Alternatively, the other UAVs 702 . . . 70x may comprise the same or an overlapping set of available resource 502 . . . 50x compared to one another to assist the first UAV 701 in performing the task (i.e., mission 25). Moreover, the first UAV 701 may comprise available resources 501 that are the same, different, or overlapping to the available resources 502 . . . 50x associated with any of the other UAVs 702 . . . 70x.
In order to validate the performance of the method 200, a simulated experiment of two leader (i.e., two UAVs 701) and six follower (i.e., six other UAVs 702 . . . 70x) is considered. The UAVs 701 . . . 70x are uniformly located in a 3 region. It is assumed that the leaders (UAVs 701) do not carry out the required resources 20 to perform the identified tasks (i.e., missions 25) individually and they call out to form coalitions of UAVs 70a. Five types of resources are considered and the amount of each resource at each UAV 701 . . . 70x as well as the required resources 20 for each task (i.e., mission 25) are generated randomly. The traveling time is generated proportional to the distance of the UAVs 701 . . . 70x to the corresponding target 80. All communication channels are generated by zero mean Gaussian variables with variance proportional to the inverse distance between the corresponding transmitter and receiver antennas.
Table 1 shows the comparison of available resources in coalition S1 versus the required resources to complete the task encountered by this coalition. As shown in Table 1, the summation of available resources in coalition S1 are higher than the required resources for task 1 which guarantees that this task can be completed by the members of coalition S1. Moreover, to evaluate the efficiency of the coalition formation method provided by the embodiments herein in a resource constraint network, an Efficiency Factor is defined as E.F.=Σj=1N
In
The leader-follower coalition formation method 200 may be utilized for distributed task allocation and optimizing the cooperative communication between the detected targets 80 and the base station 60 in a heterogeneous network 75 of UAVs 701 . . . 70x. A reputation-based approach is provided to monitor the cooperative behavior of UAVs 701 . . . 70x and filter out the selfish UAVs 71 who have not accumulated sufficient collaboration credits. The systems 10, 100 and method 200 enable optimizing several factors including the timely completion of the tasks and preserving the available resources 50 in the network 75 from the perspective(s) of the leader(s) (i.e., device 401 and/or UAV 701), while it benefits follower devices 402 . . . 40x and/or UAVs 702 . . . 70x by lowering their travel times to join the coalitions or combinations of devices 40a or UAVs 70a. The simulation results show the convergence of the method 200 in forming stable coalitions of devices 40a or UAVs 70a with high resource efficiency factors to complete the encountered tasks (i.e. missions 25).
The foregoing description of the specific embodiments will so fully reveal the general nature of the embodiments herein that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without departing from the generic concept, and, therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments. It is to be understood that the phraseology or terminology employed herein is for the purpose of description and not of limitation. Those skilled in the art will recognize that the embodiments herein can be practiced with modification within the spirit and scope of the appended claims.
This application claims the benefit of U.S. Provisional Patent Application No. 62/642,631 filed on Mar. 14, 2018, which is incorporated herein by reference in its entirety.
The invention described herein may be manufactured and used by or for the Government of the United States for all government purposes without the payment of any royalty.
Number | Date | Country | |
---|---|---|---|
62642631 | Mar 2018 | US |