BRIEF DESCRIPTION OF THE DRAWINGS
The subject matter which is regarded as the invention is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the invention are apparent from the following detailed description taken in conjunction with the accompanying drawings in which:
FIG. 1 is a pictorial diagram of a an exemplary data stream processing environment with distributed data sources that provide data streams incorporating features of the present invention;
FIG. 2 is a pictorial diagram of a an exemplary data stream processing environment with distributed data sources that provide data streams incorporating features of the present invention
FIG. 3 is a pictorial diagram of a an exemplary tree data stream processing environment with distributed data sources that provide data streams incorporating features of the present invention shown in FIG. 2;
FIG. 4 is a pictorial diagram of a an exemplary data stream processing environment with distributed data sources that provide multiple data streams incorporating features of the present invention shown in FIG. 2;
FIG. 5 is a pictorial diagram of another exemplary tree data stream processing environment with distributed data sources that provide data streams incorporating features of the present invention;
FIG. 6 is a pictorial representation of a three-tier sensor network incorporating features of the present invention shown in FIG. 1;
FIG. 7 is a graphical representation of two prior art solutions compared with a solution set provided by a method of the present invention;
FIG. 8 is another graphical representation of two prior art solutions compared with a solution set provided by a method of the present invention;
FIG. 9 is pseudo code which may be utilized to implement a method and system of the present invention shown in FIG. 1;
FIG. 10 is pseudo code which may be utilized to implement a method and system of the present invention shown in FIG. 1;
FIG. 11 is pseudo code which may be utilized to implement a method and system of the present invention shown in FIG. 1;
FIG. 12 is pseudo code which may be utilized to implement a method and system of the present invention shown in FIG. 1;
FIG. 13 is enabling description of features of the present invention shown in FIG. 1;
FIG. 14 is enabling description of features of the present invention shown in FIG. 1; and
FIG. 15 a pictorial representation of a data processing system which may be utilized to implement a method and system of the present invention.
DETAILED DESCRIPTION OF THE INVENTION
Referring to FIG. 1, there is shown an exemplary data stream processing environment with distributed data sources that provide data streams. The environment may consist of any suitable number of nodes, 1-10, (sensors and edge servers), capable of executing multiple stream-oriented operators. These nodes 1-10, which may be geographically dispersed, are organized into a network 15 and collectively provide a rich set of query processing services for multiple concurrent stream-based applications.
Referring also FIG. 15, there is shown a block diagram of a node incorporating features of the present invention shown in FIG. 1. It will be seen that in FIG. 15 there is shown input devices keypad 120, and input device 150. Also shown is display 130 and antenna 110. Node 100 includes a processor 210 that is coupled to the antenna 110, keypad 120, display 130, and input device 150. Input device 150 may be any suitable physical sensor that monitors the environment and collects data. Antenna 110 may be any suitable network connection such as a local area network (LAN), a wide area network (WAN) and/or any suitable network connection.
In addition, processor 210 is also coupled to a memory 230. Memory 230 may include any combination of volatile and non-volatile memory. Processor 210 takes input from keypad 120, antenna 110, and memory 230, and generates appropriate output on display 130, antenna 110, and/or memory 230. For clarity the FIG. 15 diagram shows only the most commonly-known components and features that allow a complete description of the preferred embodiments of the present invention.
Still referring to FIG. 1 memory 230 also contains database 215. Database 215 may contain a data transmission cost ft module for cost associated with transmission of the sensor data; a data processing cost fp module for cost associated with data processing of the sensor data; and a data storage cost fs module for cost associated with data storage of the sensor data; and a data query Q minimization module for minimizing cost associated with data transmission cost, data processing cost, and data storage cost.
As used herein, a node includes any device for providing data output. However, one skilled in the art will recognize that the teachings herein may be used with a variety of data devices other than the node shown in FIG. 15. Accordingly, nodes such as shown in FIG. 15 are merely illustrative of certain embodiments for data processing resources.
The sensor network 15 is represented by a directed graph GN (V,E), as illustrated in FIG. 1. The edges of the network 15 correspond to communication links which could be wired or wireless. Also, if (i, j) ε E, then (j,i) ε E. That is, communication is feasible in both directions for every pair of connected nodes 1-10. Also, |V|=N, and there are at least three types of nodes in the network: e.g., sensor nodes, relay nodes, and a fusion center.
Sensor nodes: Sensor nodes may be any suitable physical sensor that monitors the environment and collects data. They may or may not have the capability to process data. Sensor nodes may be, for example, a thermometer, smoke detector, or a camera. It will be appreciated that a sensor node may be any suitable device that responds to a physical stimulus such as heat, light, sound, pressure, magnetism, or motion, and transmits a resulting impulse or signal. It will also be understood that any suitable sensor node will contain the logic and resources necessary for the dynamic reallocation of data processing resources for the efficient distributed processing of sensor data.
Relay nodes: Nodes that receive and forward data. In general, relay nodes have the capability of processing data.
Fusion center: The node to which users make queries and obtain responses based on sensor data.
These nodes have varying computation and data transmission capabilities. They may also have storage space to cache data. Further, each node only has local knowledge about its neighboring nodes. Let ()i denote the set of the neighbors of node i such that for each j ε Φi, (i,j) ε E or (j,i) ε E.
Discrete Stream-Based Data Generation
Sensor nodes monitor and collect data about the environment by sampling it periodically. Further, time may be discretized, i.e., each sensor takes one sample during a time frame. Then, a snapshot of the whole environment may taken by the network at each time frame or any suitable time frame. Note that it does not require strict synchronization of the various nodes in the network. Each node simply needs to index the snapshots in sequence. For clarity, the following description focuses on a single snapshot. However, it will be appreciated that any suitable number of snapshots may be used.
Query Graph
Still referring to FIG. 1, the network 15 supports a set of queries. Each query Q is represented by a query graph GQ, consisting of operators and data objects. Such a graph can be built either directly by the user, or derived by a query planner. In general, the query graphs are logical graphs depicting the interaction between various operators and data objects. The query graphs presented herein are decoupled from the physical network topology so as to focus more on the semantics and work flow relations and better describe embodiments of the present invention.
It will be understood, however, that an underlying physical network will contain the logic and resources necessary for optimizing dynamic reallocation of data processing resources for efficient distributed processing of sensor data in accordance with embodiments of the invention presented herein.
A query graph is a directed acyclic graph (DAG) GQ=(Vo, Vd, E) that describes how a response to the query is obtained logically. Here, Vo is a set of operators, Vd is a set of data objects. In words, a query graph consists of data objects and operators, where the operators are essentially programs which aggregate or process existing data to generate new data.
Referring to FIG. 2, there is shown a query graph. For example, Operator-7 (item 277) generates intermediated data-7 (item 27) and requires as inputs data-1 (item 21) and data-2 (item 22). Likewise, Operator-6 (item 266) generates intermediate data-6 (item 26) and requires as inputs data-3 and data-4, items 23 and 24, respectively. Operator 8 (item 288 generates final data-8 (item 28) and requires as input intermediate data-7 and data-8, items 27 and 28, respectively. Similarly, Operator-9 generates final data-9 (item 29) and requires as input intermediate data-6 (item 26) and data-5 (item 25). Let |Vd|=M. In a preferred embodiment the data objects are divided into three classes:
- (1) Sensor data (raw data): Data generated by the sensor nodes, e.g., data-1 (item 21), data-2 (item 22), data-3 (item 23), data-4 (item 24), and data-5 (item 25) in FIG. 2. It will be appreciated that any suitable number of sensor nodes generating sensor data may be used;
- (2) Final data: Answers to the query, e.g., data-8 (item 28) and data-9 (item 29) in FIG. 2. The final data needed to be delivered to fusion center 201; and
- (3) Intermediate data: Data that are neither sensor data nor final data, for example data-6 (item 26) or data-7 (item 27) in FIG. 2.
For purposes of description clarity, only one data object generated by each operator is shown. Thus the operator can be assigned the same label as the data generated. It will be understood that each operator may require multiple data objects as input, and each data object may be requested as input by multiple operators.
As previously mentioned, one snapshot of the environment is created at each time slot. Also queries may be generated at a lower rate and that they are applied only to a fraction of the data generated, therefore, for a given query Q with query graph GQ, each snapshot will be evaluated by GQ with probability q.
Note the following notations:
- Δk: The set of data objects in Vd that are needed to generate data object k. For example, Δ7={1, 3} in FIG. 2. We refer to the data objects in Δk as children of data k.
- Dk: The number of bits of data k.
The sensor network GN is given and a query Q (with query graph GQ) is posed at the fusion center (FIG. 2, item 201). In accordance with an embodiment of the present invention a method for efficiently obtaining the final answer to the query is provided. It will be appreciated that some prior art solutions transmit all sensor data to the fusion center, and let the fusion center do the processing and aggregation. But this can be very expensive in the case that high volume data sources are involved, and the final answer could be very simple. In a preferred embodiment, one method is to allow partial processing and aggregation within the network (FIG. 1, item 15), where the data are combined or filtered in the network (FIG. 1, item 15) so as to reduce the number of transmissions.
Query Processing Scheme
A query processing scheme (Π,R,Ω) in accordance with an embodiment of the present invention solves the following three problems.
- (1)Operator Placement (Π): Let Π denote the operator placement matrix such that Π(k,i)=1 if data k is generated at node i. That is, the operator generating data k is placed at node i. For clarity, assume each operator can be placed at only one node, thus Π(k, j)=0 for all j=i. Although redundant placement may be beneficial, especially when data k is requested by multiple upstream operators.
- (2)Routing (R): Give a placement of operators, route the data from the node it is generated to the node it is required as input. For example, all data in Δk (children of data k) need to be transmitted to node i as these are required input data in order to generate date k.
- (3)Data Caching (Ω): Here, Ω is the caching matrix, Ω(k,i)=1 if data k is cached at node i.
Routing and operator placement clearly impact the efficiency of the query processing. It may be less obvious why one should be concerned with data caching. Consider two simple caching schemes:
- (1)PULL: Cache the raw data where they are generated. Transmit and process relevant data only when a query is acquired.
- (2)PUSH: Transmit and process all raw data for all queries and deliver results to the fusion center.
The disadvantage of PULL is that it requires the fusion center to signal the sensor nodes each time that data is needed. This incurs a communication cost with each query; the advantage is that the data are only transmitted when a query is made.
The disadvantage of PUSH is that it is always transmitted to the fusion center. In the absence of a query, a communication cost is always incurred. This may limit the total number of queries that can be supported. The advantage is that no additional transmission is required at the time of a query.
Thus, the choice of PULL or PUSH depends on q. If q=1, i.e., we know for sure that a query is made on each snapshot, PUSH is the best choice. Similarly, if q is zero or close to zero, PULL is best. On the other hand, when q lies between zero and one, it may be more beneficial to cache data at nodes in the middle of the network. This may produce a cost savings over that of PUSH or PULL. Thus, choosing the proper location at which to cache the (intermediate or final) data should also be considered when designing efficient query processing schemes.
Cost Metrics
A query process scheme (R, Π, Ω) is given for query Q with query graph GQ on sensor network GN. Let fcost (GN,GQ,R,Π,Ω) denote the cost of this scheme; it accounts for three different costs.
- (1) Transmission cost ft(GN,GQ,R,Π,Ω): The cost used to transmit the data. Let rt(i,j) denote the cost to transmit one unit of data over link (i,j). In general, rt(i,j)=rt(j, i).
- (2) Processing cost fp (GN,GQ,R,Π,Ω): The cost of processing and aggregation. Let rp(k,i) denote the cost to generate data k at node i. For the sensor data, rp(k, i)=0 if data k is pinned to node i; and rp(k,i)=∞ otherwise.
- (3) Storage cost fs (GN,GQ,R,Π,Ω): The cost used to store/cache the data. Let rs(i) denote the cost of storing one unit data at node i. If there is no storage space available at node i, rs(i)=∞.
Thus, the total cost of scheme (R, Π, Ω) is fcost (GN,GQ,R,Π,Ω)=(ft+fp+fs) (GN,GQ,R,Π,Ω).
Minimum Cost Optimization
An embodiment of the present invention determines the query process (R*, Π*, Ω* ) that solves:
arg min fcost(GN,GQ,R,Π,Ω). (Problem 1)
Consider a special case where the holding cost is zero and q=1. In this case, PUSH is the best caching scheme and the optimization problem simplifies to:
(R*,Π*)=arg min fcost(GN,GQ,R,Π).
Even for this case, the optimization problem is known to be NP-Complete as known in the prior art. It is known in the art that the optimization problem can be solved using a centralized algorithm when the query graph has tree structure with complexity O (MN 2): Recall that N is the number of nodes in the network and M is the number of data items in the query graph.
However, while centralized algorithms known in the art solves the optimization problem, it has disadvantages solved by embodiments of the present invention. For example, the prior art requires the fusion center (FIG. 2, item 201) to know the entire topology of the network. Furthermore, when the network topology changes, say due to the failure of a node, the whole scheme needs to be re-calculated. Thus, in a preferred embodiment of the present invention the nodes (FIG. 1, Items 1-10) only need to know the information of their neighbors and the optimal scheme is obtained by exchanging information between neighboring nodes.
Referring to FIG.3, there is shown a tree structured query graph. Distributed methods, in accordance with embodiments of the present invention, which solve the general optimization problem with complexity O(M 2N 3) are provided herein. Furthermore, these methods allow the optimal scheme to be re-calculated very efficiently as the network topology changes. After investigating the tree structure, heuristic solutions will be provided for general structured queries.
Tree Structured Queries
In this section, embodiments of the present invention provide solutions to the optimization problem for the case that the query has a tree structure, e.g., FIG. 3. For a first example a special case where q=1 and the holding cost is zero is considered. Then the method, in accordance with embodiments of the present invention is extended to the case where q ε (0, 1) and the holding cost is not zero.
Minimum Cost Routing and Operator Placement
First, let q=1 and the holding cost is zero. From the description above, it is will be understood that the best caching scheme is PUSH since every snapshot will be queried. Thus, (R*,Π*)=arg min fcost (GN,GQ,R,Π).
During the process of solving the optimal placement in a distributed fashion, each node, i, maintains for each data item k the following information: Cp(k,i), Fp(k,i), Πp (k,i) and I (k,i). In particular, Cp (k,i) is the current cost of the best query process scheme to obtain data k at node i; Fp(k,i) is the neighbor that transmits data k to node i in the current scheme; Πp(k,i)=1 if data k is generated at node i in the current scheme, and =0 otherwise; and I (k,i)=1 if the state of data k has been updated, but not been distributed to node i's neighbors, and =0 otherwise.
The distributed optimal placement method is given in Method 1 shown in FIG. 9. First the states of the data are initialized according to lines 1-4, and then, the following two steps are executed repeatedly:
- (i) Lines 6-10: If I (k,i)=1, which means data k's states at node i have been updated but not distributed to node i's neighbors, then node i sends data k's states to all its neighbors and I (k,i) is set to zero.
- (ii) Lines 11-19: According to the information received from its neighbors, node i checks whether it should update the states of its data. For each data k, node i first checks the cost is reduced by obtaining data k from its neighbors (lines 13-16). Then, node i checks whether it is better to generate data k at node i (lines 17-19). Let I (k,i)=1 if data k's states are updated.
The method terminates when no additional information is exchanged between nodes. Note that the method in this embodiment is synchronous, nodes have to wait for the completion of step (i) to execute step (ii), and wait for the completion of step (ii) to execute step (i).
Method 1 shown in FIG. 9, solves the optimization problem with M (N−1) iterations. For understanding, note the following definitions:
- (1)Routing path α (k,i,j): A path over which data k flows from node j to node i.
- (2) Data collection tree β (k,i) rooted at node i: A tree over which data k flows to node i. If data k is the sensor data generated at sensor j, then β (k,i)=α (k,i,j). If data k is an intermediate data or the final data, and generated at node j, then β (k,i)=α (k,i,j) ∪ (∪m εΔkβ (m,j)).
- (3)Depth H (β): The maximum path length from the root to a leaf. If data k is the sensor data, H (β(k,i)) is the number of hops path β (k,i) tales; otherwise if data k is an intermediate data or the final data, and generated at node j,
H(β(k,i))=H(α(k,i,j))+max H(β(m,j)).
Referring to FIG. 13, there is shown a theorem and corresponding proof that, in accordance with embodiments of the present invention, Cp0(k,i)=Γp0(k,i), and that
Cp(k,i)=Γpn(k,i)
for data k such that Δk=/0. Then, by induction on n and on the query tree (from the leafs to the roots), we conclude that equation (1) shown in FIG. 13 holds for all n, k and i. Since there are N nodes and M data objects, in the worst case,
H(β(Final Data,Fusion Center))=M(N−1).
Further, there are N nodes in the network, and at each step, each nodes send at most MN messages. Thus, the number of messages exchanged is O(M2N 3). Method 1 shown in FIG. 9, embeds the query graph into the sensor network (FIG. 1, item 15). Each participating node (items 1-10) knows the identities of the neighbors from which to obtain data, but not the nodes to which it delivers data. The network then executes the following trace back process.
- (i) Generate message (Data k*) at the fusion center, e.g., FIG. 2, item 201, where data k* is the final data.
- (ii) If node i receives message (data k) from node j, it needs to transmit data k to node j whenever data k is available. Furthermore,
- (a) If Π(k,i)=1, node i sends message (data m) to node Fp (m,i) for each m ε Δk, and node i needs to generate data k whenever data k's children are available.
- (b) If Π(k,i) 6=1, node i sends message (data k) to node Fp (k,i).
It will be readily understood by those skilled in the art that each node will know how to transmit and process data according to (R*,Π*) after the trace back process terminates.
Distributed Caching
In the previous subsection, the example focused on the case where q=1 corresponding to the situation where every data snapshot is queried. In this subsection, an example of a more realistic case, i.e., where q<1 is provided; the user does not query every snapshot, but only those data in which it is interested. As described earlier, PUSH is the optimal scheme for q=1. But for q<1, caching can further reduce the cost. Now, any caching scheme must account for the fact that different nodes have different amounts of storage space. For example, a sensor node, e.g., items 21-25 shown in FIG. 2, may only have limited storage space; on the other hand, a relay node, items 7-8 shown in FIG. 2 and the fusion center, item 201 may have substantial storage. Thus, a preferred embodiment of the present invention caches data at nodes with substantial storage, and associates different holding costs to different nodes to reflect this fact. Also, when cache data in the middle of the network, the nodes caching the data need to be notified each time a query is generated. For purposes of discussion, assume the size of the signaling message needed to do this is Dq.
In this subsection, a distributed method that determines the minimum cost scheme is provided in accordance with embodiments of the present invention. First, each node i needs to maintain for each data k the following information: Cp (k,i), Fp (k,i), Πp(k,i), Cs (k,i), Fs (k, j), Πs (k,i), Ω(k,i) and I (k,i). In particular, Cp (k,i), Fp (k,i) and Πp (k,i) are associated with the current best processing scheme to obtain data k at node i; while Cs (k,i), Fs (k,i), Πs(k,i), and Ω(k,i) are associated with the current minimum cost scheme involving caching, where Ω(k,i) represents the caching decision with Ω(k,i)=1 if data k is cached at node i, and 0 otherwise.
It will be understood by those skilled in the art that Γpn (k,i)+rp (k,i) is the minimum cost to cache data k at node i, among all data collection trees β (k,i) such that H (β (k,i))≦n. So Cp (k,i), Fp(k,i) and Πp (k,i) are used to check whether caching data k at node i is better than pulling data k from other nodes when the query is injected.
If the final data, i.e., data from data-8, item 28 and/or data-9, item 29, shown in FIG. 2, is not pushed to the fusion center 201, the sensors or the nodes caching the data each time a query is injected are notified. Thus, to obtain the minimum cost query processing scheme, the minimum cost path to transmit the required signaling messages is required.
Referring to FIG. 11, there is shown Method 3 for finding the minimum cost path to transmit the required signaling message. Thus, node i needs to maintain for each node j (not just for j ε Φi) the following information: R(i,j), G(i,j) and Ib (i,j). In particular R(i,j) is the current cost to transmit one unit of data from node i to node j; G(i,j) is the next hop node i should take to reach node j; and Ib(i,j)=1 if the information has been updated, but not distributed to node i's neighbors.
Still referring to FIG. 11, the distributed method to obtain the minimum cost scheme is given as Method 3. Similar to Method 1 shown in FIG. 9, Method 3 first initializes the network states according to Method 2 shown in FIG. 10. After that, Method 3 repeats following two steps:
- (i) Lines 3-8: If node i's state has been updated, node i needs to send the updates to its neighbors.
- (ii) Lines 9-31: Node i updates its state according to the information received from its neighbors. It first uses the distributed Bellman-Ford algorithm, or any suitable method, to update R(i,j), G(i,j) and Ib (i,j) (lines 10-13). Then, node i updates the cost related to caching (lines 17-19 and lines 23-25). Finally, for each data k, node i compares the current scheme with three different schemes according to the updated information received: (1) Obtain data k from node j(j ε Φi) when the query is injected (lines 20-22); (2) Compute data k at node j when the query is injected (lines 26-28); (3) Cache data k at node j (lines 29-31). If one of three schemes is better, node i chooses that scheme and updates data k's state.
FIG. 14 shows that Method 3 shown in FIG. 11 converges to the optimal scheme within N+M(N−1) iterations.
Similar to Method 1, FIG. 9, the following trace back process is provided:
- (i) Generate message (Data k*, Pull) at the fusion center, where data k is the final data.
- (ii) If node i receives message (Data k, Pull) from node j, then
- (a) If Πs (k,i)=1 and Ω(k,i)=1, node i sends message (data in, Push) to node Fs(m,i) for each m ε Δk, and needs to generate data k whenever data k's children are available.
- (b) If Πs(k,i)=1 and Ω(k,i)=0, node i sends message (data m, Pull) to node Fs(m,i) for each m ε Δk, and needs to request data m from node Fs (m,i) for each m ε Δk when there is a request for data k at node i.
- (c) If Πs (k,i)=0 and Ω(k,i)=1, node i sends message (data k, Push) to node Fs(k,i).
- (d) If Πs (k,i)=0 and Ω(k,i)=0, node i sends message (data k, Pull) to node Fs(k,i), and needs to request data k from node Fs (k,i) when there is a request for data k at node i.
If node i receives message (Data k, PUSH) from node j, it needs to transmit data k to node j whenever data k is available. Furthermore,
- (a) If Πs(k,i)=1, node i sends message (data in, Push) to node Fs (m,i) for each m ε Δk, and needs to generate data k whenever data k's children are available.
- (b) If Πs (k,i)=0, node i sends message (data k, Push) to node Fs (k,i).
Asynchronous Methods
Methods 1 and 3 shown in FIG. 9 and FIG. 11, respectively, execute in a synchronous way. For example, referring to FIG. 9, nodes begin to update their states (lines 11-19) only after all nodes finish distributing the updated information to their neighbors (lines 6-10); similarly, nodes begin to send out their updated information only after all nodes finish updating their states. This requires that all nodes be fully synchronous and know when other nodes finish updating and finish sending out updated information.
In this embodiment, Method 4, the methods are operated in an asynchronous way. Thus, each node updates its states right after it receives information from its neighbors; and sends updated messages to its neighbors right after its states are updated. For example, an asynchronous version of Method 1 is shown in FIG. 12, Method 4, and we can obtain the asynchronous version of Method 3 similarly. When the methods execute asynchronously, the methods are fully distributed.
Cost and Topology Change
Communication, computation, and storage costs may change over time. Furthermore, links can fail. In this section the methods shown above are modified to accommodate such changes.
An example of a change is where one of the costs decreases, e.g., node i's storage cost decreases due to the addition of storage space to node i. It will be understood that it suffices to re-initiate execution of the methods (node i first updates its states according to the decrease in cost and then distributes the updated information to its neighbors, and then all nodes begin to exchange information). The method will converge to the new optimal scheme in a finite number of steps.
Now consider the case of either a cost increase or a link failure (the cost is infinity). Simply executing the method will result in a “count to infinity” problem understood in the prior art. It may take an infinite number of iterations for the method to converge to the new optimal solution when a link fails, or B iterations to converge if the cost increase is B. Alternate methods may be used in these situations. Once such method includes the following two parts:
- (1)Freeze: When the cost of a node or a link increases, it sends out freeze messages. All data collection trees containing that node or that link are frozen except for modifying the costs. Then, the costs are re-calculated according to the cost change. For example, consider Method 1 shown in FIG. 9, suppose Πs (k,i)=1, and node i's processing cost increases by δ, then
- (i) Node i first sets Cp (k,i)=Cp(k,i)+δ. Then it checks the data h from the leaves to the root along the query tree. If Π(h,i)=1 and Cp(m,i) changes for any m ε Δh, node i re-calculates Cp(h,i). If Cp(k,i) increases for any data k, node i distributes a freeze message (Data h, Cp(k,i), Freeze) to its neighbors. Also, all states of node i are frozen, and can only be modified when freeze messages received.
- (ii) If node i receives (Data As, Cp (k, j), Freeze) from node j, it first sets Cp(k,i)=Cp (k, j)+Dkrt (j,i) if Fp (k,i)=j. Then, it checks and up dates all data h's states as in (i). If any Cp (k,i) increases, node i distributes freeze message (Data As, Cp(k,i), Freeze) to its neighbors. The states of node i are frozen, and can only be modified when a freeze message received.
- (2)Unfreeze: When the states of all nodes are updated according to the cost change, the nodes are unfrozen. After unfreezing, those nodes whose states are updated request their neighbors to send states to them, and then all nodes begin to exchange their information using the distributed algorithms. The algorithm terminates when there is no more information exchanged among nodes. Using this Freeze-Unfreeze scheme, the new optimal scheme can be obtained in finite steps.
Extensions
It will be appreciated that the distributed methods presented thus far solve the minimum cost query processing problem for tree-structured queries. Alternated embodiments of the present invention extend the methods to handle more general query graphs and to account for finite capacity on certain nodes.
General Queries
In this subsection, queries with more general underlying topologies are addressed. As described earlier, Problem 1 is NP-Complete for the case of a general query graph. We propose a number of simple heuristics to handle this.
Heuristic 1: For a general query graph that is not a tree, such as in FIG. 2, where one data may have more than one parent. One alternate embodiment temporarily removes some edges so that the resulting graph becomes a tree or multiple trees. The methods presented thus far apply the distributed methods on each of the new trees to obtain the minimum cost scheme. Since each operator generates exactly one data object, it is only necessary to remove edges from data objects to operators, e.g. the edge from data-6, item 26 to operator 9, item 299 in FIG. 2.
Denote the optimal assignment π* for the original problem with total cost C*. Similarly, denote π′ and C′ the assignment and total cost corresponding to the new trees and assert that C′≦C*. This is because the optimal placement π* is clearly feasible for the new trees, however, C′ does not need to include the transmission cost on the removed edges, thus must be smaller than C*. Now adding back to C′ the transmission cost Δ of the removed edges, yields a feasible solution to the original problem. Hence C*≦C′+Δ. Therefore, if the query graph is tree-like such that one only needs to remove a limited number of edges to have tree structures, the distributed method is suitable for tree-like structures.
Heuristic 2: There exists one set of queries for which we can easily adapt our earlier methods for trees to obtain the optimal solution. Transforming the graph shown in FIG. 2 to the graph in FIG. 4 by replacing the edges from data data-6, item 26 to operator 8, item 288 and operator 9, item 299 by a subgraph consisting of a three nodes, copy operator 41 and two data items, 6a, item 26A and 6b, item 26B. Edges are added from data-6, item 26 to the copy operator 41, from the copy operator 41 to data items, 6a, item 26A and 6b, item 26B, and from 6a, item 26A to operator 8, item 288 and from data 6b, item 26B to operator 9, item 299. A computation cost of zero is associated with the copy operator 41.
Next we reverse the edges from operator 9, item 299 to data-9, item 29, copy operator 41 to data-6b, item 26B, and from 6b to operator 9, item 299. The resulting graph is then of a tree structure shown in FIG. 5. Now apply Method 1 shown in FIG. 9 with the following modification. During tracing and deciding whether or not to add a physical link, say (i,j), to a path corresponding to an edge within the query graph that has been reversed, use a communication cost of rt(j,i)Dk rather than rt(i,j)Dk. This accounts for the fact that the data will actually travel in the reverse direction. In the case of Method 2, shown in FIG. 10, the modification is similar, in fact it corresponds to switching the signaling costs with the data costs for the edges of the query graph that have been reversed as well as the directions of the links traversed, i.e., rt (j, i)Dk rather than rt (i, j)Dk. Last, in both cases the back tracking phase must be modified to account for the reversed edges properly.
Note that the transformation that was described earlier is not special to the query graph illustrated in FIG. 2. It can be applied to any suitable query graph whose maximum node out degree is two. The resulting scheme is optimal over the class of all schemes that do not allow operators to be replicated. This approach also applies for query graphs with higher out degree. For example, a query graph that has a data object with out degree three. In this example there are three different orderings, each of which requires two copy operators. In order to obtain the optimal solution, each combination has to be treated separately and the one achieving the minimum cost identified after all three instances have been solved. It will be understood that as the out degree grows, the computational requirements grows exponentially.
Finite Capacity Constraints
So far we have assumed that there are no constraints on the computation capacity and storage space. However, it will be appreciated that in some scenarios there may have constraints on the total amount of capacity available at certain nodes. In this section, we provide some heuristics to address the problem.
In accordance with one embodiment of the present invention, the method increases the cost coefficients at the nodes that are severely capacity constrained. Consider, for example, a sensor node k has little space for storage, making the cost of storing a unit of data there extremely high will discourage the distributed methods presented herein to store a data object at node k.
Another solution is to perform a form of distributed bin packing. For example, after an initial run of the distributed method, it is found that the assignment violates the capacity constraint associated with node k. An embodiment of the distributed method in accordance with the present invention can then take out the minimum number of assigned items to meet the capacity constraints and re-allocate them to nearby nodes that have residual capacity.
A third approach in accordance with embodiments of the present invention is to introduce cost functions so that the cost is extremely high or infinite whenever reaching the fall capacity of a given node. Thus, dynamic network rate control is established. It will be appreciated that the cost functions may be linear or non-linear.
The following sections highlight the advantages of the embodiments of the invention described above. A first comparison highlights tree-structured queries and shows that the distributed method in accordance with embodiments of the invention presented herein can significantly reduce the cost of query processing.
Referring to FIG. 6 there is shown a three-tier sensor network 60. The highest tier is the fusion center 61, the middle tier is formed by relay nodes 62. Each relay node is responsible for several sensor nodes. The sensor nodes form the lowest tier 63. In simulations, the number of the relay nodes and the number of the sensors who connect to the same relay node are randomly chosen from [0,L]. L is varied from 5 to 20 to investigate the performance of the algorithms for the networks with different size.
Three-tier tree-structured query graphs are randomly generated. Each sensor data has probability ½ to be involved in the query, so there are approximately L2/2 sensor data in each query graph. To investigate general queries, first generate two non-overlap tree-structured queries, and then randomly add 10% additional linlks to connect the two trees. It is thus a tree-like query graph, and use Heuristic 1 to obtain the placement solution.
The communication cost per unit data of each link is randomly chosen from [0, 10], and the computation cost of each operator and the storage cost per unit data are chosen to so that the nodes in the higher tier have lower cost, and are approximately of the order of 1/10 of the communication cost. Embodiments of the invention are compared with the following two algorithms:
- 1. Simple-Push: The sensors transmit the data to the relay nodes after the data are generated. The relay node will process the data if possible and then transmit to the fusion center.
- 2. Simple-Pull: Fusion center signals the sensor nodes when the query is injected. Then the data are transmitted and processed as the Simple-Push.
Choosing L from 5 to 20 and for each fixed L, the simulation is executed 100 times, and at each time, a sensor network and a tree-structure query are randomly generated. Referring to FIG. 7, there is shown the average costs of the two algorithms described above (items 71 and 72). Further the cost of Method 3 (item 73) of an embodiment of the present invention is approximately ½ of the cost of the Simple-Pull (item 72) and ⅓ of the cost of the Simple-Push (item 71)
Referring now to FIG. 8 there is shown Heuristic 1 as described above (item 83) compared with Simple-Pull 82 Simple-Push 81 and a lower bound 84 of the query processing cost as discussed earlier. Observe that Heuristic 1 (item 83 ) is very close to the optimal solution
The capabilities of the present invention can be implemented in software, firmware, hardware or some combination thereof.
As one example, one or more aspects of the present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media. The media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention. The article of manufacture can be included as a part of a computer system or sold separately.
Additionally, at least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.
The diagrams depicted herein are just examples. There may be many variations to these diagrams described therein without departing from the spirit of the invention. The flow diagrams depicted herein are also just examples. There may be many variations to these diagrams or the steps (or operations) described therein without departing from the spirit of the invention. For instance, the steps may be performed in a differing order, or steps may be added, deleted or modified. All of these variations are considered a part of the claimed invention.
While the preferred embodiment to the invention has been described, it will be understood that those skilled in the art, both now and in the future, may make various improvements and enhancements which fall within the scope of the claims which follow. For example, any suitable IP protocol may be used. Such as, for example, Ipv4 or Ipv6. In addition, the previously described WECM or any suitable connection manager may be used. These claims should be construed to maintain the proper protection for the invention first described.