SYSTEM AND METHOD TO DETECT ATTACKS ON MOBILE WIRELESS NETWORKS BASED ON NETWORK CONTROLLABILITY ANALYSIS

Information

  • Patent Application
  • 20170318034
  • Publication Number
    20170318034
  • Date Filed
    March 18, 2016
    8 years ago
  • Date Published
    November 02, 2017
    7 years ago
Abstract
Described is a system for detecting attacks of misinformation on communication networks. Network controllability metrics on a graphical representation of a communication network are computed. Changes in the network controllability metrics are detected, and attack of misinformation on the communication network are detected based on the detected changes in the network controllability metrics.
Description
BACKGROUND OF INVENTION
(1) Field of Invention

The present invention relates to a system for detecting attacks on nodes of wireless networks and, more particularly, to a system for detecting attacks on nodes of wireless networks based on network controllability analysis.


(2) Description of Related Art

Due to the dynamic nature of mobile wireless network topology, mobile wireless networks use protocols that are built on a model of implicit trust and sharing of control information, which makes them particularly hard to defend against attacks of misinformation. Existing security solutions for mobile adhoc networks detect attacks at the level of networks throughput statistics (e.g., at layer 2 and 3 of the 7-layer network stack) by anomaly detection. They look for protocol violations; hence, they are specific to certain protocols or known attack signatures. Additionally, current network protocol stacks secure the transmission between pairs of nodes, hut they can't avoid reliance on the information from other nodes (i.e., they can't avoid “network insider” attacks). A compromised node can send bad information to subvert the operation of the network (e.g., by advertising itself as the fastest route to get to every other node in the network, but throwing away every packet it gets, called a blackhole attack). This kind of attack does not violate protocol, so it is hard to detect with conventional techniques.


Furthermore, current research in the detection of misbehaving nodes in mobile wireless networks is still predominantly focused on adapting and optimizing conventional network defense strategies that concentrate on behaviors at the lower layers of the networking stack (see the List of incorporated Literature References, Literature Reference Nos. 3-9). Research on strategies such as signature detection, statistical anomaly detection, and specification-based detection have proven effective for specific attack and network scenarios, but applicability to more general scenarios has proven elusive. What has been missing is a higher level behavioral analysis of the entire networking stack and applications on each node and on the network as a whole. It is this perspective that recent research in network science and information dynamics can now provide through the formulation and analysis of the graph-theoretic network-of-networks (NoN) model (see Literature Reference Nos. 10-12). Although NoN has been widely applied to the study of the dynamics of social networks, its application to cyber-security has only recently been recognized following breakthroughs of methods for modeling both logical and physical networks in NoN (see Literature Reference No. 13), where connectivity and dynamics are fundamentally different. The extension of this ground breaking work to the challenging environment of mobile wireless networks, particularly under real-world assumptions of scale and complexity, has yet to be studied.


Thus, a continuing need exists for a system that can detect sources of misinformation in a holistic way by analyzing changes in applications and their dependencies with the lower networking layers.


SUMMARY OF THE INVENTION

The present invention relates to a system for detecting attacks on nodes of wireless networks and, more particularly, to a system for detecting attacks on nodes of wireless networks based on network controllability analysis. The system comprises one or more processors and a memory having instructions such that when the instructions are executed, the one or more processors perform multiple operations. A plurality of network controllability metrics on a representation of a communication network comprising a plurality of nodes are computed. Changes in the plurality of network controllability metrics are detected, the detected changes are used to detect attacks of misinformation on the communication network.


In another aspect, the representation includes network topology, network dependencies, and application dependencies within the communication network.


In another aspect, the plurality of network controllability metrics are computed as a function of a pattern of communication between a plurality of nodes of the communication network during a given time window.


In another aspect, given a set of examples of network controllability metric data representing a baseline behavior and a set of examples of network controllability metric data representing an attack behavior, a machine learning classifier determines a threshold for attack detection based on differences between the baseline behavior and the attack behavior.


In another aspect, each network controllability metric is represented as a diode in a diode pattern panel, wherein network controllability metrics displaying attack behavior, as determined by the threshold for attack detection, are highlighted in the diode pattern panel.


In another aspect, upon detection of an attack of misinformation on the communication network, the system performs a mitigation action.


In another aspect, the mitigation action comprises isolating an attacking node from the rest of the communication network.


In another aspect, the mitigation action comprises informing every other node in the communication network to ignore anything that the attacking node transmits, and not to send anything to, or through, the attacking node.


In another aspect, features representing each of the plurality of network controllability metrics are output. Each feature is then converted into a binary indication of whether a value is anomalous or not anomalous, and the binary indication is used to detect changes in the plurality of network controllability metrics.


In another aspect, the representation is, a graphical representation of network topology, network dependencies, and application dependencies within the communication network.


In another aspect, the plurality of network controllability metrics are computed on a graphical representation of a pattern of communication between a plurality of nodes of the communication network during a given time window.


In another aspect, the present invention also comprises a method for causing a processor to perform the operations described herein.


Finally, in yet another aspect, the present invention also comprises a computer program product comprising computer-readable instructions stored on a non-transitory computer-readable medium that are executable by a computer having a processor for causing the processor to perform the operations described herein.





BRIEF DESCRIPTION OF THE DRAWINGS

The objects, features and advantages of the present invention will be apparent from the following detailed descriptions of the various aspects of the invention in conjunction with reference to the following drawings, where:



FIG. 1 is a block diagram depicting the components of a system for detecting attacks on wireless networks according to some embodiments of the present disclosure;



FIG. 2 is an illustration of a computer program product according to some embodiments of the present disclosure;



FIG. 3 is an illustration of construction of the Exploitation Network (X net) according to some embodiments of the present disclosure;



FIG. 4A is an illustration of results from attack detection and attribution in a 25 node baseline scenario using network controllability metrics according to sonic embodiments of the present disclosure;



FIG. 4B is an illustration of results from attack detection and attribution in a 25 node attack behavior scenario using network controllability metrics according to some embodiments of the present disclosure;



FIG. 5A is an illustration of use of a support vector machine (SVM) to find a threshold to classify attack behavior based on network controllability metrics according to some, embodiments of the present disclosure;



FIG. 5B is an illustration of the SVM learning to find a plane in feature hyperspace that can separate examples of baseline performance from attack behavior according to some embodiments of the present disclosure;



FIG. 6A is an illustration of a diode pattern of 35 network metrics for baseline activity according to some embodiments of the present disclosure;



FIG. 6B is an illustration of a diode pattern of35 network metrics during a hypertext transfer protocol (HTTP) flooding attack according to some embodiments of the present disclosure;



FIG. 7A is an illustration of a diode pattern of 35 network metrics for baseline activity according to some embodiments of the present disclosure;



FIG. 7B is an illustration of a diode pattern of 35 network metrics during a drop-all attack according to some embodiments of the present, disclosure;



FIG. 8A is an illustration of a diode pattern of 35 network metrics for baseline activity according to some embodiments of the present disclosure;



FIG. 8B is an illustration of a diode pattern of 35 network metrics during a reset-all attack according to some embodiments of the present disclosure;



FIG. 9 is an illustration of a summary panel of diode patterns of 35 network metrics in three different layers for baseline, drop-all, and reset-all attacks according to some embodiments of the present disclosure; and



FIG. 10 is an illustration depicting a relationship between modules of the Xnet model according to some embodiments of the present disclosure.





DETAILED DESCRIPTION

The present invention relates to a system for detecting attacks on nodes of wireless networks and, more particularly, to a system for detecting attacks on nodes of wireless networks based on network controllability analysis. The following description is presented to enable one of ordinary skill in the art to make and use the invention and to incorporate it in the context of particular applications. Various modifications, as well as a variety of uses in different applications will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to a wide range of aspects. Thus, the present invention is not intended to be limited to the aspects presented, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.


In the following detailed description, numerous specific details are set forth in order to provide a more thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without necessarily being limited to these specific details. In other instances, well-known structures and devices are shown in block diagram form, rather than in detail, in order to avoid obscuring the present invention.


The reader's attention is directed to all papers and documents which are filed concurrently with this specification and which are open to public inspection with this specification, and the contents of all such papers and documents are incorporated herein by reference. All the features disclosed in this specification, (including any accompanying claims, abstract, and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.


Furthermore, any element in a claim that does not explicitly state “means for” performing a specified function, or “step for” performing a specific function, is not to be interpreted as a “means” or “step” clause as specified in 35 U.S.C. Section 112, Paragraph 6. In particular, the use of “step of” or “act of” in the claims herein is not intended to invoke the provisions of 35 U.S.C. 112, Paragraph 6.


Please note, if used, the labels left, right, front, back, top, bottom, forward, reverse, clockwise and counter-clockwise have been used for convenience purposes only and are not intended to imply any particular fixed direction. Instead, they are used to reflect relative locations and/or directions between various portions of an object. As such, as the present invention is changed, the above labels may change their orientation.


Before describing the invention in detail, first a list of incorporated literature references as used in the description is provided. Next, a description of various principal aspects of the present invention is provided. Finally, specific details of the present invention are provided to give an understanding of the specific aspects.


(1) LIST OF INCORPORATED LITERATURE REFERENCES

The following references are incorporated and cited throughout this application. For clarity and convenience, the references are listed herein as a central resource for the reader. The following references are hereby incorporated by reference as though fully included herein. The references are cited in the application by referring to the corresponding literature reference number, as follows:

  • 1. Y.-Y. Liu, J.-J. Slotine, and A.-L. Barabási, “Controllability of complex networks,” Nature, vol. 473, pp. 167-173, 2011.
  • 2. Y.-Y. Liu, J.-J. Slotine, and A.-L. Barabási “The observability of complex systems,” PNAS, vol. 110, no. 7, pp. 2460-2465, 2013.
  • 3. J.-P. Hubaux, L. Buttyán, and S. Capkun, “The quest for security in mobile ad hoc networks,” in Proceedings of the 2nd ACM international symposium on Mobile ad hoc networking & computing. ACM, 2001, pp. 146-155.
  • 4. S. Marti, T. J. Giuli, K. Lai, M. Baker et al., “Mitigating routing misbehavior in mobile ad hoc networks,” in International Conference on Mobile Computing and Networking: Proceedings of the 6th annual international conference on Mobile computing and networking, vol. 6, no. 11, 2000, pp. 255-265.
  • 5. H. Yang, J. Shu, X. Meng, and S. Lu, “Scan: self-organized network-layer security in mobile ad hoc networks,” IEEE Journal on Selected Areas in Communications, vol. 24, no. 2, pp. 261-273, 2006.
  • 6. Y. Zhang and W. Lee, “Security in mobile ad-hoc networks,” in Ad Hoc Networks. Springer, 2005, pp. 249-268.
  • 7. K. Govindan and P. Mohapatra. “Trust computations and trust dynamics in mobile adhoc networks: a survey,” Communications Surveys & Tutorials, IEEE, vol. 14, no. 2, pp. 279-298, 2012.
  • 8. A. Jøsang, R. Ismail, and C. Boyd, “A survey of trust and reputation systems for online service provision,” Decision support systems, vol. 43, no. 2, pp. 618-644, 2007.
  • 9. P. Michiardi and R. Molva, “Core: a collaborative reputation mechanism to enforce node cooperation in mobile ad hoc networks,” in Proceedings of the IFIP TC6/TC11 Sixth Joint Working Conference on Communications and Multimedia Security: Advanced Communications and Multimedia Security, 2002, pp. 107-121.
  • 10. S. Noel, M. Elder, S. Jajodia, P. Kalapa, S. O'Hare, and K. Prole, “Advances in topological vulnerability analysis,” in Conference For Homeland Security, 2009. CATCH'09, Cybersecurity Applications & Technology, IEEE, 2009, pp. 124-129.
  • 11. T. Karagiannis, K. Papagiannaki, and M. Faloutsos, “Blinc: multilevel traffic classification in the dark,” in ACM SIGCOMM Computer Communication Review, vol. 35, no. 4. ACM, 2005, pp. 229-240.
  • 12. S. Noel and S. Jajodia, “Understanding complex network attack graphs through clustered adjacency matrices,”in Computer Security Applications Conference, 21st Annual, IEEE, 2005, pp. 1-10.
  • 13. M. Kurant and P. Thiran, “Layered complex networks,” Physical review letters, vol. 96, no. 13, p. 138701, 2006.
  • 14. Borgatti, S and M. Everett, “A graph-theoretic perspective on centrality,” Social Networks, 28(4), 2006.


(2) PRINCIPAL ASPECTS

The present invention has three “principal” aspects. The first is a system for detecting attacks on wireless networks. The system is typically in the form of a computer system operating software or in the form of a “hard-coded” instruction set. This system may be incorporated into a wide variety of devices that provide different functionalities. The second principal aspect is a method, typically in the form of software, operated using a data processing system (computer). The third principal aspect is a computer program product. The computer program product generally represents computer-readable instructions stored on a non-transitory computer-readable medium such as an optical storage device, e.g., a compact disc (CD) or digital versatile disc (DVD), or a magnetic storage device such as a floppy disk or magnetic tape. Other, non-limiting examples of computer-readable media include hard disks, read-only memory (ROM), and flash-type memories. These aspects will be described in more detail below.


A block diagram depicting an example of a system (i.e., computer system 100) of the present invention is provided in FIG. 1. The computer system 100 is configured to perform calculations, processes, operations, and/or functions associated with a program or algorithm. In one aspect, certain processes and steps discussed herein are realized as a series of instructions (e.g., software program) that reside within computer readable memory units and are executed by one or more processors of the computer system 100. When executed, the instructions cause the computer system 100 to perform specific actions and exhibit specific behavior, such as described herein.


The computer system 100 may include an address/data bus 102 that is configured to communicate information. Additionally, one or more data processing units, such as a processor 104 (or processors), are coupled with the address/data bus 102. The processor 104 is configured to process information and instructions. In an aspect, the processor 104 is a microprocessor. Alternatively, the processor 104 may be a different type of processor such as a parallel processor, or a field programmable gate array.


The computer system 100 is configured to utilize one or more data storage units. The computer system 100 may include a volatile memory unit 106 (e.g., random access memory (“RAM”), static RAM, dynamic RAM, etc.) coupled with the address/data unit 106 wherein a volatile memory unit 106 is configured to store information and instructions for the processor 104. The computer system 100 further may include a non-volatile memory unit 108 (e.g., read-only memory (“ROM”), programmable ROM (“PROM”), erasable programmable ROM (“EPROM”), electrically erasable programmable ROM “EEPROM”), flash memory, etc.) coupled with the address/data bus 102, wherein the non-volatile memory unit 108 is configured to store static information and instructions for the processor 104. Alternatively, the computer system 100 may execute instructions retrieved from an online data storage unit such as in “Cloud” computing. In an aspect, the computer system 100 also may include one or more interfaces, such as an interface 110, coupled with the address/data bus 102. The one or more interfaces are configured to enable the computer system 100 to interface with other electronic devices and computer systems. The communication interfaces implemented by the one or more interfaces may include wireline (e.g., serial cables, modems, network adaptors, etc.) and/or wireless e.g., wireless modems, wireless network adaptors, etc.) communication technology.


In one aspect, the computer system 100 may include an input device 112 coupled with the address/data bus 102, wherein the input device 112 is configured to communicate information and command selections to the processor 100. In accordance with one aspect, the input device 112 is an alphanumeric input device, such as a keyboard, that may include alphanumeric and/or function keys. Alternatively, the input device 112 may be an input device other than an alphanumeric input device. For example, the input device 112 may include one or more sensors, such as a camera for video or still images, a microphone, or a neural sensor. Other example input devices 112 may include an accelerometer, a GPS sensor, or a gyroscope.


In an aspect, the computer system 100 may include a cursor control device 114 coupled with the address/data bus 102, wherein the cursor control device 114 is configured to communicate user input information and/or command selections to the processor 100. In an aspect, the cursor control device 114 is implemented using a device such as a mouse, a track-ball, a track-pad, an optical tracking device, or a touch screen. The foregoing notwithstanding, in an aspect, the cursor control device 114 is directed and/or activated via input from the input device 112, such as in response to the use of special keys and key sequence commands associated with the input device 112. In an alternative aspect, the cursor control device 114 is configured to be directed or guided by voice commands.


In an aspect, the computer system 100 further may include one or more optional computer usable data storage devices, such as a storage device 116, coupled with the address/data bus 102. The storage device 116 is configured to store information and/or computer executable instructions. In one aspect, the storage device 116 is a storage device such as a magnetic or optical disk drive (e.g., hard disk drive (“HDD”), floppy diskette, compact disk read only memory (“CD-ROM”), digital versatile disk (“DVD”)). Pursuant to one aspect, a display device 118 is coupled with the address/data bus 102, wherein the display device 118 is configured to display video and/or graphics. In an aspect, the display device 118 may include a cathode ray tube (“CRT”), liquid crystal display (“LCD”), field emission display (“FED”), plasma display, or any other display device suitable for displaying video and/or graphic images and alphanumeric characters recognizable to a user.


The computer system 100 presented herein is an example computing environment in accordance with an aspect. However, the non-limiting example of the computer system 100 is not strictly limited to being a computer system. For example, an aspect provides that the computer system 100 represents a type of data processing analysis that may be used in accordance with various aspects described herein. Moreover, other computing systems may also be implemented. Indeed, the spirit and scope of the present technology is not limited to any single data processing environment. Thus, in an aspect, one or more operations of various aspects of the present technology are controlled or implemented using computer-executable instructions, such as program modules, being executed by a computer. In one implementation, such program modules include routines, programs, objects, components and/or data structures that are configured to perform particular tasks or implement particular abstract data types. In addition, an aspect provides that one or more aspects of the present technology are implemented by utilizing one or more distributed computing environments, such as where tasks are performed by remote processing devices that are linked through a communications network, or such as where various program modules are located in both local and remote computer-storage media including memory-storage devices.


An illustrative diagram of a computer program product (i.e., storage device) embodying the present invention is depicted in FIG. 2. The computer program product is depicted as floppy disk 200 or an optical disk 202 such as a CD or DVD. However, as mentioned previously, the computer program product generally represents computer-readable instructions stored on any compatible non-transitory computer-readable medium. The term “instructions” as used with respect to this invention generally indicates a set of operations to be performed on a computer, and may represent pieces of a whole program or individual, separable, software modules. Non-limiting examples of “instruction” include computer program code (source or object code) and “hard-coded” electronics (i.e. computer operations coded into a computer chip). The “instruction” is stored on any non-transitory computer-readable medium, such as in the memory of a computer or on a floppy disk, a CD-ROM, and a flash drive. In either event, the instructions are encoded on a non-transitory computer-readable medium.


(3) SPECIFIC DETAILS OF THE INVENTION

Described is a system to detect attacks on nodes of wireless networks. It leverages the system described in U.S. application Ser. No. 14/625,988 (incorporated herein by reference in its entirety), which abstracts the details of the network stack and the physical layer into a mathematical representation of the relationships between network elements and services called the eXploitation network (Xnet). Leveraging of Xnet makes it possible to go beyond conventional methods for wireless networks.


Existing security solutions for mobile adhoc networks detect attacks at the level of network throughput statistics (i.e., at layer 2 and 3 of the 7-layer network stack) by anomaly detection. They look for protocol violations; hence, they are specific to certain protocols or known at-tack signatures. The technique according to some embodiments of the present disclosure analyzes network behavior with a holistic approach, from layer 2 to layer 7, which means that it looks at changes in applications and their dependencies with the lower networking layers. In particular, it analyzes network state based on “network controllability” analysis, which computes the minimal set of nodes (referred to as driver nodes) that is required to control the state of the entire network, and how that set changes over time. This process of analyzing a wireless network is distinct from any previously described methods of analysis.


Due to the dynamic nature of mobile wireless network topology, current techniques use protocols that are built on a model of implicit trust and sharing of control information, which makes them particularly hard to defend against attacks of misinformation. For instance, current network protocol stacks secure the transmission between pairs of nodes, but they can't avoid reliance on the information from other nodes (i.e., they can't avoid “network insider” attacks). A compromised node can send bad information to subvert the operation of the net-work (e.g., by advertising itself as the fastest route to get to every other node in the network, but throwing away every packet it gets, called a blackhole attack). This kind of attack does not violate protocol, so it is hard to detect with conventional techniques. The approach described herein can detect sources of misinformation in a holistic way, especially when multiple nodes are compromised. This technique can identify dynamic structure dependency changes in Xnet that can signal suspicious nodes.


Broadly speaking, the system according to embodiments of the present disclosure falls into a class of intrusion detection systems (IDS). Current approaches include the following. Signature detection finds specific attack patterns known a priori, but this is ineffective against unknown attacks. With anomaly detection, effective classifiers are hard to construct due to network dynamics and have low to moderate accuracy. An immunology intrusion detection system learns to identify behaviors that are foreign, but this approach is protocol specific, hard to formulate, and has a high overhead. Extended finite state machine (FSM) models detect explicit violations in protocol state transitions, but this is protocol and implementation specific.


No other approach uses graph-theoretic and information dynamics analysis to identify misbehaving nodes. Rather than looking for specific attack signatures that are protocol specific or based on low-level network statistics, the invention described herein looks at a higher level of behavior.


As described above, the eXploitation Network (Xnet) is a hierarchical model of a network (a network of networks) that provides three different views of the network, linked together by directional links. The network may be wired or wireless, and the topology may change dynamically. That is, nodes in the network can move, changing their pattern of connectivity to other nodes (i.e., MANET: Mobile AdHoc Network). Its nodes include the physical radios communicating on the network as well as conceptual nodes that represent applications and network services. Edges between nodes are created whenever one of these nodes sends data to another (just the start and end node, not the intermediate nodes that forward the message datagrams). An edge exists until the message reaches its destination.


As depicted in FIG. 10, the Xnet model includes at least four unique modules, including the Xnet Dynamics (XD)) module 1000, the Xnet Controllability/Observability (XCO) module 1002, the Xnet Evolvability (XE) module 1004, and (4) the Reliability Estimation (RE) module 1006. In various embodiments, different numbers of modules may be used to perform the same or similar functions. The XD module 1000 identifies unreliable nodes based on the dynamics of social networks (with no dependency on protocol) to indicate the presence of malicious or damaged nodes altering control and data plane information in the network. The XCO module 1002 identifies the optimal set of nodes required to passively monitor (observability) or actively probe (controllability) a suspected source of misinformation. These techniques require significantly fewer nodes (i.e., lower overhead than the prior art) to form a consensus on whether a suspected source of misinformation is malicious without compromising accuracy (increased probability of detection, lowered probability of false alarms). The XE module 1004 simulates a progression of failures to predict which nodes are most likely to be attacked next or should have trust reassessed. Finally, the RE module 1006 fuses cross-layer and cross-plane (control and data plane) information to identify suspicious nodes and improve reputation-based trust management. The unified trust metric is computed in a hybrid approach in which nodes combine normalized confidence and trust values based on direct experience and recommendations of other nodes. Such a hybrid approach avoids a centralized point of failure, ensures scalability, and renders the computation resilient to attacks targeting such computations. These modules are described in further detail below.


All modules communicate by annotations on Xnet. The XD module 1000 identifies nodes that appear to be misbehaving. The RE module 1006 gets a minimal set of driver and observer nodes from the XCO module 1002 for the suspect nodes. The RE module 1006 uses the driver nodes to do active probing on the suspect nodes, and the observer nodes update a trust metric with the results. The XE module 1004 simulates a spread of compromised nodes


The RE module 1006 formalizes and quantifies trust using a model that relies on local computations based on direct interactions with neighbors and also by incorporating recommendations (and experiences) of other nodes. A formal subjective logic and trust model is leveraged for principled combination of evidence about how trustworthy a node is. Resilience to attacks is gained by adopting a hybrid distributed approach to compute trust, avoiding a single point of failure, and the approach is agnostic to control and/or data plane statistics being used. When the RE module's 1006 trust in a node falls below a certain level, it performs active probing on the node. To do that most efficiently the XCO module 1002 computes a minimal set of driver nodes to issue the challenges and observer nodes to observe the results.


The system also employs a two-pronged approach to discover sources of misinformation in the network, employing information dynamics identification of suspicious changes in Xnet dependencies, as well as trends in the appearance of such compromised nodes. First the XD module 1000 uses a unique information dynamic spectrum framework to predict system instability at critical transitions in complex systems, by analyzing Xnet time series data. This marks nodes for further inspection by the RE module 1006. Second, the XE module 1004 tracks trends in misbehaving nodes, and matches against simulations of contagion and cascading failures. The XE module 1004 will emit a confidence measure as to whether there is a pattern, and if so, the RE module 1006 can focus monitoring and testing resources on predicted next nodes to be attacked. System Administrators can use this information to focus preventative measures.


Network controllability analysis, described in further detail below, expands the scope of analysis beyond the node's immediate neighborhood to data based on indirect observations inferred from the direct data that it collects. For example, by monitoring the characteristics of the packets that a node handles it can infer architectural and dynamical properties of the larger network, such as the network size and dimension, and the dynamics of the communication patterns between nodes and reachability and connectivity.


The system described herein can be implemented in a wide variety of mobile wireless networks, non-limiting examples of which include mobile military and law enforcement networks (e.g., soldier-to-soldier, sensor-to-sensor, ground and aerial vehicle-to-vehicle); commercial vehicle-to-vehicle and vehicle-to-infrastructure networks (e.g., DSRC V2V/V2I, WiFi, active safety, infotainment); commercial mesh networks (metropolitan rooftop, WiMAX); and wireless infrastructure ISPs, cellular companies (e.g., extended data capacity). The system will significantly improve the security of these and other related networks, which currently rely predominantly on packet-level encryption to reduce the probability of external intrusion but do not detect or prevent “network insider” attacks. Specific details regarding the system are described in further detail below.


(3.1) Concept of Operation


(3.1.1) Initialization Stage


During initialization, network administrators may configure each physical node of the network with compatible networking stacks, host and network services, applications, and other software necessary for the mission, including the proposed suite of modules with supporting configuration data. Then Xnet, the hierarchical representation of a communications network, may created, such as in the form of data tables that describe the applications and services that are running on the network, their inter-dependencies, and the observable characteristics of their behavioral dynamics under normal operation (e.g., node degree, traffic flow characteristics, topology). A Network Controllability (NC) code module (such as that referred to as XCO in U.S. patent application Ser. No. 14/625,988) receives the Application Dependency (AppDep) and Network Dependency (NetDep) graph from Xnet. For further details regarding Xnet, refer to U.S. patent application Ser. No. 14/625,988, which is hereby incorporated by reference in its entirety.


(3.1.2) Network Updates


While the analysis is in operation, public domain tools, such as NSDMiner (a technique for automatically discovering: network service dependencies from passively observed network traffic) and Ettercap (an open source network security tool for attacks on local area networks (LANs)), are used to read the headers on message packets and infer the ultimate start and destination of the messages. These inferred events are identified by a start and end time, and a start node and destination node. As each event Ei is received, it is added to the Xnet 300 graph as an edge between the identified start node and destination node. Any event that did not start before or at the start of Ei and end after the end of Ei is removed. Then, a controllability analysis is performed on that graph.


The term “graph” in the context above refers to the abstract mathematical representation of the relationship between communicating entities in a physical network. Furthermore, in this context, “node” means an element in the graph. However, in another context “node” may reference a physical radio in the network. The term “network” most often refers to a physical network.



FIG. 3 depicts the construction of Xnet 300. The baseline Exploitation Network (Xnet 300) database is loaded into the network at initialization. In this context, the network is a physical radio network. Each physical radio node gets all or a portion of the Xnet database, where the Xnet database is the physical instantiation of the abstract graph of Xnet 300. An application (AppDep) dependency graph 302 and a network (NetDep) dependency graph 304, and their interdependencies (represented by dashed lines), are established a priori using expert domain knowledge or by automated inference using public domain tools, such as NSDMiner and Ettercap. Interdependencies between the AppDep dependency graph 302, the NetDep dependency graph 304, and the network topology (NetTopo) dependency graph 306 are based on the software configuration in the network. Significantly, the “nodes” on the left side of FIG. 3 (Entity/Relationship Network of Networks Analysis) represent physical radio nodes, while the “nodes” depicted in the Xnet 300 represent abstract nodes in the graph.


(3.2) Network Controllability


Network controllability analysis determines the minimal set of nodes required to control the global state of the network. In an embodiment of the present disclosure, a maximum matching algorithm (see Literature Reference Nos. 1 and 2 for a description of the maximum matching algorithm) is employed to compute controllability. The minimum number of inputs required to control the network (ND, or number of driver nodes) is given by the total number of nodes minus the number of nodes in the maximum matching set These nodes (that are members of the minimal set of nodes required to control the global state of the network) are called “driver nodes”. Once the Xnet 300 is constructed, many standard network science algorithms may be computed on the Xnet 300 representation. Non-limiting examples of such algorithms (metrics) are listed below in Table 1 below. For instance, different types of centrality measurements (e.g., degree, closeness, betweenness (see Literature Reference No. 14 for a description of the aforementioned measurements)) can be used as such an algorithm or metric. Network controllability metrics are computed on a graphical representation of a pattern of communication between nodes during a time window, where the network events contained in the graph start before or at the start of a particular network event and end before the end of that particular network event A unique aspect of the approach described in the present disclosure is to analyze the wireless network activity by looking at the change in global and local controllability metrics, such as those listed in Table 1 below, over time. Table 1 includes examples of controllability metrics used for attack detection and attribution.









TABLE 1







Global metrics








N
number of nodes


E
number of edges


ND
number of driver nodes = total # nodes minus the cardinality of



the maximum matching set


<k>
mean degree


nD
fraction of driver nodes = ND/N


ns
fraction of source nodes with in-degree 0.


ne
fraction of external dilations (a sink node; always a destination,



never a source)


ni
fraction of internal dilations, which is driver nodes ND that are



not solely sources or sinks.


nIc
fraction of type-I critical nodes. Its removal will increase ND.


nIr
fraction of type-I redundant nodes. Its removal will decrease ND.


nIo
fraction of type-I ordinary nodes. Its removal will not change



ND.


nIIc
fraction of type-II critical nodes. They are always be driver



nodes.


nIIr
fraction of type-II redundant nodes. They will never be driver



nodes.


nIIo
fraction of type-II ordinary nodes. They are neither critical nor



redundant.


lc
fraction of critical links. They belong to all maximum



matchings.


lr
fraction of redundant links. They do not belong to any maximum



matching.


lo
fraction of ordinary link. They are neither critical nor



redundant.


<lcc>
average local clustering coefficient (undirected)


gcc
global clustering coefficient (undirected)


<lccd>
average local clustering coefficient (directed)


gccd
global clustering coefficient (directed)


<BC>
average betweenness centrality (undirected)


<BCd>
average betweenness centrality (directed)


<CL>
average closeness centrality (undirected)


<CLd>
average closeness centrality (directed)


<AC>
average authority centrality


<HC>
average hub centrality


<Cc>
average control centrality







Local metrics








Cc(i)
control centrality of node i


BC(i)
betweenness centrality of node i


CL(i)
closeness centrality of each node i


AC(i)
authority centrality of each node i


HC(i)
hub centrality of each node i


BCd (i)
(directed) betweenness centrality of each node i


CLd (i)
(directed) closeness centrality of each node i









(3.3) Attack Detection and Attribution Using Controllability Analysis



FIGS. 4A and 4B illustrate two metrics computed for a baseline 25 node scenario (in FIG. 4A) and for a flooding attack in an Army Research Lab 25 node scenario (in FIG. 4B). The metrics are ne (fraction of eternal dilations) in the top rows of FIGS. 4A and 4B and AC(i) (authority centrality of each node) in the bottom rows of FIGS. 4A and 4B. The results shown are from a flooding attack in transmission control protocol (TCP) traffic from 20% of the nodes in the network to a single node, starting at 100 seconds and lasting 130 seconds. Background traffic in this example was generated by a public domain program called MGEN developed by the Naval Research Laboratory (NRL) PROTocol Engineering Advanced Networking (PROTEAN) Research Group. MGEN provides the ability to perform IP network performance tests and measurements using TCP and user datagram protocol (UDP)/Internet protocol (IP) traffic. Here, the network metrics in hypertext transfer protocol (HTTP) traffic are shown. When the flooding attack occurs (shown in FIG. 4B), both the global network metric ne and the local network metric AC(i) display abnormal behavior compared to the baseline performance shown in FIG. 4A. The abnormality is apparent in the absence of metric values greater than zero in the simulation between 100 and 225 seconds. The next paragraph describes how such a noisy graph can be smoothed to make the metric a definitive signal when the smoothed values reach zero.


Note that in FIGS. 4A and 4B, as in most metric plots, the metric values can vary in a noisy way, so it is necessary to smooth the graph by same technique, such as a median filter. Then, a threshold can be selected such that there is a clear difference between the attack behavior and the baseline behavior. For example, in FIGS. 4A and 4B, both metrics actually go to zero around time 100 seconds for both the baseline (FIG. 4A) and attack (FIG. 4B). However, the baseline gap is quite short. The smoothing filter should be configured so as to smooth over such a short time gap. An automated machine learning system can be used to discover appropriate thresholds, given examples of smoothed baseline and attack metric data. In an embodiment of the present disclosure, a support vector machine (SVM) was used for this purpose, although there are many other machine learning methods that could be applied. A SVM can learn to find a plane in feature hyperspace that can separate examples of baseline performance (FIG. 4A) from attack behavior (FIG. 4B), as depicted in FIG. 5B.



FIG. 5A illustrates the training process 500 and the subsequent online classification/detection process 502. A non-limiting example of the use of a SVM to find a threshold to classify baseline vs. attack behavior based on network controllability metrics on network communication activity is shown. Baseline activity is captured by running the network in the absence of attacks. XAE 504 is an Xnet Analytics Engine, which turns the raw network packet data of training scenarios 506 to an Xnet graph. The Xnet graph contains the NC module that extracts feature vectors 508 from the Xnet graph, which are the controllability metrics (currently 35 metrics), such as those listed in Table 1 above. The feature vectors 508 will most conveniently be captured offline and stored as one vector of all metric values for each time window, resulting in a matrix when the feature vectors 508 for various time windows are captured and combined. Additionally, examples are provided of attacks by performing attacks on the baseline scenarios, and again running them through XAE 504 to extract feature vectors 508. Then, the SVM (i.e., svm_learn 510) is trained by presenting each feature vector 508 along with a binary vector indicating, for each time period, whether an attack is present or not, resulting, in a trained classifier model 512. Once the SVM (i.e., svm_learn 510) is trained, it can be run during live online network operation (live online data 511) and will indicate when an attack is occurring in the classification/detection process 502. Specifically, during normal online operation, the XAE system 514 is used to extract sampled features 516 from current raw network packet data which, along with the trained model 512, is input to the SVM which can then be used to classify (i.e., svm_classify 518) the sampled features 516 and make a prediction 520 regarding whether an attack is present (i.e., good) or not (i.e., bad). The features that are output by XAE (508 during training and 516 when online testing) are one from each of the metrics in Table 1, smoothed as described above, and turned into a binary indication of whether the value is anomalous or not anomalous. This could be visualized as a visual panel of dots or diodes depicting a specific pattern to indicate whether an attack is present or not, and what kind of attack it is.



FIG. 5B depicts how the SVM learns to find a plane 520 in such a feature space 522 from an input space 524. The plane 520 can separate examples of baseline performance 526 from examples of attack behavior 528. An SVM is applied using a known kernel Φ 530 (e.g., see equation in FIG. 5B). The kernel is a similarity function over pairs of data points (i.e., between a labeled training set point and an unlabeled test point). Training is done by presenting examples of attacks and examples of baseline (without attacks). The SVM learns to separate attack situations from baseline by finding weights that can be described as defining a hyperplane separating baseline from attacks. Subsequently, one applies the trained model and uses the similarity function (kernel Φ 530) to classify the new unlabeled inputs as more similar to the attacks or the baseline points. In FIG. 5B, each circle represents a data point. Specifically, each data point is a value of the current 35-element feature vector.


Users can view each network metric as a “diode”, and the 35 network metrics can be displayed in a panel, such as those shown in FIG. 6A through FIG. 9. When an attack occurs, a particular set of diodes will light up or change colors. This pattern can be used for efficient attack detection and attribution. The network metrics in Table 1 can be applied to different networking protocol layers (e.g., UDP, TCP, HTTP) and the resulting binary “anomaly/no-anomaly” outputs for each of the protocol layers can be displayed in separate panels. Different layers (i.e., different network protocols) might yield different patterns. FIG. 9 illustrates separate panels for HTTP, TCP, and connections layers of the network. Combining all diode patterns from different layers enables one to perform attack detection and attribution more accurately.



FIGS. 6A and 6B show an example of a diode pattern for attack detection and attribution using all the 35 network metrics, where each diode (circle) represents a network metric. Attribution during a network attack means identifying the attacking nodes. Specifically, FIG. 6A depicts 35 network metrics for baseline activity, and FIG. 6B depicts 35 network metrics during an HTTP flooding attack. A flooding attack causes nodes to broadcast messages, effectively using up the network bandwidth so that legitimate messages cannot get through. Those network metrics displaying abnormal behavior when the attack occurs are highlighted. In FIG. 6B (and similar figures), global and local metrics are represented by pattern filled circles 600 and solid filled circles 602, respectively.



FIG. 7A illustrates 35 network metrics for baseline activity, and FIG. 7B illustrates 35 network metrics during a drop-all attack. In a dropping attack, a node advertises itself as the shortest path to everywhere and then drops any packets it is asked to route to other nodes.



FIG. 8A illustrates 35 network metrics for baseline activity, and FIG. 8B depicts 35 network metrics during a reset-all attack. A reset attack is a man-in-the-middle attack where the attackers are destroying active TCP connections that they are aware of by sending forged TCP reset packets to the involved parties. This causes both of the participants in the TCP connection to believe that the other terminated the TCP connection.


The seven outlined nodes in each of FIGS. 6B, 7B, and 8B represent local metrics identified in Table 1 above. The other nodes represent global metrics. The different patterns in FIGS. 6B, 7B, and 8B reflects the fact that each attack affects the network differently. Each metric measures a different aspect of network activity, so the patterns made in the panel of metrics is significantly indicative of different attacks. That is why it is useful to employ many metrics.



FIG. 9 summarizes results of attack detection and attribution for all the three attack models: flooding, drop-all and reset-all, using three different layers: HTTP, TCP, and IP connections. All three layers are considered to be sublayers of NetDep (element 304) in FIG. 3.


Mobile wireless networks are experiencing widespread use in applications such as mobile vehicle-to-vehicle networks, user-to-user networks, sensor-to-sensor networks, vehicle-to-infrastructure networks, commercial mesh networks, wireless infrastructure Internet service providers (ISPs), and cellular companies. The system according to embodiments of the present disclosure will significantly improve the security of these and other related networks, which currently rely predominantly on packet-level encryption to reduce the probability of external intrusion but do not detect or prevent “network insider” attacks.


In an embodiment, after identifying the presence of misinformation in the network, the system performs an operation to attribute who is responsible for the attack. After attributing the attack to an entity, the system can take actions to mitigate the attack. A non-limiting example of a mitigation action would be to isolate the attacking node (i.e., physical radio). For example, the action can include informing every other node in the network to simply ignore anything that the attacking node transmits, and not to send anything to, or through, the attacking node.


Implementation of the system described herein takes the form of a set of algorithms that provides rapid and accurate detection and prediction of sources of misinformation in the control plane of a wireless network. The algorithms/modules are protocol agnostic characteristics of the tool that will enable its transition into a wide variety of network security systems, including both wireless and wired networks. Furthermore, the inherent scalability of the approach makes it well-suited to operate effortlessly in much larger networks.


Finally, while this invention has been described in terms of several embodiments, one of ordinary skill in the art will readily recognize that the invention may have other applications in other environments. It should be noted that many embodiments and implementations are possible. Further, the following claims are in no way intended to limit the scope of the present invention to the specific embodiments described above. In addition, any recitation of “means for” is intended to evoke a means-plus-function reading of an element and a claim, whereas, any elements that do not specifically use the recitation “means for”, are not intended to be read as means-plus-function elements, even if the claim otherwise includes the word “means”. Further, while particular method steps have been recited in a particular order, the method steps may occur in any desired order and fall within the scope of the present invention.

Claims
  • 1. A system for detecting attacks of misinformation on communication networks, the system comprising: one or more processors and a non-transitory memory having instructions encoded, thereon such that when the instructions are executed, the one or more processors perform operations of:computing a plurality of network controllability metrics on a representation of a communication network comprising a plurality, of nodes;detecting changes in the plurality of network controllability metrics; andusing the detected changes to detect attacks of misinformation on the communication network.
  • 2. The system as set forth in claim 1, wherein the representation includes network topology, network dependencies, and application dependencies within the communication network.
  • 3. The system as set forth in claim 1, wherein the plurality of network controllability metrics are computed as a function of a pattern of communication between a plurality of nodes of the communication network during a given time window.
  • 4. The system as set forth in claim 1, wherein given a set of examples of network controllability metric data representing a baseline behavior and a set of examples of network controllability metric data representing an attack behavior, a machine learning classifier determines a threshold for attack detection based on differences between the baseline behavior and the attack behavior.
  • 5. The system as set forth in claim 4, wherein each network controllability metric is represented as a diode in a diode pattern panel, wherein network controllability metrics displaying attack behavior, as determined by the threshold for attack detection, are highlighted in the diode pattern panel.
  • 6. A computer-implemented method for detecting attacks of misinformation on communication networks, comprising: an act of causing one or more processors to execute instructions stored on a non-transitory memory such that upon execution, the one or more processors perform operations of:computing a plurality of network controllability metrics on a representation of a communication network comprising a plurality of nodes;detecting changes in the plurality of network controllability metrics; andusing the detected changes to detect attacks of misinformation on the communication network.
  • 7. The method as set forth in claim 6, wherein the representation includes network topology, network dependencies, and application dependencies within the communication network.
  • 8. The method as set forth in claim 6, wherein the plurality of network controllability metrics are computed as a function of a pattern of communication between a plurality of nodes of the communication network during a given time window.
  • 9. The method as set forth in claim 6, wherein given a set of examples of network controllability metric data representing a baseline behavior and a set of examples of network controllability metric data representing an attack behavior, a machine learning classifier determines a threshold for attack detection based on differences between the baseline behavior and the attack behavior.
  • 10. The method as set forth in claim 9, wherein each network controllability metric is represented as a diode in a diode pattern panel, wherein network controllability metrics displaying attack behavior, as determined by the threshold for attack detection, are highlighted in the diode pattern panel.
  • 11. A computer program product for detecting attacks of misinformation on communication networks, the computer program product comprising: computer-readable instructions stored on a non-transitory computer-readable medium that are executable by a computer having one or more processors for causing the processor to perform operations of:computing a plurality of network controllability metrics on a representation of a communication network comprising a plurality of nodes;detecting changes in the plurality of network controllability metrics; andusing the detected changes to detect attacks of misinformation on the communication network.
  • 12. The computer program product as set forth in claim 11, wherein the representation includes network topology, network dependencies, and application dependencies within the communication network.
  • 13. The computer program product as set forth in claim 11, wherein the plurality of network controllability metrics are computed as a function of a pattern of communication between a plurality of nodes of the communication network during a given time window.
  • 14. The computer program product as set forth in claim 11, wherein given a set of examples of network controllability metric data representing a baseline behavior and a set of examples of network controllability metric data representing an attack behavior, a machine learning classifier determines a threshold for attack detection based on differences between the baseline behavior and the attack behavior.
  • 15. The computer program product as set forth in claim 14, wherein each network controllability metric, is represented as a diode in a diode pattern panel, wherein network controllability metrics displaying attack behavior, as determined by the threshold for attack detection, are highlighted in the diode pattern panel.
  • 16. The system as set forth in claim 1, wherein upon detection of an attack of misinformation on the communication network, the one or more processors further perform an operation of performing a mitigation action.
  • 17. The system as set forth in claim 16, wherein the mitigation action comprises isolating an attacking node from the rest of the communication network.
  • 18. The system as set forth in claim 17, wherein the mitigation action comprises informing every other node in the communication network to ignore anything that the attacking node transmits, and not to send anything to, or through, the attacking node.
  • 19. The system as set forth in claim 1, wherein the one or more processors further perform operations of: outputting features representing each of the plurality of network controllability metrics;converting each feature into a binary indication of whether a value is anomalous or not anomalous: andusing the binary indication to detect changes in the plurality of network controllability metrics.
  • 20. The system as set forth in claim 1, wherein the representation is a graphical representation of network topology, network dependencies, and application dependencies within the communication network
  • 21. The system as set forth in claim 1, wherein the plurality of network controllability metrics are computed on a graphical representation of a pattern of communication between a plurality of nodes of the communication network during a given time window.
CROSS-REFERENCE TO RELATED APPLICATIONS

This is a Continuation-in-Part Application of U.S. application Ser. No. 14/625,988, filed on Feb. 19, 2015, entitled, “System and Method for Determining Reliability of Nodes in Mobile Wireless Network,” which is a Non-Provisional Patent Application of U.S. Provisional Application No. 61/941,893, filed on Feb. 19, 2014, entitled, “System and Method to Quantify Reliability of Nodes in Mobile Wireless Networks,” the entirety of which are incorporated by reference. U.S. application Ser. No. 14/625,988 is also a Continuation-in-Part Application of U.S. application Ser. No. 14/209,314, filed on Mar. 13, 2014, entitled, “Predicting System Trajectories Toward Critical Transitions,” which is a Continuation-in-Part Application of U.S. application Ser. No. 13/904,945, filed on May 29, 2013, entitled, “Detection and Identification of Directional Influences Using Dynamic Spectrum,” the entirety of which are incorporated herein by reference. U.S. application Ser. No. 14/209,314 is a Non-Provisional Patent Application of U.S. Provisional Application No. 61/784,167, filed on Mar. 14, 2013, entitled, “Predicting System Trajectories Toward Critical Transitions,” the entirety of which are incorporated herein by reference. U S. application Ser. No. 13/904,945 is a Continuation-in-Part Application of U.S. application Ser. No. 13/748,223, filed on Jan. 23, 2013, entitled, “Early Warning Signals of Complex Systems,” which is a Non-Provisional Patent Application of U.S. Provisional Application No. 61/589,634, filed on Jan. 23, 2012, entitled, “Early Warning Signals of Complex Systems,” and U.S. Provisional Application No. 61/589,646, filed on Jan. 23, 2012, entitled, “System and Method for Cyber Infrastructure Protection from Insider Threats,” the entirety of which are incorporated herein by reference. U.S. application Ser. No. 13/904,945 is also a Non-Provisional Patent Application of U.S. Provisional Application No. 61/694,510, filed on Aug. 29, 2012, entitled, “Detection and Identification of Directional. Influences Using Dynamic Spectrum,” the entirety of which are incorporated herein by reference. This is ALSO a Non-Provisional Patent Application of U.S. Provisional Patent Application No. 62/135,142 filed Mar. 18, 2015, entitled, “System and Method to Detect Attacks on Mobile Wireless Networks Based on Network Controllability Analysis,” the entirety of which is incorporated herein by reference. This is ALSO Non-Provisional Patent Application of U.S. Provisional Patent Application No. 62/135,136 filed Mar. 18, 2015, entitled, “System and Method to Detect Attacks on Mobile Wireless Networks Based on Motif Analysis,” the entirety of which is incorporated herein by reference.

GOVERNMENT LICENSE RIGHTS

This invention was made with government support under U.S. Government Contract Number AFRL FA8750-14-C-0017. The government has certain rights in the invention.

Provisional Applications (7)
Number Date Country
61941893 Feb 2014 US
61784167 Mar 2013 US
61589634 Jan 2012 US
61589646 Jan 2012 US
61694510 Aug 2012 US
62135142 Mar 2015 US
62135136 Mar 2015 US
Continuation in Parts (4)
Number Date Country
Parent 14625988 Feb 2015 US
Child 15075058 US
Parent 14209314 Mar 2014 US
Child 14625988 US
Parent 13904945 May 2013 US
Child 14209314 US
Parent 13748223 Jan 2013 US
Child 13904945 US