1. Technical Field
The present invention relates to computer security and, more particularly, to modeling instances and targets for in-progress attacks using probabilistic game theory.
2. Description of the Related Art
A large increase in the frequency of cybersecurity attacks has prompted industry and academia to find new ways to respond to the threat. Defensive mechanisms have been proposed in an attempt to detect and prevent attackers from reaching their targets, e.g., servers that store high-value data. In practice, large networks can have hundreds of high-value servers, each one a possible target of attack, thus making it difficult to determine the goal of a targeted attacker and to respond appropriately.
In an enterprise network, which may include hundreds of thousands of network entities such as laptops, desktop computers, and servers, the network entities can be categorized into different classes. For example, an entity may be a web server, an SQL server, a user terminal, etc. In a strongly connected network, the removal of a small number of connections will not partition the network into isolated parts. At present, however, detection and response systems do not provide adequate insight to system operators as to how best to respond to a strategic attacker. In real-life networks, targets are numerous and easily reachable, making existing approaches that assume a small target set impractical to use.
A method for determining cyber-attack targets includes collecting and storing network event information from a plurality of sensors to extract information regarding an attacker; forming an attack scenario tree that encodes network topology and vulnerability information including a plurality of paths from known compromised nodes to a set of potential targets; calculating a likelihood for each of the plurality of paths using a processor; and calculating a probability distribution for the set of potential targets to determine which potential targets are most likely pursued by the attacker.
A method for determining cyber-attack targets includes collecting and storing network event information from a plurality of sensors to extract information regarding an attacker; forming an attack scenario tree that encodes network topology and vulnerability information including a plurality of paths from known compromised nodes to a set of potential targets; calculating a probability distribution over a set of nodes and node vulnerability types already accessed by the attacker using a processor; determining a network graph edge to remove that minimizes a defender's expected uncertainty over the potential targets; and removing the determined network graph edge.
A method for determining cyber-attack targets includes collecting and storing network event information from a plurality of sensors to extract information regarding an attacker; forming an attack scenario tree that encodes network topology and vulnerability information including a plurality of paths from known compromised nodes to a set of potential targets; calculating a likelihood for each of the plurality of paths using a processor; and calculating a probability distribution for the set of potential targets to determine which potential targets are most likely pursued by the attacker; calculating a probability distribution over a set of nodes and node vulnerability types already accessed by the attacker; determining a network graph edge to remove which minimizes a defender's expected uncertainty over the potential targets; and removing the determined network graph edge.
A system for determining cyber-attack target includes a network monitor module configured to collect network event information from sensors in one or more network nodes; a processor configured to extract information regarding an attacker from the network event information, to form an attack scenario tree that encodes network topology and vulnerability information including a plurality of paths from known compromised nodes to a set of potential targets, to calculate a likelihood for each of the plurality of paths, and to calculate a probability distribution for the set of potential targets to determine which potential targets are most likely pursued by the attacker.
A system for determining cyber-attack targets includes a network monitor module configured to collect network event information from sensors in one or more network nodes; a processor configured to extract information regarding an attacker from the network event information, to form an attack scenario tree that encodes network topology and vulnerability information including a plurality of paths from known compromised nodes to a set of potential targets, to calculate a probability distribution over a set of nodes and node vulnerability types already accessed by the attacker, and to determine a network graph edge to remove that minimizes a defender's expected uncertainty over the potential targets; and a network management module configured to remove the determined network graph edge.
A system for determining cyber-attack target includes a network monitor module configured to collect network event information from sensors in one or more network nodes; a processor configured to extract information regarding an attacker from the network event information, to form an attack scenario tree that encodes network topology and vulnerability information including a plurality of paths from known compromised nodes to a set of potential targets, to calculate a likelihood for each of the plurality of paths, and to calculate a probability distribution for the set of potential targets to determine which potential targets are most likely pursued by the attacker, to calculate a probability distribution over a set of nodes and node vulnerability types already accessed by the attacker, and to determine a network graph edge to remove that minimizes a defender's expected uncertainty over the potential targets; and a network management module configured to remove the determined network graph edge.
These and other features and advantages will become apparent from the following detailed description of illustrative embodiments thereof, which is to be read in connection with the accompanying drawings.
The disclosure will provide details in the following description of preferred embodiments with reference to the following figures wherein:
The present principles employ game theory to predict attacker targets. Using a probabilistic model of attacker behavior, the interactions between a network defender and attacker are modeled, allowing the defender to anticipate future steps of the attack and identify the most likely attack targets based on the observed network events. The present principles use attack scenario trees which represent the possible sequences of high-level attack steps that can be executed at different nodes of the network. This approach differs from the attack-response trees used previously, which represent attack steps within a single network host. Attack scenario trees can be constructed based on past incident reports.
The interaction between the defender and the attacker is modeled as a two-player Stackelberg game. The defender can use the model to further decrease uncertainty about attack target predictions by blocking specific network paths (and indirectly any attack steps that traverse those paths) and influencing the attacker to reveal their intentions while conducting the attack. This allows defenders the benefit of proactively blocking future attack steps.
Referring now to
In the present example, an attacker 102 may approach network 100 from one of three externally available systems: web server 104, file server 106, and an email or web client 108. Each of these points of attack has an associated vulnerability. For example, web server 104 may be vulnerable to an SQL injection attack 116, and file server 106 may be vulnerable to a respective file server attack 114. Compromising either of these nodes may give access to active directory (AD) server 110, allowing the attacker 102 to gain access to AD credentials 120. The attacker 102 can then use the AD credentials to install a remote access tool 122 on a target device 112. As an alternative approach, the attacker could stage a phishing attack 118 at a web or email client 108, allowing the attacker 102 to steal a user's credentials 124. Either approach will allow the attacker to obtain access 126 to data or services. It should be noted that any given node may be an AND node, where all of the nodes leading up to it must be reached before accessing the AND node, an OR node where any one input node may be reached, or the nodes may implement any other condition or combination of conditions.
The present principles allow a defender to monitor the attack in-progress and provide probabilistic information regarding likely attack paths and targets. By blocking the attacker's access to particular connections, the attack scenario tree can be trimmed and certainty regarding the attacker's goals can be increased.
To formalize the model, the network 100 may be represented as a graph G=V,E, where the nodes V correspond to the services and machines in the network (for example, a web server 104, an SQL server 109, user machines 112), and the edges E correspond to the connections between them. Each node νεV belongs to a certain type θ(ν), where θ:V→Θ. The node types incorporate the information about node vulnerabilities that can be exploited by the attacker 102. The attack scenario trees are thus constructed of elements of the type set Θ. The set of all attack scenario trees known to the defender is denoted by S={s}.
Cyber attacks can be assessed from either the point of view of the defender or from the point of view of the attacker. To the defender, the attacks are paths in the network graph G from the attacker's starting point νaεV to a target tεV. From the point of view of the attacker, an attack is a path from one of the leaves to the root of an attack scenario tree sεS. Since s is composed of node types θ(ν) and not of specific nodes, each path in an attack scenario tree s can correspond to multiple paths in G as long as the sequence of node types in G matches the types in scenario s. Suppose I(s) is the set of all paths in G corresponding to a scenario s=θq
Referring now to
It can be assumed that the attacker always has control over the starting node 202 νaεV. As the attacker advances towards one of the target nodes 206 t, the set of active nodes a⊂V which are controlled by the attacker expands until tεA. This process of attacker's expansion over the nodes in V corresponds to a simultaneous expansion of the set ⊂Θ of active node types in the attack scenario tree until includes the root of the tree. Inferring a probability distribution over the possible sets A and helps in predicting the attack targets.
As noted above, the interaction between the defender and the attacker is modeled as a two-player Stackelberg game. A Stackelberg two-player game models strategic interaction between two intelligent agents, designated the leader and the follower. Each player has a finite set of actions to choose from. The leader's set of actions is marked as Al, and the follower's set of actions is marked as Aƒ. A pair of actions (al, aƒ) chosen by the players is called the outcome of the game. The players' utilities are functions of the outcome. The leader's utility function is denoted by uƒ(al, aƒ) and the follower's utility function by ul(al, aƒ). The game proceeds as follows. First, the leader chooses (or commits to) a mixed strategy having a probability distribution over the actions in Al. Then, the follower observes the distribution and chooses a best-response which, generally, can be a probability distribution over Aƒ. There is always a pure-strategy best response for the follower. In other words, one of the optimal best-responses is a degenerate distribution. Moreover, an optimal mixed strategy for the leader can be computed in polynomial time using linear programming techniques.
For the purpose of computing an optimal leader's strategy to commit to, the follower's preferences can be represented by choosing an action for each distribution over the leader's actions, instead of using the follower's utility function uƒ. The follower's preferences are written as a mapping ƒ:σ(Al)→Aƒ, where the operator σ(·) denotes a set of distributions over a given finite set.
The present principles provide an extension to the described Stackelberg model in which the follower's preferences are described by a mapping ƒ:θ(Al)→(Aƒ). In other words, if the leader commits to a mixed strategy slεσ(Al) then the follower plays a mixed strategy ƒ(sl). This extension is called herein a probabilistic Stackelberg model.
A probabilistic Stackelberg model can be used to describe irrational behavior of the follower. The present principles provide a probabilistic Stackelberg model in which the function ƒ is a linear mapping from the vector of probabilities describing the leader's mixed strategy to the vector of probabilities describing the follower's mixed strategy.
Table 1 shows the potential increase in utility that the leader can achieve by considering a probabilistic follower model rather than assuming that the follower is perfectly rational and optimizing a known utility function is shown. In the following two-player normal-form game, the leader is the row player and the follower is the column player. The leader has two strategies U,D, and the follower has two strategies L,R. Each cell in the table shows the leader's and the follower's utility for the corresponding choice of actions. If the follower is perfectly rational and always chooses the action that maximizes its utility, the optimal strategy to commit to for the leader is
As a result, the follower best-responds with R, and the leader gets a utility of approximately 3.5.
Note that the follower best-responds with L if the follower plays U with probability 0.5 or higher, and the follower plays D otherwise. However, if the follower is not perfectly rational, the leader's optimal strategy may be different. For example, consider the case in which the follower is actually playing accordingly to a quantal response model. In a quantal response model, each strategy is played with a positive probability proportional to eλuf. Setting λ=5 and assuming that the leader is playing the Stackelberg strategy
the leader can get an even higher utility by deviating to play D more frequently, because the follower will still play R with a relatively high probability after such deviation. This example demonstrates the potential benefits of using a probabilistic follower model, whether derived from the follower's utility function or defined directly as a probability function on the set of the defender's actions.
Referring now to
Here, p(oq
l(π1:i|1:j)=max(l(π1:i−1|1:j−1)p(φj|pi),
l(π1:i|1:j−1))
Using l, block 308 computes a probability distribution over the targets t ε V under the assumption that the attacker will follow one of the attack scenario trees in S according to the estimated distribution over the active node types and the active network nodes A. The distribution over the attack targets is estimated using Monte Carlo simulation:
Here, P[t|A] is estimated by simulating the attack steps according to the corresponding attack scenario tree starting with active node types . The method of
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing. Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks. The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
As the defender observes network events and collects information about the progress and location of attacker 102, the defender can modify the network 100 in order to force the attacker 102 to reveal the intended target 112 of the attack. In real-world applications, one way for the defender to react to an ongoing cyberattack is to block network connections over a certain port. Translated into the present model, the defender has the ability to block a network edge in response to the observed network events. Since there are likely multiple ways from the attacker's starting node 202 to the any target 206 in a realistic computer network 100, blocking one edge will not prevent the attack. However, the defender may improve the future predictions about the attacker's intended target by carefully choosing a single graph edge to block.
Referring now to
In one scenario, the attacker has only reached node s 202, and all attack paths going from the bottom of the graph to the top are possible. Additionally, the attacker chooses the next possible node to attack uniformly at random. At first, the probability distribution over the targets is (1/3,1/3,1/3). Note that the distribution will remain the same if the attacker chooses node b as the next node to attack. However, if the attacker chooses node a, then it is certain that t1 is the target of the attack. If the attacker chooses node c, targets t2 and t3 are two equally likely targets.
Given a probability distribution p over the targets (e.g., as calculated in
In the above example, the entropy which represents the uncertainty over the attack target, given the history of attacked nodes, is computed as follows:
Referring now to
Using the present principles provides a significant improvement in terms of the defender's uncertainty as to the attacker's potential targets. Moreover, by blocking paths in concordance with the probabilistic Stackelberg model of the present principles, the defender is able to further reduce the uncertainty by influencing the attacker's next move. Furthermore, the present principles are scalable, with good performance in attack paths of length 3, 4, and 5—it should be noted that practical attack paths rarely exceed four steps in length. Simulations run with over 1000 nodes and with attack path lengths of five were able to complete in a matter of seconds, making the present principles practical for real-world application.
Referring now to
The defender administration terminal 602 processes the data provided by sensors 604 and determines likely targets for the attacker. The defender terminal 602 also calculates an optimal defender response to reduce the uncertainty in the attacker's targets. Defender terminal 602 includes a processor 608 and memory 610 to collect and utilize the sensor data using network monitor 614. The network monitor 614 collects data from sensors 604 and processes that data from potentially heterogeneous sources into a usable form. The network monitor may, for example, parse logs provided by sensors 604 to find suspicious or abnormal entries. The processor 608 uses the sensor data provided by network monitor 614 and stored in memory 610 to produce the most probable targets and response.
Once the processor 608 calculates an optimal response, the network control module 612 executes the response using a network management interface 606. The network management 606 interface may represent any appropriate form of network management, including for example a simple network management protocol (SNMP) device. In this manner, the defender administration terminal 602 can disconnect links in a network 100, or take similar network-level measures that prevent the attacker to proceed along the chosen network links (e.g., the defender could choose to enable a firewall system on that link, instead of disconnecting the link). The changed network topology forces the attacker along different paths, and the attacker's response to the defender's action substantially reduces the uncertainty regarding the attacker's intentions.
Having described preferred embodiments of a system and method (which are intended to be illustrative and not limiting), it is noted that modifications and variations can be made by persons skilled in the art in light of the above teachings. It is therefore to be understood that changes may be made in the particular embodiments disclosed which are within the scope of the invention as outlined by the appended claims. Having thus described aspects of the invention, with the details and particularity required by the patent laws, what is claimed and desired protected by Letters Patent is set forth in the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
7013395 | Swiler et al. | Mar 2006 | B1 |
7162741 | Eskin et al. | Jan 2007 | B2 |
7530105 | Gilbert et al. | May 2009 | B2 |
7568232 | Mitomo et al. | Jul 2009 | B2 |
7702606 | Heidenreich et al. | Apr 2010 | B1 |
7813739 | Teo et al. | Oct 2010 | B2 |
7917393 | Valdes et al. | Mar 2011 | B2 |
8046835 | Herz | Oct 2011 | B2 |
8108188 | Johnson | Jan 2012 | B2 |
8549641 | Jakobsson | Oct 2013 | B2 |
20020059078 | Valdes et al. | May 2002 | A1 |
20070226796 | Gilbert et al. | Sep 2007 | A1 |
20070240215 | Flores et al. | Oct 2007 | A1 |
20080005555 | Lotem et al. | Jan 2008 | A1 |
20080082380 | Stephenson | Apr 2008 | A1 |
20080109272 | Sheopuri et al. | May 2008 | A1 |
20090099987 | Tambe et al. | Apr 2009 | A1 |
20090119239 | Tambe et al. | May 2009 | A1 |
20090133122 | Koo et al. | May 2009 | A1 |
20090287623 | Martin et al. | Nov 2009 | A1 |
20090307772 | Markham et al. | Dec 2009 | A1 |
20090307777 | He et al. | Dec 2009 | A1 |
20100058456 | Jajodia et al. | Mar 2010 | A1 |
20100117889 | Grube et al. | May 2010 | A1 |
20100235285 | Hoffberg | Sep 2010 | A1 |
20100313021 | Xu et al. | Dec 2010 | A1 |
20110055925 | Jakobsson | Mar 2011 | A1 |
20110061104 | Sarraute Yamada et al. | Mar 2011 | A1 |
20110231937 | Lippmann et al. | Sep 2011 | A1 |
20110288692 | Scott | Nov 2011 | A1 |
20120180126 | Liu et al. | Jul 2012 | A1 |
20120222089 | Whelan et al. | Aug 2012 | A1 |
20130013551 | Mathews et al. | Jan 2013 | A1 |
20130019317 | Whelan et al. | Jan 2013 | A1 |
20130055404 | Khalili | Feb 2013 | A1 |
Entry |
---|
Sun et al. “Towards adaptive anomaly detection in cellular mobile networks”. In 2006 3rd IEEE Consumer Communications and Networking Conference, CCNC 2006, by Sun, Bo, Chen, Zhi, Wang, Ruhai, Yu, Fei, Leung, Victor C. M., 666-670. Institute of Electrical and Electronics Engineers Computer Society, Oct. 9, 2006. |
Dewri, R., et al. “Optimal Security Hardening Using Multi-Objective Optimization on Attack Tree Models of Networks” Proceedings of the 14th ACM Conference on Computer and Communications Security. Oct. 2007. pp. 204-213. |
Kiekintveld, C., et al. “Approximation Methods for Infinite Bayesian Stackelberg Games: Modeling Distributional Payoff Uncertainty” Proceedings of 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS). May 2011. (8 pages). |
Zhu, M., et al. “Stackelberg-Game Analysis of Correlated Attacks in Cyber-Physical Systems” American Control Conference. Jun. 2011. pp. 4063-4068. |
Zonouz, S., et al. “RRE: A Game-Theoretic Intrusion Response and Recovery Engine” Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). Jun. 2009. pp. 429-438. |
Non-Final Office Action issued on May 13, 2014 for U.S. Appl. No. 13/487,774. (30 Pages). |
Number | Date | Country | |
---|---|---|---|
20130318615 A1 | Nov 2013 | US |