Field of the Invention
The present invention relates generally to network security and more particularly to detecting malicious software operating in computers and other digital devices.
Related Art
Malicious software, or malware for short, may include any program or file that is harmful by design to a computer. Malware includes computer viruses, worms, Trojan horses, adware, spyware, and any programming that gathers information about a computer or its user or otherwise operates without permission. The owners of the computers are often unaware that these programs have been added to their computers and are often similarly unaware of their function.
Malicious network content is a type of malware distributed over a network via Web sites, e.g., servers operating on a network according to an HTTP standard or other well-known standard. Malicious network content distributed in this manner may be actively downloaded and installed on a computer, without the approval or knowledge of its user, simply by the computer accessing the Web site hosting the malicious network content (the “malicious Web site”). Malicious network content may be embedded within objects associated with Web pages hosted by the malicious Web site. Malicious network content may also enter a computer on receipt or opening of email. For example, email may contain an attachment, such as a PDF document, with embedded malicious executable programs. Furthermore, malicious content may exist in files contained in a computer memory or storage device, having infected those files through any of a variety of attack vectors.
Various processes and devices have been employed to prevent the problems associated with malicious content. For example, computers often run antivirus scanning software that scans a particular computer for viruses and other forms of malware. The scanning typically involves automatic detection of a match between content stored on the computer (or attached media) and a library or database of signatures of known malware. The scanning may be initiated manually or based on a schedule specified by a user or system administrator associated with the particular computer. Unfortunately, by the time malware is detected by the scanning software, some damage on the computer or loss of privacy may have already occurred, and the malware may have propagated from the infected computer to other computers. Additionally, it may take days or weeks for new signatures to be manually created, the scanning signature library updated and received for use by the scanning software, and the new signatures employed in new scans.
Moreover, anti-virus scanning utilities may have limited effectiveness to protect against all exploits by polymorphic malware. Polymorphic malware has the capability to mutate to defeat the signature match process while keeping its original malicious capabilities intact. Signatures generated to identify one form of a polymorphic virus may not match against a mutated form. Thus polymorphic malware is often referred to as a family of virus rather than a single virus, and improved anti-virus techniques to identify such malware families is desirable.
Another type of malware detection solution employs virtual environments to replay content within a sandbox established by virtual machines (VMs) that simulates or mimics a target operating environment. Such solutions monitor the behavior of content during execution to detect anomalies and other activity that may signal the presence of malware. One such system sold by FireEye, Inc., the assignee of the present patent application, employs a two-phase malware detection approach to detect malware contained in network traffic monitored in real-time. In a first or “static” phase, a heuristic is applied to network traffic to identify and filter packets that appear suspicious in that they exhibit characteristics associated with malware. In a second or “dynamic” phase, the suspicious packets (and typically only the suspicious packets) are replayed within one or more virtual machines. For example, if a user is trying to download a file over a network, the file is extracted from the network traffic and analyzed in the virtual machine using an instance of a browser to load the suspicious packets. The results of the analysis constitute monitored behaviors of the suspicious packets, which may indicate that the file should be declared malicious. The two-phase malware detection solution may detect numerous types of malware and, even malware missed by other commercially available approaches. Through its dynamic execution technique, the two-phase malware detection solution may also achieve a significant reduction of false positives relative to such other commercially available approaches. Otherwise, dealing with a large number of false positives in malware detection may needlessly slow or interfere with download of network content or receipt of email, for example. This two-phase approach has even proven successful against many types of polymorphic malware and other forms of advanced persistent threats.
In some instances, malware may take the form of a “bot,” a contraction for software robot. Commonly, in this context, a bot is configured to control activities of a digital device (e.g., a computer) without authorization by the digital device's user. Bot-related activities include bot propagation to attack other computers on a network. Bots commonly propagate by scanning nodes (e.g., computers or other digital devices) available on a network to search for a vulnerable target. When a vulnerable computer is found, the bot may install a copy of itself, and then continue to seek other computers on a network to infect.
A bot may, without the knowledge or authority of the infected computer's user, establish a command and control (CnC) communication channel to send outbound communicates to its master (e.g., a hacker or herder) or a designated surrogate and to receive instructions back. Often the CnC communications are sent over the Internet, and so comply with the Hypertext Transfer Protocol (HTTP) protocol. Bots may receive CnC communication from a centralized bot server or another infected computer (peer to peer). The outbound communications over the CnC channel are often referred to as “callbacks,” and may signify that bots are installed and ready to act. Inbound CnC communications may contain instructions directing the bot to cause the infected computers (i.e., zombies) to participate in organized attacks against one or more computers on a network. For example, bot-infected computers may be directed to ping another computer on a network, such as a bank or government agency, in a denial-of-service attack, often referred to as a distributed denial-of-service attack (DDoS). In other examples, upon receiving instructions, a bot may (a) direct an infected computer to transmit spam across a network; (b) transmit information regarding or stored on the infected host computer; (c) act as a keylogger and record keystrokes on the infected host computer, or (d) search for personal information (such as email addresses contained in an email or a contacts file). This information may be transmitted to one or more other infected computers to the bot's master.
Further enhancement to effective detection of malware callbacks while avoiding false positives is desirable of course, particularly as malware developers continue to create new exploits, including more sophisticated bots and botnets, having potentially serious consequences.
Techniques may automatically detect bots or botnets running in a computer or other digital device by detecting command and control communications, called “call-backs,” from malicious code that has previously gained entry into the digital device. Callbacks are detected using an approach employing both a set of high quality indicators and a set of supplemental indicators. The high quality indicators are selected since they provide a strong correlation with callbacks, and may be sufficient for the techniques to determine that the network outbound communications actually constitute callbacks. If not, the supplemental indicators may be used in conjunction with the high quality indicators to declare the outbound communications as callbacks.
Detecting callbacks as described herein as a keystone of malicious attack and exploit analysis may permit embodiments of the invention to detect disparate forms of malware, and even families of polymorphic virus that use the same communication mechanisms to obtain instructions and other communications in furtherance of their nefarious purposes.
The invention will be more fully understood with reference to the following detailed description in conjunction with the drawings, of which:
Generally speaking, a bot is a type of (or part of) an active infiltration attack, often installing or operating in a two-step process. The first step is the initial infection, which may be a typically small package of malicious code (malware) whose function is to compromise the infected device. The second step involves that malware obtaining instructions as to malicious activity it is to perform, including possibly downloading additional malware, e.g., over the Internet or sending messages or data from the infected computer. This second step often involves establishing a CnC channel over which it may send a message providing its status or requesting CnC communications (instructions). This is called a “callback,” and the exchange of such communications may be referred to as callback activity.
The CnC may use an undocumented entry point, or subvert and use a documented entry point to request instructions over a CnC channel, which are often transmitted over the same or other channels. Often the CnC channel is established via a non-standard port provided by the operating system of the infected device. In so doing, the bot bypasses normal security and authentication mechanisms, and thereby achieves unauthorized egress from the computer. A hallmark of bots is the manner by which they utilize egress points for callbacks is designed to remain undetected to the digital device's user and system/network administrators.
Embodiments of the invention provide a computer implemented method for detecting callbacks from malicious code in network communications. The method includes generating a set of high quality indicators and a set of supplemental indicators associated with each of the network communications. The high quality indicators have a strong correlation with callbacks, and the supplemental indicators having a lower correlation with callbacks than the high quality indicators. The method also includes classifying each of the network communications as to whether each constitutes a callback using the high quality indicators if sufficient to determine that the associated network communication constitutes a callback, and, otherwise, using the supplemental indicators in conjunction with the high quality indicators.
In some embodiments, the method may be practiced to generate the high quality indicators, which may entail the steps of (i) extracting at least one of a destination URL, destination IP address, and destination domain from each network communication; (ii) determining a reputation indicator associated with each of the at least one destination URL, destination IP address, and destination domain; and, thereupon, (ii) including each of the reputation indicators in the set of high quality indicators used to classify the network communication. For performing the classification, these embodiments may also assign a weight and a score to each of the reputation indicators.
In some embodiments, the method may be practiced to generate the supplemental indicators, which may entail the steps of (i) inspecting packet headers of each of the network communications to identify one or more protocol anomalies; and (ii) evaluating each of the identified protocol anomalies by assigning a weight to each reflecting its correlation with callback activity and non-callback activity, as well as an overall score(s) for the supplemental indicators.
In another aspect of these embodiments, a malware name may be identified and associated with discovered callbacks. This entails forming a malware marker from each network communication constituting a callback; and performing a database look-up using the malware marker to identify a malware name associated therewith. The malware name so identified may (i) have the same malware marker as the callback, in which case these embodiments may declare the callback by that name; (ii) have a high correlation with the malware marker but not the same malware marker, in which case these embodiments may classify the callback as associated with a family related to the malware name; (iii) not have a high correlation with any malware name in the database, in which case these embodiments may declare that a new malware has been discovered.
While the foregoing description pertains to embodiments of the invention practicing a computer implemented method, embodiments may constitute systems, apparatus, or computer program products as well, as will be apparent from the following description.
Throughout this specification, reference is made to HTTP, communications, protocols and protocol anomalies. HTTP is an application layer protocol widely used for data communications for the World Wide Web. The Request for Comment (RFC) 2616: Hypertext Transfer Protocol—HTTP/1.1 specification sets out the semantics and other requirements for HTTP communications. HTTP resources are identified and located on a network by Uniform Resource Locators (URLs). Employing a client-server computing model, HTTP provides data communication for example between one or more Web browsers running on computers or other electronic devices constituting the clients, and an application running on a computer or other electronic device hosting a Web site constituting the server. HTTP is a request-response protocol. For example, a user clicks on a link on their Web browser, which sends a request over the Internet to a web server hosting the Web site identified in the request. The server may then send back a response containing the contents of that site, including perhaps text and images for display by the user's browser.
The HTTP specification defines fields of HTTP headers, which are components of HTTP messages used in both requests and responses, and define the operating parameters of an HTTP communication or transaction. The header fields are transmitted after the request or response line, which is the first line of a message. As noted, the HTTP semantics are well defined, for example: Header fields are colon-separated, name-value pairs in clear-text string format. Each field is terminated by a carriage return (CR) and line feed (LF) character sequence. The end of the header fields is indicated by an empty field, resulting in the transmission of two consecutive CR-LF pairs. Variations from the specified semantics constitute anomalies. Also, the HTTP specification allows users to define their own fields and content, though often practice and convention dictate how those fields are used and what content may be expected. Variations from those conventions may also be deemed anomalies. Finally, sometimes malware authors will insert content into the fields, such as malware names or other tell tail malware descriptors or indicators, which serve as strong evidence of malicious activity. These too will be deemed anomalies for purposes of this specification.
For communication, an HTTP header is added to an HTTP message, and placed in a TCP/UDP message (sometimes more than one TCP/UDP message per HTTP message), which, in turn, is encapsulated (as payload) in an IP Datagram, which is encapsulated (as payload) in a Layer 2 Frame, which is sent as a signal over the transmission medium as a string of binary numbers. Each Layer 2 Frame has, in order, a Layer 2 header, an IP header, a TCP or UDP header, a HTTP header, HTTP data, etc., and finally a Layer 2 footer. Taking this explanation one step further, the IP layer includes in its header the information necessary for the packet to find its way to its final destination. More specifically, for computer-to-computer communication across networks, a source device forms packets for transmission by placing the IP address of the destination computer in the IP header of each packet involved in a communication session. The data packets are encapsulated as noted above and placed on the network and routed across the network to the destination having the specified IP address. In this specification, reference will be made to “packets,” which shall be used in a broad sense to include, without limitation, messages, datagrams, frames and, of course, packets, unless the context requires otherwise. Accordingly, packet capture techniques may yield the HTTP header, IP address of the destination of an IP packet as well as domain identifiers from the URL of HTTP headers included in the IP packets.
Callback Detection and Analysis System
The network interface 102 is configured to receive “outbound” communications, such as communications containing HTTP packets, sent from one or more computing devices. The network interface 102 may include a network tap 103 adapted to make a copy of the outbound communications, as further described hereinbelow.
The pre-processor or pre-processing engine 104 is configured to receive the outbound communications, or in some embodiments a copy thereof, and to inspect the outbound communications to determine whether they should be submitted for further analysis by the analyzer 108.
The analyzer or analyzing engine 108 is configured to perform an analysis on the outbound communications received from the pre-processor 104. The analysis may take the form of static analysis, as opposed to dynamic analysis involving execution as may be carried out in a virtual environment as described hereinbelow. The analyzer 108 includes a recommender 110 and a supplemental influencer generator 112. The purpose and operation of these two components will be described at some length.
The classifier or classification engine 114 is configured to receive the results generated by both the recommender 110 and the supplemental influencer generator 112 for the purpose of classifying whether each of the outbound communications constitutes a command and control communication of a malicious nature. The classifier 114 uses both a set of high quality indicators and a set of supplemental indicators for assessing each outbound communication. The high quality indicators provide a strong correlation between outbound communications and callbacks, and may be sufficient for the techniques to determine that the outbound communications constitute callbacks. If not, the supplemental indicators may be used in conjunction with the high quality indicators to declare the outbound communications as callbacks. The classifier assigns scores to the high quality indicators and supplemental indicators, and uses the scores in ascertaining whether to classify each outbound communication as constituting a callback.
The report generator or reporting engine 116 is configured to generate an alert and in some embodiments also a detailed report based on the output results of the analyzer 108 and classifier 114. It also may generate a set of generic indicators (high quality and supplemental) which can be used to detect similar callbacks in future. In some embodiments, the alert and/or report may include a common name or label of a malware identified by the report generator 116. The furnished name is selected based on it having a high correlation with the associated outbound communication. In other words, the outbound communication may have characteristics associated with a known malware, the report generator 116 will discover the known malware name, and the alert and/or report will present its name to guide actions to be taken, e.g., of a remedial nature. In some cases, the callback detection and analysis system 100 will have discovered such a strong correlation with characteristics of a known malware that the communication will be deemed associated with that same malware; and, in other cases, the callback detection and analysis system 100 will have discovered a sufficiently high correlation with the known malware that the communication will be deemed a member of the same family as the named malware.
The user interface 118 is configured for providing the alert and/or report from the report generator 116, e.g., to a user or administrator. The administrator may be a network administrator or a security operations technician responsible for dealing with exploits.
High quality indicators represent features or characteristics of the outbound communications that have high probative value in classifying whether the outbound communications are command and control communications. Consequently, when identified for the outbound communications, the high quality indicators have a high correlation with those associated with command and control communications. For example, the high quality indicators may include negative reputation of the domains, URLs, or IP addresses associated with the outbound communications.
Supplemental indicators represent features or characteristics of the outbound communications that have lower probative value (compared with the high quality indicators) in classifying whether the outbound communications are command and control communications. Consequently, when identified for the outbound communications, the supplemental indicators have lower (though positive) correlations with those associated with command and control communications. For example, the supplemental indicators may include select protocol anomalies in the outbound communications.
Returning to
Then, in step 166, logic generates an alert and/or report providing details regarding the outbound communications, including whether the outbound communications constitute command and control communications associated with malware. In some cases, the alert and/or report may also provide a name or label associated with the malware.
The description of embodiments of the invention will next deal with certain terms of art, for which a short digression may aid understanding. As is well known in the art, the term “domain” or “domain name” refers to a collection or string of characters that uniquely signify a domain within the Internet. A domain name is a significant part of a URL (short for “Uniform Resource Locator), an Internet address used by Web browsers to locate a resource on the Internet. The resource can be any type of file stored on a server, such as a Web page, a text file, a graphics file, a video file or an application program.
As is also well known, a URL contains at least three elements: (i) the type of protocol used to access the file (e.g., HTTP for a Web page); the domain name or IP address of the server where the file resides; and, optionally, the pathname to the file (i.e., a description of the file's location). For example, the URL given by http://www.acme.com/patent instructs a browser to use the HTTP protocol, go to the “www.acme.abc.com” web server to access the file named “patent”. The domain name itself is structured hierarchically, with the top level domain (or “TLD”) in this example being “.com”. Other commonly used TLDs include .net and .org. In addition to these, there are TLDs for countries such as .US, .AU and .UK. There are also TLDs for schools, the military and government agencies, namely, .edu, .mil and .gov. The term “Second level domain” refers to the string immediately to the left of that dot. In the above example, the second level domain is “acme”. Third level domain in this example refers to “www”. Often, the domain names will specify well-known company names; or perhaps it is better described as domains encapsulate or refer to host names, and the host names often correspond to company names. Consider the example: www.google.com. The second level domain here is “google”, a domain registration currently owned by Google, Inc. Consequently, as can be understood from the above examples, a URL can usually be parsed to indicate at least some of the following: a host name, a host's IP address, a country, a company name and an organization's name.
Returning to the figures,
The similarity detector 204 (sometimes referred to as a duplicity checker) is configured to determine whether the callback detecting and analyzing system 100 (
The pre-filter or pre-filtering engine 206 is configured to obtain the domain name from the outbound communication, to access a database stored in repository 210 of “whitelisted” domains and determine whether the domain of the current outbound communication matches any of the entries of whitelisted domains. The whitelist of domains is a collection of domains believed to be “safe,” i.e., free of malware. Safe domains may include those of well-known companies, organizations, schools, and government agencies and departments. Lists of such safe domains are commercially available, publically available on the Internet, or may be compiled for these purposes through various means.
The pre-processor 200 generates communication candidates deserving of further analysis. Those that have already been processed in earlier testing and found to be either malware or safe, as determined by the similarity detector 204, need not be further analyzed. Similarly, those that correspond to any of the whitelisted domains, as determined by the pre-filter 206, need not be further analyzed.
The HQI generator 302 is configured to select and store indicators discovered in the outbound communications under test having a high correlation with command and control communications. The HQI generator 302 includes a reputation checker 310, an “other” strong indicator (OSI) detector 312, and an indicator repository 314. The reputation checker 310 is configured to check the reputation of, e.g., the domain, IP address, or URL, or a combination of two or more of the foregoing, as extracted by the extractor 202 (
The HQI evaluator or evaluation engine 304 is configured to assign weights and scores to the discovered HQI, and pass the scores to the classifier 114 (
In step 354, logic generates HQI based on the received IP addresses. The IP address-based HQI may include, for example, indicators based on information regarding reputation, etc. For example, for purposes of generating HQI related to reputation, the logic looks-up the received IP address, if available, in the indicator database to obtain information specifying a reputation associated therewith, if such information is available, and generates an IP address reputation indicator based on the information obtained from the indicator database.
In step 356, logic generates HQI based on the received domain. The domain-based HQI may include, for example, indicators based on, for example, information regarding reputation, information from a publically available database, such as the database called WHOIS, information regarding TLD's, information regarding traffic rates or rank for that domain, etc. For example, for purposes of generating HQI related to reputation, the logic looks-up the received domain, if available, in the indicator database to obtain information specifying a reputation associated therewith, if such information is available, and generates a domain reputation indicator based on the information obtained from the indicator database.
In step 358, logic generates HQI based on a received URL. The URL-based HQI may include, for example, indicators based on information regarding reputation, number of parameters in the headers, name of each parameter, etc. For example, for purposes of generating HQI related to reputation, the logic looks-up the received URL in the indicator database to obtain information specifying a reputation associated therewith, if such information is available, and generates an URL reputation indicator based on the information obtained from the indicator database. In step 362, logic evaluates the generated HQI, assigns weights to each, develops an overall HQI store for the outbound communication and stores the HQI stores. Then, the logic provides the HQI scores to the classifier 114 (
The SI evaluator or evaluation engine 410 is configured to assign weights and scores to the discovered SI, and pass the scores to the classifier 114 (
A further word must be added regarding the high quality indicators and supplemental indicators. Since malware evolves as malware writers devise alternative exploits and seek to evade detection, the indicators used in embodiments of the invention will likely also evolve. Certain indicators may be regarded as HQI and will need to later be used as SI, or vice versa. Indeed, certain indicators used for HQI or SI may need to be dropped in their entirety in the future, and other indicators may take their place. Accordingly, the indicators described herein as usefully employed by the various embodiments should be regarded as examples.
Consequently, it can now be understood that the high quality indicators may be used alone to determine whether or not the outbound communication constitutes a callback, but, even if they fail to indicate that a callback is present, the supplemental indicators may be used to influence the classification or decision. Clearly, the HQI and SI can now be seen as aptly named.
At this point, it is worth emphasizing, discovery that the outbound communication constitutes a callback indicates that the source of the outbound communication is infected with malware, such as a bot, and this may be a serious condition requiring immediate attention.
Returning to
Reputation
It can now be seen that aspects of the foregoing embodiments relate to assessing reputation information based on URL, IP address, and/or domain metadata. Features that may provide an indication of reputation may include:
Additionally, embodiments may involve using information or indicia of a reputation based at least in part on the corporate or business identity associated with the URL, domain or IP address. The corporate reputation may be based at least in part on one or more of the following: Better Business Bureau rating, and ranking of the corporation (e.g., in the Fortune 1000, Fortune 500, Fortune 100), corporate address, how long the company has been in existence, how long its Web site has been in existence, whether the corporation has an IP address in a range of addresses with a poor reputation, whether the corporation is associated with spamming or a spammer, Web site popularity rank, etc.
The foregoing reputation information may be collected in a database and/or be available through reputation Web sites, such as those associated with Better Business Bureau online, TrustE, P3P, Hackersafe certification, Fortune 1000, Hoovers, D&B, Yellow Pages, DMOZ/The Open Directory Project, Yahoo, credit card certified online merchants, or the like. Further information of reputation indicia may be had with reference to United States Patent Application 2013/0014020, filed Sep. 15, 2012, and entitled “Indicating Web site Reputations during Web site Manipulation of User Information,” whose disclosure is incorporated herein by reference.
Databases and Machine Learning
The databases stored in various repositories described above may store data of a dynamic nature, which is subject to change as more information is obtained regarding malware, for instance. Various databases are described above as having data that may be developed using principles of machine learning. Machine learning refers to a process or system that can learn from data, i.e., be trained to distinguish between “good” and “bad”, or in this case, between malicious and non-malicious, and classify samples under test accordingly or develop indicators having a high correlation to those that are malicious. Core principals of machine learning deal with representation and generalization, that is, representation of data instances (e.g., reputation or anomaly information), and functions performed on those instances (e.g., weighting and scoring). Generalization is the property that the process or system uses to apply what it learns on a learning set of known (or “labeled”) data instances to unknown (or “unlabeled”) examples. To do this, the process or system must extract learning from the labeled set that allows it to make useful predictions in new and unlabeled cases. For example, weighting of indicators (e.g., reputation or anomalies), as practiced in some embodiments described above, may entail machine learning to assure proper weights are assigned to the appropriate indicators of a current outbound communication to reflect their correlation with known malware. The data for assigning the weights may need to be updated from time to time, whether on an aperiodic or periodic basis, e.g., every three or six months, to reflect changes in malware then identified. Similarly, the data used for scoring as described above, may also need to be updated from time to time for the same reason. One way of updating the data, in either case, is to use machine learning, for example, in a malware forensic lab, to develop the appropriate data to adjust the weights and scores, and, for that matter, the thresholds and databases used in the described embodiments.
Controller Architecture
The memory 720 may include, for example, RAM and/or ROM. The storage system 730 also permanently or temporarily stores data. The storage system 730 may include, for example, one or more hard drives and/or flash drives, or other form of mass storage. The storage in memory 720 and storage 730 is not to be regarded as being transitory in nature. The repositories 130 (
The controller 700 may also have a communication network interface 740, an input/output (I/O) interface 750, and a user interface 760. The communication network interface 740 may be coupled with a communication network 772 via a communication medium 770. The communications network interface 740 may communicate with other digital devices (not shown) via the communications medium 770. The communication interface 740 may include a network tap 940 (
The I/O interface 750 may include any device that can receive input from or provide output to a user. The I/O interface 750 may include, but is not limited to, a flash drive, a compact disc (CD) drive, a digital versatile disc (DVD) drive, or other type of I/O peripheral (not separately shown). The user interface 760 may include, but is not limited to a keyboard, mouse, touchscreen, keypad, biosensor, display monitor or other human-machine interface (not separately shown) to allow a user to control the controller 700. The display monitor may include a screen on which is provided a command line interface or graphical user interface.
In various embodiments of the invention, a number of different controllers (for example, each of a type as illustrated and described for controller 700 may be used to implement the invention. For example, a separate controllers may be used for each of the pre-processor 104, analyzer 108, classifier 114, and report generator 116 of
In some embodiments, a malware detection system or station (see
Computer System with Malicious Content Detection System
Referring to
Herein, according to this embodiment of the invention, first MCD system 8101 is an electronic device that is adapted to (i) intercept data traffic that is routed over a public communication network 830 or a private communication network 845 between at least one server device 840 and at least one client device 850 and (ii) monitor, in real-time, content within the data traffic. For purposes of detecting callbacks in the data traffic, the MCD system 8101 intercepts and monitors data traffic outbound via private network 845 from at least one client device 850. For purposes of detecting malicious content headed to the at least one client device 850, the MCD system 8101 intercepts and monitors ingress traffic en route via public network 830 (e.g., the Internet) to the at least one client device 850.
More specifically, first MCD system 8101 may be configured to inspect content received via communication network 830, 845 and identify malware using at least two approaches. The first MCD system 8101 may implement the method described above in conjunction with
As noted, the communication network 830 may include a public computer network such as the Internet, in which case an optional firewall 855 (represented by dashed lines) may be interposed between communication network 830 and client device 850. Alternatively, the communication network 830 may be a private computer network such as a wireless telecommunication network, wide area network, or local area network, or a combination of networks. Likewise, the private network 845 may be a private computer network such as a wireless telecommunication network, wide area network, or local area network, or a combination of networks.
The first MCD system 8101 is shown as being coupled with the communication network 830 (behind the firewall 855) and with private network 845 via a network interface 860. The network interface 860 operates as a data capturing device (referred to as a “tap” or “network tap”) that is configured to receive data traffic propagating to/from the client device 850 and provide content from the data traffic to the first MCD system 8101. In general, the network interface 860 receives and copies the content that is received from and provided to client device 850 normally without an appreciable decline in performance by the server device 840, the client device 850, or the communication network 830. The network interface 860 may copy any portion of the content, for example, any number of data packets.
It is contemplated that, for any embodiments where the first MCD system 8101 is implemented as an dedicated appliance or a dedicated computer system, the network interface 860 may include an assembly integrated into the appliance or computer system that includes network ports, network interface card and related logic (not shown) for connecting to the communication networks 830 and 845 to non-disruptively “tap” data traffic and provide a copy of the data traffic to the heuristic module 870. In other embodiments, the network interface 860 can be integrated into an intermediary device in the communication path (e.g. firewall 855, router, switch or other network device) or can be a standalone component, such as an appropriate commercially available network tap. In some embodiments, also, the network interface 860 may be contained within the first MCD system 8101. In virtual environments, a virtual tap (vTAP) can be used to copy traffic from virtual networks.
Referring still to
The first MCD system 8101 may also include components for detecting malware in a two-stage malware detection approach, including a static analysis employing heuristics and a dynamic analysis employing replaying (i.e., executing) the network traffic while observing its behavior to detect malware. For this, the first MCD system 8101 includes a heuristic engine 870, a heuristics database 875, a scheduler 880, a storage device 885, an analysis engine 890 and a reporting module 895. Also, heuristic engine 870, scheduler 880 and/or analysis engine 890 may be software modules executed by a processor that receives the suspicious content, performs malware analysis and is adapted to access one or more non-transitory storage mediums operating as heuristic database 875, storage device 885 and/or reporting module 895. In some embodiments, the heuristic engine 870 may be one or more software modules executed by a processor, and the scheduler 880 and the analysis engine 890 may be one or more software modules executed by a different processor, where the two processors are possibly located at geographically remote locations, and communicatively coupled for example via a network.
In general, the heuristic engine 870 serves as a filter to permit subsequent malware analysis only on a portion of incoming content, which effectively conserves system resources and provides faster response time in determining the presence of malware within analyzed content. As illustrated in
Thereafter, according to one embodiment of the invention, the heuristic module 870 may be adapted to transmit at least a portion of the metadata or attributes of the suspicious content, which identify attributes of the client device 850, to the analysis engine 890 for dynamic analysis. Such metadata or attributes are used to identify the VM instance needed for subsequent malware analysis. For instance, the analysis engine 890 may be adapted to use the metadata to identify the desired software profile. Alternatively, the analysis engine 890 may be adapted to receive one or more data packets from the heuristic engine 870 and analyze the packets to identify the appropriate software profile. In yet other embodiment of the disclosure, the scheduler 880 may be adapted to receive software profile information, in the form of metadata or data packets, from the network interface 860 or from the heuristic module 870 directly.
The scheduler 880 may retrieve and configure a VM instance to mimic the pertinent performance characteristics of the client device 850. In one example, the scheduler 880 may be adapted to configure the characteristics of the VM instance to mimic only those features of the client device 850 that are affected by the data traffic copied by the network interface 860. The scheduler 880 may determine the features of the client device 850 that are affected by the content by receiving and analyzing the data traffic from the network interface 860. Such features of the client device 850 may include ports that are to receive the content, certain device drivers that are to respond to the content, and any other devices coupled to or contained within the client device 850 that can respond to the content. Alternatively, the heuristic engine 870 may determine the features of the client device 850 that are affected by the data traffic by receiving and analyzing the content from the network interface 860. The heuristic engine 870 may then transmit the features of the client device to the scheduler 880 and/or analysis engine 890.
The storage device 885 may be configured to store one or more VM disk files forming a VM profile database, where each VM disk file is directed to a different software profile for a VM instance. In one example, the VM profile database may store a VM disk file associated with a single VM instance that can be configured by the scheduler 880 to mimic the performance of a client device 850 on the communication network 830. Alternatively, as shown in
The analysis engine 890 is adapted to execute multiple VM instances to simulate the receipt and/or execution of different data flows of “suspicious” content by the client device 850 as well as different operating environments. Furthermore, the analysis engine 890 analyzes the effects of such content upon the client device 850. The analysis engine 890 may identify the effects of malware by analyzing the simulation of the effects of the content upon the client device 850 that is carried out on each VM instance. Such effects may include unusual network transmissions, unusual changes in performance, and the like. This detection process is referred to as a dynamic malicious content detection.
The analysis engine 890 may flag the suspicious content as malware according to the observed behavior of the VM instance. The reporting module 895 may issue alerts indicating the presence of malware, and using pointers and other reference information, identify what message(s) (e.g. packet(s)) of the “suspicious” content may contain malware. Additionally, the server device 840 may be added to a list of malicious network content providers, and future network transmissions originating from the server device 840 may be blocked from reaching their intended destinations, e.g., by firewall 855.
The embodiments discussed herein are illustrative. As these embodiments are described with reference to illustrations, various modifications or adaptations of the methods and/or specific structures described may become apparent to those skilled in the art. For example, aspects of the embodiments may be performed by executable software, such as a program or operating system. For example, embodiments of the local analyzer may be implemented in an operating system. Of course, the operating system may incorporate other aspects instead of or in addition to that just described, as will be appreciated in light of the description contained in this specification. Similarly, a utility or other computer program executed on a server or other computer system may also implement the local analyzer or other aspects. Noteworthy, these embodiments need not employ a virtual environment, but rather test for callback activity during normal execution of the operating system, utility or program within a computer system.
It should be understood that the operations performed by the above-described illustrative embodiments are purely exemplary and imply no particular order unless explicitly required. Further, the operations may be used in any sequence when appropriate and may be partially used. Embodiments may employ various computer-implemented operations involving data stored in computer systems. These operations include physical manipulation of physical quantities. Usually, though not necessarily, these quantities take the form of electrical, magnetic, or optical signals capable of being stored, transferred, combined, compared, and otherwise manipulated.
Any of the operations described herein are useful machine operations. The present invention also relates to a device or an apparatus for performing these operations. The apparatus may be specially constructed for the required purpose, or the apparatus may be a general-purpose computer selectively activated or configured by a computer program stored in the computer. In particular, various general-purpose machines may be used with computer programs written in accordance with the teachings herein, or it may be more convenient to construct a more specialized apparatus to perform the required operations, or multiple apparatus each performing a portion of the operations. Where apparatus or components of apparatus are described herein as being coupled or connected to other apparatus or other components, the connection may be direct or indirect, unless the context requires otherwise.
The present invention may be embodied as computer readable code on a computer readable medium. The computer readable medium is any data storage device that can store data, which can be thereafter read by a computer system. Examples of the computer readable medium include hard drives, flash drives, read-only memory, random-access memory, CD-ROMs, CD-Rs, CD-RWs, magnetic tapes, and other optical and non-optical data storage devices. The computer readable medium can also be distributed over a network-coupled computer system so that the computer readable code is stored and executed in a distributed fashion. The computer readable medium can also be distributed using a switching fabric, such as used in compute farms.
The terms “logic”, “module”, “engine” and “unit” are representative of hardware, firmware or software that is configured to perform one or more functions. As hardware, these components may include circuitry such as processing circuitry (e.g., a microprocessor, one or more processor cores, a programmable gate array, a microcontroller, an application specific integrated circuit, etc.), receiver, transmitter and/or transceiver circuitry, semiconductor memory, combinatorial logic, or other types of electronic components. When implemented in software, the logic, modules, engines, and units may be in the form of one or more software modules, such as executable code in the form of an executable application, an operating system, an application programming interface (API), a subroutine, a function, a procedure, an applet, a servlet, a routine, source code, object code, a script, a shared library/dynamic load library, or one or more instructions. These software modules may be stored in any type of a suitable non-transitory storage medium, or transitory storage medium (e.g., electrical, optical, acoustical or other form of propagated signals such as carrier waves, infrared signals, or digital signals). Examples of non-transitory storage medium may include, but are not limited or restricted to a programmable circuit; a semiconductor memory; non-persistent storage such as volatile memory (e.g., any type of random access memory “RAM”); persistent storage such as non-volatile memory (e.g., read-only memory “ROM”, power-backed RAM, flash memory, phase-change memory, etc.), a solid-state drive, hard disk drive, an optical disc drive, or a portable memory device. As firmware, the executable code is stored in persistent storage. Software is operational when executed by processing circuitry. Execution may be in the form of direct execution, emulation, or interpretation.
The term “computerized” generally represents that any corresponding operations are conducted by hardware in combination with software and/or firmware.
Lastly, the terms “or” and “and/or” as used herein are to be interpreted as inclusive or meaning any one or any combination. Therefore, “A, B or C” or “A, B and/or C” mean “any of the following: A; B; C; A and B; A and C; B and C; A, B and C.” An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.
It will be appreciated by those of ordinary skill in the art that modifications to and variations of the above-described embodiments of a system and method of detecting callbacks and associated malware may be made without departing from the inventive concepts disclosed herein. Accordingly, the specification and drawings are to be regarded as illustrative rather than restrictive, and the invention should not be viewed as limited except as by the scope and spirit of the appended claims. It will be recognized that the terms “comprising,” “including,” and “having,” as used herein, are specifically intended to be read as open-ended terms of art.
Number | Name | Date | Kind |
---|---|---|---|
4292580 | Ott et al. | Sep 1981 | A |
5175732 | Hendel et al. | Dec 1992 | A |
5440723 | Arnold et al. | Aug 1995 | A |
5657473 | Killean et al. | Aug 1997 | A |
5842002 | Schnurer et al. | Nov 1998 | A |
5978917 | Chi | Nov 1999 | A |
6088803 | Tso et al. | Jul 2000 | A |
6094677 | Capek et al. | Jul 2000 | A |
6269330 | Cidon et al. | Jul 2001 | B1 |
6279113 | Vaidya | Aug 2001 | B1 |
6298445 | Shostack | Oct 2001 | B1 |
6357008 | Nachenberg | Mar 2002 | B1 |
6424627 | Sorhaug et al. | Jul 2002 | B1 |
6484315 | Ziese | Nov 2002 | B1 |
6487666 | Shanklin et al. | Nov 2002 | B1 |
6493756 | O'Brien et al. | Dec 2002 | B1 |
6550012 | Villa et al. | Apr 2003 | B1 |
6775657 | Baker | Aug 2004 | B1 |
6832367 | Choi et al. | Dec 2004 | B1 |
6895550 | Kanchirayappa et al. | May 2005 | B2 |
6898632 | Gordy et al. | May 2005 | B2 |
6907396 | Muttik et al. | Jun 2005 | B1 |
6981279 | Arnold et al. | Dec 2005 | B1 |
7007107 | Ivchenko et al. | Feb 2006 | B1 |
7028179 | Anderson et al. | Apr 2006 | B2 |
7043757 | Hoefelmeyer et al. | May 2006 | B2 |
7069316 | Gryaznov | Jun 2006 | B1 |
7080408 | Pak et al. | Jul 2006 | B1 |
7093002 | Wolff et al. | Aug 2006 | B2 |
7093239 | van der Made | Aug 2006 | B1 |
7100201 | Izatt | Aug 2006 | B2 |
7159149 | Spiegel et al. | Jan 2007 | B2 |
7231667 | Jordan | Jun 2007 | B2 |
7240364 | Branscomb et al. | Jul 2007 | B1 |
7240368 | Roesch et al. | Jul 2007 | B1 |
7287278 | Liang | Oct 2007 | B2 |
7308716 | Danford et al. | Dec 2007 | B2 |
7356736 | Natvig | Apr 2008 | B2 |
7386888 | Liang et al. | Jun 2008 | B2 |
7392542 | Bucher | Jun 2008 | B2 |
7418729 | Szor | Aug 2008 | B2 |
7428300 | Drew et al. | Sep 2008 | B1 |
7441272 | Durham et al. | Oct 2008 | B2 |
7448084 | Apap et al. | Nov 2008 | B1 |
7458098 | Judge et al. | Nov 2008 | B2 |
7464404 | Carpenter et al. | Dec 2008 | B2 |
7464407 | Nakae et al. | Dec 2008 | B2 |
7467408 | O'Toole, Jr. | Dec 2008 | B1 |
7480773 | Reed | Jan 2009 | B1 |
7487543 | Arnold et al. | Feb 2009 | B2 |
7496960 | Chen et al. | Feb 2009 | B1 |
7496961 | Zimmer et al. | Feb 2009 | B2 |
7519990 | Xie | Apr 2009 | B1 |
7523493 | Liang et al. | Apr 2009 | B2 |
7530104 | Thrower et al. | May 2009 | B1 |
7540025 | Tzadikario | May 2009 | B2 |
7565550 | Liang et al. | Jul 2009 | B2 |
7603715 | Costa et al. | Oct 2009 | B2 |
7607171 | Marsden et al. | Oct 2009 | B1 |
7639714 | Stolfo et al. | Dec 2009 | B2 |
7644441 | Schmid et al. | Jan 2010 | B2 |
7676841 | Sobchuk et al. | Mar 2010 | B2 |
7698548 | Shelest et al. | Apr 2010 | B2 |
7707633 | Danford et al. | Apr 2010 | B2 |
7779463 | Stolfo et al. | Aug 2010 | B2 |
7784097 | Stolfo et al. | Aug 2010 | B1 |
7832008 | Kraemer | Nov 2010 | B1 |
7849506 | Dansey et al. | Dec 2010 | B1 |
7869073 | Oshima | Jan 2011 | B2 |
7877803 | Enstone et al. | Jan 2011 | B2 |
7904959 | Sidiroglou et al. | Mar 2011 | B2 |
7908660 | Bahl | Mar 2011 | B2 |
7930738 | Petersen | Apr 2011 | B1 |
7937761 | Bennett | May 2011 | B1 |
7996556 | Raghavan et al. | Aug 2011 | B2 |
7996836 | McCorkendale et al. | Aug 2011 | B1 |
7996905 | Arnold et al. | Aug 2011 | B2 |
8006305 | Aziz | Aug 2011 | B2 |
8010667 | Zhang et al. | Aug 2011 | B2 |
8020206 | Hubbard et al. | Sep 2011 | B2 |
8028338 | Schneider et al. | Sep 2011 | B1 |
8045094 | Teragawa | Oct 2011 | B2 |
8045458 | Alperovitch et al. | Oct 2011 | B2 |
8069484 | McMillan et al. | Nov 2011 | B2 |
8087086 | Lai et al. | Dec 2011 | B1 |
8171553 | Aziz et al. | May 2012 | B2 |
8201246 | Wu et al. | Jun 2012 | B1 |
8204984 | Aziz et al. | Jun 2012 | B1 |
8220055 | Kennedy | Jul 2012 | B1 |
8225288 | Miller et al. | Jul 2012 | B2 |
8225373 | Kraemer | Jul 2012 | B2 |
8233882 | Rogel | Jul 2012 | B2 |
8234709 | Viljoen et al. | Jul 2012 | B2 |
8239944 | Nachenberg et al. | Aug 2012 | B1 |
8286251 | Eker et al. | Oct 2012 | B2 |
8291499 | Aziz et al. | Oct 2012 | B2 |
8307435 | Mann et al. | Nov 2012 | B1 |
8307443 | Wang et al. | Nov 2012 | B2 |
8312545 | Tuvell et al. | Nov 2012 | B2 |
8321936 | Green et al. | Nov 2012 | B1 |
8321941 | Tuvell et al. | Nov 2012 | B2 |
8365286 | Poston | Jan 2013 | B2 |
8370938 | Daswani et al. | Feb 2013 | B1 |
8370939 | Zaitsev et al. | Feb 2013 | B2 |
8375444 | Aziz et al. | Feb 2013 | B2 |
8381299 | Stolfo et al. | Feb 2013 | B2 |
8402529 | Green et al. | Mar 2013 | B1 |
8510827 | Leake et al. | Aug 2013 | B1 |
8510842 | Amit et al. | Aug 2013 | B2 |
8516593 | Aziz | Aug 2013 | B2 |
8528086 | Aziz | Sep 2013 | B1 |
8539582 | Aziz et al. | Sep 2013 | B1 |
8549638 | Aziz | Oct 2013 | B2 |
8561177 | Aziz et al. | Oct 2013 | B1 |
8566946 | Aziz et al. | Oct 2013 | B1 |
8584094 | Dadhia et al. | Nov 2013 | B2 |
8584234 | Sobel et al. | Nov 2013 | B1 |
8584239 | Aziz et al. | Nov 2013 | B2 |
8595834 | Xie et al. | Nov 2013 | B2 |
8627476 | Satish et al. | Jan 2014 | B1 |
8635696 | Aziz | Jan 2014 | B1 |
8713681 | Silberman et al. | Apr 2014 | B2 |
8850571 | Staniford et al. | Sep 2014 | B2 |
8881282 | Aziz et al. | Nov 2014 | B1 |
8898788 | Aziz et al. | Nov 2014 | B1 |
8935779 | Manni et al. | Jan 2015 | B2 |
8984638 | Aziz et al. | Mar 2015 | B1 |
8990939 | Staniford et al. | Mar 2015 | B2 |
8990944 | Singh et al. | Mar 2015 | B1 |
8997219 | Staniford et al. | Mar 2015 | B2 |
9009822 | Ismael et al. | Apr 2015 | B1 |
9009823 | Ismael et al. | Apr 2015 | B1 |
9027135 | Aziz | May 2015 | B1 |
9071638 | Aziz et al. | Jun 2015 | B1 |
9104867 | Thioux et al. | Aug 2015 | B1 |
9106694 | Aziz et al. | Aug 2015 | B2 |
9118715 | Staniford et al. | Aug 2015 | B2 |
9159035 | Ismael et al. | Oct 2015 | B1 |
9171160 | Vincent et al. | Oct 2015 | B2 |
9176843 | Ismael et al. | Nov 2015 | B1 |
9189627 | Islam | Nov 2015 | B1 |
9195829 | Goradia et al. | Nov 2015 | B1 |
9197664 | Aziz et al. | Nov 2015 | B1 |
9223972 | Vincent et al. | Dec 2015 | B1 |
9225740 | Ismael et al. | Dec 2015 | B1 |
9241010 | Bennett et al. | Jan 2016 | B1 |
9251343 | Vincent | Feb 2016 | B1 |
20010005889 | Albrecht | Jun 2001 | A1 |
20010047326 | Broadbent et al. | Nov 2001 | A1 |
20020018903 | Kokubo et al. | Feb 2002 | A1 |
20020038430 | Edwards et al. | Mar 2002 | A1 |
20020091819 | Melchione et al. | Jul 2002 | A1 |
20020144156 | Copeland, III | Oct 2002 | A1 |
20020162015 | Tang | Oct 2002 | A1 |
20020166063 | Lachman et al. | Nov 2002 | A1 |
20020184528 | Shevenell et al. | Dec 2002 | A1 |
20020188887 | Largman et al. | Dec 2002 | A1 |
20020194490 | Halperin et al. | Dec 2002 | A1 |
20030074578 | Ford et al. | Apr 2003 | A1 |
20030084318 | Schertz | May 2003 | A1 |
20030115483 | Liang | Jun 2003 | A1 |
20030188190 | Aaron et al. | Oct 2003 | A1 |
20030200460 | Morota et al. | Oct 2003 | A1 |
20030212902 | Van Der Made | Nov 2003 | A1 |
20030237000 | Denton et al. | Dec 2003 | A1 |
20040003323 | Bennett et al. | Jan 2004 | A1 |
20040015712 | Szor | Jan 2004 | A1 |
20040019832 | Arnold et al. | Jan 2004 | A1 |
20040047356 | Bauer | Mar 2004 | A1 |
20040083408 | Spiegel et al. | Apr 2004 | A1 |
20040093513 | Cantrell et al. | May 2004 | A1 |
20040111531 | Staniford et al. | Jun 2004 | A1 |
20040165588 | Pandya | Aug 2004 | A1 |
20040236963 | Danford et al. | Nov 2004 | A1 |
20040243349 | Greifeneder et al. | Dec 2004 | A1 |
20040249911 | Alkhatib et al. | Dec 2004 | A1 |
20040255161 | Cavanaugh | Dec 2004 | A1 |
20040268147 | Wiederin et al. | Dec 2004 | A1 |
20050021740 | Bar et al. | Jan 2005 | A1 |
20050033960 | Vialen et al. | Feb 2005 | A1 |
20050033989 | Poletto et al. | Feb 2005 | A1 |
20050050148 | Mohammadioun et al. | Mar 2005 | A1 |
20050086523 | Zimmer et al. | Apr 2005 | A1 |
20050091513 | Mitomo et al. | Apr 2005 | A1 |
20050091533 | Omote et al. | Apr 2005 | A1 |
20050114663 | Cornell et al. | May 2005 | A1 |
20050125195 | Brendel | Jun 2005 | A1 |
20050157662 | Bingham et al. | Jul 2005 | A1 |
20050183143 | Anderholm et al. | Aug 2005 | A1 |
20050201297 | Peikari | Sep 2005 | A1 |
20050210533 | Copeland et al. | Sep 2005 | A1 |
20050238005 | Chen et al. | Oct 2005 | A1 |
20050265331 | Stolfo | Dec 2005 | A1 |
20060010495 | Cohen et al. | Jan 2006 | A1 |
20060015715 | Anderson | Jan 2006 | A1 |
20060021054 | Costa et al. | Jan 2006 | A1 |
20060031476 | Mathes et al. | Feb 2006 | A1 |
20060047665 | Neil | Mar 2006 | A1 |
20060070130 | Costea et al. | Mar 2006 | A1 |
20060075496 | Carpenter et al. | Apr 2006 | A1 |
20060095968 | Portolani et al. | May 2006 | A1 |
20060101516 | Sudaharan et al. | May 2006 | A1 |
20060101517 | Banzhof et al. | May 2006 | A1 |
20060117385 | Mester et al. | Jun 2006 | A1 |
20060123477 | Raghavan et al. | Jun 2006 | A1 |
20060143709 | Brooks et al. | Jun 2006 | A1 |
20060150249 | Gassen et al. | Jul 2006 | A1 |
20060161983 | Cothrell et al. | Jul 2006 | A1 |
20060161987 | Levy-Yurista | Jul 2006 | A1 |
20060161989 | Reshef et al. | Jul 2006 | A1 |
20060164199 | Gilde et al. | Jul 2006 | A1 |
20060173992 | Weber et al. | Aug 2006 | A1 |
20060179147 | Tran et al. | Aug 2006 | A1 |
20060184632 | Marino et al. | Aug 2006 | A1 |
20060191010 | Benjamin | Aug 2006 | A1 |
20060221956 | Narayan et al. | Oct 2006 | A1 |
20060236393 | Kramer et al. | Oct 2006 | A1 |
20060242709 | Seinfeld et al. | Oct 2006 | A1 |
20060251104 | Koga | Nov 2006 | A1 |
20060288417 | Bookbinder et al. | Dec 2006 | A1 |
20070006288 | Mayfield et al. | Jan 2007 | A1 |
20070006313 | Porras et al. | Jan 2007 | A1 |
20070011174 | Takaragi et al. | Jan 2007 | A1 |
20070016951 | Piccard et al. | Jan 2007 | A1 |
20070033645 | Jones | Feb 2007 | A1 |
20070038943 | FitzGerald et al. | Feb 2007 | A1 |
20070064689 | Shin et al. | Mar 2007 | A1 |
20070094730 | Bhikkaji et al. | Apr 2007 | A1 |
20070143827 | Nicodemus et al. | Jun 2007 | A1 |
20070156895 | Vuong | Jul 2007 | A1 |
20070157180 | Tillmann et al. | Jul 2007 | A1 |
20070157306 | Elrod et al. | Jul 2007 | A1 |
20070171824 | Ruello et al. | Jul 2007 | A1 |
20070174915 | Gribble et al. | Jul 2007 | A1 |
20070192500 | Lum | Aug 2007 | A1 |
20070192858 | Lum | Aug 2007 | A1 |
20070198275 | Malden et al. | Aug 2007 | A1 |
20070240218 | Tuvell et al. | Oct 2007 | A1 |
20070240219 | Tuvell et al. | Oct 2007 | A1 |
20070240220 | Tuvell et al. | Oct 2007 | A1 |
20070240222 | Tuvell et al. | Oct 2007 | A1 |
20070250930 | Aziz et al. | Oct 2007 | A1 |
20070271446 | Nakamura | Nov 2007 | A1 |
20080005782 | Aziz | Jan 2008 | A1 |
20080028463 | Dagon et al. | Jan 2008 | A1 |
20080072326 | Danford et al. | Mar 2008 | A1 |
20080077793 | Tan et al. | Mar 2008 | A1 |
20080080518 | Hoeflin et al. | Apr 2008 | A1 |
20080098476 | Syversen | Apr 2008 | A1 |
20080120722 | Sima et al. | May 2008 | A1 |
20080134178 | Fitzgerald et al. | Jun 2008 | A1 |
20080134334 | Kim et al. | Jun 2008 | A1 |
20080141376 | Clausen et al. | Jun 2008 | A1 |
20080184373 | Traut et al. | Jul 2008 | A1 |
20080189787 | Arnold et al. | Aug 2008 | A1 |
20080215742 | Goldszmidt et al. | Sep 2008 | A1 |
20080222729 | Chen et al. | Sep 2008 | A1 |
20080263665 | Ma et al. | Oct 2008 | A1 |
20080295172 | Bohacek | Nov 2008 | A1 |
20080301810 | Lehane et al. | Dec 2008 | A1 |
20080307524 | Singh et al. | Dec 2008 | A1 |
20080320594 | Jiang | Dec 2008 | A1 |
20090007100 | Field et al. | Jan 2009 | A1 |
20090013408 | Schipka | Jan 2009 | A1 |
20090031423 | Liu et al. | Jan 2009 | A1 |
20090036111 | Danford et al. | Feb 2009 | A1 |
20090044024 | Oberheide et al. | Feb 2009 | A1 |
20090044274 | Budko et al. | Feb 2009 | A1 |
20090083369 | Marmor | Mar 2009 | A1 |
20090083855 | Apap et al. | Mar 2009 | A1 |
20090089879 | Wang et al. | Apr 2009 | A1 |
20090094697 | Provos et al. | Apr 2009 | A1 |
20090109973 | Ilnicki | Apr 2009 | A1 |
20090125976 | Wassermann et al. | May 2009 | A1 |
20090126015 | Monastyrsky et al. | May 2009 | A1 |
20090126016 | Sobko et al. | May 2009 | A1 |
20090133125 | Choi et al. | May 2009 | A1 |
20090158430 | Borders | Jun 2009 | A1 |
20090187992 | Poston | Jul 2009 | A1 |
20090193293 | Stolfo et al. | Jul 2009 | A1 |
20090199296 | Xie et al. | Aug 2009 | A1 |
20090228233 | Anderson et al. | Sep 2009 | A1 |
20090241187 | Troyansky | Sep 2009 | A1 |
20090241190 | Todd et al. | Sep 2009 | A1 |
20090265692 | Godefroid et al. | Oct 2009 | A1 |
20090271867 | Zhang | Oct 2009 | A1 |
20090300761 | Park et al. | Dec 2009 | A1 |
20090328185 | Berg et al. | Dec 2009 | A1 |
20090328221 | Blumfield et al. | Dec 2009 | A1 |
20100017546 | Poo et al. | Jan 2010 | A1 |
20100043073 | Kuwamura | Feb 2010 | A1 |
20100054278 | Stolfo et al. | Mar 2010 | A1 |
20100058474 | Hicks | Mar 2010 | A1 |
20100064044 | Nonoyama | Mar 2010 | A1 |
20100077481 | Polyakov et al. | Mar 2010 | A1 |
20100083376 | Pereira et al. | Apr 2010 | A1 |
20100115621 | Staniford et al. | May 2010 | A1 |
20100132038 | Zaitsev | May 2010 | A1 |
20100154056 | Smith et al. | Jun 2010 | A1 |
20100192223 | Ismael et al. | Jul 2010 | A1 |
20100251104 | Massand | Sep 2010 | A1 |
20100281102 | Chinta et al. | Nov 2010 | A1 |
20100281541 | Stolfo et al. | Nov 2010 | A1 |
20100281542 | Stolfo et al. | Nov 2010 | A1 |
20100287260 | Peterson et al. | Nov 2010 | A1 |
20110025504 | Lyon et al. | Feb 2011 | A1 |
20110041179 | Stahlberg | Feb 2011 | A1 |
20110047594 | Mahaffey et al. | Feb 2011 | A1 |
20110047620 | Mahaffey et al. | Feb 2011 | A1 |
20110078794 | Manni et al. | Mar 2011 | A1 |
20110093951 | Aziz | Apr 2011 | A1 |
20110099633 | Aziz | Apr 2011 | A1 |
20110113231 | Kaminsky | May 2011 | A1 |
20110145920 | Mahaffey et al. | Jun 2011 | A1 |
20110167494 | Bowen et al. | Jul 2011 | A1 |
20110247072 | Staniford et al. | Oct 2011 | A1 |
20110265182 | Peinado et al. | Oct 2011 | A1 |
20110307954 | Melnik et al. | Dec 2011 | A1 |
20110307955 | Kaplan et al. | Dec 2011 | A1 |
20110307956 | Yermakov et al. | Dec 2011 | A1 |
20110314546 | Aziz et al. | Dec 2011 | A1 |
20120079596 | Thomas et al. | Mar 2012 | A1 |
20120084859 | Radinsky et al. | Apr 2012 | A1 |
20120117652 | Manni et al. | May 2012 | A1 |
20120174186 | Aziz et al. | Jul 2012 | A1 |
20120174218 | McCoy et al. | Jul 2012 | A1 |
20120198279 | Schroeder | Aug 2012 | A1 |
20120210423 | Friedrichs et al. | Aug 2012 | A1 |
20120222121 | Staniford et al. | Aug 2012 | A1 |
20120278886 | Luna | Nov 2012 | A1 |
20120297489 | Dequevy | Nov 2012 | A1 |
20120330801 | McDougal et al. | Dec 2012 | A1 |
20130036472 | Aziz | Feb 2013 | A1 |
20130047257 | Aziz | Feb 2013 | A1 |
20130097706 | Titonis et al. | Apr 2013 | A1 |
20130160130 | Mendelev et al. | Jun 2013 | A1 |
20130160131 | Madou et al. | Jun 2013 | A1 |
20130227691 | Aziz et al. | Aug 2013 | A1 |
20130246370 | Bartram et al. | Sep 2013 | A1 |
20130263260 | Mahaffey et al. | Oct 2013 | A1 |
20130291109 | Staniford et al. | Oct 2013 | A1 |
20130298243 | Kumar et al. | Nov 2013 | A1 |
20140007238 | Magee et al. | Jan 2014 | A1 |
20140053260 | Gupta et al. | Feb 2014 | A1 |
20140053261 | Gupta et al. | Feb 2014 | A1 |
Number | Date | Country |
---|---|---|
2439806 | Jan 2008 | GB |
WO-0206928 | Jan 2002 | WO |
WO-0223805 | Mar 2002 | WO |
WO-2007-117636 | Oct 2007 | WO |
WO-2008041950 | Apr 2008 | WO |
WO-2011084431 | Jul 2011 | WO |
WO-2012145066 | Oct 2012 | WO |
Entry |
---|
Cisco, Configuring the Catalyst Switched Port Analyzer (SPAN) (“Cisco”) (1992-2003). |
Reiner Sailer, Enriquillo Valdez, Trent Jaeger, Roonald Perez, Leendert van Doorn, John Linwood Griffin, Stefan Berger., sHype: Secure Hypervisor Appraoch to Trusted Virtualized Systems (Feb. 2, 2005) (“Sailer”). |
Excerpt regarding First Printing Date for Merike Kaeo, Designing Network Security (“Kaeo”) (2005). |
The Sniffers's Guide to Raw Traffic available at: yuba.stanford.edu/˜casado/pcap/section1.html, (Jan. 6, 2014). |
NetBIOS Working Group. Protocol Standard for a NetBIOS Service on a TCP/UDP transport: Concepts and Methods. STD 19, RFC 1001, Mar. 1987. |
“Network Security: NetDetector—Network Intrusion Forensic System (NIFS) Whitepaper”, (“NetDetector Whitepaper”). |
“Packet”, Microsoft Computer Dictionary, Microsoft Press, (Mar. 2002), 1 page. |
“When Virtual is Better Than Real”, IEEEXplore Digital Library, available at, http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=990073 (Dec. 7, 2013). |
Abdullah, et al., Visualizing Network Data for Intrusion Detection, 2005 IEEE Workshop on Information Assurance and Security, pp. 100-108. |
Adetoye, Adedayo , et al., “Network Intrusion Detection & Response System”, (“Adetove”) (Sep. 2003). |
Aura, Tuomas, “Scanning electronic documents for personally identifiable information”, Proceedings of the 5th ACM workshop on Privacy in electronic society. ACM, 2006. |
Baecher, “The Nepenthes Platform: An Efficient Approach to collect Malware”, Springer-verlag Berlin Heidelberg, (2006), pp. 165-184. |
Bayer, et al., “Dynamic Analysis of Malicious Code”, J Comput Virol, Springer-Verlag, France., (2006), pp. 67-77. |
Boubalos, Chris , “extracting syslog data out of raw pcap dumps, seclists.org, Honeypots mailing list archives”, available at http://seclists.org/honeypots/2003/q2/319 (“Boubalos”), (Jun. 5, 2003). |
Chaudet, C. , et al., “Optimal Positioning of Active and Passive Monitoring Devices”, International Conference on Emerging Networking Experiments and Technologies, Proceedings of the 2005 ACM Conference on Emerging Network Experiment and Technology, CoNEXT '05, Toulousse, France, (Oct. 2005), pp. 71-82. |
Cohen, M.I. , “PyFlag—An advanced network forensic framework”, Digital investigation 5, Elsevier, (2008), pp. S112-S120. |
Costa, M. , et al., “Vigilante: End-to-End Containment of Internet Worms”, SOSP '05, Association for Computing Machinery, Inc., Brighton U.K., (Oct. 23-26, 2005). |
Crandall, J.R. , et al., “Minos:Control Data Attack Prevention Orthogonal to Memory Model”, 37th International Symposium on Microarchitecture, Portland, Oregon, (Dec. 2004). |
Deutsch, P. , “Zlib compressed data format specification version 3.3” RFC 1950, (1996). |
Distler, “Malware Analysis: An Introduction”, SANS Institute InfoSec Reading Room, SANS Institute, (2007). |
Dunlap, George W. et al., “ReVirt: Enabling Intrusion Analysis through Virtual-Machine Logging and Replay”, Proceeding of the 5th Symposium on Operating Systems Design and Implementation, USENIX Association, (“Dunlap”), (Dec. 9, 2002). |
Filiol, Eric , et al., “Combinatorial Optimisation of Worm Propagation on an Unknown Network”, International Journal of Computer Science 2.2 (2007). |
Goel, et al., Reconstructing System State for Intrusion Analysis, Apr. 2008 SIGOPS Operating Systems Review, vol. 42 Issue 3, pp. 21-28. |
Kaeo, Merike , “Designing Network Security”, (“Kaeo”), (Nov. 2003). |
Kim, H. , et al., “Autograph: Toward Automated, Distributed Worm Signature Detection”, Proceedings of the 13th Usenix Security Symposium (Security 2004), San Diego, (Aug. 2004), pp. 271-286. |
Krasnyansky, Max , et al., Universal TUN/TAP driver, available at https://www.kernel.org/doc/Documentation/networking/tuntap.txt (2002) (“Krasnyansky”). |
Kreibich, C. , et al., “Honeycomb-Creating Intrusion Detection Signatures Using Honeypots”, 2nd Workshop on Hot Topics in Networks (HotNets-11), Boston, USA, (2003). |
Kristoff, J. , “Botnets, Detection and Mitigation: DNS-Based Techniques”, NU Security Day, (2005), 23 pages. |
Marchette, David J., “Computer Intrusion Detection and Network Monitoring: A Statistical Viewpoint”, (“Marchette”), (2001). |
Margolis, P.E. , “Random House Webster's ‘Computer & Internet Dictionary 3rd Edition’”, ISBN 0375703519, (Dec. 1998). |
Moore, D. , et al., “Internet Quarantine: Requirements for Containing Self-Propagating Code”, INFOCOM, vol. 3, (Mar. 30-Apr. 3, 2003), pp. 1901-1910. |
Morales, Jose A., et al., ““Analyzing and exploiting network behaviors of malware.””, Security and Privacy in Communication Networks. Springer Berlin Heidelberg, 2010. 20-34. |
Natvig, Kurt , “SandboxII: Internet”, Virus Bulletin Conference, (“Natvig”), (Sep. 2002). |
Newsome, J. , et al., “Dynamic Taint Analysis for Automatic Detection, Analysis, and Signature Generation of Exploits on Commodity Software”, In Proceedings of the 12th Annual Network and Distributed System Security, Symposium (NDSS '05), (Feb. 2005). |
Newsome, J. , et al., “Polygraph: Automatically Generating Signatures for Polymorphic Worms”, In Proceedings of the IEEE Symposium on Security and Privacy, (May 2005). |
Nojiri, D. , et al., “Cooperation Response Strategies for Large Scale Attack Mitigation”, DARPA Information Survivability Conference and Exposition, vol. 1, (Apr. 22-24, 2003), pp. 293-302. |
Peter M. Chen, and Brian D. Noble , “When Virtual is Better Than Real, Department of Electrical Engineering and Computer Science”, University of Michigan (“Chen”). |
Silicon Defense, “Worm Containment in the Internal Network”, (Mar. 2003), pp. 1-25. |
Singh, S. , et al., “Automated Worm Fingerprinting”, Proceedings of the ACM/USENIX Symposium on Operating System Design and Implementation, San Francisco, California, (Dec. 2004). |
Spitzner, Lance , “Honeypots: Tracking Hackers”, (“Spizner”), (Sep. 17, 2002). |
Thomas H. Ptacek, and Timothy N. Newsham “Insertion, Evasion, and Denial of Service: Eluding Network Intrusion Detection”, Secure Networks, (“Ptacek”), (Jan. 1998). |
Venezia, Paul , “NetDetector Captures Intrusions”, InfoWorld Issue 27, (“Venezia”), (Jul. 14, 2003). |
Whyte, et al., “DNS-Based Detection of Scanning Works in an Enterprise Network”, Proceedings of the 12th Annual Network and Distributed System Security Symposium, (Feb. 2005), 15 pages. |
Williamson, Matthew M., “Throttling Viruses: Restricting Propagation to Defeat Malicious Mobile Code”, ACSAC Conference, Las Vegas, NV, USA, (Dec. 2002), pp. 1-9. |
AltaVista Advanced Search Results. “attack vector identifier”. Http://www.altavista.com/web/results?Itag=ody&pg=aq&aqmode=aqa=Event+Orch- estrator . . . , (Accessed on Sep. 15, 2009). |
AltaVista Advanced Search Results. “Event Orchestrator”. Http://www.altavista.com/web/results?Itag=ody&pg=aq&aqmode=aqa=Event+Orch- esrator . . . , (Accessed on Sep. 3, 2009). |
Hjelmvik, Erik , “Passive Network Security Analysis with NetworkMiner”, (IN)Secure, Issue 18, (Oct. 2008), pp. 1-100. |
IEEE Xplore Digital Library Sear Results for “detection of unknown computer worms”. Http//ieeexplore.ieee.org/searchresult.jsp?SortField=Score&SortOrder=desc- &ResultC . . . , (Accessed on Aug. 28, 2009). |
King, Samuel T., et al., “Operating System Support for Virtual Machines”, (“King”) (2003). |
Liljenstam, Michael , et al., “Simulating Realistic Network Traffic for Worm Warning System Design and Testing”, Institute for Security Technology studies, Dartmouth College (“Liljenstam”), (Oct. 27, 2003). |