Fuzzy hash of behavioral results

Description

FIELD

Embodiments of the disclosure relate to the field of network security. More specifically, one embodiment of the disclosure relates to a system, apparatus, and method for classifying a suspect object in a malware system using a fuzzy hash of behaviors of the suspect object and clusters of previously classified objects.

GENERAL BACKGROUND

Over the last decade, malicious software (malware) has become a pervasive problem for Internet users. In some situations, malware is a program, file, or digital data object that is embedded within downloadable content and designed to adversely influence (i.e., attack) normal operations of a computer. Examples of different types of malware may include bots, computer viruses, worms, Trojan horses, spyware, adware, or any other programming that operates within the computer without permission.

For instance, content may be embedded with objects associated with a web page hosted by a malicious web site. By downloading this content, malware causing another web page to be requested from a malicious web site may be unknowingly installed on the computer. Similarly, malware may also be installed on a computer upon receipt or opening of an electronic mail (email) message. For example, an email message may contain an attachment, such as a Portable Document Format (PDF) document, with embedded executable malware. Also, malware may exist in files infected through any of a variety of attack vectors, which are uploaded from the infected computer onto a networked storage device such as a file share.

As development of malware has progressed, hackers have developed malware that share similarities with other malware objects, but maintain some dissimilarities. Accordingly, these “similar” malware objects may be in the same malware family, but traditional malware and anti-virus protection systems may fail to properly classify each object in the family as malware based on these differences. For example, traditional malware detection and classification techniques may employ a direct comparison of a suspect object with known malware objects in an attempt to reveal an exact match. However, if the suspected malware object has not been previously detected and analyzed (e.g., zero-day malware threats), these direct comparison techniques will fail to classify the object as malware even if “similar” objects have been previously classified as malware. Accordingly, traditional malware classification and analysis techniques may prove inaccurate and inefficient as these techniques do not accommodate for small difference between malware objects within a family of malware.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the invention are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:

FIG. 1 is an exemplary block diagram of a communication system deploying a plurality of malicious content detection (MCD) systems according to one embodiment of the invention.

FIG. 2 is an exemplary block diagram of a MCD system according to one embodiment of the invention.

FIG. 3 is a diagram of a method for classifying objects using fuzzy hashes of previously classified objects according to one embodiment of the invention.

FIG. 4A shows an example user interface for entering information for a suspect object according to one embodiment of the invention.

FIG. 4B shows the example user interface of FIG. 4A after a warning message has been returned to a user according to one embodiment of the invention.

FIG. 5 is a diagram of a set of objects assigned to a set of clusters according to one embodiment of the invention.

FIG. 6 is a diagram of a set of objects known as malware, known as non-malware, or with an unknown status and assigned to a set of clusters according to one embodiment of the invention.

FIG. 7A is a diagram of a suspect object being added to a preexisting cluster according to one embodiment of the invention.

FIG. 7B is a diagram of a suspect object being added to a new cluster according to one embodiment of the invention.

DETAILED DESCRIPTION
I. Overview

In one embodiment of the invention, a communication system is provided that includes a plurality of malicious content detection (MCD) systems communicatively coupled to a management system via a network. Each of the MCD systems may detonate, execute, open, or otherwise process a suspected malware object such that the suspect object conducts/performs a set of behaviors. These behaviors are collected and recorded such that further analysis with objects previously analyzed and assigned to clusters may be performed. In one embodiment, the recorded behavior data for the suspect object may be used to generate a fuzzy hash. A fuzzy hash allows the comparison of objects to determine similarity of the objects instead of necessarily a direct match. In comparison, traditional hashing techniques only allow a comparison of objects to determine an exact match. By allowing the determination of “similar” objects, fuzzy hashes afford a greater leniency in classification and categorization of objects that might be slightly different but otherwise share important characteristics.

As alluded to above, the fuzzy hash of the suspect object is compared against fuzzy hashes of one or more objects in one or more clusters. In one embodiment, machine learning may be utilized to determine a “similar” object in a cluster. Upon detection of a “similar” object, the suspect object may be associated with the cluster and classified based on information attached to the cluster. For example, the suspect object may be classified as malware, non-malware, or with an unknown status based on the classification of objects within the cluster. In some embodiments, the suspect object may be assigned a malware family name associated with the cluster.

As described above, fuzzy hash techniques may be used to group “similar” objects in clusters for further analysis and classification. This similarity matching provides 1) greater flexibility in analyzing potential malware objects, which may share multiple characteristics and behaviors but are also slightly different from previously classified objects, 2) a more efficient technique for classifying/assigning attributes to objects (e.g., malware family names), and 3) increase accuracy in identifying malware.

II. Terminology

In the following description, certain terminology is used to describe features of the invention. For example, in certain situations, the terms “logic” and “engine” are representative of hardware, firmware or software that is configured to perform one or more functions. As hardware, logic may include circuitry such as processing circuitry (e.g., a microprocessor, one or more processor cores, a programmable gate array, a microcontroller, an application specific integrated circuit, etc.), wireless receiver, transmitter and/or transceiver circuitry, semiconductor memory, combinatorial logic, or other types of electronic components.

As software, logic may be in the form of one or more software modules, such as executable code in the form of an executable application, an application programming interface (API), a subroutine, a function, a procedure, an applet, a servlet, a routine, source code, object code, a shared library/dynamic load library, or one or more instructions. These software modules may be stored in any type of a suitable non-transitory storage medium, or transitory storage medium (e.g., electrical, optical, acoustical or other form of propagated signals such as carrier waves, infrared signals, or digital signals). Examples of non-transitory storage medium may include, but is not limited or restricted to a programmable circuit; a semiconductor memory; non-persistent storage such as volatile memory (e.g., any type of random access memory “RAM”); persistent storage such as non-volatile memory (e.g., read-only memory “ROM”, power-backed RAM, flash memory, phase-change memory, etc.), a solid-state drive, hard disk drive, an optical disc drive, or a portable memory device. As firmware, the executable code is stored in persistent storage.

The term “content” generally refers to information transmitted over a network as one or more messages, namely a grouping of information that comprises a header and a payload, such as any of the following: a packet; a frame; a stream being a sequence of packets or frames; an Asynchronous Transfer Mode “ATM” cell; or any other series of bits having a prescribed format. An “object” may be construed as a portion of the content, namely information within one or more of the messages. The “payload” is generally defined as including the data associated with the message such as text, executable software, an image, audio, video, a Uniform Resource Locator (URL), or other types of digital data. The “header” is generally defined as a part of the message that includes control information. However, the specific types of control information depend on the content/object type.

For network traffic, such as data transmitted in accordance with a Hypertext Transfer Protocol (HTTP), HyperText Markup Language (HTML) protocol, the header may include source and destination Internet Protocol (IP) addresses (e.g., IPv4 or IPv6 addressing) and/or source and destination port information.

Another example of content or objects includes email, which may be transmitted using an email protocol such as Simple Mail Transfer Protocol (SMTP), Post Office Protocol version 3 (POP3), or Internet Message Access Protocol (IMAP4). A further example of content or objects includes an Instant Message, which may be transmitted using Session Initiation Protocol (SIP) or Extensible Messaging and Presence Protocol (XMPP) for example. Yet another example of content or objects includes one or more files that are transferred using a data transfer protocol such as File Transfer Protocol (FTP) for subsequent storage on a file share. Where the content or object is email, Instant Message or a file, the header may include the sender/recipient address, the sender/recipient phone number, or a targeted network location of the file, respectively.

The term “malware” is directed to software that produces an undesirable behavior upon execution, where the behavior is deemed to be “undesirable” based on customer-specific rules, manufacturer-based rules, or any other type of rules formulated by public opinion or a particular governmental or commercial entity. This undesired behavior may include a communication-based anomaly or an execution-based anomaly that (1) alters the functionality of an electronic device executing that application software in a malicious manner; (2) alters the functionality of an electronic device executing that application software without any malicious intent; and/or (3) provides an unwanted functionality which is generally acceptable in other context.

The term “transmission medium” is a communication path between two or more systems (e.g. any electronic devices with data processing functionality such as, for example, a security appliance, server, mainframe, computer, netbook, tablet, smart phone, router, switch, bridge or router). The communication path may include wired and/or wireless segments. Examples of wired and/or wireless segments include electrical wiring, optical fiber, cable, bus trace, or a wireless channel using infrared, radio frequency (RF), or any other wired/wireless signaling mechanism.

The term “computerized” generally represents that any corresponding operations are conducted by hardware in combination with software and/or firmware.

Lastly, the terms “or” and “and/or” as used herein are to be interpreted as inclusive or meaning any one or any combination. Therefore, “A, B or C” or “A, B and/or C” mean “any of the following: A; B; C; A and B; A and C; B and C; A, B and C.” An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.

As this invention is susceptible to embodiments of many different forms, it is intended that the present disclosure is to be considered as an example of the principles of the invention and not intended to limit the invention to the specific embodiments shown and described.

III. General Architecture

Referring to FIG. 1, an exemplary block diagram of a communication system 100 deploying a plurality of malware content detection (MCD) systems 110₁-110_N(N>1, e.g. N=3) communicatively coupled to a management system 120 via a network 125 is shown. In general, management system 120 is adapted to manage MCD systems 110₁-110_N. For instance, management system 120 may be adapted to cause one or more clusters of objects, each of which comprise information representative of prior detected and classified objects, to be shared among some or all of the MCD systems 110₁-110_Nfor use in malware checks. Such sharing may be conducted automatically or manually uploaded by an administrator. Also, such sharing may be conducted freely among the MCD systems 110₁-110_Nor subject to a subscription basis.

Herein, according to the embodiment illustrated in FIG. 1, a first MCD system 110₁is an electronic device that is adapted to analyze information associated with network traffic routed over a communication network 130 between at least one server device 140 and at least one client device 150.

The communication network 130 may include a public computer network such as the Internet, in which case an optional firewall 155 (represented by dashed lines) may be interposed between communication network 130 and the client device 150. Alternatively, the communication network 130 may be a private computer network such as a wireless telecommunication network, wide area network, or local area network, or a combination of networks.

The first MCD system 110₁is shown as being coupled with the communication network 130 (behind the firewall 155) via a network interface 160. The network interface 160 operates as a data capturing device (referred to as a “tap” or “network tap”) that is configured to receive network traffic propagating to/from the client device 150 and provide content from the network traffic to the first MCD system 110₁.

In general, the network interface 160 receives and duplicates the content that is received from and provided to client device 150 normally without an appreciable decline in performance by the server device 140, the client device 150, or the communication network 130. The network interface 160 may duplicate any portion of the content, for example, one or more files or objects that are part of a data flow or part of the payload contained within certain data packets, or the like.

In some embodiments, the network interface 160 may capture metadata from network traffic intended for the client device 150. This metadata may be used, at least in part, to deconstruct a corresponding file. For instance, the metadata may include keys that can be used to de-obfuscate a file or object.

It is contemplated that, for any embodiments where the first MCD system 110₁is implemented as an dedicated appliance or a dedicated computer system, the network interface 160 may include an assembly integrated into the appliance or computer system that includes network ports, a network interface card and related logic (not shown) for connecting to the communication network 130 to non-disruptively “tap” network traffic propagating through firewall 155 and provide a copy of the network traffic to the dynamic analysis engine 190. In other embodiments, the network interface 160 can be integrated into an intermediary device in the communication path (e.g., firewall 155, router, switch or other network device) or can be a standalone component, such as an appropriate commercially available network tap. In virtual environments, a virtual tap (vTAP) can be used to duplicate files from virtual networks.

Referring still to FIG. 1, first MCD system 110₁may include a scheduler 180, a storage device 185, a dynamic analysis engine 190, and a clustering and reporting module 195. In some embodiments, the network interface 160 may be contained within the first MCD system 110₁. Also, the dynamic analysis engine 190 and the clustering and reporting module 195 may be software modules executed by a processor that receives content and performs a dynamic scan analysis on objects within the content, which may involve accessing one or more non-transitory storage mediums operating as the storage device 185. In some embodiments, the dynamic analysis engine 190 may be one or more software modules, where such software modules are executed by a processor within the MCD system 110₁. The clustering and reporting module 195 may be one or more software modules executed by the same or a different processor, where these different processors are possibly located at geographically remote locations, located within the same processor package (e.g. different processor cores) and/or communicatively coupled for example via a network.

Herein, in one embodiment, the static analysis engine 175 may serve as a filter to permit subsequent malware analysis of one or more objects that may represent only on a portion of incoming content, which effectively conserves system resources and provides faster response time in determining the presence of malware within the analyzed content. As shown in FIG. 1, the static analysis engine 175 receives the copy of incoming content from the network interface 160 and applies heuristics to determine if any object(s) of the content are “suspicious”. The heuristics applied by the static analysis engine 175 may be based on data and/or rules stored in a database (not shown). Also, the static analysis engine 175 may examine the image of the captured content without executing or opening the captured content.

For example, the static analysis engine 175 may examine objects such as metadata or certain attributes of the captured content to determine whether a certain portion of the captured object matches (e.g., a high level of correlation with) a predetermined pattern of attributes that is associated with a malicious attack. According to one embodiment of the disclosure, the static analysis engine 175 flags objects from one or more data flows as suspicious after applying this heuristic analysis.

Thereafter, according to one embodiment of the invention, the static analysis engine 175 may be adapted to transmit at least an object of the suspicious content to the dynamic analysis engine 190. The portion of the object(s), such as some metadata for example, may identify attributes of the runtime environment in which the suspicious content should be processed and, on occasion, of the client device(s) 150 to which the suspicious content was being sent. Such metadata or attributes are used to identify a configuration of a virtual machine (VM) needed for subsequent malware analysis. In another embodiment of the disclosure, the dynamic analysis engine 190 may be adapted to receive one or more messages (e.g., data packets) from the static analysis engine 175 and analyze the message(s) to identify the software profile information associated with the needed VM.

For instance, as an illustrative example, the suspicious object(s) under test may include a portion of an email message that was generated, under control of Windows® 7 Operating System, using a Windows® Outlook 2010, version 1. Upon determining that the object includes suspicious content, such as an attachment for example, static analysis engine 175 provides software profile information to scheduler 180 to identify a particular configuration of VM needed to conduct dynamic analysis of the suspicious object. According to this illustrative example, the software profile information would include (1) Windows® 7 Operating System (OS); (2) Windows® Outlook 2010, version 1; and perhaps (3) an Adobe® reader if the attachment is a Portable Document Format (PDF) document.

The static analysis engine 175 supplies the software profile information to the scheduler 180, which determines whether any of the VM disk files within storage device 185 feature a software profile supporting the above-identified configuration of OS and one or more applications or a suitable alternative.

The dynamic analysis engine 190 is adapted to execute multiple VMs, to simulate the receipt and processing of different types of “suspicious” objects as well as different operating environments. Furthermore, the dynamic analysis engine 190 monitors and analyzes the activities and other behaviors of such objects during processing in the VM. The behaviors may include those expected and/or not expected during processing of that type of object. Unexpected behaviors can be considered anomalous behaviors. Examples of anomalous behaviors may include unusual network transmissions, opening certain ports to retrieve data, unusual changes in performance, and the like. This detection process is referred to as a dynamic malicious content detection.

The dynamic analysis engine 190 may flag the suspicious object as malware according to the observed behavior of the VM. In response to detecting anomalous behaviors, the dynamic analysis engine 190 may provide information to the cluster and reporting module 195 to conduct further analysis with objects previously analyzed and assigned to clusters, as described below.

Referring now to FIG. 2, an exemplary block diagram of logic associated with MCD system 110₁is shown. MCD system 110₁comprises one or more processors 200 that are coupled to communication interface logic 210 via a first transmission medium 220. Communication interface logic 210 enables communications with other MCD systems 110₂-110_N, management system 120 and/or cloud computing services 135 of FIG. 1. According to one embodiment of the disclosure, communication interface logic 210 may be implemented as a physical interface including one or more ports for wired connectors. Additionally, or in the alternative, communication interface logic 210 may be implemented with one or more radio units for supporting wireless communications with other electronic devices.

Processor(s) 200 is(are) further coupled to persistent storage 230 via transmission medium 225. According to one embodiment of the disclosure, persistent storage 230 may include static analysis engine 175, dynamic analysis engine 190, graphical user interface (GUI) logic 271, configuration logic 273, and clustering and reporting module 195, which comprises behavior analysis logic 231, sanitization logic 233, fuzzy hashing logic 235, comparison logic 237, and malware score logic 239. Of course, when implemented as hardware, engine 190 and logic 231, 233, 235, 237, 239, 271, and 273 would be implemented separately from persistent storage 230.

Turning now to FIG. 3, a method for classifying objects 300 will now be described. Each operation of the method 300 may be performed by one or more components of the MCD system 110₁. For example, the operation of method 300 may be performed by the dynamic analysis engine 190 in conjunction with the clustering and reporting module 195 of the MCD system 110₁. In other embodiments, the operations of method 300 may be performed in full or in part by other components of the communication system 100.

The method 300 may commence at operation 301 with receipt of a suspect object to be classified. The suspect object may be intercepted by the network interface 160 and passed to the MCD system 110₁for analysis. In another embodiment, an anti-malware system running on the client device 150 may periodically and without direct provocation by the user intercept and transmit objects to the MCD system 110₁for processing and analysis. This independent interception and analysis of objects allows the client device 150 to maintain an automatic examination of potential malware objects received without direct interaction by a user.

In another embodiment, a user of the client device 150 may submit a suspect object through a user interface. The interface may be generated by GUI logic 271 and served to the client device 150 by configuration logic 273 of the MCD system 110₁. In this fashion, the MCD system 110₁may operate as a web-server to deliver data and a user interface to the client device 150.

FIG. 4A shows a web-interface 400 for submitting a suspect object to the MCD system 110₁for analysis according to one embodiment. In this example interface 400, a user may direct a web browser running on the client device 150 to view the web-interface 400. The user may thereinafter enter the address/location of a suspect object into the web-interface 400 using the address input field 401 and the “BROWSE” button 403. The entered address indicates the location of the suspect object in storage on the client device 150 or on a remote device (e.g., stored on a website). After selection of a suspect object, the user may submit the suspect object for analysis by selecting the “SCAN” button 405 in the web-interface 400. The suspect object may be transmitted from the client device 150 such that it is received by the MCD 110₁for processing as described above at operation 301.

In one embodiment, a suspect object may be any digital data structure. For example, a suspect object may be a file (e.g., PDF document), a component of a file, a component of a web page, an image, a series of captured network/web traffic that is capable of being replayed, etc. As described above, a user of the client device 150 may manually determine that an object is suspected to be malware or the client device 150 may automatically classify the object as potential/suspected malware and transmit the suspect object to the MCD system 110₁.

Referring back to FIG. 3, although described in relation to receiving a single suspect object, in other embodiments, the method 300 may be used in relation to multiple suspect objects received simultaneously or in rapid succession. For example, the method 300 may be used to analyze multiple suspect objects received from the client device 150 or other devices on the network 130. The suspect objects may be processed by the method 300 separately to determine whether each received suspect object is malware based on comparisons with previously generated clusters of objects using fuzzy hashes as described in greater detail below.

Following receipt of a suspect object, operation 303 determines behaviors of the suspect object using the behavior analysis logic 231. The determined behaviors characterize the suspect object such that a comparison can be performed with other previously classified objects in one or more object clusters as will be described in further detail below.

In one embodiment, the behaviors may be determined/detected at operation 303 after the suspect object has been detonated, namely processed (e.g. executed, opened or otherwise activated), by the dynamic analysis engine 190. For example, dynamic analysis engine 190 may detonate the suspect object such that operations associated with the suspect object are performed. For instance, in one embodiment the suspect object may be a PDF file. In this embodiment, the dynamic analysis engine 190 may detonate the PDF file by opening the file using an Adobe® Reader or other appropriate document reader.

In one embodiment, one or more virtual machines with various profiles that simulate the client device 150 may be used during detonation of the suspect object. These profiles may be software to be run by a virtual machine to process a suspect object. For example, the profiles may include an operating system and one or more suitable computer applications that are associated with the client device 150. For instance, an Adobe® Reader may be included in a virtual machine such that a suspect object, which is a PDF file, may be detonated by the virtual machine. Use of virtual machines ensures that detonation of the suspect object is controlled and will not result in infection of the client device 150 while still simulating the computing environment of the client device 150 to generate behavior data that describes the suspect object.

As noted above, detonation of the suspect object produces behavior data that describes the suspect object such that a comparison may later be performed with other objects. This behavior data may be detected and collected at operation 303 using the behavior analysis logic 231. The behavior data may include, for example, details regarding data generated by the suspect object during detonation, data accessed by the suspect object (both locally and from remote systems) during detonation, known exploits in the suspect object, etc.

In one embodiment, operation 305 may scrub the behavior data detected and collected at operation 303 to remove data that does not identify the suspect object. This scrubbing operation may be performed using the sanitization logic 233. In one embodiment, scrubbing the behavior data includes removing a subset of process identifiers of processes called by the suspect object during detonation, values written to, deleted from, or modified to a registry by the suspect object during detonation such that only the path of these operations is retained, and names of files generated, modified, and/or deleted by the suspect object during detonation such that only a path in an associated file system is retained. This removed/scrubbed data may be discarded at operation 305 as it does not identify the suspect object in relation to other objects and may be considered superfluous.

After the behavior data has been scrubbed at operation 305 to generate scrubbed behavior data, the method 300 may perform two analyses: 1) an analysis to associate the suspect object with a cluster of previously stored/analyzed objects and 2) an analysis to generate a malware score, which describes the probability that the suspect object is malware. The analyses may be run concurrently or asynchronously. In one embodiment, the results of the first analysis (i.e., cluster association) may be used to modify the malware score generated by the second analysis. Each of these analyses will be described in greater detail below.

Beginning with the first analysis of the suspect object, at operation 307 a fuzzy hash for the suspect object may be generated based on the scrubbed behavior data using the fuzzy hashing logic 235. A fuzzy hash allows the comparison of objects to determine similarity of the objects instead of necessarily a direct match. In comparison, traditional hashing techniques only allow a comparison of objects to determine an exact match. By allowing the determination of “similar” objects, fuzzy hashes afford a greater leniency in classification and categorization of objects that might be slightly different but otherwise share important characteristics. Through the utilization of a fuzzy hash, similar objects may be determined through a comparison of hash values within the fuzzy hash as will be described in greater detail below.

In one embodiment, a fuzzy hash is constructed by running a hashing algorithm over blocks of the scrubbed behavior data for an object. In one embodiment, an MD5 hash may be performed on successive blocks of scrubbed behavior data to produce a plurality or a stream of hash values. For example, the scrubbed behavior data may be separated into N equal sized blocks, where N is greater than or equal to two (e.g., 1024 byte blocks). A hash value is produced for each of the N blocks to generate exactly N hash values. In one embodiment, the scrubbed behavior data may be separated into blocks corresponding to segments of data that represent discrete behaviors detected at operation 303. Accordingly, in this embodiment, each block represents a single detected behavior associated with the suspect object.

Although described in relation to use of an MD5 hash for generation of the fuzzy hash, in other embodiments other hashing techniques/methods may be used. For example, in other embodiments a SHA, SWIFFT, and/or HAVAL hash may be used to generate the fuzzy hash for the suspect object at operation 307.

Following the generation of a fuzzy hash for the suspect object at operation 307, operation 309 may compare the fuzzy hash for the suspect object with one or more fuzzy hashes of other previously stored/classified objects associated with clusters. This comparison may be performed using the comparison logic 237 of the MCD system 110₁. In one embodiment, the previously stored clusters of objects are stored locally on the MCD system 110₁in the storage device 185 or a separate data store (e.g. part of persistent storage 230 of FIG. 2). In other embodiments, the previously stored clusters of objects may be stored in cloud computing services 135 or the management system 120. In these embodiments, the management system 120 may distribute clusters of objects to MCD systems 110₁-110₃as needed or the MCD systems 110₁-110₃may directly access the clusters of objects over the network 125 for analysis of other objects received by these MCD systems 110₁-110₃.

As shown in FIG. 5, each previously stored object 501 may be associated with one or more behaviors. Similar to the behaviors of the suspect object, the behaviors of the previously stored objects 501 characterize the dynamic actions, operations, and activities of the objects 501 during detonation. A fuzzy hash may be associated with each object 501 based on these behaviors in a similar fashion as described above in relation to operation 307. Each of the previously stored objects 501 may be associated with a cluster 503 based on a similarity of fuzzy hashes for each respective object 501. For example, as shown in FIG. 5, objects 501₁and 501₂are associated with the cluster 503₁. This association indicates that the fuzzy hashes of objects 501₁and 501₂are “similar” and may be considered in the same family. In one embodiment, similarity may be described in terms of the number of matching hash values between the respective fuzzy hashes of objects 501₁and 501₂. For example, the fuzzy hashes of two objects may be compared to determine a similarity measure. The similarity measure may describe the percentage or number of matching hash values between the two fuzzy hashes. In one embodiment, a similarity measure above a predefined similarity threshold indicates that the objects are similar. Since the objects 501₁and 501₂are in the same cluster 503₁, the comparison of fuzzy hashes for objects 501₁and 501₂would yield a similarity measure above the predefined similarity threshold. In contrast, the comparison of fuzzy hashes for objects 501₁and 501₃would yield a similarity measure below the predefined similarity threshold since these objects 501₁and 501₃are associated with different clusters (e.g., clusters 503₁and 503₂respectively).

Referring back to FIG. 3 and returning to operation 309, the fuzzy hash of the suspect object is compared with one or more fuzzy hashes of previously stored objects associated with clusters. Using the example set of objects 601 in FIG. 6, the fuzzy hash of the suspect object may be compared against the fuzzy hashes of one or more of the objects 601₁-601₁₅. In one embodiment, operation 309 compares the fuzzy hash of the suspect object with the fuzzy hash of at least one object 601 in each cluster 603. Each comparison yields a separate similarity measure that describes the similarity of the suspect object and each respective comparison object 601.

In one embodiment, operation 309 may utilize statistical and machine learning to determine whether the suspect object is similar to an object in a cluster. Machine learning refers to a process or system that can learn from data, i.e., be trained to distinguish between “good” and “bad”, or in this case, between similar objects and non-similar objects. The core of machine learning deals with representation and generalization, that is, representation of data objects (e.g., the anomalies and other analytical results, which can be collectively represented by features/behaviors of the objects), and functions performed on those objects (e.g., weighting and probability formulas). Generalization is the property that the process or system uses to apply what it learns on a learning set of known (or “labeled”) data objects to unknown (or “unlabeled”) examples. To do this, the process or system must extract learning from the labeled set that allows it to make useful predictions in new and unlabeled cases.

For machine learning, the MCD system 110₁may operate in a training mode and in an operational mode. In a training mode, the MCD system 110₁employs threat heuristics training logic to subject known samples (e.g., labeled samples) of similar objects and known samples of non-similar objects to calibrate threat heuristics logic for probability scoring and/or decision making of objects. To accomplish this, the threat heuristics training logic may submit similar and non-similar objects to analyzers. In some embodiments, the threat heuristics training logic may employ a special forensics system. In alternative embodiments, the threat heuristics training logic may test the similar and non-similar objects each time it processes a different suspect object, or it may store the results of prior tests for use for future processing of objects. The threat heuristics training logic may assign a probability score (e.g., a similarity measure) to each of the possible patterns resulting from testing the similar and non-similar objects. These probability scores and classification labels are indicative of whether a set of objects are similar. In one embodiment, the machine learning routines and operations described above may be performed by the learning module 187 shown in FIG. 1 based on inputs from the storage device 185 and/or the clustering and reporting module 195.

Referring back again to FIGS. 2-3, at operation 311, the set of similarity measures generated at operation 309 may be compared against the predefined similarity threshold to determine whether the suspect object is “similar” to a previously stored object in a preexisting cluster. This comparison may be performed by the comparison logic 237 and reveals whether the suspect object is within the same family as objects within a cluster. As noted above, if a similarity measure is above the predefined similarity threshold, the suspect object is “similar” to the corresponding object. However, if the similarity measure is below the predefined similarity threshold, the suspect object is not “similar” to the corresponding object. In one embodiment, the predefined similarity threshold may be set by an analyst, network administrator, and/or subscriber.

Upon determining that a similarity measure is above the predefined similarity threshold, the method moves to operation 313. At operation 313, the suspect object is associated with the cluster of the object with which the generated similarity measure exceeded the predefined similarity threshold. For example, as shown in FIG. 7A, the fuzzy hash of the suspect object may be added to cluster 603₂. In this example, the similarity measure between the suspect object and one or more of the objects 601₅-601₈is above the predefined similarity threshold.

In one embodiment, association with a cluster may be used to further describe the suspect object. For example, association with a cluster may be used to 1) determine a malware family name for the suspect object and/or 2) determine whether the suspect object is malware, non-malware, or has an unknown status.

As shown in FIG. 7A, the objects 601₁-601₄in the cluster 603₁were determined to be non-malware (indicated by the lack of shading for these objects 601₁-601₄). This classification determination may be based on a previous dynamic or static analysis of the objects 601₁-601₄. In this case, if the suspect object had been associated with the cluster 603₁, the suspect object would be classified as non-malware.

In comparison, the objects 601₅-601₈in the cluster 603₂were determined to be malware (indicated by shading of these objects 601₅-601₈) and associated with the malware family name “MalBot”. Again, this classification determination may be based on a previous dynamic or static analysis of the objects 601₅-601₈using both comparisons with locally stored objects and objects stored remotely. Since the suspect object has been associated with the cluster 603₂in the example provided above, the suspect object is classified malware and associated with the malware family name “MalBot”.

In some instances, a status of a set of objects in a cluster may not yet be known. For example, in the cluster 603₄shown in FIG. 7A the status of these objects 601₁₂-601₁₆cannot yet be determined as malware or non-malware (indicated by dashed border for these objects 601₁₂-601₁₆). Accordingly, if the suspect object had been associated with the cluster 603₄, the suspect object would be classified with an unknown status. Grouping objects with unknown status may later be useful when a classification and/or malware family name may be assigned to these objects.

In some embodiments, association of an object with a cluster may only be informative and not provide classification information. For example, the cluster 603₃may include several objects 601₉-601₁₁that have been classified as malware and associated with the malware family name “DataStealer”. However, association with cluster 603₃may only yield an association with a malware family name associated with the cluster 603₃(e.g., “DataStealer”) instead of also a classification for the newly added object. This failure to yield classification information for new objects may be based on a number of false positive malware classifications associated with the cluster 603₃or another threshold that indicates an unreliable classification.

In one embodiment, operations 309 and 311 may be first performed in relation to clusters of objects stored locally on the MCD system 110₁(i.e., in the storage device 185). Following a failure to locate a locally stored cluster with a “similar” object to the suspect object, the operations 309 and 311 may be performed for clusters of objects stored on other devices. For example, the operations 309 and 311 may be performed on clusters of objects stored on a cloud server located in the cloud computing services 135 in response to a failure to locate a local cluster with a “similar” object.

Returning to operation 311 of FIG. 3, upon determining that similarity measures generated for the suspect object at operation 309 are not above the predefined similarity threshold for any local or remote clusters of objects, the method 300 moves to operation 315 to create a new cluster for the suspect object. For example, FIG. 7B, shows the suspect object added to new cluster 603₅. In this example, the suspect object is not similar to any of the objects 601₁-601₁₆based on compared fuzzy hashes and accordingly is not part of these families of objects. Instead, the suspect object is the first member of a new family defined by the cluster 603₅.

Following generation of a new cluster for the suspect object at operation 315, operation 317 may transmit the new cluster to the MCD systems 110₂and 110₃and/or the management system 120. In one embodiment, the management system 120 may receive the new cluster from the MCD system 110₁and propagate this new cluster to the MCD systems 110₂and 110₃using the network 125. The MCD systems 110₂and 110₃may utilize this new cluster for future analysis of other objects intercepted or otherwise received from the client device 150 or other devices on the network 130.

As described above, objects intercepted or otherwise received from the client device 150 may be compared using fuzzy hashes to determine similarity. Upon determination of similarity, the received/suspect object may be associated with a corresponding cluster and inherit attributes of the cluster. These attributes may include 1) classification as malware, non-malware, or an unknown status and/or 2) a malware family name. By utilizing fuzzy hash comparisons with previously stored and classified objects, the method 300 provides an efficient technique for classifying newly received objects based on familial similarities.

In one embodiment, the results of the method 300 may be transmitted from the clustering and reporting module 195 to the dynamic analysis engine 190. In this embodiment, the results of the method 300 may be used to supplement the analysis results produced by the dynamic analysis engine 190 to increase the accuracy in identifying suspicious objects as malware.

As noted above, the method 300 may conduct a separate analysis following operation 305 to generate a preliminary malware score, which describes the probability that the suspect object is malware. For example, the preliminary malware score may fall between 0.0 and 1.0. In one embodiment, operation 319 compares the scrubbed behavior data of the suspect object with known malware behaviors using the malware score logic 239 shown in FIG. 2. These known malware behaviors may be cultivated after dynamic analysis of known malware objects by the MCD 110₁, another device on the network 125 (e.g., the MCDs 110₂and 110₃or the management system 120), and/or a remote device (e.g., device located within the cloud computing services). In one embodiment, the known malware behaviors are stored in the storage device 185 and describe unexpected, anomalous, and/or malicious actions that are characteristic of malware. Examples of anomalous behaviors may include unusual network transmissions, opening certain ports to retrieve data, unusual changes in performance, and the like.

The comparison at operation 319 yields a preliminary malware score based on the number of similarities between the scrubbed behavior data and the known malware behavior. For example, when multiple behaviors described in the scrubbed behavior data match behaviors in the known malware behaviors, operation 319 may yield a high preliminary malware score (e.g., 0.9), which indicates a high probability the suspect object is malware. In contrast, when few behaviors described in the scrubbed behavior data match behaviors in the known malware behaviors, operation 319 may yield a low preliminary malware score (e.g., 0.1), which indicates a low probability the suspect object is malware. In one embodiment, this comparison at operation 319 may be performed using machine learning and statistical analysis similar to that described above in relation to operation 309.

In one embodiment, the preliminary malware score may be used at operation 321 to generate a final malware score based on the suspect object's association with a cluster at operations 313 or 315. For example, when the suspect object is associated with a cluster that classifies the suspect object as malware, the preliminary malware score from operation 319 may be increased to generate a final malware score that is greater that the preliminary malware score from operation 319. This increase indicates a higher probability that the suspect object is malware than originally computed at operation 319. Conversely, when the suspect object is associated with a cluster that classifies the suspect object as non-malware or with an unknown status, the preliminary malware score from operation 319 may be decreased to generate the final malware score. This decrease indicates a lower probability that the suspect object is malware than originally computed at operation 319. By generating a final malware score that reflects the probability that a suspect object is malware based on both a comparison with known malware behaviors and clusters of classified objects, operation 321 creates a more robust determination of the likelihood that the suspect object is malware.

At operation 323, the final malware score generated at operation 321 may be transmitted along with the classification and naming information assigned to the suspect object at operations 313 or 315 to a user of the client device, a subscriber of a malware detection service, a network administrator, or another entity. The transmission may be made using an email message, a popup message, or any other message transmission technique. For example, the user interface 400 may be updated to reflect the classification of the suspect object as shown in FIG. 4B.

As described above, the method for classifying objects 300 may utilize fuzzy hash techniques to group “similar” objects in clusters for future analysis. This similarity matching allows greater flexibility in analyzing potential malware objects, which may share multiple characteristics and behaviors but are also slightly different from previously classified objects. These clusters of objects may be continually updated and shared between the MCD systems 110₁-110_Nas new objects are processed by the method 300 such that a robust set of object clusters are maintained for future detection and remediation of families of malware threats.

Claims

1. A computerized method for classifying objects in a malware system, comprising: receiving, by a malicious content detection (MCD) system from a client device, an object to be classified;detecting behaviors of the received object, wherein the behaviors are detected after processing the received object;generating a fuzzy hash for the received object based on the detected behaviors, the generating of the fuzzy hash comprises (i) obtaining a reduced amount of data associated with the detected behaviors by retaining a portion of the data associated with the detected behaviors that corresponds to one or more operations conducted during processing of the received object, and removing metadata associated with the one or more operations conducted during the processing of the received object, the metadata including at least one or more identifiers of processes called during the processing of the received object, and (ii) performing a hash operation on the reduced amount of data associated with the detected behaviors;comparing the fuzzy hash for the received object with a fuzzy hash of an object in a preexisting cluster to generate a similarity measure;associating the received object with the preexisting cluster in response to determining that the similarity measure is above a predefined threshold value;creating a new cluster for the received object in response to determining that the similarity measure is below the predefined threshold value; andreporting, by the MCD system, results of either (i) the associating of the received object with the preexisting cluster or (ii) the creating of the new cluster.
2. The computerized method of claim 1, wherein the received object is at least one of a file, a uniform resource locator, a web object, a capture of network traffic for a user over time, and an email message.
3. The computerized method of claim 1, wherein the removed metadata associated with the corresponding operations includes metadata associated with one or more of (1) network calls, (2) modifications to a registry, (3) modifications to a file system, or (4) an application program interface call.
4. The computerized method of claim 1, further comprising: generating a preliminary malware score for the received object based on a comparison of the reduced amount of data associated with the detected behaviors with data associated with known malware behaviors, wherein the preliminary malware score indicates the probability the received object is malware; andgenerating a final malware score for the received object based on the cluster the received object is associated,wherein the final malware score is greater than the preliminary malware score when the received object is associated with a cluster of objects classified as malware and the final malware score is less than the preliminary malware score when the received object is associated with a cluster of objects classified as non-malware.
5. The computerized method of claim 1, wherein the removing of the metadata associated with the one or more operations comprises removing data that does not identify the received object.
6. The computerized method of claim 5, wherein the removing of the metadata further comprises removing at least a portion of values written to a registry by the received object.
7. The computerized method of claim 1, further comprising: transmitting, by the MCD system, the new cluster or the preexisting cluster with the newly associated received object to another MCD system.
8. The computerized method of claim 1, further comprising: classifying the received object as malware, non-malware, or with an unknown status to match a classification of the preexisting cluster, when the received object is assigned to the preexisting cluster.
9. The computerized method of claim 1, further comprising: assigning a malware family name to the received object to match a malware family name of the preexisting cluster, when the received object is assigned to the preexisting cluster.
10. The computerized method of claim 1, wherein the generating of the fuzzy hash further comprises at least one of (a) retaining one or more image paths in an associated file system corresponding to a location of a file that is generated or modified during the processing of the received object or (b) removing a file name prior to performing the hash operation on the data associated with the detected behaviors.
11. The computerized method of claim 1, wherein the generating of the fuzzy hash comprises retaining only the one or more image paths corresponding to operations conducted during processing of the received object as part of the data associated with the detected behaviors.
12. A non-transitory storage medium including instructions that, when executed by one or more hardware processors, performs a plurality of operations, comprising: detecting behaviors of a received object, wherein the behaviors are detected after processing the received object;generating a fuzzy hash for the received object based on the detected behaviors, the generating of the fuzzy hash comprises (i) obtaining a reduced amount of data associated with the detected behaviors by retaining a portion of the data associated with the detected behaviors that corresponds to one or more operations conducted during processing of the received object, and removing metadata associated with the one or more operations conducted during the processing of the received object, the metadata including at least one or more identifiers of processes called during the processing of the received object metadata, and (ii) performing a hash operation on the reduced amount of data associated with the detected behaviors;comparing the fuzzy hash for the received object with a fuzzy hash of an object in a preexisting cluster to generate a similarity measure;associating the received object with the preexisting cluster in response to determining that the similarity measure is above a predefined threshold value;creating a new cluster for the received object in response to determining that the similarity measure is below the predefined threshold value; andreporting results of either (i) the associating of the received object with the preexisting cluster or (ii) the creating of the new cluster.
13. The non-transitory storage medium of claim 12, wherein the received object is one of a file, a uniform resource locator, a web object, a capture of network traffic for a user over time, and an email message.
14. The non-transitory storage medium of claim 12, wherein the removed metadata associated with the one or more operations includes metadata associated with one or more of (1) network calls, (2) modifications to a registry, (3) modifications to a file system, or (4) an application program interface call.
15. The non-transitory storage medium of claim 12 further includes instructions that, when executed by the one or more hardware processors, perform a plurality of operations comprising: generating a preliminary malware score for the received object based on a comparison of the reduced amount of data associated with the detected behaviors with data associated with known malware behaviors, wherein the preliminary malware score indicates the probability the received object is malware; andgenerating a final malware score for the received object based on the cluster the received object is associated,wherein the final malware score is greater than the preliminary malware score when the received object is associated with a cluster of objects classified as malware and the final malware score is less than the preliminary malware score when the received object is associated with a cluster of objects classified as non-malware.
16. The non-transitory storage medium of claim 12, wherein the removing of the metadata associated with the one or more operations comprises removing metadata that does not identify the received object.
17. The non-transitory storage medium of claim 12, wherein the removing of the metadata associated with the one or more operations further comprises removing at least a portion of values written to a registry by the received object.
18. The non-transitory storage medium of claim 12 further includes instructions that, when executed by the one or more hardware processors, perform operations comprising: classifying the received object as malware, non-malware, or with an unknown status to match a classification of the preexisting cluster, when the received object is assigned to the preexisting cluster.
19. The non-transitory storage medium of claim 12 further includes instructions that, when executed by the one or more hardware processors, perform operations comprising: assigning a malware family name to the received object to match a malware family name of the preexisting cluster, when the received object is assigned to the preexisting cluster.
20. The non-transitory storage medium of claim 12 including instructions that, when executed by one or more hardware processors, perform an operation of generating of the fuzzy hash that includes one or more operations comprising at least retaining one or more image paths in an associated file system corresponding to a location of a file that is generated or modified during the processing of the received object, or removing the file name prior to performing the hash operation on the data associated with the detected behaviors.
21. The non-transitory storage medium of claim 12 including instructions that, when executed by one or more hardware processors, perform an operation of generating of the fuzzy hash that includes one or more operations comprising retaining one or more image paths corresponding to operations conducted during processing of the received object as part of the data associated with the detected behaviors.
22. A system comprising: one or more hardware processors;a memory including one or more software modules that, when executed by the one or more hardware processors: detect behaviors of a received object, wherein the behaviors are detected after processing the received object;generate a fuzzy hash for the received object based on a portion of the detected behaviors, the generating of the fuzzy hash comprises (i) obtaining a reduced amount of data associated with the detected behaviors by retaining a portion of the data associated with the detected behaviors that corresponds to one or more operations conducted during processing of the received object, and removing metadata associated with the one or more operations conducted during the processing of the received object, the metadata including at least one or more identifiers of processes called during the processing of the received object, and (ii) performing a hash operation on the reduced amount of data associated with the detected behaviors;compare the fuzzy hash for the received object with a fuzzy hash of an object in a preexisting cluster to generate a similarity measure;associate the received object with the preexisting cluster in response to determining that the similarity measure is above a predefined threshold value;create a new cluster for the received object in response to determining that the similarity measure is below the predefined threshold value; andreport results of either (i) an association of the received object with the preexisting cluster or (ii) a creation of the new cluster.
23. The system of claim 22, wherein the one or more hardware processors, when executing the software modules, further: classify the received object as malware, non-malware, or with an unknown status to match a classification of the preexisting cluster, when the received object is assigned to the preexisting cluster.
24. The system of claim 22, wherein the one or more hardware processors, when executing the software modules, further: assign a malware family name to the received object to match a malware family name of the preexisting cluster, when the received object is assigned to the preexisting cluster.
25. The system of claim 22, wherein the memory including the one or more software modules that, when executed by the one or more hardware processors, generate the fuzzy hash for the received object based on the portion of the detected behaviors by retaining one or more image paths in an associated file system corresponding to a location of a file that is generated or modified during the processing of the received object or removing a file name prior to conducting the hash operation on the data associated with the detected behaviors.
26. The system of claim 22, wherein the memory including the one or more software modules that, when executed by the one or more hardware processors, generate the fuzzy hash for the received object based on the portion of the detected behaviors that comprises only the one or more image paths corresponding to operations conducted during processing of the received object.

US Referenced Citations (514)

Number	Name	Date	Kind
4292580	Ott et al.	Sep 1981	A
5175732	Hendel et al.	Dec 1992	A
5440723	Arnold et al.	Aug 1995	A
5490249	Miller	Feb 1996	A
5657473	Killean et al.	Aug 1997	A
5842002	Schnurer et al.	Nov 1998	A
5978917	Chi	Nov 1999	A
6088803	Tso et al.	Jul 2000	A
6094677	Capek et al.	Jul 2000	A
6108799	Boulay et al.	Aug 2000	A
6118382	Hibbs et al.	Sep 2000	A
6269330	Cidon et al.	Jul 2001	B1
6272641	Ji	Aug 2001	B1
6279113	Vaidya	Aug 2001	B1
6298445	Shostack	Oct 2001	B1
6357008	Nachenberg	Mar 2002	B1
6417774	Hibbs et al.	Jul 2002	B1
6424627	Srhaug et al.	Jul 2002	B1
6442696	Wray et al.	Aug 2002	B1
6484315	Ziese	Nov 2002	B1
6487666	Shanklin et al.	Nov 2002	B1
6493756	O'Brien et al.	Dec 2002	B1
6550012	Villa et al.	Apr 2003	B1
6700497	Hibbs et al.	Mar 2004	B2
6775657	Baker	Aug 2004	B1
6831893	Ben Nun et al.	Dec 2004	B1
6832367	Choi et al.	Dec 2004	B1
6895550	Kanchirayappa et al.	May 2005	B2
6898632	Gordy et al.	May 2005	B2
6907396	Muttik et al.	Jun 2005	B1
6941348	Petry et al.	Sep 2005	B2
6971097	Wallman	Nov 2005	B1
6981279	Arnold et al.	Dec 2005	B1
6995665	Appelt et al.	Feb 2006	B2
7007107	Ivchenko et al.	Feb 2006	B1
7028179	Anderson et al.	Apr 2006	B2
7043757	Hoefelmeyer et al.	May 2006	B2
7069316	Gryaznov	Jun 2006	B1
7080407	Zhao et al.	Jul 2006	B1
7080408	Pak et al.	Jul 2006	B1
7093002	Wolff et al.	Aug 2006	B2
7093239	van der Made	Aug 2006	B1
7096498	Judge	Aug 2006	B2
7100201	Izatt	Aug 2006	B2
7107617	Hursey et al.	Sep 2006	B2
7159149	Spiegel et al.	Jan 2007	B2
7213260	Judge	May 2007	B2
7231667	Jordan	Jun 2007	B2
7240364	Branscomb et al.	Jul 2007	B1
7240368	Roesch et al.	Jul 2007	B1
7243371	Kasper et al.	Jul 2007	B1
7249175	Donaldson	Jul 2007	B1
7287278	Liang	Oct 2007	B2
7308716	Danford et al.	Dec 2007	B2
7328453	Merkle, Jr. et al.	Feb 2008	B2
7346486	Ivancic et al.	Mar 2008	B2
7356736	Natvig	Apr 2008	B2
7386888	Liang et al.	Jun 2008	B2
7392542	Bucher	Jun 2008	B2
7418729	Szor	Aug 2008	B2
7428300	Drew et al.	Sep 2008	B1
7441272	Durham et al.	Oct 2008	B2
7448084	Apap et al.	Nov 2008	B1
7458098	Judge et al.	Nov 2008	B2
7464404	Carpenter et al.	Dec 2008	B2
7464407	Nakae et al.	Dec 2008	B2
7467408	O'Toole, Jr.	Dec 2008	B1
7478428	Thomlinson	Jan 2009	B1
7480773	Reed	Jan 2009	B1
7487543	Arnold et al.	Feb 2009	B2
7496960	Chen et al.	Feb 2009	B1
7496961	Zimmer et al.	Feb 2009	B2
7519990	Xie	Apr 2009	B1
7523493	Liang et al.	Apr 2009	B2
7530104	Thrower et al.	May 2009	B1
7540025	Tzadikario	May 2009	B2
7565550	Liang et al.	Jul 2009	B2
7568233	Szor et al.	Jul 2009	B1
7584455	Ball	Sep 2009	B2
7603715	Costa et al.	Oct 2009	B2
7607171	Marsden et al.	Oct 2009	B1
7639714	Stolfo et al.	Dec 2009	B2
7644441	Schmid et al.	Jan 2010	B2
7657419	van der Made	Feb 2010	B2
7676841	Sobchuk et al.	Mar 2010	B2
7694150	Kirby	Apr 2010	B1
7698548	Shelest et al.	Apr 2010	B2
7707633	Danford et al.	Apr 2010	B2
7712136	Sprosts et al.	May 2010	B2
7730011	Deninger et al.	Jun 2010	B1
7739740	Nachenberg et al.	Jun 2010	B1
7779463	Stolfo et al.	Aug 2010	B2
7784097	Stolfo et al.	Aug 2010	B1
7832008	Kraemer	Nov 2010	B1
7836502	Zhao et al.	Nov 2010	B1
7849506	Dansey et al.	Dec 2010	B1
7854007	Sprosts et al.	Dec 2010	B2
7869073	Oshima	Jan 2011	B2
7877803	Enstone et al.	Jan 2011	B2
7904959	Sidiroglou et al.	Mar 2011	B2
7908660	Bahl	Mar 2011	B2
7930738	Petersen	Apr 2011	B1
7937761	Benett	May 2011	B1
7949849	Lowe et al.	May 2011	B2
7996556	Raghavan et al.	Aug 2011	B2
7996836	McCorkendale et al.	Aug 2011	B1
7996904	Chiueh et al.	Aug 2011	B1
7996905	Arnold et al.	Aug 2011	B2
8006305	Aziz	Aug 2011	B2
8010667	Zhang et al.	Aug 2011	B2
8020206	Hubbard et al.	Sep 2011	B2
8028338	Schneider et al.	Sep 2011	B1
8042184	Batenin	Oct 2011	B1
8045094	Teragawa	Oct 2011	B2
8045458	Alperovitch et al.	Oct 2011	B2
8069484	McMillan et al.	Nov 2011	B2
8087086	Lai et al.	Dec 2011	B1
8171553	Aziz et al.	May 2012	B2
8176049	Deninger et al.	May 2012	B2
8176480	Spertus	May 2012	B1
8201246	Wu et al.	Jun 2012	B1
8204984	Aziz et al.	Jun 2012	B1
8214905	Doukhvalov et al.	Jul 2012	B1
8220055	Kennedy	Jul 2012	B1
8225288	Miller et al.	Jul 2012	B2
8225373	Kraemer	Jul 2012	B2
8233882	Rogel	Jul 2012	B2
8234640	Fitzgerald et al.	Jul 2012	B1
8234709	Viljoen et al.	Jul 2012	B2
8239944	Nachenberg et al.	Aug 2012	B1
8260914	Ranjan	Sep 2012	B1
8266091	Gubin et al.	Sep 2012	B1
8286251	Eker et al.	Oct 2012	B2
8291499	Aziz et al.	Oct 2012	B2
8307435	Mann et al.	Nov 2012	B1
8307443	Wang et al.	Nov 2012	B2
8312545	Tuvell et al.	Nov 2012	B2
8321936	Green et al.	Nov 2012	B1
8321941	Tuvell et al.	Nov 2012	B2
8332571	Edwards, Sr.	Dec 2012	B1
8365286	Poston	Jan 2013	B2
8365297	Parshin et al.	Jan 2013	B1
8370938	Daswani et al.	Feb 2013	B1
8370939	Zaitsev et al.	Feb 2013	B2
8375444	Aziz et al.	Feb 2013	B2
8381299	Stolfo et al.	Feb 2013	B2
8402529	Green et al.	Mar 2013	B1
8464340	Ahn et al.	Jun 2013	B2
8479174	Chiriac	Jul 2013	B2
8479276	Vaystikh et al.	Jul 2013	B1
8479291	Bodke	Jul 2013	B1
8510827	Leake et al.	Aug 2013	B1
8510828	Guo et al.	Aug 2013	B1
8510842	Amit et al.	Aug 2013	B2
8516478	Edwards et al.	Aug 2013	B1
8516590	Ranadive et al.	Aug 2013	B1
8516593	Aziz	Aug 2013	B2
8522348	Chen et al.	Aug 2013	B2
8528086	Aziz	Sep 2013	B1
8533824	Hutton et al.	Sep 2013	B2
8539582	Aziz et al.	Sep 2013	B1
8549638	Aziz	Oct 2013	B2
8555391	Demir et al.	Oct 2013	B1
8561177	Aziz et al.	Oct 2013	B1
8566946	Aziz et al.	Oct 2013	B1
8584094	Dahdia et al.	Nov 2013	B2
8584234	Sobel et al.	Nov 2013	B1
8584239	Aziz et al.	Nov 2013	B2
8595834	Xie et al.	Nov 2013	B2
8627476	Satish et al.	Jan 2014	B1
8635696	Aziz	Jan 2014	B1
8682054	Xue et al.	Mar 2014	B2
8682812	Ranjan	Mar 2014	B1
8689333	Aziz	Apr 2014	B2
8695096	Zhang	Apr 2014	B1
8713631	Pavlyushchik	Apr 2014	B1
8713681	Silberman et al.	Apr 2014	B2
8726392	McCorkendale et al.	May 2014	B1
8739280	Chess et al.	May 2014	B2
8776229	Aziz	Jul 2014	B1
8782792	Bodke	Jul 2014	B1
8789172	Stolfo et al.	Jul 2014	B2
8789178	Kejriwal et al.	Jul 2014	B2
8793787	Ismael et al.	Jul 2014	B2
8805947	Kuzkin et al.	Aug 2014	B1
8806647	Daswani et al.	Aug 2014	B1
8832829	Manni et al.	Sep 2014	B2
8850570	Ramzan	Sep 2014	B1
8850571	Staniford et al.	Sep 2014	B2
8881234	Narasimhan et al.	Nov 2014	B2
8881282	Aziz et al.	Nov 2014	B1
8898788	Aziz et al.	Nov 2014	B1
8935779	Manni et al.	Jan 2015	B2
8984638	Aziz et al.	Mar 2015	B1
8990939	Staniford et al.	Mar 2015	B2
8990944	Singh et al.	Mar 2015	B1
8997219	Staniford et al.	Mar 2015	B2
9009822	Ismael et al.	Apr 2015	B1
9009823	Ismael et al.	Apr 2015	B1
9027135	Aziz	May 2015	B1
9071638	Aziz et al.	Jun 2015	B1
9104867	Thioux et al.	Aug 2015	B1
9106694	Aziz et al.	Aug 2015	B2
9118715	Staniford et al.	Aug 2015	B2
20010005889	Albrecht	Jun 2001	A1
20010047326	Broadbent et al.	Nov 2001	A1
20020018903	Kokubo et al.	Feb 2002	A1
20020038430	Edwards et al.	Mar 2002	A1
20020091819	Melchione et al.	Jul 2002	A1
20020095607	Lin-Hendel	Jul 2002	A1
20020116627	Tarbotton et al.	Aug 2002	A1
20020144156	Copeland	Oct 2002	A1
20020162015	Tang	Oct 2002	A1
20020166063	Lachman, III et al.	Nov 2002	A1
20020169952	DiSanto et al.	Nov 2002	A1
20020184528	Shevenell et al.	Dec 2002	A1
20020188887	Largman et al.	Dec 2002	A1
20020194490	Halperin et al.	Dec 2002	A1
20030074578	Ford et al.	Apr 2003	A1
20030084318	Schertz	May 2003	A1
20030101381	Mateev et al.	May 2003	A1
20030115483	Liang	Jun 2003	A1
20030188190	Aaron et al.	Oct 2003	A1
20030191957	Hypponen et al.	Oct 2003	A1
20030200460	Morota et al.	Oct 2003	A1
20030212902	Van Der Made	Nov 2003	A1
20030229801	Kouznetsov et al.	Dec 2003	A1
20030237000	Denton et al.	Dec 2003	A1
20040003323	Bennett et al.	Jan 2004	A1
20040015712	Szor	Jan 2004	A1
20040019832	Arnold et al.	Jan 2004	A1
20040047356	Bauer	Mar 2004	A1
20040083408	Spiegel et al.	Apr 2004	A1
20040088581	Brawn et al.	May 2004	A1
20040093513	Cantrell et al.	May 2004	A1
20040111531	Staniford et al.	Jun 2004	A1
20040117478	Triulzi et al.	Jun 2004	A1
20040117624	Brandt et al.	Jun 2004	A1
20040128355	Chao et al.	Jul 2004	A1
20040165588	Pandya	Aug 2004	A1
20040236963	Danford et al.	Nov 2004	A1
20040243349	Greifeneder et al.	Dec 2004	A1
20040249911	Alkhatib et al.	Dec 2004	A1
20040255161	Cavanaugh	Dec 2004	A1
20040268147	Wiederin et al.	Dec 2004	A1
20050005159	Oliphant	Jan 2005	A1
20050021740	Bar et al.	Jan 2005	A1
20050033960	Vialen et al.	Feb 2005	A1
20050033989	Poletto et al.	Feb 2005	A1
20050050148	Mohammadioun et al.	Mar 2005	A1
20050086523	Zimmer et al.	Apr 2005	A1
20050091513	Mitomo et al.	Apr 2005	A1
20050091533	Omote et al.	Apr 2005	A1
20050091652	Ross et al.	Apr 2005	A1
20050108562	Khazan et al.	May 2005	A1
20050114663	Cornell et al.	May 2005	A1
20050125195	Brendel	Jun 2005	A1
20050149726	Joshi et al.	Jul 2005	A1
20050157662	Bingham et al.	Jul 2005	A1
20050183143	Anderholm et al.	Aug 2005	A1
20050201297	Peikari	Sep 2005	A1
20050210533	Copeland et al.	Sep 2005	A1
20050229254	Singh	Oct 2005	A1
20050238005	Chen et al.	Oct 2005	A1
20050240781	Gassoway	Oct 2005	A1
20050262562	Gassoway	Nov 2005	A1
20050265331	Stolfo	Dec 2005	A1
20050283839	Cowburn	Dec 2005	A1
20060010495	Cohen et al.	Jan 2006	A1
20060015416	Hoffman et al.	Jan 2006	A1
20060015715	Anderson	Jan 2006	A1
20060015747	Van de Ven	Jan 2006	A1
20060021029	Brickell et al.	Jan 2006	A1
20060021054	Costa et al.	Jan 2006	A1
20060031476	Mathes et al.	Feb 2006	A1
20060047665	Neil	Mar 2006	A1
20060070130	Costea et al.	Mar 2006	A1
20060075496	Carpenter et al.	Apr 2006	A1
20060095968	Portolani et al.	May 2006	A1
20060101516	Sudaharan et al.	May 2006	A1
20060101517	Banzhof et al.	May 2006	A1
20060117385	Mester et al.	Jun 2006	A1
20060123477	Raghavan et al.	Jun 2006	A1
20060143709	Brooks et al.	Jun 2006	A1
20060150249	Gassen et al.	Jul 2006	A1
20060161983	Cothrell et al.	Jul 2006	A1
20060161987	Levy-Yurista	Jul 2006	A1
20060161989	Reshef et al.	Jul 2006	A1
20060164199	Gilde et al.	Jul 2006	A1
20060173992	Weber et al.	Aug 2006	A1
20060179147	Tran et al.	Aug 2006	A1
20060184632	Marino et al.	Aug 2006	A1
20060191010	Benjamin	Aug 2006	A1
20060221956	Narayan et al.	Oct 2006	A1
20060236393	Kramer et al.	Oct 2006	A1
20060242709	Seinfeld et al.	Oct 2006	A1
20060248519	Jaeger et al.	Nov 2006	A1
20060248582	Panjwani et al.	Nov 2006	A1
20060251104	Koga	Nov 2006	A1
20060288417	Bookbinder et al.	Dec 2006	A1
20070006288	Mayfield et al.	Jan 2007	A1
20070006313	Porras et al.	Jan 2007	A1
20070011174	Takaragi et al.	Jan 2007	A1
20070016951	Piccard et al.	Jan 2007	A1
20070033645	Jones	Feb 2007	A1
20070038943	FitzGerald et al.	Feb 2007	A1
20070064689	Shin et al.	Mar 2007	A1
20070074169	Chess et al.	Mar 2007	A1
20070094730	Bhikkaji et al.	Apr 2007	A1
20070101435	Konanka et al.	May 2007	A1
20070128855	Cho et al.	Jun 2007	A1
20070142030	Sinha et al.	Jun 2007	A1
20070143827	Nicodemus et al.	Jun 2007	A1
20070156895	Vuong	Jul 2007	A1
20070157180	Tillmann et al.	Jul 2007	A1
20070157306	Elrod et al.	Jul 2007	A1
20070168988	Eisner et al.	Jul 2007	A1
20070171824	Ruello et al.	Jul 2007	A1
20070174915	Gribble et al.	Jul 2007	A1
20070192500	Lum	Aug 2007	A1
20070192858	Lum	Aug 2007	A1
20070198275	Malden et al.	Aug 2007	A1
20070208822	Wang et al.	Sep 2007	A1
20070220607	Sprosts et al.	Sep 2007	A1
20070240218	Tuvell et al.	Oct 2007	A1
20070240219	Tuvell et al.	Oct 2007	A1
20070240220	Tuvell et al.	Oct 2007	A1
20070240222	Tuvell et al.	Oct 2007	A1
20070250930	Aziz et al.	Oct 2007	A1
20070256132	Oliphant	Nov 2007	A2
20070271446	Nakamura	Nov 2007	A1
20080005782	Aziz	Jan 2008	A1
20080028463	Dagon et al.	Jan 2008	A1
20080032556	Schreier	Feb 2008	A1
20080040710	Chiriac	Feb 2008	A1
20080046781	Childs et al.	Feb 2008	A1
20080066179	Liu	Mar 2008	A1
20080072326	Danford et al.	Mar 2008	A1
20080077793	Tan et al.	Mar 2008	A1
20080080518	Hoeflin et al.	Apr 2008	A1
20080086720	Lekel	Apr 2008	A1
20080098476	Syversen	Apr 2008	A1
20080120722	Sima et al.	May 2008	A1
20080134178	Fitzgerald et al.	Jun 2008	A1
20080134334	Kim et al.	Jun 2008	A1
20080141376	Clausen et al.	Jun 2008	A1
20080181227	Todd	Jul 2008	A1
20080184373	Traut et al.	Jul 2008	A1
20080189787	Arnold et al.	Aug 2008	A1
20080201778	Guo et al.	Aug 2008	A1
20080209557	Herley et al.	Aug 2008	A1
20080215742	Goldszmidt et al.	Sep 2008	A1
20080222729	Chen et al.	Sep 2008	A1
20080263665	Ma et al.	Oct 2008	A1
20080295172	Bohacek	Nov 2008	A1
20080301810	Lehane et al.	Dec 2008	A1
20080307524	Singh et al.	Dec 2008	A1
20080313738	Enderby	Dec 2008	A1
20080320594	Jiang	Dec 2008	A1
20090003317	Kasralikar et al.	Jan 2009	A1
20090007100	Field et al.	Jan 2009	A1
20090013408	Schipka	Jan 2009	A1
20090031423	Liu et al.	Jan 2009	A1
20090036111	Danford et al.	Feb 2009	A1
20090037835	Goldman	Feb 2009	A1
20090044024	Oberheide et al.	Feb 2009	A1
20090044274	Budko et al.	Feb 2009	A1
20090064332	Porras et al.	Mar 2009	A1
20090077666	Chen et al.	Mar 2009	A1
20090083369	Marmor	Mar 2009	A1
20090083855	Apap et al.	Mar 2009	A1
20090089879	Wang et al.	Apr 2009	A1
20090094697	Provos et al.	Apr 2009	A1
20090113425	Ports et al.	Apr 2009	A1
20090125976	Wassermann et al.	May 2009	A1
20090126015	Monastyrsky et al.	May 2009	A1
20090126016	Sobko et al.	May 2009	A1
20090133125	Choi et al.	May 2009	A1
20090144823	Lamastra et al.	Jun 2009	A1
20090158430	Borders	Jun 2009	A1
20090172815	Gu et al.	Jul 2009	A1
20090187992	Poston	Jul 2009	A1
20090193293	Stolfo et al.	Jul 2009	A1
20090199296	Xie et al.	Aug 2009	A1
20090228233	Anderson et al.	Sep 2009	A1
20090241187	Troyansky	Sep 2009	A1
20090241190	Todd et al.	Sep 2009	A1
20090265692	Godefroid et al.	Oct 2009	A1
20090271867	Zhang	Oct 2009	A1
20090300415	Zhang et al.	Dec 2009	A1
20090300761	Park et al.	Dec 2009	A1
20090328185	Berg et al.	Dec 2009	A1
20090328221	Blumfield et al.	Dec 2009	A1
20100005146	Drako et al.	Jan 2010	A1
20100011205	McKenna	Jan 2010	A1
20100017546	Poo et al.	Jan 2010	A1
20100031353	Thomas et al.	Feb 2010	A1
20100037314	Perdisci et al.	Feb 2010	A1
20100043073	Kuwamura	Feb 2010	A1
20100054278	Stolfo et al.	Mar 2010	A1
20100058474	Hicks	Mar 2010	A1
20100064044	Nonoyama	Mar 2010	A1
20100077481	Polyakov et al.	Mar 2010	A1
20100083376	Pereira et al.	Apr 2010	A1
20100115621	Staniford et al.	May 2010	A1
20100132038	Zaitsev	May 2010	A1
20100154056	Smith et al.	Jun 2010	A1
20100180344	Malyshev et al.	Jul 2010	A1
20100192223	Ismael et al.	Jul 2010	A1
20100220863	Dupaquis et al.	Sep 2010	A1
20100235831	Dittmer	Sep 2010	A1
20100251104	Massand	Sep 2010	A1
20100281102	Chinta et al.	Nov 2010	A1
20100281541	Stolfo et al.	Nov 2010	A1
20100281542	Stolfo et al.	Nov 2010	A1
20100287260	Peterson et al.	Nov 2010	A1
20100299754	Amit et al.	Nov 2010	A1
20100306173	Frank	Dec 2010	A1
20110004737	Greenebaum	Jan 2011	A1
20110025504	Lyon et al.	Feb 2011	A1
20110041179	Stahlberg	Feb 2011	A1
20110047594	Mahaffey et al.	Feb 2011	A1
20110047597	Mahaffey et al.	Feb 2011	A1
20110047620	Mahaffey et al.	Feb 2011	A1
20110055907	Narasimhan et al.	Mar 2011	A1
20110078794	Manni et al.	Mar 2011	A1
20110093426	Hoglund	Apr 2011	A1
20110093951	Aziz	Apr 2011	A1
20110099620	Stavrou et al.	Apr 2011	A1
20110099633	Aziz	Apr 2011	A1
20110113231	Kaminsky	May 2011	A1
20110145918	Jung et al.	Jun 2011	A1
20110145920	Mahaffey et al.	Jun 2011	A1
20110145934	Abramovici et al.	Jun 2011	A1
20110167493	Song et al.	Jul 2011	A1
20110167494	Bowen et al.	Jul 2011	A1
20110173460	Ito et al.	Jul 2011	A1
20110219449	St. Neitzel et al.	Sep 2011	A1
20110219450	McDougal et al.	Sep 2011	A1
20110225624	Sawhney et al.	Sep 2011	A1
20110225655	Niemela et al.	Sep 2011	A1
20110247072	Staniford et al.	Oct 2011	A1
20110265182	Peinado et al.	Oct 2011	A1
20110289582	Kejriwal et al.	Nov 2011	A1
20110302587	Nishikawa et al.	Dec 2011	A1
20110307954	Melnik et al.	Dec 2011	A1
20110307955	Kaplan et al.	Dec 2011	A1
20110307956	Yermakov et al.	Dec 2011	A1
20110314546	Aziz et al.	Dec 2011	A1
20120023593	Puder et al.	Jan 2012	A1
20120054869	Yen et al.	Mar 2012	A1
20120066698	Yanoo	Mar 2012	A1
20120079596	Thomas et al.	Mar 2012	A1
20120084859	Radinsky et al.	Apr 2012	A1
20120110667	Zubrilin et al.	May 2012	A1
20120117652	Manni et al.	May 2012	A1
20120121154	Xue et al.	May 2012	A1
20120124426	Maybee et al.	May 2012	A1
20120174186	Aziz et al.	Jul 2012	A1
20120174196	Bhogavilli et al.	Jul 2012	A1
20120174218	McCoy et al.	Jul 2012	A1
20120174227	Mashevsky	Jul 2012	A1
20120198279	Schroeder	Aug 2012	A1
20120210423	Friedrichs et al.	Aug 2012	A1
20120222121	Staniford et al.	Aug 2012	A1
20120255015	Sahita et al.	Oct 2012	A1
20120255017	Sallam	Oct 2012	A1
20120260304	Morris	Oct 2012	A1
20120260342	Dube et al.	Oct 2012	A1
20120266244	Green et al.	Oct 2012	A1
20120278886	Luna	Nov 2012	A1
20120297489	Dequevy	Nov 2012	A1
20120330801	McDougal et al.	Dec 2012	A1
20130014259	Gribble et al.	Jan 2013	A1
20130036472	Aziz	Feb 2013	A1
20130047257	Aziz	Feb 2013	A1
20130074185	McDougal et al.	Mar 2013	A1
20130086684	Mohler	Apr 2013	A1
20130097699	Balupari	Apr 2013	A1
20130097706	Titonis et al.	Apr 2013	A1
20130111587	Goel et al.	May 2013	A1
20130111591	Topan	May 2013	A1
20130117852	Stute	May 2013	A1
20130117855	Kim et al.	May 2013	A1
20130139264	Brinkley et al.	May 2013	A1
20130160125	Likhachev et al.	Jun 2013	A1
20130160127	Jeong et al.	Jun 2013	A1
20130160130	Mendelev et al.	Jun 2013	A1
20130160131	Madou et al.	Jun 2013	A1
20130167236	Sick	Jun 2013	A1
20130174214	Duncan	Jul 2013	A1
20130185789	Hagiwara et al.	Jul 2013	A1
20130185795	Winn et al.	Jul 2013	A1
20130185798	Saunders et al.	Jul 2013	A1
20130191915	Antonakakis et al.	Jul 2013	A1
20130196649	Paddon et al.	Aug 2013	A1
20130227691	Aziz et al.	Aug 2013	A1
20130246370	Bartram et al.	Sep 2013	A1
20130263260	Mahaffey et al.	Oct 2013	A1
20130291109	Staniford et al.	Oct 2013	A1
20130298243	Kumar et al.	Nov 2013	A1
20140053260	Gupta et al.	Feb 2014	A1
20140053261	Gupta et al.	Feb 2014	A1
20140130158	Wang et al.	May 2014	A1
20140137180	Lukacs et al.	May 2014	A1
20140165203	Friedrichs	Jun 2014	A1
20140169762	Ryu	Jun 2014	A1
20140179360	Jackson et al.	Jun 2014	A1
20140328204	Klotsche et al.	Nov 2014	A1
20140337836	Ismael	Nov 2014	A1
20140351935	Shao et al.	Nov 2014	A1
20150047034	Burnham	Feb 2015	A1
20150096025	Ismael	Apr 2015	A1
20150135262	Porat	May 2015	A1

Foreign Referenced Citations (12)

Number	Date	Country
102811213	Dec 2012	CN
2439806	Jan 2008	GB
2490431	Oct 2012	GB
WO-0206928	Jan 2002	WO
WO-0223805	Mar 2002	WO
WO-2007-117636	Oct 2007	WO
WO-2008041950	Apr 2008	WO
WO-2011084431	Jul 2011	WO
2011112348	Sep 2011	WO
2012075336	Jun 2012	WO
WO-2012145066	Oct 2012	WO
2013067505	May 2013	WO

Non-Patent Literature Citations (76)

Entry
Using Fuzzy Pattern Recognition to Detect Unknown Malicious Executables Code; Boyun Zhang, L. Wang and Y. Jin (Eds.): FSKD 2005, LNAI 3613, pp. 629-634, 2005. Springer-Verlag Berlin Heidelberg 2005.
“Measuring Similarity of Malware Behavior”, Martin Apel, Christian Bockermann and Michael Meier, (University of Dortmund), The 5th LCN Workshop on Security in Communications Networks (SICK 2009) Zürich, Switzerland; Oct. 20-23, 2009.
IEEE Xplore Digital Library Sear Results for “detection of unknown computer worms”. Http//ieeexplore.ieee.org/searchresult.jsp?SortField=Score&SortOrder=desc&ResultC. . . , (Accessed on Aug. 28, 2009).
AltaVista Advanced Search Results. “Event Orchestrator”. Http://www.altavista.com/web/results?ltag=ody&pg=aq&aqmode=aqa=Event+Orchesrator. . . , (Accessed on Sep. 3, 2009).
AltaVista Advanced Search Results. “attack vector identifier”. Http://www.altavista.com/web/results?ltag=ody&pg=aq&aqmode=aqa=Event+Orchestrator. . . , (Accessed on Sep. 15, 2009).
Cisco, Configuring the Catalyst Switched Port Analyzer (SPAN) (“Cisco”), (1992-2003).
Reiner Sailer, Enriquillo Valdez, Trent Jaeger, Roonald Perez, Leendert van Doorn, John Linwood Griffin, Stefan Berger., sHype: Secure Hypervisor Appraoch to Trusted Virtualized Systems (Feb. 2, 2005) (“Sailer”).
Excerpt regarding First Printing Date for Merike Kaeo, Designing Network Security (“Kaeo”), (2005).
The Sniffers's Guide to Raw Traffic available at: yuba.stanford.edu/˜casado/pcap/section1.html, (Jan. 6, 2014).
NetBIOS Working Group. Protocol Standard for a NetBIOS Service on a TCP/UDP transport: Concepts and Methods. STD 19, RFC 1001 Mar. 1987.
“Network Security: NetDetector—Network Intrusion Forensic System (NIFS) Whitepaper”, (“NetDetector Whitepaper”), (2003).
“Packet”, Microsoft Computer Dictionary, Microsoft Press, (Mar. 2002), 1 page.
“When Virtual is Better Than Real”, IEEEXplore Digital Library, available at, http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=990073, (Dec. 7, 2013).
Abdullah, et al., Visualizing Network Data for Intrusion Detection, 2005 IEEE Workshop on Information Assurance and Security, pp. 100-108.
Adetoye, Adedayo , et al., “Network Intrusion Detection & Response System”, (“Adetove”), (Sep. 2003).
Aura, Tuomas, “Scanning electronic documents for personally identifiable information”, Proceedings of the 5th ACM workshop on Privacy in electronic society. ACM, 2006.
Baecher, “The Nepenthes Platform: An Efficient Approach to collect Malware”, Springer-verlag Berlin Heidelberg, (2006), pp. 165-184.
Bayer, et al., “Dynamic Analysis of Malicious Code”, J Comput Virol, Springer-Verlag, France., (2006), pp. 67-77.
Boubalos, Chris , “extracting syslog data out of raw pcap dumps, seclists.org, Honeypots mailing list archives”, available at http://seclists.org/honeypots/2003/q2/319 (“Boubalos”), (Jun. 5, 2003).
Chaudet, C. , et al., “Optimal Positioning of Active and Passive Monitoring Devices”, International Conference on Emerging Networking Experiments and Technologies, Proceedings of the 2005 ACM Conference on Emerging Network Experiment and Technology, CoNEXT '05, Toulousse, France, (Oct. 2005), pp. 71-82.
Cohen, M.I. , “PyFlag—An advanced network forensic Framework”, Digital Investigation 5, Elsevier, (2008), pp. S112-S120.
Costa, M. , et al., “Vigilante: End-to-End Containment of Internet Worms”, SOSP '05, Association for Computing Machinery, Inc., Brighton U.K., (Oct. 23-26, 2005).
Crandall, J.R. , et al., “Minos:Control Data Attack Prevention Orthogonal to Memory Model”, 37th International Symposium on Microarchitecture, Portland, Oregon, (Dec. 2004).
Deutsch, P. , “Zlib compressed data format specification version 3.3” RFC 1950, (1996).
Distler, “Malware Analysis: An Introduction”, SANS Institute InfoSec Reading Room, SANS Institute, (2007).
Dunlap, George W. , et al., “ReVirt: Enabling Intrusion Analysis through Virtual-Machine Logging and Replay”, Proceeding of the 5th Symposium on Operating Systems Design and Implementation, USENIX Association, (“Dunlap”), (Dec. 9, 2002).
Filiol, Eric , et al., “Combinatorial Optimisation of Worm Propagation on an Unknown Network”, International Journal of Computer Science 2.2 (2007).
Goel, et al., Reconstructing System State for Intrusion Analysis, Apr. 2008 SIGOPS Operating Systems Revies, vol. 42 Issue 3, pp. 21-28.
Hjelmvik, Erik , “Passive Network Security Analysis with Network Miner”, (IN)Secure, Issue 18, (Oct. 2008), pp. 1-100.
Kaeo, Merike , “Designing Network Security”, (“Kaeo”), (Nov. 2003).
Kim, H. , et al., “Autograph: Toward Automated, Distributed Worm Signature Detection”, Proceedings of the 13th Usenix Security Symposium (Security 2004), San Diego, (Aug. 2004), pp. 271-286.
Krasnyansky, Max , et al., Universal TUN/TAP driver, available at https://www.kernel.org/doc/Documentation/networking/tuntap.txt(2002) (“Krasnyansky”).
Kreibich, C. , et al., “Honeycomb-Creating Intrusion Detection Signatures Using Honeypots”, 2nd Workshop on Hot Topics in Networks (HotNets-11), Boston, USA, (2003).
Warning System Design and Testing, Institute for Security Technology studies, Dartmouth College, (“Liljenstam”), (Oct. 27, 2003).
Marchette, David J., “Computer Intrusion Detection and Network Monitoring: A Statistical Viewpoint”, (“Marchette”), (2001).
Margolis, P.E. , “Random House Webster's ‘Computer & Internet Dictionary 3rd Edition’”, ISBN 0375703519, (Dec. 1998).
Moore, D. , et al., “Internet Quarantine: Requirements for Containing Self-Propagating Code”, Infocom, vol. 3, (Mar. 30-Apr. 3, 2003), pp. 1901-1910.
Morales, Jose A., et al., ““Analyzing and exploiting network behaviors of malware.””, Security and Privacy in Communication Networks. Springer Berlin Heidelberg, 2010. 20-34.
Natvig, Kurt , “SANDBOXII: Internet”, Virus Bulletin Conference, (“Natvig”), (Sep. 2002).
Newsome, J. , et al., “Dynamic Taint Analysis for Automatic Detection, Analysis and Signature Generation of Exploits on Commodity Software”, In Proceedings of the 12th Annual Network and Distributed System Securty, Symposium (NDSS '05), (Feb. 2005).
Newsome, J. , et al., “Polygraph: Automatically Generating Signatures for Polymorphic Worms”, In Proceedings of the IEEE Symposium on Security and Privacy, (May 2005).
Nojiri, D. , et al., “Cooperation Response Strategies for Large Scale Attack Mitigation”, DARPA Information Survivability Conference and Exposition, vol. 1, (Apr. 22-24, 2003), pp. 293-302.
Silicon Defense, “Worm Containment in the Internal Network”, (Mar. 2003), pp. 1-25.
Singh, S. , et al., “Automated Worm Fingerprinting”, Proceedings of the ACM/USENIX Symposium on Operating System Design and Implementation, San Francisco, California, (Dec. 2004).
Spitzner, Lance , “Honeypots: Tracking Hackers”, (“Spizner”), (Sep. 17, 2002).
Thomas H. Ptacek, and Timothy N. Newsham Denial , “Insertion, Evasion, and Denial of Service: Eluding Network Intrusion Detection”, Secure Networks, (“Ptacek”), (Jan. 1998).
Venezia, Paul , “NetDetector Captures Intrusions”, InfoWorld Issue 27, (“Venezia”), (Jul. 14, 2003).
Whyte, et al., “DNS-Based Detection of Scanning Works in an Enterprise Network”, Proceedings of the 12th Annual Network and Distributed System Security Symposium, (Feb. 2005), 15 pages.
Williamson, Matthew M., “Throttling Viruses: Restricting Propagation to Defeat Malicious Mobile Code”, ACSAC Conference, Las Vegas, NV, USA, (Dec. 2002), pp. 1-9.
“White Paper Advanced Threat Protection 1-16 Solution”, URL:http://cdn2.hubspot.net/hub/237610/file-232929750-pdf/White—Papers/WP-—Seculert—Solution.pdf, dated Jul. 28, 2013.
Apostolopoulos, George; hassapis, Constantinos; “V-eM: A cluster of Virtual Machines for Robust, Detailed, and High-Performance Network Emulation”, 14th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, Sep. 11-14, 2006, pp. 117-126.
Baldi, Mario; Risso, Fulvio; “A Framework for Rapid Development and Portable Execution of Packet-Handling Applications”, 5th IEEE International Symposium Processing and Information Technology, Dec. 21, 2005, pp. 233-238.
Cisco “Intrusion Prevention for the Cisco ASA 5500-x Series” Data Sheet (2012).
Clark, John, Sylvian Leblanc,and Scott Knight. “Risks associated with usb hardware trojan devices used by insiders.” Systems Conference (SysCon), 2011 IEEE International. IEEE, 2011.
FireEye Malware Analysis & Exchange Network, Malware Protection System, FireEye Inc., 2010.
FireEye Malware Analysis, Modern Malware Forensics, FireEye Inc., 2010.
FireEye v.6.0 Security Target, pp. 1-35, Version 1.1, FireEye Inc., May 2011.
Gibler, Clint, et al. AndroidLeaks: automatically detecting potential privacy leaks in android applications on a large scale. Springer Berlin Heidelberg, 2012.
Gregg Keizer: “Microsoft's HoneyMonkeys Show Patching Windows Works”, Aug. 8, 2005, XP055143386, Retrieved from the Internet: URL:https://web.archive.org/web/20121022220617/http://www.informationweek- .com/microsofts-honeymonkeys-show-patching-wi/167600716 [retrieved on Sep. 29, 2014].
Heng Yin et al, Panorama: Capturing System-Wide Information Flow for Malware Detection and Analysis, Research Showcase @ CMU, Carnegie Mellon University, 2007.
Idika et al., A-Survey-of-Malware-Detection-Techniques, Feb. 2, 2007, Department of Computer Science, Purdue University.
Isohara, Takamasa, Keisuke Takemori, and Ayumu Kubota. “Kernel-based behavior analysis for android malware detection.” Computational intelligence and Security (CIS), 2011 Seventh International Conference on. IEEE, 2011.
Jiyong Jang et al. “BitShred” Computer and Communications Security, ACM, dated (p. 309-320) Oct. 17, 2011.
Kevin A Roundy et al: “Hybrid Analysis and Control of Malware”, Sep. 15, 2010, Recent Advances in Intrusion Detection, Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 317-338, XP019150454 ISBN:978-3-642-15511-6.
Leading Colleges Select FireEye to Stop Malware-Related Data Breaches, FireEye Inc., 2009.
Li et al., A VMM-Based System Call Interposition Framework for Program Monitoring, Dec. 2010, IEEE 16th International Conference on Parallel and Distributed Systems, pp. 706-711.
Lindorfer, Martina, Clemens Kolbitsch, and Paolo Milani Comparetti. “Detecting environment-sensitive malware.” Recent Advances in Intrusion Detection. Springer Berlin Heidelberg, 2011.
Lok Kwong et al: “DroidScope: Seamlessly Reconstructing the OS and Dalvik Semantic Views for Dynamic Android Malware Analysis”, Aug. 10, 2012, XP055158513, Retrieved from the Internet: URL:https://www.usenix.org/system/files/conference/usenixsecurity12/sec12- -final107.pdf [retrieved on Dec. 15, 2014].
Mori, Detecting Unknown Computer Viruses, 2004, Springer-Verlag Berlin Heidelberg.
Oberheide et al., CloudAV.sub.-N-Version Antivirus in the Network Cloud, 17th USENIX Security Symposium USENIX Security '08 Jul. 28-Aug. 1, 2008 San Jose, CA.
PCT/US14/55958 filed Sep. 16, 2014 International Search Report and Written Opinion, dated May 1, 2015.
U.S. Pat. No. 8,171,553 filed Apr. 20, 2006, Inter Parties Review Decision dated Jul. 10, 2015.
U.S. Pat. No. 8,291,499 filed Mar. 16, 2012, Inter Parties Review Decision dated Jul. 10, 2015.
Wahid et al., Characterising the Evolution in Scanning Activity of Suspicious Hosts, Oct. 2009, Third International Conference on Network and System Security, pp. 344-350.
Yuhei Kawakoya et al: “Memory behavior-based automatic malware unpacking in stealth debugging environment”, Malicious and Unwanted Software (Malware), 2010 5th International Conference on, IEEE, Piscataway, NJ, USA, Oct. 19, 2010, pp. 39-46, XP031833827, ISBN:978-1-4244-8-9353-1.
Zhang et al., The Effects of Threading, Infection Time, and Multiple-Attacker Collaboration on Malware Propagation, Sep. 2009, IEEE 28th International Symposium on Reliable Distributed Systems, pp. 73-82.

Related Publications (1)

	Number	Date	Country
	20150096023 A1	Apr 2015	US

Fuzzy hash of behavioral results

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

CPC

International Classifications