Nefarious individuals attempt to compromise computer systems in a variety of ways. As an example, unsuspecting users may download or otherwise obtain software that purports to be useful (e.g., a game) when in fact, the software contains malicious elements (i.e., is or contains “malware”). Unfortunately, malware authors are crafting increasingly sophisticated malware that is increasingly able to evade detection. Accordingly, there exists an ongoing need for improved techniques to detect malware and prevent its harm.
Various embodiments of the invention are disclosed in the following detailed description and the accompanying drawings.
The invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term ‘processor’ refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.
A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.
Appliance 102 can be configured to work in cooperation with one or more virtual machine servers (112, 124) to perform malware analysis. As one example, data appliance 102 can be configured to provide a copy of malware 130 (transmitted to client 104 by host 120) to one or more of the virtual machine servers for real-time analysis. As another example, service 122 can provide a list of signatures of known-malicious files to appliance 102 as part of a subscription. Those signatures can be generated by service 122 in conjunction with the techniques described herein.
An example of a virtual machine server is a physical machine comprising commercially available server-class hardware (e.g., a multi-core processor, 4+ Gigabytes of RAM, and one or more Gigabit network interface adapters) that runs commercially available virtualization software, such as VMware ESXi, Citrix XenServer, or Microsoft Hyper-V. The virtual machine servers may be separate from, but in communication with, data appliance 102, as shown in
A virtual machine (VM) can be used to execute a candidate malware application (e.g., in a VM sandbox environment). Behavior profiling using various heuristic-based analysis techniques and network traffic analysis associated with the application can be performed. As one example, suppose a malicious user of host 120 attempts to send a user of 104 an email message that includes as an attachment a malicious executable (e.g., having a .exe extension). Data appliance 102 can determine whether a signature for the executable is present on data appliance 102. A signature, if present, can indicate that the executable is known to be safe, and can also indicate that the executable is known to be malicious. If no signature for the executable is found, data appliance 102 is configured to provide the executable to a virtual machine server, such as virtual machine server 112, for analysis.
Virtual machine server 112 is configured to execute the executable in one or more virtual machines 114-116. If the executable is ultimately determined to be malicious, appliance 102 can automatically block the file download to client 104 (and any other applicable clients) based on the analysis result. Further, a signature can be generated and distributed (e.g., to other data appliances) to automatically block future file transfer requests to download the file determined to be malicious.
As will be described in more detail below, traffic analysis can be performed on communications associated with the executable and determinations as to whether or not the executable is malicious based, in part, on that traffic analysis. In some embodiments the analysis (also referred to herein as “post-analysis”) is performed by the VM server (e.g., VM server 112). In other embodiments, the analysis is performed at least in part by appliance 102. In yet other embodiments, the analysis is performed at least in part by cloud security service 122 (e.g., where virtual machine server 124 is used). The malware analysis functionality illustrated in
Appliance 102 can take a variety of forms. For example, appliance 102 can be a dedicated device or set of devices. The functionality provided by appliance 102 can also be integrated into or executed as software on a general purpose computer, a computer server, a gateway, and/or a network/routing device. For example, in some embodiments services provided by data appliance 102 are instead (or in addition) provided to client 104 by software executing on client 104.
Whenever appliance 102 is described as performing a task, a single component, a subset of components, or all components of appliance 102 may cooperate to perform the task. Similarly, whenever a component of appliance 102 is described as performing a task, a subcomponent may perform the task and/or the component may perform the task in conjunction with other components. In various embodiments, portions of appliance 102 are provided by one or more third parties. Depending on factors such as the amount of computing resources available to appliance 102, various logical components and/or features of appliance 102 may be omitted and the techniques described herein adapted accordingly. Similarly, additional logical components/features can be added to appliance 102 as applicable.
Suppose, when executing a copy of candidate malware, virtual machine 114 attempts to access a remote server 134 using an encrypted session protocol, such as SSL. Network processor 306 is configured to receive packets from virtual machine 114, and provide the packets to data plane 304 for processing. Flow 308 identifies the packets as being part of a new session and creates a new session flow. Subsequent packets will be identified as belonging to the session based on a flow lookup. If applicable, SSL decryption is applied by SSL decrypter 310. Otherwise, processing by SSL decrypter 310 is omitted. Application identification module 312 is configured to determine what type of traffic the session involves. For example, application identification module 312 can recognize a GET request in the received data and conclude that the session requires an HTTP decoder. For each type of protocol, there exists a corresponding decoder 314. In some embodiments, the application identification is performed by an application identification module (e.g., APP-ID engine). Based on the determination made by application identification module 312, the packets are sent to an appropriate decoder 314. Decoder 314 is configured to assemble packets (e.g., which may be received out of order) into the correct order, perform tokenization, and extract out information. Also as shown, signatures 320 are received and stored in the management plane 302. As needed, packets are re-encrypted by SSL encryption module 316 and then provided to forward module 318 for transmission to server 134.
At 404, the candidate malware application is executed in one or more virtual machines. For example, the candidate malware application can be executed in virtual machine 114. In some embodiments, the virtual machine is configured to include a set of personal user data or other information (e.g., an address book and one or more documents) usable in performing malware analysis. Any network traffic generated as a result of executing the application can be captured (e.g., using pcap) and stored (e.g., in storage 210). As another example, the candidate malware can be executed in virtual machines 126-128 and the network traffic stored by service 122.
At 406, traffic analysis is performed on the captured network traffic. And, at 408, a determination is made as to whether or not the application is malicious based at least in part on the traffic analysis and an application type of the candidate malware application. Additional detail regarding the processing performed at 406 and 408 of process 400 follows.
Suppose the candidate malware application is a screen-sharing utility. The application is executed in a virtual machine (404) and network traffic associated with the execution of the application is captured. Specifically, traffic associated with the virtual machine is parsed by system 132 and application identification module 312 is used to determine that the application is a screen-sharing utility. Other examples of types of applications include (but are not limited to) web browsing applications, FTP applications, email applications, and IRC applications.
When an application is executed in a virtual machine, the application may take certain expected actions (observable via traffic analysis). In the example of the malicious screen-sharing utility, examples of expected actions could include attempting to connect to a known screen-sharing server and/or using a known screen-sharing protocol. An application may also take certain unexpected actions. In the example of the malicious screen-sharing utility, examples of unexpected actions include attempting to transmit a copy of the address book to host 120 and/or attempting to send out emails.
Based on analysis of the collected network traffic, and also based on the type of application (e.g., as determined by module 312), a determination can be made as to whether the application is malicious or not. For example, as a screen-sharing application, establishing a connection to a known screen-sharing site/using a screen-sharing protocol could be considered expected (benign) behavior. For a different type of application (e.g., a word processing application) the same observed traffic (establishing a connection to a known screen-sharing site/using a screen-sharing protocol) could be considered highly suspicious. Similarly, a legitimate screen-sharing application would be very unlikely to need to transmit a user's address book. The fact that the application executing in the virtual machine has attempted to take such an action can be treated as being suspicious accordingly. In this example, no human user is interacting with the application while it is executing in the virtual machine. Thus, even if the application being analyzed was an email client, it should not be sending out emails (i.e., because there is no human drafting an email and clicking on a send button).
In various embodiments, heuristics are applied to determine whether the traffic, in the aggregate, is sufficiently suspicious to indicate that the application is malicious. In some embodiments, traffic analysis of known-good applications is performed and stored as a baseline for comparison against evaluation of unknown applications. If a given application is determined to be suspicious, additional actions can be taken to confirm whether or not the application should be treated as malicious or not. For example, a suspicious application can be quarantined and additional processing can be performed in other virtual machines. Other types of analysis can also be performed, such as static analysis. If a determination is made that an application is malicious (or, in some embodiments, suspicious) appropriate actions can be taken. As one example, a signature for the executable can be generated (e.g., as an MD5 hash-based signature) and stored in data appliance 102 and/or propagated to other data appliances as applicable. As another example, instead of or in addition to generating a signature, an alert can be generated that instructs data appliance 102 not to provide the executable to client 104.
Additional examples of network traffic which, if attributed to a specific application executing in a virtual machine, can be indicative of malware are as follows:
Although the foregoing embodiments have been described in some detail for purposes of clarity of understanding, the invention is not limited to the details provided. There are many alternative ways of implementing the invention. The disclosed embodiments are illustrative and not restrictive.
Number | Name | Date | Kind |
---|---|---|---|
5983348 | Ji | Nov 1999 | A |
7409718 | Hong et al. | Aug 2008 | B1 |
7496963 | Shipp | Feb 2009 | B2 |
7568233 | Szor et al. | Jul 2009 | B1 |
7649838 | Fishteyn et al. | Jan 2010 | B2 |
7664855 | Freed et al. | Feb 2010 | B1 |
7779472 | Lou | Aug 2010 | B1 |
7870610 | Mitchell et al. | Jan 2011 | B1 |
7958555 | Chen et al. | Jun 2011 | B1 |
8011010 | Michael et al. | Aug 2011 | B2 |
8141132 | Oliver et al. | Mar 2012 | B2 |
8201246 | Wu et al. | Jun 2012 | B1 |
8209680 | Le et al. | Jun 2012 | B1 |
8225317 | Chiueh et al. | Jul 2012 | B1 |
8260914 | Ranjan | Sep 2012 | B1 |
8291468 | Chickering | Oct 2012 | B1 |
8316440 | Hsieh et al. | Nov 2012 | B1 |
8321936 | Green et al. | Nov 2012 | B1 |
8359651 | Wu et al. | Jan 2013 | B1 |
8370938 | Daswani et al. | Feb 2013 | B1 |
8402543 | Ranjan et al. | Mar 2013 | B1 |
8407324 | McDougal | Mar 2013 | B2 |
8438639 | Lee et al. | May 2013 | B2 |
8443363 | Brennan, III et al. | May 2013 | B1 |
8443449 | Fan et al. | May 2013 | B1 |
8464341 | Cohen | Jun 2013 | B2 |
8479295 | Sahita et al. | Jul 2013 | B2 |
8484732 | Chen et al. | Jul 2013 | B1 |
8484739 | Seshadri | Jul 2013 | B1 |
8510827 | Leake et al. | Aug 2013 | B1 |
8516591 | Fly et al. | Aug 2013 | B2 |
8521667 | Zhu et al. | Aug 2013 | B2 |
8539577 | Stewart et al. | Sep 2013 | B1 |
8566946 | Aziz | Oct 2013 | B1 |
8572740 | Mashevsky et al. | Oct 2013 | B2 |
8578481 | Rowley | Nov 2013 | B2 |
8584239 | Aziz et al. | Nov 2013 | B2 |
8646088 | Pistoia et al. | Feb 2014 | B2 |
8656491 | Daswani et al. | Feb 2014 | B1 |
8677487 | Balupari et al. | Mar 2014 | B2 |
8683584 | Daswani et al. | Mar 2014 | B1 |
8707441 | Cidambi et al. | Apr 2014 | B1 |
8756691 | Nachenberg | Jun 2014 | B2 |
8763125 | Feng | Jun 2014 | B1 |
8838570 | English | Sep 2014 | B1 |
9003526 | El-Moussa | Apr 2015 | B2 |
20040030913 | Liang et al. | Feb 2004 | A1 |
20040107416 | Buban et al. | Jun 2004 | A1 |
20050177602 | Kaler et al. | Aug 2005 | A1 |
20060168024 | Mehr et al. | Jul 2006 | A1 |
20070050850 | Katoh et al. | Mar 2007 | A1 |
20070192857 | Ben-Itzhak | Aug 2007 | A1 |
20070261112 | Todd et al. | Nov 2007 | A1 |
20080016552 | Hart | Jan 2008 | A1 |
20080177755 | Stern et al. | Jul 2008 | A1 |
20080177994 | Mayer | Jul 2008 | A1 |
20080209562 | Szor | Aug 2008 | A1 |
20080229393 | Congdon | Sep 2008 | A1 |
20080256633 | Arnold et al. | Oct 2008 | A1 |
20080263658 | Michael et al. | Oct 2008 | A1 |
20080263659 | Alme | Oct 2008 | A1 |
20080320594 | Jiang | Dec 2008 | A1 |
20090019547 | Palliyil et al. | Jan 2009 | A1 |
20090055928 | Kang et al. | Feb 2009 | A1 |
20090144826 | Piccard | Jun 2009 | A2 |
20090150419 | Kim et al. | Jun 2009 | A1 |
20090241190 | Todd et al. | Sep 2009 | A1 |
20090282485 | Bennett | Nov 2009 | A1 |
20090288167 | Freericks et al. | Nov 2009 | A1 |
20100043072 | Rothwell | Feb 2010 | A1 |
20100077481 | Polyakov et al. | Mar 2010 | A1 |
20100107252 | Mertoguno | Apr 2010 | A1 |
20100115586 | Raghavan et al. | May 2010 | A1 |
20100154059 | McNamee et al. | Jun 2010 | A1 |
20100162350 | Jeong et al. | Jun 2010 | A1 |
20100175132 | Zawadowskiy et al. | Jul 2010 | A1 |
20100281458 | Paladino | Nov 2010 | A1 |
20110041179 | St Hlberg | Feb 2011 | A1 |
20110055923 | Thomas | Mar 2011 | A1 |
20110090911 | Hao et al. | Apr 2011 | A1 |
20110099620 | Stavrou et al. | Apr 2011 | A1 |
20110167495 | Antonakakis et al. | Jul 2011 | A1 |
20110173698 | Polyakov et al. | Jul 2011 | A1 |
20110185425 | Lee et al. | Jul 2011 | A1 |
20110252474 | Ward et al. | Oct 2011 | A1 |
20110271342 | Chung et al. | Nov 2011 | A1 |
20110296412 | Banga et al. | Dec 2011 | A1 |
20110296486 | Burch et al. | Dec 2011 | A1 |
20120084860 | Cao et al. | Apr 2012 | A1 |
20120089700 | Safruti et al. | Apr 2012 | A1 |
20120096549 | Amini et al. | Apr 2012 | A1 |
20120117650 | Nachenberg | May 2012 | A1 |
20120117652 | Manni et al. | May 2012 | A1 |
20120192274 | Odom et al. | Jul 2012 | A1 |
20120233691 | Jiang | Sep 2012 | A1 |
20120240224 | Payne et al. | Sep 2012 | A1 |
20120255018 | Sallam | Oct 2012 | A1 |
20120255019 | McNamee et al. | Oct 2012 | A1 |
20120255021 | Sallam | Oct 2012 | A1 |
20120255031 | Sallam | Oct 2012 | A1 |
20120278889 | El-Moussa | Nov 2012 | A1 |
20120291131 | Turkulainen et al. | Nov 2012 | A1 |
20130014259 | Gribble et al. | Jan 2013 | A1 |
20130047147 | McNeill | Feb 2013 | A1 |
20130055394 | Beresnevichiene et al. | Feb 2013 | A1 |
20130091350 | Gluck | Apr 2013 | A1 |
20130091570 | McCorkendale et al. | Apr 2013 | A1 |
20130104230 | Tang et al. | Apr 2013 | A1 |
20130145002 | Kannan et al. | Jun 2013 | A1 |
20130145008 | Kannan et al. | Jun 2013 | A1 |
20130152200 | Alme et al. | Jun 2013 | A1 |
20130227165 | Liu | Aug 2013 | A1 |
20130246685 | Bhargava et al. | Sep 2013 | A1 |
20130298184 | Ermagan et al. | Nov 2013 | A1 |
20130298192 | Kumar et al. | Nov 2013 | A1 |
20130298230 | Kumar et al. | Nov 2013 | A1 |
20130298242 | Kumar et al. | Nov 2013 | A1 |
20130298243 | Kumar et al. | Nov 2013 | A1 |
20130298244 | Kumar et al. | Nov 2013 | A1 |
20130326625 | Anderson et al. | Dec 2013 | A1 |
20140059641 | Chapman et al. | Feb 2014 | A1 |
20140096131 | Sonnek et al. | Apr 2014 | A1 |
20140283037 | Sikorski et al. | Sep 2014 | A1 |
20140380474 | Paithane et al. | Dec 2014 | A1 |
20150058984 | Shen et al. | Feb 2015 | A1 |
Number | Date | Country |
---|---|---|
2012134584 | Oct 2012 | WO |
2013067505 | May 2013 | WO |
2013067508 | May 2013 | WO |
2013134206 | Sep 2013 | WO |
Entry |
---|
Davidoff et al., “Chapter 12: Malware Forensics,” Network Forensics: Tracking Hackers Through Cyberspace, Pearson Education Inc., Jun. 2012, 60 pages. |
Ligh et al., “Chapter 5: Researching Domains and IP Addresses,” Malware Analyst's Cookbook, John Wiley & Sons, 2011, 38 pages. |
van der Heide et al., “DNS Anomaly Detection,” System and Network Engineering Research Group, University of Amsterdam, Feb. 6, 2011, 20 pages. |
Abu Rajab et al., “A Multifaceted Approach to Understanding the Botnet Phenonmenon,” Proceedings of the 6th ACM SIGCOMM conference on Internet measurement, 2006, 12 pages. |
Schechter et al., “Fast Detection of Scanning Worm Infections,” Recent Advances in Intrusion Detection: 7th International Symposium RAID 2004 Proceedings, 2004, 24 pages. |
Sikorski et al., “Chapter 14: Malware-Focused Network Signatures,” Practical Malware Anlaysis, No Starch Press, Feb. 2012, 13 pages. |
Chen et al., “Chapter 4: Guarding Against Network Intrusions,” Network and System Security, Elsevier Inc., 2009, 5 pages. |
Author Unknown, “FireEye Malware Analysis”, FireEye.com, FireEye, Inc., 2010. |
Author Unknown, “Hybrid Sandboxing for Detecting and Analyzing Advanced and Unknown Malware”, Blue Coat Systems, Inc., 2014. |
Lindorfer et al., “Detecting Enviroment-Sensitive Malware”, Recent Advances in Intrusion Detection. Springer Berlin Heidelberg, 2011. |
Singh et al., “Hot Knives Through Butter: Evading File-based Sandboxes”, FireEye, Inc., Feb. 2014. |
Author Unknown, “Multi-Vector Virtual Execution (MVX) Engine”, FireEye, Inc., http://www.fireeye.com/products-and-solutions/virtual-execution-engine.html, 2014. |
Wagener et al., “An Instrumented Analysis of Unknown Software and Malware Driven by Free Libre Open Source Software”, Signal Image Technology and Internet Based Systems, 2008. SITIS'08. IEEE International Conference on. IEEE, 2008. |
Lau et al., “Measuring Virtual Machine Detection in Malware using DSD Tracer”, Sophoslabs, Journal in Computer Virology, 2008. |
Sun et al. “Malware Virtualization-resistant behavior detection”, 2011 IEEE, pp. 912-917. |
Zang et al., “Botnet Detection Through Fine Flow Classifcation”, CSE Dept. Technical Report No. CSE11-001, p. 1-17, Jan. 31, 2011. |
Landecki, Grzegorz, Detecting Botnets, Linux Journal, Jan. 1, 2009. |
Karasaridis, Anestis et al., Wide-scale Botnet Detection and Characterization, Dec. 14, 2010. |
Author Unknown, Advanced Persistent Threats (APT), What's an APT? A Brief Definition, Damballa, Dec. 14, 2010. |
Author Unknown, Executive Overview, The Command Structure of the Aurora Botnet, Damballa, Mar. 2010. |
Strayer, W Timothy et al. Detecting Botnets with Tight Command and Control, BBN Technologies, Nov. 2006. |
Ramachandran, Anirudh et al., Revealing Botnet Membership Using DNSBL Counter-Intelligence, 2006. |
Goebel, Jan et al., Rishi: Identify Bot Contaminated Hosts by IRC Nickname Evaluation, Apr. 2007. |
Gu, Guofei et al., BotSniffer: Detecting Botnet Command and Control Channels in Network Traffic, Feb. 2008. |
Gu, Guofei et al., BotHunter: Detecting Malware Infection Through IDS-Driven Dialog Correlation, Aug. 2007. |
Gu, Guofei et al., BotMiner: Clustering Analysis of Network Traffic for Protocol-and Structure-Independent Botnet Detection, Jul. 2008. |
Royal, Paul, Analysis of the Kraken Botnet, Damballa, Apr. 9, 2008. |
Livadas, Carl et al., Using Machine Learning Techniques to Identify Botnet Traffic, BBN Technologies, Nov. 2006. |
Binkley, James R. et al., An Algorithm for Anomaly-based Botnet Detection, Jul. 2006. |
Yen, Ting-Fang et al., Traffic Aggregation for Malware Detection, Jul. 2008. |
Author Unknown, Anatomy of a Targeted Attack, Damballa, Dec. 3, 2008. |
Author Unknown, Layer 8, How and Why Targeted Attacks Exploit Your Users, Damballa, Nov. 2011. |
Author Unknown, Targeted Attacks for Fun and Profit, An Executed Guide to a New and Growing Enterprise Threat, Damballa, Oct. 13, 2008. |
Author Unknown, AV, IDS/IPS and Damballa's Response to Targeted Attacks, A Technology Comparison, Damballa, Nov. 2008. |
Author Unknown, Updated on the Enemy, A Deconstruction of Who Profits From Botnets, Damballa, May 13, 2009. |
Author Unknown, A Day in the Life of a BotArmy, Damballa, 2008. |
Ollmann, Gunter, Botnet Communication Topologies, Understanding the Intricacies of Bonet Command and Control, Damballa, 2009. |
Ollmann, Gunter, The Botnet vs. Malware Relationship, The One to one Botnet Myth, Damballa, 2009. |
Author Unknown, Closed Window, How Failsafe Enhancements Dramatically Limit Opportunities for Malware Armies and other Targeted Attacks, Damballa, Sep. 23, 2009. |
Author Unknown, Damballa's In-The-Cloud Security Model, Enterprise Protection Moves Beyond the Network Perimeter, Damballa, Aug. 24, 2008. |
Ollmann, Gunter, Extracting CnC from Malware, The Role of malware Sample Analysis in Botnet Detection, Damballa, 2009. |
Ollmann, Gunter, The Opt-IN Botnet Generation, Hacktivism and Centrally Controlled Protesting, Social Networks, Damballa, 2010. |
Ollmann, Gunter, Serial Variant Evasion Tactics, Techniques Used to Automatically Bypass Antivirus Technologies, Damballa, 2009. |
Author Unknown, Damballa: A Different Approach, Targeted Attacks Requires a New Solution, Damballa, Sep. 23, 2008. |
Author Unknown, Trust Betrayed, What to Do When a Targeted Attack Turns Your Networks Against You, Damballa, 2008. |
Author Unknown, How to Be a Hero in the War Against BotArmies, Damballa, 2008. |
Giroire, Frederic et al., Exploiting Temporal Persistence to Detect Convert Botnet Channels, 2009. |
Russ White, “High Availability in Routing”, Mar. 2004, Cisco Systems, vol. 7, Issue 1, pp. 2-14. |
Yadav et al., “Detecting Algorithmically Generated Malicious Domain Names”, Nov. 2010. |
Dittrich et al., P2P as Botnet Command and Control; A Deeper Insight, 2008 3rd International Conference on Malicious and Unwanted Software (MALWARE), Oct. 2008, IEEE, vol. 10, pp. 41-48. |