Nefarious individuals attempt to harm computer systems in a variety of ways. As one example, such individuals may embed or otherwise include malicious software (“malware”) in email attachments and transmit (or cause the malware to be transmitted) to unsuspecting users. When executed, the malware compromises the victim's computer. Some types of malware will instruct a compromised computer to communicate with a remote host. For example, malware can turn a compromised computer into a “bot” in a “botnet,” receiving instructions from and/or reporting data to a command and control (C&C) server under the control of the nefarious individual. Such compromised computers can be used to perform a variety of tasks (e.g., initiating attacks against other systems). Unfortunately, attackers continue to adapt their techniques to evade detection. Accordingly, there exists an ongoing need for improved approaches to detecting malicious computer activities and preventing harm to computer systems.
Various embodiments of the invention are disclosed in the following detailed description and the accompanying drawings.
The invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term ‘processor’ refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.
A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.
A firewall generally protects networks from unauthorized access while permitting authorized communications to pass through the firewall. A firewall is typically a device, a set of devices, or software executed on a device that provides a firewall function for network access. For example, a firewall can be integrated into operating systems of devices (e.g., computers, smart phones, or other types of network communication capable devices). A firewall can also be integrated into or executed as one or more software applications on various types of devices, such as computer servers, gateways, network/routing devices (e.g., network routers), and data appliances (e.g., security appliances or other types of special purpose devices), and in various implementations, certain operations can be implemented in special purpose hardware, such as an ASIC or FPGA.
Firewalls typically deny or permit network transmission based on a set of rules. These sets of rules are often referred to as policies (e.g., network policies or network security policies). For example, a firewall can filter inbound traffic by applying a set of rules or policies to prevent unwanted outside traffic from reaching protected devices. A firewall can also filter outbound traffic by applying a set of rules or policies (e.g., allow, block, monitor, notify or log, and/or other actions can be specified in firewall rules or firewall policies, which can be triggered based on various criteria, such as are described herein). A firewall can also filter local network (e.g., intranet) traffic by similarly applying a set of rules or policies.
Security devices (e.g., security appliances, security gateways, security services, and/or other security devices) can include various security functions (e.g., firewall, anti-malware, intrusion prevention/detection, Data Loss Prevention (DLP), and/or other security functions), networking functions (e.g., routing, Quality of Service (QoS), workload balancing of network related resources, and/or other networking functions), and/or other functions. For example, routing functions can be based on source information (e.g., IP address and port), destination information (e.g., IP address and port), and protocol information.
A basic packet filtering firewall filters network communication traffic by inspecting individual packets transmitted over a network (e.g., packet filtering firewalls or first generation firewalls, which are stateless packet filtering firewalls). Stateless packet filtering firewalls typically inspect the individual packets themselves and apply rules based on the inspected packets (e.g., using a combination of a packet's source and destination address information, protocol information, and a port number).
Application firewalls can also perform application layer filtering (e.g., application layer filtering firewalls or second generation firewalls, which work on the application level of the TCP/IP stack). Application layer filtering firewalls or application firewalls can generally identify certain applications and protocols (e.g., web browsing using HyperText Transfer Protocol (HTTP), a Domain Name System (DNS) request, a file transfer using File Transfer Protocol (FTP), and various other types of applications and other protocols, such as Telnet, DHCP, TCP, UDP, and TFTP (GSS)). For example, application firewalls can block unauthorized protocols that attempt to communicate over a standard port (e.g., an unauthorized/out of policy protocol attempting to sneak through by using a non-standard port for that protocol can generally be identified using application firewalls).
Stateful firewalls can also perform state-based packet inspection in which each packet is examined within the context of a series of packets associated with that network transmission's flow of packets. This firewall technique is generally referred to as a stateful packet inspection as it maintains records of all connections passing through the firewall and is able to determine whether a packet is the start of a new connection, a part of an existing connection, or is an invalid packet. For example, the state of a connection can itself be one of the criteria that triggers a rule within a policy.
Advanced or next generation firewalls can perform stateless and stateful packet filtering and application layer filtering as discussed above. Next generation firewalls can also perform additional firewall techniques. For example, certain newer firewalls sometimes referred to as advanced or next generation firewalls can also identify users and content. In particular, certain next generation firewalls are expanding the list of applications that these firewalls can automatically identify to thousands of applications. Examples of such next generation firewalls are commercially available from Palo Alto Networks, Inc. (e.g., Palo Alto Networks' PA Series firewalls). For example, Palo Alto Networks' next generation firewalls enable enterprises to identify and control applications, users, and content—not just ports, IP addresses, and packets—using various identification technologies, such as the following: APP-ID for accurate application identification, User-ID for user identification (e.g., by user or user group), and Content-ID for real-time content scanning (e.g., controlling web surfing and limiting data and file transfers). These identification technologies allow enterprises to securely enable application usage using business-relevant concepts, instead of following the traditional approach offered by traditional port-blocking firewalls. Also, special purpose hardware for next generation firewalls (implemented, for example, as dedicated appliances) generally provides higher performance levels for application inspection than software executed on general purpose hardware (e.g., such as security appliances provided by Palo Alto Networks, Inc., which use dedicated, function specific processing that is tightly integrated with a single-pass software engine to maximize network throughput while minimizing latency).
Advanced or next generation firewalls can also be implemented using virtualized firewalls. Examples of such next generation firewalls are commercially available from Palo Alto Networks, Inc. (e.g., Palo Alto Networks' VM Series firewalls, which support various commercial virtualized environments, including, for example, VMware® ESXi™ and NSX™, Citrix® Netscaler SDX™, KVM/OpenStack (Centos/RHEL, Ubuntu®), and Amazon Web Services (AWS)). For example, virtualized firewalls can support similar or the exact same next-generation firewall and advanced threat prevention features available in physical form factor appliances, allowing enterprises to safely enable applications flowing into, and across their private, public, and hybrid cloud computing environments. Automation features such as V1\4 monitoring, dynamic address groups, and a REST-based API allow enterprises to proactively monitor VM changes, dynamically feeding that context into security policies, thereby eliminating the policy lag that may occur when VMs change.
In the example environment shown in
Although illustrated as a single element in
Appliance 112 can take a variety of forms. For example, appliance 112 can comprise a dedicated device or set of devices. The functionality provided by appliance 112 can also be integrated into or executed as software on a general purpose computer, a computer server, a gateway, and/or a network/routing device. In some embodiments, services provided by data appliance 112 are instead (or in addition) provided to client 104 by software executing on client 104.
In the example shown in
In various embodiments, appliance 112 is configured to work in cooperation with a security platform (e.g., platform 102). As one example, platform 102 can provide to appliance 112 a set of signatures of known-malicious files (e.g., as part of a subscription). If a signature for malware 130 is included in the set, appliance 112 can prevent the transmission of malware 130 to client 104 accordingly. As another example, platform 102 can provide to appliance 112 a list of known malicious domains, allowing appliance 112 to block traffic between network 110 and (for example) C&C server 150. The list of malicious domains can also help appliance 112 determine when one of its nodes has been compromised. For example, if client 104 attempts to contact C&C server 150, such attempt is a strong indicator that client 104 has been compromised by malware (and remedial actions should be taken accordingly, such as quarantining client 104 from communicating with other nodes within network 110).
In various embodiments, data appliance 112 includes a DNS module 114, which is configured to receive (e.g., from security platform 102) a list of domains (e.g., a list of attack domains and/or a list of NXNS domains) for which queries (e.g., made by client device 104), if observed (e.g., within network 110), are problematic. DNS module 114 can also be configured to send (e.g., to platform 102) DNS query data (e.g., logs of DNS requests made by clients such as clients 104-108). DNS module 114 can be integrated into appliance 112 (as shown in
An embodiment of a data appliance is shown in
Functionality described herein as being performed by data appliance 112 can be provided/implemented in a variety of ways. For example, data appliance 112 can be a dedicated device or set of devices. The functionality provided by data appliance 112 can also be integrated into or executed as software on a general purpose computer, a computer server, a gateway, and/or a network/routing device. In some embodiments, at least some services described as being provided by data appliance 112 are instead (or in addition) provided to a client device (e.g., client device 104) by software executing on the client device (e.g., endpoint protection application 180).
Whenever data appliance 112 is described as performing a task, a single component, a subset of components, or all components of data appliance 112 may cooperate to perform the task. Similarly, whenever a component of data appliance 112 is described as performing a task, a subcomponent may perform the task and/or the component may perform the task in conjunction with other components. In various embodiments, portions of data appliance 112 are provided by one or more third parties. Depending on factors such as the amount of computing resources available to data appliance 112, various logical components and/or features of data appliance 112 may be omitted and the techniques described herein adapted accordingly. Similarly, additional logical components/features can be included in embodiments of data appliance 112 as applicable. One example of a component included in data appliance 112 in various embodiments is an application identification engine which is configured to identify an application (e.g., using various application signatures for identifying applications based on packet flow analysis). For example, the application identification engine can determine what type of traffic a session involves, such as Web Browsing—Social Networking; Web Browsing —News; SSH; and so on.
As shown, data appliance 112 comprises a firewall, and includes a management plane 132 and a data plane 134. The management plane is responsible for managing user interactions, such as by providing a user interface for configuring policies and viewing log data. The data plane is responsible for managing data, such as by performing packet processing and session handling.
Network processor 136 is configured to receive packets from client devices, such as client device 108, and provide them to data plane 134 for processing. Whenever flow module 138 identifies packets as being part of a new session, it creates a new session flow. Subsequent packets will be identified as belonging to the session based on a flow lookup. If applicable, SSL decryption is applied by SSL decryption engine 140. Otherwise, processing by SSL decryption engine 140 is omitted. Decryption engine 140 can help data appliance 112 inspect and control SSL/TLS and SSH encrypted traffic, and thus help to stop threats that might otherwise remain hidden in encrypted traffic. Decryption engine 140 can also help prevent sensitive content from leaving enterprise network 110. Decryption can be controlled (e.g., enabled or disabled) selectively based on parameters such as: URL category, traffic source, traffic destination, user, user group, and port. In addition to decryption policies (e.g., that specify which sessions to decrypt), decryption profiles can be assigned to control various options for sessions controlled by the policy. For example, the use of specific cipher suites and encryption protocol versions can be required.
Application identification (APP-ID) engine 142 is configured to determine what type of traffic a session involves. As one example, application identification engine 142 can recognize a GET request in received data and conclude that the session requires an HTTP decoder. In some cases, e.g., a web browsing session, the identified application can change, and such changes will be noted by data appliance 112. For example, a user may initially browse to a corporate Wiki (classified based on the URL visited as “Web Browsing—Productivity”) and then subsequently browse to a social networking site (classified based on the URL visited as “Web Browsing—Social Networking”). Different types of protocols have corresponding decoders 144.
Based on the determination made by application identification engine 142, the packets are sent to an appropriate decoder 144. Decoder 144 is configured to assemble packets (which may be received out of order) into the correct order, perform tokenization, and extract out information. Decoder 144 also performs signature matching to determine what should happen to the packet. As needed, SSL encryption engine 146 can re-encrypt decrypted data. Packets are forwarded using a forward module 148 for transmission (e.g., to a destination).
As also shown in
Security platform 202 receives DNS query information (e.g., passive DNS data) from a variety of sources (208-212), using a variety of techniques. Sources 208-212 collectively provide platform 202 with approximately five billion unique records each day. An example of a record is:
The record indicates that, on Jan. 1, 2022, a DNS query was made for the site “abc.com” and at that time, the response provided was the IP address “199.181.132.250” (an “Address record” or “A record”). As used throughout the Specification, references to an “A record” can include both IPv4 (A) address records and IPv6 (AAAA) address records, based, for example, on implementation. In some cases, additional information can also be included. For example, an IP address associated with the requestor may be included in the passive DNS, or may be omitted (e.g., due to privacy reasons). Another example of a record is:
The record indicates that, on Jan. 2, 2022, a DNS query was made for the site “xyz.abc.com” and at that time, the response provided (also referred to as a “referral response” or “Nameserver (NS) record”) was to query the nameserver at ns.abc.com for more information about “xyz.abc.com.”
Source 208 is a real-time feed of globally collected passive DNS. An example of such a source is Farsight Security Passive DNS. In particular, records from source 208 are provided to platform 202 via an nmsgtool client, which is a utility wrapper for the libnmsg API that allows messages to be read/written across a network. Every 30 minutes, a batch process 216 (e.g., implemented using python) loads records newly received from source 208 into an Apache Hadoop cluster (HDFS) 214.
Source 210 is a daily feed of passive DNS associated with malware. An example of such a source is the Georgia Tech Information Security Center's Malware Passive DNS Data Daily Feed. Records from source 210 are provided to platform 202 as a single file via scp and then copied into HDFS 214 (e.g., using copyFromLocal on the file location 218 (e.g., a particular node in a cluster configured to receive data from source 210)).
As previously mentioned, appliance 112 can collect DNS queries made by clients 104-108 and provide passive DNS data to platform 202. In some embodiments, appliances such as appliance 112 directly provide the passive DNS information to platform 202. In other embodiments, appliance 112 (along with many other appliances) provides the passive DNS information to an intermediary, which in turn provides the information to platform 202. In the example shown in
As shown in
Among other features, the DNS system was designed to be highly available, fault tolerant, and provide quick responses. Unfortunately, these features can also be leveraged by a malicious individual for nefarious purposes. Suppose client device 302 has been compromised (e.g., malware 130 has compromised client device 302) and an attacker would like to use client device 302 to help perpetrate a DDoS attack. The attacker has registered the domain, “attacker.com,” and causes client device 302 to make a DNS query for the domain, “000.attacker.com” which does not exist (is an NXNS domain). In this scenario, the same sequence of events S2-S6 occurs as in the benign scenario (e.g., querying root 306, querying an applicable TLD nameserver (ns1.com), and querying an applicable SLD nameserver (ns.attacker.com)). However, in this scenario, SLD nameserver 308 is malicious (and under the control of the attacker). Its response, rather than being an IP address, is to instruct resolver 304 to query a set of SLD nameservers that are not under the control of the attacker (which is allowed by the DNS recursive protocol). In this example, the instructions refer to nonexistent victim nameservers.
In a typical, benign DNS resolution scenario, a handful of legitimate nameservers will be provided as answers (e.g., ns1.edu and ns2.edu or ns1.panw.edu and ns2.panw.edu). At each step along the way, querying all of the returned nameservers simultaneously represents a low communication overhead while getting the benefits of, e.g., quick responses, high availability, and fault tolerance. However, the DNS protocol allows for many more nameservers to be provided in a response. This can be exploited by the operator of attacker.com who could, for example, cause nameserver 308 to respond with hundreds of randomly generated NXNS domains purporting to belong to the victim's DNS hierarchy. Following the DNS protocol, resolver 304 will then query all of the NXNS domains (e.g., 111.victim.com, 222.victim.com, etc.) in parallel, flooding the traffic between resolver 304 and victim.com's legitimate nameservers. The packet amplification factor can be as high as 1620×, subjecting both the resolver and the victim nameservers to DDoS attacks (particularly if many different compromised clients are coordinated by the attacker to simultaneously make such NXNS queries). In addition to targeting SLD nameservers, this type of attack can also be used against TLD and root nameservers (e.g., by attack server 308 varying the different components included in its response to queries).
One way to mitigate some of the harm that NXNS attacks can pose is to modify resolver behavior (e.g., to limit the number of nameserver domains contacted per referral response). While this approach might be taken by large cloud public DNS resolvers (e.g., provided by Google or CloudFlare) with teams of sophisticated engineers, smaller organizations (e.g., a typical enterprise or small business entity) using edge resolvers or relying on their ISPs' resolvers might not be in a position to modify resolver behavior, lack the technical acumen to do so, or even know that such an action should be taken. An alternate approach is for data appliance 112 and/or security platform 102 to provide protection, e.g., to a resolver operating on behalf of network 110. Further, by identifying attack domains (using embodiments of techniques described herein), additional protections can be achieved (e.g., by identifying compromised client devices and taking remedial action).
Out-of-Bailiwick is a term that explains whether a response in which the nameserver is answering is authoritative for an ancestor of the owner name in the response. Using the example data shown in
Returning to
Although the foregoing embodiments have been described in some detail for purposes of clarity of understanding, the invention is not limited to the details provided. There are many alternative ways of implementing the invention. The disclosed embodiments are illustrative and not restrictive.
This application claims priority to U.S. Provisional Patent Application No. 63/340,312 entitled IDENTIFY AND BLOCK DOMAINS USED FOR NXNS-BASED DDOS ATTACKS filed May 10, 2022 which is incorporated herein by reference for all purposes.
Number | Date | Country | |
---|---|---|---|
63340312 | May 2022 | US |