The term computer “hacker” is increasingly used to refer to persons who engage in malicious or illegal activities to gain access to, or attack computer systems without authorization. Such activities by hackers have evolved far beyond those that were simply pranks or caused some minor inconveniences into a key component of highly organized criminal enterprises in which billions of dollars can be made each year.
Hackers often seek to launch attacks on computer systems in an automated manner by using large networks called “botnets” of compromised computers called “bots” (i.e., software robots) or “drones.” While bots can be supported by hosts that bypass most local Internet content regulation (so called “bullet-proof hosts”), bots are primarily found in computers used by innocent home users who are completely unaware that their systems have been taken over and are being used for illegitimate purposes. Botnets are thought to be organized in herds as large as one hundred thousand to a half million or more bots that can be geographically spread over many countries.
Botnets can employ both active and passive attacks. In an active attack, a botnet originates attacking traffic such as spam, adware, or denial of service (“DoS”) traffic which is sent over a network such as the Internet to its victims. In a passive attack, bots function as servers which, when accessed by a user, serve malware such as viruses, rootkits, trojan horses etc., typically using HTTP (Hypertext Transfer Protocol).
Reputation services have been established to address the problem of automated attacks and other hacker activities by compiling black lists of URLs (Uniform Resource Locators) and IP (Internet Protocol) addresses of known adversaries. A variety of technologies such as mail relay servers, firewalls, and unified threat management gateways can query the reputation service through an online connection to decide whether to accept traffic from, or send traffic to, a given computer on the Internet.
Current reputation services often run their own laboratories that are equipped with a variety of tools which are used to scan the Internet to locate adversaries and establish the reputation. These tools include web crawlers, honeypots (passive, dummy data or network sites that appear to contain information of value to attract attackers), honey monkeys (virtual computers that visit websites and seek code designed to attack a computer), virtual machines, and other global sensors.
Reputation services face several significant challenges that can affect their use and success in combating hackers. For example, reputation services must reliably detect and confirm adversaries that are deployed in vast numbers all over the world. Hackers can also change URLs and IP addresses of bots frequently, so reputation services must be able to dynamically respond with equal speed to detect them and not block legitimate users who might reuse the same URL or IP address a few hours later. This problem of false positives in which URLs and IP addresses of innocent (i.e., non-malicious) computers are wrongly identified as adversaries can cause significant disruptions to users and result in high costs to service providers to resolve disputes and restore services.
In addition, reputation services need to successfully deal with hackers who are increasingly targeting attacks on small sets of customer or enterprise networks that frequently go unobserved by the technologies of existing reputation services.
This Background is provided to introduce a brief context for the Summary and Detailed Description that follow. This Background is not intended to be an aid in determining the scope of the claimed subject matter nor be viewed as limiting the claimed subject matter to implementations that solve any or all of the disadvantages or problems presented above.
An automated arrangement for detecting adversaries is provided in which assessments of detected adversaries are reported to a reputation service from security devices, such as unified threat management (“UTM”) systems, in deployed enterprise networks. By using actual deployed networks, the number of available sensors can be very large to increase the scope of the adversary detection, while still observing real attacks and threats including those that are targeted to small sets of customers. The reputation service performs a number of correlations and validations on the received assessments to then return a reputation back to the security device that can be used for blocking adversaries.
In various illustrative examples, the assessment includes a URL or IP address of the adversary plus a severity level (e.g., low, medium, high, critical) of the incident associated with the attack or malware. The reputation service verifies that the assessment comes from a authenticated known sources, to make it expensive for a hacker to misuse assessments to damage the service through DoS attacks or the filing of false reports. Authentication may be performed using a certificate to sign assessments or other cryptographic methods.
Each reported assessment is assigned a time-to-live (“TTL”) value that sets the length of the time the assessment is valid to deal with an adversary changing IP addresses and URLs of bots under its control. If, after a TTL expires, the same adversary is detected again, another assessment is sent where the TTL is increased, for example, using an algorithm that increases the TTL value with each recurring detection (e.g., from an initial TTL value of 4 hours to 8 hours, then 16 hours, and so forth upon each detection recurrence).
The reputation service establishes fidelity (i.e., confidence level) of a reputation according to the number of enterprises or customers reporting the same adversary. Only when multiple, distinct enterprises report the same adversary in valid assessments (i.e., those with an unexpired TTL) will the reputation have sufficient fidelity to be sent back to the reporting enterprises to be used to actually block adversaries.
Certain types of hosts such as large proxies and share web hosting sites are included in a list of known exclusions and are not subjected to blocking. In addition, privacy concerns may be addressed by hashing the adversary's URL or IP address in the assessments reported by the security devices in the network to thereby mask the identity of particular adversaries that affect particular enterprises or customers.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
Similar reference numerals indicate similar elements in the drawings.
An illustrative botnet 132 comprises a herd 137 of compromised hosts, such as home computers with broadband connectivity. Botnet 132 typically uses the Internet 125 in its attempts to attack hosts inside the customer networks 105. Botnet herd 137 can include any number of compromised hosts and could number in the several hundred thousands or even millions. Botnet 132 is typically spread over the globe and can thus be very difficult to deactivate.
Details of botnet 132 and herd 137 are shown in
A payloads runner 221 includes a number of functional modules to carry out the malicious purposes of the botnet 132. These functional modules include a keylogger 227 that may be used for identity theft, a spam proxy 230 for sending spam, a back-end phishing web server 235 for hosting spoofed web sites, a DoS flooder 241 for supporting DoS for extortion or other attacks, and a click fraud module 246 that provides automated click throughs on web advertising in order to fraudulently gain per-click advertising revenue or reduce advertising costs by manipulating auctions.
An update engine 250 is arranged to keep the compromised host updated in order to dynamically reconfigure the host as may be required to conduct a variety of nefarious activities (it is believed that botnet herders lease botnets to various “customers” on a time-share basis). A control agent 253 pulls commands from an IRC (Internet Relay Chat) server 265 on a bullet-proof host or hosted by another drone in the botnet 132. A command and control center 261 provides a centralized command post that is either supported by a bullet-proof host or another drone in the botnet 132.
Returning back to
In accordance with the principles of the present adversary detection arrangement, UTMs 121 are arranged to communicate with a reputation service 140. UTMs 121 report assessments of detected adversaries via telemetry that it uploads to the reputation service 140. As UTMs 121, or other security products having similar adversary detection functionalities, are commonly utilized by enterprises, businesses, and corporations, they can thus number in the hundreds of thousands to provide a very broad array of distributed adversary detection sensors. In addition, consumer products such as network routers and set top televisions terminals etc., may also be arranged to incorporate detection functionalities and thus be used to report assessments. It is emphasized that benefits of the present arrangement can be substantially realized even with a relatively small number of UTMs (e.g., a few dozen). In addition, assessments may be optionally collected from security products deployed in other networks, including home networks and other known resources, as collectively identified as third parties feeds 146 in
As shown in
There is a possibility that a hacker might create false assessments in an attempt to damage the credibility of the reputation service 140, or launch a DoS attack, for example on a legitimate website and falsely accuse it of participating in malicious attacks. To prevent reporting of such false assessments, the reputation service 140 authenticates the UTMs 121 making the reports through use of a unique customer identification or security certificate to prove that the reporting UTM is legitimate and not a hacker. In addition, if false information is discovered by the reputation service, it can be traced back to a source UTM, and all further assessments sent from that UTM will be disregarded.
As indicated by reference numeral 325, an assessment, in this illustrative example, includes data fields which contain the URL or IP address of the detected adversary, a severity of the incident associated with the attack by the adversary, and a time-to-live (“TTL”) value. The severity (e.g., low, medium, high, critical) describes the seriousness of an incident that is associated with the attack, which as noted above, can be both active and passive. For example, a host in customer network 105 (
Each reported assessment has an assigned TTL value that defines the time interval over which the assessment is valid. Once the TTL expires, the assessment is no longer valid. The TTL is utilized in recognition that hackers can often quickly change the URL or IP addresses of the bots in their botnets. By using the TTL, the possibility is lessened for blocking a legitimate user who subsequently uses a URL or IP address after it has been abandoned by the hacker.
In this illustrative example, the default initial TTL value is four hours. However, if a UTM 121 detects the same adversary on a recurring basis, the TTL value in its subsequent reported assessments will be extended in time. Various types of algorithms may be used to extend the TTL value according to the needs of a specific application of the present adversary detection arrangement. For example, an exponentially increasing or geometrically increasing algorithm can be applied to double each TTL value with each recurring detection (e.g., 4, 8, 16 hours . . . ).
As noted above, the reputation service 140 uses the collected assessments from the UTMs 121 to generate the reputation 318. To do this, the reputation service 140 correlates the collected assessments to derive a fidelity (i.e., level of confidence) that will be associated with the reputation 318. In some implementations, such reputation fidelity can be reported back to the UTMs 121 with various levels, for example, low, medium, or high fidelity. Alternatively, fidelity can be arranged in a binary fashion (i.e., a reputation has sufficient fidelity to be relied upon, or has no fidelity). In either case, in accordance with the principles herein, a set or predetermined amount of fidelity must be present before a reputation may be used by a UTM to block traffic.
Various techniques or algorithms may be used to establish fidelity and the local rules governing assessment use may vary, but the general principle applied by all is that multiple, distinct UTMs (i.e., UTMs operating on different customer networks) must corroborate an adversary so that no single assessment is used to generate a reputation at the reputation service 140. For example, one technique would be to require a minimum number, such as 10, valid (i.e., having unexpired TTLs) assessments identifying the same adversary received from distinct UTMs, in order for the reputation service 140 to generate a reputation 318 having high fidelity. In this example, only high fidelity reputations are allowed to be used by a UTM 121 to block an adversary. In another example, a low fidelity reputation is generated when between one and five assessments identifying the same adversary are received from distinct UTMs. But in this example, a given UTM 121 might apply a different local rule to block the adversary associated with the low fidelity reputation, but only if the UTM also detects the adversary. Thus, corroboration from an outside source, even if it is low fidelity, is sufficient evidence when combined with the UTM's own observations to warrant taking a blocking action. It may also be desirable to use a fidelity algorithm that adjusts according to the severity of the reported incidents. For example, high or critical severity incidents that have the potential to cause greater harm might require fewer assessments from multiple, distinct sources to generate a high-fidelity reputation than when the severity is lower. Another illustrative technique is to increase the fidelity of a reputation when multiple different types of attacks are launched from the same adversary. For example, a hacker (or an entity leasing a botnet) might use a botnet for spam at one given time, and then for a DoS attack, followed by a identity theft attack. In this case, the reputation service 140 can assign greater fidelity to the reputation for the source initiating these multiple types of attack, even if such multiple attacks are directed to a smaller number of nodes (e.g., UTMs 121) which would result in a fewer number of reported assessments to the reputation service 140.
By requiring correlation of assessments from multiple distinct sources before issuing a reputation, the present arrangement prevents a hacker from simply installing a pirated UTM in order to influence a reputation. This provides a measure of security that may be used in addition to the use of authentication of assessment sources. Should a hacker attempt to provide assessments including fraudulent or misleading data, the fact that such assessments are not corroborated by other UTMs 121 can be used as a justification for revoking the certificate for the pirated UTM.
In some applications of the present arrangement, a UTM 121 sends a request 330 to the reputation service when it encounters an unknown URL or IP address to check the reputation of the URL or IP address before allowing access to the URL or accepting traffic from the IP address. In other applications, reputations are generated and sent to the UTMs 121 whenever a sufficient number of assessments are collected and correlated by the reputation service 140 irrespective of an explicit request.
As indicated by reference numeral 331 in
The fidelity field contains a fidelity indicator such as low, medium, high etc. The fidelity field can be optionally eliminated in some implementations. It is typically not used in cases where reputations are binary in nature and are thus only generated and sent when the derived fidelity reaches a predetermined threshold (and which would make reporting the actual fidelity value somewhat meaningless). The optionally-utilized TTL value in a reputation may similarly vary according to the requirements of a specific application. For example, a TTL value for the reputation 318 might be selected to be equal to the largest TTL value contained in a received assessment and a UTM 121 should block the URL or IP address only so long as the TTL remains valid. In other illustrative examples, no TTL is used and the reputation stays valid until it is explicitly revoked.
For the second special case of shared web hosting sites like MySpace® and MSN Hotmail™, the known exclusion list 422 includes a white list of URLs associated with shared web hosting sites that will not be blocked since those sites are shared by many users, including legitimate users. In some implementations, it may be possible to block certain sites by path, but not by domain of the shared web hosting site.
It is possible that some customers operating networks that engage in the present arrangement with a reputation service may have privacy concerns and not want to disclose information on incidents and attacks. Accordingly, as an optional feature, instead of reporting actual URLs and IP addresses in an assessment 306 (
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 60/923,663, filed Apr. 16, 2007, entitled “Detection of Adversaries through Collection and Correlation of Assessments,” which is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5097469 | Douglas | Mar 1992 | A |
5497460 | Bailey et al. | Mar 1996 | A |
5892903 | Klaus | Apr 1999 | A |
5982753 | Pendleton et al. | Nov 1999 | A |
6088804 | Hill et al. | Jul 2000 | A |
6415313 | Yamada et al. | Jul 2002 | B1 |
6415329 | Gelman et al. | Jul 2002 | B1 |
6711137 | Klassen et al. | Mar 2004 | B1 |
6772214 | McClain et al. | Aug 2004 | B1 |
6807156 | Veres | Oct 2004 | B1 |
6859829 | Parupudi et al. | Feb 2005 | B1 |
6934290 | Story | Aug 2005 | B2 |
6938080 | Kahveci et al. | Aug 2005 | B1 |
6941384 | Aiken et al. | Sep 2005 | B1 |
6993584 | Border | Jan 2006 | B2 |
6996631 | Aiken et al. | Feb 2006 | B1 |
7007089 | Freedman | Feb 2006 | B2 |
7007302 | Jagger et al. | Feb 2006 | B1 |
7080141 | Beekelmans et al. | Jul 2006 | B1 |
7082467 | Border | Jul 2006 | B2 |
7085925 | Hanna et al. | Aug 2006 | B2 |
7152105 | McClure et al. | Dec 2006 | B2 |
7174454 | Roskind | Feb 2007 | B2 |
7203753 | Yeager et al. | Apr 2007 | B2 |
7430611 | Aiken et al. | Sep 2008 | B2 |
7596097 | McCowan et al. | Sep 2009 | B1 |
7774361 | Nachenberg et al. | Aug 2010 | B1 |
7817668 | Igarashi et al. | Oct 2010 | B2 |
20020147837 | Heller | Oct 2002 | A1 |
20020150041 | Reinshmidt et al. | Oct 2002 | A1 |
20030014665 | Anderson et al. | Jan 2003 | A1 |
20030018585 | Butler et al. | Jan 2003 | A1 |
20030018930 | Mora et al. | Jan 2003 | A1 |
20030023876 | Bardsley et al. | Jan 2003 | A1 |
20030028614 | Jeon | Feb 2003 | A1 |
20030043802 | Yazaki et al. | Mar 2003 | A1 |
20030126227 | Zimmerman et al. | Jul 2003 | A1 |
20030159069 | Choi et al. | Aug 2003 | A1 |
20030163729 | Buchegger | Aug 2003 | A1 |
20030177213 | Wallace | Sep 2003 | A1 |
20030217173 | Butt et al. | Nov 2003 | A1 |
20040044912 | Connary et al. | Mar 2004 | A1 |
20040083408 | Spiegel et al. | Apr 2004 | A1 |
20040109477 | Lee et al. | Jun 2004 | A1 |
20040122926 | Moore et al. | Jun 2004 | A1 |
20040187032 | Gels et al. | Sep 2004 | A1 |
20040250114 | Parekh et al. | Dec 2004 | A1 |
20050052998 | Oliver et al. | Mar 2005 | A1 |
20050091355 | Keohane et al. | Apr 2005 | A1 |
20050259633 | Kato | Nov 2005 | A1 |
20050283527 | Corrado et al. | Dec 2005 | A1 |
20060004918 | Lubeck et al. | Jan 2006 | A1 |
20060042483 | Work et al. | Mar 2006 | A1 |
20060095970 | Rajagopal et al. | May 2006 | A1 |
20060129382 | Anand et al. | Jun 2006 | A1 |
20060153200 | Filsfils et al. | Jul 2006 | A1 |
20060182036 | Sasagawa et al. | Aug 2006 | A1 |
20060200487 | Adelman et al. | Sep 2006 | A1 |
20060212931 | Shull et al. | Sep 2006 | A1 |
20060224709 | Johnson et al. | Oct 2006 | A1 |
20060242268 | Omernick et al. | Oct 2006 | A1 |
20060253458 | Dixon et al. | Nov 2006 | A1 |
20060253578 | Dixon et al. | Nov 2006 | A1 |
20060253580 | Dixon et al. | Nov 2006 | A1 |
20060253581 | Dixon et al. | Nov 2006 | A1 |
20060253582 | Dixon et al. | Nov 2006 | A1 |
20060253583 | Dixon et al. | Nov 2006 | A1 |
20060253584 | Dixon et al. | Nov 2006 | A1 |
20060265760 | Daemke et al. | Nov 2006 | A1 |
20060272018 | Fouant | Nov 2006 | A1 |
20060294579 | Khuti et al. | Dec 2006 | A1 |
20070005787 | Igarashi et al. | Jan 2007 | A1 |
20070064626 | Evans | Mar 2007 | A1 |
20070078699 | Scott et al. | Apr 2007 | A1 |
20070094725 | Borders | Apr 2007 | A1 |
20070127393 | Car | Jun 2007 | A1 |
20070169194 | Church et al. | Jul 2007 | A1 |
20080025218 | Liu | Jan 2008 | A1 |
Number | Date | Country |
---|---|---|
02093334 | Nov 2002 | WO |
2006074294 | Jul 2006 | WO |
Entry |
---|
International Search Report from corresponding PCT Application No. PCT/US2008/057742 dated Jul. 23, 2008, 3 pages. |
Allen, et al., “State of the Practice of Intrusion Detection Technologies”, Date: Jan. 2000, pp. 1-242. |
Larson., “Aspects of Adapting Data Collection to Intrusion Detection”. pp. 1-128, 2006. |
Agarwal et al., “An Infrastructure for Passive Network Monitoring of Applicaiton Data Streams”, http://congo.postech.ac.kr/PAPER/MONITORING/PAM/2003/agarval.pdf, Year 2003. |
Balakrishnan et al., “TCP Behavior of a Busy Internet Server: Analysis and Improvements”, http://nms.ics.mit.edu/hari/papers/infocom98.ps.gz, Year 1998. |
Padmanabhan et al., “Network Tomography Using Passive End-to-End Measurements”, http://research.microsoft.com/padmanab/papers/dimacs2002.pdf, Year 2002. |
Paxson Vern, “End-to-End Internet Packet Dynamics”, Date: Jun. 1999, vol. 7, http://delivery.acm.org/10.1145/320000/312234/00779192.pdf?key1=312234&key2=0458187611&coll-GUIDE$dl=GUIDE&CEID=10427251$cetoken=17506759. |
Bace et al., “NIST Special Publication on Intrusion Detection Systems” 2007, downloaded at http://www.21cfrpart11.com/files/library/reg—guid—docs/nist—intrusiondetectionsys.pdf pp. 1-51. |
Alunkal, Beulah Kurian, “Grid Eigen Trust a Framework for Computing Reputation in Grids”, Date: Dec. 2003, Chicago, Illinois (93 pages). |
Baker, et al., “Local Control over Filtered WWW access”, http://cm.bell-labs.com/who/bsb/papers/signet/Overview.html, Date: May 7, 2007 (10 pages). |
Kostoulas, Dionysios Konstantinos, “Distibuted reputation-based mechansims for trust propagation in large groups”, Date: 2002, Urbana, Illinois (93 pages). |
Lik Mui et al., “A computational Trust Model of Confidence and Reputation”, Proceedings of the 35th Annual Hawaii International Conference on System Sciences, pp. 2431-2439, Jan. 2002 (10 pages total). |
Extended European Search Report issued Nov. 15, 2013 in connection with corresponding EP Patent Application No. 08732611.2 (10 pages total). |
Josang, et al., “Simulating the Effect of Reputation Systems on e-Markets”, Trust Management, First International Conference, ITRUST 2003, proceedings May 28-30, 2003, Heraklion, Crete, Greece May 30, 2003 (16 pages total). |
Number | Date | Country | |
---|---|---|---|
20080256619 A1 | Oct 2008 | US |
Number | Date | Country | |
---|---|---|---|
60923663 | Apr 2007 | US |