A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
This application is related to the following patent application, which is hereby incorporated by reference in its entirety:
U.S. patent application Ser. No. 13/545,796, entitled “SYSTEM AND METHOD FOR SUPPORTING SUBNET MANAGEMENT PACKET (SMP) FIREWALL RESTRICTIONS IN A MIDDLEWARE MACHINE ENVIRONMENT”, inventors Bjorn Dag Johnsen, Roy Arntsen and Lars Paul Huse, filed Jul. 10, 2012.
The present invention is generally related to computer systems and software such as middleware, and is particularly related to supporting a middleware machine environment.
The interconnection network plays a beneficial role in the next generation of super computers, clusters, and data centers. High performance network technology, such as the InfiniBand (IB) technology, is replacing proprietary or low-performance solutions in the high performance computing domain, where high bandwidth and low latency are the key requirements. For example, IB installations are used in supercomputers such as Los Alamos National Laboratory's Roadrunner, Tex. Advanced Computing Center's Ranger, and Forschungszcntrum Juelich's JuRoPa.
IB was first standardized in October 2000 as a merge of two older technologies called Future I/O and Next Generation I/O. Due to its low latency, high bandwidth, and efficient utilization of host-side processing resources, it has been gaining acceptance within the High Performance Computing (HPC) community as a solution to build large and scalable computer clusters. The de facto system software for IB is OpenFabrics Enterprise Distribution (OFED), which is developed by dedicated professionals and maintained by the OpenFabrics Alliance. OFED is open source and is available for both GNU/Linux and Microsoft Windows.
Described herein are systems and methods for providing switch based subnet management packet (SMP) traffic protection in a middleware machine environment. The middleware machine environment includes a network switch that operates to receive at least one SMP destined for a subnet management agent (SMA) components. The network switch can check whether the at least one SMP includes a correct management key, and prevent the at least one SMP from being forwarded to the destined SMA when at least one SMP does not include the correct management key. Furthermore, the network switch can specify a different management key for each external port and can enforce separate restrictions on ingress and egress SMP traffics at a particular external port.
Described herein is a system and method for providing a middleware machine or similar platform. In accordance with an embodiment of the invention, the system comprises a combination of high performance hardware, e.g. 64-bit processor technology, high performance large memory, and redundant InfiniBand and Ethernet networking, together with an application server or middleware environment, such as WebLogic Suite, to provide a complete Java EE application server complex which includes a massively parallel in-memory grid, that can be provisioned quickly, and can scale on demand. In accordance with an embodiment, the system can be deployed as a full, half, or quarter rack, or other configuration, that provides an application server grid, storage area network, and InfiniBand (IB) network. The middleware machine software can provide application server, middleware and other functionality such as, for example, WebLogic Server, JRockit or Hotspot JVM, Oracle Linux or Solaris, and Oracle VM. In accordance with an embodiment, the system can include a plurality of compute nodes, IB switch gateway, and storage nodes or units, communicating with one another via an IB network. When implemented as a rack configuration, unused portions of the rack can be left empty or occupied by fillers.
In accordance with an embodiment of the invention, referred to herein as “Sun Oracle Exalogic” or “Exalogic”, the system is an easy-to-deploy solution for hosting middleware or application server software, such as the Oracle Middleware SW suite, or Weblogic. As described herein, in accordance with an embodiment the system is a “grid in a box” that comprises one or more servers, storage units, an IB fabric for storage networking, and all the other components required to host a middleware application. Significant performance can be delivered for all types of middleware applications by leveraging a massively parallel grid architecture using, e.g. Real Application Clusters and Exalogic Open storage. The system delivers improved performance with linear I/O scalability, is simple to use and manage, and delivers mission-critical availability and reliability.
M_Key Protection Model
In the IB fabric 100, a secure HCA firmware implementation in HCA 121-124 can keep the type and identity of various fabric nodes well defined. Each of the HCA 121-124 can implement a subnet management agent (SMA) components 131-134, each of which can be associated with an M_Key 141-144. Furthermore, the connected switches A-B 103-104 can be controlled by the fabric administrator 110. Thus, any rogue SMA implementation 131-134 may not compromise the fabric administrator 110 defined M_Key 102 values that are used in the IB subnet/fabric 100.
Additional descriptions of various embodiments of using secure HCA firmware implementation in a middleware machine platform are provided in U.S. patent application Ser. No. 13/487,973, entitled “SYSTEM AND METHOD FOR PROVIDING SECURE SUBNET MANAGEMENT AGENT (SMA) IN AN INFINIBAND (IB) NETWORK”, filed Jun. 4, 2012, which application is herein incorporated by reference.
Furthermore, the fabric administrator 110 can ensure that new M_Key values 102 for the IB subnet/fabric 100 are installed out-of-band on switches 103-104 (as well as for the relevant subnet manager instances 101). Additionally, the fabric administrator 110 can ensure that there is infinite M_Key 102 lease time on the switches 103-104. Thus, the host based software 161-164, e.g. a host based subnet manager on different hosts 111-114 (including an operating system 151-154), can not hijack the control of any switch 103-104 in the IB subnet/fabric 100.
In accordance with an embodiment of the invention, a single M_Key 102 value (or a single set of M_Key values) can be used for various nodes in the in the IB subnet/fabric 100 based on the IB specification defined access restrictions. The correct value for a current M_Key 102 may need to be specified before either reading or updating the M_Key 102, since the secure HCA firmware can ensure that the “read protected” M_Key assigned to the local HCA 121-124 is not exposed to local host based software.
Additionally, local software 161-164 on different hosts 111-114 may be able to hi-jack the HCA port by setting up its own M_Key value, in the case when the current M_Key value for HCA ports is defined at run-time. Also, the host local software 161-164 may make the HCA port unmanageable for the designated subnet manager 101, e.g., before the designated subnet manager 101 sets up any M_Key 102 for the HCA 121-124.
In accordance with an embodiment of the invention, a designated subnet manager 101 can ignore any HCA ports with un-known M_Key value and leave the corresponding link not initialized. The only impact of a hijacked HCA port M_Key can be that the HCA port may not be operational, and the designated subnet manager 101 can prevent host based software from communicating via this HCA port using normal communication, i.e. non-SMP/VL15 based communication.
Furthermore, when host software 111-114 compromises the local HCA M_Key value, the offending host software may be able to bring the HCA port to an operational state with activated local identifiers (LIDs) and partition membership. In such a case, if the switch port on a switch 103-104 that connects to the HCA 121-124 is controlled by a different M_Key value that is not known to the host software 111-114 that has compromised the local HCA M_Key value, then the offending host software 111-114 may not be able to bring the link to a full operational state that allows normal data traffic.
In accordance with an embodiment of the invention, the IB fabric 100 can prevent direct route SMPs between the various hosts 111-114 in order to avoid various potentially threatening scenarios. In one scenario, a host, e.g. host 111, can use the direct route SMPs to hijack the M_Key of the HCA port on a remote host, e.g. 112, after the remote host 112 and/or the remote HCA 122 are reset. This can cause the remote HCA 122 port to become inaccessible from the SM 101 and thereby prevent the remote host 112 from participating in normal IB communication, i.e. a denial of service (DoS) attack. In another scenario, when two hosts, e.g. host 111 and host 114, are compromised by hackers, the cooperating administration in the IB fabric 100 that depends on direct route SMPs may allow the two compromised hosts to exchange information using direct route SMPs.
The IB fabric 100 can support the cooperating administration for exchanging information between different hosts 111-114 without depending on direct route SMPs. For example, the administrators for the hosts can access a shared web-page on the Internet instead of relying on direct route SMPs in the IB fabric 100. From a fabric security perspective, leaving direct route SMPs as a security hole on the IB fabric may be considered a worse situation than allowing both host administrators to access a shared web-page on the Internet.
In accordance with an embodiment of the invention, the HCA ports may be set up with finite lease time on M_Keys 102, e.g. due to a high availability concern with the subnet manager(s) 101 that maintains the M_Key 102 lease period. Thus, the M_Keys 102 can expire without the associated link going down. Consequently, the state of the HCA 121-124, e.g. the partition membership, may be updated while links are still in active mode and the LID routes for the involved port are still operational. Then, the IB fabric 100 without M_Key protection may mistakenly allow normal IB traffic between a hi-jacked host and the hosts in other partitions.
Furthermore, if the M_Keys 102 expire before the links going down, both the local HCA, e.g. HCA 121, and any remote HCA, e.g. HCA 124, may be hi-jacked and the partition membership may be modified. If the associated switch ports, e.g. on switches 103-104, are not set up to perform partition enforcement, then the traffic with non-solicited partition membership can reach any other node in the fabric.
Additionally, a subnet manager 101 within the IB fabric 100 can depend on a designated virtual lane (VL), e.g. the VL15 buffering, in order to correctly monitor and control the IB fabric 100 and negotiate with other subnet managers in the IB fabric 100. Since the VL15 buffering within the IB fabric 100 is a shared resource, the uncontrolled use of SMPs from any host can represent a DoS attack. This can affect subnet manager 101 operations, since the M_Key protection within the IB fabric 100 may prevent the hosts from changing any SMA state on any node. Thus, there is a need to provide SMP traffic protection in the IB fabric 100.
In accordance with an embodiment of the invention, the M_Key 102 can be created and managed by fabric administrators 110 and stored in secured memory on switches A-B 103-104 and/or HCAs 121-124. A microprocessor on a switch A-B 103-104 or a HCA 121-124 can access the memory for reading out the M_Key 102 or writing the M_Key 102 into the memory.
Switch Based Proxy M_Key Protection
In accordance with an embodiment of the invention, M_Key checking of SMPs can be performed in intermediate switch nodes in an IB fabric to ensure that the local switch M_Key setting can protect remote HCA ports that do not have an M_Key set up.
A subnet management agent (SMA) component 206, which is implemented in firmware 204 on the HCA 202, can communicate with other nodes in the IB fabric 200 via the switch port 211. Also, the designated subnet manager 208 in the IB fabric 200 can use a particular switch port, e.g. port 211, for both sending direct route SMP requests to any SMA 206 and receiving direct route SMP responses from the SMA 206.
The switch 201 can prevent un-intended SMP traffic occurring in the IB fabric 200, without depending on requiring all HCAs to have trusted firmware with SMP control. For example, the switch 201 can filter direct route SMP traffic that is not consistent with a fabric policy for the IB fabric 200.
The switch 201 can use a filtering scheme to prevent a remote HCA port 202 from being hijacked by an intruder. The filtering scheme can be based on identifying any direct route SMP request that targets setting an SMA 206 attribute. Additionally, the filtering scheme can perform the same M_Key check for all direct route SMP requests independently of the destination, and may require that the direct route SMP requests include the correct M_Key 209 for the local switch 201 independently of which destination the SMP is targeting.
A single M_Key 209 can be used in an IB fabric 200, which includes the switch 201 and the HCA ports 202 that directly connects to the switch 201. If the host stack software 205 is able to compromise the M_Key 219 that protects the local HCA 202, then the host stack software 205 may also compromise the M_Key 209 protecting the local switch 201, since all SMP traffic in the IB fabric 200 includes the local switch M_Key 209.
In accordance with an embodiment of the invention, the switch 201 implementation can specify an optional M_Key value for each external port, e.g. M_Keys 221-226 for external ports 211-216. Additionally, the switch 201 implementation can ensure that any SMP sent out from a switch port, e.g. switch port 211, and any SMP received from this port all have an M_Key value that matches the M_Key 221 specified for the switch port 211. Furthermore, the network switch 201 can enforce separate restrictions on SMPs 220 sending from an external port 214 and SMPs 210 receiving at the external port 214.
Using this mechanism, a legitimate subnet manager 208 can ensure that all potentially untrusted remote HCA ports or other potentially untrusted remote ports may only be allowed to send out SMPs, when the correct local M_Key associated with a switch port is specified. Also, attempts to access a remote port may need to have the subnet manager 208 defined M_Key for that port independently of whether the remote port M_Key has been hi-jacked or not. Additionally, this mechanism can also specify an SMP rate that defines how fast the remote port can generate SMP, in order to prevent or reduce the chance of SMP based DoS attacks from an untrusted port in the fabric.
Additional descriptions of various embodiments of using SMP filtering in a middleware machine platform are provided in U.S. Pat. No. 7,398,394, entitled “SYSTEM AND METHOD FOR AUTHENTICATING NODES IN A COMMUNICATION NETWORK”, filed Jun. 2, 2004 and issued Jul. 8, 2008, which application is herein incorporated by reference.
In accordance with an embodiment of the invention, a filtering scheme can prevent illegal host-host based direct route SMP traffic based on declaring the switch ports as either “trusted” or “not trusted.” The determination of whether a switch port, or the entity connecting to the switch port, is trusted or not can be based on either explicit policy input to the local switch 201, or automatic authentication of the remote port.
As shown in
In accordance with an embodiment of the invention, the ability to use a single M_Key 209, or a single set of M_Keys, throughout an IB subnet/fabric 200 depends on whether all nodes in the IB fabric 200 are trusted and do not expose the M_Key in use to any entity that does not have the required privileges. A pre-requisite for a subnet manager 208 to include the current M_Key in a request is that the subnet manager 208 can be assured that the target and any intermediate agent may not compromise the integrity of the M_Key. In one example, such trustfulness can be established prior to including the current M_Key in any SMP. Thus, the fabric configurations may require all nodes in the IB fabric, including the HCAs, to be authenticated (or declared) as trusted before any M_Key based communication can takes place, instead of assuming that all SMA instances are trustful a priori.
In accordance with an embodiment of the invention, a mechanism for sending and receiving vendor based SMPs via the switch management interface can be provided. Such a mechanism enables a switch embedded authentication mechanism to operate as part of the switch local software and thereby operate in concert with embedded subnet managers and the embedded switch driver and SMA stack.
The present invention may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
In some embodiments, the present invention includes a computer program product which is a storage medium or computer readable medium (media) having instructions stored thereon/in which can be used to program a computer to perform any of the processes of the present invention. The storage medium can include, but is not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.
The foregoing description of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations will be apparent to the practitioner skilled in the art. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalence.
This application claims priority to U.S. Provisional Patent Application No. 61/506,557, entitled “SYSTEM AND METHOD FOR USING UNICAST AND MULTICAST FLOODING MECHANISMS TO PROVIDE EoIB GATEWAY vNICs” filed Jul. 11, 2011, and U.S. Provisional Patent Application No. 61/645,517, entitled “SYSTEM AND METHOD FOR PROVIDING SECRET MANAGEMENT KEY IN A MIDDLEWARE MACHINE ENVIRONMENT” filed May 10, 2012, which applications are herein incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
5758083 | Singh et al. | May 1998 | A |
6012100 | Frailong | Jan 2000 | A |
6038233 | Hamamoto et al. | Mar 2000 | A |
6098098 | Sandahl et al. | Aug 2000 | A |
6148336 | Thomas et al. | Nov 2000 | A |
6282678 | Snay | Aug 2001 | B1 |
6286038 | Reichmeyer | Sep 2001 | B1 |
6308148 | Bruins et al. | Oct 2001 | B1 |
6314531 | Kram | Nov 2001 | B1 |
6343320 | Fairchild et al. | Jan 2002 | B1 |
6658579 | Bell et al. | Dec 2003 | B1 |
6697360 | Gai | Feb 2004 | B1 |
6772320 | Raj | Aug 2004 | B1 |
6826694 | Dutta et al. | Nov 2004 | B1 |
6941350 | Frazier | Sep 2005 | B1 |
6981025 | Frazier et al. | Dec 2005 | B1 |
7023795 | Hwu | Apr 2006 | B1 |
7113995 | Beukema et al. | Sep 2006 | B1 |
7290277 | Chou et al. | Oct 2007 | B1 |
7398394 | Johnsen et al. | Jul 2008 | B1 |
7409432 | Recio et al. | Aug 2008 | B1 |
7636772 | Kirby et al. | Dec 2009 | B1 |
7721324 | Jackson | May 2010 | B1 |
7792987 | Vohra | Sep 2010 | B1 |
7843906 | Chidambaram et al. | Nov 2010 | B1 |
7860006 | Kashyap et al. | Dec 2010 | B1 |
7894440 | Xu et al. | Feb 2011 | B2 |
8054832 | Shukla et al. | Nov 2011 | B1 |
8149834 | Nielsen | Apr 2012 | B1 |
8391289 | Yalagandula | Mar 2013 | B1 |
8645524 | Pearson | Feb 2014 | B2 |
20020016858 | Sawada et al. | Feb 2002 | A1 |
20020133620 | Krause | Sep 2002 | A1 |
20030005039 | Craddock | Jan 2003 | A1 |
20030051026 | Carter | Mar 2003 | A1 |
20040013088 | Gregg | Jan 2004 | A1 |
20040028047 | Hou et al. | Feb 2004 | A1 |
20040037279 | Zelig et al. | Feb 2004 | A1 |
20040078709 | Beukema | Apr 2004 | A1 |
20040123142 | Dubal et al. | Jun 2004 | A1 |
20040168089 | Lee | Aug 2004 | A1 |
20050071709 | Rosenstock | Mar 2005 | A1 |
20050100033 | Arndt et al. | May 2005 | A1 |
20050163115 | Dontu | Jul 2005 | A1 |
20050198371 | Smith | Sep 2005 | A1 |
20060230219 | Njoku | Oct 2006 | A1 |
20070022479 | Sikdar et al. | Jan 2007 | A1 |
20070038703 | Tendjoukian et al. | Feb 2007 | A1 |
20070073882 | Brown et al. | Mar 2007 | A1 |
20070140266 | Njoku | Jun 2007 | A1 |
20070162520 | Petersen et al. | Jul 2007 | A1 |
20070280104 | Miyoshi | Dec 2007 | A1 |
20080049764 | Solomon | Feb 2008 | A1 |
20080137528 | O'Toole et al. | Jun 2008 | A1 |
20080159277 | Vobbilisetty | Jul 2008 | A1 |
20080267183 | Arndt | Oct 2008 | A1 |
20080301256 | McWilliams et al. | Dec 2008 | A1 |
20090003317 | Kasralikar | Jan 2009 | A1 |
20090019505 | Gopalakrishnan et al. | Jan 2009 | A1 |
20090073895 | Morgan et al. | Mar 2009 | A1 |
20090080328 | Hu et al. | Mar 2009 | A1 |
20090216910 | Duchesneau | Aug 2009 | A1 |
20090222558 | Xu et al. | Sep 2009 | A1 |
20090234974 | Arndt | Sep 2009 | A1 |
20090262741 | Jungck et al. | Oct 2009 | A1 |
20100008291 | LeBlanc et al. | Jan 2010 | A1 |
20100054129 | Kuik et al. | Mar 2010 | A1 |
20100103837 | Jungck et al. | Apr 2010 | A1 |
20100107162 | Edwards et al. | Apr 2010 | A1 |
20100118868 | Dabagh | May 2010 | A1 |
20100138532 | Glaeser et al. | Jun 2010 | A1 |
20100228961 | Burns | Sep 2010 | A1 |
20100257269 | Clark | Oct 2010 | A1 |
20100275199 | Smith et al. | Oct 2010 | A1 |
20100287548 | Zhou et al. | Nov 2010 | A1 |
20100306408 | Greenberg et al. | Dec 2010 | A1 |
20110023108 | Geldermann et al. | Jan 2011 | A1 |
20110131447 | Prakash et al. | Jun 2011 | A1 |
20110239268 | Sharp et al. | Sep 2011 | A1 |
20110246669 | Kanada et al. | Oct 2011 | A1 |
20110268117 | Davis | Nov 2011 | A1 |
20110299537 | Saraiya | Dec 2011 | A1 |
20110317703 | Dunbar et al. | Dec 2011 | A1 |
20120103837 | Wall et al. | May 2012 | A1 |
20120131225 | Chiueh et al. | May 2012 | A1 |
20120147894 | Mulligan et al. | Jun 2012 | A1 |
20120173757 | Sanden | Jul 2012 | A1 |
20120265976 | Spiers et al. | Oct 2012 | A1 |
20120278804 | Narayanasamy et al. | Nov 2012 | A1 |
20120291028 | Kidambi et al. | Nov 2012 | A1 |
20120307826 | Matsuoka | Dec 2012 | A1 |
20120314706 | Liss et al. | Dec 2012 | A1 |
20120320929 | Subramanian et al. | Dec 2012 | A9 |
20120331127 | Wang et al. | Dec 2012 | A1 |
20120331142 | Mittal et al. | Dec 2012 | A1 |
20130036136 | Horii | Feb 2013 | A1 |
20130077492 | Scaglione | Mar 2013 | A1 |
20130232492 | Wang | Sep 2013 | A1 |
20140115584 | Mudigonda et al. | Apr 2014 | A1 |
20140223431 | Yoshimura et al. | Aug 2014 | A1 |
Number | Date | Country |
---|---|---|
1 128 607 | Aug 2001 | EP |
2160068 | Mar 2010 | EP |
2012037518 | Mar 2012 | WO |
Entry |
---|
International Search Report dated Sep. 23, 2013 for Application No. PCT/US2013/040639, 10 pages. |
Lee, M. et al., “Security Enhancement in Infiniband Architecture,” Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium, Denver, Colorado, Apr. 4-8, 2005, Piscataway, New Jersey, Apr. 4, 2005, 18 pages. |
Sun Infiniband Dual Port 4x QDR PCIe ExpressModule and Low Profile Host Channel Adapters M2, Frequently Asked Questions, Sep. 21, 2010, http://www.oracle.com/us/products/servers-storage/networking/infiniband/sun-qdr-ib-hcas-faq-172841.pdf, retrieved on Sep. 11, 2012, 4 pages. |
International Search Report dated Sep. 26, 2013 for Application No. PCT/US2013/040656, 10 pages. |
European Patent Office, International Searching Authority, International Search Report and Written Opinion dated Oct. 11, 2012 for Application No. PCT/US2012/046225, 11 pages. |
European Patent Office, International Searching Authority, International Search Report and Written Opinion dated Nov. 30, 2012 for Application No. PCT/US2012/046219, 18 pages. |
Number | Date | Country | |
---|---|---|---|
20130019303 A1 | Jan 2013 | US |
Number | Date | Country | |
---|---|---|---|
61506557 | Jul 2011 | US | |
61645517 | May 2012 | US |