1. Technical Field
The present invention relates in general to the field of computers, and in particular to a multiple blade server housed in a server chassis. Still more particularly, the present invention relates to a method and system for rerouting data traffic to and from one of the server blades in the server chasses when an uplink is lost.
2. Description of the Related Art
Server blade computers offer high-density server boards (blades) in a single server blade chassis (blade center chassis). A typical server blade computer is illustrated in
As shown in
A major drawback of systems such as STP is that they are relatively slow, taking as long as 30-60 seconds to re-route the data. Such a delay may be unacceptable to some applications, and may result in packet loss or disruption of end-to-end sessions.
The present invention, therefore, addresses the need for a method and system to quickly redirect traffic from a server blade to different access switches that provide data communication to a network. Each access switch has external ports directed upstream towards the network, and correlated internal ports directed downstream towards the server blade. The server blade has a primary upstream interface associated with a first access switch and a secondary failover upstream interface associated with a second access switch. In the event that the first access switch loses an upstream data signal or connection to one of its upstream external ports, one or more VLAN-determined corresponding downstream internal ports in the first access switch is disabled, thus causing the primary upstream interface in the server blade to failover to the secondary failover upstream interface and its associated second access switch.
The above, as well as additional objectives, features, and advantages of the present invention will become apparent in the following detailed written description.
The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further purposes and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, where:
a illustrates dual Network Interfaced Cards (NICs) in a server blade in a server blade chassis;
b depicts a multiple layered switching pathway between server blades and the network via an Ethernet Switch Module (ESM);
c illustrates internal and external ports in the ESM shown in
d depicts a Virtual Local Area Network (VLAN) table for use by the ESM to determine which internal ports should be turned off in response to a specific external port being disconnected;
e shows an exemplary VLAN table as depicted in
With reference now to
Server blade 204 has multiple Network Interface Cards (NICs), shown in exemplary form as NIC 206a and NIC 206b. NICs 206 connect, preferably via chassis mid-plane wiring 210, to internal ports 212 in an Ethernet Switch Module (ESM) 214. As shown, each NIC 206 preferably connects to a different ESM 214. Each ESM 214 connects, via an external port 216, to an external uplink 218, which connects to the network 106.
NICs 206a and 206b are backups to each other. That is, if an upstream signal (from ESM 214a) is lost, then NIC 206a sendsa signal to a NIC select logic 208 (and/or alternatively to NIC 206b) instructing all data packets being communicated to and from server blade 204 to be routed via NIC 206b to ESM 214b. If and when upstream communication is re-established with NIC 206a, then NIC 206a sends a signal to NIC select logic 208 (and/orNIC 206b) that all future data packets to and from server blade 204 are to be again communicated via NIC 206a to ESM 214a.
With reference to ESM 214a in the example just described, assume that the link between external port 216a and external uplink 218 is broken. Such a link could be broken by a failure in external port 216a, a cable used to connect external port 216a and external uplink 218 could simply be inadvertently unplugged, or an uplink device such as external uplink 218 could fail due to being powered off, being defective, etc. In such a scenario in the prior art, NIC 206a would continue to communicate with internal port 212a, since the connection between NIC 206a and internal port 212a would still be intact, but data packets would not be able to reach external uplink 218. According to the present invention, however, the loss of the external port 216a connection results in internal port 212a being disabled, resulting in NIC 206a sensing no signal, to cause NIC 206b to take over.
Note that while in the preferred embodiment ESM 214a and ESM 214b are different intermediate switches, in an alternate embodiment they may be the same switch. If ESM 214a and 214b are the same switching unit, then NIC 206b is coupled to one or more internal (downstream/server blade facing) ports in the ESM 214 that are associated with a different external (upstream/network facing) port that has not lost its upstream data link. These internal and external ports are depicted in greater detail in
With reference now to
Referring now to
In an alternative embodiment, however, rather than disabling all internal ports 1-14, only selected internal ports are disabled according to which internal port was carrying traffic to the disconnected external port. As shown in
In another preferred embodiment, determining which internal port is associated with a particular external port is done with the aid of a Virtual Local Area Network (VLAN) table 228, as shown in
The alternate embodiment reference in
When a data packet is received in the ESM 214, the ESM 214 reads a tag in a header of the data packet, which tells the ESM 214 which external port should receive the data packet. With reference then to
Using the VLAN table 228 allows internal ports to be selectively disabled, just as described in the system of
Referring now to
As described in block 310, the internal ports are disabled, causing the internal port-attached downstream devices to be disconnected. If and when the ESM detects that the original external port is again in communication with the external uplink (query block 312), then the internal ports are again enabled for connection with the downstream devices (server blades).
Note that while the present invention has been described as operating within an ESM, it can also be implemented (exclusively or concurrently) in the other switches shown in
While the present invention has been described as implicitly causing NIC failover, this can be explicitly caused as well. That is, as described above, the back-up failover NIC in the server blade is activated when it receives a signal indicating a loss of communication between the primary NIC and the upstream link. This signal may be in the form of a message in a packet, such as in a header of a data packet being directed to the primary NIC in the computer system, or the signal may be an electrical signal on a pathway (such as setting a line to the computer system high or low). This signal, whether in the packet header or on a line as described, is defined as an encoded message, which instructs the primary NIC to failover to the backup NIC.
Thus, the upstream link can send the encoded message directly to either the primary or backup NIC, telling them that the primary NIC's upstream pathway has been broken. Similarly, while the system has been described for exemplary purposes as using NIC's, any similar such interface device may be used in accordance with the present invention.
While the ports have been described as being “external” and “internal,” it should be understood that these terms are used to describe the respective “upstream” and “downstream” characteristics of these ports. Thus, a reference to an “upstream” port is understood in a preferred embodiment as being towards a network, and a reference to a “downstream” port is understood in the preferred embodiment as being towards a processor.
It should be understood that at least some aspects of the present invention may alternatively be implemented in a program product. Programs defining functions on the present invention can be delivered to a data storage system or a computer system via a variety of signal-bearing media, which include, without limitation, non-writable storage media (e.g., CD-ROM), writable storage media (e.g., a floppy diskette, hard disk drive, read/write CD ROM, optical media), and communication media, such as computer and telephone networks including Ethernet. It should be understood, therefore in such signal-bearing media when carrying or encoding computer readable instructions that direct method functions in the present invention, represent alternative embodiments of the present invention. Further, it is understood that the present invention may be implemented by a system having means in the form of hardware, software, or a combination of software and hardware as described herein or their equivalent.
While the invention has been particularly shown and described with reference to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention.
The present application is related to the following co-pending U.S. patent application filed on even date herewith, and incorporated herein by reference in its entirety: Ser. No. 10/______, (RPS920050011US1), entitled “NETWORK SWITCH LINK FAILOVER IN A REDUNDANT SWITCH CONFIGURATION.”