The present disclosure relates to modular information handling systems. More specifically, embodiments of the disclosure provide systems and methods for monitoring and management of chassis components in a distributed architecture information handling system chassis.
As the value and use of information continues to increase, individuals and businesses seek additional ways to process and store information. One option available to users is information handling systems. An information handling system generally processes, compiles, stores, and/or communicates information or data for business, personal, or other purposes thereby allowing users to take advantage of the value of the information. Because technology and information handling needs and requirements vary between different users or applications, information handling systems may also vary regarding what information is handled, how the information is handled, how much information is processed, stored, or communicated, and how quickly and efficiently the information may be processed, stored, or communicated. The variations in information handling systems allow for information handling systems to be general or configured for a specific user or specific use such as financial transaction processing, airline reservations, enterprise data storage, or global communications. In addition, information handling systems may include a variety of hardware and software components that may be configured to process, store, and communicate information and may include one or more computer systems, data storage systems, and networking systems.
Existing server architectures either provide a single monolithic server capable of running one operating system (or a single hypervisor running multiple virtualized operating systems) and input/output (“I/O”) resources at a time, or bulky blade server chassis providing multiple servers and I/O control modules in a single chassis. A system chassis with multiple information handling systems with various peripheral and I/O capabilities common to the chassis as a whole may provide advantages, as it allows a blade server chassis in a small form factor, thereby providing a blade server chassis with a size comparable to the size of a monolithic server. Implementation of a system chassis with multiple information handling systems with various peripheral and I/O capabilities common to the chassis as a whole presents numerous challenges.
In accordance with the teachings of the present disclosure, the disadvantages and problems associated with traditional approaches to monitoring and management in an information handling system chassis may be reduced or eliminated.
In accordance with embodiments of the present disclosure, a method for use in a chassis having a plurality of slots each configured to receive a module having one or more information handling systems wherein each slot is configured to electrically and communicatively couple the module to other components of the chassis may be provided. The method may include receiving, by a controller integral to the chassis, a signal from a sensor integral to the chassis, the signal indicative of a measured physical quantity of the sensor. The method may also include communicating, from the controller to a remote access controller integral to at least one module disposed in the plurality of slots, sensor information in conformity with the signal.
In accordance with these and other embodiments of the present disclosure, a method for use in a chassis having a plurality of slots each configured to receive a module having one or more information handling systems wherein each slot is configured to electrically and communicatively couple the module to other components of the chassis may be provided. The method may include receiving, by a controller integral to the chassis from a remote access controller integral to the module, a firmware image for application to an information handling resource integral to the chassis. The method may also include performing a firmware update of the information handling resource by applying the firmware image to the information handling resource.
In accordance with these and other embodiments of the present disclosure, a method for use in a chassis having a plurality of slots each configured to receive a module having one or more information handling systems wherein each slot is configured to electrically and communicatively couple the module to other components of the chassis may be provided. The method may include receiving, by a controller integral to the chassis from a remote access controller integral to the module, a request for allocation of storage media dedicated to the controller as storage for the remote access controller. The method may also include, responsive to the request, allocating a portion of the storage media to the remote access controller, and communicating an identifier of the allocated portion of the storage media to the remote access controller, such that the remote access controller may mount the allocated portion as a logical storage medium of the remote access controller.
Technical advantages of the present disclosure may be readily apparent to one skilled in the art from the figures, description and claims included herein. The objects and advantages of the embodiments will be realized and achieved at least by the elements, features, and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are examples and explanatory and are not restrictive of the claims set forth in this disclosure.
A more complete understanding of the present embodiments and advantages thereof may be acquired by referring to the following description taken in conjunction with the accompanying drawings, in which like reference numbers indicate like features, and wherein:
Preferred embodiments and their advantages are best understood by reference to
For the purposes of this disclosure, an information handling system may include any instrumentality or aggregate of instrumentalities operable to compute, classify, process, transmit, receive, retrieve, originate, switch, store, display, manifest, detect, record, reproduce, handle, or utilize any form of information, intelligence, or data for business, scientific, control, entertainment, or other purposes. For example, an information handling system may be a personal computer, a personal digital assistant (PDA), a consumer electronic device, a network storage device, or any other suitable device and may vary in size, shape, performance, functionality, and price. The information handling system may include memory, one or more processing resources such as a central processing unit (CPU) or hardware or software control logic. Additional components of the information handling system may include one or more storage devices, one or more communications ports for communicating with external devices as well as various input and output (I/O) devices, such as a keyboard, a mouse, and a video display. The information handling system may also include one or more busses operable to transmit communication between the various hardware components.
For the purposes of this disclosure, computer-readable media may include any instrumentality or aggregation of instrumentalities that may retain data and/or instructions for a period of time. Computer-readable media may include, without limitation, storage media such as a direct access storage device (e.g., a hard disk drive or floppy disk), a sequential access storage device (e.g., a tape disk drive), compact disk, CD-ROM, DVD, random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), and/or flash memory; as well as communications media such as wires, optical fibers, microwaves, radio waves, and other electromagnetic and/or optical carriers; and/or any combination of the foregoing.
In this disclosure, the term “information handling resource” may broadly refer to any component system, device or apparatus of an information handling system, including without limitation processors, busses, memories, input-output devices and/or interfaces, storage resources, network interfaces, motherboards, electro-mechanical devices (e.g., fans), displays, and power supplies.
Each drawer 104 may include an interface connector 118 configured to electrically couple to a midplane 108, thus providing electrical coupling between information handling resources carried on the various drawers 104 to each other and/or one or more networks or devices external to chassis 100. Midplane 108 may comprise any system, device, or apparatus configured to interconnect information handling resources of chassis 100 with each other. Accordingly, midplane 108 may include slots, pads, and/or other connectors configured to receive corresponding electrical connectors of information handling resources in order to electrically couple information handling systems disposed in drawers 104 and/or information handling resources to each other.
A chassis management controller (CMC) 112 may be communicatively coupled to midplane 108 and may comprise any system, device, or apparatus configured to facilitate management and/or control of components of chassis 100, information handling systems modularly coupled within, and/or one or more of its component information handling resources. CMC 112 may be configured to issue commands and/or other signals to manage and/or control information handling systems coupled to slots 106 and/or information handling resources of chassis 100. CMC 112 may comprise a microprocessor, microcontroller, DSP, ASIC, field programmable gate array (“FPGA”), EEPROM, or any combination thereof.
In addition or alternatively, CMC 112 may also provide a management console for user/administrator access to these functions. For example, CMC 112 may provide for communication with a user interface (e.g., user interface 116), permitting a user to interact with CMC 112 and configure control and management of components of chassis 100 by CMC 112. As another example, CMC 112 may implement Web Services Management (“WS-MAN”) or another suitable management protocol permitting a user to remotely access a CMC 112 to configure chassis 100 and its various information handling resources. In such embodiments, a CMC 112 may interface with a network interface separate from a traditional network interface of chassis 100, thus allowing for “out-of-band” control of chassis 100, such that communications to and from CMC 112 are communicated via a management channel physically isolated from an “in band” communication channel with the traditional network interface. Thus, for example, if a failure occurs in chassis 100 that prevents an administrator from interfacing with chassis 100 via a traditional network interface and/or user interface 116 (e.g., operating system failure, power failure, etc.), the administrator may still be able to monitor and/or manage chassis 100 (e.g., to diagnose problems that may have caused failure) via CMC 112. In the same or alternative embodiments, CMC 112 may allow an administrator to remotely manage one or more parameters associated with operation of chassis 100 and its various information handling resources (e.g., power usage, processor allocation, memory allocation, security privileges, etc.).
One or more air movers 110 may be communicatively coupled to CMC 112, and may include any mechanical or electro-mechanical system, apparatus, or device operable to move air and/or other gasses. In some embodiments, an air mover 110 may comprise a fan (e.g., a rotating arrangement of vanes or blades which act on the air). In other embodiments, an air mover 110 may comprise a blower (e.g., a centrifugal fan that employs rotating impellers to accelerate air received at its intake and change the direction of the airflow). In these and other embodiments, rotating and other moving components of an air mover 110 may be driven by a motor. The rotational speed of such motor may be controlled by one or more control signals communicated from CMC 112. In operation, an air mover 110 may cool information handling systems and information handling resources of chassis 100 by drawing cool air into chassis 100 from outside chassis 100, expel warm air from inside chassis 100 to the outside of chassis 100, and/or move air across one or more heatsinks (not explicitly shown) internal to chassis 100 to cool one or more information handling systems and/or information handling resources. Although
As shown in
A user interface 116 may include any system, apparatus, or device via which a user may interact with chassis 100 and its various components by facilitating input from a user allowing the user to manipulate chassis 100 and output to a user allowing chassis 100 to indicate effects of the user's manipulation. For example, user interface 116 may include a display suitable for creating graphic images and/or alphanumeric characters recognizable to a user, and may include, for example, a liquid crystal display, a cathode ray tube, a plasma screen, and/or a digital light processor projection monitor. In certain embodiments, such a display may be an integral part of chassis 100 and receive power from one or more power supplies 114 of chassis 100, rather than being coupled to chassis 100 via a cable. In some embodiments, such display may comprise a touch screen device capable of receiving user input, wherein a touch sensor may be mechanically coupled or overlaid upon the display and may comprise any system, apparatus, or device suitable for detecting the presence and/or location of a tactile touch, including, for example, a resistive sensor, capacitive sensor, surface acoustic wave sensor, projected capacitance sensor, infrared sensor, strain gauge sensor, optical imaging sensor, dispersive signal technology sensor, and/or acoustic pulse recognition sensor. In these and other embodiments, user interface 116 may include other user interface elements (e.g., a keypad, buttons, and/or switches placed in proximity to a display) allowing a user to provide input to chassis 100. In these and other embodiments, user interface 116 may include one or more visual indicators, such as light-emitting diodes, for example, for communicating information to a user. User interface 116 may be coupled to CMC 112 and/or other components of chassis 100, and thus may allow a user to configure various information handling systems and/or information handling resources of chassis 100.
As shown in
Intermediate member 206 may be constructed from steel, aluminum, plastic, and/or any other suitable material. Although intermediate member 206 may have any suitable size and/or shape, intermediate member 206 is depicted in the embodiments of
Carrier member 208 may be constructed from steel, aluminum, plastic, and/or any other suitable material. Although carrier member 208 may have any suitable size and/or shape, carrier member 208 is depicted in the embodiments of
In some embodiments, face 210 may be substantially equal in width to the width of carrier member 208 and substantially equal to the height of carrier member 208. In these and other embodiments, face 210 may include handles, pull tabs, and/or other features allowing a person to pull on face 210 in order to translate chassis drawer 104A from a closed position to an open position in a direction generally parallel to the depth of top 214 and bottom 216. In these and other embodiments, face 210 may include a grill, vent, and/or other opening allowing gaseous fluid to enter and/or exit through face 210.
As shown in
Each of the various bays 212 defined by drawer 104A may include one or more electrical components for coupling an information handling resource (e.g., a hard disk drive) inserted into such bay 212 to other information handling resources of chassis 100. For example, a backplane (not explicitly shown) may couple a modular information handling resource disposed in a bay 212 to interface connector 118A, which, as described above, may in turn be coupled to midplane 108. In some embodiments, the various information handling resources may be coupled to interface connector 118A such that when chassis drawer 104A is drawn open relative to chassis 100, such information handling resources maintain electrical conductivity to interface connector 118A and interface connector 118A may maintain electrical conductivity to midplane 108, thus permitting insertion or removal of an information handling resource without affecting operation of other information handling resources carried by chassis drawer 104A. In such embodiments, interface connector 118A may only be decoupled from midplane 108 when the entirety of chassis drawer 104A is removed from chassis 100.
In the particular chassis drawer 104B depicted in
As shown in
Although
As shown in
A processor 402 may include any system, device, or apparatus configured to interpret and/or execute program instructions and/or process data, and may include, without limitation, a microprocessor, microcontroller, digital signal processor (“DSP”), application specific integrated circuit (“ASIC”), or any other digital or analog circuitry configured to interpret and/or execute program instructions and/or process data. In some embodiments, processor 402 may interpret and/or execute program instructions and/or process data stored in a memory or other computer-readable media accessible to processor 402.
A network interface 508 may comprise any suitable system, apparatus, or device operable to serve as an interface between an associated information handling system 502 and a private communication network internal to chassis drawer 104B formed by information handling systems 502 and in-band switch 516. Network interface 508 may enable an associated information handling system 502 to communicate using any suitable transmission protocol and/or standard. In some embodiments, network interface 508 may comprise one or more network interface cards, or “NICs.” In other embodiments, network interface 508 may comprise one or more local area network (LAN)-on-motherboard (LOM) devices. In these and other embodiments, network interface 508 may include a plurality of NICs, LOMs, or other network interface devices, in order to provide communication redundancy or robustness.
A RAC 512 may be implemented by, for example, a microprocessor, microcontroller, DSP, ASIC, EEPROM, or any combination thereof. RAC 512 may be configured to communicate with CMC 112. Such communication may be made, for example, via private management network fabric implemented using out-of-band switch 518. RAC 512 may be configured to provide out-of-band management facilities for management of an associated information handling system 502. Such management may be made by CMC 112 even if information handling system 502 is powered off or powered to a standby state. A RAC 512 may include a processor, memory, and network connection separate from the rest of its associated information handling system 502. In certain embodiments, A RAC 502 may include or may be an integral part of a baseboard management controller (BMC), Dell Remote Access Controller (DRAC) or an Integrated Dell Remote Access Controller (iDRAC).
In-band switch 516 may comprise any system, device, or apparatus configured to couple network interfaces 508 of information handling systems 502 to external network interface 530 and perform switching between network interfaces 508 and an external communication network communicatively coupled to external network interface 530 based on a network configuration of various ports (not explicitly shown) of network interfaces 508, in-band switch 516, and external network interface 530, as described in greater detail below. In-band switch 516 may comprise a PCIe switch, a generalized PC bus switch, an Infiniband switch, or other suitable switch.
Similarly, out-of-band switch 518 may comprise any system, device, or apparatus configured to couple RACs 512 of information handling systems 502 to CMC 112 and perform switching between RACs 512 and CMC 112 in order to provide for centralized management of individual information handling systems 502 via a management console coupled to CMC 112 (e.g., via CMC management port 528).
As shown in
In addition, as shown in
CMC management port 528 may serve as a network interface between CMC 112 and a remote management console configured to allow a user to remotely manage components of chassis 100 via an out-of-band network physically isolated from an in-band network coupled to external network interface 530. CMC management port 528 may communicate with such remote management console via any suitable management protocol or standard, including without limitation Intelligent Platform Management Interface (IPMI) and Simple Network Management Protocol (SNMP).
Sensor 532 may be coupled to midplane 108 and may include any suitable system, device, or apparatus that measures a physical quantity and converts it into a signal which can be read by a processor 402, RAC 512, and/or CMC 112. For example, a sensor 532 may include a temperature sensor (e.g., thermocouple, thermistor, thermostat, etc.), a speed sensor (e.g., a Hall effect sensor used to determine rotational speed of an air mover (e.g., an air mover 110), or any other suitable sensor for detecting a physical quantity associated with chassis 100 or a component thereof. Based on a measured physical quantity of a sensor 532, an information handling system 502 or a component thereof may take an action. As an example, in embodiments in which sensor 532 comprises a temperature sensor, an information handling system 502 may take corrective or remedial action in response to a sensed temperature exceeding a threshold, such as causing an increase in speed of an air mover 110, reducing power consumption of an information handling system 502, and/or other remedial action. For simplicity of exposition, only one sensor is depicted in
One or more information handling resources 534 may be communicatively coupled to midplane 108 or otherwise disposed in chassis 100 and may include one or more processors, service processors, basic input/output systems, buses, memories, I/O devices and/or interfaces, storage resources, network interfaces, motherboards, air movers, power supplies, and/or any other components and/or elements suitable for use in an information handling system. Such information handling resources 534 may also include air movers 110 and power supply 114 depicted in
Users of existing rack servers may be accustomed to monitoring and management of an information handling system chassis in which updates to information resources of a chassis are managed and applied by a RAC and which sensors are monitored by a RAC. However, in a distributed modular chassis system such as that depicted in
In addition, users of existing rack servers may be accustomed to monitoring and management of an information handling system chassis in which RACs include computer-readable media for storing data and instructions locally to the RAC (e.g., via a flash or secure digital (SD) card). However, in a distributed modular chassis system such as that depicted in
At step 602, CMC 112 may monitor one or more sensors 532 by receiving signals from the one or more sensors 532 indicative of the physical quantities measured by such sensors 532.
At step 604, CMC 112 may determine if a measured physical quantity of a sensor 532 has changed by more than a threshold amount. For example, for a temperature sensor, such threshold amount may be equal to five degrees Celsius. If such a change has occurred, method 600 may proceed to step 606. Otherwise, step 604 may repeat until such a change has occurred.
At step 606, responsive to a change of a measured physical quantity of a sensor 532 by more than the threshold amount, CMC 112 may communicate to a RAC 512 (e.g., via out-of-bound switch 518) present on a chassis drawer 104 disposed in chassis 100 an indication that such a change has occurred. Such indication may include an IPMI over LAN command or other suitable command communicated between CMC 112 and RAC 512 via out-of-band switch 518.
At step 608, responsive to receipt of the indication from CMC 112 that a change has occurred in a measured physical quantity of a sensor 532, RAC 512 may communicate a request to CMC 112 to retrieve (e.g., download) sensor information from CMC 112 and may receive the sensor information from CMC 112. Such download may be performed using trivial file transfer protocol (TFTP), IPMI over LAN, or other suitable protocol or standard for file transfer. RAC 512 may, alone or in concert with an associated processor 402, process the sensor information in order to model performance of chassis 100 components, initiate responsive action to a measured physical quantity, and/or perform any other action. Accordingly, a RAC 512 may model chassis 100 components in the same fashion as it would in a rack server, with CMC 112 serving as a proxy between sensors 532 and RAC 512.
Although
Method 600 may be implemented using CMC 112, a remote access controller 512, and/or any other system operable to implement method 600. In certain embodiments, method 600 may be implemented partially or fully in software and/or firmware embodied in computer-readable media and executable on a processor or controller.
At step 702, a user may upload to RAC 512 a firmware image for a chassis-level information handling resource 534. Such firmware image may include updated firmware for such information handling resource 534. A user may upload the firmware image to RAC 512 in any suitable manner, including without limitation, by issuing proper commands via user interface 116 or a keyboard-video-mouse interface (not explicitly shown) coupled to chassis 100 or a chassis drawer 104 including such RAC 512. Such file may be uploaded from a readily-removable computer-readable medium (e.g., flash drive, secure digital (SD) card, etc.), communicatively coupled to RAC 512 via a suitable external media interface (e.g., Universal Serial Bus port, SE port) of chassis 100 or a chassis drawer 104 including such RAC 512 (not explicitly shown). During upload, the user may also provide an indication regarding to which information handling resource 534 the firmware image is to be applied. Alternatively, the firmware image itself, or metadata associated therewith, may indicate to which information handling resource 534 the firmware image is to be applied.
At step 704, RAC 512 may communicate to a CMC 112 (e.g., via out-of-bound switch 518) an indication that such a firmware update is to be applied to one or more information handling resources 534. Such indication may include an IPMI over LAN command or other suitable command communicated between RAC 512 and CMC 112 via out-of-band switch 518.
At step 706, responsive to receipt of the indication from RAC 512 that a firmware update is available, CMC 112 may retrieve the firmware image from RAC 512. Such transfer may be performed using trivial file transfer protocol (TFTP), IPMI over LAN, or other suitable protocol or standard for file transfer.
At step 708, responsive to receipt of the firmware image, CMC 112 may perform the firmware update by applying the firmware image to the appropriate information handling resource 534. During such firmware update, CMC 112 may, from time-to-time communicate an indication to RAC 512 regarding the status of the update. Such indication may include an IPMI over LAN command or other suitable command communicated between RAC 512 and CMC 112 via out-of-band switch 518. Accordingly, a RAC 512 may perform firmware updates and manage chassis 100 components in the same fashion as it would in a rack server, with CMC 112 serving as an intermediary between information handling resources 534 and RAC 512.
Although
Method 700 may be implemented using CMC 112, a remote access controller 512, and/or any other system operable to implement method 700. In certain embodiments, method 700 may be implemented partially or fully in software and/or firmware embodied in computer-readable media and executable on a processor or controller.
At step 802, a RAC 512 may determine if storage local to RAC 512 (e.g., within the chassis drawer 104 comprising the RAC 512) is present and available. If local storage is not available, method 800 may proceed to step 804. Otherwise, method 800 may end.
At step 804, in response to storage local to RAC 512 being available, RAC 512 may send a request to CMC 112 (e.g., via out-of-band switch 518) to allocate a portion of storage media 524 as storage for RAC 512. Such request may include an IPMI over LAN command or other suitable command communicated between RAC 512 and CMC 112 via out-of-band switch 518.
At step 806, responsive to the request from RAC 512, CMC 112 may determine if a portion of storage media 524 is available responsive to the request from RAC 512. If a portion of storage media 524 is available, method 800 may proceed to step 808. Otherwise, method 800 may end.
At step 808, CMC 112 may allocate a portion of storage within storage media 524 to RAC 512. At step 810, CMC 112 may create a share (e.g., a Network File System (NFS) share) comprising the allocated portion. At step 812, CMC 112 may communicate the share name or other identifying information for the created share to RAC 512. Such communication may include an IPMI over LAN command or other suitable command communicated between CMC 112 and RAC 512 via out-of-band switch 518.
At step 814, in response to receipt of the share name, RAC 512 may mount the share as a logical storage medium of RAC 512. Accordingly, should RAC 512 require file storage I/O, it may issue the appropriate I/O commands to the mounted share, and a portion of storage media 524 may serve as virtual storage for RAC 512.
Similarly, portions of storage media 524 may also be allocated for a plurality of RACs 512 present in chassis 100 using methods similar to those described with respect to method 800.
Although
Method 800 may be implemented using CMC 112, a remote access controller 512, and/or any other system operable to implement method 800. In certain embodiments, method 800 may be implemented partially or fully in software and/or firmware embodied in computer-readable media and executable on a processor or controller.
Although the present disclosure has been described in detail, it should be understood that various changes, substitutions, and alterations can be made hereto without departing from the spirit and the scope of the disclosure as defined by the appended claims.