The present invention relates to a management method and system, and more particularly to a method and system for managing a plurality of server units.
Computing devices normally operate within a range of parameters, with the operation performance being monitored by sensors. The parameters being monitored may include the temperature and humidity within a computer case, voltages of semiconductor components, the velocity of cooling fans, etc. A management method is often used to collect and analysis monitor data reported from sensors measuring the operation and performance of the computing device. Furthermore, the management method enables the performance of administrative tasks on the computing device such that certain corrective action can be taken when the parameters stray beyond specified limits.
One such management method has been realized using Intelligent Platform Management Interface (IPMI), which is a set of computer interface specifications for a network of computing devices. An IPMI includes mainly a baseboard management controller (BMC) and other controller components. The BMC is a specialized microcontroller embedded on the motherboard of a computer and independent of the operating system thereof. Although typically configured in the baseboard of the computing device so as to monitor the physical state thereof, the BMC communicates with the control unit of remote manager system through a different protocol than the computing system does with the control unit. Furthermore, since the BMC and the computing system operate independently, they use different LAN ports. This means the information that the BMC transmits to the remote manager platform and that transmitted by the computing device are also independent. Therefore, problems arise when a remote manager system manage multiple computing devices using multiple BMCs, wherein the remote manager system fails to recognize a BMC and a computing device being on the same server. For instance, when one of the computing devices goes through a power failure, complex mutual communication between the BMCs and the computing devices has to be performed before the remote manager system can determine which BMC is to be controlled to reboot the failed computing device.
Accordingly, one of the objectives of the present invention is to provide a method and system for managing a plurality of server units, which enable a remote management platform to recognize a BMC and a computing device being on the same server.
In order to achieve the aforementioned objective, one embodiment of the present invention provides a method for managing a plurality of server units. Each server unit has a computing device and a baseboard management controller (BMC) connected thereto. The control unit interfaces with each of the computing devices through a first protocol, and interfaces with each of the BMCs through a second protocol different from the first protocol. The BMC and the computing device that are on the same server unit each have a server information corresponding to each other. The method includes: (a) sending a command through the second protocol via the control unit to each BMC to obtain the server information thereof; (b) determining, using the control unit, whether the server information of each computing device stored in the control unit corresponds to the server information obtained from one of the BMCs; and (c) mapping one of the computing devices to one of the BMCs when the server information of said computing device corresponds to the server information obtained from said BMC.
Another embodiment of the present invention provides a system for managing a plurality of server units including a plurality of server units and a control unit. The plurality of server units each includes a computing device and a baseboard management controller (BMC) connected thereto. The BMC and the computing device that are on the same server unit each have a server information corresponding to each other. The control unit interfaces with each of the computing devices through a first protocol and interfaces with each of the BMCs through a second protocol different from the first protocol. The control unit is configured to send a command through the second protocol to each BMC to obtain the server information thereof, determine whether the server information of each computing device stored in the control unit corresponds to the server information obtained from one of the BMCs, and map one of the computing devices to one of the BMCs when the server information of said computing device corresponds to that of said BMC.
In order to further the understanding of the present disclosure, reference is made to the following detailed description illustrating the embodiments and examples of the present disclosure. The description is for illustrative purpose only and is not intended to limit the scope of the claim.
The aforementioned illustrations and following detailed descriptions are exemplary for the purpose of further explaining the scope of the present disclosure. Other objectives and advantages related to the present disclosure will be illustrated in the subsequent descriptions and appended drawings.
Referring to
As shown in
With reference to
The present invention does not limit the first protocol and the second protocol to a specific type. For instance, when the plurality of computing devices 20 run under Windows operating system, the control unit 1 can interface with the computing devices 20 through standard WMI interfaces, and when the plurality of computing devices 20 run under Linux operating system, the control unit 1 can interface with the computing devices 20 through standard SSH interfaces. Moreover, in the present embodiment, the second protocol is the IPMI interface.
Moreover, with reference to
In the present embodiment, the BMC 21 and the computing device 20 that are on the same server unit (2a, 2b, 2c) have a server information corresponding to each other. To be specific, in the present embodiment, the server information can be a MAC address of said server unit 2. Furthermore, in the present embodiment, the server information of the BMC is the MAC address of the computing device 20 that is stored in the BMC prior to the performance of step S100. Therefore, “corresponding to each other” mentioned above means, for a computing device 20 and a BMC that are on the same server unit, the server information of the computing device 20 and the server information of the BMC 21 are the same. For instance, referring to
As mentioned in the background section, a BMC is typically embedded on the baseboard of a computing device. Since the server information is one that the computing device 20 and the BMC 21 on the same server unit share, in other embodiments, the server information can also be the serial number of the baseboard of the computing devices 20. Alternatively, in yet another embodiment, the server information of the BMC 21 can be the UUID of the computing device 20 that is on the same server unit as the BMC 21.
With reference to
Specifically, in this embodiment, the control unit 1 sends an IPMI OEM command to each BMC 21 to obtain the server information thereof. In step S102, the control unit 1 compares the server information of each computing device 20 stored therein with the retrieved server information of each BMC 21. Upon finding server information existing in one of the computing devices 20 and one of the BMCs 21 at the same time, the control unit 1 maps said computing device 20 to said BMC 21. The control unit 1 will keep doing so until it has checked the server information of all the computing devices 20 stored therein.
The way of mapping said computing device 20 and to said BMC 21 can be, but not limited to adding a tag indicating the association of said computing device 20 with said BMC 21. In other words, by mapping one computing device 20 to one BMC 21 that share the same server information, the method of the present invention enables the control unit 1 to recognize that said computing device 20 and said BMC 21 are on the same server unit 2a, 2b or 2c.
With reference to
More specifically, where the control unit 1 has not yet built management relationship with the computing devices 20, or where the control unit 1 does not possess information that can help recognize the BMC 21 on the same server unit as another computing device 20, the control unit 1 performs step 200 and step 202. Moreover, in step S202, the control unit 1 stores the server information of the computing device 20 of the server unit 2a in the BMC 21 of the server unit 2a, and does the same for the server unit 2b and the server unit 2c.
Afterwards, referring to
In this embodiment, the aforementioned step S104 of mapping one of the computing devices 20 to one of the BMCs 21 when the server information of said computing device 20 corresponds to that obtained from said BMC 21 is alternatively realized by step S208.
Specifically, the table built in step S208 is exemplarily shown as Table 1 below. It should be noted that, Table 1 is for illustration purpose only, and should not be deemed as a limitation to the present embodiment.
In Table 1, the computing devices 20 shown in
With the above-described technical solutions, the method and system for managing a plurality of server units enable the control unit 1 to recognize a BMC 21 and a computing device 20 being on the same server 2a, 2b or 2c. In this way, the method and system for managing a plurality of server units of the present invention can enhance the collaboration between the BMC and the computing device.
Specifically, a remote management module, such as the control unit 1 in the present embodiment can integrate the monitor information sent from the BMCs 21 more efficiently. Additionally, when one of the computing devices 20 fails to operate, whether due to BIOS failure or operating system failure, the control unit 1 can refer to Table 1 and determine which BMC 21 is on the same server unit as the computing device 20 that is failing. The control unit 1 can subsequently demand said BMC 21 to take corresponding corrective actions, e.g. rebooting. On the contrary, when one of the BMCs 21 needs an update, the control unit 1 can refer to Table 1 and determine which computing device 20 is on the same server unit as the BMC 21 that needs updating. Afterwards, the control unit 1 can send commands to said computing device 20 to perform the update.
The descriptions illustrated supra set forth simply the embodiments of the instant disclosure; however, the characteristics of the instant disclosure are by no means restricted thereto. All changes, alterations, or modifications conveniently considered by those skilled in the art are deemed to be encompassed within the scope of the instant disclosure delineated by the following claims.