This non-provisional application claims priority under 35 U.S.C. § 119(a) on Patent Application No(s). 201910239492.2 filed in China, R.O.C. on Mar. 27, 2019, the entire contents of which are hereby incorporated by reference.
The disclosure relates to a server system and a management method thereto, more particularly to a server system and a management method thereto based on a rack management controller.
As the era of “big data” is arrived, since the server has the advantages such as the powerful ability of calculation and the large memory for saving data and is able to supply the services for a plurality of external computing terminals through internet, there are more and more industries depend the server on dealing with large amount of data.
Generally, the physical characteristics (for example, the temperature, voltage and power supplement for each of the elements on the mainboard) of the computing node and the storage node of the server is monitored by the baseboard management controller (BMC). Also, the baseboard management controller sends the collected data to the rack management controller (RMC). Additionally, some kinds of the servers are able to directly monitor aforementioned characteristics by the rack management controller through the switch. Hence, the structures of said servers are simpler, and the cost is also lower since the baseboard management controller is not required to be configured.
However, since the structure of aforementioned server is limited by the factors such as the configuration of the elements and the specification of the switch, there are only one port connected between the switch, the rack management controller and each of the nodes. As a result, when one of the nodes or ports is damaged, the server is unable to switch the node through other ports or connect to another node via other ports, thereby the operating calculation is seriously affected.
For these reasons, it still needs a server system and a management method thereto to improve aforementioned problems.
According to one or more embodiment of this disclosure, a server system comprises: a plurality of computing nodes and a plurality of storage nodes configured to operate after the plurality of computing nodes and the plurality of storage nodes are actuated; a switch electrically connected to the plurality of computing nodes through a plurality of first ports respectively, and the switch electrically connected to the plurality of storage nodes through a plurality of second ports respectively; and a rack management controller electrically connected to the plurality of computing nodes, the plurality of storage nodes and the switch, with the rack management controller controlling the switch to connect at least a part of the plurality of computing nodes to at least a part of the plurality of storage nodes according to a demand of hardware resource when the rack management controller receives the demand of hardware resource.
According to one or more embodiment of this disclosure, a management method for a server system comprises: actuating a plurality of computing nodes and a plurality of storage nodes by a rack management controller; and controlling a switch to connect at least a part of the plurality of computing nodes to at least a part of the plurality of storage nodes by the rack management controller according to a demand of hardware resource when the rack management controller receives the demand of hardware resource.
The present disclosure will become more fully understood from the detailed description given hereinbelow and the accompanying drawings which are given by way of illustration only and thus are not limitative of the present disclosure and wherein:
In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the disclosed embodiments. It will be apparent, however, that one or more embodiments may be practiced without these specific details. In other instances, well-known structures and devices are schematically shown in order to simplify the drawings.
Please refer to
Please continue to refer to
Please continue to refer to
Please continue to refer to
Hereinbefore, the plurality of computing node 11 may comprise a complex programmable logic device (CPLD), a real-time clock (RTC), a temperature sensor, a field-replaceable unit (FRU) or other elements which are able to collect data or supply additional function for the computing node 11 in practice. It is worth mentioning that the rack management controller 14 of this disclosure is able to collect the information (such as the temperature, voltage and firmware version of the CPLD) from each computing node 11 through the switch 13 without the baseboard management controller (BMC). Hence, the server system of this disclosure not only makes the structure of the server simpler, but also reduces the cost for maintaining the server.
On the other hand, when the computing node 11 is implemented by the complex programmable logic device (CPLD), the operating state of the computing node 11 is monitored by the rack management controller 14. Generally, in the structure of the server “monitoring each of the computing nodes by the BMC” mentioned in the conventional art, the complex programmable logic device (CPLD) is monitored by the BMC. Hence, the firmware of the complex programmable logic device (CPLD) supports the in-band updating in the conventional art. However, in an embodiment of this disclosure, the firmware of the complex programmable logic device (CPLD) supports both out-of-band updating and in-band updating. Particularly, the out-of-band updating is able to be performed as the firmware of the CPLD is sent to the switch 13 through the high speed topology network of the serial attached SCSI (SAS). On the other hand, the in-band updating is able to be performed as the firmware of the CPLD is sent to the switch 13 through the ports of the rack management controller 14; wherein the port may be implemented by the port whose hardware specification supports the I2C protocol. Additionally, in an implementation in this embodiment, aforementioned serial attached SCSI (SAS) may be implemented by the SAS 3.0. However, aforementioned serial attached SCSI (SAS) is also able to be implemented by other versions of SAS according to different transmission rate for different configurations, but this embodiment is not limited thereto. For these reasons, the server system disclosed by this disclosure improves the convenient to update the firmware of the CPLD, and the user can select the way to update the firmware of the CPLD flexibly.
Additionally, aforementioned serial attached SCSI (SAS) is a technique for computer hubs, wherein the main function thereof is transmitting data for the peripheral parts of the computer (such as the hard drive, CD-ROM, etc.). On the other hand, aforementioned SAS is a specification of the serial attached SCSI, wherein the SAS supports 2.5-inch hard drive and the SAS is adapted for the point-to-point serial protocol. The SAS 3.0 mentioned hereinbefore is the third-generation SAS, wherein the SAS 3.0 is able to provide a transmission rate of 12.0 Gbps (12000 Mbps) for each driver in the array.
Please refer to
Please refer to
Please refer to the step S14: when the rack management controller determines the connected computing node is unable to carry the computational load which the demand of hardware resource requires, the rack management controller controls the switch to connect another one of the plurality of computing nodes to the one of the plurality of storage nodes. Particularly, when the computing node selected by the rack management controller current is unable to carry aforementioned computational load, the rack management controller needs to select another computing node according to current computational load from the computing nodes other than the one connected to the storage node in order to supply enough hardware resource for the current computational load. For these reasons, when the computational load of the server is increased suddenly (such as online shopping network congestion due to special festivals, or increased network traffic caused by special events hosted in the online games), the rack management controller is able to select more computing nodes currently through the switch according to the difference of the computational load. On the other hand, when the operating computing node or the corresponding port is damaged suddenly, the rack management controller is able to select other workable computing nodes or ports currently through the switch, and makes the current calculation can be continued.
In view of the above description, this disclosure provides a server system and management method thereto. The switch of the server system is able to connect the rack management controller to each of the nodes via a plurality of ports. When one of the ports is damaged, the rack management controller, via the switch and another port, is able to connect to the node originally connected to the damaged port and required during the calculation. Furthermore, the rack management controller is able to control the number of the operating computing nodes and storage nodes based on the difference of the demand of hardware resource. As a result, an efficient and flexible structure and method to manage the server system is provided by this disclosure, and the problems mentioned in the related art are able to be improved.
The embodiments depicted above and the appended drawings are exemplary and are not intended to be exhaustive or to limit the scope of the present disclosure to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings.
Number | Date | Country | Kind |
---|---|---|---|
201910239492.2 | Mar 2019 | CN | national |