This application is based on and claims the benefit of China Application No. 201210084484.3 filed Mar. 27, 2012 the entire disclosure of which is incorporated by reference herein.
1. Field of the Invention
The present invention relates to a cloud system, in particular relates to a cloud system and a method for monitoring operation states of a physical machine, and compulsorily ejecting a physical machine from a cabinet immediately when the abnormal states occur during operation. 2. Description of Related Art
In recent years, as the semiconductor industry develops rapidly, the capabilities of a computer are growing more and more powerful. As the internet becomes more popular, cloud systems which provide servers at the service end for replacing computers at the client end is regarded as the future trend of computer technologies.
As mentioned above, when one of the physical machine 12 in the cloud computing data center 1 is damaged and expected to be replaced, it is difficult to an administrator to correctly identify the damaged physical machine 12 among a great many of physical machines 12. Accordingly, a system for administrating a cloud computing data center 1 is provided in the market, wherein a physical machine 12 is damaged, the administrator is automatically informed of the floor and location of the computing data center 1 where the damaged physical machine 12 is located, and further informed of the location of the cabinet in the computing data center 1 where the damaged floor physical machine 12 is located. Thus, the administrator is allowed to look for the damaged physical machine 12 onsite according to the location data and replace the damaged physical machine 12.
As mentioned previously, each physical machine 12 has identical exterior. If there are tens or hundreds of cabinet 11 in a computing data center 1, also each cabinet 11 has tens or hundreds of physical machines 12, it is still a difficult task to promptly identify exact location of the damaged physical machine 12 to the administrator according to the location data mentioned above. Not only the required time for replacing a damaged physical machine 12 is long, also the miss operation of replacing the damaged physical machine 12 may occur and lead to irreparable errors.
It is desired to offer innovative technologies to provide exact location data to administrators when a physical machine 12 in the cloud computing data center 1 is expected to be replaced. Not only the exact location data is provided to the administrators, the physical machine 12 expected to be replaced is directly ejected from the cabinet 11. When an administrator arrives on the computing data center 1, the physical machine 12 can be quickly identified and replaced and miss operation of replacing the physical machine 12 is avoided.
The objective of the present invention is to provide a cloud system and a method for monitoring and handling abnormal states of physical machines in the cloud system. Administrators are allowed to monitor operation states of a plurality of physical machines in a cloud computing data center via a management terminal, and compulsorily ejecting the physical machine having abnormal operation states from the cabinet.
In order to achieve the above, each physical machine of the cloud system respectively executes a daemon program. The daemon program monitors the operation states of physical machines, and provides the operation states to a management terminal of the cloud system. When the management terminal determines that any physical machine is having abnormal operation states, the management terminal provides a control instruction to the cabinet of the physical machine having abnormal operation states. The physical machine having abnormal operation states is compulsorily ejected from the cabinet.
Compare with the related art, the advantage of the present invention is the daemon program executed in each physical machine continuously to monitor each number data of each physical machine, and further determines the operation states of physical machines. Administrators remotely control the management terminal, and receive the operation states of all physical machines in the cloud computing data center from the user interface of the management terminal. When a physical machine having abnormal operation states is required to be replaced, the physical machine is compulsorily ejected from the cabinet. Thus, when administrators arrive on the cloud computing data center to replace the damaged physical machine, the physical machine having abnormal operation states physical machine is ejected from cabinet and easily identified. The typical miss operation due to the identical exterior of all physical machines in a computing data center is accordingly avoided.
The features of the invention believed to be novel are set forth with particularity in the appended claims. The invention itself, however, may be best understood by reference to the following detailed description of the invention, which describes an exemplary embodiment of the invention, taken in conjunction with the accompanying drawings, in which:
Embodiments are provided in the following in order to further detail the implementations of the present invention in the summary. It should be noted that objects used in the diagrams of the embodiments are provided with proportions, dimensions, deformations, displacements and details are examples and the present invention is not limited thereto and identical components in the embodiments are the given same component numbers.
The present invention provides a cloud system and a method for monitoring and handling abnormal states of physical machines in the cloud system. The method for monitoring and handling abnormal states of physical machines in a cloud system is used in a management terminal of a cloud system (the management terminal 3 shown in
Next, the management terminal 3 generates a control instruction (the control instruction C1 shown in
Lastly, the cabinet 1 compulsorily ejects the physical machine 22 on the corresponding location from the cabinet 21 according to the content of the control instruction C1 (step S18). Thus, when the administrators arrive, the ejected physical machine 22 from the cabinet 21 can be quickly identified and replaced. The objective of the present invention is to provide a method such that administrators are allowed to quickly and precisely identify the physical machine 22 expected to be replaced. As a result, given the step S16 and the step S18 both manage to achieve the above objective, the method does not require to include the step S16 and the step S18 at the same time, and is not limited thereto.
As shown in
As shown in
The management terminal 3 has a monitor application program interface (API) 31 and a user interface 32. The management terminal 3 retrieves these record files F1 from the sharing storage pool P1 via the monitoring API 31, and display operation states of these physical machines 22 via the user interface 32 whereby administrators can check and analyze.
In the embodiment, the monitoring API 31 generates an abnormal events message or an abnormal state message to inform the administrators according to the analyzed results of the step S34. When the physical machine 22 has abnormal events, such as CPU usage is higher than 70%, the network traffic is higher than 10 M per second or the temperature is higher than 70 degree ° C., abnormal events message are generated accordingly. The monitoring API 31 determines the physical machine 22 is under abnormal states (for example CPU usage is up to 70% and longer than 5 minutes) and generates the abnormal state message when the physical machine 22 has abnormal events lasting for a predetermined time length. Thus, the management terminal 3 respectively provides different warning messages, or informs different administrators to address the issues according to the abnormal event message and the abnormal state message.
After the step S36, the management terminal 3 receives an external trigger by the administrator via the user interface 32 (step S38), generates the control signals C1 according to the trigger, and transmits the control signals C1 to the cabinet 21 with the physical machine 22 having abnormal operation states (step S40). Further, the management terminal 3 automatically generates the control instruction C1 after the abnormal event message or the abnormal state message is generated, and automatically transmits the control instruction C1 cabinet 21 with the physical machine 22 having abnormal operation states (step S42), but the application is not limited thereto. Thus, after the step S40 or S42, the cabinet 21 compulsorily ejects the physical machine 22 having abnormal operation states according to the control instruction C1, which is convenient to the administrators to locate and replace the damaged physical machine 22.
In the first embodiment, the execution efficiency of the predetermined daemon program 221 is insufficient to perform complicated computing tasks. The daemon program 221 is used for collecting and compiling statistics of the data in the physical machines 22. The analyzing and determining tasks are executed by the management terminal 3. Nonetheless, if the daemon program 221 is capable of performing complicated computing tasks, the daemon program 221 directly analyzes the operation states of the physical machine 22 for reducing the loading of the management terminal 3.
As shown in
In the embodiment, when the physical machine 22 has abnormal events (for example the CPU usage is higher than 70%), the daemon program 222 generates and transmits the abnormal event message, and generates and transmits the abnormal state message when the physical machine 22 is under abnormal states (for example the CPU usage is higher than 70% and lasting for 5 minutes). The daemon program 222 regards that the physical machine 22 is under abnormal states when the abnormal events occur and last for a predetermined time length.
As shown in
In addition, the cloud system network may be further installed with a database 4. The database 4 is connected to the physical machines 22 and the management terminal 3 via the network system. In the step S58, the daemon program 222 transmits and saves the abnormal message M1 transmitting in the database 4. The management terminal 3 periodically connects to the database 4, for accessing the abnormal message M1 in the database 4. Nonetheless, the above description includes preferred embodiments of the present invention and the scope of the invention is not limited thereto.
In the embodiment, the management terminal 3 receives external trigger by the administrators via the user interface 32 (step S64), generates the control signals C1 according to the trigger, and transmits the control signals C1 to the cabinet 21 with the physical machine 22 having abnormal operation states (step S66). The management terminal 3 automatically generates the control instruction C1 after receiving the abnormal message M1, and automatically transmits the control instruction C1 to the cabinet 21 with the physical machine 22 having the abnormal operation states (step S68). The cabinet 21 ejects the physical machine 22 having abnormal operation states from the cabinet 21 according to the content of the control instruction C1.
In the step S18, S40, S42, S66 and S68, the cabinet 21 receives the control instruction C1 via the control module 23. The control module 23 controls to move the tenon 213 on the corresponding location in the cabinet 21 according to the content of the control instruction C1 for ejecting the physical machine 22 on the corresponding location from the cabinet 21. In further details, the control module 23 controls the tenon 213 to depart the tenon receiving portion 223 on the housing of the physical machine 22, whereby the elastic component 212 at the back of the cabinet 21 pushes the physical machine 22 to eject from the socket. The above embodiments are preferred embodiments according to the present invention and are not limited thereto.
In further details, the cabinet 21 is installed with a coil circuit 214 on the corresponding location. When the control module 23 instructs to eject the physical machine 22, the coil circuit 214 is powered on to generate the magnetic force for attracting the tenon 213 (as shown in
As the skilled person will appreciate, various changes and modifications can be made to the described embodiments. It is intended to include all such variations, modifications and equivalents which fall within the scope of the invention, as defined in the accompanying claims.
Number | Date | Country | Kind |
---|---|---|---|
201210084484.3 | Mar 2012 | CN | national |