The present invention will become more fully understood from the detailed description given hereinbelow illustration only, and thus are not limitative of the present invention, and wherein:
The present invention provides redundant clock architecture to bring a high-end SMP system feature into a blade clustering system implementation, thereby enabling flexible system configuration.
Please refer to
Each blade module 20/30/40/50 may be considered as an independent computer, which is generally implemented on a mother board. The blade module 20/30/40/50 is embodied on a printed circuit board configured with various electrical components such as processor(s), system memory, bridge chip(s), I/O controllers, network interface controller(s), I/O connectors for expansion cards (all not shown), a clock multiplexer (MUX) 21/31/41/51 and local clock 22/32/42/52. These components connect each other by specific buses to perform data processing tasks. Each blade module 20/30/40/50 includes a dedicated operation system (OS) to execute direct monitoring and managing on hardware components configured thereon, manage all kinds of computing resources and provide an operative environment for application programs. In the present invention, the dedicated OS for each blade module 20/30/40/50 has capability to implement SMP configuration.
The processors (not shown) of the blades 20, 30, 40, 50 are single-chip processors configured in dedicated processor sockets (not shown), each equipped with one or more computing core. All the processors in the blade clustering system according to the present invention support various SMP configurations, such as 1, 2, 4 or 8 processor chips and etc. Namely, there may be more than one SMP domains existing in the blade clustering system.
The central clock 10, off-board configured or equipped on one of the blade modules 20, 30, 40, 50, generates and distributes a synchronized central clock signal to each of the blade modules 20, 30, 40, 50. In some specific cases, the central clock signal may be provided by the local clock of one of the blade modules 20, 30, 40, 50. To execute clustering tasks, the central clock 10 is controlled by the clustering management system.
The local clock 22/32/42/52 is an independent clock source configured on each blade module 20/30/40/50, generating and distributing an operative local clock signal while the blade clustering system is operating. The local clock 22/32/42/52 can also be used for a standalone operation such as testing, debugging and trouble shooting. If the local clock 22/32/42/52 is capable of maintaining the same clock edge alignment, the blade clustering system can support a complete clock-fail-over feature with “single blade operation” (
The clock multiplexer 21/31/41/51 connects electrically with the central clock 10, the local clock 22/32/42/52 and the local clock consumers on each blade module 20/30/40/50. The central clock signal and the local clock signal are sent to the clock multiplexer 21/31/41/51.
The clock multiplexer 21/31/41/51 monitors the clock signal status, also capable of selecting a healthy clock signal from the central clock signal and the local clock signal. By default the clock multiplexer 21/31/41/51 may select the central clock signal. If the selected clock signal has a problem and the other is healthy, the clock multiplexer 21/31/41/51 will switch over from a bad clock source to the other. A practical example of the clock multiplexer 21/31/41/51 is a select PLL, which is controlled by the clustering management system. The clustering management system of the blade clustering system monitors the clock status, controls clock distribution path and takes necessary actions to recover the blade clustering system.
Please refer to
As a clustering system, the blade clustering system can use the central clock signal by default. Once the central clock 10 has failure or problem, the multiplexer 21/31/41/51 switches and the blade module 20/30/40/50 of the blade clustering system can use the local clock signal of its local clock 22/32/42/52, thereby remaining the blade module 20/30/40/50 to run. If the local clock 22/32/42 can maintain the local clock signal's edge aligned with the original central clock signal before it fails, the whole blade clustering system may keep synchronized clustering operation. Oppositely, without the local clock signals synchronized, each blade module 20/30/40/50 will still be running. As long as the cluster management system still processing the task scheduling/dispatching, the blade clustering system is still available for new tasks.
Please refer to
One solution is to utilize a synchronization module (not shown) for synchronizing the local clocks (22, 32)/(42, 52) in the same SMP/OS domain 06/07.
Another is to reboot the blade clustering system as “single-blade operation mode”. The clustering management system will recycle the power, change the SMP configuration and clock sources and restart system as “single-blade operation mode” again. Then the blade clustering system will still be usable without any repair/replacement. In the prior art, to replace or repair hardware configurations takes time. With the clock distribution architecture, the present invention provides an opportunity for the blade clustering system to keep operating for certain duration.
Using the central clock by default may be considered as a “full-redundant mode”. If somehow the blade clustering system cannot operates under the single-blade operation mode, then the clustering management system will need to recycle the power, change the clock sources and restart the system as “single-blade operation mode” again.
For those blades that need external clock source as a centralized clock, to test those blades will also rely on an extra clock source. This invention provides flexibility for standalone testing, debugging and trouble shooting. The blade could thus operate as a standalone.
Essential hardware implementation and/or software/firmware configuration would possibly need to be made to change the SMP configuration. The redundant clock distribution architecture of the present invention is one of the fundamentals for flexible system configuration.
The invention being thus described, it will be obvious that the same may be varied in many ways. Such variations are not to be regarded as a departure from the spirit and scope of the invention, and all such modifications as would be obvious to one skilled in the art are intended to be included within the scope of the following claims.
| Number | Date | Country | |
|---|---|---|---|
| 60822399 | Aug 2006 | US |