This application relates to data processing systems and networking. More particularly, the present disclosure relates to methods and systems for server change management.
Server changes made to physical and/or virtual servers may be implemented through several different change mechanisms. The server changes may be implemented using such as with hard-coded programming in which the available server changes are predefined. Other change mechanisms may include using a workflow mechanism whereby a user can group operations together into a server change. Commands may be used with the workflow mechanism to enable flexibility. An example workflow can be saved and used as a template to enable some variable substitution and reuse with future server changes. Another change mechanism is the use of macros that can be user initiated or triggered based on events. Modifications to the macros may be made by saving the server change as a new macro.
The particular embodiments of the invention are illustrated by way of example, and not limitation, in the figures of the accompanying drawings, in which like reference numerals indicate the same or similar features unless otherwise indicated.
In the drawings,
Method and systems for server change management are described. A server system may include a physical server and a virtual server capable of being deployed on the physical server. A server change group may include one or more server change objects. Each server change object may represent a change to be made to at least one of the physical server or the virtual server. A change manager may receive a request for the change to the server system, select transitioning change objects from among the server change objects to transition the server system from a current state to an expected state after implementation of the change, and execute the transitioning change objects to implement the change to the server system.
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of example embodiments. It will be evident, however, to one skilled in the art that the present invention may be practiced without these specific details
In an example embodiment, as the server personality is disembodied or separated from the physical structure, it may be possible to provision virtual servers on-demand out of industry standard components to enable differing configurations of virtual servers without reconfiguring the server. Each virtual server deployed on a physical server defines a state of a physical server. This may include the logical definitions and configuration information stored in and used by a VSS director (described by way of example in more detail below) to program a server fabric, an OS and applications of the virtual server. The state may be stored on a logical unit on a Storage Area Network 30, as described in more detail below. Thus, in
The system 10 is shown, by way of example, to include a switch group 12 including one or more switches 14, 16. The switch group 12 is connected, for example, via an InfiniBand link 18 (e.g., a switched fabric communications link) to one or more server pools 20. Each server pool 20.1-20.3 is shown to include a plurality of physical servers 22.1-22.n linked via one or more InfiniBand links 18 to the switch group 12. By way of example, three physical server pools 20.1-20.3 (on which the virtual servers are deployed) are shown in
When the link 18 is an InfiniBand link, each switch 14 may include an InfiniBand interface 24 to interface the server pools 20.1-20.3 to the switch group 12. The InfiniBand architecture or link may define a high speed network for interconnecting processing nodes and I/O nodes. In an InfiniBand network, processing nodes and I/O nodes are connected to the fabric by Host Channel Adapters (HCAs) and Target Channel Adapters (TCAs). Other links may be provided in addition to instead of, the InfiniBand link 18.
In this example embodiment, the physical server infrastructure may be stateless computational resources with CPUs and memory. For example, as shown in
The switch 14 communicates with a plurality of different networks (Local Area Networks, Wide Area Networks, or the like) via communication links 38, 40, 42. For example, the communication links 38, 40, 42 may be Ethernet connections and, each switch 14, 16 may include one or more Ethernet gateways 44. In the example system 10, the communication link 38 is shown to connect to a network 46 interconnecting a plurality of hosts 48.1-48.5. The hosts 48.1-48.5 may form part of another data network, or be any other network host.
The switch 14 also communicates via the communication link 40 to a network 50 which may, for example, be an enterprise network. The network 50 communicates with desktop computers 52.1-52.2 and a subnet 54 which, in turn, is connected to desktop computers 56.1-56.3. Further, the switch 14 connects via the communication link 42 to a network such as the Internet 58. The aforementioned networks are merely example networks and different configurations, and different numbers of networks and subnets may be provided that connect a wide range of network devices.
The system 10 may allow virtualization of servers deployed on physical servers to be managed by a management module 60. The management module 60 may be provided at the switch 14 or in other components. The management module 60 communicates with a VSS director 62 that controls the provisioning of the server pools 20.1-20.3.
In an example embodiment, the VSS director 62 communicates via a network 64 with the management module 60. The system 10 also includes a third party management module 65 that communicates with the VSS director 62 and/or with the management module 60 to manage the provisioning of virtual servers. In an example embodiment, the network 64 is an Ethernet network and, accordingly, the switch 14 may thus include one or more Ethernet ports 66. However, the various communication links linking the various components/devices in the system 10 are not restricted to InfiniBand connections, Ethernet connections, or the like. Any communication means may be provided to interconnect the various components.
As shown by way of example in
A change manager 602 of the system 600 may be used to manage the execution of a change to the virtual server system 10 in which the system 600 is deployed. For example, the change manager 602 may receive a request for a change to the virtual server system 10, select transitioning change objects from among the server change objects to transition the virtual server system 10 from a current state to an expected state after implementation of the change, and to execute the transitioning change objects to implement the change to the virtual server system 10.
The transitioning change objects may be a subset of the server change objects based on a current state of the virtual server system 10. The subset may reflect the change objects to be used for the change while excluding change objects from the server change objects that are unneeded to implement the change. For example, one or more of the server change objects may not be needed to run in order to provision a new virtual server within the virtual server system 10 depending on the current state of a physical server of the virtual server system 10.
In an example embodiment, a server change object may encapsulate a change. The change manager 602 may obtain a state of the change from the server change object. The change manager 602 may receive one or more server change objects directly from the server system and/or through a task manager 604.
The change manager 602 may provide an execution environment for a change. For example, a change executed by the change manager 602 may include assigning a physical server to a virtual server or turning on the power to a physical server. The change manager 602 may also perform other changes to the state of the virtual server system 10.
A task manager 604 may receive task objects from a provisioning director (e.g., the VSS director 62 of
In an example embodiment, the task manager 604 may include one or more job queues that may enable tasks to be run simultaneously. For example, a user may specify that the task manager 604 add tasks into a job queue and the task manager 604 may determine a time (e.g., a best time) to run the task or run the task immediately.
In an example embodiment, a change task may implement a change listener object. The change listener object may be configured to provide messages regarding execution of the change to an associated server change object (e.g., an associated transitioning object). The change task may optionally take the message received (e.g., as a change listener object to report back information on a server change) and add the messages to a task table 608 and a task history table 610 of a database 606. The task manager 604 may also write to the task table 608 and the task history table 610.
The database 606 may include one or more tables including the task table 608 and the task history table 610. However, other tables in the database 606 which may (or may not) be modified by the task manager 604 and/or the change manager 602 may also be provided.
The task table 608 may include information regarding tasks such as a nature of the task, when the task started, a running state of the task, an estimated time for completion of the task, current operation objects that are affected by the task, and the like. However, other information may also be provided.
The task history table 610 may include entries with timestamps, a reference to an associated task, and a text string. As each task runs, the tasks may update the task history table 610 with, for example, runtime information.
A job viewer 612 may access the database 606 to obtain information from the task table 608 and the task history table 610 to provide information to a user regarding system status such as existing tasks and a status of the existing tasks. In an example embodiment, the information may be messages (e.g., for the reporting of errors). However other types of messages may also be used. The job viewer 612 may optionally detail the status of a current operation such as a change to the server system.
A user may optionally interact with the system 600 through the job viewer 612. The user may select a job to get a history and a current status of a task. The user may also de-queue or cancel a task from the job viewer 612 by having the job viewer 612 communicate with the task manager 604.
In an example embodiment, the reserve process 702 may be performed to lock (or pre-lock) one or more resources of the virtual server system 10 (see
A resource may be unlocked by the release process 704 when a job is cancelled prior to completing the execute process 706 such that additional processes may be run on the resource. For example, the release process 704 may restore the resource to its status prior to the reserve process 702 being run on the resource.
When the change task is executed, the change task may internally call the execute process 706 by passing a server change object contained by the change task. The execute process 706 may then run the server change of the server change object. In an example embodiment, after the server change is complete, the resources of the virtual server system 10 that were previously locked may be automatically unlocked.
The use of the server change group 800 when suitably deployed may enable the system 600 to implement a more complex change to the virtual server system 10 that includes a number of underlying changes. For example, provisioning a virtual server on the virtual server system 10 (see
By way of example, the server change group 800 is described as being capable of effectuating a change in state to the virtual server system 10 (see
To effectuate changes to the virtual server system 10, the server change group 800 may include a completion status criterion 802, a plurality of change elements 804 (e.g., change objects and nested change groups), a description 806, and a continue-on-failure state 808.
The completion status criterion 802 may indicate a check against the states of completed changes associated with the plurality of change elements 804. In an example embodiment, if the completion status criterion 802 is not met, than the system 600 may reflect that the processing of a particular server change group 800 has failed.
For example, the completion status criterion 802 may be as follows:
The description 806 of the server change group 800 may provide information regarding the server change group 800. For example, the description 806 may identify the server change group 800, however, other types of descriptions may also be used.
The continue-on-failure state 808 may indicate whether the server change group 800 may continue to be executed when a requested server change associated with the plurality of change elements 804 has failed. In an example embodiment, the continue-on-failure state 808 may be set to enable (or cease) further processing of the server change group 800 when an element of plurality of change elements 804 has failed and/or the completion status criterion 802 has not been met.
The nested change group 904 may include a server change group 800 that includes the completion status criterion 802, the plurality of change elements 804, the description 806 and the continue-on-failure state 808 (see
The server change object 1000 may provide an interface for the change manager 602 to run a server change and may include a get state process 1002, a reserve process 1004, an execute process 1006, a get/set message process 1008, a description 1010, and a release process 1012.
The get state process 1002 may obtain a current state of a server change to a server system (e.g., the virtual server system 10 of
The reserve process 1004 may ready the server system for the execute process 1006. For example, the reserve process 1004 may function like the reserve process 702 (see
The execute process 1006 may run a server change. For example, the change manager 602 (see
The get/set message process 1008 may be used by the change and the change manager 602 to convey messages such as during execution of a requested server change. In an example embodiment, the messages may be error messages, however other messages such as pass/fail messages may also be provided. Errors may include, but are not limited to, the unavailability of a resource (e.g., a server object) within a server system (e.g., the virtual server system 10), why an operation failed, or the like.
In an example embodiment, a listener object (an example of which is described in greater detail below) may be passed as a reference to the execute process 1006, the reserve process 1004, and the release process 1012 to receive messages regarding the execute process 1006, the reserve process 1004, and the release process 1012 respectively.
The description 1010 may describe a server change being implemented by the server change object 1000. For example, when the change manager 602 is to execute a server change object 1000, the change manager 602 may get the description 1010 associated with the server change object 1000, run the execute process 1006 on the server change object 1000, and run the get state process 1002 to obtain a state of the execution of the server change object 1000. The job viewer 612 (see
An object may be unlocked by the release process 1012 when a job is cancelled prior to running the execute process 1006 such that additional processes may be run on the object. For example, the release process 1012 may restore an object of the virtual server system 10 to its status prior to the reserve process 1004 being run on the object.
The listener object 1100 may report back information on a server change to the system 600 (see
The add history process 1102 may store information regarding a task in the task history table 610 (see
The change manager 602 (see
In an example embodiment, both the change manager 602 and the server change object 1000 (see
The log process 1106 may store information in one or more logs. For example, the one or more logs may contain information regarding implementation of a server change that may be used for debugging purposes and the listener object 1100 may optionally correlate information received from a job into one or more logs.
In an example embodiment, the listener object 1100 may optionally be an argument for the execute processes 706, 1006, the reserve processes 702, 1004 and the release processes 704, 1012 (see
A requested change (e.g., requested by a user of the system 600 of
At decision block 1204, a determination may be made (such as by the change manager 602 of
If a requested change is to be implemented as a job, the change may be implemented as a change task through the task manager 604 (see
In an example embodiment, when task manager 604 runs a change task, change task may call the execute process 706 (see
If a determination is made at decision block 1204 not to implement the change as a job, the requested change may be implemented through the change manager 602 at block 1208. For example, implementing a change through the change manager 602 at block 1208 may include performing the change directly through the change manager 602 without the use of task manager 604 thereby making additional information regarding the execution of the change unavailable. Upon completion of the operations at block 1206 or block 1208, the method 1200 may terminate.
In an example embodiment, running a task through the task manager 604 at block 1206 may provide information regarding the task to the database 606 so that the information regarding the change may be provided to a user in the job viewer 612 (see
A request for a change to the virtual server system 10 may be received at block 1302.
One or more server change objects 1000 associated with the change may be accessed at block 1304. For example, a change group 800 (see
The change may be reserved and optimized within the server system at block 1306. For example, the change may be reserved by running the reserve process 1004 (see
At block 1308, the transitioning change objects may be selected from among the server change objects 1000 for server transitioning. For example, of the virtual server system 10 may be queried to select a number of transitioning change objects from among the number of server change objects 1000 to transition the virtual server system 10 from the current state to an expected state after implementation of the change.
The transitioning change objects may be executed at block 1310. For example, the change may be executed by running the execute process 1006 (see
The state of the change (e.g., of the plurality of server changes) may be extracted at block 1312. For example, the state of the change may be extracted by running the get state process 1002 (see
The extracted state may then be recorded at block 1314. The extracted state may be optionally stored by the add history process 1102 (see
Upon completion of the operations at block 1314, the method 1300 may terminate.
The transitioning change objects may be presented at block 1402. For example, the transitioning change objects 902, 1000 may be presented to a user and/or the server system to seek a confirmation for execution of the transitioning change objects.
At decision block 1404, a determination may be made whether to run the transitioning change objects. If the determination is made to run the transitioning change objects, the transitioning change objects may be executed at block 1406. If the determination is made not to run the transitioning change objects at decision block 1404, or upon completion of the operations at block 1406, the method 1400 may terminate.
In an example embodiment, the determination at decision block 1404 may be made automatically by the method 1400 and/or manually by a user.
A first transitioning change object may be selected at block 1502. An example embodiment of selecting a server change object 902, 1000 from the plurality of change elements 900 as a transiting change object is described in greater detail below.
A server change of a transitioning change object 902 may be performed at block 1504. For example, the server change may be performed by running the execute process 1006 (see
At decision block 1506, a determination may be made as to whether the server change of an executed change object was completed successfully. The determination as to whether the server change was completed successfully or not may optionally be based on a state obtained from the get state process 1002 (see
If the server change was determined to be successfully completed at decision block 1506, the method 1500 may proceed to decision block 1512. However, if the server change was determined not to be successfully completed at decision block 1506, the method 1500 may proceed to decision block 1508 (e.g., to determine whether the completion criterion has been met).
At decision block 1508, a determination may be made as to whether the completion status criterion 802 (see
A determination may be made at decision block 1510 as to whether the continue-on-failure state 808 (see
At decision block 1512, a determination may be made as to whether another transitioning change object is available (e.g., from the plurality of change elements 900). If another transitioning change object is available, an available transitioning change object may be selected at block 1514 and the method 1500 may return to block 1504. If another transitioning change object is not available at decision block 1512, the method 1500 may terminate.
A first element of a current change group 900 may be accessed at block 1602. At decision block 1604, a determination may be made as to whether the current element is a nested change group 904 (see
If the current element is not a nested change group 904, the current element may be the change object 902 of the current change group 900 and may be utilized (e.g., as a transitioning change object) at block 1608.
At decision block 1610, a determination may be made as to whether there is another element in the current change group 900. If there is another element (e.g., the nested change group 904 or the change object 902), the next element in change group 900 may be selected at block 1612 and the method 1600 may return to decision block 1604. If there is not another element, the method 1600 may proceed to decision block 1614.
A determination may be made at decision block 1614 as to whether there is a parent change group 900 (e.g., whether the current change group 900 is a nested change group 904). If there is a parent server change group 800, the parent server change group 800 may be selected as the current server change group 800 at block 1616 and the method 1600 may return to decision block 1610. If there is not a remaining parent server change group 800, the method 1600 may terminate.
In an example embodiment, the elements of a change group 900 may be accessed in a sequential, parallel and/or multithreaded environment. The embodiments are not restricted to these access methods.
The example computer system 1700 includes a processor 1702 (e.g., a central processing unit (CPU) a graphics processing unit (GPU) or both), a main memory 1704 and a static memory 1706, which communicate with each other via a bus 1708. The computer system 1700 may further include a video display unit 1710 (e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)). The computer system 1700 also includes an alphanumeric input device 1712 (e.g., a keyboard), a cursor control device 1714 (e.g., a mouse), a disk drive unit 1716, a signal generation device 1718 (e.g., a speaker) and a network interface device 1720.
The disk drive unit 1716 includes a machine-readable medium 1722 on which is stored one or more sets of instructions (e.g., software 1724) embodying any one or more of the methodologies or functions described herein. The software 1724 may also reside, completely or at least partially, within the main memory 1704 and/or within the processor 1702 during execution thereof by the computer system 1700, the main memory 1704 and the processor 1702 also constituting machine-readable media.
The software 1724 may further be transmitted or received over a network 1726 via the network interface device 1720.
While the machine-readable medium 1722 is shown in an example embodiment to be a single medium, the term “machine-readable medium” should be taken to include a single medium or multiple media (e.g., a centralized or distributed database, and/or associated caches and servers) that store the one or more sets of instructions. The term “machine-readable medium” shall also be taken to include any tangible medium that is capable of storing or encoding a set of instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the present invention. The term “machine-readable medium” shall accordingly be taken to include, but not be limited to, solid-state memories, and optical and magnetic media.
Although example embodiments of the present invention have been described with reference to specific example embodiments, it will be evident that various modifications and changes may be made to these embodiments without departing from the broader spirit and scope of the invention. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.
This application claims the priority benefit of U.S. Provisional Application No. 60/805,855, filed 26 Jun. 2006, which is incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
6308315 | Dice et al. | Oct 2001 | B1 |
6421711 | Blumenau et al. | Jul 2002 | B1 |
6499063 | Chessell et al. | Dec 2002 | B1 |
6631395 | Chessell | Oct 2003 | B1 |
6801949 | Bruck et al. | Oct 2004 | B1 |
7069296 | Moller et al. | Jun 2006 | B2 |
7076801 | Gong et al. | Jul 2006 | B2 |
7111300 | Salas et al. | Sep 2006 | B1 |
7213065 | Watt | May 2007 | B2 |
7356679 | Le et al. | Apr 2008 | B1 |
7366742 | Umbehocker et al. | Apr 2008 | B1 |
7379990 | Tsao | May 2008 | B2 |
7546354 | Fan et al. | Jun 2009 | B1 |
7706303 | Bose et al. | Apr 2010 | B2 |
7747816 | Nourmohamadian et al. | Jun 2010 | B1 |
20020001307 | Nguyen et al. | Jan 2002 | A1 |
20020165961 | Everdell et al. | Nov 2002 | A1 |
20020188870 | Gong et al. | Dec 2002 | A1 |
20030005028 | Dritschler et al. | Jan 2003 | A1 |
20030018927 | Gadir et al. | Jan 2003 | A1 |
20030217131 | Leslie et al. | Nov 2003 | A1 |
20040054725 | Moller et al. | Mar 2004 | A1 |
20040141005 | Banatwala et al. | Jul 2004 | A1 |
20040236633 | Knauerhase et al. | Nov 2004 | A1 |
20050120160 | Plouffe et al. | Jun 2005 | A1 |
20050262505 | Esfahany et al. | Nov 2005 | A1 |
20060041660 | Bishop et al. | Feb 2006 | A1 |
20060107087 | Sieroka et al. | May 2006 | A1 |
20060114917 | Raisch | Jun 2006 | A1 |
20060143617 | Knauerhase et al. | Jun 2006 | A1 |
20060294516 | Winner et al. | Dec 2006 | A1 |
20070027973 | Stein et al. | Feb 2007 | A1 |
20070067435 | Landis et al. | Mar 2007 | A1 |
20070094367 | Esfahany et al. | Apr 2007 | A1 |
20070234334 | Araujo, Jr. et al. | Oct 2007 | A1 |
20070234337 | Suzuki et al. | Oct 2007 | A1 |
20070250608 | Watt | Oct 2007 | A1 |
20070258388 | Bose | Nov 2007 | A1 |
20070283015 | Jackson et al. | Dec 2007 | A1 |
20070297428 | Bose et al. | Dec 2007 | A1 |
20090282404 | Khandekar et al. | Nov 2009 | A1 |
20100228840 | Bose et al. | Sep 2010 | A1 |
Entry |
---|
“U.S. Appl. No. 11/743,537, Preliminary Amendment filed Jun. 17, 2008”, 7 pgs. |
“U.S. Appl. No. 12/754,489, Final Office Action mailed Nov. 20, 2012”, 16 pgs. |
Number | Date | Country | |
---|---|---|---|
20070299906 A1 | Dec 2007 | US |
Number | Date | Country | |
---|---|---|---|
60805855 | Jun 2006 | US |