1. Technical Field
The present invention is directed to a method and apparatus for managing a computer system. More specifically, the present invention is directed to a procedure for dynamically reconfiguring resources of logical partitions.
2. Description of Related Art
Presently, many computer manufacturers design computer systems with partitioning capability. To partition a computer system is to divide the computer system's resources (i.e., memory devices, processors etc.) into groups; thus, allowing for a plurality of operating systems (OSs) to be concurrently executing on the computer system.
Partitioning a computer system may be done for a variety of reasons. Firstly, it may be done for consolidation purposes. Clearly consolidating a variety of computer systems into one by running multiple application programs that previously resided on the different computer systems on only one reduces (i) cost of ownership of the system, (ii) system management requirements and (iii) footprint size.
Secondly, partitioning may be done to provide production environment and test environment consistency. Once a new OS or application program has been tested and certified, the test partition may simply be designated as a production partition, with resources diverted to it as needed. This, in turn, may inspire more confidence that an application program that has been tested successfully will perform as expected.
Thirdly, partitioning a computer system may provide increased hardware utilization. For example, when an application program does not scale well across large numbers of processors, running multiple instances of the program on separate smaller partitions may provide better throughput.
Fourthly, partitioning a system may provide application program isolation. When application programs are running on different partitions, they are guaranteed not to interfere with each other. Thus, in the event of a failure in one partition, the other partitions will not be affected. Furthermore, none of the application programs may consume an excessive amount of hardware resources. Consequently, no application programs will be starved out of required hardware resources.
Lastly, partitioning provides increased flexibility of resource allocation. A workload that has resource requirements that vary over a period of time may be managed more easily if it is being run on a partition. That is, the partition may be easily altered to meet the varying demands of the workload.
Computer systems with partitioning capability generally use a partitioning management tool. The partitioning management tool is usually associated with the hardware of the system and interacts with a service processor embedded in the computer system. One particular partitioning management tool is a hardware management console (HMC). The HMC is used to configure and monitor partitions.
Presently, however, one resource is handled at a time when a partition is being reconfigured. This can be a rather time-consuming endeavor when a plurality of partitions each containing numerous resources have to be reconfigured.
Therefore, what is needed is a method, apparatus and system for handling multiple resources when a partition is being reconfigured.
The present invention provides a method, system and apparatus for dynamically reconfiguring partitions of a partitioned computer system. Here, dynamically reconfiguring a partition is defined as being able to reconfigure the partition while the partition is operational. Using a graphical user interface (GUI), a user may indicate a plurality of resources that are to be involved in a partition reconfiguration. However, the partition is actually reconfigured one resource at a time. If one of the resources is memory, then memory is allocated to the partition or de-allocated from the partition one logical memory block (LMB) at a time. In reconfiguring partitions, resources may be removed, added or moved from one partition to another. Moving resources from one partition to another entails removing the resources from one partition and adding it to the other partition. After a partition re-configuration, a resource allocation table is updated. All this is done transparently to the user.
The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further objectives and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, wherein:
With reference now to the figures,
In the depicted example, server 104 is connected to network 102 along with storage unit 106. In addition, clients 108, 110, and 112 are connected to network 102. These clients 108, 110, and 112 may be, for example, personal computers or network computers. In the depicted example, server 104 provides data, such as boot files, operating system images, and applications to clients 108, 110 and 112. Clients 108, 110 and 112 are clients to server 104. Network data processing system 100 may include additional servers, clients, and other devices not shown. In the depicted example, network data processing system 100 is the Internet with network 102 representing a worldwide collection of networks and gateways that use the TCP/IP suite of protocols to communicate with one another. At the heart of the Internet is a backbone of high-speed data communication lines between major nodes or host computers, consisting of thousands of commercial, government, educational and other computer systems that route data and messages. Of course, network data processing system 100 also may be implemented as a number of different types of networks, such as for example, an intranet, a local area network (LAN), or a wide area network (WAN).
Referring to
Peripheral component interconnect (PCI) bus bridge 214 connected to I/O bus 212 provides an interface to PCI local bus 216. A number of modems may be connected to PCI local bus 216. Typical PCI bus implementations will support four PCI expansion slots or add-in connectors. Communications links to network computers 108, 110 and 112 in
Those of ordinary skill in the art will appreciate that the hardware depicted in
The data processing system depicted in
With reference now to
An operating system runs on processor 302 and is used to coordinate and provide control of various components within data processing system 300 in
Those of ordinary skill in the art will appreciate that the hardware in
As another example, data processing system 300 may be a stand-alone system configured to be bootable without relying on some type of network communication interface, whether or not data processing system 300 comprises some type of network communication interface. As a further example, data processing system 300 may be a Personal Digital Assistant (PDA) device, which is configured with ROM and/or flash ROM in order to provide non-volatile memory for storing operating system files and/or user-generated data.
The depicted example in
The present invention provides an apparatus, system and method of handling multiple resources when configuring a partition. The invention may be local to client systems 108, 110 and 112 of
As shown, when a computer system is partitioned its resources are divided among the partitions. The resources that are not assigned to a partition are not used. More specifically, a resource may either belong to a single partition or not belong to any partition at all. If the resource belongs to a partition, it is known to and is only accessible to that partition. If the resource does not belong to any partition, it is neither known to nor is accessible to any partition.
The computer system ensures that the resources assigned to one partition are not used by another partition through an LPAR allocation table managed by the HMC. The LPAR allocation table is stored in the NVRAM (non-volatile random access memory) of a service processor.
Note that
The hypervisor 608 is stored in a system flash module in the server hardware. During system initialization, the hypervisor 608 is loaded into the first physical address region of system memory. The hypervisor program is trusted to create partition environments, and is the only program that can directly access special processor registers and translation table entries. Partition programs have no way to access the hypervisor's instructions or data, other than through controlled hypervisor service calls that are part of the processor architecture. These protections allow the hypervisor to perform its duties in a simple and rigorous manner, resulting in the confinement of each operating system to a very tight, inescapable box.
Once the partitions are configured, they can be reconfigured by reassigning resources from one partition to another or by assigning unallocated resources to a partition or partitions. In the present invention, the command drmgr (dynamic reconfiguration manager) is used to reconfigure the partitions. This command allows resources to be added or taken away from a partition dynamically (i.e., while the partition is operational). However, only one resource can be added or taken away from a partition at a time. The present invention allows, in the point of view of a user, multiple resources to be added or taken away from a partition at a time.
The HMC provides a user-friendly management interface for performing hardware platform operations, supporting both local and remote user accesses. An HMC user must have a valid user ID and password that has been set up by an HMC administrator. Each HMC user has an assigned role that determines the set of operations that the user is allowed to perform.
a) is a GUI that may be used to assign or re-assign CPU or CPUs when a partition is being reconfigured. Title bar 800 contains the name of the partition that is being reconfigured. As shown by operation area 805, a user may choose to move, add or remove CPU or CPUs from the partition. In this particular example, CPUs will be moved from the partition identified in title bar area 800 to the partition identified in box 850. The number of CPUs used in the partition is 4 (see numeral 815). The user is required to put the number of CPUs to be moved in box 820. Once this is completed, the user may assert ok button 835 to proceed with the re-configuration or button 840 to abort and close the session. If help is needed, the user may assert help button 845. Task timeout 830 is used to enter a number of minutes that may not be exceeded when the partition is being reconfigured (i.e., after the user has asserted the ok button 835).
b) is a GUI that is displayed when CPU or CPUs are to be added to a partition that is being reconfigured. Here the number of free CPUs (i.e., unassigned) that there are in the computer system is displayed (see numeral 820) and the user has to indicate the number of CPUs that are to be added (see numeral 825).
a) is a GUI that may be used to assign or re-assign memory space when a partition is being reconfigured. Title bar 900 contains the name of the partition that is being reconfigured. As shown by operation area 905, a user may choose to move, add or remove memory space from the partition. In this particular example, memory space is being moved from the partition identified in title bar area 900 to the partition identified in box 925. The amount of memory space used in the partition is 4 GB and 512 MB (see numeral 915). The user is required to put the amount of memory space to be moved in box 920. Once this is completed, the user may assert ok button 940 to proceed with the re-configuration or button 945 to abort and close the session. If help is needed, the user may assert help button 950. Task timeout 935 is used to enter a number of minutes that may not be exceeded when the partition is being reconfigured (i.e., after the user has asserted the ok button 940).
b) is a GUI that is displayed when memory space is to be added to a partition that is being reconfigured. Here, the amount of free memory space (i.e., unassigned) that there is in the computer system is displayed (see numeral 920) and the user has to indicate the amount of memory space that is to be added to the partition (see numeral 925).
a) is a GUI that may be used to assign or re-assign I/O adapters when a partition is being reconfigured. Title bar 1000 contains the name of the partition that is being reconfigured. As shown by operation area 1005, a user may choose to move, add or remove I/O adapters from the partition. In this particular example, I/O adapters are being moved from the partition identified in title bar area 1000 to the partition identified in box 1015. The I/O adapters used in the partition is shown in window 1025. The user is required to choose the I/O adapter(s) to be moved in box 1025. Once this is completed, the user may assert ok button 1035 to proceed with the re-configuration or button 1040 to abort and close the session. If help is needed, the user may assert help button 1045. Task timeout 1030 is used to enter a number of minutes that may not be exceeded when the partition is being reconfigured (i.e., after the user has asserted the ok button 1035).
b) is a GUI that is displayed when I/O adapter or adapters are to be added to a partition that is being reconfigured. Here, the free I/O adapters are shown in window 1020 and the user has to indicate the amount of memory space that is to choose.
During configuration or re-configuration, the HMC may send as many drmgr commands as there are resources to be added, moved or removed from a partition. For example, for each individual resource that is to be dynamically removed from a partition during a reconfiguration transaction, the HMC will issue an appropriate drmgr command to the target partition to release the resource. When the operating system running on that partition receives the command, it will obey the command and notify the hypervisor of its compliance with the request. The hypervisor will in turn update the LPAR allocation table in the NVRAM, marking each relinquished resource as being in a stopped state. The HMC will then be notified asynchronously of the new states of the resources for display to the user. Then the HMC will mark each resource that is in a stopped state as being unassigned.
Likewise, for each individual resource that is to be added to a partition, a drmgr command will be issued. However, before the command is issued, the HMC will scour the LPAR allocation table looking for unassigned resources or resources assigned to partitions that are not currently running. When the needed resource is found, the HMC will assign it to the target LPAR. After doing so, the appropriate drmgr command will be issued to the target LPAR enabling the LPAR to acquire the resource. In the case of a move transaction, the resource to be moved will first be removed from the appropriate partition and then added to the target partition.
Note that the above-described procedures apply to CPU and I/O slots reconfigurations. For memory reconfigurations, the allocation table in the NVRAM is first adjusted to the new desired memory allocation of the target LPAR. Then, a drmgr command is issued for each logical memory block (LMB) to be dynamically reconfigured.
As in the prior art, a user may enter the drmgr command manually at the command line to reconfigure a partition. When this is done, the user will have to manually update the LPAR allocation table.
The above-described procedure for reconfiguring LPARs has many advantages. For example, it allows a user to perform dynamic reconfiguration (DR) operations of multiple resources at once instead of doing so one at a time. In addition, the HMC is able to keep track of successful DR operations and unsuccessful ones in real-time. Furthermore, the move feature allows resources to be moved dynamically across partitions in a manner that is totally transparent to the user. Particularly, the remove and add aspects of the move feature are abstracted from the user.
If the reconfiguration involves either a CPU or I/O slot, then an appropriate drmgr command is issued to OS running in the target partition to release the resource. As mentioned before, when the OS has carried out the order, it will inform the hypervisor. The hypervisor then will mark the resource as being in a stopped state and notify the HMC. The HMC then will wait until it receives notification from the hypervisor. Once it receives the notification from the hypervisor, it will mark and display the resource as being unassigned. If it is the last resource to be removed, the process will end. If not the process will return to step 1125 (steps 1105, 1125-1150).
If the reconfiguration involves either a CPU or I/O slot, then the HMC will look for the appropriate resource that is either unassigned. When a resource is found, it will be assigned to the target LPAR and an appropriate drmgr command will be issued to the OS running in the LPAR to acquire the resource. After the OS has acquired the resource, the LPAR allocation table will be updated and the process ends if it is the last resource to be added. if not, the process returns to step 1225 (steps 1205, 1225-1245).
The description of the present invention has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiment was chosen and described in order to best explain the principles of the invention, the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.
Number | Name | Date | Kind |
---|---|---|---|
6647508 | Zalewski et al. | Nov 2003 | B2 |
6725317 | Bouchier et al. | Apr 2004 | B1 |
6871264 | Soltis, Jr. | Mar 2005 | B2 |
Number | Date | Country |
---|---|---|
64-002145 | Jan 1989 | JP |
03-282828 | Dec 1991 | JP |
07-295841 | Nov 1995 | JP |
10-293695 | Nov 1998 | JP |
2000-172515 | Jun 2000 | JP |
2000-311129 | Nov 2000 | JP |
Number | Date | Country | |
---|---|---|---|
20040003063 A1 | Jan 2004 | US |