1. Field of the Invention
The present invention relates generally to data processing systems. More specifically, the present invention relates to a computer implemented method, data processing system, and computer usable program code for creating virtual machine images for software.
2. Description of the Related Art
In modern computing environments, it is becoming increasingly popular to distribute software solutions as a set of virtual machine (VM) images as opposed to the various single software components that make up the solutions. A virtual machine image is file representation of a virtual machine, the virtual machine devices, and all the installed software components. Virtualizers, like VMware® Server, instantiate and run virtual machines starting from their file-based representation or image. Distributing software solutions as virtual machine images is very appealing for both software vendors and customers because of the simplified, less error prone, deployment, and maintenance process.
For example, consider software solutions like the IT Service Management (ITSM) that comprises the components and supported topologies as shown in
Instead of distributing the sixteen different major components and required middleware components shown in table 302 in
However, current virtual machine technology is aimed at the process of creating, duplicating, and deploying virtual machines themselves without reference to the application(s) installed in the virtual machine images or to the dependencies among multiple virtual machines that fulfill various parts of a distributed application. Thus, it would be advantageous to have an improved method for managing software with minimal user intervention.
The different illustrative embodiments provide a computer implemented method, data processing system, and computer usable program code to create a set of virtual machine images for software. The illustrative embodiments retrieve a virtual software resource template. The illustrative embodiments copy metadata associated with the virtual software resource template. The illustrative embodiments modify the copy of the metadata to generate personalized metadata for each virtual machine image in the set of virtual machine images. The illustrative embodiments deploy the set of virtual machine images using the personalized metadata.
The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further objectives and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, wherein:
With reference now to the figures,
In the depicted example, server 104 and server 106 connect to network 102 along with storage unit 108. In addition, clients 110, 112, and 114 connect to network 102. These clients 110, 112, and 114 may be, for example, personal computers or network computers. In the depicted example, server 104 provides data, such as boot files, operating system images, and applications to clients 110, 112, and 114. Clients 110, 112, and 114 are clients to server 104 in this example. Network data processing system 100 may include additional servers, clients, and other devices not shown.
In the depicted example, network data processing system 100 is the Internet with network 102 representing a worldwide collection of networks and gateways that use the Transmission Control Protocol/Internet Protocol (TCP/IP) suite of protocols to communicate with one another. At the heart of the Internet is a backbone of high-speed data communication lines between major nodes or host computers, consisting of thousands of commercial, government, educational and other computer systems that route data and messages. Of course, network data processing system 100 also may be implemented as a number of different types of networks, such as for example, an intranet, a local area network (LAN), or a wide area network (WAN).
With reference now to
In the depicted example, data processing system 200 employs a hub architecture including north bridge and memory controller hub (NB/MCH) 202 and south bridge and input/output (I/O) controller hub (ICH) 204. Processing unit 206, main memory 208, and graphics processor 210 are connected to north bridge and memory controller hub 202. Graphics processor 210 may be connected to north bridge and memory controller hub 202 through an accelerated graphics port (AGP).
In the depicted example, local area network (LAN) adapter 212 connects to south bridge and I/O controller hub 204. Audio adapter 216, keyboard and mouse adapter 220, modem 222, read only memory (ROM) 224, hard disk drive (HDD) 226, CD-ROM drive 230, universal serial bus (USB) ports and other communications ports 232, and PCI/PCIe devices 234 connect to south bridge and I/O controller hub 204 through bus 238 and bus 240. PCI/PCIe devices may include, for example, Ethernet adapters, add-in cards and PC cards for notebook computers. PCI uses a card bus controller, while PCIe does not. ROM 224 may be, for example, a flash binary input/output system (BIOS).
Hard disk drive 226 and CD-ROM drive 230 connect to south bridge and I/O controller hub 204 through bus 240. Hard disk drive 226 and CD-ROM drive 230 may use, for example, an integrated drive electronics (IDE) or serial advanced technology attachment (SATA) interface. Super I/O (SIO) device 236 may be connected to south bridge and I/O controller hub 204.
An operating system runs on processing unit 206 and coordinates and provides control of various components within data processing system 200 in
As a server, data processing system 200 may be, for example, an IBM eServer™ pSeries® computer system, running the Advanced Interactive Executive (AIX®) operating system or Linux® operating system (eServer, pseries and AIX are trademarks of International Business Machines Corporation in the United States, other countries, or both while Linux is a trademark of Linus Torvalds in the United States, other countries, or both). Data processing system 200 may be a symmetric multiprocessor (SMP) system including a plurality of processors in processing unit 206. Alternatively, a single processor system may be employed.
Instructions for the operating system, the object-oriented programming system, and applications or programs are located on storage devices, such as hard disk drive 226, and may be loaded into main memory 208 for execution by processing unit 206. The processes for embodiments are performed by processing unit 206 using computer usable program code, which may be located in a memory such as, for example, main memory 208, read only memory 224, or in one or more peripheral devices 226 and 230.
Those of ordinary skill in the art will appreciate that the hardware in
In some illustrative examples, data processing system 200 may be a personal digital assistant (PDA), which is configured with flash memory to provide non-volatile memory for storing operating system files and/or user-generated data.
A bus system may be comprised of one or more buses, such as bus 238 or bus 240 as shown in
An exemplary embodiment provides for managing software by deploying, configuring, and instantiating a software solution distributed as virtual machine images. A virtual machine image consists of an operating system, middleware, and application components. Depending on the topology, some of these components communicate with other components in the same virtual machine image, while other components communicate with components in other images.
Turning back to the figures,
Table 304 shows the various topologies supported by the solution of table 302. The first column contains entries for each of the supported topologies. The second column contains entries for the names of the topology nodes or virtual machine images that comprise the supported topology entry in the first column. The third column contains entries for major components from table 302 that comprise the topology nodes in the entry of the second column. Following the name of each major component in the entry of the third column is the name of the specific topology node in the entry of the second column to which the major component belongs.
The connection between components A and B can be established and configured at the time of the creation of VM1402 while the connection between components B and D needs to be established and configured at the time of deployment.
In an exemplary embodiment, the virtual machine image is treated as a single software component with well identified access points. From this perspective, the specific components in a virtual machine, such as a Linux® operating system, a queue manager, and a database that is only used by the queue manager, is irrelevant. What is relevant is the designed role or capability of the virtual machine, such as a provider of a queue manager service and the access point(s) of the virtual machine. Some examples of access points are protocol, address, port, and any other parameter needed to connect to the virtual machine.
An exemplary embodiment provides for using metadata that describes hardware and software requirements of each virtual machine image, the services the virtual machine exposes with associated access points, the services that the virtual machine requires from other components in order to fulfill the role of the virtual machine in the overall system, and other related metadata. Since the metadata describes hardware and software requirements of each virtual machine image, the metadata may be considered personalized metadata.
An exemplary embodiment provides for a virtual software resource template (VSRT). A virtual software resource template is a collection of one or more freeze-dried software stack(s) with associated metadata. A freeze-dried software stack is a software stack comprised of pre-configured, pre-tuned, and hardened components, such as an operating system, an application server, a database, and a set of applications. A pre-tuned or hardened component is a component whose configuration, all the parameters that influence the behavior of the component at run time, has been already executed and the results stored in the virtual machine image. Examples of metadata information include, but are not limited to, information such as location of freeze-dried software stack volumes, hardware requirements such as CPU, memory, disk, network, and so forth, for hosting the virtual software resource template, external and internal virtual machine image dependencies, policies associated with capacity, configuration parameters for the solution, and so forth.
Virtual software resource templates (VSRTs) may be either simple or composite. A simple virtual software resource template, also called a base virtual software resource template, comprises a single software stack. That is, a base virtual software resource template comprises a virtual machine image and metadata for a single virtual machine. A composite virtual software resource template comprises multiple, finer grained software stacks. That is, a composite virtual software resource template comprises multiple virtual machine images and metadata for multiple virtual machines. Composite virtual software resource templates contain information about assembling a set of virtual software resource templates into a single distributed application or “solution”. Virtual software resource templates may be deployed on a set of physical machines either by running in a virtual container or directly on physical machines like a blade. A virtual container is a program capable of instantiating and running virtual software resource templates, such as VMware® Server. When virtual software resource templates are instantiated on a set of physical machines, one or more virtual software resources (VSRs) are created. A virtual software resource is a resource that is deployed, configured, and running.
An exemplary embodiment provides for a base virtual software resource template comprised of the following metadata information: an identifier, a name, a description, disk image information, hardware profile, software profile, capabilities, configuration, scale-out/clustering, intra-virtual software resource template dependencies, and external dependencies.
An identifier is a globally unique identifier that may act as a primary key for machine identification of the software stack contained in the virtual machine image. A name is a unique name for human identification of the software stack. The description is a short description of the functionality provided by the software stack. Disk image information includes, for example, the location of the disk images. A hardware profile is the virtual machine image hardware requirements. These include, for example, CPU, memory, disk, network, and so forth. This information is used for resource matching during the deployment phase and results in the virtual machine definition.
A software profile includes detailed information about the software installed on the disk images. A disk image is the file image of a virtual disk. A virtual machine can have different virtual disks much like a physical machine can have different hard disks, for example, the OS version and release information. This information is particularly useful when the virtual software resource template is being extended vertically by the application or when the virtual software resource template is being upgraded, for example, by applying fix-packs to the middleware and/or operating system. Fix-packs are fixes software vendors rollout to the bugs found in their software that needs to be applied by customers on top of their existing software installation for the software to work properly. Capabilities are metadata that express or explain the capabilities provided by the software stack, for example, a J2EE™ application server or a servlet container and so forth. Configuration information is metadata that defines data that is needed for the solution to be customized and configured. The configuration data is provided during the deployment of the virtual software resource template.
Scale-out/clustering information is metadata that identifies the policies associated with scale-out of virtual software resource templates. A computer “cluster” is a group of loosely coupled computers that work together closely so that in many respects they can be viewed as though they are a single computer. Clusters are usually deployed to improve speed and/or reliability over that provided by a single computer, while typically being much more cost-effective than single computers of comparable speed or reliability. This technique of grouping computers is called “clustering”. “Scale-out” in the exemplary embodiments is used to define machines, physical or virtual, that are clustered together to increase throughput and improve speed. The scale-out information is useful at runtime for optimal instantiation and placement of virtual machines. The intra-virtual software resource template dependencies are the dependencies between the various software stacks included in the composite virtual software resource template definition. For example, IT Service Management (ITSM) Command Line Interface (CLI) client software stack “uses” IT Service Management data server software stack. External dependencies are the dependencies between a software stack and external components. For example, a WebSphere® application server software stack might use a Lightweight Directory Access Protocol (LDAP) server for authentication.
An exemplary embodiment provides for a composite virtual software resource template comprised of the following metadata information: an identifier, a name, a version, a description, capabilities, capacity, cost of instantiation, and constituent virtual software resource templates. An identifier is a globally unique identifier that could act as a primary key for machine identification of the virtual software resource template. A name is a unique name for the human identification of a virtual software resource template. A version refers to the version of the virtual software resource template. The description is a short description of the functionality provided by the virtual software resource template. The description may function as a miniature README file for the solution. Capabilities are metadata that express or explain the capabilities provided by the virtual software resource template, for example, a J2EE™ application server, a servlet container, and so forth.
The capacity identifies the capacity of the virtual software resource template, provided that the hardware and software requirements are met. An example of capacity is the number of clients that the virtual software resource template is capable of serving. The cost of instantiation is metadata that identifies the cost associated with the instantiation of the virtual software resource template into a virtual software resource (VSR). Cost refers to the time taken to perform the function. The cost information can be used, for example, by a utility function to decide whether creating a new instance of a virtual software resource template in response to an increased workload of short duration would be worthwhile.
The constituent virtual software resource template's metadata comprises information about the virtual software resource template that make up the distributed application, such as, for example, IT Service Management (ITSM) data server, IT Service Management (ITSM) User Interface (UI) server, etc. The constituent virtual software resource template's metadata takes the form of one or more entries. Each entry comprises an identifier and a role. The identifier is the identifier of the base virtual software resource template. The role describes the role the software stack in the base virtual software resource template will play in the solution described by the composite virtual software resource template. For example, a WebSphere® application server software stack may play a role of a deployment manager or an application server or both. It should be noted that the constituent virtual software resource templates referred to in a composite virtual software resource template may themselves be either base or composite virtual software resource templates. Thus, composite virtual software resource templates may be nested.
Turning back to the figures,
Input 502 comprises various inputs including the logical application structure from the application developer, a logical topology, for example, a three (3) tier deployment of a typical web application versus two (2) tier deployment from a user who could be a domain expert, DB2 and other middleware base. VSRT factory 504 also receives base virtual software resource templates that are used as building blocks for other composite virtual software resource templates. For example, WebSphere® Application Server, DB2, and other middleware from VSRT repository 506. The inputs to VSRT factory 504 contain enough information to produce the virtual software resource template metadata required for a particular implementation.
VSRT deployment manager 508 is the consumer of the virtual software resource template metadata and virtual machine images. Apart from retrieving virtual software resource templates from VSRT repository 506, VSRT deployment manager 508 also receives input 510. Input 510 is user input, including input about the physical topology, information about the physical machines in the data center, and so forth. VSRT deployment manager 508 instantiates virtual software resources on the physical systems of data center 512. Virtual software resources are a set of virtual machines with configuration information associated with the software stack.
Next, for each virtual machine image in the virtual software resource template, a virtual machine name is chosen (step 708). The copied metadata is modified by adding the virtual machine names to the copied metadata (step 710). The virtual machine names may be specified manually, may be automatically chosen from a pre-specified list of available machine names, or may be generated dynamically, for example, based on the solution instance name and the role played by the base virtual software resource template.
The copied metadata is modified by adding an Internet Protocol (IP) address to the copied metadata for each virtual machine image (step 712). If dynamic IP addressing is not in use, an IP address is chosen. Next, utilizing the hardware profile information stored in the base virtual software resource template for each virtual machine image, a physical server is selected for each virtual machine image, and initial resource allocations, such as central processing unit, memory, and so forth, are specified and the information is augmented to the copied metadata (step 714). Next, the modified copy of the metadata is saved (step 716).
The virtual machine images are copied and modified as necessary (step 718). Modifications may include, but are not limited to, adding, to a given copy of a virtual machine image, information about the names and/or IP addresses that were chosen for the other virtual machines on which the given virtual machine will depend. Any other configuration parameters that vary from solution-instance to solution-instance are also specified at this time. The modified virtual machine images are deployed and started on the selected physical servers (step 720) and the operation ends. Because virtual machine images were previously configured with their dependency information, the solution instance is fully operational when it starts.
Thus, the exemplary embodiments provide for a simplified method of managing software. The software solution is managed by deploying, configuring, and instantiating the software solution distributed as virtual machine images.
The invention can take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment containing both hardware and software elements. In a preferred embodiment, the invention is implemented in software, which includes but is not limited to firmware, resident software, microcode, etc.
Furthermore, the invention can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system. For the purposes of this description, a computer-usable or computer readable medium can be any tangible apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
The medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium. Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read-only memory (ROM), a rigid magnetic disk and an optical disk. Current examples of optical disks include compact disk—read only memory (CD-ROM), compact disk—read/write (CD-R/W) and DVD.
A data processing system suitable for storing and/or executing program code will include at least one processor coupled directly or indirectly to memory elements through a system bus. The memory elements can include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution.
Input/output or I/O devices (including but not limited to keyboards, displays, pointing devices, etc.) can be coupled to the system either directly or through intervening I/O controllers.
Network adapters may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters.
The description of the present invention has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiment was chosen and described in order to best explain the principles of the invention, the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.
Number | Name | Date | Kind |
---|---|---|---|
4674038 | Brelsford et al. | Jun 1987 | A |
5682535 | Knudsen | Oct 1997 | A |
5807790 | Gupta et al. | Sep 1998 | A |
6094528 | Jordan | Jul 2000 | A |
6510448 | Churchyard | Jan 2003 | B1 |
6633916 | Kauffman | Oct 2003 | B2 |
6734873 | Herf et al. | May 2004 | B1 |
6775699 | DeLuca et al. | Aug 2004 | B1 |
6802062 | Oyamada et al. | Oct 2004 | B1 |
6848106 | Hipp | Jan 2005 | B1 |
6917963 | Hipp et al. | Jul 2005 | B1 |
6941410 | Traversat et al. | Sep 2005 | B1 |
7013462 | Zara et al. | Mar 2006 | B2 |
7093247 | Ashworth et al. | Aug 2006 | B2 |
7127713 | Davis et al. | Oct 2006 | B2 |
7131122 | Lakhdhir | Oct 2006 | B1 |
7142848 | Owen et al. | Nov 2006 | B2 |
7150015 | Pace et al. | Dec 2006 | B2 |
7206827 | Viswanath et al. | Apr 2007 | B2 |
7313793 | Traut et al. | Dec 2007 | B2 |
7356679 | Le et al. | Apr 2008 | B1 |
7376693 | Neiman et al. | May 2008 | B2 |
7478387 | Abelite et al. | Jan 2009 | B2 |
7506336 | Ninan | Mar 2009 | B1 |
7506338 | Alpern et al. | Mar 2009 | B2 |
7587453 | Bhrara et al. | Sep 2009 | B2 |
7590983 | Neiman et al. | Sep 2009 | B2 |
7603443 | Fong et al. | Oct 2009 | B2 |
7676803 | Zhao et al. | Mar 2010 | B2 |
7730183 | Brown et al. | Jun 2010 | B2 |
7730480 | Isaacson | Jun 2010 | B2 |
7743363 | Brumme et al. | Jun 2010 | B2 |
7761865 | Stienhans et al. | Jul 2010 | B2 |
7779091 | Wilkinson et al. | Aug 2010 | B2 |
7865888 | Qureshi et al. | Jan 2011 | B1 |
7870153 | Croft et al. | Jan 2011 | B2 |
7908313 | Lurie et al. | Mar 2011 | B2 |
7949677 | Croft et al. | May 2011 | B2 |
7954087 | Zenz et al. | May 2011 | B2 |
8078824 | Sugumar et al. | Dec 2011 | B2 |
8108855 | Dias et al. | Jan 2012 | B2 |
8145751 | Creamer et al. | Mar 2012 | B2 |
8145760 | Dinda et al. | Mar 2012 | B2 |
8191060 | Malasky et al. | May 2012 | B2 |
20020065877 | Kowtko et al. | May 2002 | A1 |
20040010598 | Bales et al. | Jan 2004 | A1 |
20040015968 | Neiman et al. | Jan 2004 | A1 |
20040060048 | Abelite et al. | Mar 2004 | A1 |
20050041588 | Kim et al. | Feb 2005 | A1 |
20050050175 | Fong et al. | Mar 2005 | A1 |
20050191991 | Owen et al. | Sep 2005 | A1 |
20050268280 | Fildebrandt | Dec 2005 | A1 |
20060155674 | Traut et al. | Jul 2006 | A1 |
20060184936 | Abels et al. | Aug 2006 | A1 |
20060184937 | Abels et al. | Aug 2006 | A1 |
20060277542 | Wipfel | Dec 2006 | A1 |
20070006205 | Kennedy et al. | Jan 2007 | A1 |
20070046791 | Wang et al. | Mar 2007 | A1 |
20070089111 | Robinson et al. | Apr 2007 | A1 |
20070124731 | Neiman et al. | May 2007 | A1 |
20070157172 | Zenz et al. | Jul 2007 | A1 |
20070157185 | Semerdzhiev et al. | Jul 2007 | A1 |
20070162892 | Zenz et al. | Jul 2007 | A1 |
20070233881 | Nochta et al. | Oct 2007 | A1 |
20070234334 | Araujo et al. | Oct 2007 | A1 |
20070234337 | Suzuki et al. | Oct 2007 | A1 |
20070234356 | Martins et al. | Oct 2007 | A1 |
20070283314 | Browning et al. | Dec 2007 | A1 |
20070294676 | Mellor et al. | Dec 2007 | A1 |
20070300220 | Seliger et al. | Dec 2007 | A1 |
20070300221 | Hartz et al. | Dec 2007 | A1 |
20080034071 | Wilkinson et al. | Feb 2008 | A1 |
20080127169 | Malasky et al. | May 2008 | A1 |
20080155537 | Dinda et al. | Jun 2008 | A1 |
20080163194 | Dias et al. | Jul 2008 | A1 |
20080184225 | Fitzgerald et al. | Jul 2008 | A1 |
20080222234 | Marchand | Sep 2008 | A1 |
20090089860 | Forrester et al. | Apr 2009 | A1 |
20090113395 | Creamer et al. | Apr 2009 | A1 |
20090276771 | Nickolov et al. | Nov 2009 | A1 |
20100083218 | Bender | Apr 2010 | A1 |
20110119670 | Sugumar et al. | May 2011 | A1 |
20110119748 | Edwards et al. | May 2011 | A1 |
Number | Date | Country |
---|---|---|
2004005650 | Jan 2004 | JP |
2005174201 | Jun 2005 | JP |
2005292956 | Oct 2005 | JP |
03021516 | Mar 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20080163171 A1 | Jul 2008 | US |