1. Field of the Invention
This invention relates in general to network storage systems, and more particularly to a method, apparatus and program storage device for providing automatic performance optimization of virtualized storage allocation within a virtualized storage subsystem.
2. Description of Related Art
In enterprise data processing arrangements, such as may be used in a company, government agency or other entity, information is often stored on servers and accessed by users over, for example, a network. The information may comprise any type of information that of programs and/or data to be processed. Users, using their personal computers, workstations, or the like (generally, “computers”) will enable their computers to retrieve information to be processed, and, in addition, to store information, for example, on remote servers.
Generally, servers store data in mass storage subsystems that typically include a number of disk storage units. Data is stored in units, such as files. In a server, a file may be stored on one disk storage unit, or alternatively portions of a file may be stored on several disk storage units. A server may service access requests from a number of users concurrently, and it will be appreciated that it will be preferable that concurrently serviced access operations be in connection with information that is distributed across multiple disk storage units, so that they can be serviced concurrently. Otherwise stated, it is generally desirable to store information in disk storage units in such a manner that one disk drive unit not be heavily loaded, or busy servicing accesses, and while others are lightly loaded or idle.
A computer network of a business may have multiple storage networks that are located remote from one another and a business user. The storage networks may also be hosted on different types of systems. To perform the job correctly, the business user may require fast and reliable access to the data contained in all of the storage networks. Information Technology (IT) employees must be able to provide high-speed, reliable access to the business users.
Storage area networks (SANs) are high-speed, high-bandwidth storage networks that logically connect the data storage devices to servers. The business user, in turn, is typically connected to the data storage devices through the server. SANs extend the concepts offered by traditional server/storage connections and deliver more flexibility, availability, integrated management and performance. SANs are the first IT solutions to allow users access to any information in the enterprise at any time. Generally the SAN includes management software for defining network devices such as hosts, interconnection devices, storage devices, and network attach server (NAS) devices. The SAN management software also allows links to be defined between the devices. Within SANs software can define virtual storage where data is stored across a number of storage disks while being characterized as a single virtual disk.
One important component in reaching this goal of providing high-speed, reliable access to the business users is to allow the SAN to be fully understood by those designing and maintaining the SAN. It is often difficult to quickly understand the SAN due to its complexity. Tools that allow the configuration of the SAN and virtual systems within the SAN to be understood and changed quickly are beneficial.
One of the advantages of a SAN is the elimination of the bottleneck that may occur at a server, which manages storage access for a number of clients. By allowing shared access to storage, a SAN may provide for lower data access latencies and improved performance. However, in a large storage network such as SAN attached storage, it is difficult for a storage administrator to know where to allocate an increment of storage so that the newly allocated space achieves the best possible performance, due to the complexity of the network which can include a number of virtualized storage subsystems, the complexity of analyzing workloads, and that physical storage attributes may be hidden from the application.
In the past, storage allocation for large storage environments has been performed manually. Storage management software that can allocate or recommend where to allocate storage based on a number of algorithms is available. Nevertheless, these algorithms do not actually attempt to satisfy production performance requirements within the constraints of available storage including virtual storage systems within SANs.
It can be seen that there is a need for a method, apparatus and program storage device for providing automatic performance optimization of virtualized storage allocation within a virtualized storage system.
To overcome the limitations in the prior art described above, and to overcome other limitations that will become apparent upon reading and understanding the present specification, the present invention discloses a method, apparatus and program storage device for providing automatic performance optimization of virtualized storage allocation within a virtualized storage subsystem.
An embodiment of the present invention includes a program storage device. The program storage devices comprises program instructions executable by a processing device to perform operations for managing storage allocation in a virtual storage system, the operations including defining workload profiles, determining performance characteristics of managed disks, determining relationships between the managed disks and resource groups and creating a virtual disk comprising a set of the managed disks based on resource groups that the managed disks are allocated to.
In another embodiment of the present invention, a device for managing storage allocation in a virtual storage system is provided. The device includes a memory for storing storage system information and a processor coupled to the memory. The processor is configured to provide a user interface for use in defining workload profiles and a virtual disk allocator for determining performance characteristics of managed disks, for determining relationships between managed disks and resource groups based on user defined or automated inputs and for creating a virtual disk comprising a set of the managed disk considering the resource groups to which the managed disks are allocated.
In another embodiment of the present invention, a method for managing storage allocation in a virtual storage system is provided. The method includes obtaining user defined workload profiles, determining performance characteristics of managed disks, determining relationships between managed disks and resource groups based on user defined or automated inputs, and creating a virtual disk comprising a set of the managed disk considering the resource groups to which the managed disks are allocated.
In another embodiment of the present invention, a volume provisioning advisor for managing storage allocation in a virtual storage system is provided. The volume provisioning advisor includes a user interface for use in defining workload profiles and a virtual disk allocator, operatively coupled to the user interface, the virtual disk allocator determining the performance characteristics of managed disks, determining relationships between managed disks and resource groups based on user defined or automated inputs and creating a virtual disk comprising a set of the managed disks considering the resource groups to which the managed disks are allocated.
In another embodiment of the present invention, another volume provisioning advisor is provided. This embodiment of a volume provisioning advisor includes means for obtaining user defined workload profiles, means for determining performance characteristics of managed disks, means for determining relationships between managed disks and resource groups based on user defined or automated inputs, and means for creating a virtual disk comprising a set of the managed disk considering the resource groups to which the managed disks are allocated.
In another embodiment of the present invention, another volume provisioning advisor is provided. This embodiment of a volume provisioning advisor includes means for obtaining user defined workload profiles, means for determining performance characteristics of managed disks and means for creating a virtual disk comprising a set of the managed disk considering the resource groups to which the managed disks are allocated.
These and various other advantages and features of novelty which characterize the invention are pointed out with particularity in the claims annexed hereto and form a part hereof. However, for a better understanding of the invention, its advantages, and the objects obtained by its use, reference should be made to the drawings which form a further part hereof, and to accompanying descriptive matter, in which there are illustrated and described specific examples of an apparatus in accordance with the invention.
Referring now to the drawings in which like reference numbers represent corresponding parts throughout:
In the following description of the embodiments, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration the specific embodiments in which the invention may be practiced. It is to be understood that other embodiments may be utilized because structural changes may be made without departing from the scope of the present invention.
The present invention provides a method, apparatus and program storage device for providing automatic performance optimization of virtualized storage allocation within a virtualized storage subsystem.
The network shown in
In
As networks such as shown in
In the SAN 200 of
The virtual disk allocator 272 satisfies requests for storage within the network of storage elements in such a way as to meet the performance requirements specified with the request, or through a storage policy mechanism. The virtual disk allocator 272 can operate in environments such as IBM 2145 SAN Volume Controller (SAN VC), which is a virtualized storage subsystem. The virtual disk allocator 272 determines performance characteristics of managed disks. The virtual disk allocator 272 determines relationships between managed disks and resource groups based on user defined or automated input, and creates a virtual disk that includes a set of the managed disks, taking into consideration the resource groups, and the resource group storage resources such as cache and data paths, to which the managed disks are allocated.
The virtual disk allocator 272 extends the policy-based aspects to Open System Environments and automates the selection of storage elements within virtualized storage subsystems to meet performance requirements. Recommending the selected storage elements within the virtualized storage system allows for optimal usage of striped or composite volumes supported by the OS or Volume Manager software, or applications (such as database applications) which support the concept of striped volumes, such as DB2 and other database products The virtual disk allocator 272 also extends the notions of allocating storage taking into consideration long-term data usage patterns. The virtual disk allocator 272 incorporates various algorithms required to make intelligent choice of data placement.
The virtual disk allocator 272 may make determinations of which nodes, i.e., engines such as the virtualization engine 274, may access the data, and which managed disk groups (MDGs), groups of disks supporting a virtual disk, would compose the LUNs to be selected. Within the MDG is at least one managed disk, which is used by a virtualization engine 274 and volume manager 276 to stripe data within the virtual disk, which is comparable to logical disks in Enterprise Storage Systems (ESS). The virtual disk allocator 272 can thus select a LUN or a plurality of LUNs in multiple resource groups across multiple storage elements in order to meet the customer's desired level of performance.
The administrator may perform a calibration process 278 to discover the performance capabilities of the underlying disks. This would entail running specific tests to discover the performance parameters of those groups of disks, as opposed to merely using specific knowledge about the performance capabilities of the disks organized in specific configurations in resource groups.
It is almost impossible to make intelligent data placement decisions without having a rudimentary understanding of the application workload requirements, or at least making reasonable assumptions about those workloads. For example, if a user asks for 100 GB of storage, a light performance requirement might allow allocating a single 100 GB logical disk, whereas a high performance application might require allocating ten 100 GB logical disks across 10 disk arrays, and striping of data across those logical disks. Unfortunately, when most customers are asked what their workloads look like, they usually have no idea.
First, canned workload profiles may be provided 410. Referring to
Workload profiles may also be automatically created based on observations of a customer's workload 412. Since every customer's workload has unique attributes, better workload assumptions can be obtained by observing storage access patterns in the customer's environment. Referring to
Workload profiles may also be provided by intelligent software components 414. Referring to
One mechanism for determining the performance capabilities of managed disks includes a manual input approach 510. This approach is a simple, but less desirable approach to understanding managed disk performance capabilities. For example, a small number of managed disk performance profiles could be defined (e.g., “mirrored disk”, “RAID-5 array with 8 disks.” Each of these profiles would have specific default performance attributes. The administrator would then select the appropriate profile that matches the managed disk configuration. This approach would narrow some of the very wide differences in managed disk performance capability.
Another approach to understanding the performance capabilities of managed disks involves a managed disk configurator 512. Device-specific configurators could provide input on the performance characteristics for a finite set of “understood” disk controllers and the relationship between the managed disks and the storage controllers. For example, a FAStT900 configurator could provide a uniform technique of creating RAID arrays, LUNs, and managed disk on a subsystem, and then provide the managed disk performance assumptions for those managed disk. Configurators could also dynamically recognize the relationship between the managed disk and the storage controllers.
In another approach, controlled calibration 514 is used to run a specified I/O load against the managed disks while observing the behavior of managed disks. Ordinarily, controlled calibration would be executed before space is allocated on the managed disks, although it could be executed afterward if space were reserved.
In an uncontrolled calibration approach 516, the performance behavior of managed disks is analyzed as applications run normal workloads against managed disks. This approach would be used if managed disks were already assigned (and there is no free space reserved for calibration). It is also useful to determine if conditions have changed since the controlled calibration step (e.g., the managed disk is running in degraded condition). As an example, the uncontrolled calibration might observe that response times are consistently high at particular load points, assuming that this load point more accurately reflects the maximum throughput capability.
Another approach uses standardized interfaces 518 which may allow the SMIS specification to provide a conceptual model that identifies LUNs and the LUN's associations with physical resources, as well as static performance capabilities of those elements.
Other approaches similar to those mentioned above can be used to make allocation decisions. Similarly, combinations of the above-mentioned approaches can be chosen. For example, administrators might choose to run a controlled calibration step against the first storage controller of a particular type configured behind the SAN VC, and then manually assign attributes for subsequent controllers of that same type, e.g., “these managed disks look just like those managed disks”.
The workload parameters used by the virtual disk allocator 272 are selected based on their ability to accurately predict disk storage performance, and based on their general availability though data collection tools. The workload parameters used include maximum throughputs for each of the managed disks, maximum random reads and writes per second, maximum mixed reads and writes per second, maximum sequential reads and writes per second, maximum mixed sequential reads and writes per second and latency at low and maximum loads.
Mixed reads and writes are important to determine whether the component has bidirectional capability. The short block random operations are used to determine processing capabilities and costs, and the large block sequential metrics are used to determine data transfer capabilities and costs.
These maximum throughputs are then used to determine the overhead per read or write operation, and per read or write megabyte transferred. The overheads are then applied to the projected workload characteristics to determine projected utilizations of the managed disks.
Because managed disks have individual performance capabilities, and have different performance capabilities when taking into account the performance capabilities of other resources associated with those managed disks, the virtual disk allocator 272 defines resource groups, each having associated performance attributes for a set of managed disks that belong to the resource group. By taking into account the managed disk performance capabilities and the performance capabilities of their associated resources, the overall performance of the managed disks can be determined. These other resources might include the physical disks or controllers on which the managed disks reside. However, it is not necessary to identify all of the possible resources associated with the managed disks. Instead, the virtual disk allocator 272 will generalize associations with resources when defining a resource group.
Each resource group is given performance attributes (same as for managed disks). The performance capacity of resource groups can be specified manually or determined through a controlled benchmark procedure called controlled calibration. Calibration can be executed against managed disks identified and at a time identified by an administrator. The calibration may be executed against managed disks before they are assigned to virtual disks. This could also be contained within an operational procedure that automates creation of managed disks on specified storage controllers. The calibration would be executed against individual managed disks and all of the associated resource groups as long as all of the managed disks in the groups are not assigned. The technique used in calibration is similar to those used in conducting engineering measurements against competitive disk subsystems.
Because managed disks can belong to more than one resource group, calibration of resource groups allows optimization of data allocation decisions. However, it is not necessary to discover all of the potential relationships between managed disks and associated resources, and no impact results when suspected relationships do not exist. Various implementations of the virtual disk allocator 272 could build-in varying levels of sophistication in discovering the relationships.
Data structure 600 is used to provide a goal-oriented approach to storage allocation. As an example, the SAN VC presents a somewhat different model of storage than the traditional storage controller. The application server is presented with a view of virtual disks, comparable to logical disks in IBM Enterprise Storage Servers (ESS). Virtual disk allocator 272 creates a virtual disk by identifying a set of performance properties associated with a set of managed disks and resource groups, including throughput capabilities for specific workloads and estimating resource utilization derived from the identified set of performance properties associated with a set of managed disks and resource groups. A virtual disk is accessed, for example, through a SAN VC primary node, and data for the virtual disk is cached in that node, a function comparable to an ESS cluster. Referring to
Each of the managed disks 610, 620, 630 and 640 might be allocated as a portion of a single physical disk shared with other applications, or allocated as an entire RAID array consisting of several disks. In addition, storage administrators are given tremendous flexibility in selection of managed disk configurations. As a result of associations with other applications, activity on one managed disk can affect the performance of another managed disk, such as when they reside on the same storage device or behind the same storage controller. With reference to
In accordance with an embodiment of the invention, each managed disk is associated with one or more resource groups (RGs). RG 1660, RG 2670 and RG 3680 are defined by the virtual disk allocator 272. RG1660 includes managed disk 610 and resources 611, 612 and 613. RG2670 includes managed disk 620, 630 and a portion of managed disk 640, and has resources 613, 621, 622, 631, 632 and 641. RG3680 includes a portion of managed disk 640, and resources 632 and 641. As can be seen, resource 613 is shared between managed disk 610 and 620, and resources 632 and 641 are shared between managed disk 630 and 640. Because managed disks and associated resources are grouped into resource groups allowing observation of interactions within and between resource groups, enhanced performance assumptions about the capability and behavior of the managed disks can be made by virtual disk allocator 272.
For example, during the volume selection process, the virtual disk allocator 272 keeps track of utilizations of managed disks and resource groups, similar to the way it maintains those utilizations for the components of the configuration tree in VPA for ESS. When selecting potential managed disks for allocation, the effects are evaluated for all of the resource groups to which a managed disk belongs. The managed disk's utilization is then assumed to be the highest of all the associated RGs. This is similar to the technique of navigating up the configuration tree used by the current version of VPA. The virtual disk allocator 272 makes a recommendation for storage allocation that meets the capacity and the performance requirements specified.
Referring again to
An important aspect of the virtual disk allocator 272 involves the use of the capacity and performance structures to balance storage allocation across available resources. Where multiple choices are possible in virtual disk allocator 272, the capacity and performance structures may be used to bias allocation to one set of resources through the use of pseudo-random numbers. Several sample allocations can be selected in this fashion, and the best among the samples chosen for the answer. This technique prevents the algorithms from making the same recommendation in similar situations, increasing the spread of workloads across available resources. In this way, storage allocations will be biased toward elements in the network that are most capable of handling the specified workload.
The process illustrated with reference to
The foregoing description of the exemplary embodiment of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. It is intended that the scope of the invention be limited not with this detailed description, but rather by the claims appended hereto.