Storage systems having differentiated storage pools

Abstract
The systems and methods described herein include among other things, systems for providing a block level data storage service. More particularly, the systems and methods of the invention provide a block level data storage service that provides differentiated pools of storage on a single storage device. To this end, the systems and methods described herein leverage the different performance characteristics across the logical block name (LBN) space of the storage device (or devices). These different performance characteristics may be exploited to support two or more classes of storage on a single device.
Description
BACKGROUND OF THE INVENTION

The systems and methods described herein relate to data storage.


More sophisticated storage technologies, such as RAID, provide differentiated classes of storage. These storage classes may differ in regard to performance, reliability, error detection and other factors. The different storage classes allow the system administrator to store different types of data with different classes of storage.


Although existing systems can work quite well, they generally employ separate storage devices for each class of service. Thus, one storage device can be set to a certain level of performance, such as RAID Level 0, and this device (or devices) can store the appropriate data, such as Photoshop temporary files. Thus, these systems require dedicated devices for the different classes of storage.


Accordingly, there is a need in the art for storage systems that provide differentiated classes of storage without requiring class-dedicated storage devices.


SUMMARY OF THE INVENTION

The systems and methods described herein include among other things, systems for providing a block level data storage service. More particularly, the systems and methods of the invention provide, among other things, a block level data storage service that provides differentiated pools of storage on a single storage device. To this end, the systems and methods described herein leverage the different performance characteristics across the logical block name (LBN) space of the storage device (or devices). These different performance characteristics may be exploited to support two or more classes of storage on a single device.


More particularly, the systems and methods described herein include, in one aspect, systems for providing differentiated classes of storage. Such systems may comprise a storage device having a plurality of storage locations and a logical block name space for organizing the storage locations. A performance process may partition the storage locations into a plurality of regions that provide different levels of performance, and a mapping process may map the partitioned portions of the storage locations to a selected section of the logical block name space.


In certain embodiments, the performance process separates the plurality of storage locations into a plurality of categories being associated with a different level of service, which for example may be associated with a different RAID level of performance. However, those of skill in the art will know that other types of differentiated storage, other than RAID, may be employed, including storage systems that distinguish between media employed, cost and other features or parameters. Moreover, in some embodiments the underlying storage device is a single storage medium, although optionally, the mapping process may create multiple storage volumes at a selected level of performance, and the multiple storage volumes may be associated with one or more storage devices.


Optionally, the system may further comprise a load balancing mover process for moving data between different portions of the logical block name space. The load balancing mover process may include an adaptive data placement process for moving data between storage pools to thereby improve system performance. Further, an admin process may allow an administrator to move data between different storage pools. These systems may move, store and access data blocks and to this end may move data blocks that are organized as files, including files having a directory structure and hierarchy to provide a file system service. In alternate embodiments, the systems may include a process for providing a storage volume service.


In another embodiment, the systems provide storage systems supporting differentiated classes of storage. Such systems can include a storage device having a plurality of storage locations, a logical block name space for organizing the storage locations, and performance parameters of the storage locations that vary across the storage device. The system may further include a partitioning process for partitioning those storage locations into regions as a function variations in performance parameters. The partitioning process may select a fixed set of partitions as a function of a selected configuration of system components. Further, a performance process may associate partitions with different levels of performance, and a mapping process may map the identified partitions of the storage locations to a selected section of the logical block name space.


The systems and methods described herein may be realized in many different forms, including as part of a RAID controller, device driver, operating system, application program, or network service. In one particular embodiment the system is realized as a storage area network, that includes one or more servers executing computer code that configures the server, at least in part, to carry out the processes that provide storage systems supporting differentiated classes of storage.


In another aspect, the invention provides a process for providing differentiated classes of storage. Such methods may comprise the steps of providing one or more storage devices having a plurality of storage locations and a logical block name space for organizing the storage locations. The process may partition the storage locations into a plurality of regions providing different levels of performance, and may map the partitioned portions of the storage locations to a selected section of the logical block name space. Optionally, the process may include the step of separating the plurality of storage locations into a plurality of categories associated with a different level of service, and optionally separating the plurality of storage locations into a plurality of categories being associated with a different RAID level of performance. In one practice the mapping process associates different portions of the logical block name space to different respective levels of RAID.


Optionally, the process may include the step of load balancing by moving data between different portions of the logical block name space. Load balancing may include moving data between storage pools, including storage pools that have different levels of performance, to thereby improve overall system performance, or the performance of certain aspects of the system, such as the overall performance of one class of storage, or to provide better service for one class of data.


To this end, the process may allow an administrator to move data between different storage pools. This can be done through a user interface that provides the administrator with the ability to set parameters, including operating parameters, which will move data between different storage pools.


The mapping step may create multiple storage volumes at a selected level of performance.





BRIEF DESCRIPTION OF THE DRAWINGS

The following figures depict certain illustrative embodiments of the invention in which like reference numerals refer to like elements. These depicted embodiments are to be understood as illustrative of the invention and not as limiting in any way.



FIG. 1 depicts an individual storage device that has different performance characteristics based on a list of LBNs;



FIG. 2 illustrates RAID differentiated storage pools sharing the same storage devices;



FIG. 3 depicts differentiated storage pools by combining individual storage device performance and RAID performance characteristics;



FIG. 4 illustrates extent based and page based allocation of a storage service across differentiated storage pools; and



FIG. 5 depicts in more detail and as a functional block diagram one embodiment of a system according to the invention.





DESCRIPTION OF CERTAIN ILLUSTRATED EMBODIMENTS

The systems and methods described herein include systems for storing, organizing and managing resources. The systems and methods described below may employ a software architecture that builds certain of the processes described below into an operating system, into a device driver, or into a software process that operates on a peripheral device, such as a tape library, a RAID storage system or some other device. In any case, it will be understood by those of ordinary skill in the art that the systems and methods described herein may be realized through many different embodiments, and practices, and that the particular embodiment and practice employed will vary as a function of the application of interest and all these embodiments and practices fall within the scope hereof Moreover, it will be understood by those of ordinary skill in the art that the systems and methods described herein are merely exemplary of the kinds of systems and methods that may be achieved through the invention and that these exemplary embodiments may be modified, supplemented and amended as appropriate for the application at hand.



FIG. 1 depicts a storage device 10 and graphically portrays that different sections of the logical block name (LBN) can have different kinds of performance characteristics. Specifically, in the example depicted by FIG. 1, the LBN of the device 10 is divided into three different performance classes, which are shown respectively as Section 12, 14 and 16. Thus, FIG. 1 is an abstract representation of the entire name space on the disc 10. Although the use of these terms can vary somewhat, it is generally understood that the logical block name space is the full compliment of the adressable locations on the disc and it will be understood that a logical block will be understood as the smallest addressable space on a disc, such as the disc 10. Each logical block may be identified by a unique logical block name (or number), typically assigned in order starting from 0 at the beginning of the disc. Under the ISO 9660 standard, all data on a CD is addressed in terms of logical block numbers. Thus, the device depicted in FIG. 1 is generally understood to be a storage disc, such as a hard disk drive. However, the invention is not so limited. The systems and methods described herein may also be applied to compact discs, floppy disks, tape drive systems, and other similar kinds of data storage devices.


In any case, it can generally be understood that the logical block name space can be subdivided into different sections where each of these sections has different kinds of performance characteristics. The performance characteristics of interest might include the rate in which data can be accessed from that portion of the disc, the reliability of that portion of the disc, and how long it can take to access that portion of the memory device (which can be particularly relevant to digital linear tape storage).


In one practice, the systems and methods described herein include a performance measurement system that is capable of scanning an individual device, such as the depicted storage device 10, and measuring certain characteristics to determine how the device should be subdivided into different performance classes. In one embodiment, the performance measurement system makes experimental read and write operations across portions of the LBN and uses the collected experimental data to determine the performance characteristics of different logical block areas within the LBN space. The measuring process may then determine from these measured characteristics whether or not a collection of logical blocks can be aggregated into a section, such as one of the depicted sections 12, 14, or 16 wherein the aggregate logical blocks have common performance characteristics of interest and thereby provide a subspace within the LBN space that may be associated with a particular level of performance. Obviously, those of skill in the art can select the granularity over which the LBN space is subdivided into different sections and the granularity selected will depend largely on the application at hand and the number of storage classes desired or needed for the application of interest.


In other practices and embodiments, partitioning into performance regions will be determined and done at system design time, and fixed for a particular system design. That design process may involve experiment and measurement, but the product that results from the design process typically will not itself incorporate a performance measurement component. The embodiment and practice selected can turn on the application being addressed and the specifics of the product design. Doing measurement at “run time” may be useful if there will be widely varying configurations of the product, not all of which have been seen or analyzed in advance during the product design phase. Conversely, if product configurations are rigorously controlled and analyzed prior to approval, then the performance analysis may be done at that time and no measurement component is needed in the operational or delivered product.


Turning now to FIG. 2, one RAID system 20 is depicted wherein the RAID system 20 provides differentiated storage pools sharing the same storage devices depicted as devices 25A through 25D. Specifically, FIG. 2 depicts a RAID storage system 20 that includes a pool A 22 and a pool B 24. As shown in this embodiment, pool A 22 is set up as a RAID 10 storage pool and pool B 24 is set up as a RAID 50 storage pool. As known to those of skill in the art, RAID is a term commonly employed for the acronym redundant ray of inexpensive (or independent) disks. A RAID array is a collection of drives which collectively act as a single storage system, which can tolerate a failure of a drive without losing data and which can operate independently of each other. Although there are different types of RAID systems, in general, the UC Berkeley term “RAID” defined six RAID levels. Each level is a different way to spread data across the multiple drives in the system, such as the multiple systems 25A through 25D. This technique provides a compromise between cost and speed. The different levels have different meanings and each level is typically optimized for a different use or application. For example, RAID level 0 is not redundant and splits data across the different drives 25A through 25D resulting in higher data through-put. Since no redundant information is stored, the performance is very good, but the failure of any disk in the array results in data loss. This level is commonly referred to as striping. RAID level 1 is commonly referred to as mirroring with two hard drives. This level provides redundancy by duplicating all the data from one drive onto another drive. The performance of level 1 array is slightly better than a single drive, but if either drive fails, no data is lost. RAID level 10, such as the depicted pool A 22 employs, is a dual level array that employs multiple RAID 1 (mirrored) sets into a single array. Data is striped across all mirrored sets. RAID 1 utilizes several drives, provides better performance, and each drive is typically duplicated or mirrored. RAID 50, as employed by pool B 24, is a dual level array that employs multiple RAID 5 levels into a single array. This will be understood that the RAID system 20 depicted in FIG. 2 employs the four storage devices 25A through 25D to support two different classes of storage, each of which has different performance characteristics and different performance levels.


In one example system according to the invention, such as that depicted in FIG. 3, differentiated pools are created by combining the RAID levels depicted in FIG. 2 with the device performance variations depicted in FIG. 1. Specifically, FIG. 3 depicts a system 30 that comprises four storage devices 35A through 35D. Other storage devices may be added as needed. Similar to FIG. 1, the LBN space of the devices 35A through 35D is depicted abstractly as extending over the surface of the device and, as shown in FIG. 1, the devices 35A through 35D are subdivided into three subspaces, each LBN subspace being associated with a different performance level. Simultaneously, the system depicted in FIG. 3 associates different RAID levels with different performance subspaces. For example, the performance A LBN subspace is used to support a RAID 10 storage level, the performance B LBN subspace is employed to support a RAID 5 service level, and the performance C LBN subspace is employed to provide a RAID 50 performance level.


In the process of setting up the system 30, the storage system administrator can determine which of the performance LBN subspaces should be used to support a particular one of the RAID levels. Any suitable technique may be employed for choosing among the different subspaces. For example, if Region A is the region of the drives that has particularly good random access I/O performance, it will often be appropriate to allocate it to a RAID-10 set since RAID-10 is also characterized by good random access performance, especially random write performance; the characteristics of the two layers thus reinforce each other resulting in a “Pool A” that has excellent random write performance.



FIG. 4, depicts one example of an extent based or page based allocation operation of a storage service across the differentiated storage pools of the type depicted in FIG. 3. As depicted, the system 30 provides a Pool A that can store data that is frequently accessed. To this end, the Pool A subspace of the LBN is employed to support a RAID Level 10, which provides a certain amount of fault tolerance and provides respectable data throughput. By virtualizing the storage service across different classes of storage, while utilizing the same set of devices, as illustrated here, both optimal or substantially optimal performance and least physical device consumption may be obtained simultaneously. Multiple instances of the block storage service (“logical volumes”) may be created, all sharing the same set of underlying differentiated storage pools. Each is allocated portions of the pools according to its performance requirements.



FIG. 5 depicts an optional embodiment wherein the system employs an adaptive storage block data distribution process for distributing blocks of data across the different partitions of the data volume. One such suitable adaptive storage system is described in US Patent Application entitled ADAPTIVE STORAGE BLOCK DATA DISTRIBUTION U.S. Ser. No. 10/347,898 and naming G. Paul Koning et al. as inventor, the contents of which are herein incorporated by reference.


As described therein, each server 22 A, 22 B in the system 50 includes a routing table 52, a data mover process 54 and a request monitor process 58. The request monitor process 58 is capable of monitoring requests made to one of the servers 22 A or 22 B from the one or more clients 12 that are accessing the system 50.


The request may be associated with data blocks stored on a partition or somewhere on the volume of the storage device 30. The request monitor 58 can monitor the different requests that clients 12 make to the associated server or servers 22. Additionally, the request monitor 58 may communicate with other request monitor processes 58 running on the different equivalent servers 22 on the system 50. In this way, the request monitor 58 can generate a global view of the requests being forwarded by clients 12 to the partitioned block data storage system 50. By sharing this information, each server 22 may, through its associated request monitor process 58, develop a global understanding of the requests being serviced by the block data storage system 50.


As further shown in FIG. 5, the data mover process 54 employs the information stored within the routing table 52 for the purpose of determining whether a more efficient or reliable arrangement of data blocks may be achieved. To this end, the data mover process 54 comprises a computer program that applies an algorithm to the data collected by the request monitors 58. The data mover process 54 applies an algorithm that reviews the current distribution of data blocks and considers the current client 12 demand for these data blocks when determining whether a more efficient allocation of data blocks is available. In the embodiment depicted, this algorithm can determine whether a particular data block is more suited for a storage class that is different from the class presently used. Thus the adaptive process can move data blocks between and among the different storage classes available on the system, such as on the system 30. In this way, the system 50 can achieve the optimal or substantially optimal storage performance for the data. The storage performance achieved may be realized with systems that employ a single server, wherein the data mover process moves the data between different regions in the system. Thus, the systems and methods described herein may be employed with Storage Area Networks, including Storage Area Networks described in the above-referenced US Patent Application entitled Adaptive Storage Block Data Distribution.


Although FIGS. 1-5 depict exemplary systems as assemblies of functional block elements, it will be apparent to one of ordinary skill in the art that the systems and methods of the invention may be realized as computer programs or portions of computer programs that are capable of running on the servers to thereby configure the servers as systems according to the invention. Moreover, as discussed above, in certain embodiments, the systems of the invention may be realized as software components operating on a conventional data processing system such as a Unix workstation. In such embodiments, the system can be implemented as a C language computer program, or a computer program written in any high level language including C++, Fortran, Java or Basic. General techniques for such high level programming are known, and set forth in, for example, Stephen G. Kochan, Programming in C, Hayden Publishing (1983).


While the invention has been disclosed in connection with the preferred embodiments shown and described in detail, various modifications and improvements thereon will become readily apparent to those skilled in the art. For example, although the systems and methods of the invention may be employed with RAID differentiated storage, other differentiated storage systems and methods may also be used without departing from the scope hereof. Accordingly, the spirit and scope of the present invention is to be limited only by the following claims.

Claims
  • 1. An apparatus comprising: an interface to a storage device having a plurality of storage locations accessible using logical block names;a storage controller configured to determine a level of performance for the plurality of storage locations and to partition the plurality of storage locations into a plurality of regions as determined by their different levels of performance;the storage controller determining the different levels of performance by scanning the storage locations, and specifically by performing experimental read and write operations to determine the different levels of performance from experimental data collected in the experimental read and write operations;the storage controller further mapping the partitioned regions of the storage locations and aggregate the logical block names of the storage locations in the partitioned regions having an identical level of performance to a selected logical block name space; anda RAID controller, for assigning a first RAID level configuration to a first set of aggregated logical block names of the storage device, and assigning a second RAID level configuration to a second set of aggregated logical block names of the storage device, the first and second RAID level configurations being different from each other.
  • 2. The apparatus of claim 1 additionally comprising: a network interface to two or more clients;the apparatus thereby providing differentiated classes of storage having two or more differentiated RAID level configurations of the storage device to the two or more clients.
  • 3. The apparatus of claim 1 wherein the storage controller is further configured to employ the storage system to provide a file system service to multiple clients.
  • 4. The apparatus of claim 1 wherein the storage controller is further configured to provide a storage volume service to multiple clients.
  • 5. The apparatus of claim 4 wherein multiple storage volumes at a selected level of performance are provided by the storage device.
  • 6. The apparatus of claim 1 wherein the measured experimental data comprises data access time, a reliability of a storage location, or a combination thereof.
  • 7. The apparatus of claim 1 wherein the storage device is a single physical disk.
  • 8. The apparatus of claim 1 wherein the different levels of performance are determined apparatus is in manufacturing test.
  • 9. The apparatus of claim 1 wherein the different levels of performance are determined during field operation of the storage device.
  • 10. A method for providing differentiated classes of storage performance from a storage device, comprising: accessing the storage device using logical block names for organizing a plurality of the storage locations;determining a level of performance of the plurality of storage locations by partitioning the plurality of storage locations into a plurality of regions according to different levels of performance, wherein the different levels of performance are further determined by performing experimental read and write operations on the storage locations and determining the level of performance as a result of the experimental data collected from the experimental read and write operations;aggregating the logical block names of the storage locations in the partitioned regions having a given level of performance to a selected logical block name space;assigning a first RAID level configuration to a first set of such aggregated logical block names; having a first level of performance;assigning a second RAID level configuration to a second set of such aggregated logical block names having a second level of performance different from the first level of performance; andthe first RAID level configuration and second RAID level configuration being also different from one another; thereby providing differentiated classes of RAID level storage to one or more clients.
  • 11. The method of claim 10 further comprising: mapping partitioned regions of the storage locations to provide multiple storage volumes at a selected level of performance.
  • 12. The method of claim 10 wherein a measured a level of performance comprises a data access time, a reliability of a storage location, or a combination thereof.
  • 13. The method of claim 10 wherein the experimental read and write operations are performed when a storage controller that implements the method is designed.
  • 14. The method of claim 10 wherein the experimental read and write operations are performed during operation of a storage controller that implements the method.
  • 15. A network storage controller for providing differentiated classes of storage to multiple client data processors, comprising: a network interface to provide access by two or more client data processors;an interface to a storage device having a plurality of storage locations accessible using logical block names;the network storage controller further configured to: determine different levels of performance for the plurality of storage locations and to partition the plurality of storage locations into a plurality of regions as determined by the different levels of performance;further determine the different levels of performance by performing experimental read and write operations on the plurality of storage locations, and measuring data collected in response to the experimental read and write operations;aggregating the logical block names of the storage locations in the partitioned regions having an identical level of performance to a selected logical block name space; anda RAID controller, for assigning a first RAID level configuration to a first set of aggregated logical block names of the storage device, and assigning a second RAID level configuration to a second set of aggregated logical block names of the storage device, the first and second RAID level configurations being different from each other.
  • 16. The network storage controller of claim 15 wherein the data measured in response to the experimental read and write operations comprises data access time, reliability of a storage location, or a combination thereof.
  • 17. The network storage controller of claim 15 wherein the different levels of performance are determined during field operation of the network storage controller.
RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 10/761,884, filed Jan. 20, 2004, now U.S. Pat. No. 7,937,551 which claims priority to earlier filed U.S. Provisional Application No. 60/441,810, naming Eric R. Schott as an inventor, and having a filing date of Jan. 21, 2003. The entire teachings of the above application(s) are incorporated herein by reference.

US Referenced Citations (74)
Number Name Date Kind
5392244 Jacobson et al. Feb 1995 A
5774660 Brendel et al. Jun 1998 A
5774668 Choquier et al. Jun 1998 A
5951694 Choquier et al. Sep 1999 A
5978844 Tsuchiya et al. Nov 1999 A
6070191 Narcndran et al. May 2000 A
6108727 Boals et al. Aug 2000 A
6122681 Aditya et al. Sep 2000 A
6128279 O'Neil et al. Oct 2000 A
6141688 Bi et al. Oct 2000 A
6144848 Walsh et al. Nov 2000 A
6148414 Brown et al. Nov 2000 A
6189079 Micka et al. Feb 2001 B1
6195682 Ho et al. Feb 2001 B1
6199112 Wilson Mar 2001 B1
6212565 Gupta Apr 2001 B1
6212606 Dimitroff Apr 2001 B1
6226684 Sung et al. May 2001 B1
6275898 DeKoning Aug 2001 B1
6292181 Banerjee et al. Sep 2001 B1
6341311 Smith et al. Jan 2002 B1
6360262 Guenthner et al. Mar 2002 B1
6421723 Tawil Jul 2002 B1
6434683 West et al. Aug 2002 B1
6460083 Niwa et al. Oct 2002 B1
6473791 Al-Ghosein et al. Oct 2002 B1
6498791 Pickett et al. Dec 2002 B2
6598134 Ofek et al. Jul 2003 B2
6687731 Kavak Feb 2004 B1
6718436 Kim et al. Apr 2004 B2
6725253 Okano et al. Apr 2004 B1
6732171 Hayden May 2004 B2
6742059 Todd et al. May 2004 B1
6766348 Combs et al. Jul 2004 B1
6813635 Jorgenson Nov 2004 B1
6839802 Dimitri et al. Jan 2005 B2
6850982 Siegel Feb 2005 B1
6859834 Arora et al. Feb 2005 B1
6886035 Wolff Apr 2005 B2
6910150 Mashayekhi et al. Jun 2005 B2
6944777 Belani et al. Sep 2005 B1
6950848 Yousefi'zadeh Sep 2005 B1
6957433 Umberger et al. Oct 2005 B2
6985956 Luke et al. Jan 2006 B2
7043564 Cook et al. May 2006 B1
7076655 Griffin et al. Jul 2006 B2
7085829 Wu et al. Aug 2006 B2
7089293 Grosner et al. Aug 2006 B2
7127577 Koning et al. Oct 2006 B2
7937551 Schott May 2011 B2
20010039581 Deng et al. Nov 2001 A1
20020008693 Banerjee et al. Jan 2002 A1
20020009079 Jungck et al. Jan 2002 A1
20020035667 Bruning, III et al. Mar 2002 A1
20020059451 Haviv et al. May 2002 A1
20020065799 West et al. May 2002 A1
20020138551 Erickson Sep 2002 A1
20020194324 Guha Dec 2002 A1
20030005119 Mercier et al. Jan 2003 A1
20030074596 Mashayekhi et al. Apr 2003 A1
20030117954 De Neve et al. Jun 2003 A1
20030225884 Hayden Dec 2003 A1
20040030755 Koning et al. Feb 2004 A1
20040049564 Ng et al. Mar 2004 A1
20040080558 Blumenau et al. Apr 2004 A1
20040083345 Kim et al. Apr 2004 A1
20040103104 Hara et al. May 2004 A1
20040128442 Hinshaw et al. Jul 2004 A1
20040143637 Koning et al. Jul 2004 A1
20040143648 Koning et al. Jul 2004 A1
20040210724 Koning et al. Oct 2004 A1
20050010618 Hayden Jan 2005 A1
20050144199 Hayden Jun 2005 A2
20070106857 Koning et al. May 2007 A1
Foreign Referenced Citations (8)
Number Date Country
0 938 046 Aug 1999 EP
WO 9724668 Jul 1997 WO
WO 9953415 Oct 1999 WO
WO 0106367 Jan 2001 WO
WO 0138983 May 2001 WO
WO 0237943 May 2002 WO
WO 0244885 Jun 2002 WO
WO 02056182 Jul 2002 WO
Related Publications (1)
Number Date Country
20110208943 A1 Aug 2011 US
Provisional Applications (1)
Number Date Country
60441810 Jan 2003 US
Continuations (1)
Number Date Country
Parent 10761884 Jan 2004 US
Child 13099057 US