[none]
1. Field of Invention
The present invention relates generally to the art of cache allocation in a RAID controller.
2. Description of Related Art
RAID (Redundant Array of Independent Disks) is a storage system used to increase performance and provide fault tolerance. RAID is a set of two or more hard disks and a specialized disk controller that contains the RAID functionality. RAID improves performance by disk striping, which interleaves bytes or groups of bytes across multiple drives, so more than one disk is reading and writing simultaneously (e.g., RAID 0). Fault tolerance is achieved by mirroring or parity. Mirroring is 100% duplication of the data on two drives (e.g., RAID 1).
A volume in storage is a logical storage unit, which is a part of one physical hard drive or one that spans several physical hard drives.
A cache a form of memory stating area that is used to speed up data transfer between two subsystems in a computer. When the cache client (e.g. a CPU, a RAID controller, an operating system and the like that accessing the cache) wants to access a datum in a slower memory, it first checks the faster cache. If a datum entry in cache can be found with a tag matching that of the desired datum, the datum in the entry is used instead of accessing the slower memory, a situation known as a cache hit. The alternative is when the cache is consulted and found not to contain a datum with the desired tag, known as a cache miss. A cache miss is a failure to find the required instruction or data item in the cache. When a cache misses, the item is read from the main memory, which is slower than the cache (e.g. secondary storage such as a hard drive), which increases the data latency. A prefetch is to bring data or instructions into a higher-speed storage or memory before it is actually processed.
A Storage Area Network (SAN) often connects multiple servers to a centralized pool of disk storage. A SAN can treat all the storage as a single resource, improving disk maintenance and backups. In some SANs, the disks themselves can copy data to other disks for backup without any computer processing overhead. The SAN network allows data transfers between computers and disks at high peripheral channel speeds, with Fibre Channel as a typical high-speed transfer technology, as well as transfer by SSA (Serial Storage Architecture) and ESCON channels. SANs can be centralized or distributed; a centralized SAN connects multiple servers to a collection of disks, while a distributed SAN typically uses one or more Fibre Channel or SCSI switches to connect nodes. Over long distances, SAN traffic can be transferred over ATM, SONET or dark fiber. A SAN option is IP storage, which enables data transfer via IP over fast Gigabit Ethernet locally or via the internet.
A solid state disk or device (SSD) is a disk drive that uses memory chips instead of traditional rotating platters for data storage. SSDs are faster than regular disks because there is zero latency, as there is no read/write head to move as in a traditional drive. SSDs are more rugged than hard disks. SSDs may use non-volatile flash memory; or, SSDs may use volatile DRAM or SRAM memory backed up by a disk drive or UPS system in case of power failure, all of which are part of the SSD system. At present, in terms of performance, a DRAM-based SSD has the highest performance, followed by a flash-based SSD and then a traditional rotating platter hard drive.
Turning attention to
There are, however, several disadvantages with the existing system of
What is lacking in the prior art is a method and apparatus for an improved system to allocate cache for a RAID SAN, such as taught in the present invention.
Accordingly, an aspect of the present invention is an improved apparatus and method to cache data in a RAID configuration.
A further aspect of the present invention is an apparatus and method of introducing a scalable cache repository in a RAID SAN.
Another aspect of the present invention is an apparatus and method of employing SSD for a RAID SAN cache.
A further aspect of the present invention is to make the cache in a RAID controller be scalable, depending on demand.
Thus the present invention enables a fast, scalable cache for a RAID controller in a RAID SAN.
The sum total of all of the above advantages, as well as the numerous other advantages disclosed and inherent from the invention described herein, creates an improvement over prior techniques.
The above described and many other features and attendant advantages of the present invention will become apparent from a consideration of the following detailed description when considered in conjunction with the accompanying drawings.
Detailed description of preferred embodiments of the invention will be made with reference to the accompanying drawings. Disclosed herein is a detailed description of the best presently known mode of carrying out the invention. This description is not to be taken in a limiting sense, but is made merely for the purpose of illustrating the general principles of the invention. The section titles and overall organization of the present detailed description are for the purpose of convenience only and are not intended to limit the present invention.
It should be understood that one skilled in the art may, using the teachings of the present invention, vary embodiments shown in the drawings without departing from the spirit of the invention herein. In the figures, elements with like numbered reference numbers in different figures indicate the presence of previously defined identical elements.
Turning attention to
The scalable cache depository 220 is scalable because more SSDs 226, 228 may be added if greater cache memory is desired, and the controller's cache can be increased dynamically as the SAN environment scales up. The SSDs may be hot-pluggable for field upgrade benefits. The capacity and percentage of reservation for file-cache and block-cache may be predefined to some predetermined level in the controller 205 itself, or equivalently it can be set by a user through suitable software.
When a cache-miss is observed in
The switching between the fixed cache 210 and the controller 205 and the cache repository 220 and the controller 205 is dynamic, based on the IO demand. Once switching commences, the next prefetch is done to the cache repository 220 directly and not to the fixed local (RAM) cache 210. In the event there are limited or no prefetch actions on the cache repository 220, the controller 205 may switch back to the fixed local cache 210.
Turning attention now to
Thus, in
At decision diamond box 325 labeled “Controller Gets A Cache Hit?”, the system continues back to box 330 labeled “Process New IO Request” if the controller gets a cache-hit, and the process continues from there, otherwise, flow continues to the step box 340 labeled “The Controller Needs To Fetch The Data From The Hard Drive Storage”, and data is fetched from secondary memory comprising the hard drive(s).
From box 330, once the controller 205 uses the cache repository 220 rather than the fixed local cache 210, in response to increased IO demand, flow will continue to the step box 345 labeled “The Controller Now Uses Cache-Repository Directly For Pre-Fetching And Managing Cache-Hits”.
At this point, at box 345, the controller 205 finds the data needed at the cache repository 220 rather than fixed local cache 210, and henceforth uses the cache repository 220 directly for managing cache hits, bypassing the fixed local cache 210 (RAM). This bypassing of the fixed local cache continues until such time that activity on prefetch decreases below some predetermined threshold limit, which can be arbitrarily set. Thus at decision diamond step 350, labeled “Is Pre-Fetching Required After IO Demand Decreases?”, the controller 205 can dynamically switch back to the fixed local cache 210 (RAM) when not much activity is found on prefetch in the cache repository 220 as IO demand decreases below some predetermined but arbitrary level, as indicated by following the “No” branch of decision diamond 350 to the box 310. However, if IO demand increases or stays above the predetermined limit, the flow of the program for the present invention continues along the “Yes” branch of the decision diamond 350, to box 345, and the program continues as before.
The RAID controller cache of the present invention is scalable as demand increases; the SSD used can be a RAID 1 volume created on the storage system, such as a SAN, using SSD drives. The SSD drives themselves may be hot-pluggable, allowing advantageous field upgrades. The SSDs themselves, depending on the model, may be as fast as memory DIMM memory modules. Further, any SSD failures can be recovered by GHS (Global Hot Spare) via a RAID 1 mechanism. Global Hot Spare is for drive failure; when a drive fails, the array controller will reconstruct the data of any failed drive from any RAID volume/Volume group/Logical array managed by the array controller on the Global Hot spare. If the failed drive is replaced by a good drive, the array controller then copies the data of Global Host Spare to the good drive.
The advantages of the present invention include dynamically allocating the size of cache, using scalable and hot-swappable devices such as SSDs. Using SSDs also provides faster IO transactions and smaller latency than using traditional hard drive access. Consequently, a performance boost occurs with reduced latency, as IO requests to traditional hard drives are avoided as much as possible. The disadvantages include using SSD, which increases the cost of manufacturing. However, the cost of SSD drives has dropped over the last two years, and should continue to fall.
Usage of the present invention is a SAN environment, where there are block-caching requirements. The present invention can also fit in the middle of a file-caching SANS as well, where there are not as many OS/Application variants. File Caching SAN is a SAN where the hosts/initiators are issuing file system IO to storage array and the page file/buffer is cached. Block-caching SAN is a SAN where there is a Block Storage array/controller. Those storage arrays have cache on its array controller at block level.
Although the present invention has been described in terms of the preferred embodiments above, numerous modifications and/or additions to the above-described preferred embodiments would be readily apparent to one skilled in the art.
It is intended that the scope of the present invention extends to all such modifications and/or additions and that the scope of the present invention is limited solely by the claims set forth below.