Storage management system and auto-RAID transaction manager for coherent memory map across hot plug interface

Abstract
A Redundant Array of Independent Disks (RAID) data storage system includes an AutoRAID memory transaction manager for a disk array controller that enables a consistent, coherent memory image of the data storage space to all processors across hot-plug interfaces. To external processes seeking to read or write data, the memory image looks the same across the hot-plug interface. The disk array controller has two identical controllers, each with its own non-volatile memory, to maintain redundant images of disk array storage space. A hot-plug interface interconnects the two controllers. Each controller has an AutoRAID memory transaction manager that enables sharing of cyclic redundancy check (CRC)-protected memory transactions over the hot-plug interface between the two controllers. The AutoRAID memory transaction managers also facilitate ordered execution of the memory transactions regardless of which controller originated the transactions. Mirrored read and write transactions are handled atomically across the hot-plug interface.
Description




FIELD OF THE INVENTION




This invention relates to data storage systems, such as hierarchical RAID (Redundant Array of Independent Disks) data storage systems. More particularly, this invention relates to disk array controllers for data storage systems that provide a coherent, consistent memory image to multiple processors across a hot-plug interface.




BACKGROUND OF THE INVENTION




Conventional disk array data storage systems have multiple storage disk drive devices that are arranged and coordinated to form a single mass storage system. The common design goals for mass storage systems include low cost per megabyte, high input/output performance, and high data availability. Data availability involves the ability to access data stored in the storage system while ensuring continued operation in the event of a disk or component failure. Data availability is often provided through the use of redundancy where data, or relationships among data, are stored in multiple locations on the storage system. In the event of failure, redundant data is retrieved from the operable portion of the system and used to regenerate the original data that is lost due to the component failure.




There are two common methods for storing redundant data on disk drives: mirror and parity. In mirror redundancy, data is duplicated and stored in two separate areas of the storage system. In parity redundancy, redundant data is stored in one area of the storage system, but the size of the redundant storage area is less than the remaining storage area used to store the original data.




RAID (Redundant Array of Independent Disks) storage systems are disk array systems in which part of the physical storage capacity is used to store redundant data. RAID systems are typically characterized as one of six architectures, enumerated under the acronym RAID. A RAID


0


architecture is a disk array system that is configured without any redundancy. Since this architecture is really not a redundant architecture, RAID


0


is often omitted from a discussion of RAID systems.




A RAID


1


architecture involves storage disks configured according to mirror redundancy. Original data is stored on one set of disks and a duplicate copy of the data is kept on separate disks. The RAID


2


through RAID


5


architectures all involve parity-type redundant storage. Of particular interest, a RAID


5


system distributes data and parity information across all of the disks. Typically, the disks are divided into equally sized address areas referred to as “blocks”. A set of blocks from each disk that have the same unit address ranges are referred to as “stripes”. In RAID


5


, each stripe has N blocks of data and one parity block which contains redundant information for the data in the N blocks.




In RAID


5


, the parity block is cycled across different disks from stripe-to-stripe. For example, in a RAID


5


system having five disks, the parity block for the first stripe might be on the fifth disk; the parity block for the second stripe might be on the fourth disk; the parity block for the third stripe might be on the third disk; and so on. The parity block for succeeding stripes typically “processes” around the disk drives in a helical pattern (although other patterns are possible). RAID


2


through RAID


4


architectures differ from RAID


5


in how they compute and place the parity block on the disks.




A hierarchical data storage system permits data to be stored according to different techniques. In a hierarchical RAID system, data can be stored according to multiple RAID architectures, such as RAID


1


and RAID


5


, to afford tradeoffs between the advantages and disadvantages of the redundancy techniques.




U.S. Pat. No. 5,392,244 to Jacobson et al., entitled “Memory Systems with Data Storage Redundancy Management”, describes a hierarchical RAID system that enables data to be migrated from one RAID type to another RAID type as data storage conditions and space demands change. This patent, which is assigned to Hewlett-Packard Company, describes a multi-level RAID architecture in which physical storage space is mapped into a RAID-level virtual storage space having mirror and parity RAID areas (e.g., RAID


1


and RAID


5


). The RAID-level virtual storage space is then mapped into an application-level virtual storage space, which presents the storage space to the user as one large contiguously addressable space. During operation, as user storage demands change at the application-level virtual space, data can be migrated between the mirror and parity RAID areas at the RAID-level virtual space to accommodate the changes. For instance, data once stored according to mirror redundancy may be shifted and stored using parity redundancy, or vice versa. The '244 patent is hereby incorporated by reference to provide additional background information.




With data migration, the administrator is afforded tremendous flexibility in defining operating conditions and establishing logical storage units (or LUNs). As one example, the RAID system can initially store user data according to the optimum performing RAID


1


configuration. As the user data approaches and exceeds 50% of array capacity, the disk array system can then begin storing data according to both RAID


1


and RAID


5


, and dynamically migrating data between RAID


1


and RAID


5


in a continuous manner as storage demands change. At any one time during operation, the data might be stored as RAID


1


or RAID


5


on all of the disks. The mix of RAID


1


and RAID


5


storage changes dynamically with the data I/O (input/output). This allows the system to optimize performance versus an increasing amount of user data.




SUMMARY OF THE INVENTION




This invention provides a mirrored memory disk array controller that enables a consistent, coherent memory image of the data storage space to all processors across hot-plug interfaces. To host processes seeking to read or write data, the memory image looks the same across the hot-plug interfaces.




In the described implementation, the disk array controller has two identical controllers, each with its own non-volatile memory, to maintain redundant images. A hot-plug interface interconnects the two controllers. Each controller has an AutoRAID memory transaction manager that enables sharing of cyclic redundancy check (CRC)-protected memory transactions over the hot-plug interface between the two controllers. The AutoRAID memory transaction managers also facilitate ordered execution of the memory transactions regardless of which controller originated the transactions. Mirrored read and write transactions are handled atomically across the hot-plug interface.











BRIEF DESCRIPTION OF THE DRAWINGS





FIG. 1

is a diagrammatic illustration of a host computer connected to a data storage system.





FIG. 2

is a block diagram of the host computer and data storage system. The data storage system is illustrated as a hierarchical RAID system.





FIG. 3

is a block diagram of the mirrored memory disk array controller in the data storage system, embodied with two controllers.





FIG. 4

is a block diagram of an AutoRAID memory transaction manager implemented in each controller of the disk array controller.











The same reference numbers are used throughout the figures to reference like components and features.




DETAILED DESCRIPTION OF THE INVENTION





FIG. 1

shows a computer system


20


having a host computer


22


connected to a data storage system


24


via an input/output (I/O) interface bus


26


. Host computer


22


is a general purpose computer that can be configured, for example, as a server or workstation. Computer


22


has a visual display monitor


28


, a central processing unit (CPU)


30


, a keyboard


32


, and a mouse


34


. Other data entry and output peripherals may also be included, such as a printer, tape, compact disc read only memory (CD-ROM), network interfaces, and so forth. In

FIG. 1

, the host computer


22


is coupled to a network


36


to serve data from the data storage system


24


to one or more clients (not shown).




The data storage system


24


holds user data and other information. In the preferred implementation, the data storage system


24


is a hierarchical Redundant Array of Independent Disks (RAID) system that is capable of storing data according to different redundancy schemes. The host computer


22


provides an interface for an administrator to configure the memory space in the RAID system


24


, run diagnostics, evaluate performance, and otherwise manage the RAID storage system.





FIG. 2

shows the host computer


22


and data storage system


24


in more detail. The computer


22


has a processor


40


, a volatile memory


42


(i.e., random access memory (RAM)), a keyboard


32


, a mouse


34


, a non-volatile memory


44


(e.g., ROM, hard disk, floppy disk, CD-ROM, etc.), and a display


28


. An administrator module


46


is stored in memory


44


and executes on processor


40


. The administrator module


46


provides management functions such as diagnostics, performance review, logical unit number (LUN) arrangement analysis, and capacity analysis. The administrator module


48


supports a storage manager graphical user interface (UI)


48


that presents a visual interface on the display


28


.




The data storage system


24


has a disk array


50


with multiple storage disks


52


, a disk array controller


54


, and a RAID management system


56


. The disk array controller


54


is coupled to the disk array


50


via one or more interface buses


58


, such as a small computer system interface (SCSI) bus. The RAID management system


56


is coupled to the disk array controller


54


via an interface protocol


60


. It is noted that the RAID management system


56


can be embodied as a separate component (as shown), or within the disk array controller


54


, or within the host computer


22


. The RAID management system


56


is preferably a software module that runs on a processing unit of the data storage system


24


, or on the processor


40


of the computer


22


.




The disk array controller


54


coordinates data transfer to and from the disk array


50


. The disk array controller


54


has two identical controllers or controller boards: a first disk array controller


54




a


and a second disk array controller


54




b


. The parallel controllers enhance reliability by providing continuous backup and redundancy in the event that one controller becomes inoperable. The parallel controllers


54




a


and


54




b


have respective mirrored memories


62




a


and


62




b


. The mirrored memories


62




a


and


62




b


are preferably implemented as battery-backed, non-volatile RAMs (NVRAMs). Although only dual controllers


54




a


and


54




b


are shown and discussed generally herein, aspects of this invention can be extended to other multi-controller configurations where more than two controllers are employed.




The mirrored memories


62




a


and


62




b


store several types of information. The mirrored memories


62




a


and


62




b


maintain duplicate copies of a cohesive memory map of the storage space in disk array


50


. This memory map tracks where data and redundancy information are stored on the disks


52


, and where available free space is located. The view of the mirrored memories is consistent across the hot-plug interface, appearing the same to external processes seeking to read or write data.




The mirrored memories


62




a


and


62




b


also maintain a read cache that holds data being read from the disk array


50


. Every read request is shared between the controllers. The mirrored memories


62




a


and


62




b


further maintain two duplicate copies of a write cache. Each write cache temporarily stores data before it is written out to the disk array


50


.




One particular implementation of a mirrored memory dual controller for a disk storage system is described in U.S. Pat. No. 5,699,510, entitled “Failure Detection System for a Mirrored Memory Dual Controller Disk Storage System,” which issued Dec. 16, 1997 in the names of Petersen et al. and is assigned to Hewlett-Packard Company. This patent is incorporated by reference for background information pertaining generally to dual controller disk storage systems.




The controller's mirrored memories


62




a


and


62




b


are physically coupled via a hot-plug interface


64


. In the absence of this invention, the hot-plug interface


64


introduces a possible problem in that single points of failure could corrupt both memories


62




a


and


62




b


. An aspect of this invention is to prevent such corruption, and to ensure that the two memories maintain correctly replicated images of the storage space. Generally, the controllers


62




a


and


62




b


monitor data transfers between them to ensure that data is accurately transferred and that transaction ordering is preserved (e.g., read/write ordering).





FIG. 3

shows the dual disk array controller in more detail. In addition to the controller boards


54




a


and


54




b


, the disk array controller also has two I/O modules


70




a


and


70




b


, a display


72


, and two power supplies


74




a


and


74




b


. The I/O modules


70




a


and


70




b


facilitate data transfer between respective controllers


54




a


and


54




b


and the host computer


22


. In one implementation, the I/O modules


70




a


and


70




b


employ fiber channel technology, although other bus technologies may be used. The power supplies


74




a


and


74




b


provide power to the other components in the disk array controller


54


(FIG.


2


), including the controllers


54




a


,


54




b


, the display


72


, and the I/O modules


70




a


,


70




b.






Each controller


54




a


,


54




b


has a converter


78




a


,


78




b


connected to receive signals from the host via respective I/O modules


70




a


,


70




b


. Each converter


78




a


and


78




b


converts the signals from one bus format (e.g., fiber channel) to another bus format (e.g., peripheral component interconnect (PCI)). A first PCI bus


80




a


,


80




b


carries the signals to an AutoRAlD memory transaction manager


82




a


,


82




b


, which handles all mirrored memory transaction traffic to and from the NVRAM


62




a


,


62




b


in the mirrored controller. The AutoRAID memory transaction manager maintains the memory map, computes parity, and facilitates cross-communication with the other controller. The AutoRAID memory transaction manager


82




a


,


82




b


is preferably implemented as an integrated circuit (IC), such as an application-specific integrated circuit (ASIC)).




The AutoRAID memory transaction manager


82




a


,


82




b


is coupled to the NVRAM


62




a


,


62




b


via a high-speed bus


84




a


,


84




b


and to other processing and memory components via a second PCI bus


86




a


,


86




b


. Each controller


54




a


,


54




b


has at least one CPU


88




a


,


88




b


and several types of memory connected to the PCI bus


86




a


and


86




b


. The memory includes dynamic RAM (DRAM)


90




a


,


90




b


, Flash memory


92




a


,


92




b


, and cache


94




a


,


94




b.






The AutoRAID memory transaction managers


82




a


and


82




b


are coupled to one another via a hot-plug interface


64


. The hot-plug interface


64


supports bi-directional parallel communication between the two AutoRAID memory transaction managers


82




a


and


82




b


at a data transfer rate commensurate with the NVRAM buses


84




a


and


84




b.






The hot-plug interface


64


is implemented using a point-to-point bus


96


and inter-integrated circuit (I


2


C) buses


98


. The point-to-point bus


96


is a multi-bit bus that transports memory transaction data between the mirrored memory controllers


54




a


and


54




b


. The I


2


C buses


98


carry microprocessor communications, peripheral communications, and so forth.




The AutoRAID memory transaction managers


82




a


and


82




b


employ a high-level packet protocol to exchange transactions in packets over the bus


96


of the hot-plug interface


64


. The AutoRAID memory transaction managers


82




a


and


82




b


perform error correction on the packets to ensure that the data is correctly transferred between the controllers.




The AutoRAID memory transaction managers


82




a


and


82




b


provide a memory image that is coherent across the hot plug interface


64


. The managers


82




a


and


82




b


also provide an ordering mechanism to support an ordered interface that ensures proper sequencing of memory transactions.





FIG. 4

shows the AutoRAID memory transaction manager in more detail. For discussion purposes, the AutoRAID memory transaction manager is generally referenced as number


82


, without the “a” and “b” designations. The AutoRAID memory transaction manager in

FIG. 4

is representative of either component


82




a


or


82




b


, as both are identically constructed.




The AutoRAID memory transaction manager


82


has a PCI interface


100


, an NVRAM interface


102


, an NWay interface or entity


104


, a mirror entity


106


, and a transbus entity


108


. The PCI interface


100


facilitates data transfer over the PCI bus


80


to and from the I/O modules


70




a


,


70




b


via the converters


78




a


,


78




b


and over PCI bus


86


to and from the CPUs


88




a


,


88




b


. The PCI interface


100


has a buffer


110


to temporarily hold transaction data being transferred to and from other entities in the memory transaction manager


82


.




The NVRAM interface


102


handles read and write transaction requests and facilitates data transfer to and from the local NVRAM


62


.




The NWay interface or entity


104


facilitates transfer of remote memory transactions over the hot-plug interface between the controllers. As noted above, the transactions are transferred in packets, which are error corrected to ensure that the transfers are valid. The NWay interface


104


has one or more buffers


112


to hold transaction data, a data memory


114


to hold packets being exchanged across the hot-plug interface with the other controller, and a CRC unit


116


to perform error correction on individual or groups of packets.




The mirror entity


106


handles local and mirrored memory transactions and orders them via the transbus entity


108


. The mirror entity accommodates such transactions as mirrored reads, mirrored writes, and local reads in a mirrored space. The mirror entity


106


has a buffer


118


and a data memory


120


. A transaction queue


122


is formed from the buffer


118


and data memory


120


.




The transbus entity


108


enables the different components


100


-


106


to issue commands. The transbus entity


108


, along with the transaction queue


122


in the mirror entity


106


, provide the ordering functionality that orders transactions received at the controller.




The transbus entity


108


controls a multiple access bus


124


that interconnects the four components


100


-


106


. The PCI interface


100


, the NWay interface


104


and the mirror entity


106


initiate transactions on the transbus entity


108


and all four components


100


-


106


receive transactions from the transbus entity


108


. The transbus entity


108


identifies the source of the request (i.e., a source identifier (ID)), an address of the access, and other attributes.




Data is moved between the four components


100


-


106


via callback buses


126


. There is one callback bus


126


for each component that acts as an originator of a transaction. The PCI interface


100


, the NWay interface


104


, and the mirror entity


106


have a callback bus


126


.




Several transactions are described below to exemplify operation of the disk array controller. These transactions will be described in the following order: mirrored read, local read, mirrored write, and concurrent writes. The transactions are described in the context of the dual controllers with reference to

FIGS. 3 and 4

. However, the transactions may be performed in other multi-controller implementations involving more than two controllers.




Mirrored Read




In a mirrored read request, the controllers


54




a


and


54




b


use the memory map in the mirrored memories


62


and


62




b


to locate the requested data on the disk array. The data is then stored temporarily in the read cache maintained in the mirrored memories


62




a


and


62




b


. Suppose that controller


54




a


receives a request from the host via I/O module


70




a


to read data from the disk array. The controller CPU


88




a


processes the request and generates a mirrored read request that involves reading data from the mirrored memories


62




a


and


62




b


. The mirrored read request is sent to the PCI interface


100


of the AutoRAlD memory transaction manager


82




a.






In this example, the first controller


54




a


and its AutoRAlD memory transaction manager


82




a


are referred to as the “local” controller and manager because they receive and originate the mirrored read request. The second controller


54




b


and its AutoRAID memory transaction manager


82




b


are referred to as the “remote” controller and manager. In this mirrored read transaction, the local and remote managers perform the following steps:




Step 1: The PCI interface


100


makes a request to the transbus entity


108


via the bus


124


for a particular piece of data. The request includes a memory address, the number of bytes, and an indication that it is a read operation, and so forth.




Step 2: The mirror entity


106


accepts the request from the transbus entity


108


as its own and places the request in the transaction queue


122


. The queue


122


maintains order over various requests that are being handled by the local manager


82




a.






Step 3: When the request moves to the top of the queue


122


, the mirror entity


106


makes a request to the transbus entity


108


for the NWay entity


104


to obtain the mirrored copy of data from the remote memory transaction manager


82




b


. The request from the mirror entity


106


includes the same information (i.e., memory address, byte number, etc.) as found in the original request made in Step 1 by the PCI interface.




Step 4: The NWay entity


104


packages the read request in a packet and computes a CRC value for the packet using the CRC unit


116


. The NWay entity


104


transfers the read request packet over the bus


96


to the corresponding NWay entity in the remote AutoRAID memory transaction manager


82




b.






Step 5: The remote NWay entity checks the CRC value to ensure valid transfer and makes a request via the remote transbus entity to the remote NVRAM interface to read the particular piece of data. The remote NWay entity returns an acknowledgement to the local NWay entity


104


indicating that the request was properly received and has been posted for execution.




Step 6: Concurrently with step 5, the local NWay entity


104


makes a request to the local NVRAM interface


102


via the transbus entity


108


to read the particular piece of data.




Step 7: The local and remote NVRAM interfaces concurrently retrieve the same piece of data from their respective NVRAMs


62




a


and


62




b


. The local NVRAM interface


102


places the data on the callback bus


126


. The remote NVRAM interface places the data on a callback bus that is being monitored by the remote mirror entity, which then makes a request to the remote NWay entity to transfer the data to the local manager


82




a


. The remote NWay entity packages the data, error corrects the packet and transfers the packet across the bus


96


to the local NWay entity


104


.




Step 8: The local NWay entity


104


checks the CRC value to ensure valid transfer and posts the data on its callback bus


126


.




Step 9: In the mean time, the local mirror entity


106


that originated the read request watches the callback buses


126


from the NVRAM interface


102


and the NWay entity


104


. When the local data is received from the NVRAM interface


102


and the remote copy is received from the NWay entity


104


, the mirror entity


106


compares the two versions to make sure they match. A match indicates that the data is valid.




Step 10: Assuming the data is valid, the mirror entity


106


returns the data via the callback bus


126


back to the PCI interface


100


. The PCI interface


100


is monitoring the callback buses because it does not know where the data is coming from (e.g., whether from the mirror entity or the NVRAM interface). The PCI interface


100


then transfers the results back to the CPU or to the converter for return to the host.




Local Read




Suppose that controller


54




a


receives a read request that does not require a read compare operation. The local controller


54




a


can read the data from its own memory without requesting the copy from the remote controller. In the local read transaction, the local AutoRAID memory transaction manager performs the following steps:




Step 1: The PCI interface


100


makes a request to the transbus entity


108


via the bus


124


for a piece of data on the local NVRAM


62




a


. The request includes a memory address, the number of bytes, and an indication that it is a read operation, and so forth.




Step 2: The mirror entity


106


accepts the request from the transbus entity


108


as its own and places the request in the transaction queue


122


. In this manner, the local reads are handled in order with other types of transactions, including the mirrored read transactions described above.




Step 3: When the request moves to the top of the queue


122


, the mirror entity


106


makes the local read request to the transbus entity


108


. The NVRAM interface


102


accepts the request and retrieves the data from the NVRAM


62




a


. The NVRAM interface


102


places the data on its callback bus


126


.




Step 4: The PCI interface


100


is watching the callback bus


126


for the returned data. The PCI interface


100


then transfers the results back to the CPU or to the converter for return to the host.




Mirrored Write




The controllers


54




a


and


54




b


handle a mirrored write request by (1) storing the data in the respective write caches maintained in the mirrored memories and (2) updating the memory map in the mirrored memories. Suppose that controller


54




a


receives a write request to write data to a particular location on the disk array. The write request is received at the PCI interface


100


from the PCI bus. The local and remote AutoRAID memory transaction managers then perform the following steps:




Step 1: The PCI interface


100


makes a write request to the transbus entity


108


via the bus


124


. The request includes a memory address, a pointer to the data to be written, an indication that it is a mirrored write operation, and so forth.




Step 2: The mirror entity


106


accepts the write request from the transbus entity


108


and places the request in the transaction queue


122


. Once again, the transaction queue


122


maintains order among the various memory transactions.




Step 3: When the request moves to the top of the queue


122


, the mirror entity


106


places the write request back on the transbus entity


108


for acceptance by the NWay entity


104


. At this point, the mirror entity


106


is finished with the process.




Step 4: The NWay entity


104


packages the write request in a packet and computes a CRC value for the packet using the CRC unit


116


. The NWay entity


104


transfers the write request packet over the bus


96


to the corresponding NWay entity in the remote AutoRAID memory transaction manager


82




b.






Step 5: The remote NWay entity checks the CRC value to ensure valid transfer and makes a request via the remote transbus entity to the remote NVRAM interface to read the particular piece of data. The remote NWay entity returns an acknowledgement to the local NWay entity


104


indicating that the request was properly received and has been posted for execution.




Step 6: Concurrently with step 5, the local NWay entity


104


makes a request to the local NVRAM interface


102


via the transbus entity


108


to write the data to the write cache in the mirrored memory


62




a.






Step 7: The local and remote NVRAM interfaces write the same data to the write caches in their respective NVRAMs


62




a


and


62




b


. The local NVRAM interface


102


puts an acknowledgement on the callback bus


126


when the write is completed.




Step 8: The local PCI interface


100


monitors the callback bus for the acknowledgement, and returns an acknowledgement of the mirrored write completion upon receipt of the acknowledgement from the NVRAM interface


102


.




Concurrent Writes The last exemplary transactions involve a situation in which the CPU


88




a


on the local controller


54




a


requests a write to a specific memory location and the CPU


88




b


on the remote controller


54




b


requests a write to the same memory location. In this situation, the mirrored memory disk array controller is designed to ensure that write requests are ordered and fully completed. To do this, the dual controllers


54




a


and


54




b


assume roles of master/slave or primary/secondary, whereby one controller is crowned with the tasks of ordering transactions and the other controller agrees to abide by this ordering. Determining which controller is the master or primary can be handled ahead of time, as part of a preconfiguration process, or can be negotiated by the controllers when such situations arise.




For purposes of illustration, suppose that local controller


54




a


is the master or primary controller and remote controller


54




b


is the slave or secondary controller. The write requests are received at the respective PCI interfaces of the master and slave AutoRAID memory transaction managers


82




a


and


82




b


from the PCI bus. The master and slave AutoRAID memory transaction managers


82




a


and


82




b


then perform the following steps:




Step 1: At each manager, the PCI interfaces make a write request to the transbus entities. The request includes a memory address, a pointer to the data to be written, an indication that it is a mirrored write operation, and so forth.




Step 2: The mirror entities accept the write requests from the transbus entities and place the requests in their transaction queues.




Step 3: When the requests move to the tops of the queues, the mirror entities place the write requests back on the transbus entities for acceptance by the NWay entities.




Step 4: The NWay entities transfer the write requests as CRC-protected packets over the bus


96


to the opposing NWay entities. Acknowledgements of receipt are also exchanged.




Step 5: Concurrently with step 4, the local and remote NWay entities also make requests to their respective NVRAM interfaces via the transbus entities to write the data to the write caches in the mirrored memories


62




a


and


62




b


. As a result of steps


4


and


5


, the two write transactions will be posted to the transbus entities within the AutoRAID memory transaction manager. The order of these write transactions is controlled by their order on the transbus entities of the “master” controller, which in this case is controller


54




a


. Accordingly, if the write request received locally at the master controller


54




a


is posted first to the local transbus entity


108


before the write request from the remote (or slave) controller


54




b


is posted, the locally received write request is processed first. Conversely, if the write request received locally at the master controller


54




a


is posted to the local transbus entity


108


after the write request from the remote (or slave) controller


54




b


is posted, the remotely received write request is processed first. The slave AutoRAID memory transaction manager abides by this ordering.




Step 6: The local and remote NVRAM interfaces write the data in the winning write request to the write caches in their respective NVRAMs


62




a


and


62




b


. The NVRAM interfaces place respective acknowledgements on their callback buses when the write is completed.




Step 7: The PCI interfaces monitor the callback buses for the acknowledgement, and return an acknowledgement of the mirrored write completion upon receipt.




Atomicity




The mirrored read and write transactions described above are handled atomically across the hot-plug interface. The controllers are designed to either perform the read and write operations for both NVRAMs, or to abort performance for both NVRAMs. In this manner, the images presented in the NVRAMs remain identical.




Summary




The mirrored memory disk array controller is advantageous because it maintains a coherent memory map across a hot-plug interface. The dual controllers, and particularly the AutoRAID memory transaction managers, support CRC-protected memory transactions over the interface, and facilitate ordered execution of the transactions.




Although the invention has been described in language specific to structural features and/or method steps, it is to be understood that the invention defined in the appended claims is not necessarily limited to the specific features or steps described. Rather, the specific features and steps are disclosed as preferred forms of implementing the claimed invention.



Claims
  • 1. A storage management system for a redundant array of independentdisks (RAID) data storage system, comprising: at least two controllers interconnected via a hot-plug interface, each controller having a non-volatile memory to store redundant images of storage space provided by the data storage system; and each controller further having a memory transaction manager that orders memory transactions involving its non-volatile memory and facilitates transmission and receipt of the memory transactions over the hot-plug interface.
  • 2. A storage management system as recited in claim 1, wherein the redundant images stored by the non-volatile memories of the controllers appear identically across the hot-plug interface to external processes seeking to read or write data.
  • 3. A storage management system as recited in claim 1, wherein each memory transaction manager exchanges the memory transactions in cyclic redundancy check (CRC)-protected packets to guarantee valid transfer.
  • 4. A storage management system as recited in claim 1, wherein each memory transaction manager comprises a transaction queue to order the memory transactions.
  • 5. A storage management system as recited in claim 1, wherein the memory transaction managers perform the memory transactions for respective ones of the non-volatile memories in an atomic manner.
  • 6. A storage management system as recited in claim 1, wherein the controllers designate one of the memory transaction managers as having primary responsibility of ordering the memory transactions for all of the controllers.
  • 7. A storage management system as recited in claim 1, wherein each memory transaction manager comprises:a mirror entity to order the memory transactions; a local memory interface to execute the memory transactions with respect to a respective non-volatile memory; and a bus interface to facilitate transfer and receipt of the memory transactions over the hot-plug interface.
  • 8. A data storage system, comprising:a disk array having a number of disks that defines a physical storage space; a storage management system to manage memory transactions to the disks of the disk array via one or more buses, the storage management system comprising: at least two controllers interconnected via a hot-plug interface, each controller having a non-volatile memory to store redundant images of the physical storage space provided by the data storage system; and each controller further having a memory transaction manager that orders memory transactions involving its non-volatile memory and facilitates transmission and receipt of the memory transactions over the hot-plug interface, wherein error correction of the memory transactions guarantees valid transfer.
RELATED APPLICATIONS

This is a continuation of U.S. patent application Ser. No.09/103,329, filed Jun. 23, 1998, which is now U.S. Pat. No. 6,230,240.

US Referenced Citations (10)
Number Name Date Kind
5432922 Polyzois et al. Jul 1995 A
5479653 Jones Dec 1995 A
5548712 Larson et al. Aug 1996 A
5651133 Burkes et al. Jul 1997 A
5666512 Nelson et al. Sep 1997 A
5699510 Petersen et al. Dec 1997 A
5802394 Baird et al. Sep 1998 A
5928367 Nelson et al. Jul 1999 A
5960451 Voigt et al. Sep 1999 A
6151659 Solomon et al. Nov 2000 A
Non-Patent Literature Citations (1)
Entry
Wilkes et al, “The HP AutoRAID Hierarchical Storage System,” ACM Trans. on Comp. Systems, vol. 14, No. 1, Feb. 1996, pp. 108-136.
Continuations (1)
Number Date Country
Parent 09/103329 Jun 1998 US
Child 09/767305 US