Storage system and method for automatic restoration upon loop anomaly

Information

  • Patent Application
  • 20080010547
  • Publication Number
    20080010547
  • Date Filed
    September 25, 2006
    18 years ago
  • Date Published
    January 10, 2008
    16 years ago
Abstract
A storage system has a controller and a plurality of storage devices connected to a pair of loops, the controller diagnoses storage devices and performs automatic high-speed restoration of the loops. A storage system has a controller, a plurality of storage devices, and switch circuits having a plurality of switches to connect each of the plurality of storage devices to loops. When a loop anomaly is detected, a restoration instruction is issued to a switch circuit, the plurality of switches are operated in parallel, storage devices are diagnosed, and according to the diagnosis results, a storage device judged to be anomalous are bypassed from the loop by operating the switches. When a loop anomaly is detected, diagnosis of the disk drives connected to the loop can be performed in parallel, so that the time until automatic restoration of the loop can be greatly reduced.
Description

BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows the configuration of the storage system of an embodiment of the invention;



FIG. 2 shows the configuration of the centralize module of FIG. 1;



FIG. 3 shows the configuration of the FC switches of FIG. 1;



FIG. 4 explains the FC loop table of FIG. 1;



FIG. 5 shows the automatic restoration processing flow of an embodiment of the invention;



FIG. 6 explains the loop anomaly detection processing of FIG. 5;



FIG. 7 explains the error statistical processing of FIG. 5;



FIG. 8 explains the FC loop disconnect processing of FIG. 5;



FIG. 9 explains the LDR request processing of FIG. 5;



FIG. 10 explains the bypass instruction request processing of FIG. 5;



FIG. 11 explains the PTBI diagnosis processing instruction processing of FIG. 5;



FIG. 12 explains the connect processing of FIG. 5;



FIG. 13 explains the LDR execution result configuration processing of FIG. 5;



FIG. 14 explains the anomalous disk detachment processing of FIG. 5;



FIG. 15 explains the FC loop incorporation processing of FIG. 5; and,



FIG. 16 explains automatic restoration control of the prior art.





DESCRIPTION OF THE PREFERRED EMBODIMENTS

Below, embodiments of the invention are explained in the order of a configuration of a data storage system, automatic restoration processing of a data storage system, and other embodiments.


Configuration of a Data Storage System



FIG. 1 shows the configuration of the storage system of an embodiment of the invention, FIG. 2 shows the configuration of the control module of FIG. 1, FIG. 3 shows the configuration of the FC switches of FIG. 1, and FIG. 4 explains the FC loop table of FIG. 2. FIG. 1 shows an example in which one storage controller (control module) is connected to one disk enclosure.


As shown in FIG. 1, the storage controller (centralize module) 4 has a control unit (cache manager) 40, a pair of channel adapters 41, and a pair of device adapters 42. The control unit 40 is connected to the service management controller 44.


The channel adapters 41 are connected to a host computer, not shown, via Fibre Channel or Ethernet (a registered trademark). The device adapters 42 are connected to each of the disk drives 1-1 to 1-n in the device enclosure 2 (described below) by for example Fibre Channel FC loops 2-1, 2-2.


That is, one of the disk adapters 42 of the centralize module 4 accesses each of the disk drives 1-1 to 1-n via the first FC loop 2-1, and the other disk adapter 42 accesses each of the disk drives 1-1 to 1-n via the second FC loop 2-2. By this means, a dually redundant connection path configuration is realized.


The device enclosure 2 has a pair of print boards 20-1 and 20-2, which respectively have port control circuits 22-1, 22-2 and FC switches 24-1, 24-2, as well as disk drives 1-1 to 1-n. Each of the FC switches 24-1 and 24-2 is connected to an FC loop 2-1, 2-2, as shown in FIG. 3, to connect, bypass, or perform loopback of the individual disk drives 1-1 to 1-n.


The port bypass circuits (abbreviated “PBCs”) 22-1, 22-2 are connected to the service management controller 44 of the control module 4 via Ethernet (a registered trademark) or another network, and perform state management of the disk enclosure 2 and execute control of the FC switches 24-1, 24-2.


The service management controller (abbreviated “SMC”) 44 receives state information for the device enclosure 2 from the port bypass circuits 22-1, 22-2, performs state monitoring, and issues necessary commands to the port control circuits 22-1, 22-2.


Hence each of the disk drives 1-1 to 1-n is connected to the control module 4 via two independent FC loops 2-1 and 2-2. Here, a single device enclosure 2 is shown; but a plurality of (for example, four of) such device enclosures 2 may be connected in series in the FC loops 2-1, 2-2.


As shown in FIG. 2, the control module 4 has a cache manager 40, a channel adapter (first interface portion; hereafter “CA”) 41, device adapter (second interface portion; hereafter “DA”) 42, and DMA (Direct Memory Access) engine (communication portion; hereafter “DMA”) 43.


The cache manager 40 performs read/write processing based on processing requests (read requests and write requests) from the host computer, and comprises memory 410, a processing unit (CPU) 400, and a memory controller 420.


The memory 410 has a so-called cache area 412, which serves as cache for a plurality of disks to hold a portion of the data stored on the plurality of disk drives in the disk enclosure 2 of FIG. 1, an FC loop table 414, and another work area 416.


The processing unit (CPU) 400 executes control of the memory 410, channel adapter 41, device adapter 42, DMA 43, and service management controller 44. For this purpose, one or a plurality of CPUs 400 (in the figure, one) and a memory controller 420 are provided. The memory controller 420 controls reading and writing of the memory 410, and also performs path switching.


The memory controller 420 is connected to the memory 410 via the memory bus 432, and is connected to the CPU 400 via the CPU bus 430; the memory controller 420 is also connected to the device adapter 42 via a four-lane high-speed serial bus (for example, PCI-Express) 440.


Similarly, the memory controller 420 is connected via a four-lane serial buses (for example, PCI-Express) 443, 444, 445, 446 to the channel adapter 41 (here, four channel adapters 41a, 41b, 41c, 41d), and via a four-lane serial bus (for example, PCI-Express) 448 to the DMA 43.


This PCI-Express or other high-speed serial bus communicates using packets, and by providing a plurality of serial bus lanes, so-called low-latency communication is possible, with little delay and rapid response, even when the number of signal lines is reduced.


The channel adapters 41a to 41d are an interface with host computers; the channel adapters 41a to 41d are connected to different host computers. It is preferable that the channel adapters 41a to 41d be connected by a bus, such as for example a Fibre Channel or Ethernet (a registered trademark) bus, to the interface portions of the respective corresponding host computers; in this case, optical fiber or coaxial cable is used as the bus.


Further, each of the channel adapters 41a to 41d is configured as a portion of the control module 4. The channel adapters 41a to 41d support a plurality of protocols as interface portions between the corresponding host computers and the cache manager 40.


Because the protocols to be installed are not the same, depending on the corresponding host computers, the channel adapters 41a to 41d are mounted on print boards separate from the cache manager 40, to enable easy replacement as necessary.


For example, as protocols with host computers which are to be supported by the channel adapters 41a to 41d, in addition to those described above, Fibre Channel, and iSCSI (Internet Small Computer System Interface) supporting Ethernet (a registered trademark), and others can be used.


As stated above, each of the channel adapters 41a to 41d is directly connected to the cache manager 40 by a bus 443 to 446, designed to connect LSI (Large Scale Integration) devices to print boards, such as a PCI-Express or other bus. By this means, the high throughput required between the channel adapters 41a to 41d and the controller 40 can be realized.


Next, the device adapter 42 is an interface with each of the disk drives in the device enclosure, and here has four FC (Fibre Channel) ports.


Further, as stated above, the device adapter 42 is directly connected to the cache manager 40 by a bus designed to connect LSI (Large Scale Integration) devices to print boards, such as a PCI-Express or other bus. By this means, the high throughput required between the disk adapter 42 and the cache manager 40 can be realized.


As shown in FIG. 2, the DMA engine 43 communicates with the cache manager 40 of another control module, not shown, and is for example used in mirroring processing.



FIG. 3 is used to explain the FC switches of FIG. 1. The FC switch 24-1 (24-2) has port switches 28-1, 28-2, 28-3, 28-4, . . . , 28-n for connection, bypass and loopback of each of the disk drives (magnetic disk devices) 1-1 to 1-n, and a port switch controller 26 connected to the fibre channel loop 2-1 (2-2) to perform connection control with each of the port switches 28-1 to 28-n and the fibre channel loop 2-1 (2-2).


The FC switch 24-1 (24-2) does not have an FC_AL loop configuration for physical serial connection to each of the disk drives, as shown in FIG. 16; but the port switch controller 26 logically emulates the sending and receiving of data in an FC_AL loop configuration, and controls reading from and writing to the disk drives.


That is, this port switch controller 26 analyzes the FC headers from the fibre channel loop 2-1 (2-2), and distributes the FC header commands to the relevant port switches 28-1 to 28-n. In other words, requests and data from the loop 2-1 (2-2) are directly exchanged with the disk drives 1-1 to 1-n via the relevant port switch 28-1 to 28-n.


The port switch controller 26 also performs diagnoses of each of the port switches 28-1 to 28-n in response to a PTBI (Port Test Before Insertion) instruction, and controls the switched states of each of the port switches 28-1 to 28-n according to the diagnosis results.


Each of the magnetic disk devices 1-1 to 1-n has a pair of FC (Fibre Channel) chips for connection to the two port switches, a control circuit, and a disk drive mechanism. The FC chips have CRC check functions.


As shown in FIG. 4, the fibre channel loop table (FC map) 414 has map tables 414-1 to 414-m for each fibre channel path 2-1, 2-2. Each of the map tables 414-1 to 414-m stores a WWN (World Wide Name) for the magnetic disk device connected to the fibre channel loop, an ID number for the disk enclosure 10 in which the magnetic disk device is housed, a slot number indicating the housing position of the magnetic disk device in the disk enclosure 10, and an ID number for the fibre channel loop.


Automatic Restoration Processing of a Data Storage System


Next, FIG. 5 through FIG. 15 are used to explain automatic restoration processing in an embodiment of the invention. FIG. 5 shows the flow of automatic restoration processing in an embodiment of the invention, and FIG. 6 through FIG. 15 explain the operations therein. The flow of processing in FIG. 5 is explained referring to FIG. 6 through FIG. 15.


First, in FIG. 6, the control program of the centralize module 40 is explained. The host interface portion 41 is the above-described channel adapter. The monitoring control interface management portion 44-1 is a monitoring control program executed by the service management controller 44.


The control unit 40 has programs which execute an I/O control portion 460, error statistical processing portion 462, RAS (Recovery Access System) control portion 464, active maintenance control portion 466, and FC interface management portion 468.


(S10) As shown in FIG. 6, a host request (read/write request) is received by the channel adapter 41, and the I/O control portion 460 of the control unit 40 is notified. The I/O control portion 460 performs read or write access of the magnetic disk drive in which the requested data exists, via the FC interface management portion 468, device adapter 42 which is not shown (see FIG. 1), and FC loop 2-1, 2-2. Here, a RAID-1 or higher configuration is employed, so that the magnetic disk drive 1-1 is accessed from the FC loop 2-1, and the magnetic disk drive 1-2 is accessed from the FC loop 2-2. The I/O control portion 460 monitors response to this access, and when there is response failure, judges that there is a loop anomaly for the disk drive from which there is no response. For example, when there is response failure (in the case of reading, no data is returned) due to an anomaly in the disk drive itself, an anomaly in the FC switch, or a connection fault between the disk drive and the FC switch, or when the FC switch is executing the above-described PTBI processing, then a response failure occurs.


(S12) Upon judging a loop anomaly to have occurred as described above, the I/O control portion 460 reports an anomaly for the disk drive from which there has been no response to the error statistical processing portion 462, as shown in FIG. 7. The error statistical processing portion 462 computes statistics for details of each error (error code).


(S14) The error statistical processing portion 462 counts the frequency of disk access errors, and when the frequency exceeds a threshold value, the RAS control portion 464 logically disconnects the relevant FC loop (here, FC loop 2-2), as shown in FIG. 8. That is, although physically connected, the control module 40 prohibits use of the FC loop 2-2.


(S16) As shown in FIG. 9, the RAS control portion 464 then requests automatic FC loop restoration of the monitoring control interface portion 44-1 of the service management controller 40, and the monitoring control interface portion 44-1 requests LDR (Loop Down Recovery) of the PBC (port bypass circuit) 22-2 of the relevant FC loop 2-2.


(S18) As shown in FIG. 10, the PBC 22-2 issues a bypass instruction to the FC switch 24-2. As a result, the FC switch control circuit 26 of the FC switch 24-2 bypasses all the port switches 28-1 to 28-n of the FC switch 24-2 from the FC loop 2-2. That is, the switches are initialized.


(S20) As shown in FIG. 11, the PBC 22-2 issues a connect instruction to all switches of the FC switch control circuit 26. As a result of this connect instruction, the FC switch control circuit 26 performs a PTBI (Port Test Before Insertion) diagnosis, and performs connection.


(S22) That is, as shown in FIG. 12, the FC switch control circuit 26 performs loopback of all the port switches 28-1 to 28-n, as explained in FIG. 3. Through this switch loopback operation, N number of FC_AL loops that is connected a single disk drive are logically formed. An initialization process is executed for each of the N FC_AL loops, and the disk drive transmits frames to each of N FC_AL loops. That is, each of the disk drives 1-1 to 1-n independently comprises an FC loop, executes an initialization process, and transmits the frame signals. The FC switch control circuit 26 monitors the frame signals for each of the disk drives 1-1 to 1-n, and performs diagnoses of the individual disk drives 1-1 to 1-n. When for example there is an anomaly in a disk drive, a transmitted frame signal does not return, and so the disk drive is judged to be anomalous. The FC switch control circuit 26 connects the port switches of disk drives not judged to be anomalous, and bypasses the port switches of disk drives which are anomalous. That is, diagnosis processing is performed in parallel for all the disk drives.


(S24) As shown in FIG. 13, while waiting the time required for this diagnosis processing, the RAS control portion 464 queries the PBC 22-2 for the LDR execution results via the monitoring control interface management portion 44-1. The PBC 22-2 receives the PTBI diagnosis results from the FC switch 24-2 and sends a response with the diagnosis results to the RAS control portion 464.


(S26) As shown in FIG. 14, the RAS control portion 464 issues a request to the error statistical processing portion 462 for disconnection of anomalous disks based on the diagnosis results. The error statistical processing portion 462 writes information indicating that the anomalous disk drives are faulty to the configuration information table of the disk drive management information (including the above-described error frequencies). By this means, the anomalous disk drives are logically disconnected.


(S28) As shown in FIG. 15, the RAS control portion 464 then issues a request for FC loop incorporation to the active maintenance control portion 466. The active maintenance control portion 466 acquires the WWN of each disk drive from the FC loop 2-2 and FC switch 24-2 via the FC interface management portion 468, and creates the FC loop map 414 shown in FIG. 4. Here, the anomalous disk drive 1-2 is in a bypassed state on the FC loop 2-2, and so the anomalous disk drive 1-2 is not incorporated into the FC map. That is, during disk access the I/O control portion 460 excludes the drive from the referenced FC loop map 414. By this means, unintentional accessing of an anomalous disk drive can be prevented. As a result, automatic restoration is completed, and the I/O control portion 460 can use the FC loop 2-2 to access disks.


In this way, when a loop anomaly is detected, diagnoses of the disk drives connected to the loop are performed in parallel, so that the time until automatic loop restoration is approximately several seconds, and can be shortened to from 1/10 to 1/60 approximately of the time required in the prior art.


Further, an FC switch having port test functions is used to perform diagnoses in parallel of the disk drives connected to the loop, so that the load on the control module is not so great. As a result, declines in the performance of disk access and cache control using the control module with no loop anomalies can be prevented, and in particular, performance in a RAID configuration when responding to hosts is improved.


Further, if there is an anomaly in a disk drive 1-2, then when accessing the disk drive 1-2 from a loop with no anomalies, the loop without anomalies is also judged to be anomalous, so that both loops are anomalous. But as in this aspect, automatic restoration processing is performed with one loop in the anomalous state, and so a case in which both loops are anomalous can be prevented.


Other Embodiments

In the above embodiments, when an anomaly occurs in one loop, automatic restoration processing is executed for that loop; but when there are anomalies in both loops, automatic restoration processing can also be performed for both loops.


Similarly, in cases where, with an anomaly in one loop, restoration is not possible even when automatic restoration processing is performed, when both loops have become unable to operate, by automatically starting functions for on/off diagnosis of the power supply of the device enclosure 2, diagnoses of all circuits of the print boards 20-1 and 20-2 can be performed, to execute restoration processing.


Further, in the above embodiments, the case of a redundant RAID configuration such as shown in FIG. 1was explained; but application to storage systems with other redundant configurations is also possible. Further, application to various storage systems, employing as storage devices magnetic disks, optical disks, magneto-optical disks, and similar is possible.


In the above, embodiments of the invention have been explained; but various modifications can be made within the scope of the invention, and these modifications are not excluded from the scope of the invention.


Using a switch circuit having a plurality of switches to connect each of a plurality of storage devices to a loop, when a loop anomaly is detected, a restoration instruction is issued to the switch circuit, the plurality of switches are operated in parallel, the storage devices are diagnosed, and according to the diagnosis results, storage devices judged to be anomalous are bypassed from the loop by operating the switches, so that when a loop anomaly is detected it is possible to perform diagnoses in parallel of disk drives connected to the loop, and the time until automatic restoration of the loop can be greatly reduced.

Claims
  • 1. A storage system for accessing storage devices according to data access requests from a requesting device, comprising: a plurality of storage devices;at least one controller, which controls access to said plurality of storage devices;a pair of loops, in which information circulates in a single direction, to exchange information between said controller and said plurality of storage devices; andswitch circuits, provided in each of said loops, having a plurality of switches to connect said loops to each of said plurality of storage devices,wherein when said controller detects a loop anomaly while accessing said storage devices, said controller issues an instruction to said switch circuit to restore said loop,and said switch circuit, in response to said restoration instruction, performs operations on said plurality of switches in parallel, diagnoses said storage devices, and according to the diagnosis results, bypasses said storage device judged to be anomalous from said loop by means of operation of said switches.
  • 2. The storage system according to claim 1, wherein when said controller detects an anomaly in one of said loops during access of said storage devices via said one loop, said controller issues an instruction for restoration of said loop to said switch circuit of said one loop.
  • 3. The storage system according to claim 1, wherein said controller, according to the diagnosis results of said switch circuit, disconnects said storage device judged to be anomalous from objects for said access.
  • 4. The storage system according to claim 1, further comprising a port bypass circuit which controls switch circuits in said loop, wherein said port bypass circuit, in response to said restoration instruction, issues an instruction to said switch circuit to bypass all switches, and after all of said switches have been bypassed by said switch circuit, issues an instruction for said diagnosis to said switch circuit.
  • 5. The storage system according to claim 4, wherein said switch circuit, in response to said diagnosis instruction, performs loopback of all of said switches, and executes said diagnoses according to signals from said storage devices.
  • 6. The storage system according to claim 5, wherein said switch circuit, after execution of said diagnoses, connects the switches of said storage devices not judged to be anomalous to said loop.
  • 7. The storage system according to claim 1, wherein said controller, in response to said diagnosis results, executes processing to incorporate said storage devices in said loop.
  • 8. The storage system according to claim 1, wherein said controller comprises: a control unit which controls access to a plurality of storage devices; anda service controller, connected to said switch circuits, which issues instructions for said restoration to said switch circuits.
  • 9. The storage system according to claim 1, wherein said controller comprises: a channel circuit to control an interface with said requesting device;a device circuit which sends and receives information with said storage devices via said loops;a control unit which controls said channel circuit and said device circuit and which accesses said storage devices according to data access requests from said requesting device; anda service controller which is connected to said control unit, and which connects with said switch circuits.
  • 10. The storage system according to claim 1, wherein said plurality of storage devices have a RAID configuration.
  • 11. The storage system according to claim 1, wherein said storage devices comprises disk drives.
  • 12. An automatic restoration method of a storage system for accessing storage devices in response to data access requests from a requesting device, comprising the steps of: detecting a loop anomaly during accessing of said storage devices by a controller connected to said plurality of storage devices by a pair of loops;issuing, in response to detection of said loop anomaly, an instruction from said controller to a switch circuit having a plurality of switches, which connect said loop to each of said plurality of storage devices, to restore said loop; andperforming restoration by, in response to said restoration instruction, using said switch circuit to perform operations in parallel on said plurality of switches, diagnosing said storage devices, and in response to the diagnosis result, bypassing a storage device judged to be anomalous from said loop by means of operation of said switches.
  • 13. The automatic restoration method of a storage system according to claim 12, wherein said restoration instruction step comprises a step, upon detecting an anomaly in one of said loops during accessing of said storage devices via said one loop, issuing an instruction for restoration of said loop to said switch circuit of said one loop.
  • 14. The automatic restoration method of a storage system upon a loop anomaly according to claim 12, further comprising a step, in response to the diagnosis results of said switch circuit, of disconnecting said storage devices judged to be anomalous from said object of access of said controller.
  • 15. The automatic restoration method of a storage system upon a loop anomaly according to claim 12, wherein said restoration step comprises: a step of issuing, in response to said restoration instruction, from a port bypass circuit which controls the switch circuit of said loop, an instruction to said switch circuit to bypass all switches; anda step, after said switch circuit bypasses all of said switches, of issuing said diagnosis instruction from said port bypass circuit to said switch circuit.
  • 16. The automatic restoration method of a storage system according to claim 15, wherein said diagnosis step comprises a step, in response to said diagnosis instruction, of performing loopback of all of said switches, and in response to signals from said storage devices, of executing said diagnosis.
  • 17. The automatic restoration method of a storage system according to claim 16, further comprising a step, after executing said diagnosis by said switch circuit, of connecting said loop to said storage devices not judged to be anomalous.
  • 18. The automatic restoration method of a storage system according to claim 12, further comprising a step, in response to said diagnosis results, of executing processing to incorporate said storage devices in said loop by said controller.
  • 19. The automatic restoration method of a storage system according to claim 12, wherein said restoration instruction step comprises a step of issuing said restoration instruction to said switch circuit from a service controller connected to a control unit which controls access to said plurality of storage devices and to said switch circuit.
  • 20. The automatic restoration method of a storage system according to claim 12, wherein said plurality of storage devices have a RAID configuration.
Priority Claims (1)
Number Date Country Kind
2006-144768 May 2006 JP national