A) Field of the Invention
The present invention relates to a disk controller for controlling a plurality of disk drives, and more particularly to a high reliability disk controller using connection-less type multiplex communication.
B) Description of the Related Art
U.S. Pat. No. 6,601,134 and No. 2003046460 disclose a storage system.
A disk sub-system (hereinafter simply called a “sub-system”) using magnetic disk drives as storage media has an input/output performance lower by three to four digits than that of a main storage of a computer using semiconductor storages as storage media. A lot of effort has been put into reducing this difference, i.e., improving the input/output performance of the sub-system. One method of improving the input/output performance of the sub-system is to use a disk controller which controls a plurality of magnetic disk drives into which data is distributively stored.
For example, a conventionally known disk controller such as shown in
Each channel adapter 2100 has: data link engines (DLEs) 2110 for executing packet transfer in the data system internal network; direct memory access (DMA) controllers (DMACs) 2120 for executing DMA transfer in the data system inner network; a selector 2115 for interconnecting DLEs 2110 and DMACs 2120; protocol engines (PE) 2130 for controlling communication between the host computer and disk drive; ports 2140 for connection to the host computer or disk drive; DLEs 2210 for executing packet transfer in the control system inner network; DMACs 2220 for DMA transfer in the control system inner network; micro-processors (MPs) 2230 for controlling the operation of the disk controller; and a selector 2125 for interconnecting DMACs 2120 and PEs 2130 or MPs 2230.
The cache memory adapter 2300 and control memory adapter 2301 each have: DLEs 2310 for executing DMA transfer in the data system internal network or control system internal network; DMACs 2320 for executing DMA transfer in each inner network; memory controllers (MCs) 2330; memory modules (MMs) 2340; a selector 2315 for interconnecting DLEs 2310 and DMACs 2320; and a selector 2325 for interconnecting DMACs 2320 and MCs 2330.
The switch adapter 2400 has: DLEs 2410 for executing packet transfer in the data system inner network; DMACs 2420 for executing DMA transfer in the data system inner network; and a selector 2430 for interconnecting DMACs 2420.
Data transfer between the adapters is realized by cooperative operations of DMACs in the respective adapters. As an example of this, with reference to
When a WRITE request is issued from the host computer via the connection port 2140, MP 2230 calculates an area of the cache memory adapter for temporarily storing WRITE data, and notifies the calculated result to DMAC 2120 in the channel adapter as a DMA list 2600. DMAC 2120 issues requests 2605 for acquiring paths to the cache memory adapters necessary for DMA transfer. Since the WRITE data is stored in a plurality of cache memory adapters (two cache memory adapters having DMAC 2321 and DMAC 2322) in order to improve the reliability, a plurality of path establishing requests are issued. After necessary paths are established, DMAC 2120 transfers the WRITE data to DMAC 2420 at the relay point switch, in accordance with the contents of the DMA list 2600. In this case, the WRITE data is transferred from the host computer by dividing it into a data amount having a predetermined size.
DMA 2420 of the switch adapter 2400 generates DMA sub-requests 2611 and 2612 for DMACs 2321 and 2322 of the cache memory adapters, in accordance with the transfer requests sent from DMAC 2120 of the channel adapter 2100. In response to the requests 2611 and 2612, DMACs 2321 and 2322 return sub-statuses 2621 and 2622 which are the request completion notices. After DMAC 2120 of the channel adapter confirms the sub-statuses 2621 and 2622, it issues the next DMA sub-request. When the sub-statuses of all the DMA sub-requests are returned, DMAC 2120 issues release requests 2625 for the established paths to the cache memory adapters, and returns a completion status 2630 to MP 2230 to thereby complete the process for the DMA list 2600. During the DMA transfer, MP 2230 accesses the control memory adapter 2301 when necessary. In this case, similar DMA transfer is performed between DMAC 2220 of the channel adapter 2100 and DMAC 2320 of the control memory adapter 2301.
The path establishing request 2605 is issued by using the command packet 2520. A data packet 2530 has: an address field 2531 for indicating a targeting DMAC; an address field 2532 for indicating an initiating DMAC; transfer data 2533; and an error check code 2535. The DMA sub-request is issued by using the data packet 2530.
As described above, DMA transfer in a conventional disk controller described in the above-cited Patent documents is performed by connection type non-multiplex communication because of easy implementation. Namely, DMAC establishes the paths necessary for the execution of DMA transfer, and during DMA transfer the paths are occupied (connection type communication). Moreover, until the sub-status for the DMA sub-transfer immediately before is confirmed, the next DMA sub-request cannot be executed (non-multiplex communication).
A conventional disk controller has therefore a low use efficiency of the inner network paths, which may hinder the performance improvement. In order to satisfy the conditions that the necessary path bandwidth is reserved at the limited path use efficiency, a complicated inner network configuration is required such as implementation of both the data system inner network and control system inner network, resulting in a high cost.
An object of the present invention is to provide a disk controller using connection-less type multiplex communication, capable of addressing issues of the prior art, realizing a high transfer efficiency (performance) while retaining a high reliability equivalent to that of a conventional disk controller, and realizing a low cost.
In order to solve the above-described issues, the present invention adopts the following configuration.
A disk controller includes: a channel adapter having a connection interface to a host computer or a disk drive; a memory adapter for temporarily storing data to be transferred between the host computer and disk drive; a processor adapter for controlling operations of the channel adapter and memory adapter; and a switch adapter for configuring an inner network by interconnecting the channel adapter, memory adapter and processor adapter, wherein: the channel adapter, memory adapter, processor adapter and switch adapter each include a DMA controller for performing a communication protocol control of the inner network; and packet multiplex communication is performed among the DMA controllers provided in the adapters.
According to the invention, by adopting connection-less type multiplex communication, multiplex becomes possible not only during one DMA sub-transfer (as will be later described, transfer state of the sub-DMA and sub-status shown in
Embodiments of a disk controller of this invention will be described in detail with reference to
The channel adapter 100, processor adapter 200, memory adapter 300 and switch adapter 400 have DMA controllers (DMACs) 120, 220, 320 and 420, respectively, the DMA controllers performing a communication protocol control of the inner network. Switch adapters can be connected each other by their expansion ports 440. DMACs execute DMA transfer with involvement data link engines (DLEs) 110, 210, 310 and 410, respectively. Connection-less type packet multiplex communication shown in
In the example shown in
The processor adapter 200 shown in
Since a DMA arbiter 2150 of the selector 215 arbitrates requests from a plurality of DMA controllers 220, DMA transfer from a plurality of DMACs via the same DLE 210 can be executed at the same time (connection-less communication). Reception data from DLE 210 is distributed by a DLE arbiter 2155 to a target DMAC 220.
As shown in
With the example of the structure shown in
The channel adapter shown in
The DMA controller 120 shown in
Similarly, the demulitplexer 1205 selects reception data under the control by the arbiter 1212 and stores it in a proper FIFO buffer among a plurality of reception FIFO buffers 1206. The packet assembly logic 1230 and packet disassembly logic 1240 are logic circuits for assembling and disassembling the packet. The sequence control logic 1213 and sequence management table 1220 manage the DMA sequence of DMA sub-transfers, the description of this operation being later given.
With the example shown in
Arbiters 3250 and 3255 arbitrate the contention conditions between DMAC 320 and MC 330. One MC can therefore be shared by a plurality of DMACs, and the priority order control among DMACs can be realized as the function of the arbiters. For example, if DMACs for the control system inner network and DMACs for the data system inner network are provided and the DMACs for the control system inner network are set to have a priority over the data system inner network, then accesses to the control system inner network can be suppressed from being influenced by interference of the operation of the data system inner network.
With the structure shown in
In the disk controller according to the embodiment of the invention, since DMA transfer is performed by connection-less type multiplex communication, it is necessary to guarantee the transfer sequence of DMA and properly perform an error check process and a failure recovery process. As a means for this, the DMA sequential field is provided so as to reliably identify the packet, and this field is controlled (preferably sequentially incremented) so as to make it unique (distinguishable) in one DMA transfer.
With the example of the packet structure shown in
The packet 500 shown in
With this packet structure, the DMA transfer function including duplicate WRITE can be applied to connection-less multiplex communication so that the disk controller of a high reliability can be realized.
The packet 500 shown in
With this packet structure, the routing control information, DMAC control information and data information can be protected by different error check codes, resulting in a finer DMA transfer control and a finer failure recovery process. Even if the routing control information is required to be rewritten such as when duplicate WRITE is performed via the switching adapter, it is possible to minimize the recalculation range of the error check code and realize the disk controller of a high reliability and a high performance.
The DMA controller 420 sends back completion sub-statuses 621 and 622 corresponding to the DMA transfer sub-request 610 to the DMA controller 120. The completion sub-statuses 621 and 622 contain the information of the DMA sequence field 524 contained in the DMA transfer sub-request 610. The DMA controller 120 confirms the information in this DMA sequence field in order to confirm the transfer sequence of DMA sub-transfers.
In the example of the structure shown in
The DMA controller 420 of the switch adapter generates a DMA sub-request packet 611 and a DMA sub-request packet 612 and transfers the packets to the respective targeting addresses. The former packet 611 has DMAC 321 as the targeting address field and contains the transfer data 531, and the latter packet 612 has DMAC 322 as the targeting address field and contains the transfer data 531. In response to the DMA sub-requests 611 and 612, the DMACs 321 and 322 of the memory adapter return sub-statuses 621 and 622 to the channel adapter DMAC 120 via the switch adapter DMAC 420.
The example of the structure shown in
With the example of the structure shown in
Until the header error check code is confirmed, the reception side DMA controller 420 in the switch adapter will not send the packet to the transmission side DMAC. After the header error check code is confirmed, the header and data field of the packet are sent to the transmission side DMAC in a pipeline processing manner. If an error is found by the header error check code, the packet is discarded and a proper error recovery process is executed.
With the example of the structure shown in
The adapter used by the disk controller according to the embodiment of the invention, such as the channel adapter shown in
With the example of the structure, the arbiter 1150 or 2150 controls to make the same DLE deal with transmission/reception for a series of DMA sub-requests and sub-statuses from the same DMAC. More preferably, a transmission/reception in the normal operation is fixed for the requests and statuses from the same DMAC.
With the example of the structure shown in
According to this embodiment of the invention, the reliability can be improved by enhancing the redundancy of the disc controller system.
As described so far, adopting the disk controller of the embodiments of the invention shown in
According to the embodiments, a plurality of DMACs can be set in one-to-one correspondence with one DLE. For example, the control system inner network and data system inner network can be mixed in one DLE. A plurality of DMACs can be set in one-to-one correspondence with one MC, so that the control system inner network and data system inner network can be mixed. A disk controller of a simpler inner network structure can be realized, satisfying both the performance improvement and low cost.
According to the embodiments, a proper sequence guarantee and its check are possible in the DMA transfer by connection-less type multiplex communication, and a proper failure recovery process can be performed when a failure occurs. With this structure, it becomes possible to realize a disk controller having a high reliability equivalent to the reliability of a conventional disk controller. According to the embodiments, the routing control information, DMAC control information and data information can be protected by different error check codes, resulting in a finer DMA transfer control and a finer failure recovery process. Even if the routing control information is required to be rewritten such as when duplicate WRITE is performed via the switching adapter, it is possible to minimize the recalculation range of the error check code and realize the disk controller of a high reliability and a high performance.
According to the embodiments, it becomes possible to realize cache memory duplicate WRITE by the switch adapter DMAC. Since DMAC of the switch adapter near the memory adapter generates the packets for duplicate WRITE, the bandwidth of the inner network will not be consumed wastefully and the path efficiency can be improved.
According to the embodiments, the switch adapter can start a transmission process from the transmission DLE before the whole data field is fetched from the reception DLE and the data field error check code is confirmed, and the packet having an illegal targeting address field because of an error in the header is discarded to prevent the propagation of the error. According to the embodiments, since the inner network route is fixed for a series of DMA sub-requests and sub-statuses, there is no possibility of a sequence exchange (outrun) due to different routes. The sequence control of DMA sub-requests and sub-statuses can be facilitated greatly.
According to the embodiments, the reliability can be improved by providing the redundance with the disk controller system. According to the embodiments, the scalability of the disk controller can be improved.
This application relates to and claims priority under 35 U.S.C. 119 from Japanese Patent Application No. 2004-038459 filed on Feb. 16, 2004 which is cited to support the present invention.
Number | Date | Country | Kind |
---|---|---|---|
2004-038459 | Feb 2004 | JP | national |
This application is a continuation application of U.S. Ser. No. 10/826,471, filed Apr. 16, 2004 now U.S. Pat. No. 7,231,469 and is related to U.S. Ser. No. 11/249,174, filed Oct. 11, 2005, which is a continuation-in-part of U.S. Pat. Ser. No. 10/826,471.
Number | Name | Date | Kind |
---|---|---|---|
5140592 | Idleman et al. | Aug 1992 | A |
5201053 | Benhase et al. | Apr 1993 | A |
5206943 | Callison et al. | Apr 1993 | A |
5249279 | Schmenk et al. | Sep 1993 | A |
5257391 | DuLac et al. | Oct 1993 | A |
5511227 | Jones | Apr 1996 | A |
5548788 | McGillis et al. | Aug 1996 | A |
5574950 | Hathorn et al. | Nov 1996 | A |
5586127 | Moriuchi | Dec 1996 | A |
5729763 | Leshem | Mar 1998 | A |
5740465 | Matsunami et al. | Apr 1998 | A |
5761534 | Lundberg et al. | Jun 1998 | A |
5949982 | Frankeny et al. | Sep 1999 | A |
5974058 | Burns et al. | Oct 1999 | A |
6108750 | Yamamoto et al. | Aug 2000 | A |
6148349 | Chow et al. | Nov 2000 | A |
6330626 | Dennin et al. | Dec 2001 | B1 |
6385681 | Fujimoto et al. | May 2002 | B1 |
6401149 | Dennin et al. | Jun 2002 | B1 |
6535953 | Kakiuchi | Mar 2003 | B1 |
6542951 | Sangveraphunski et al. | Apr 2003 | B1 |
6542961 | Matsunami et al. | Apr 2003 | B1 |
6581137 | Sandorfi | Jun 2003 | B1 |
6601134 | Yamagami et al. | Jul 2003 | B1 |
6604155 | Chong | Aug 2003 | B1 |
6609164 | Kallat | Aug 2003 | B1 |
6611879 | Dobecki | Aug 2003 | B1 |
6631433 | Paluzzi | Oct 2003 | B1 |
6636933 | MacLellan et al. | Oct 2003 | B1 |
6651130 | Thibault | Nov 2003 | B1 |
6671767 | Furuumi et al. | Dec 2003 | B2 |
6684268 | Paluzzi | Jan 2004 | B1 |
6711632 | Chow et al. | Mar 2004 | B1 |
6779071 | Kallat et al. | Aug 2004 | B1 |
6792506 | Solomon et al. | Sep 2004 | B2 |
6813689 | Baxter, III | Nov 2004 | B2 |
6816916 | Black et al. | Nov 2004 | B1 |
6820171 | Weber et al. | Nov 2004 | B1 |
6834326 | Wang et al. | Dec 2004 | B1 |
6865643 | Baxter, III | Mar 2005 | B2 |
6868479 | Thibault et al. | Mar 2005 | B1 |
6874039 | Ganapathy et al. | Mar 2005 | B2 |
6877059 | Solomon et al. | Apr 2005 | B2 |
6889301 | Wilson et al. | May 2005 | B1 |
6901468 | Paluzzi | May 2005 | B1 |
6910102 | Matsunami et al. | Jun 2005 | B2 |
6952734 | Gunlock et al. | Oct 2005 | B1 |
6961788 | Kanai et al. | Nov 2005 | B2 |
6970972 | Hosoya | Nov 2005 | B2 |
6985994 | Kanai et al. | Jan 2006 | B2 |
20010054120 | Horikawa et al. | Dec 2001 | A1 |
20020087751 | Chong | Jul 2002 | A1 |
20020188786 | Barrow et al. | Dec 2002 | A1 |
20030046460 | Inoue et al. | Mar 2003 | A1 |
20030061297 | Fujimoto | Mar 2003 | A1 |
20030084237 | Yoshida et al. | May 2003 | A1 |
20030131192 | Nakamura et al. | Jul 2003 | A1 |
20030140192 | Thibault et al. | Jul 2003 | A1 |
20030182502 | Kleiman et al. | Sep 2003 | A1 |
20030182516 | Fujimoto | Sep 2003 | A1 |
20030188032 | Solomon et al. | Oct 2003 | A1 |
20030188098 | Baxter, III | Oct 2003 | A1 |
20030188099 | Baxter et al. | Oct 2003 | A1 |
20030188100 | Solomon et al. | Oct 2003 | A1 |
20030191892 | Fujimoto et al. | Oct 2003 | A1 |
20030204649 | Kanai et al. | Oct 2003 | A1 |
20030229757 | Hosoya et al. | Dec 2003 | A1 |
20040024951 | Aruga | Feb 2004 | A1 |
20040098529 | Sangveraphunski et al. | May 2004 | A1 |
20040103244 | Fujimoto et al. | May 2004 | A1 |
20040111485 | Mimatsu et al. | Jun 2004 | A1 |
20040111560 | Takase et al. | Jun 2004 | A1 |
20040123028 | Kanai et al. | Jun 2004 | A1 |
20040139260 | Steinmetz et al. | Jul 2004 | A1 |
20040139365 | Hosoya | Jul 2004 | A1 |
20040177182 | Metevier et al. | Sep 2004 | A1 |
20040186931 | Maine | Sep 2004 | A1 |
20040193760 | Matsunami et al. | Sep 2004 | A1 |
20040193973 | Porat et al. | Sep 2004 | A1 |
20040199719 | Valin et al. | Oct 2004 | A1 |
20040205269 | Stolowitz | Oct 2004 | A1 |
20040243386 | Stolowitz et al. | Dec 2004 | A1 |
20040257857 | Yamamoto et al. | Dec 2004 | A1 |
20050010715 | Davies et al. | Jan 2005 | A1 |
20050021884 | Jeddeloh | Jan 2005 | A1 |
20050021888 | Yatziv et al. | Jan 2005 | A1 |
20050060443 | Rosner | Mar 2005 | A1 |
20050071424 | Baxter, III | Mar 2005 | A1 |
20050071556 | Walton et al. | Mar 2005 | A1 |
20050076177 | Mori | Apr 2005 | A1 |
20050080946 | Hosoya | Apr 2005 | A1 |
Number | Date | Country |
---|---|---|
10 2004 013112 | Jun 2005 | DE |
1353264 | Oct 2003 | EP |
2 366 049 | Feb 2002 | GB |
05-242006 | Sep 1993 | JP |
07-056693 | Mar 1995 | JP |
07-200187 | Aug 1995 | JP |
07-210320 | Aug 1995 | JP |
09-198308 | Jul 1997 | JP |
10-105347 | Apr 1998 | JP |
2000-298641 | Oct 2000 | JP |
2003-263279 | Sep 2003 | JP |
8803679 | May 1988 | WO |
9113404 | Sep 1991 | WO |
0203387 | Jan 2002 | WO |
03043254 | May 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20070130385 A1 | Jun 2007 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10826471 | Apr 2004 | US |
Child | 11652567 | US |