This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2013-175250, filed on Aug. 27, 2013, the entire contents of which are incorporated herein by reference.
The embodiment discussed herein is related to an information processing system and a failure processing method of the information processing system.
An information processing system including a plurality of nodes has, for example, a building block (BB) structure. For example, in an information processing system in which a plurality of nodes share a memory, the nodes share the memory via the crossbar. An application operating in the information processing system uses the shared memory to improve processing performance of the system. On the other hand, operating systems (hereinafter referred to as OSs) and hypervisors operating in the respective nodes operate on local memories of the nodes. Since the OSs and the hypervisors operate on the local memories, independency of the nodes increases and availability of the system is improved.
In such an information processing system, when a failure occurs in hardware of a part of the nodes, the fail node in which the failure occurs needs to be detected. Further, in a state in which the fail node is separated from the system, the operation of the system needs to be resumed. The detection of the failure of the hardware is disclosed in, for example, Japanese Patent Application Laid-Open No. 2011-248653.
The information processing system sequentially specifies the fail node and analyzes the necessity of separation of the fail node from the system on the basis of log information including events of failures. Therefore, according to increases in the number of nodes of the information processing system and types of the events of the failures, an analysis time for specifying the fail node and analyzing the necessity of the separation of the fail node from the system also increases. Since a data amount of the log information is enormous, times is also consumed by collection of the log information.
According to a first aspect of the embodiment, an information processing system includes a plurality of nodes, and a shared memory connected to the plurality of nodes, wherein each of the nodes includes, a plurality of functional circuits, a control device configured to control the functional circuits, and a register configured to store a plurality of interrupt factors that occur in the plurality of functional circuits, and wherein the control device in one node among the plurality of nodes receives the interrupt factor in each register of a plurality of other nodes in response to an occurrence of the interrupt factor of one node among the plurality of other nodes, extracts an interrupt factor to be detected as a failure among the received interrupt factors, specifies a fail node according to an extraction result, and, after suppressing access to the shared memory by the fail node, controls to separate the fail node from the information processing system on basis of log information received from the plurality of other nodes.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention.
The system board 1A includes a plurality of central processing units (CPUs) 12a, memories 3 and 11a, and an input output (I/O) device 13a. A part of regions of the memories 3 and 11a is used as a share memory 3 shared by all CPUs included in the information processing system 1. The other part of the regions is used as a local region 11a in which the CPUs 12a store kernel data and the like. The other system boards 1B to 1E include a configuration same as the configuration of the system board 1A. In the following explanation, the system boards are referred to as nodes.
In a firmware layer 14a of the node 1A, for example, control software called hypervisor operates. The hypervisor logically divides resources of the node 1A and generates one or a plurality of logical partitions Da and Db. When the plurality of logical partitions Da and Db are generated, a plurality of operating systems (hereinafter referred to as OSs) can operate on one node. In an example depicted in
Applications pa to ph operating on the logical partitions Da to Dh use, for example, the shared memory 3. That is, in this embodiment, a distributed shared memory is configured in which the nodes include parts of the shared memory 3 and use the shared memory 3 of the other nodes. The applications pa to ph perform predetermined processing on the basis of shared information stored in the shared memory 3. The hypervisors and the OSs operate on the respective local memories 11a to 11e. Consequently, independency of the nodes increases and availability of the system is improved.
In the information processing system 1 including the distributed shared memory 3, for example, when the CPUs 12a of the node 1A access a region of the shared memory 3 of a node (e.g., the node 18) different from the node 1A on the shared memory 3 in executing the application pa, the CPUs 12a transmit a request for access to the region of the shared memory 3 of the node 1B via the crossbar switch 2. When the CPUs 12a access a region of the shared memory 3 of the own node 1A, the CPUs 12a transmit a request for memory access via direct connection.
[Configuration of the Information Processing System]
In the example depicted in
In the example depicted in
In this example, the first, second, third, and fourth system boards 1A, 1B, 1C, and 1D are connected to the first crossbar switch 2A. The fifth, sixth, seventh, and eighth system boards 1E, 1F, 1G, and 1H are connected to the second crossbar switch 2B. The ninth, tenth, eleventh, and twelfth system boards 1I, 1J, 1K, and 1L are connected to the third crossbar switch 2C. The thirteenth, fourteenth, fifteenth, and sixteenth system boards 1M, 1N, 1O, and 1P are connected to the fourth crossbar switch 2D.
In the example depicted in
The system boards 1A to 1P also include system control devices (depicted in
[Configuration of the System Boards]
The system board unit 81 includes, for example, the plurality of CPUs (CPU chips) 12a, a system controller 15, an I/O controller 16, a peripheral component interconnect (PCI) Express 17, a memory access controller 18, the memories 3 and 11a, and a maintenance bus controller (hereinafter referred to as MBC) 19. The memories 3 and 11a are for example, dynamic random access memories (DRAMs). The MBC 19 controls a communication path to the service processor board B2.
The CPUs 12a are arithmetic processing devices that execute the applications pa and pb explained with reference to
The system controller 15 performs transfer control between the CPUs 12a and the memory access controller 18. The system controller 15 is connected to the crossbar switch 2A via the connection ports ax and ay and performs transfer control between the crossbar switch 2A and the CPUs 12a and transfer control between the crossbar switch 2A and the memory access controller 18. For example, the system controller 15 plays a role of a bridge circuit.
In
The service processor board B2 includes the system control device 22 and a maintenance bus controller (hereinafter referred to as MBC) 21. The system control device 22 performs control such as access control to hardware in a node, monitoring, power-on, collection of a log, and user interface control (user I/F). The MBC 21 controls a communication path to the system board unit 81. The MBC 21 includes a register rg that stores interrupt factors that occur in hardware such as the CPUs 12a, the memories 3 and 11a, the I/O controller 16, and the system control device 22. As explained above with reference to
In the example depicted in
A specific example of the register rg explained with reference to
[Register]
In
Processing during failure occurrence is explained below. In this embodiment, the register rg is used in failure occurrence processing explained below.
[Failure Occurrence Processing]
When a failure analysis of the entire information processing system 1 is performed, it is efficient that one of the plurality of nodes mainly performs the failure analysis. For the efficiency, the information processing system 1 sets the system control device of one node as a master system control device and sets the system control devices of the other nodes as slave system control devices. Alternatively, as a system control device to which the master system control device is switched, the information processing system 1 may set the system control device of another node as a substitute system control device for the master system control device. In an example depicted in
As explained above, each of the system control devices 22 and V1 to V4 performs monitoring of states of the hardware in the nodes and control of the hardware. The system control devices 22 and V1 to V4 include the registers rg (
When a failure of the hardware occurs, an interrupt signal is generated. An interrupt factor is stored in the registers rg (an arrow x1 depicted in the figure). When the system control devices 22 and V1 to V4 detect the occurrence of the failure of the hardware by monitoring the registers rg, the system control devices 22 and V1 to V4 notify the system control device V1 of the master node 2AB of the occurrence of the interrupt factor (arrows x2 and x3 depicted in the figure). When the system control device V1 of the master node 2AB receives the notification of the occurrence of the interrupt factor, the system control device V1 instructs the system control devices 22 and V2 to V4 of the slave nodes 1A to 1P and 2BB to 2DB to transmit log information including error information of the hardware (an arrow x4 depicted in the figure). The system control devices 22 and V2 to V4 of the slave nodes 1A to 1P and 2BB to 2DB collect log information in the nodes and transmit the log information to the system control device V1 of the master node 2AB according to the instruction received from the system control device V1 of the master node 2AB (an arrow x5 depicted in the figure). The log information acquired in the nodes is collected in the system control device V1 of the master node 2AB.
Subsequently, the system control device V1 of the master node 2AB performs analysis processing for the log information. For example, the system control device V1 specifies the fail node 1B and a failed component in the fail node 1B on the basis of the log information of the nodes. The system control device V1 performs reaction to the failure on the basis of the information specified by the analysis processing for the log information (arrows x6 and x7 depicted in the figure). The reaction is, for example, suppression of access to a region of the shared memory 3 of the fail node 18 by the applications operating in the nodes and stop control for the hardware of the fail node 1B.
As explained with reference to
[Analysis of the Log Information]
First, the analysis processing for the log information (S1) is explained. When a failure occurs, even if a continued operation of the nodes is possible, the information processing system 1 needs to specify content of the failure to perform preventive maintenance of the nodes. The information processing system 1 needs to specify an application specific integrated circuit (hereinafter referred to as ASIC) portion in which the failure occurs. For example, to specify the ASIC portion in which the failure occurs and determine possibility of a continued operation of the nodes, the system control device V1 of the master node 2AB performs the analysis processing for the log information. In this embodiment, the ASIC corresponds to, for example, the CPU, the memory access controller, and the I/O controller.
The system control device V1 of the master node 2AB performs a log analysis on the basis of the collected log information (S1). The log information is, for example, error factor information including error information during the interrupt factor occurrence and error log detailed information. The error log detailed information is, for example, history information of the ASIC and dump information. Since a data amount of the error log detailed information is enormous, the system control device V1 of the master node 2AB receives the error log detailed information in parallel to analysis steps S61 to S65 in the log information (S1).
Analysis steps in step S1 are explained. First, the system control device V1 performs analysis processing for an error code on the basis of the error factor information (S61). Subsequently, the system control device V1 performs determination of presence or absence and specifying processing for a failed portion on the basis of the error factor information targeting each of the nodes and the hardware (S62 to S65). The system control device V1 performs determination of a failed portion in the respective kinds of hardware and determination processing for details of the failed portion on the basis of the error factor information targeting, for example, the CPU 12a, the crossbar switch 2, and the memories 3 and 11a. According to the processing in steps S62 to S65, a fail node and a circuit in which the failure occurs are specified. It is confirmed that the other circuits are normally operating. In an example depicted in
When the fail node and the failed portion are specified, the system control device V1 stands by for completion of the collection of the error log detailed information and performs registration processing for the error log detailed information (S66). Subsequently, the system control device V1 performs registration processing for a representative log indicating the log information corresponding to the failed portion on the basis of the error factor information (S67). The error log detailed information and the representative log are information needed for an analysis of a cause of the failure and restoration from the failure. When the analysis processing based on the log information is completed, the system control device V1 performs separation control for the fail node from the information processing system 1 according to the importance of the failure (S2). The separation control for the fail node indicates, for example, a power supply stop to the hardware of the fail node.
The system control device V1 of the master node 2AB executes the analysis processing for the log information (S1) irrespective of the importance of the failure. In the analysis processing for the log information, the system control device V1 performs the determination processing for the failed portion in detail targeting the respective kinds of hardware in the nodes. Therefore, the analysis processing takes time. Since a data amount of the error log detailed information is enormous, transfer processing for the error log detailed information takes time. Therefore, the analysis processing for the log information takes about several ten seconds to several minutes (30 seconds to 5 minutes). That is, time of about 5 minutes is consumed from the occurrence of the failure to the separation control for the fail node.
However, it is desirable that the information processing system 1 resumes the operation in a short time after the failure occurs. In resumption processing for the operation, a normal node takes over processing of the fail node. Therefore, the information processing system 1 needs to quickly specify the fail node. In the information processing system 1 in which the memory is shared among the plurality of nodes, it is likely that a secondary failure such as breakage, nonconformity, or the like of the shared memory 3 occurs because of the occurrence of the failure. In order to suppress the secondary failure of the shared memory 3, suppression of access to the memory of the fail node needs to be quickly performed. It is desirable to complete the suppression of access to the memory of the fail node, for example, in about one second after the occurrence of the failure.
Therefore, in this embodiment, before the analysis processing for the log information (S1), the system control device V1 of the master node 2AB performs fail node list (hereinafter referred to as FNL) analysis processing (S3) to specify a fail node and performs suppression of access to the memory of the fail node (S4).
The system control device V1 of the master node 2AB in this embodiment receives the interrupt factors of the register rg according to occurrence of an interrupt factor in the other nodes and extracts an interrupt factor to be detected as a failure among the interrupt factors. The system control device V1 of the master node 2AB specifies a fail node according to an extraction result and, after suppressing access to the memory of the fail node, performs separation control for the fail node on the basis of log information received from the other nodes.
Specifically, in the FNL analysis processing (S3), first, the system control device V1 of the master node 2AB acquires occurred interrupt factors from the nodes (S51). Subsequently, the system control device V1 extracts interrupt factors to be detected as the failure among the acquired interrupt factors (S52). The system control device V1 excludes, from targets of an FNL analysis, affected interrupt factors among the extracted interrupt factors (S53). That is, the system control device V1 excludes, from the targets of the FNL analysis processing, interrupt factors that occur because of other interrupt factors among the extracted interrupt factors.
Subsequently, when a plurality of interrupt factors are extracted, the system control device V1 determines priority levels of the interrupt factors (S54). The system control device V1 selects the interrupt factors in order from the interrupt factor having the highest priority level and specifies a fail node corresponding to the interrupt factors (S55). The system control device V1 performs processing for suppressing access to the memory of the fail node from the other nodes (S56). That is, the system control device V1 suppresses access to a region of the shared memory 3 of the fail node, Details of the steps are explained below. Subsequently, the system control device V1 executes the analysis processing for the log information (S1) and performs the separation control for the fail node from the information processing system 1 (S2).
As explained with reference to
After suppressing the access to the memory of the fail node, the system control device V1 performs the analysis processing for the log information (S1), specifies an ASIC portion in which the failure occurs, and determines possibility of a continuous operation of the node. The system control device V1 performs the separation control for the fail node from the information processing system 1 (S2) on the basis of a result of the analysis processing for the log information.
In
On the other hand, in
A software module diagram of the system control device V1 of the master node 2AB and the system control devices 22 and V2 to V4 of the slave nodes 1A to 1P and 2BB to 2DB in this embodiment is explained.
[Software Module Diagram]
In
The intra-hardware control unit 61 includes, for example, a hardware access program (hereinafter referred to as HAP) 65 for performing access processing for hardware such as a power supply, a processor (in
The FNL update request reception control unit 52 acquires the notifications of the occurrence of the interrupt factors from the units and outputs the notifications to the FNL control unit 51. The FNL control unit 51 notifies the FNL control unit 32 of the system control device V1 of the master node 2AB of the occurrence of the interrupt factors via the FNL driver 54. The FNL control unit 51 collects the occurred interrupt factors and transmits the interrupt factors to the system control device V1 of the master node 2AB via the FNL driver 54 in response to a collection request for the interrupt factors from the system control device V1 of the master node 2AB. The FNL updating unit 53 updates a fail node list (FNL; not depicted in
In
When the FNL control unit 32 of the FNL unit 30 receives the notification of the occurrence of the interrupt factor from the system control device 22 of the slave node 1A, the FNL control unit 32 instructs, via the FNL driver 35, the system control devices 22 of the slave nodes to collect the occurred interrupt factors. The FNL analyzing unit 31 specifies a fail node referring to the FNDB 36 on the basis of the interrupt factors collected from the system control devices 22 of the nodes, The FNDB 36 is a file having a definition of an analysis logic in an FNL analysis. The FNL analyzing unit 31 instructs, on the basis of information concerning the specified fail node, the FNL updating units 33 of the nodes to update the FNL.
A flow of processing until the FNL 40 explained with reference to
[Flow of Processing of the Software Module]
In an example depicted in
The intra-hardware control unit 41 in the system control device V1 receives the notification of the occurrence of the failure and instructs the system control devices 22 of the slave nodes to collect interrupt factors (an arrow g3 depicted in the figure). In response to the notification of the system control device V1 of the master node 2AB, the system control devices 22 of the slave nodes transmit occurred interrupt factors to the FNL unit 30 of the system control device V1 (an arrow g5 depicted in the figure) via the FNL unit 50 (an arrow g4 depicted in the figure).
As a result, the interrupt factors of the system control devices 22 of the slave nodes are collected. A data amount of the interrupt factors is small. Therefore, the system control device V1 can acquire the interrupt factors of the system control devices 22 of the slave nodes in a short time. When the system control devices 22 and the system control devices V1 to V4 are connected via high speed communication, the system control device V1 can further acquire interrupt factors of the system control devices 22 and the system control devices V2 to V4 at high speed.
When the FNL unit 30 in the system control device V1 of the master node 2AB collects interrupt factors in all the slave nodes, the FNL unit 30 instructs the FNL analyzing unit 31 to perform analysis processing (an arrow g6 depicted in the figure). The FNL analyzing unit 31 specifies a fail node on the basis of the collected interrupt factors and outputs the fail node to the FNL unit 30 (an arrow g7 depicted in the figure). Subsequently, the FNL unit 30 instructs, on the basis of information concerning the fail node, the FNL units 50 of the system control devices of the slave nodes 1A to update the FNL 40 (
The FNL analysis processing and the update processing for the FNL are performed on the basis of the flow of the processing depicted in the flowchart of
[FNL Analysis Processing and FNL Update Processing]
On the other hand, when a power supply failure does not occur (NO in S21), the FNL analyzing unit 31 acquires interrupt factors (S22). As explained above, the FNL analyzing unit 31 acquires interrupt factors from the nodes in response to the notification from the node in which the interrupt factor occurs. Subsequently, the FNL analyzing unit 31 extracts interrupt factors to be detected as a failure from the collected interrupt factors (S23). For example, the FNL analyzing unit 31 extracts, for example, interrupt factors corresponding to failures in which the nodes need to be stopped among the interrupt factors. That is, the FNL analyzing unit 31 does not extract interrupt factors in which the nodes can continue to operate.
In this embodiment, the system control device V1 of the master node 2AB extracts, for example, among the interrupt factors illustrated in
The register map rm depicted in
Subsequently, the FNL analyzing unit 31 suppresses affected interrupt factors (S24). The interrupt factors are classified into affecting interrupt factors and affected interrupt factors induced on the basis of the affecting interrupt factors. The FNL analyzing unit 31 excludes the affected interrupt factors and narrows down the interrupt factors to only the affecting interrupt factors.
Specifically, for example, when a failure occurs in the processor of a certain node, the failure sometimes affects a connecting section of the crossbar switch in the same node and other portions of the processor. In this case, interrupt factors corresponding to a failure that occurs in the processor are equivalent to high-order interrupt factors. An interrupt factor corresponding to failures that occur in the connecting section of the crossbar switch and the other portions of the processor are equivalent to low-order interrupt factors. That is, the high-order interrupt factors correspond to the affecting interrupt factors and the low-order interrupt factors corresponds to the affected interrupt factors.
In order to exclude the affected interrupt factors from targets of the FNL analysis according to the suppression processing for the affected interrupted factors (S24), the FNL analyzing unit 31 specifies only nodes corresponding to the affecting interrupt factors as fail nodes. That is, the FNL analyzing unit 31 avoids specifying nodes corresponding to the affected interrupt factors as fail nodes and specifies only nodes in which failures actually occurs as fail nodes.
Subsequently, the FNL analyzing unit 31 acquires priority levels of the respective extracted interrupt factors (S25). The FNL analyzing unit 31 specifies, in order from the interrupt factor having the highest priority level, a fail node corresponding to the interrupt factor and acquires control content for the fail node (S26). The FNL updating units 33 and 53 update the FNL 40 on the basis of the acquired control content and suppress access to a region of the fail node on the shared memory 3 from the other nodes (S27). Subsequently, the FNL analyzing unit 31 and the FNL updating units 33 and 53 perform processing in steps S26 and S27 targeting the interrupt factor having the next highest priority level. After performing the processing in steps S26 and S27 targeting all the extracted interrupt factors, the FNL analyzing unit 31 and the FNL updating units 33 and 53 end the analysis processing for the FNL and the update processing for the FNL.
Details of the suppression processing for the affected interrupt factors (step S24) in the flowchart of
[Suppression of the Affected Interrupt Factors (Step S24 in
On the other hand, when the extracted interrupt factors are the law-order interrupt factors (YES in 512), the FNL analyzing unit 31 refers to the registers rg of the nodes and determines whether the affecting interrupt factors indicating the high-order interrupt factors corresponding to the low-order interrupt factors occur (S13). When the affecting interrupt factors occur (YES in S14), the FNL analyzing unit 31 suppresses the low-order interrupt factors and excludes the low-order interrupt factors from targets of the FNL analysis processing (S15).
On the other hand, when the affecting interrupt factors do not occur (NO in S14), the FNL analyzing unit 31 ends the suppression processing for the affected interrupt factors. That is, for example, even if occurred interrupt factors are the affected interrupt factors, when the affecting interrupt factors do not occur, the FNL analyzing unit 31 does not suppress the affected interrupt factors.
As explained above with reference to
A specific example of the FNDB 36 referred to by the FNL analyzing unit 31 in the FNL analyzing processing is explained. The FNDB 36 has a definition of an analysis logic in the FNL analysis.
[Specific Example of the FNDB]
According to
The entry suppression condition (ent_dis) indicates an interrupt factor corresponding to the interrupt factor and logically higher in order than the interrupt factor. That is, the entry suppression condition (ent_dis) indicates an affecting interrupt factor corresponding to the interrupt factor. When the entry suppression condition (ent_dis) is blank, this indicates that the affecting interrupt factor corresponding to the interrupt factor is absent.
The FNL analyzing unit 31 refers to a description cd1 of the definition table tb1 depicted in
For example, in an example explained below, an interrupt factor collected on the basis of the register rg is the interrupt factor CK. According to the register map rm depicted in
Subsequently, the FNL analyzing unit 31 refers to an item cd5 corresponding to the entry suppression condition (ent_dis) in the detected definition condition in the second row. In an example depicted in
On the other hand, the FNL analyzing unit 31 detects definition information in the first row corresponding to the interrupt factor FE. In the example depicted in
The information processing system is explained on the basis of a specific example. For example, an information processing system in which a plurality of nodes BS00 to SB03 are connected to a node XB00 including the crossbar switch 2 is illustrated. In the specific example, for example, a clock control error is generated in the node SB02 and a port failure occurs in the node XB00 including the crossbar switch 2.
When the system control device V1 of the master node 2AB detects any interrupt factor, the system control device V1 collects interrupt factors that occur in the nodes (S22). The system control device V1 acquires the interrupt factor CK corresponding to the clock control error that occurs in the node SB02 and an interrupt factor corresponding to the port failure that occurs in the node XB00. In the specific example, the interrupt factor CK and the interrupt factor corresponding to the port failure are extraction target interrupt factors (S23).
Subsequently, the FNL analyzing unit 31 refers to the FNDB 36 depicted in
Subsequently, the action number (act) of the definition table tb1 depicted in
The data definition block of the definition table tb2 depicted in
According to
Further, the entry GCSM_DEGRADE indicates that the node in which the interrupt factor is detected is specified as a fail node and control of memory access is performed with the fail node set as a function degradation target node. The entry GCSM_DEGRADE_DEST indicates that a node connected to the node in which the interrupt factor is detected is specified as a fail node and control of memory access is performed with the fail node set as a function degradation target node. The function degradation of the fail node indicates, for example, control for degrading the double lines of the crossbar switch depicted in
In the specific example, as explained above, the action number (act) of the interrupt factor CK is the value “0x01” and the action number (act) of the interrupt factor corresponding to the port number is, for example, the value “0x12”. Therefore, the FNL analyzing unit 31 specifies the node SB02 in which the interrupt factor occurs corresponding to the interrupt factor CK, as a fail node. The FNL updating units 33 and 53 perform the control (FNL_UPDATE) of a memory access related to the node SB02. The FNL analyzing unit 31 specifies the nodes SB00 to SB03 connected to the node XB00 in which the interrupt factor corresponding to the port failure occurs, as fail nodes. The FNL updating units 33 and 53 perform the function degradation control (GCSM_DEGRADE_DEST) related to the nodes SB00 to SB03.
However, in the specific example, the interrupt factor CK takes precedence over the interrupt factor corresponding to the port failure. Therefore, first, the FNL updating units 33 and 53 perform the processing for suppressing access to the memory of the fail node corresponding to the interrupt factor CK (S26 and S27). For example, the FNL updating units 33 and 53 of the respective nodes update the FNL 40 and suppress access to the memory of the node SB02 from the other nodes.
Subsequently, the FNL updating units 33 and 53 perform the processing for suppressing access to the memories of the nodes SB00 to SB03 according to the interrupt factor corresponding to the port failure (step S26 and S27). For example, the FNL updating units 33 and 53 of the respective nodes update the FNL 40 and degrade the double lines to a line on one side in the access processing from the node XB00 to the nodes SB00 to SB03. Since the lines are degraded, access paths to the shared memory of the nodes SB00 to SB03 decrease.
As depicted in
As explained above, in the information processing system in this embodiment, includes a plurality of nodes and a shared memory connected to the plurality of nodes. Each of the nodes includes a plurality of functional circuits, a control device configured to control the functional circuits; and a register configured to store a plurality of interrupt factors that occur in the plurality of functional circuits. The control device in one node among the plurality of nodes receives the interrupt factor in each register of a plurality of other nodes in response to an occurrence of the interrupt factor of one node among the plurality of other nodes, extracts an interrupt factor to be detected as a failure among the received interrupt factors, specifies a fail node according to an extraction result, and, after suppressing access to the shared memory by the fail node, controls to separate the fail node from the information processing system on basis of log information received from the plurality of other nodes.
The information processing system in this embodiment can specify a fail node at high speed on the basis of interrupt factors. The information processing system in this embodiment specifies a fail node targeting interrupt factors to be detected as a failure for which a stop of the nodes is needed among a plurality of interrupt factors. Therefore, the information processing system can more efficiently specify a fail node.
Since the information processing system in this embodiment can specify a fail node at high speed, the information processing system can quickly suppress access to the memory of the fail node and can avoid a secondary failure of the shared memory. That is, the information processing system can quickly reduce the influence of the fail node on the other nodes during occurrence of a failure. Since the information processing system can specify a fail node at high speed, the information processing system can reduce overhead for switch of operation from the fail node to the normal node during occurrence of a failure.
In the information processing system in this embodiment, the control device of the one node determines whether a second interrupt factor (the affecting interrupt factor), which is a spreading source of an interrupt factor to be detected as the failure, occurs. The control device specifies a node corresponding to the interrupt factor as the fail node when the second interrupt factor does not occur. The control device specifies a node corresponding to the second interrupt factor as the fail node, when the second interrupt factor occurs.
The information processing system in this embodiment specifies a node corresponding to the affecting interrupt factor as a fail node. Therefore, when a plurality of interrupt factors occur in conjunction with one another, the information processing system can specify a fail node corresponding to the affecting interrupt factor targeting only the affecting interrupt factor among the plurality of interrupt factors.
In the information processing system in this embodiment, when the control device of the one node detects a plurality of the interrupt factors to be detected as the failure, the control device suppresses access to the shared memory by the specified fail node on the basis of a priority level of the interrupt factor.
The information processing system in this embodiment controls the order of the processing for suppressing access to the memory of the fail node on the basis of the priority levels of the interrupt factors. Therefore, the information processing system can adjust, according to the interrupt factors, the order of the processing for suppressing access to the memory of the fail node. The information processing system can maintain the performance of the information processing system longer by setting the priority level low for the interrupt factor in which a suppression range of access to the memory is wide.
In the information processing system in this embodiment, when the interrupt factor to be detected as the failure is an interrupt factor that occurs in the node that executes the data processing, the control device of the one node specifies the node where the interrupt factor has occurred as the fail node. When the interrupt factor to be detected as the failure is an interrupt factor that occurs in the node including the network connecting device, the control device specifies a node connected to the network connecting device as the fail node. Therefore, the information processing system in this embodiment can specify a fail node corresponding to the interrupt factor on the basis of the interrupt factor.
In the information processing system in this embodiment, one node includes a definition table including a correspondence relation between the interrupt factor and the interrupt factor which is the spreading source of the interrupt factor. The control device of the one node determines, on the basis of the definition table, whether the interrupt factor, which is the spreading source of the interrupt factor to be detected as the failure, occurs.
The information processing system in this embodiment includes the definition table including the correspondence relation between the interrupt factors and the interrupt factors affecting the interrupt factors. Therefore, the information processing system can determine, at high speed, whether an interrupt factor is an affecting interrupt factor. When the interrupt factors increase or when a change of the interrupt factors occurs, the information processing system can easily apply the increase or the change of the interrupt factors by performing the update processing for the definition table. Consequently, the information processing system can suppress maintenance man-hour during enhancement and a design change.
The information processing system in this embodiment, one node includes a definition table including the priority level corresponding to the interrupt factor. The control device of the one node determines the priority level of the interrupt factor on the basis of the definition table.
The information processing system in this embodiment includes the definition table including the priority levels corresponding to the interrupt factors. Therefore, the information processing system can acquire a priority level of an interrupt factor at high speed. When the interrupt factors increase or when a change of the interrupt factors occurs, the information processing system can easily apply the increase or the change of the interrupt factors by performing the update processing for the definition table. Consequently, the information processing system can suppress maintenance man-hour during enhancement and a design change.
The configuration of the distributed shared memory in which the nodes includes the shared memory is explained as an example. However, this embodiment can also be applied to a cluster type configuration in which a shared memory is not provided in nodes and is provided separately from the nodes.
All examples and conditional language provided herein are intended for the pedagogical purposes of aiding the reader in understanding the invention and the concepts contributed by the inventor to further the art, and are not to be construed as limitations to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although one or more embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2013-175250 | Aug 2013 | JP | national |