This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2019-075035, filed on Apr. 10, 2019, and the prior Japanese Patent Application No. 2018-129242, filed on Jul. 6, 2018, the entire contents of which are incorporated herein by reference.
The embodiments discussed herein are related to an information processing apparatus and an information processing method.
In some cases, a bezel (also called a “front bezel” or a “dust-proof bezel”) may be provided in front of a casing of a server in order to prevent dust or dirt from entering the casing. The bezel has a filter-shaped surface, and also has a function of discharging heat within the casing by an air flow generated by a fan provided within the casing of the server.
When the bezel is clogged due to dirt, etc., there is a concern that a heat exhaust efficiency in the casing of the server may be lowered, and then the server may be broken.
Therefore, there has been known a technology of comparing an air volume measured by an air volume sensor provided near the bezel in the casing of the server, to a reference value, so that the clogging of the bezel is detected when the air volume falls below the reference value.
Related techniques are disclosed in, for example, Japanese Laid-open Patent Publication No. 2012-066190 and Japanese Laid-open Patent Publication No. 2013-201304.
However, when the air volume is measured, the CPU temperature of the server has to be kept constant. Also, since many factors that cause a change of the air volume may be taken into consideration, it is difficult to determine that the bezel is clogged simply because the measured air volume falls below the reference value.
According to an aspect of the embodiments, an information processing apparatus includes: a fan that cools a first processor; a dust-proof bezel that prevents foreign matter from entering a casing; a memory; and a second processor coupled to the memory and the second processor configured to: measure a temperature of the first processor and an air volume of an air flow which passes through the dust-proof bezel; compare a registered air volume to the measured air volume when the measured temperature matches a registered temperature included in comparison information stored in the memory, the registered air volume being included in the comparison information in association with the matched temperature, the comparison information including a registered temperature of the first processor and a registered air volume of an air flow generated by the fan in association with each other; and determine abnormality in the dust-proof bezel based on a result of the comparison.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention, as claimed.
Hereinafter, embodiments will be described with reference to the accompanying drawings. Meanwhile, the embodiments described below are merely exemplary, and there is no intention to exclude the application of various modifications or techniques not specified in the embodiments. That is, the embodiments may be implemented through various modifications without departing from the scope of the gist thereof.
Each drawing is not intended to include only constitutional elements illustrated in the drawing, but may include other functions, etc.
Hereinafter, in the drawings, the same reference numerals indicate the same portions, and thus, the descriptions thereof will be omitted.
The information processing system 100 includes a monitoring target server 1, a management server/mobile terminal 2, and a management terminal 3.
The management server/mobile terminal 2 may be connected to the monitoring target server 1 via a local area network (LAN) by using a simple network management protocol (SNMP), an e-mail, etc. The management server/mobile terminal 2 may remotely monitor the monitoring target server 1. When a plurality of monitoring target servers 1 is provided in the information processing system 100, the management server/mobile terminal 2 may collectively monitor the plurality of monitoring target servers 1. In the embodiment, the management server/mobile terminal 2 receives a notification of clogging of a dust-proof bezel 16 detected in the monitoring target server 1 (described below with reference to
The management terminal 3 may be connected to the monitoring target server 1 via a LAN by using a hypertext transfer protocol (HTTP). The management terminal 3 is connected to the monitoring target server 1 in a one-to-one relationship, and is capable of monitoring various measurement results in the monitoring target server 1. In the embodiment, the management terminal 3 receives a notification of clogging of the dust-proof bezel 16 detected in the monitoring target server 1 (described below with reference to
The monitoring target server 1 includes a baseboard management controller (BMC) 11. The BMC 11 detects clogging in the dust-proof bezel 16 as described below with reference to
The monitoring target server 1 includes a system board 10, a plurality (six in the illustrated example) of cooling fans (FAN) 14, an air volume sensor 15, and the bezel 16 (also called a “dust-proof bezel” or a “front bezel”). In
The cooling fan 14 generates an air flow in order to cool the heat within the casing 1000.
The air volume sensor 15 measures an air volume of an air flow passing through the dust-proof bezel 16 among the air volumes in the air flows generated by the cooling fans 14.
The dust-proof bezel 16 has a filter-shaped surface, and is provided in front of the casing 1000 of the monitoring target server 1 in such a manner that the dust-proof bezel 16 is detachable from the casing 1000. When an abnormality such as clogging has occurred due to dust, dirt etc. in the dust-proof bezel 16, the dust-proof bezel 16 may be replaced by a new dust-proof bezel 16 by the user.
The system board 10 includes the BMC 11, a central processing unit (CPU) 12, and a chipset 13.
The chipset 13 is a combination of a plurality of integrated circuits that implements various functions in the monitoring target server 1. The chipset 13 may be connected to the BMC 11 by a low pin count (LPC) bus and may be connected to the CPU 12 by a direct media interface (DMI).
The CPU 12 is an example of a first processor. The CPU 12 is, for example, a processing device that performs various controls or calculations in the monitoring target server 1, and implements various functions by executing, for example, an Operating System (OS) or a program stored in a memory (not illustrated) within the chipset 13.
The BMC 11 is an example of a second processor. The BMC 11 monitors an air volume by the air volume sensor 15 provided near the dust-proof bezel 16 of the monitoring target server 1, and monitors the temperature of the CPU 12. The BMC 11 may be connected to the CPU 12, each of the cooling fans 14, and the air volume sensor 15 by an inter-integrated circuit I2C.
The BMC 11 functions as, for example, a control determination unit 111, a measurement unit 112, a storage unit 113, a notification unit 114, and an interface (I/F) processor 115.
The control determination unit 111 performs a comparison and a determination of information such as the temperature of the CPU 12 acquired by an intelligent platform management bus (IPMB) interface or an air volume acquired by the air volume sensor 15, and performs a control of the cooling fans 14.
That is, the control determination unit 111 is an example of a determination unit that compares a calibration value DB 101 (described below with reference to
The measurement unit 112, via the I2C interface, measures an air volume within the casing 1000 by using the air volume sensor 15, and measures the temperature of the CPU 12.
That is, the measurement unit 112 measures a relationship between the temperature of the CPU 12 and the air volume of an air flow which passes through the dust-proof bezel 16, as the measurement value DB 102.
As described below with reference to
That is, the storage unit 113 has a storage area that stores the calibration value DB 101 in which the temperature of the CPU 12 and an air volume of an air flow generated by the cooling fans 14 are registered in association with each other.
When the control determination unit 111 determines that an abnormality such as clogging has occurred in the dust-proof bezel 16, the notification unit 114 notifies the user about the occurrence of the abnormality via the I/F processor 115.
The I/F processor 115 controls communication between the BMC 11 and other devices.
The calibration value DB 101 is an example of comparison information in which the temperature of the CPU 12 and an air volume of an air flow generated by the cooling fans 14 are registered in association with each other. In the calibration value DB 101, a calibration number, a CPU temperature, an air volume, and an approximation processing flag are registered. As illustrated, all values in the calibration value DB 101 may be “0” at the initial state.
The calibration number is a number that identifies each combination of a CPU temperature and an air volume. In the calibration number, values of #1 to #N (N is a natural number of 2 or more) may be registered.
In the CPU temperature, a temperature that the CPU 12 may take is registered, and the unit may be, for example, ° C.
In the air volume, a predicted value of an air volume of the air flow generated by the cooling fans 14 is registered when a corresponding CPU temperature is measured, and the unit is, for example, cubic feet per minute (CFM).
A relationship between a CPU temperature and an air volume may be registered on the basis of an actually measured past value. Meanwhile, when an air volume corresponding to a CPU temperature that is not actually measured in the past is registered in the calibration value DB 101, “1” is set in an approximate value flag.
The measurement value DB 102 holds a relationship between the temperature of the CPU 12 and an air volume of an air flow which passes through the dust-proof bezel 16. In the measurement value DB 102, a measurement number, a CPU temperature, an air volume, and a determination result are registered. As illustrated, all values in the measurement value DB 102 may be “0” at the initial state.
The measurement number is a number that identifies each combination of a CPU temperature and an air volume. In the measurement number, values of #1 to #M (M is a natural number of 2 or more) may be registered.
In the CPU temperature, the temperature of the CPU 12 measured by the measurement unit 112 is registered, and the unit may be, for example, ° C.
In the air volume, an air volume measured using the air volume sensor 15 is registered when a corresponding CPU temperature is measured, and the unit is, for example, cubic feet per minute (CFM).
In the determination result, “OK” is registered when a combination of a CPU temperature and an air volume in the measurement value DB 102 matches a combination of a CPU temperature and an air volume in the calibration value DB 101. Meanwhile, in the determination result, “NG” is registered when a combination of a CPU temperature and an air volume in the measurement value DB 102 does not match a combination of a CPU temperature and an air volume in the calibration value DB 101.
When the temperature of the CPU 12 in the measurement value DB 102 does not match the temperature of the CPU 12 in the calibration value DB 101, the control determination unit 111 may calculate an approximate value of an air volume registered in the calibration value DB 101. Then, the control determination unit 111 may determine an abnormality in the dust-proof bezel 16 by comparing the calculated approximate value to a measured air volume.
The control determination unit 111 may cause the storage unit 113 to store the calculated approximate value as an air volume at the temperature of the CPU 12 in the measurement value DB 102, in addition to the calibration value DB 101.
The control determination unit 111 may determine that the dust-proof bezel 16 is abnormal when the number of times where the calibration value DB 101 and the measurement value DB 102 match each other is equal to or less than the number of times where the calibration value DB 101 and the measurement value DB 102 do not match each other.
The calibration parameter DB 103 holds a CPU temperature additionally registered in the calibration value DB 101. As illustrated, all values in the calibration parameter DB 103 may be “0” at the initial state.
An additional parameter number is a number that identifies an added CPU temperature. In the additional parameter number, values of #1 to #M (M is a natural number of 2 or more) may be registered.
In the CPU temperature, the temperature of the CPU 12 newly registered in the calibration value DB 101 is registered, and the unit may be, for example, ° C.
The user setting parameter DB 104 holds various values set by the user.
A temperature interval value indicates an interval of CPU temperatures registered in the calibration value DB 101.
The weighting of an actual measurement value is an example of a first weighting value, and is a value used when the temperature of the CPU 12 in the measurement value DB 102 matches the temperature of the CPU 12 in the calibration value DB 101.
The weighting of an approximate value is an example of a second weighting value, and is a value used when the temperature of the CPU 12 in the measurement value DB 102 does not match the temperature of the CPU 12 in the calibration value DB 101.
An interval time indicates an interval at which a clogging detection processing is performed at the dust-proof bezel 16.
The storage unit 113 has a storage area that stores the weighting of an actual measurement value for a case where the temperature of the CPU 12 in the measurement value DB 102 matches the temperature of the CPU 12 registered in the calibration value DB 101. The storage unit 113 has a storage area that stores the weighting of an approximate value for a case where the temperature of the CPU 12 in the measurement value DB 102 does not match the temperature of the CPU 12 registered in the calibration value DB 101. Then, the control determination unit 111 may determine that the dust-proof bezel 16 is abnormal when a value obtained by multiplying the number of times where the calibration value DB 101 and the measurement value DB 102 match each other by the weighting of the actual measurement value or the weighting of the approximate value, is equal to or less than a value obtained by multiplying the number of times where the calibration value DB 101 and the measurement value DB 102 do not match each other by the weighting of the actual measurement value or the weighting of the approximate value.
Descriptions will be made on a clogging detection processing of the dust-proof bezel 16 in the monitoring target server 1 illustrated in
As illustrated in
The CE 4 powers ON the system, in the casing 1000 of the monitoring target server 1 (step S2 in
As illustrated in each of
As illustrated in
As illustrated in
The control determination unit 111 sets various parameters in the user setting parameter DB 104 (step S6 in
In the example illustrated in
When the weighting of the actual measurement value is set to “5,” as the weighting of the approximate value, values of “1” to “4” may be set according to the granularity of the temperature interval value.
When the interval time is “7200 sec,” the clogging detection processing of the dust-proof bezel 16 is performed 12 times a day at an interval of once every two hours. For example, when it is determined that clogging has not occurred in the dust-proof bezel 16 in half or more of twelve clogging detection processings in a day, the clogging detection processing may not be executed for a predetermined time. Accordingly, it is possible to reduce a load in the monitoring target server 1.
The control determination unit 111 receives a DB setting completion notification from the storage unit 113 (step S7 in
The control determination unit 111 starts a calibration value setting processing (step S8 in
As illustrated in
In the example illustrated in
The control determination unit 111 inquires of the calibration value DB 101 in the storage unit 113 about a parameter value for each CPU temperature (step S10 in
The control determination unit 111 obtains the parameter value from the calibration value DB 101 in the storage unit 113 (step S11 in
The control determination unit 111 performs a control of the cooling fans 14 (step S12 in
The control determination unit 111 determines whether a variable k (k is a natural number of 0 or more and n or less) has reached the number n of calibration values registered in the calibration value DB 101 (step S13 in
When it is determined that the variable k has not reached the number n of calibration values (see “No” route in step S13 in
In the example illustrated in
Meanwhile, when it is determined that the variable k has reached the number n of calibration values (see “Yes” route in step S13 in
As illustrated in
The control determination unit 111 requests the measurement unit 112 to perform a measurement processing of a CPU temperature and an air volume (step S17 in
The measurement unit 112 measures a CPU temperature and an air volume (step S18 in
The measurement unit 112 determines whether the number of times of measurement j has reached the maximum value m of the number of times of measurement (step S19 in
When it is determined that the number of times of measurement j has reached the maximum value m of the number of times of measurement (see “Yes” route in step S19 in
Meanwhile, when it is determined that the number of times of measurement j has not reached the maximum value m of the number of times of measurement (see “No” route in step S19 in
The measurement unit 112 notifies the control determination unit 111 of measurement execution (step S21 in
When the notification of the measurement execution is received from the measurement unit 112, the control determination unit 111 inquires of the measurement value DB 102 in the storage unit 113 (step S22 in
The control determination unit 111 obtains the measurement value from the measurement value DB 102 in the storage unit 113 (step S23 in
As illustrated in
When it is determined that there is a matching CPU temperature (see “Yes” route in step S24 in
Meanwhile, when it is determined that there is no matching CPU temperature (see “No” route in step S24 in
The control determination unit 111 performs a comparison processing between the air volume in the calibration value DB 101 and the air volume in the measurement value DB 102 (step S26 in
In the example illustrated in
In the example illustrated in
In the example illustrated in
Details of the air volume comparison processing in step S26 in
The control determination unit 111 stores the determination result in the air volume comparison processing, in the measurement value DB 102 in the storage unit 113 (step S27 of
The measurement unit 112 notifies the control determination unit 111 of the end of measurement (step S28 in
When the notification of the end of measurement is received from the measurement unit 112, the control determination unit 111 inquires of the measurement value DB 102 in the storage unit 113 (step S29 in
The control determination unit 111 obtains the determination result from the measurement value DB 102 in the storage unit 113 (step S30 in
As illustrated in
When it is unnecessary to replace the dust-proof bezel 16 (see “No” route in step S31 in
Meanwhile, when it is necessary to replace the dust-proof bezel 16 (See “Yes” route in step S31 in
The notification unit 114 notifies the CE 4 of the deposition of dust or dirt in the dust-proof bezel 16 via the I/F processor 115. Then, the CE 4 receives the notification of the deposition of dust or dirt in the dust-proof bezel 16 (step S33 in
The CE 4 performs a power-OFF operation of the system, in the casing 1000 of the monitoring target server 1 (step S34 in
The CE 4 replaces the dust-proof bezel 16, in the casing 1000 of the monitoring target server 1 (step S35 in
The CE 4 performs a power-ON operation of the system, in the casing 1000 of the monitoring target server 1 (step S36 in
The control determination unit 111 initializes the calibration parameter DB 103 in the storage unit 113, and sets the actually measured value of the air volume corresponding to the CPU temperature for which the approximate value of the air volume is calculated, as a parameter value in the calibration value DB 101 (step S37 in
In the example illustrated in
Next, details of the air volume comparison processing illustrated in step S26 in
As illustrated in
When the air volume (A) in the calibration value DB 101 is equal to the air volume (B) in the measurement value DB 102 (see “(A)=(B)” route in
Meanwhile, when the air volume (A) in the calibration value DB 101 is less than the air volume (B) in the measurement value DB 102 (see “(A)<(B)” route in
As illustrated in
The control determination unit 111 compares the approximate air volume (A) to the air volume (B) in the measurement value DB 102 (step S267 in
When the approximate air volume (A) is equal to the air volume (B) in the measurement value DB 102 (see “(A)=(B)” route in
Meanwhile, when the approximate air volume (A) is less than the air volume (B) in the measurement value DB 102 (see “(A)<(B)” route in FIG. 15B), the processing proceeds to step S269 in
Next, details of the bezel replacement necessity determination illustrated in step S31 in
In the table indicated by the reference numeral C1, a determination number, a determination result, and a weighting score are associated with each other. For example, for the determination number “#1,” the determination result is “OK” and the weighting score is “5,” and for the determination number “#2,” the determination result is “NG” and the weighting score is “1.” As illustrated in the user setting parameter DB 104 in
As indicated by the reference numeral C2, the control determination unit 111 calculates total scores of OK determination and NG determination on the basis of the association between the determination result and the weighting score.
As indicated by the reference numeral C3, the total scores of the OK determination and the NG determination may be obtained by, for example, the following equations.
Total score of OK determination=weighting score α×the number of OKs of score α+weighting score β×the number of OKs of score β
Total score of NG determination=weighting score α×the number of NGs of score α+weighting score β×the number of NGs of score β
For example, the weighting score α may be the weighting of an actual measurement value, and the weighting score β may be the weighting of an approximate value. When there are three or more weighting scores, in the total score of the OK determination, a product of a third or subsequent weighting score and the number of OKs of the corresponding score may be further added. When there are three or more weighting scores, in the total score of the NG determination as well, a product of a third or subsequent weighting score and the number of NGs of the corresponding score may be further added.
When the table indicated by the reference numeral C1 is applied to the numerical formula indicated by the reference numeral C3, the total scores of the OK determination and the NG determination are calculated as in the following equations.
Total score of OK determination=5×2+1×1=11
Total score of NG determination=5×1+1×1=6
Then, as indicated by the reference numeral C4, the control determination unit 111 performs a final determination on the basis of the calculated total scores of the OK determination and the NG determination.
That is, the control determination unit 111 compares a total score (A) of the OK determination to a total score (B) of the NG determination (step S311).
When the total score (A) of the OK determination is larger than the total score (B) of the NG determination (see “(A)>(B)” route in step S311), the control determination unit 111 determines that it is unnecessary to replace the dust-proof bezel 16 (step S312). Then, the bezel replacement necessity determination is ended.
Meanwhile, when the total score (A) of the OK determination is equal to or less than the total score (B) of the NG determination (see “(A)≤(B)” route in step S311), the control determination unit 111 determines that it is necessary to replace the dust-proof bezel 16 (step S313). Then, the bezel replacement necessity determination is ended.
According to the monitoring target server 1 in an example of the first embodiment as described above, for example, the following operation effects may be achieved.
The storage unit 113 stores a predicted relationship between the temperature of the CPU 12 and an air volume of an air flow generated by the cooling fans 14, as the calibration value DB 101. The measurement unit 112 measures a relationship between the temperature of the CPU 12 and an air volume of an air flow which passes through the dust-proof bezel 16, as the measurement value DB 102. The control determination unit 111 determines an abnormality in the dust-proof bezel 16 by comparing the calibration value DB 101 to the measurement value DB 102.
Accordingly, it is possible to accurately determine the abnormality in the dust-proof bezel 16. Specifically, even when the temperature of the CPU 12 of the monitoring target server 1 is not kept constant (i.e., regardless of the state of the monitoring target server 1), it is possible to detect clogging due to deposition of dust or dirt in the dust-proof bezel 16.
The control determination unit 111 calculates an approximate value of the air volume in the calibration value DB 101 when the temperature of the CPU 12 in the measurement value DB 102 does not match the temperature of the CPU 12 in the calibration value DB 101. Then, the control determination unit 111 determines the abnormality in the dust-proof bezel 16 by comparing the calculated corresponding approximate value to the measured air volume.
Accordingly, even when the measured CPU temperature is not registered in the calibration value DB 101, it is possible to determine whether the measured air volume is proper.
The storage unit 113 additionally stores a relationship between the temperature of the CPU 12 in the measurement value DB 102 and the approximate value calculated by the control determination unit 111, in the calibration value DB 101.
Accordingly, it is possible to omit calculation of the approximate value of the air volume corresponding to the corresponding CPU temperature in the next measurement, and thus it is possible to improve the speed of the clogging detection processing of the dust-proof bezel 16.
The control determination unit 111 determines that the dust-proof bezel 16 is abnormal when the number of times where the calibration value DB 101 and the measurement value DB 102 match each other is equal to or less than the number of times where the calibration value DB 101 and the measurement value DB 102 do not match each other.
Accordingly, since the occurrence of clogging in the dust-proof bezel 16 is detected by comparing the air volumes to each other a plurality of times, it is possible to improve the detection accuracy.
The storage unit 113 stores the weighting of an actual measurement value for a case where the temperature of the CPU 12 in the measurement value DB 102 matches the temperature of the CPU 12 in the calibration value DB 101. The storage unit 113 stores the weighting of an approximate value for a case where the temperature of the CPU 12 in the measurement value DB 102 does not match the temperature of the CPU 12 in the calibration value DB 101. The control determination unit 111 determines that the dust-proof bezel 16 is abnormal when a value obtained by multiplying the number of times where the calibration value DB 101 and the measurement value DB 102 match each other by the weighting of the actual measurement value or the weighting of the approximate value, is equal to or less than a value obtained by multiplying the number of times where the calibration value DB 101 and the measurement value DB 102 do not match each other by the weighting of the actual measurement value or the weighting of the approximate value.
Accordingly, by applying different weighting values, to the determination result on the state of the dust-proof bezel 16 based on the actually measured value of the air volume and the determination result on the state of the dust-proof bezel 16 based on the approximate value of the air volume, respectively, it is possible to comprehensively determine the replacement necessity of the dust-proof bezel 16. Accordingly, it is possible to improve the detection accuracy of the occurrence of clogging in the dust-proof bezel 16.
In some cases, it is determined that dirt is not detected by a comprehensive determination through one cycle of scoring, and scoring of dirt detection and scoring of dirt non-detection are close to each other (i.e., a timing when a determination of dirt detection will be made soon). In an example of the above-described first embodiment, in such a case, even when there is a timing when determination of dirt detection may be made halfway, measurement processings have to be performed a predetermined number of times for one cycle, in the next cycle. Thus, there is a possibility that determination of dirt detection may not be timely made.
In an example of the above-described first embodiment, it is assumed that after a calibration processing is performed when the monitoring target server 1 is powered ON, the surrounding environment of the corresponding monitoring target server 1 is not changed. Therefore, after the calibration processing, when the surrounding environment of the monitoring target server 1 is changed against the user's intention, the reference for measuring dirt also changes. Thus, there is a possibility that dirt may not be properly detected.
When the technology in an example of the above-described embodiment is applied to an environment in which a plurality of monitoring target servers 1 is provided such as a rack configuration, in some cases, a system FAN provided in a server room may be controlled or the monitoring target servers 1 in a rack may be inserted and removed. In such a case, since the surrounding environment of the monitoring target servers 1 is changed, there is a possibility that dirt may not be properly detected.
At the time of dirt measurement, when another maintenance operation is performed in parallel to the dirt measurement, and when the timing of replacement of the dust-proof bezel 16 is different from the timing of another maintenance operation, there is a possibility that an efficient operation may not be performed throughout the maintenance operation.
In the information processing system 200, a plurality of monitoring target servers 1, each of which has the same configuration as that in an example of the above described first embodiment, is mounted in a server rack 2000. The BMC 11 of the monitoring target server 1 monitors an air volume by the air volume sensor 15 (described above with reference to
The BMC 11 functions as, for example, the control determination unit 111, the measurement unit 112, the storage unit 113, the notification unit 114, and the I/F processor 115, as described above with reference to
In an example of the second embodiment, the control determination unit 111 determines that the dust-proof bezel 16 is abnormal when a value obtained by multiplying the number of times of matching by comparisons by a first or second weighting value is equal to or less than a value obtained by multiplying the number of times of non-matching by comparisons by a first or second weighting value and a third weighting value. The first weighting value may be the weighting of an actual measurement value, the second weighting value may be the weighting of an approximate value, and the third weighting value may be the weighting of a maintenance operation factor. Details of necessity determination of replacement of the dust-proof bezel 16 using the first to third weighting values will be described below with reference to
The control determination unit 111 determines that the dust-proof bezel 16 is abnormal when in the comparison recently performed a predetermined number of times, the number of times of matching by the comparisons is equal to or less than the number of times of non-matching by the comparisons.
The control determination unit 111 detects a change of the surrounding environment when a change ratio of a difference between an air volume stored in the calibration value DB 101 (described below with reference to
The storage unit 113 stores the weighting (a third weighting value) of a maintenance operation factor for a case where a maintenance operation is performed on the monitoring target server 1, in a storage area. The storage unit 113 updates the calibration value DB 101 stored in the storage area when a change in the surrounding environment of the monitoring target server 1 is detected.
In the information processing system 200, for a management server/mobile terminal 2a, in addition to the configuration of the management server/mobile terminal 2 in an example of the above-described first embodiment, operation management software (SW) 21 is installed to monitor the monitoring target server 1. Hereinafter, the management server/mobile terminal 2a will be simply referred to as a management server 2a.
The monitoring target server 1 and the management server 2a may be connected to the management terminal 3 via a LAN switch 5.
An operation management SW 21 is downloaded to the management server 2a. The operation management SW 21 functions as a device information management unit 211, a monitoring unit 212, and an I/F processor 213.
The device information management unit 211 registers and manages management target device information (e.g., a model name, a serial number, or an Internet Protocol (IP) address) in the monitoring target server 1, etc.
The monitoring unit 212 monitors various events. The various events are, for example, SNMP traps notified from each device, notification of a status change of each device, and checking of an intake air temperature, a CPU usage rate, a power consumption, etc. which are acquired from each device.
The I/F processor 213 communicates with the monitoring target server 1 or the management terminal 3 by using a LAN.
The calibration value DB 101 is an example of comparison information in which the temperature of the CPU 12 and an air volume of an air flow generated by the cooling fans 14 are registered in association with each other. In the calibration value DB 101, as in an example of the above-described first embodiment, a calibration number, a CPU temperature D1(a), an air volume D2(a), and an approximation processing flag D3(a) are registered. As illustrated, all values in the calibration value DB 101 may be “0” at the initial state.
The measurement value DB 102a holds a relationship between the temperature of the CPU 12 and an air volume of an air flow which passes through the dust-proof bezel 16. In the measurement value DB 102a, in addition to a measurement number, a CPU temperature D1(b), an air volume D2(b), and a determination result D4(b) in the measurement value DB 102 in an example of the above-described first embodiment, a maintenance operation factor D3(b) is registered. As illustrated, all values in the measurement value DB 102a may be “0” at the initial state.
The maintenance operation factor D3(b) is set to “1” when an event requiring a maintenance operation, such as hardware failure of the monitoring target server 1, has occurred.
The calibration parameter DB 103 maintains a CPU temperature D1(c) which is additionally registered in the calibration value DB 101 as in an example of the above-described first embodiment. As illustrated, all values in the calibration parameter DB 103 may be “0” at the initial state.
The user setting parameter DB 104a holds various values set by the user. In the user setting parameter DB 104a, a temperature interval value D1(d), the weighting of an actual measurement value D2(d), the weighting of an approximate value D3(d), and an interval time D5(d) are registered as in the user setting parameter DB 104 in an example of the above-described first embodiment. In the user setting parameter DB 104a, in addition to the values of the user setting parameter DB 104 in an example of the above-described first embodiment, the weighting of a maintenance operation factor D4(d), a difference allowable range D6(d), and an allowable range D7(d) of the difference change ratio are registered. As illustrated, all values in the user setting parameter DB 104a may be “0” at the initial state.
The weight of the maintenance operation factor D4(d) is an example of a third weighting value, and is a value used when an event requiring a maintenance operation occurs, and a value of the maintenance operation factor D3(b) in the measurement value DB 102a is set to “1.”
The difference allowable range D6(d) indicates an allowable range of a difference between the air volume D2(a) in the calibration value DB 101 and the air volume D2(b) in the measurement value DB 102a.
The allowable range D7(d) of the difference change ratio indicates an allowable range of a change ratio of the difference between the air volume D2(a) in the calibration value DB 101 and the air volume D2(b) in the measurement value DB 102a, between a previous measurement time and a current measurement time.
The difference DB 105 indicates a difference between a calibration value and a measurement value. In the difference DB 105, a measurement number, a CPU temperature D1(e), a difference D2(e), and a difference change ratio D3(e) are registered. As illustrated, all values in the difference DB 105 may be “0” at the initial state.
The measurement number is a number that identifies each combination of a calibration value and a measurement value. In the measurement number, values of #1 to #M (M is a natural number of 2 or more) may be registered.
In the CPU temperature D1(e), the temperature of the CPU 12 registered in the measurement value DB 102a is registered, and the unit may be, for example, ° C.
In the difference D2(e), a ratio of a difference between the air volume D2(a) of the calibration value DB 101 and the air volume D2(b) of the measurement value DB 102a is registered, and the unit may be, for example, %.
In the difference change ratio D3(e), a change ratio of the difference between the air volume D2(a) of the calibration value DB 101 and the air volume D2(b) of the measurement value DB 102a, from a previous measurement time, is registered, and the unit may be, for example, %.
Descriptions will be made on a clogging detection processing of the dust-proof bezel 16 (see
As illustrated in
The CE 4 powers ON the system, in the casing 1000 of the monitoring target server 1 (step S42 in
Here, the CE 4 changes the surrounding environment of the monitoring target server 1 (step S43 in
Accordingly, the device information management unit 211 in the operation management SW 21 of the management server 2a updates server installation information within the server rack 2000 (step S44 in
The device information management unit 211 requests the monitoring target server 1 to perform a re-calibration (step S45 in
As illustrated in each of
For example, in the measurement value DB 102a, a setting may be made in such a manner that 12 measurement processings a day are possible.
As illustrated in
For example, in the calibration parameter DB 103, the CPU temperatures D1(c) of each monitoring target server 1 based on the temperature interval value D1(d) of the calibration may be set to, for example, 40.0° C., 40.5° C., . . . , 89.5° C., 90.0° C. via an IPMB interface function. The temperature interval width D1(d) may be set to an arbitrary value in measurable units of the BMC 11, and upper/lower limit values (e.g., room temperature to 90.0° C.) of the temperature may be uniquely determined in accordance with the specification of the mounted CPU 12 (described above with reference to
As illustrated in
For example, in the user setting parameter DB 104a, the temperature interval value D1(d) may be set to “0.5,” the weighting of the actual measurement value D2(d) may be set to “5,” the weighting of the approximate value D3(d) may set to “1,” the weighting of the maintenance operation factor D4(d) may be set to “4,” and the interval time may be set to “two hours.”
The control determination unit 111 calculates a difference between the air volume D2(a) in the calibration value DB 101 and the air volume D2(b) in the measurement value DB 102a, and stores the difference in the storage unit 113 (step S49 in
As illustrated in
The control determination unit 111 receives a DB setting completion notification from the storage unit 113 (step S51 in
The control determination unit 111 starts a calibration value setting processing (step S52 in
The control determination unit 111 sets a plurality of CPU temperatures D1(c) as parameters for FAN control, in the calibration parameter DB 103, in the storage unit 113 (step S53 in
As illustrated in
The control determination unit 111 obtains the parameter value from the calibration parameter DB 103 in the storage unit 113 (step S55 in
The control determination unit 111 performs a control of the cooling fans 14 (FAN control) (step S56 in
The control determination unit 111 determines whether a variable k (k is a natural number of 0 or more and n or less) has reached the number n of parameter values registered in the calibration parameter DB 103 (step S57 in
When it is determined that the variable k has not reached the number n of calibration values (see “No” route in step S57 in
On the basis of the CPU temperatures (e.g., 40.0° C., 40.5° C., . . . , 89.5° C., 90.0° C.) in the calibration parameter DB 103, a calibration at each CPU temperature D1(a) is performed via an IPBM interface of the BMC 11. Then, the air volume D2(a) at each CPU temperature D1(a) (e.g., at the CPU temperature of 60.0° C., an air volume value of 0.11 (m3/min)) is set in the calibration value DB 101.
Meanwhile, when it is determined that the variable k has reached the number n of calibration values (see “Yes” route in step S57 in
Here, as illustrated in
The monitoring unit 212 of the operation management SW 21 notifies the CE 4 about the occurrence of the event that requires a maintenance operation (step S61 in
Meanwhile, the storage unit 113 of the BMC 11 sets the maintenance operation factor D3(b) in the measurement value DB 102a on the basis of the notification by the notification unit 114 (step S62 in
The control determination unit 111 requests the measurement unit 112 to perform a measurement processing of the CPU temperature D1(b) and the air volume D2(b) (step S63 in
The measurement unit 112 measures a CPU temperature and an air volume (step S64 in
The measurement unit 112 stores the CPU temperature D1(b) in the measurement value DB 102a (step S65 in
The measurement unit 112 notifies the control determination unit 111 of the measurement execution (step S66 in
As illustrated in
The control determination unit 111 obtains the measurement value from the storage unit 113 (step S68 in
The control determination unit 111 determines whether a CPU temperature D1(a) which matches the CPU temperature D1(b) in the measurement value DB 102a exists in the calibration value DB 101 (step S69 in
When it is determined that there is a matching CPU temperature (see “Yes” route in step S69 in
Meanwhile, when it is determined that there is no matching CPU temperature (see “No” route in step S69 in
As illustrated in
When it is determined that the air volume D2(b) as the measurement value is larger than the air volume D2(a) as the calibration value (see “Yes” route in step S71 in
For example, an examination is made on a case where the measured CPU temperature is 60.0° C., and the measured air volume D2(b) is 0.17 m3/min, while the air volume D2(a) as the calibration value is 0.15 m3/min. In this case, the air volume D2(b) and the air volume D2(a) are compared to each other through a BMC-CPU function. Then, since the air volume D2(b) is larger than the air volume D2(a), it is determined that the surrounding environment has changed. Then, the dirt detection processing of the monitoring target server 1 is ended, and the re-calibration is performed.
Meanwhile, when it is determined that the air volume D2(b) as the measurement value is equal to or less than the air volume D2(a) as the calibration value (see “No” route in step S71 in
For example, an examination is made on a case where the measured CPU temperature is 60.0° C. and the measured air volume D2(b) is 0.15 m3/min, while the air volume D2(a) as the calibration value is 0.15 m3/min. In this case, the air volume D2(b) and the air volume D2(a) are compared to each other through the BMC-CPU function. Then, since the air volume D2(b) is equal to the air volume D2(a), it is determined that the surrounding environment has not changed. Then, the dirt detection processing of the monitoring target server 1 is continued.
For example, an examination is made on a case where the measured CPU temperature is 60.0° C. and the measured air volume D2(b) is 0.13 m3/min, while the air volume D2(a) as the calibration value is 0.15 m3/min. In this case, the air volume D2(b) and the air volume D2(a) are compared to each other through the BMC-CPU function. Then, since the air volume D2(b) is smaller than the air volume D2(a), it is determined that the surrounding environment has not changed. Then, the dirt detection processing of the monitoring target server 1 is continued and the difference change ratio D3(e) (e.g., 10%) between the previous measurement time and the current measurement time, in a difference between the measurement value and the calibration value of the air volume, is stored in the difference DB 105.
When the monitoring target server 1 within the server rack 2000 is inserted or removed, as illustrated in steps S41 to S43 in
The storage unit 113 stores the difference change ratio D3(e) in the difference DB 105 (step S73 in
The control determination unit 111 determines whether the difference change ratio D3(e) in the difference DB 105 is within the allowable range D7(d) of the difference change ratio in the user setting parameter DB 104a (step S74 in
When it is determined that the difference change ratio D3(e) is not within the allowable range D7(d) of the difference change ratio (see “No” route in step S74 in
For example, an examination is made on a case where a difference between the measurement value of a previously measured air volume and the calibration value is 10%, a difference between the measurement value of a currently measured air volume and the calibration value is 20%, and the allowable range D7(d) of the difference change ratio is 5%. In this case, since the change ratio (20−10=10%) between the previous measurement time and the current measurement time in the difference exceeds the allowable range D7(d) of the difference change ratio, that is, 5%, the dirt detection processing of the monitoring target server 1 is ended, and the re-calibration is performed.
Meanwhile, when the difference change ratio D3(e) is within the allowable range D7(d) of the difference change ratio (see “Yes” route in step S74 in
For example, an examination is made on a case where a difference between the measurement value of a previously measured air volume and the calibration value is 10%, a difference between the measurement value of a currently measured air volume and the calibration value is 20%, and the allowable range D7(d) of the difference change ratio is 20%. In this case, since the change ratio (20−10=10%) between the previous measurement time and the current measurement time in the difference is equal to or less than the allowable range D7(d) of the difference change ratio, that is, 20%, the dirt detection processing of the monitoring target server 1 is continued.
For example, an examination is made on a case where a difference between the measurement value of a previously measured air volume and the calibration value is 10%, a difference between the measurement value of a currently measured air volume and the calibration value is 20%, and the allowable range D7(d) of the difference change ratio is 10%. In this case, since the change ratio (20−10=10%) between the previous measurement time and the current measurement time in the difference is equal to or less than the allowable range D7(d) of the difference change ratio, that is, 10%, the dirt detection processing of the monitoring target server 1 is continued.
As illustrated in
The measurement unit 112 stores the determination result D4(b) in the measurement value DB 102a (step S76 in
The control determination unit 111 determines whether the current number of times of measurement j is less than a predetermined number of times of measurement m (step S77 in
When it is determined that the current number of times of measurement j is less than the predetermined number of times of measurement m (see “Yes” route in step S77 in
Meanwhile, when it is determined that the current number of times of measurement j is equal to or larger than the predetermined number of times of measurement m (see “No” route in step S77 in
The control determination unit 111 obtains the latest m determination results D4(b), in the measurement value DB 102a (step S79 in
The control determination unit 111 integrates the determination results D4(b), and comprehensively determines whether it is necessary to replace the dust-proof bezel 16 (described above with reference to
When it is determined that it is unnecessary to replace the dust-proof bezel 16 (see “No” route in step S80 in
Meanwhile, when it is determined that it is necessary to replace the dust-proof bezel 16 (see “Yes” route in step S80 in
The monitoring unit 212 of the operation management SW 21 receives the dirt deposition notification (step S82 in
As illustrated in
The CE 4 replaces the dust-proof bezel 16, in the casing 1000 of the monitoring target server 1 (step S84 in
The CE 4 performs a maintenance operation of the monitoring target server 1 besides the replacement of the dust-proof bezel 16 (step S85 in
The CE 4 performs a power-ON operation of the system, in the casing 1000 of the monitoring target server 1 (step S86 in
The control determination unit 111 initializes the calibration parameter DB 103 in the storage unit 113, and sets the actually measured value of the air volume D2(a) corresponding to the CPU temperature D1(a) for which the approximate value of the air volume is calculated, as a parameter value in the calibration value DB 101 (step S87 in
Next, details of the air volume comparison processing illustrated in step S75 in
The control determination unit 111 determines whether the difference D2(e) between the air volume of the calibration and the air volume as the measurement value is within the allowable range D6(d) (step S751).
When it is determined that the difference D2(e) is within the allowable range D6(d) (see “Yes” route in step S751), the control determination unit 111 registers “OK” determination in the determination result of the measurement value DB 102a, in the storage unit 113 (step S752). Then, the air volume comparison processing is ended.
For example, an examination is made on a case where the measured air volume D2(b) is 0.14 m3/min, while the air volume D2(a) as the calibration value is 0.15 m3/min and the difference allowable range D6(d) is 10%. In this case, since the difference D2(e) (=(0.15-0.14)/0.15=6.7%) between the actually measured value and the calibration value is equal to or less than the difference allowable range D6(d), that is, 10%, it is determined that dirt is not deposited on the dust-proof bezel 16.
Meanwhile, when it is determined that the difference D2(e) is not within the allowable range D6(d) (see “No” route in step S751), the control determination unit 111 registers “NG” determination in the determination result of the measurement value DB 102a, in the storage unit 113 (step S753). Then, the air volume comparison processing is ended.
For example, an examination is made on a case where the measured air volume D2(b) is 0.14 m3/min, while the air volume D2(a) as the calibration value is 0.15 m3/min and the difference allowable range D6(d) is 5%. In this case, since the difference D2(e) (=(0.15-0.14)/0.15=6.7%) between the actually measured value and the calibration value exceeds the difference allowable range D6(d), that is, 5%, it is determined that dirt is deposited on the dust-proof bezel 16.
Next, details of the bezel replacement necessity determination illustrated in step S80 in
In the table indicated by the reference numeral D1, a determination number, a determination result, the weighting of a maintenance operation factor, and a weighting score are associated with each other. For example, for the determination number “#1,” the determination result is “OK” and the weighting score is “5,” for the determination number “#2,” the determination result is “NG” and the weighting score is “1,” and for the determination number “#6,” the determination result is “NG” and the weighting of the maintenance operation factor is “2.” The weighting score “5” may be the weighting of an actual measurement value, and the weighting score “1” may be the weighting of an approximate value.
As indicated by the reference numeral D2, the control determination unit 111 calculates total scores of OK determination and NG determination on the basis of the association among the determination result, the weighting of the maintenance operation factor, and the weighting score.
As indicated by the reference numeral D3, the total scores of the OK determination and the NG determination may be obtained by, for example, the following equations.
Total score of OK determination=weighting score α×the number of OKs of score α+weighting score β×the number of OKs of score β+ . . .
Total score of NG determination=weighting score α×the number of NGs of score α+weighting score β×the number of NGs of score β+ . . . +weighting score γ×the number of NGs of score γ×maintenance operation factor weighting
For example, the weighting scores α and γ may be the weightings of actual measurement value, and the weighting score β may be the weighting of an approximate value.
When the table indicated by the reference numeral D1 is applied to the numerical formula indicated by the reference numeral D3, the total scores of the OK determination and the NG determination are calculated as in the following equations.
Total score of OK determination=5×2+1×1=11
Total score of NG determination=5×1+1×1+2×5×1=16
Then, as indicated by the reference numeral D4, the control determination unit 111 performs a final determination on the basis of the calculated total scores of the OK determination and the NG determination.
That is, the control determination unit 111 compares a total score (A) of the OK determination to a total score (B) of the NG determination (step S801).
When the total score (A) of the OK determination is larger than the total score (B) of the NG determination (see “(A)>(B)” route in step S801), the control determination unit 111 determines that it is unnecessary to replace the dust-proof bezel 16 (step S802). Then, the bezel replacement necessity determination is ended.
Meanwhile, when the total score (A) of the OK determination is equal to or less than the total score (B) of the NG determination (see “(A)≤(B)” route in step S801), the control determination unit 111 determines that it is necessary to replace the dust-proof bezel 16 (step S803). Then, the bezel replacement necessity determination is ended.
When half or more of determination results in a predetermined number of measurements (e.g., 12 times a day) continue with the same determinations, the bezel replacement necessity may be determined on the basis of the continuous determination results.
According to the monitoring target server 1 in an example of the second embodiment as described above, for example, the following operation effects may be achieved.
The storage unit 113 stores the weighting of a maintenance operation factor (i.e., a third weighting value) for a case where a maintenance operation is performed on the monitoring target server 1, in a storage area. The control determination unit 111 determines that the dust-proof bezel 16 is abnormal when a value obtained by multiplying the number of times of matching by comparisons by a first or second weighting value is equal to or less than a value obtained by multiplying the number of times of non-matching by comparisons by a first or second weighting value and a third weighting value. The first weighting value may be the weighting of an actual measurement value, and the second weighting value may be the weighting of an approximate value.
Accordingly, when the replacement of the dust-proof bezel 16 is predicted in a short period of time due to occurrence of slight clogging, etc. in the dust-proof bezel 16, the replacement of the dust-proof bezel 16 may be performed concurrently with the maintenance operation at the timing of the maintenance operation of the monitoring target server 1. Thus, it is possible to improve the work efficiency.
The control determination unit 111 determines that the dust-proof bezel 16 is abnormal when in the comparison recently performed a predetermined number of times, the number of times of matching by the comparisons is equal to or less than the number of times of non-matching by the comparisons.
Accordingly, it is possible to determine the replacement necessity of the dust-proof bezel 16 at every timing when measurement and comparison of air volumes are performed. Thus, it is possible to detect the abnormality of the dust-proof bezel 16 early.
The storage unit 113 updates the calibration value DB 101 stored in the storage area when a change in the surrounding environment of the monitoring target server 1 is detected.
Accordingly, it is possible to automatically perform a re-calibration when the surrounding environment of the monitoring target server 1 is changed. Thus, it is possible to prevent an erroneous determination in the replacement necessity determination of the dust-proof bezel 16.
The control determination unit 111 detects a change in the surrounding environment when a change ratio of a difference between the air volume stored in the calibration value DB 101 and the air volume measured by the measurement unit 112 exceeds an allowable range.
Accordingly, even when the management server 2a is not capable of detecting a change in the surrounding environment of the monitoring target server 1, the monitoring target server 1 is capable of autonomously detecting a change in the surrounding environment.
The disclosed technology is not limited to the above-described embodiments, and may be implemented through various modifications without departing from the scope of the gist of the embodiments. Configurations and processings in the embodiments may be selected as needed, or may be properly combined.
In examples of the above-described embodiments, although a clogging in the dust-proof bezel 16 provided in the monitoring target server 1 is detected, the present disclosure is not limited thereto. The clogging detection processing of the dust-proof bezel 16 as described above may be applied to various devices to which the dust-proof bezel 16 is provided.
In examples of the above-described embodiments, although the clogging detection processing of the dust-proof bezel 16 is performed in the BMC 11, the present disclosure is not limited thereto. For example, each processing other than the temperature measurement of the CPU 12 may be implemented by functions in the CPU 12.
All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the invention and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to an illustrating of the superiority and inferiority of the invention. Although the embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
JP2018-129242 | Jul 2018 | JP | national |
JP2019-075035 | Apr 2019 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5513344 | Nakamura | Apr 1996 | A |
5673029 | Behl | Sep 1997 | A |
6337630 | Hass | Jan 2002 | B1 |
7643290 | Narasimhan | Jan 2010 | B1 |
8395898 | Chamseddine | Mar 2013 | B1 |
9128672 | Loo | Sep 2015 | B1 |
20030159802 | Steneby | Aug 2003 | A1 |
20040075981 | Kim | Apr 2004 | A1 |
20050244263 | Hardt | Nov 2005 | A1 |
20050273349 | Abedi | Dec 2005 | A1 |
20060070527 | Chapman | Apr 2006 | A1 |
20060172188 | Okuda | Aug 2006 | A1 |
20060253633 | Brundridge | Nov 2006 | A1 |
20090287456 | Tran | Nov 2009 | A1 |
20100287880 | Yasunaga | Nov 2010 | A1 |
20100296945 | Nitta | Nov 2010 | A1 |
20110163545 | Hirai | Jul 2011 | A1 |
20110245981 | Refai-Ahmed | Oct 2011 | A1 |
20120224976 | Nagamatsu | Sep 2012 | A1 |
20130085001 | Anderson | Apr 2013 | A1 |
20130128455 | Koblenz | May 2013 | A1 |
20130139402 | Hong | Jun 2013 | A1 |
20140016268 | Tsujimura | Jan 2014 | A1 |
20140064321 | Yokoyama | Mar 2014 | A1 |
20140273792 | Kondo | Sep 2014 | A1 |
20150327406 | Gallefoss | Nov 2015 | A1 |
20150351288 | Fukuda | Dec 2015 | A1 |
20160085789 | Fuchs | Mar 2016 | A1 |
20160094898 | Primm | Mar 2016 | A1 |
20160253851 | Pandey | Sep 2016 | A1 |
Number | Date | Country |
---|---|---|
2007-311458 | Nov 2007 | JP |
2012-66190 | Apr 2012 | JP |
2013-201304 | Oct 2013 | JP |
Entry |
---|
Japanese Office Action dated Oct. 25, 2022 for corresponding Japanese Patent Application No. 2019-075035, with English Translation, 8 pages. ***Please note JP-2013-201304-A cited herewith, was previously cited in an IDS filed on Jul. 2, 2019.***. |
Number | Date | Country | |
---|---|---|---|
20200012325 A1 | Jan 2020 | US |