The present invention relates to an anomaly detection/diagnosis method, an anomaly detection/diagnosis system, an anomaly detection/diagnosis program and enterprise asset management infrastructure asset management system which are used for sensing and diagnosing an anomaly of a plant or a facility at an early time and relates to an enterprise/facility-asset management system.
Among other operations, a power company makes use of typically waste heat of a gas turbine in order to provide a region with hot water for heating the region and provide a plant with high-pressure or low-pressure vapor. A petroleum chemistry plant operates a gas turbine or the like to serve as a power-supply facility. In this way, a variety of plants and facilities each making use of a gas turbine or the like detect an anomaly thereof at an early time, diagnose a cause of the anomaly and take a countermeasure against the anomaly in order to suppress damage inflicted on the society to a minimum, which is of very much importance to the society.
The facilities used as described above are not limited to the gas turbine and a vapor turbine. That is to say, the facilities used as described above may also be a water wheel employed in a hydraulic power plant, a nuclear rector employed in a nuclear power plant, a wind mill employed in a wind power plant, an engine employed in an airplane or heavy equipment, a railway vehicle, railway tracks, an escalator, an elevator, medical equipment such as an MRI, a manufacturing and inspection apparatus for semiconductors and flat panel display units as well as other facilities of the equipment and part levels. There are many more facilities required for detecting an anomaly such as a deterioration of an embedded battery or the life of such a battery at an early time and diagnosing a cause of the anomaly. Recently, the detection of anomalies (that is, a variety of disease states) of a human body for the purpose of health preservation is also becoming more and more important. Such anomalies are detected by typically measuring and diagnosing brain waves.
Thus, documents such as U.S. Pat. No. 6,952,662 (patent document 1), U.S. Pat. No. 6,975,962 (patent document 2) and Stephan W. Wegerich, Nonparametric modeling of vibration signal features for equipment health monitoring, Aerospace Conference, 2003. Proceedings. 2003 IEEE, Volume 7, Issue, 2003 Pages: 3113-3121 (Non-patent Document 1) describe sensing of an anomaly generated mainly in an engine. In accordance with the documents, past data is stored in a database (DB). First of all, the degree of similarity between observed data and the past learned data is measured by adoption of an original method. Then, linear combination of data having high degrees of similarity is used to compute inferred values. Finally, the degree of discrepancy between the inferred values and the observed data is output. The U.S. Pat. No. 6,216,066 (Patent document 3) describes typical detection proposed by General Electric as detection based on k-means clustering to sense an anomaly.
In addition, non-patent document 2 and JP-2009-110066-A (patent document 4) describe a process of acquiring useful knowledge on maintenance. In accordance with the documents, a failure history and a work history are stored in a database which can be searched for such histories in order to acquire the knowledge.
In general, there is widely used a system for monitoring observed data and comparing the data with a threshold value set in advance in order to sense an anomaly. In this case, since the threshold value is set by paying attention to, among others, the measurement-object physical quantity of the observed data, the system can be said to be dependent on the design basis.
With this method, it is difficult to sense an anomaly not intended by the design so that such an anomaly may be overlooked. For example, the set threshold value can no longer be said to be proper due to, among others, the operating environment of the facility, a condition change caused by the lapse of operating years, an operating condition and an effect of a part replacement.
In accordance with the techniques based on anomaly knowledge as disclosed in patent documents 1 and 2, on the other hand, learned data is used as an object and linear combination of data having high degrees of similarity between observed data and the learned data is used to compute inferred values before the degree of discrepancy between the inferred values and the observed data is output. Thus, depending on the preparation of the learned data, it is possible to consider, among others, the operating environment of the facility, a condition change caused by the lapse of operating years, an operating condition and an effect of a part replacement.
In accordance with the techniques disclosed in patent documents 1 and 2, however, the data is handled as a snapshot and data changes with the lapse of time are not taken into consideration. In addition, it is necessary to separately explain why an anomaly is included in the observed data. In the detection of an anomaly in a feature space having a little physical meaning as is the case with the k-means clustering described in patent document 3, the explanation of an anomaly becomes even more difficult. If the explanation of an anomaly is difficult, the detection of the anomaly is treated as incorrect detection.
In addition, in accordance with the method described in patent document 4, there is constructed a system in which a failure history and a work history are stored in a database which can be searched for such histories in order to acquire useful knowledge on maintenance. In accordance with patent document 4, there is constructed a system for displaying maintenance medical charts. In this system, information on a failure history and a work history can be bonded to each other through a search operation so that the information can be presented in a visible form.
However, the bonding of the anomaly detection and the maintenance history information is not clear so that it is hard to say that the maintenance information stored in the system can be used effectively. With only a simple search function, even the bonding of the failure history and the work history themselves is not always successful. In such maintenance information, various kinds of information are generally dispersed and, in addition, there are many enumerations of ambiguous words so that the bonding is impossible unless a keyword serving as a keystone of the search operation is devised carefully. That is to say, in a method depending on only a search operation, from the detected anomaly including a predicted anomaly, it is impossible to clarify, among others, a portion of the past information to be inspected in order to determine the cause of the anomaly, the handling carried out in the past for the cause of the anomaly and what should be done this time for the cause of the anomaly. Thus, even if the cause of the anomaly is diagnosed immediately at the anomaly detection stage, the phenomenon, the cause of the anomaly, the part to be replaced and the like remain unclear so that it is impossible to determine what action should be done. As a result, in the reality of the condition, inspection carried out in the field by a skilled maintenance person is relied on.
It is thus an object of the present invention to present an anomaly detection/diagnosis method and an anomaly detection/diagnosis system which are capable of accurately diagnosing a newly generated anomaly (including a predicted anomaly) by making use of maintenance history information comprising past examples such as anomaly detection information and work-history/replaced-part information which take sensing data as an object.
In addition, it is another object of the present invention to present a diagnosis program which can be presented by a beginner.
On the top of that, it is a further object of the present invention to present an enterprise/facility-asset management system making use of the anomaly detection/diagnosis method and the anomaly detection/diagnosis system.
In order to achieve the objects described above, in accordance with the present invention, pieces of maintenance-history information comprising past examples as is the case with anomaly detection information and work-history/replaced-part information are associated with each other in advance by frequencies of appearances of keywords. Then, on the basis of anomaly detection taking signals output by a multi-dimensional sensor added to a facility as an object, the detected anomaly and the associated maintenance history information are combined with each other so that, at a point of time the predicted anomaly is detected, it is possible to provide relationships with countermeasures such as part replacements, adjustments and resumption. In this way, the diagnosis and the handling which are to be carried out for the generated anomaly can be clarified. In addition, work commands can be implemented.
In particular, to express a condition (referred to hereafter as a context) in which maintenance-history information is used, the frequency of appearance of a keyword is handled by being regarded as a context pattern. That is to say, including anomaly detection, from main keywords representing typically works related to maintenance, a context taking the actually used condition into consideration is acquired as a frequency pattern to be described later and a context-oriented anomaly diagnosis activating the context is expressed.
To put it concretely, in the anomaly detection, the following operations are carried out:
(1): (All but) normal learned data is generated.
(2): The anomaly measure of observed data is computed by adoption of a subspace method or the like.
(3): The anomaly is determined.
(4): The type of the anomaly is identified
(5): The time of generation of the anomaly is estimated and pieces of maintenance-history information are associated with each other
(6): A keyword of a set of documents describing maintenance-history information and the like is extracted
(7): Images are classified
(8): The keywords are associated
(9): A diagnosis model is generated to serve as a model expressing the association of the anomaly with the keyword as a frequency pattern
(10): The diagnosis model is used for classifying the anomaly detected in the plant or the facility or classifying a predicted anomaly so as to clarify a diagnosis and/or handling which are to be carried out
In addition, in order to achieve the objects described above, in accordance with an anomaly detection/diagnosis method provided by the present invention to serve as a method for detecting an anomaly generated or predicted at a plant or a facility at an early time and diagnosing the plant and the facility, an anomaly generated in the plant or the facility is detected by handling data acquired from a plurality of sensors as an object, a keyword is extracted from maintenance-history information of the plant or the facility, a diagnosis model of the plant or the facility is generated by making use of the extracted keyword and the anomaly detected or predicted at the plant or the facility is diagnosed by making use of the generated diagnosis model.
In addition, the maintenance-history information includes ones of on-call data, work reports, the codes of adjusted/replaced parts, image information and audio information. The frequency of appearance of a keyword determined from the maintenance-history information is computed in order to obtain a pattern of the appearance frequency. The obtained appearance frequency pattern is used as a diagnosis model. The similarity between the appearance frequency pattern and a keyword for an anomaly newly detected in a plant or a facility is used in order to carry out a diagnosis on the anomaly detected or predicted in the plant or the facility.
In addition, in order to achieve the objects described above, an anomaly detection/diagnosis system provided by the present invention to serve as a system for detecting an anomaly generated or predicted at a plant or a facility at an early time and diagnosing the plant and the facility is configured to comprise:
an anomaly detection section for detecting an anomaly of the plant or the facility by handling data obtained from a plurality of sensors as an object;
a database section used for storing maintenance-history information of the plant or the facility;
a diagnosis-model generation section for generating a diagnosis model of the plant or the facility by making use of a keyword extracted from the maintenance-history information stored in the database section as the maintenance-history information of the plant or the facility; and
a diagnosis section for carrying out a diagnosis on an anomaly newly detected or predicted in the plant or the facility by collating the detected or predicted anomaly with the diagnosis model.
In addition, the maintenance-history information stored in the database section includes ones of on-call data, work reports, the codes of adjusted/replaced parts, image information and audio information. The diagnosis-model generation section computes the frequency of appearance of a keyword determined from the maintenance-history information in order to obtain a pattern of the appearance frequency. The diagnosis-model generation section makes use of the appearance frequency pattern as a diagnosis model. The diagnosis section makes use of similarity of the appearance frequency pattern for a newly detected anomaly in order to carry out a diagnosis on the facility.
On the top of that, in order to achieve the objects described above, an anomaly detection/diagnosis program provided by the present invention to serve as a program for detecting an anomaly generated or predicted at a plant or a facility at an early time and diagnosing the anomaly is configured to comprise:
a processing step of detecting the anomaly by handling data obtained from a plurality of sensors as an object;
a processing step of generating a diagnosis model by making use of the frequency of appearance of a keyword acquired from maintenance-history information; and
a diagnosis processing step of carrying out a diagnosis on an anomaly detected or predicted in the plant or the facility by making use of the diagnosis model generated at the processing step of generating a diagnosis model.
As described above, at the processing step of detecting an anomaly, the anomaly is detected by handling data obtained from a plurality of sensors as an object. At the processing step of generating a diagnosis model, a diagnosis model is generated by making use of the frequency of appearance of a keyword acquired from maintenance-history information. At the diagnosis processing step, in a diagnosis carried out on the facility by making use of the generated diagnosis model, a pattern or a keyword is extracted through detection of anomaly and/or a diagnosis of a phenomenon. The extracted pattern or the extracted keyword is used in a diagnosis.
In addition, in order to achieve the objects described above, an enterprise/facility-asset management system according to the present invention is configured to comprise:
a database used for storing maintenance-history information including work reports and information on replaced parts;
detection means for detecting a generated anomaly or a predicted anomaly by making use of signal information obtained from a multi-dimensional sensor added to a facility and making use of identification means such as a subspace technique; and
diagnosis means for carrying out a diagnosis on the basis of a frequency pattern of a keyword paying attention to replacement parts, adjustments and the like.
In addition, the enterprise/facility-asset management system is configured to also implement detection of a predicted anomaly and a diagnosis taking the detection of a predicted anomaly as a trigger.
In accordance with the present invention, it is possible to arrange a lot of maintenance-history information existing in the field by making use of relations with anomalies. For a generated anomaly or a predicted anomaly, it is also possible to speedily determine handling of the anomaly at a standpoint of a necessary countermeasure, a necessary adjustment or the like. In addition, a proper instruction can be given to a person in charge of maintenance works. Since a condition in which the maintenance-history information is used can be accurately expressed as a context pattern and since it can be collated with, the stored maintenance-history information can be reused.
In accordance with them, early and accurate detection of an anomaly as well as a diagnosis and handling which have to be carried out become clear not only for facilities such as a gas turbine and a vapor turbine, but also for a water wheel employed in a hydraulic power plant, a nuclear rector employed in a nuclear power plant, a wind mill employed in a wind power plant, an engine employed in an airplane or heavy equipment, a railway vehicle, railway tracks, an escalator, an elevator and those at the facility and part levels. Anomalies detected at the facility and part levels include anomalies of a variety of facilities and parts. Examples of such anomalies are a deterioration of an embedded battery or the life of such a battery. It is needless to say that the present invention can also be applied to measurements and diagnoses of human bodies.
The present invention relates to an anomaly detection/diagnosis system for detecting an anomaly generated or predicted in a plant or a facility an early time. In a process of detecting an anomaly, all but normal learned data is generated and the anomaly measure of observed data is computed by adoption of the subspace method or the like. Then, an anomaly is determined and the type of the anomaly is identified. Subsequently, the time at which the anomaly has been generated is estimated.
In addition, in a process of associating pieces of maintenance-history information with each other, a keyword of a set of documents describing the maintenance-history information and the like is extracted and the keyword is associated with the anomaly through image classification or the like.
Then, a diagnosis model expressing the association of the keyword with the anomaly as a frequency pattern is generated. The diagnosis model is used for clarifying a diagnosis and handling which are to be carried out for the detected or predicted anomaly.
The following description explains an exemplary embodiment of the present invention by referring to diagrams.
The object handled by the anomaly prediction/diagnosis system 100 is the multi-dimensional time-series sensor signals 104 acquired by the multi-dimensional time-series signal acquisition section 103. The sensor signals 104 include signals representing a generator voltage, an exhausted-gas temperature, a cooling-water temperature, a cooling-water pressure and an operating-time length. The installation environment or the like is also monitored. The interval of timings to sample the sensors is a time period in a range of about several tens of ms to about several tens of seconds. That is to say, there is a variety of such intervals. The sensor signals 104 and the event data 105 include the operating states of the facilities 101 and 102, information on a failure and information on maintenance.
Arrows shown in
In this exemplary embodiment, the concept of a bag of words is adopted. The concept of a bag of words is a technique which should also be referred to as a bag of characteristics. In accordance with this concept, information (characteristics) is handled by ignoring the generation order of the information and its positional relations. In this technique, from alarm activation information, work reports, the codes of replacement parts and the like, the frequencies of generations of keywords, codes and words as well as a histogram are created. The distribution form of this histogram is regarded as a characteristic for classification into categories. This method is characterized in that, unlike the one-to-one search like the one described in non-patent document 2, a plurality of pieces of information can be handled at the same time. In addition, this method can also be used to handle free descriptions so that this method can also be used with ease to handle changes such as additions and deletions of information. On the top of that, this method is also effective for changing the format of a work report or the like. Even if a plurality of dispositions are carried out or even if an incorrect disposition is included, since attention is paid to the distribution form of the histogram, the robustness is high. In the same way, sensor signals are also classified into a plurality of categories. These categories are keywords.
Such an expression represents a condition in which maintenance has been carried out and is also referred to as a context. A context gives responses to questions including those described as follows:
In what condition was its information effective?
For what purpose was it used?
Why was it used?
What is attention paid to?
What are relations with other information?
The context is represented by the keyword appearance frequency pattern described above.
Concrete explanation referring to
The name of an alarm is information generated in remote monitoring of a facility. In
As shown in
In
It is to be noted that a keyword and a code book are given by the designer and a person in charge of maintenance, being stored in the maintenance-history information 401. However, weights may also be added to them by the importance. By making use of a mutual time relation between keywords as a relation showing a short or long period of time, a weight may be added or used as a selection reference.
Next, the following description explains a case in which an anomaly has been newly generated. In the phenomenon diagnosis 411, the type of an anomaly is determined by the sensor-signal point of view. For example, the name of the anomaly is determined to be a pressure decrease. In this case, in accordance with the diagnosis model described above, the probability of the replacement of a valve is 10%. Since this probability is known to be higher than other cases, in order to confirm that this valve is to be replaced, first of all, the diagnosis model is used in the field. It is needless to say that the sensor signals may also be analyzed in more detail in order to identify the failing member.
In this exemplary embodiment, the table 420 is further utilized. Normally, the phenomenon is complicated so that, even if the name of the anomaly is determined to be a pressure decrease, there are also conceivably many cases in which a part other than a valve is replaced. Thus, attention is paid to a frequency pattern representing a failure phenomenon 427. In the table 420 shown in
Thus, it is necessary to pay attention to the fact that, with regard to data to be observed and diagnosed, the diagnosis start time is a kind of pattern instead of a frequency. It is needless to say that, at the diagnosis start time, information can be used to serve as not only the contribution degree, but also the frequency of the contribution degree which is a time-axis summary.
Attention is paid to time-series variations of a residual vector shown in
As described above, if a diagnosis model is adopted, the diagnosis work can be carried out smoothly in the field so that the time it takes to carry out the diagnosis work can be shortened substantially. In addition, a candidate for a part to be replaced can be prepared in advance so that the recovery time of the facility can also be shortened considerably as well.
In the example described above, a frequency pattern is taken as the type of a failure phenomenon. However, any information other than a frequency pattern can be used as long as the information is usable. Examples of the usable information are a confirmed member, an adjusted member, information acquired from an on-call, a replacement part and an explained takeout anomaly cause. It is also a reason for which the bag-of-words method paying attention to the frequency can be used. In addition, when there are many items of the horizontal axis, the number of dimensions can also be said to be large. Thus, reducing the number of dimensions in advance is effective. The ordinary pattern recognition technique can also be said to be usable. Examples of the ordinary pattern recognition technique are an analysis of principal components, an analysis of independent components and selection of a feature quantity. It is also possible to adopt a normalization technique such as the whitening technique.
In the anomaly detection/analysis system shown in
This diagnosis model can also be adopted as educational information for young scholars. In addition, by adopting the diagnosis model as a base, it can be reflected in a maintenance work procedure.
In
The maintenance-history information 401 shown in
In addition,
It is to be noted that
In addition, in
The frequency pattern 730 comprising a variety of keyword types as described above can also be said to be a context representing, among others, the facility installation condition, the anomaly generation condition, the maintenance condition, the part replacement condition and past examples. A context, a placement condition and others are added to a keyword serving as a sole base for the conventional search operation. In a manner, such a search operation can be conceivably carried out. In other words, so far, it is written in the ‘if then’ form so that, in the search operation, the usage condition is not capable of achieving the target. As a result, there are many cases in which the diagnosis of the ‘then’ portion and its countermeasure are wasted in the end. However, such an ineffective keyword expression/usage condition can be expressed more flexibly by making use of a frequency pattern to provide a form in which the target can be conceivably achieved. Thus, in comparison with the diagnosis/countermeasure based on ‘if then’, it is possible to implement a diagnosis with a much higher degree of reliability.
A procedure of creating a diagnosis fault tree is explained as follows.
A phenomenon leading to the anomaly handling such as replacement of a part is taken as an object. Things to be clarified include anomaly phenomena and candidates for handling works required to recover the phenomena, descriptions of diagnosis works required to narrow down the candidates, information necessary for diagnoses, diagnosis criteria and information on work items to be carried out next in accordance with determination results.
Unexhausted diagnosis works, handling works and points to be corrected are listed up and used as supplementary information by making use of maintenance-history information and setting a hearing meeting with the service department.
A hearing meeting with the service department is set in order to classify information necessary for diagnoses into information that can be acquired automatically or information that can be acquired manually through manual operations.
A hearing meeting with the service department is set in order to record information on standard work times it takes to carry out anomaly diagnosis works and anomaly handling works.
Likewise,
In these diagnosis fault trees, if a signal to be checked at a branch point can be acquired automatically, the signal can be added to sensor data.
An important viewpoint in a diagnosis fault tree is to set up an optimum route. An optimum route is a route set up by a variety of cost viewpoints such as part costs and a work time. The optimum route does not necessarily show a first route only. Comparison with a second route may also be conceivably displayed. In addition, the work-end times of the first and second routes may also be presented. On the top of that, a virtual cost incurred in the case of an incorrect branch and a do-over route may also be presented. A virtual cost is a work cost caused by an end-time difference and a work cost incurred as a part spending for replacement of a part which does not naturally need to be replaced. They are carried out by, for example, referring to high-frequency work items shown in
In addition, a display screen may show all diagnosis fault trees or only portions surrounding a work of interest in a diagnosis fault tree.
For this diagnosis fault tree,
In a variety of phenomena, by classifying sensor data on the basis of past examples, the sensor data is viewed from the phenomenon point of view or the countermeasure point of view. Thus, in the diagnosis flows shown in
The clustering process 1116 is carried out to classify the sensor data into some categories by mode in accordance with an operating state and the like. In addition to the sensor data, event data (ON/OFF control of the facility, a variety of alarms, periodic inspection and adjustment of the facility and other data) 105 may be used. In addition, on the basis of their analysis results, learned data is selected and an analysis of the anomaly is carried out. As an input to the clustering process 1116, the event data 105 can also be classified into some categories for modes on the basis of the event data 105. It is an analyzer 1117 that analyzes and interprets the event data 105.
In addition, the identification section 1113 carries out identification making use of a plurality of identification means. The results of the identification are integrated by the integration section 1114 in order to implement the detection of the anomaly with higher robustness. The integration section 1114 outputs a message explaining the anomaly.
The characteristic extraction/selection/transformation section 912 reduces the number of dimensions of the multi-dimensional time-series signal received from the multi-dimensional time-series signal acquisition section 911. Then the multi-dimensional time-series signal is identified by a plurality of identification means 913-1, 913-2, - - - and 913-n which are employed in the identifier 913. The integration processing section 914 (global anomaly measure) determines the global anomaly measure. The learned data stored in the learned-data storage section 915 as data composed of mainly normal examples is also identified by the identification means 913-1, 913-2, - - - and 913-n and used in the determination of the global anomaly measure. In addition, the learned data stored in the learned-data storage section 915 as data composed of mainly normal examples itself is subjected to a selection process of taking or discarding the data. In this way, the learned data is stored in the learned-data storage section 915 and updated in order to improve the precision.
The observed data select 1232 is an instruction indicating which sensor signals are to be used. The anomaly determination threshold value 1233 is a threshold value for binary conversion of a value representing the degree of anomaly, which is computed and expressed as a variance/deviance from a model, a deviation value, an estrangement degree and an anomaly measure.
The identifier 913 shown in
In accordance with the projection distance method, first of all, an average mi of the learned data {xi} for each cluster and a variation matrix Σi are found by making use of the following equation:
In the above equation, symbol ni denotes the number of learned patterns pertaining to a cluster ωi.
Then, an eigenvalue problem of the variation matrix Σi is solved and, on the basis of a cumulative contribution ratio, a matrix Ui arranging eigenvectors corresponding to the r eigenvalues starting with the largest one is taken as a normal orthogonal base of an affine subspace of the cluster ωi. The minimum value of the projection distance to the affine subspace is defined as an anomaly measure of an unknown pattern x. In spite of 1 cluster classification making use of only normal learned data, the learned data itself includes different conditions such as the ON/OFF operating conditions. Thus, for the learned data, a subspace is generated with k-vicinity data close to observed data taken as one cluster. At that time, learned data whose distance from the observed data falls in a range determined in advance is selected (an RS method or a Range Search method). In addition, L (times t−t1 to t+t2, t1 and t2 are sampling consideration) pieces of learned data are also used to generate a subspace (time extension RS method). The L pieces of learned data are data which should correspond to variations of the transient time and leads ahead of or lags behind the selected data in the direction of the time axis. On the top of that, the projection distance is selected so that its value is smallest among those in a range from a smallest count to a selection count.
For 1 point of observed data, minimum learned data is selected. With only 1 point of observed data, however, whether or not the sensitivity is highest is not clear. Thus, also for the observed data, a subspace is generated. In the learned data, a subspace is generated from L×k sets (or smaller) of data selected by adoption of the time extension range search method. For the observed data, however, the length of the window segment is a degree of freedom and the selection is key to it. If the length of the window segment is increased, the variations of the data are caught. Due to handling independent of times, however, the amount of fear that a variation cannot be detected increases so that, furthermore, handling of the learned data can no longer be carried out.
On the basis of the dimension count n of the subspace in which learned data is stretched, a minimum window segment of the observed data is determined. The dimension count n is computed from the cumulative contribution ratio. Under a condition that the number of pieces of observed data is equal to the maximum (n+1), on the basis of the dimension count n, the window segment length M of the observed data is determined in an exploratory manner and the subspace is generated. Then, cos θ or its square is computed where θ denotes an angle formed by subspaces. A planning method is characterized in that, in accordance with this method, for time-series data, first of all, a minimum learning subspace is generated, then, from the similarity standpoint and the time-window standpoint, observed data is selected properly and, finally, similar subspaces are generated successively.
It is to be noted that, in the projection distance method, the center of gravity of classes is taken as an origin. An eigenvector obtained by applying the KL expansion to a covariance matrix of classes is used as a base. A variety of subspace methods have been proposed. If the method is a method having a distance scale, the degree of deviation can be computed. It is to be noted that, also in the case of the density, by making use of its quantity, the degree of deviation can be determined. In the projection distance method, the length of the orthogonal projection is found. Thus, the projection distance method makes use of a similarity measure.
As described above, in a subspace, a distance and a similarity degree are computed whereas the degree of deviation is evaluated. In the subspace method such as the projection distance method, due to identification means based on a distance, as a learning method for a case in which anomaly data can be used, it is possible to make use of metric learning for learning a distance function and vector quantization for updating a dictionary pattern.
In this method, for example, an orthographic point projected from the unknown pattern q (a most recent observed pattern) onto a subspace created by making use of the k multi-dimensional time-series signals can also be computed as an inferred value.
In addition, the k multi-dimensional time-series signals can also be rearranged into an order starting with the signal closest to the unknown pattern q (a most recent observed pattern) and multiplied by weights inversely proportional to the distances in order to compute inferred values of the signals. By adoption of the projection distance method, the inferred values of the signals can also be computed as well.
The parameter k is normally set at 1 value. If the processing is carried out by setting the parameter k at a value which can be changed to one of several other values, however, object data is selected in accordance with the degree of similarity. In this case, since comprehensive determination is made from their results, the method becomes more effective.
In addition, as shown in
What is described above can be applied to the projection distance method. To put it concretely, the procedure is described as follows.
1: Compute distances from the observed data to the learned data and rearrange the distances in an increasing order.
2: If the distance d<a threshold value th and the distance d is not greater than the parameter k, select the learned data.
3: Compute the projection distance for the range j=1 to k and output the minimum value.
The threshold value th used in the procedure described above is determined experimentally from the frequency distribution of the distance.
This notion is a concept referred to as a range search (RS) concept. This notion is thought to be applied to selection of learned data. The range search concept of learned-data selection can be applied also to the methods disclosed in patent documents 1 and 2. It is to be noted that, in the local subspace method, even if abnormal values are mixed a little bit, by setting the local-subspace, the effects are reduced substantially.
It is to be noted that, as shown in none of the figures, in identification referred to as an LAC (Local Average Classifier) method, the center of gravity for k neighborhood data is defined as a local subspace. Then, the distance from the unknown parameter q (a most recent observed pattern) to the center of gravity is computed and used as a variance (or a residual error).
A=1/N(ΣφφT) (2)
In
The example shown in
In the one-class support vector machine, the side close to the origin is a deflected value, that is, an anomaly. However, the support vector machine is capable of keeping up with even a high dimension of the feature quantity. Nevertheless, there is a demerit that if the learned-data count increases, the huge amount of computation is required.
In order to deal with the demerit, it is possible to apply typically a technique announced in the MIRU 2007 (which is a Meeting on Image Recognition and Understanding 2007). The document describing the technique is IS-2-10, “One-class Identification Means Based on Pattern Adjacency” authored by Takekazu Kato, Mami Noguchi, Toshikazu Wada (Wakayama University), Kaoru Sakai and Shunji Maeda (Hitachi). This announced technique offers a merit that, even if the learned-data count increases, the huge amount of computation is not required.
By expressing a multi-dimensional time-series signal by a low-dimensional model as described above, a complicated state can be decomposed and expressed by a simple model. Thus, there is provided a merit that the phenomenon is easy to understand. In addition, in order to set a model, it is not necessary to prepare data completely as is the case with the methods disclosed in patent documents 1 and 2.
The principal component analysis 1201 is referred to as a PCA for linearly transforming a multi-dimensional time-series signal having a dimension count M into an r-dimensional time-series signal having a dimension count r. The principal component analysis 1201 is also used for generating an axis with a maximum number of variations. KL transformation can also be carried out. The dimension count r is determined on the basis of a value serving as a cumulative contribution ratio obtained by dividing an eigenvalue by the sum of all eigenvalues. The divided eigenvalue is a value obtained by arranging eigenvalues computed by a principal component analysis in a descending order and summing up them by starting with a large one.
The independent component analysis 1202 has an effect of a technique referred to as an ICA (Independent Component Analysis) and used for actualizing a non-Gaussian distribution. The non-negative matrix factor decomposition is referred to as NMF (Non-negative Matrix Factorization). In NMF, sensor signals given in the form of a matrix are decomposed into non-negative elements.
The characteristic conversion method which is indicated on the column of the function 1220 as without a teacher is an effective transformation method in a case that an item is provided is an item with few anomaly examples and not possible to activate it. In this case, an example of the linear transformation is shown. Non-linear transformation can also be applied.
The characteristic transformation described above includes normalization for normalizing by making use of standard deviations and is implemented at the same time by arranging learned data and observed data. By doing so, learned data and observed data can be handled on the same level.
In
To put it concretely,
In order to predict an anomaly example, locus data of a deviation (residual error) time series up to the generation of the anomaly example is stored in a database in advance. Then, the degree of similarity between the deviation (residual error) time-series pattern of the observed data and the deviation (residual error) time-series pattern stored in the locus database as a pattern for locus data can be computed in order to detect predicted generation of an anomaly.
If such a locus is displayed to the user through a GUI (Graphical User Interface), the state of generation of an anomaly can be visually expressed and reflected with ease in a countermeasure or the like.
If only comprehensive residual errors are traced and development with the lapse of time is ignored, an anomaly phenomenon is difficult to understand. If the development of a residual error vector with the lapse of time is followed, however, the phenomenon can be picked up and understood. Theoretically, by carrying out processing to sum up vectors of each of several events forming a compound event, it is possible to detect prediction of generation of an anomaly for the compound event and the fact that a residual error vector accurately expresses an anomaly can be understood. If the loci of past anomaly examples such as the past anomaly examples A and B have been stored in a database as known information, an observed locus of an anomaly can be collated with the stored loci in order to identify (diagnose) the type of the anomaly.
In addition, if
Results of a diagnosis include a diagnosis model shown in
Separately from the hardware described above, a program to be installed in the hardware can be provided to the customer through a program recording medium or an online service.
A skilled engineer or the like is capable of making use of the DB 121. In particular, anomaly examples and countermeasure examples can be stored in the DB 121 as past experiences. To be more specific, the DB 121 can be used for storing (1) learned data (normal data), (2) anomaly data, (3) countermeasure descriptions and (4) fault-tree information. The DB 121 is structured so that a skilled engineer or the like is capable of manually modifying the data stored in the DB 121. Thus, a sophisticated and useful database can be provided. In addition, a data operation is carried out by automatically moving learned data (pieces of data and the position of the center of gravity) in accordance with generation of an alarm and/or replacement of a part. In addition, acquired data can be added automatically. If the data of an anomaly exists, a technique such as the generalization vector quantization can be applied to movements of the data.
In addition, the loci of the past anomaly examples A and B and the like explained earlier by referring to
In
As shown in
The anomaly analysis section 1540 is easy to understand if the reader thinks that the anomaly analysis section 1540 comprises a phenomenon analysis section 1541 and a cause analysis section 1542. The phenomenon analysis section 1541 is a section for carrying out a phenomenon analysis to identify a sensor including a predicted anomaly and for classifying anomalies from the countermeasure point of view and the adjustment point of view. On the other hand, the cause analysis section 1542 is a section for identifying a part which most likely causes a failure. The prediction detection section 1530 provides the anomaly analysis section 1540 with a signal indicating whether or not an anomaly exists and information on feature quantities. On the basis of the signal indicating whether or not an anomaly exists and the information on feature quantities, the phenomenon analysis section 1541 employed in the anomaly analysis section 1540 carries out a phenomenon analysis by making use of information stored in the DB 121. The phenomenon analysis section 1541 also classifies phenomena. In addition, the phenomenon analysis section 1541 also classifies sensor data from, among others, the adjustment point of view and the countermeasure point of view. That is to say, on the basis of the methods explained earlier by referring to
If such a relevant network is available, the signal connection, the signal co-occurrence and the signal correlation, which are not intended by the designer, could be clearly shown. Thus, such a relevant network is useful for an analysis of an anomaly. Such a network is generated at scales such as correlation, similarity, distance, cause-effect relationship and phase-lead/phase-lag in addition to the quantity of an effect on anomalies of sensor signals. Object-Facility Models and Network of Selected Sensor Signals
The design-information database 1708 is also used for storing information other than the design information. In the case of an engine, for example, the information stored in the design-information database 1708 includes a model year, a model, a table of parts (BOM), past maintenance information, information on operating conditions and inspection data obtained at the transport/installation time. The past maintenance information includes an on-call description, sensor-signal data obtained in the event of a generated anomaly, an adjustment date/time, taken-image data, abnormal-noise information and information on replaced parts to mention a few.
Description of Reference Numerals
Number | Date | Country | Kind |
---|---|---|---|
2010-289851 | Dec 2010 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2011/076963 | 11/22/2011 | WO | 00 | 6/26/2013 |