This U.S. patent application claims priority under 35 U.S.C. § 119 to: India Application No. 201721029392, filed on 18 Aug. 2017. The entire contents of the aforementioned application are incorporated herein by reference.
This disclosure relates generally to health monitoring of systems, and more particularly to monitor health of a system and to perform fault signature identification.
Complex systems deployed in an industry environment need to be monitored to ensure proper working of the system. Such systems would include multiple sub-units of sensors and other components which perform data collection, data processing and so on, and the sub-systems may be communicating each other for data exchange.
It is possible that due to technical issues a sub-system may malfunction, and due to the malfunctioning of the sub-system, readings of corresponding sensors change. In such a connected system, as the throughput of each component/sub-system affects final output of the system, any such malfunction would adversely affect overall throughput of the system.
The inventors here have recognized several technical problems with such conventional systems, as explained below. One way of analyzing issues associated with such complex systems is by performing a manual analysis for verifying working of the system components. However, for systems with the large number of components and complex architecture/design, manual analysis would be a tedious task. Manual analysis further demands complex domain knowledge, and based on amount of knowledge a person has, accuracy of results of verification can also vary.
There are certain methods and systems being used for fault analysis. However, one disadvantage of these systems is that they have limited or no capability of performing a runtime analysis. Furthermore, most of these systems require manual intervention at different stages of the analysis.
Embodiments of the present disclosure present technological improvements as solutions to one or more of the above-mentioned technical problems recognized by the inventors in conventional systems. For example, in one embodiment, a processor-implemented method for health monitoring and fault signature identification is provided. In this method, for a system being monitored, a Health Index (HI) is generated via one or more hardware processors, by the health monitoring and fault signature identification. By monitoring, the health monitoring and fault signature identification system identifies abnormal behavior of the system, if any, via the one or more hardware processors, wherein an estimated low HI is identified as indicative of the abnormal behavior. The health monitoring and fault signature identification system further detects at least one component of the system as responsible for the abnormal behavior, based on a local Bayesian Network generated for the system.
In another embodiment, a health monitoring and fault signature identification system is provided. The system comprising a processor; and a memory module comprising a plurality of instructions. The plurality of instructions are configured to cause the processor to estimate Health Index (HI) of a system being monitored, via one or more hardware processors, by a HI estimation module of the health monitoring and fault detection system. Further, a low HI data selection module of the health monitoring and fault detection system identifies abnormal behavior of the system, if any, via the one or more hardware processors, wherein an estimated low HI is identified as indicative of the abnormal behavior. Upon identifying the abnormal behavior, a HI descriptor module of the health monitoring and fault detection system detects at least one component of the system as responsible for the abnormal behavior, based on a local Bayesian Network generated for the system, via the one or more hardware processors.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate exemplary embodiments and, together with the description, serve to explain the disclosed principles.
Exemplary embodiments are described with reference to the accompanying drawings. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. Wherever convenient, the same reference numbers are used throughout the drawings to refer to the same or like parts. While examples and features of disclosed principles are described herein, modifications, adaptations, and other implementations are possible without departing from the spirit and scope of the disclosed embodiments. It is intended that the following detailed description be considered as exemplary only, with the true scope and spirit being indicated by the following claims.
The HI estimation module 101 is configured to collect, using an appropriate input interface, data from one or more components of the system being monitored, as inputs. Here, the term ‘component’ is used to refer to any hardware module of the system being analyzed which contributes to HI of the system, and which can be monitored for fault signature identification purpose. For example, the component can be a mechanical component, an electrical component, and/or an electronic component, that performs one or more functionalities of the system. Using appropriate sensor(s), one or more data associated with one or more of these components are collected and analyzed by the HI estimation module 101 of the health monitoring and fault signature identification system 100, for the purpose of health monitoring and fault signature identification. The HI estimation module 101 further processes the collected inputs, and estimates HI of the system, wherein the HI of the system represents health status of the system. In an embodiment, the HI is estimated as a time-series data that represents health index of a system for different time intervals. In an embodiment, the HI estimation module 101 estimates the HI of the system, using a Recurring Neural Network (RNN). In this method, the HI estimation module 101 considers a multi-sensor data from the system being monitored, to be multivariate time series xi={xi(1), xi(2), . . . xi(l)} corresponding to ith instance of a machine, where ‘I’ is length of time series, and each point xi(t)ϵRm in the time series is an m-dimension vector with each dimension corresponding to a sensor. A model is trained, based on data from a healthy system, to predict or reconstruct the time series. The HI estimation module 101 assumes that error vectors corresponding to healthy behavior are to follow a normal distribution (μ, Σ), wherein the parameters μ and Σ can be obtained using Maximum Likelihood Estimation method over time series in a training set used. Based on μ and Σ, the HI is computed as:
where
and ‘d’ is dimension of error vector. The HI estimation module 101 can be further configured to classify machine instance ‘i’ as healthy or unhealthy class at time ‘t’ if hi(t)>r′, where r′ can be a value as configured with the HI estimation module 101.
In an embodiment, the HI estimation module 101 estimates the HI values for specific time intervals i.e. as a time-series data. The HI estimation module 101 further provides the estimated HI data (i.e. data corresponding to the estimated HI) for each time window as an input to the low HI data selection module 102. The low HI data selection module 102 processes the HI data that is in the form of time-series, and identifies data corresponds to low HI of the system if present. In an embodiment, the low HI data selection module 102 identifies low HI of the system in terms of presence of one or more low HI windows in the time-series data being analyzed, where the term ‘low HI window’ represents a time window in which majority of HI values are less than a threshold value of HI. In an embodiment, the low HI data selection module 102 identifies the threshold value based on HI values obtained in previous time windows. For instance, by analyzing the HI values obtained in all or certain number of previous time windows, can identify range of HI values in the previous time windows, and accordingly determine the threshold value. In another embodiment, the threshold value is configured by a user, as per requirements/implementation standards. In an embodiment, the presence of low HI window(s) is identified as corresponding to or representing an abnormal behavior of one or more components of the system, and of the system as a whole. The detailed working of the low HI selection module 102 is as follows:
By processing the obtained time series data, the low HI selection module 102 classifies time windows in the time series data as low HI window(s) and high HI window(s). One time window can have multiple HI values, wherein the number of HI data in a time window depends on the length of the time window. For example, a time window of length 20 seconds can have 20 HI values (one at each second). Based on HI values in a time window, the low HI data selection module 102 classifies the time window as a low HI window or a high HI window. A low HI window is the time window in which majority of HI values are below the threshold, and high HI window is the time window in which majority of HI values are above the threshold. Upon identifying that one or more of the windows in the obtained time series data are low HI windows, the low HI data selection module 102 invokes the HI descriptor module 103 so as to identify component(s) that is responsible for the abnormal behavior of the system, and provides required inputs to the HI descriptor module 103. In an embodiment, the HI descriptor module 103 is not to be invoked if the HI values indicate a normal functioning of the system.
The HI descriptor module 103, when invoked, identifies all component(s) responsible for the low HI of the system (which is represented by the low HI values in the estimated time series, and in turn by presence of low HI windows). Input to the HI descriptor module 103 is HI values estimated for multiple time windows. In an embodiment, at least two time windows are required for proper functioning of the HI descriptor module 103, wherein one of the two time windows; one time window ωA with a majority of low HI values (hi(t)≤τ) and the other time window ωN with a majority of high HI values (hi(t)≥τ). The data (HI values) from ωA and ωN are used to learn parameters that would constitute the local BN, and generate the local BN for the system being monitored and analyzed. The HI descriptor 103 uses Information in the BN for the purpose of mapping an estimated low HI window with one or more components (sensors) of the system. The HI descriptor module 103 then obtains an Explainability Index (EI) for each sensor of the system, based on the local BN and the data collected from the mapped system components, wherein the EI quantifies contribution of each sensor to the estimated low HI value. EI is also represented as E(Si) in the description. Based on the EI measured for each sensor, the HI descriptor module 103 identifies one or more sensors as carrying faulty signatures that result in the low HI window, and in turn, one or more associated components.
The processing module 104 can be configured to interact with all other components of the health monitoring and fault signature identification system 100, collect instruction and execute one or more steps with respect to function(s) being handled by each module using one or more associated hardware processors.
By processing the collected inputs, the health monitoring and fault signature identification system 100 estimates (202) a Health Index (HI) of the system, as a time-series data. The health monitoring and fault signature identification system 100 further identifies, by processing the time-series data, one or more low HI windows (if present) in which majority of the HI values are below a threshold value of HI, which in turn indicates low HI of the system being monitored. The health monitoring and fault signature identification system 100 identifies (204) the low HI as an indicative of abnormal behavior of the system. If an abnormal behavior of the system is detected, then the health monitoring and fault signature identification system 100, based on a local Bayesian Network (BN) generated for the system, detects (208) one or more components of the system as carrying the (faulty) signature(s) for the abnormal behavior of the system. Various actions in
The health monitoring and fault signature identification system 100, based on the data present in the BN, identifies (304) one or more system sensors as associated with the estimated low HI. Now, in order to identify specific component out of the one or more components identified based on the BN, as responsible for the low HI (and in turn the abnormal behavior of the system), the health monitoring and fault signature identification system 100 generates (306) an Explainability Index (EI), wherein the EI quantifies the effect of each sensor on the HI through the change in distribution of the readings a sensor takes over time between predicted high HI and low HI ranges, and in turn identifies one or more corresponding components of the system (i.e. the component with which the sensor that has been identified as contributing to the low HI is associated with) that contribute to the low HI and the abnormal behavior of the system. For example, if EI indicates that a particular sensor is responsible for carrying the faulty signature, then the corresponding component(s) is identified as contributing to the abnormal behavior of the system. Various actions in
In order to generate the EI for a sensor, the HI descriptor module 103 of the health monitoring and fault signature identification system 100 collects (402) information pertaining to low HI data and high HI data leamt for the time-series data being analyzed, as inputs. The HI descriptor module 103, based on the local BN, calculates (404) sensor distribution under the identified low HI condition, and for the identified high HI condition (406). Further, based on the sensor distribution, the HI descriptor module 103 computes (408) EI for each of the corresponding sensors. The process of computing the EI by the HI descriptor module 103 is explained below:
Consider a discrete random variable H corresponding to HI, and a set of m discrete random variables {S1, S2, . . . Sm} corresponding to ‘m’ sensors. A BN with m+1 nodes is used to model dependence between the sensors and HI. In an embodiment, for the purpose of modelling dependence between sensors of the system and HI, a joint distribution P(S1, S2, Sm, H) of a set of random variables X=(S1, S2, Sm, H). For a practical scenario practice in which dependence between each sensor and the health index HI is to be modelled, a naive Bayes model with H being the parent node and each Si being a child node can be assumed.
A random variable XiϵX is considered to have k possible outcomes [bi1, bi2, . . . bik] corresponding too k discretized bins for the range of values the variable can take. An m-dimensional vector of sensor readings X(t) and health index h(t) for every time instant ‘t’ in windows ωA and ωN yield one observation for the set of random variables X={S1, S2, . . . Sm, H}. A marginal probability distribution for Si is given as P (Si)=[{circumflex over (p)}i1, {circumflex over (p)}i2, . . . {circumflex over (p)}ik], where {circumflex over (p)}ij is probability of jth outcome of Si. For a given range of values of HI, conditional probability distribution for Si is given by P(Si|H)=[{circumflex over (p)}i1, {circumflex over (p)}i2, . . . {circumflex over (p)}ik]. A change in distribution of random variable Si conditioned on outcomes of H corresponding to high HI (P(Si|H>r)) and low HI (P(Si|H≤r)) is used to quantify the effect of ith component on HI. Considering P(Si|H>r) and P(Si|H≤r) as vectors in Rk, change is quantified in terms of EI as:
E(Si)=∥P(Si|H>r)−P(Si|H≤r)∥ (2)
where, higher the Explainability index of a sensor, higher is the effect of the sensor on the HI.
Consider a turbomachinery dataset containing readings from 58 sensors such as temperature, pressure, and vibration, recorded for 6 months of operation. These sensors capture behavior of different components such as bearing and coolant of the turbomachinery. The turbomachinery is controlled via an automated control system having multiple controls making the sensor readings change frequently, and hence, unpredictable. A Long Short Term Memory-Encoder Decoder (LSTM-ED) is used for HI estimation. A LSTM ED is trained to reconstruct all 58 sensors. Performance details of the HI estimation module 101 and the HI descriptor module 103 are provided on three types of faults, related to: i) abnormal temperature fluctuations in component C1 (Temp-C1), ii) abnormal temperature fluctuations in component C2 (Temp-C2), and iii) abnormal vibration readings.
Stage 1: HI Estimation by HI Estimation Module 101:
Table 1 shows the performance of HI estimation module 101 for classifying normal and faulty behavior. Most relevant sensor for Temp-C1 and Vibration faults are denoted as T1 and V1, respectively. A plot depicting sample time series for normal and faulty behavior for sensors T1, V1, Load, and HI (as in
Once HI values are available from LSTM-ED temporal model, a BN is built for the purpose of identifying sensors that carry faulty signature, and in turn the associated component(s) of the system. Examples of BNs built are given in
Stage 2: HI Descriptor Module Operation:
The HI descriptor module 103 uses the BN structure as in
This dataset contains readings from 12 sensors, recorded for 3 years of engine operation. The sensor readings in this dataset are quasi-predictable and depend on an external manual control, namely, Accelerator Pedal Position (APP). LSTM Anomaly Detection (LSTM-AD) based HI Estimation is used for this dataset. All sensors data are input to LSTM-AD such that m=12. Analysis is done for two of the sensors: APP and Coolant Temperature (CT) to get insights into the reasons for estimated low HI. The low HI regions found correspond to three instances of abnormal CT.
Stage 1: HI Estimation by HI Estimation Module 101:
Table 1 shows the performance of the HI estimation module 101.
Stage 2: Working of HI Descriptor Module 103 for Identifying Reason for Low HI:
Dependency between HI and sensors is modelled as in
The illustrated steps are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments. Also, the words “comprising,” “having,” “containing,” and “including,” and other similar forms are intended to be equivalent in meaning and be open ended in that an item or items following any one of these words is not meant to be an exhaustive listing of such item or items, or meant to be limited to only the listed item or items. It must also be noted that as used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise.
Furthermore, one or more computer-readable storage media may be utilized in implementing embodiments consistent with the present disclosure. A computer-readable storage medium refers to any type of physical memory on which information or data readable by a processor may be stored. Thus, a computer-readable storage medium may store instructions for execution by one or more processors, including instructions for causing the processor(s) to perform steps or stages consistent with the embodiments described herein. The term “computer-readable medium” should be understood to include tangible items and exclude carrier waves and transient signals, i.e., be non-transitory. Examples include random access memory (RAM), read-only memory (ROM), volatile memory, nonvolatile memory, hard drives, CD ROMs, DVDs, flash drives, disks, and any other known physical storage media.
It is Intended that the disclosure and examples be considered as exemplary only, with a true scope and spirit of disclosed embodiments being indicated by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
201721029392 | Aug 2017 | IN | national |
Number | Name | Date | Kind |
---|---|---|---|
5680409 | Qin et al. | Oct 1997 | A |
20030005486 | Ridolfo | Jan 2003 | A1 |
20120101653 | Tran | Apr 2012 | A1 |
20120215450 | Ashok et al. | Aug 2012 | A1 |
20160143596 | Bhattacharya | May 2016 | A1 |
Number | Date | Country |
---|---|---|
002997489 | Feb 2019 | CA |
Entry |
---|
Gregory W. Bartram (System Health Diagnosis and Prognosis Using Dynamic Bayesian Networks, Aug. 2013, 149 pages). (Year: 2013). |
Number | Date | Country | |
---|---|---|---|
20190057317 A1 | Feb 2019 | US |