The subject matter disclosed herein generally relates to analyzing a fault log of a machine. More specifically, the subject matter relate to method and system for a diagnosis, and repair of the machine based on data associated with the operation of the machine.
Case Based Reasoning (CBR), is a technique of problem solving based on rules and behaviors learnt from experiential knowledge (memory of past experiences or cases). CBR focuses on indexing, retrieval, reuse, and archival of cases. CBR is used generally for diagnosis and repair of systems related to healthcare, transportation, and other infrastructure related systems.
CBR has been employed in equipment monitoring and remote diagnostics, call center automation, and in productivity tools. Quality management initiatives involving obtaining measurement data, analyzing the data, making improvements based on the data, and maintaining the improvement by continuously collecting data suits adoption of CBR techniques.
In accordance with one aspect of the present technique, a method is disclosed. The method includes obtaining sensory data from a machine and obtaining a plurality of measured structural features based on the sensory data. The method also includes obtaining a plurality of reference cases corresponding to the sensory data, from a database. The plurality of reference cases include a plurality of reference structural features and a plurality of fault identifiers. The method further includes computing a statistical parameter based on the plurality of reference cases and obtaining a first subset of reference structural features from the plurality of reference structural features based on the computed statistical parameter. The method also includes computing a plurality of similarity values based on the obtained first subset of reference structural features and the plurality of measured structural features. The method further includes identifying at least one fault identifier among the plurality of fault identifiers, based on the computed plurality of similarity values.
In accordance with another aspect of the present technique, a system is disclosed. The system includes a data acquisition module communicatively coupled to a sensing unit of a machine. The data acquisition module is configured to obtain a sensory data including a plurality of measured structural features from the sensing unit. The system further includes a training module communicatively coupled to the data acquisition module, the training module including a database having a plurality of reference cases corresponding to the sensory data. The plurality of reference cases include a plurality of reference structural features and a plurality of fault identifiers. The system also includes an optimizer module communicatively coupled to the database. The optimizer module is configured to obtain a first subset of reference structural features from the plurality of reference structural features. The system further includes an execution module communicatively coupled to the data acquisition module and the optimizer module. The execution module is configured to identify at least one fault identifier among the plurality of fault identifiers, based on the plurality of measured structural features and the first subset of reference structural features.
In accordance with another aspect of the present technique, a non-transitory computer readable medium encoded with a program to instruct at least one processor based device to diagnose machine faults is disclosed. The program instructs the at least one processor based device to obtain sensory data from a machine and obtain a plurality of measured structural features based on the sensory data. The program also instructs the at least one processor based device to obtain a plurality of reference cases corresponding to the sensory data, from a database. The plurality of reference cases include a plurality of reference structural features and a plurality of fault identifiers. The program also instructs the at least one processor based device to compute a statistical parameter based on the plurality of reference cases and obtain a first subset of reference structural features from the plurality of reference structural features based on the computed statistical parameter. The program further instructs the at least one processor based device to compute a plurality of similarity values based on the obtained first subset of reference structural features and the plurality of measured structural features and identify at least one fault identifier among the plurality of fault identifiers, based on the computed plurality of similarity values.
These and other features and aspects of embodiments of the present invention will become better understood when the following detailed description is read with reference to the accompanying drawings in which like characters represent like parts throughout the drawings, wherein:
Embodiments of the present disclosure relate to a system and a method for performing at least one of a diagnosis of a condition of operation and a repair of a diagnosed condition of a malfunctioning machine based on measured data associated with the operation of the malfunctioning machine. Specifically, in certain embodiments, a plurality of measured structural features are obtained from sensory data of a machine. A plurality of reference cases corresponding to the sensory data are obtained from a database. The plurality of reference cases includes a plurality of reference structural features and a plurality of fault identifiers. A statistical parameter is computed based on the plurality of reference cases. A first subset of reference structural features from the plurality of reference structural features are obtained based on the computed statistical parameter A plurality of similarity values are computed based on the obtained first subset of reference structural features and the plurality of measured structural features. At least one fault identifier among the plurality of fault identifiers is identified based on the computed plurality of similarity values.
A data acquisition module 106 is communicatively coupled to the sensing unit 104. The data acquisition module 106 is configured to receive the sensory data from the sensing unit 104. The data acquisition module 106 may receive sensory data from the sensing unit 104 through a communication link such as a wired, a wireless, or an internet network. In one embodiment, the data acquisition module 106 may be a standalone customized hardware component. In another embodiment, the data acquisition module 106 may be stored in a memory and executable by a processor. The system 100 further includes a training module 108 communicatively coupled to the data acquisition module 106. In the illustrated embodiment, the training module 108 includes a database 112 and an optimizer module 114. The database 112 may be used to store a plurality of reference cases corresponding to the sensory data. The plurality of reference cases includes a plurality of reference structural features and a plurality of fault identifiers. In one embodiment, the database 112 may be an off-the-shelf database module integrated with the optimizer module 114. The term “reference case” refers to a previously labeled processed case stored in the database 112. The term ‘fault identifier’ refers to an operating condition of the machine 102 of the machine 102 associated with the reference case. In one embodiment, the training module 108 may be a standalone customized hardware component. In another embodiment, the training module 108 may be stored in a memory and executable by a processor. In an embodiment where the data acquisition module 106 is disposed on the machine 102, the training module 108 receives the sensory data through a communication link from the data acquisition module 106.
The optimizer module 114 is communicatively coupled to the database 112 and configured to obtain a first subset of reference structural features from the plurality of reference structural features. The details of obtaining the first subset of reference structural features are explained in greater detail with reference to subsequent figures. In one embodiment, the optimizer module 114 may be a customized hardware component. In another embodiment, the optimizer module 114 may be stored in a memory and executable by a processor. In an alternate embodiment, the optimizer module 114 may be a sub-module implemented either as hardware component or software component within the training module 108. In certain other embodiments, the optimizer module 114 may be integrated with the training module 108.
The system 100 also includes an execution module 110 communicatively coupled to the data acquisition module 106 and the optimizer module 114. The execution module 110 is configured to identify at least one fault identifier among the plurality of fault identifiers, based on the plurality of measured structural features and the first subset of reference structural features. In one embodiment, the execution module 110 may be a customized hardware component. In another embodiment, the execution module 110 may be stored in a memory and executable by a processor.
In one embodiment, at least one module of the data acquisition module 106, the training module 108, and the execution module 110 may be a customized hardware component designed to perform respective specified functionality. In an alternate embodiment, at least one module of the data acquisition module 106, the training module 108, and the execution module 110 may be a software component stored in at least one memory and executed by at least one processor-based unit. In an exemplary embodiment, some modules of the training module 108, the optimizer module 114, and the execution module 110 are executed by a first processor-based unit. In such an embodiment, the remaining modules of the training module 108, the optimizer module 114, and the execution module 110 are executed by a second processor-based unit communicatively coupled with the first processor-based unit. Data may be exchanged between the first processor-based unit and the second processor-based unit depending on the configuration of the system.
At least one processor based unit may include at least one arithmetic logic unit, microprocessor, general purpose controller or other processor arrays to perform computations, and a memory module. The processing capability of at least one processor-based unit, in one embodiment, may be limited to retrieval of data and transmission of data. The processing capability of at least one processor-based unit, in another embodiment, may include performing more complex tasks such as obtaining the measured structural features from the sensory data, obtaining reference structural features from the reference cases, and the like. In other embodiments, other type of processors, operating systems, and physical configurations are also envisioned. The processor-based unit may also include or be communicatively coupled to at least one memory module. The memory module may be a non-transitory storage medium. For example, the memory module may be a dynamic random access memory (DRAM) device, a static random access memory (SRAM) device, flash memory or other memory devices. In one embodiment, the memory module also includes a non-volatile memory or a storage device such as a hard disk drive, a floppy disk drive, a compact disc read only memory (CD-ROM) device, a digital versatile disc read only memory (DVD-ROM) device, a digital versatile disc random access memories (DVD-RAM) device, a digital versatile disc rewritable (DVD-RW) device, a flash memory device, or other non-volatile storage devices. In one embodiment, the non-transitory computer readable medium is encoded with a program to instruct at least one processor-based device to identify fault of the machine 102.
In the illustrated embodiment, the case 200 represented by the CIN 206, includes a plurality of structural features 214, 216, 218, 220 generated within the fixed duration 212. A variable duration 208 between the time instance 210 (representative of the start of the case 200) and the CIN 206, includes a plurality of data units 224. It should be noted herein that the duration 208 does not include any of the plurality of structural features. In the illustrated embodiment, two data units 224 spans over the variable duration 208 extending over two days. The data units 222 spanning over the fixed duration 212 are stored in a data base. The term “structural feature” referred herein refers to a fault condition of the machine. For example, the plurality of structural features 214, 216, 218, 220 may be representative of fault conditions of the machine. In an exemplary embodiment, the structural feature may refer to a sequence of faults. As an example, the faults 216, 218 as a sequence may be treated as one structural feature. In alternative embodiments, the term “structural feature” may include other structures such as an n-tuple or a graph, derived from a plurality of fault conditions.
It should be noted herein that the machine data 204 may also be referred to as “sensory data”. A case including the sensory data may be referred to as a “measured case”. A plurality of structural features in the measured case may be referred to herein as “measured structural features”. The machine data processed, labeled, and stored in a database may be referred to herein as “reference data”. A case including the reference data may be referred to herein as a “reference case”. A plurality of structural features in the reference case may be referred to herein as “reference structural features”. The measured case and the reference case have a same data format as represented by the schematic diagram of
The plurality of nuisance structural features 416 are obtained based on the second subset 508 of reference structural features corresponding to the plurality of obtained nuisance cases. A statistical parameter is computed 514 based on the plurality of reference cases. In an exemplary embodiment, the statistical parameter is a frequency parameter used to determine the plurality of nuisance structural features. In such an embodiment, the frequency parameter is assigned to each of the reference structural feature of the second subset. In one embodiment, the frequency parameter is determined based on a number of cases among the plurality of nuisance cases 504, having a reference structural feature. In an alternate embodiment, the number of repetitions of reference structural feature is considered as the frequency parameter. Similarly, a plurality of frequency parameters corresponding to each reference structural feature of the second subset is determined.
A subset of the plurality of frequency parameters greater than a second threshold value is determined. In one embodiment, the second threshold value is defined by a user. In an alternate embodiment, the second threshold value is retrieved from a database. The reference structural features from the second subset of reference structural features, corresponding to the subset of plurality of frequency parameters, are determined as the plurality of nuisance structural features 416.
A statistical significance parameter is determined based on the contingency table of
where, A, B, C, and D are entries of the contingency table 700 and the exclamation mark (!) is representative of factorial mathematical operation. If the statistical significance parameter is less than a pre-defined constant value, the reference structural feature SFNNX is determined as “instructive” with reference to the considered fault identifier FIMMY. In a specific example, the value of A is thirty seven, the value of B is twenty one, the value of C is four, the value of D is six hundred and thirty five and the pre-defined constant value is 0.05. In such an example, the statistical significance parameter p is equal to 6.8×10−42. Since the value of p is smaller than 0.05, the reference structural feature SFNNX is instructive with reference to the fault identifier FIMMY.
The plurality of similarity values includes a first numerical value 812 of each reference structural feature from the first subset of reference structural features based on a second frequency of occurrence of each reference structural feature with reference to the plurality of measured structural features. In an exemplary embodiment, a structural feature which occurs commonly in the measured structural features and the first subset 414 of reference structural features corresponding to a reference case is considered. The second frequency of occurrence corresponding to the common structural feature is referred to as a ratio of repetition of the common structural feature in the reference case to the repetition of the common structural feature in the measured case. As an example, if CSFID1 is a common structural feature and if CSFID1 is repeated twice in the reference case and four times in the measured case, then the second frequency of occurrence is equal to 0.5. As another example, if CSFID1 occurs once in the reference case and the measured case, then the second frequency of occurrence is equal to one. The second frequency of occurrence may be suitably weighted to determine the first numerical value 812. The first numerical value 812 is represented by:
first_numerical_value=(1−α)+α×second_frequency (2)
where, α is a weighting factor of the second frequency of occurrence. In one example, the value of α is selected as 0.3. In another example, the value of α may be equal to 0.4. It should be noted herein that the equation (2) should not to be construed as a limitation of the invention and the first numerical value 812 may be determined using other similar mathematical formulae indicative of the relative similarity between the measured case and the reference case with reference to the common structural feature.
Further, the plurality of similarity values includes a second numerical value 814 of each reference case determined based on the first numerical value 812 of each reference structural feature. In one embodiment, the plurality of similarity values corresponding to the instructive structural features of the reference case are added together to determine the second numerical value 814 corresponding to the reference case. It should be noted herein that the second numerical value 814 indicative of a similarity value of each reference case with reference to the measured case. The technique of determining a plurality of similarity values corresponding to each reference case is explained in greater detail below.
Further, the plurality of similarity values includes a third numerical value 816 of each fault identifier determined based on the second numerical value 814 of each reference case. In an exemplary embodiment, the third numerical value 816 for a fault identifier is determined by adding a plurality of second numerical values corresponding to a plurality of reference cases having the fault identifier. Further, a plurality of third numerical values corresponding to each of the plurality of fault identifiers are determined. A maximum value among the plurality of third numerical values is then determined and a fault identifier corresponding to the maximum value is identified. The fault identifier 810 is representative of the operating condition of the machine. In an alternate embodiment, a subset of values among the plurality of third values is identified. A plurality of fault identifiers corresponding to the subset of identified values are determined.
A statistical parameter is computed 908 based on the plurality of reference cases and the plurality of reference structural features. In one embodiment, a plurality of statistical parameters are computed. In one such embodiment, a first parameter from the plurality of statistical parameters is used to determine an instructive structural feature. In one specific embodiment, the first parameter is a statistical significance of each reference structural feature with reference to each corresponding fault identifier. In such a manner, a plurality of instructive structural features are determined 910 from the plurality of reference structural features. In another embodiment, a second parameter from the plurality of statistical parameters is used to determine a nuisance structural feature. In one such embodiment, the second parameter is a first frequency of occurrence of each reference structural feature with reference to a plurality of reference structural features of the plurality of nuisance cases. In such a manner, a plurality of nuisance structural features are determined 912 from the reference structural features.
A first subset of reference structural features is obtained 914 based on the instructive structural features and the nuisance structural features identified from the plurality of reference structural features. The first subset includes the instructive structural features and excludes the nuisance structural features. A plurality of similarity values for reference structural features of the first subset is determined 916. The plurality of similarity values are determined based on the reference structural features of each reference case and the plurality of measured structural features. Specifically, the plurality similarity values are determined based on a frequency of occurrence of each reference structural feature of the reference case, within the plurality of measured structural features.
A plurality of similarity values for each reference case are determined 918 based on the plurality of similarity values for the reference structural features corresponding to the each of the plurality of reference cases. A plurality of similarity values for each of the fault identifier is obtained 920 based on the similarity values for the plurality of reference cases corresponding to each fault identifier. At least one fault identifier is determined 922 based on the plurality of similarity values corresponding to the plurality of fault identifiers.
Exemplary embodiments of the case-based reasoning technique disclosed herein enables determination of at least one fault identifier among a plurality of fault identifiers associated with a plurality of reference cases representative of an operating condition of the machine. Determination of instructive structural features from the plurality of reference structural features for computing the plurality of similarity values facilitates reduction of false alarms while diagnosing an operating condition of the machine. It is to be understood that not necessarily all such objects or advantages described above may be achieved in accordance with any particular embodiment. Thus, for example, those skilled in the art will recognize that the systems and techniques described herein may be embodied or carried out in a manner that achieves or improves one advantage or group of advantages as taught herein without necessarily achieving other objects or advantages as may be taught or suggested herein.
While the technology has been described in detail in connection with only a limited number of embodiments, it should be readily understood that the invention are not limited to such disclosed embodiments. Rather, the technology can be modified to incorporate any number of variations, alterations, substitutions or equivalent arrangements not heretofore described, but which are commensurate with the spirit and scope of the claims. Additionally, while various embodiments of the technology have been described, it is to be understood that aspects of the inventions may include only some of the described embodiments. Accordingly, the inventions are not to be seen as limited by the foregoing description, but are only limited by the scope of the appended claims. What is claimed as new and desired to be protected by Letters Patent of the United States is: