The subject matter disclosed herein relates generally to the field of condition based maintenance of machines and to a system and a method of extracting features from signal data to enable better prognostics and health monitoring of machinery.
Vibration monitoring is widely used to monitor a condition of moving machinery, e.g., a gearbox, for condition based maintenance (CBM). CBM comprises a set of maintenance actions based on real-time or near real-time assessments of the condition of, e.g., moving machinery and other systems through vibration signals that can be obtained from embedded sensors, and external tests and measurements, based on current condition indicators. Vibration monitoring techniques can utilize vibration signals from the gearbox to detect, isolate, identify, and predict degraded or faulty performance of the gearbox and its associated machinery. Typical vibration monitoring techniques rely on the domain knowledge of an expert to design appropriate features to characterize vibration data. These features are low dimensional encodings of information carried by the vibration signals. Existing data driven approaches require a predefined transformation of data, for example, Fourier transform, Hilbert-Huang transform, or the like. However, raw vibration signals can have complex statistical distributions and such low dimensional encodings may lose relevant information through characterization. A method of vibration monitoring that does not depend on physics based models and domain expertise would be well received in the art.
According to an aspect of the invention, a method for providing health indication of a mechanical system, includes receiving, with a processor, signals indicative of vibration data of the mechanical system; pre-training, with the processor, features in the signals with a model; determining, with the processor, information related to vibration signatures in the signals; associating, with the processor, the vibration signatures with historical vibration data of the mechanical system; and building, with the processor, a multi- layer Deep Neural Network (DNN) from the vibration signatures and the historical vibration data.
In addition to one or more of the features described above, or as an alternative, further embodiments could include associating the vibration signatures with known fault types from the historical data.
In addition to one or more of the features described above, or as an alternative, further embodiments could include building an initial two-layer Deep Belief Net (DBN) from the signals.
In addition to one or more of the features described above, or as an alternative, further embodiments could include building a DBN from a stack of Restricted Boltzmann Machines (RBM) comprising hidden variables and observed variables.
In addition to one or more of the features described above, or as an alternative, further embodiments could include determining a non-linearity in the hidden variables using stepped sigmoid units, sigmoid units, or rectified linear units.
In addition to one or more of the features described above, or as an alternative, further embodiments could include building an additional two-layer DBN from the initial two-layer DBN.
In addition to one or more of the features described above, or as an alternative, further embodiments could include associating the vibration signatures with ground truth labels representing known fault types from the historical vibration data.
In addition to one or more of the features described above, or as an alternative, further embodiments could include building the DNN with identical data from the model.
According to another aspect of the invention, a system to provide health indication of a mechanical system, includes a moving machinery associated with the mechanical system; a sensor associated with the moving machinery; a processor; and memory having instructions stored thereon that, when executed by the processor, cause the system to: receive signals indicative of vibration data of the mechanical system; pre-train features in the signals with a model; determine information related to vibration signatures in the signals; associate the vibration signatures with historical vibration data of the mechanical system; and build a multi-layer Deep Neural Network (DNN) from the vibration signatures and the historical vibration data.
In addition to one or more of the features described above, or as an alternative, further embodiments could include a processor that is configured to associate the vibration signatures with known fault types from the historical data.
In addition to one or more of the features described above, or as an alternative, further embodiments could include a processor that is configured to build an initial two-layer Deep Belief Net (DBN) from the signals.
In addition to one or more of the features described above, or as an alternative, further embodiments could include a processor that is configured to build a DBN from a stack of Restricted Boltzmann Machines (RBM) comprising hidden variables and observed variables.
In addition to one or more of the features described above, or as an alternative, further embodiments could include a processor that is configured to determine a non-linearity in the hidden variables using stepped sigmoid units, sigmoid units, or rectified linear units.
In addition to one or more of the features described above, or as an alternative, further embodiments could include a processor is configured to build an additional two-layer DBNs from the initial two-layer DBN.
In addition to one or more of the features described above, or as an alternative, further embodiments could include a processor that is configured to associate the vibration signatures with ground truth labels representing known fault types from the historical vibration data.
Technical function of the embodiments of the invention include prognostics and health management of machinery through extraction of health features in vibration data without utilizing physics based models and domain expertise. The invention uses vibration data to pre-train a model to characterize signatures of features, which are used to backpropagate the features to known condition fault types of machinery in order to predict degraded or faulty performance of machinery.
Other aspects, features, and techniques of the invention will become more apparent from the following description taken in conjunction with the drawings.
The subject matter, which is regarded as the invention, is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other features, and advantages of the invention are apparent from the following detailed description taken in conjunction with the accompanying drawings in which like elements are numbered alike in the several FIGURES:
Referring now to the drawings,
Computer system 200 includes a communications interface 220 connected to the bus 202. Communications interface 220 allows software and data to be transferred between computer system 200 and external devices. Examples of communications interface 220 may include a modem, a network interface (such as an Ethernet card), a communications port, a PCMCIA slot and card, etc. Software and data transferred via communications interface 220 are in the form of signals which may be electronic, electromagnetic, optical, or other signals capable of being received by communications interface 220. These signals are provided to communications interface 218 in secondary memory 210 via a communications path (i.e., channel) and may be implemented using wire or cable, fiber optics, wired, wireless and other communications channels. Also, computer system 200 may receive sensed signals from a plurality of sensors 224 such as, for example accelerometers, for systems and machinery on aircraft 100 (
The computer system 200 may also include an I/O interface 222, which provides the computer system 200 with access to a display/monitor and the like. In an embodiment, the results and/or pictures of health monitoring based upon the PHM algorithm are reported to the user via the I/O interface 222. Also, a model containing the PHM algorithm for health monitoring is stored as executable instructions in module 206 in main memory 208 and/or hard disk storage unit 216 of secondary memory 210. The PHM algorithm, when executed by processor 204, enables the computer system 200 to perform the features of the invention as discussed herein. The main memory 208 may be loaded with one or more application modules 206 that can be executed by one or more processors 204 with or without a user input through the I/O interface 222 to achieve desired tasks.
As shown, the exemplary process is initiated in 302 where computer system 200 receives vibration signals from one or more sensors associated with machinery, e.g. a gearbox, in a mechanical system in aircraft 100. For example, the vibration signals can include energy or other data that is received from movement of gears in the mechanical system.
In 304, the vibration signals data are used to pre-train features in the vibration signals using an energy based model. Initially, a multi-layer Deep Belief Net (DBN) is built two-layers at a time using the vibration signals. The multi-layer DBN is built without presenting any labels to the vibration data. The DBN consists of a stack of Restricted Boltzmann Machines (RBM) that forms a single multilayer generative model. An example DBN is illustrated in
In one example, a Gaussian-Bernoulli RBM is used where linear variables are visible and hidden variables are binary; but, in other embodiments, other variants can be used based on the specific application. Hidden units or variables are followed by a non-linearity and can include stepped sigmoid units (SSU) for the hidden variables h. In an embodiment, the SSU can be applied according to the method disclosed in a non-patent literature publication authored by N. Jaitly and G. Hinton entitled “Learning a better representation of speech sound waves using restricted Boltzmann machines,” ICASSP, 2011, which is herein incorporated by reference. In embodiments, sigmoid units, rectified linear units, or the like may be used for the hidden variables h. Parameters are learned using contrastive divergence. Outputs of the RBM are the first-level vibremes or vibration signatures. These vibremes, the inferred states of the hidden units of the first RBM, can be used as training data to train another RBM to capture their dependencies. RBM training can be repeated as many times as desired or required to produce many layers of non-linear feature detectors (i.e., higher level vibremes). The activations or outputs of the hidden units at each RBM encode characteristic features present in the vibration signals to create vibration signatures or vibremes. In 306, the learned parameters are fine-tuned. The parameters are tuned by associating the vibremes (classify or associate the signatures) with ground truth labels (backpropagation) through a DNN. In other embodiments, other classification techniques can be used to associate the signatures to labels.
In an example, fine-tuning is performed by training a DNN on historical vibration data that contains ground truth labels. A ground truth label can include a known fault type that is identified from historical data such as, for example, a health condition indicator (CI). These ground truth labels are not limited to fault types and can correspond to descriptors of other physical conditions of interest that can identify fault types of machinery for PHM.
In this step, a multi-layer neural network is instantiated with the number of layers and number of nodes in each layer being identical to the DBN learned in 304. All the weights in the network are initialized to the parameters learned in the DBN of 304. A DNN can include many hidden layers for prognostics and health monitoring (PHM) using vibration signals. A DNN is a feedforward artificial neural network that has more than one layer of hidden units between its inputs and outputs. Each hidden unit, j, uses the logistic function to map its total input from the layer below, xj, to the scalar state, yj, that it sends to the layer above, according to equations (2) and (3).
where bj is the bias of unit j, i is an index over units in the layer below, and wij is the weight to unit j from unit i in the layer below.
In 308, for a final layer 508 (
where k is an index over all classes.
Benefits of the invention include a PHM algorithm to learn a DNN method for PHM of machinery without using domain expertise of conventional methods. The PHM algorithm utilizes a deep learning approach including a generative pre-training step and backpropagation in order to predict degraded or faulty performance of the gearbox that accurately determines faults for PHM over prior methods. Additional benefits can include building models where predictive ground truth labels are orders of magnitude less than the large amount of data collected and used in PHM of machinery.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. While the description of the present invention has been presented for purposes of illustration and description, it is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications, variations, alterations, substitutions or equivalent arrangement not hereto described will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the invention. Additionally, while the various embodiments of the invention have been described, it is to be understood that aspects of the invention may include only some of the described embodiments. Accordingly, the invention is not to be seen as limited by the foregoing description, but is only limited by the scope of the appended claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US15/51936 | 9/24/2015 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62056781 | Sep 2014 | US |