The field of the present disclosure generally relates to industrial assets, and more particularly, to aspects of systems and methods to provide anamoly detection for the industrial assets and an identification of root causes corresponding to the detected anamoly.
To control normal operation of an industrial asset such as a wind turbine, traditional process control methods have been used to monitor the time series of sensor measurements and generate alerts when outliers are detected. However, different root causes may exist that can lead to abnormal sensor measurements. As an example, high tower acceleration measurements may be caused by one or more of blade misalignment, blade imbalance, incorrect control parameter, and sensor hardware issue, etc. To identify the specific root cause in conventional methods, it requires a manual diagnostic process to distinguish outlier patterns. The manual process is limited to relatively simple outlier patterns and is thus plagued by results with relatively high uncertainty.
Accordingly, in some respects, a need exists for methods and systems that provide an efficient and accurate deep learning model to automatically detect anomalies and identify the corresponding root causes thereof with high model accuracy.
Reference will now be made in detail to present embodiments of the present disclosure, one or more examples of which are illustrated in the accompanying drawings. The detailed description uses numerical and letter designations to refer to features in the drawings. Like or similar designations in the drawings and description may be used to refer to like or similar parts of the present disclosure.
Each example is provided by way of explanation of the invention, not limitation of the invention. In fact, it will be apparent to those skilled in the art that modifications and variations can be made in the present disclosure without departing from the scope or spirit thereof. For instance, features illustrated or described as part of one embodiment may be used on another embodiment to yield a still further embodiment. Thus, it is intended that the present disclosure covers such modifications and variations as come within the scope of the appended claims and their equivalents.
As an overview, embodying systems and methods provide an AI (Artificial Intelligence) anomaly pattern recognition model that leverages a diagnostic expert domain knowledge base and deep learning technique to automatically detect an industrial asset (e.g., wind turbine) operational anomaly and identify root cause(s) corresponding to the detected anomaly. In some embodiments, a large set of training cases can be established based on historical diagnostic records that include multiple root causes. For each training case, several pairs of time series of sensor measurements may be configured and represented as scatter plots, where a combination of data patterns in or derived from the scatter plots indicates a specific root cause of an anomaly reflected in the sensor measurements (i.e., data).
In some embodiments, a convolutional neural network model is developed and used to recognize patterns in images of the scatter plots and to classify the training cases with particular root causes. Further, cross validation is performed to ensure robustness of the generated model. In some embodiments, the model may be used for real-time anomaly prediction of operational assets. Some embodiments might include a feedback loop to, for example, track model accuracy, facilitate the continuous updating of the training data, model improvement, and combinations thereof.
In some aspects, one or more embodiments described herein may be applicable to many different types of industrial assets. By way of example,
In an exemplary embodiment, a wind turbine site 205 includes the plurality of wind turbines 210, 215, and 220 that may each include a processor-enabled wind turbine controller 225. Wind turbine controller 225 of each wind turbine may be coupled in signal communication with site monitor 235 via network 245.
In some embodiments, site monitor 235 might be located at wind turbine site 205 or, alternatively, it might be located remotely from wind turbine site 205. For example, site monitor 235 might be communicatively coupled to and may interact with wind turbine controllers 225 at a plurality (not shown) of wind turbine sites 205.
In some aspects, each of site monitor 235 and wind turbine controller 225 includes a processor (e.g., a computing device or machine). A processor herein may include a processing unit, such as, without limitation, an integrated circuit (IC), an application specific integrated circuit (ASIC), a microcomputer, a programmable logic controller (PLC), and/or any other programmable circuit. A processor herein may include multiple processing units (e.g., in a multi-core configuration). In some embodiments, each of site monitor 235 and wind turbine controllers 225 may be configurable to perform the operations described herein by programming the corresponding processor. For example, a processor may be programmed by encoding an operation as one or more executable instructions and providing the executable instructions to the processor as a data structure stored in a memory device coupled to the processor. A memory device may include, without limitation, one or more random access memory (RAM) devices, one or more storage devices, and/or one or more computer-readable media.
As depicted in the example of
In the example of
In some embodiments, data processing & data filtering component 315 might process, condition, pre-process, or “clean” the operational data 305 so that it is configured in an expected manner and format for efficient processing by deep learning model system 310. In some scenarios, operational data 305 might include historical operational data associated with one or more wind turbines. Operational data 305 might be received directly or indirectly from the wind turbines, such as a database storing the data and/or a service provider that might aggregate or otherwise collect the operational data. For example, data processing & data filtering component 315 might operate to exclude turbine downtime data received in operational data 305 since such data may not be needed in some embodiments herein. In some aspects, data processing may be performed to ensure data quality and data validity, such as, for example, to process the operational data to execute an air density correction for wind speed measurements included in the operational data 305.
The training data establishment component 320 or functionality of deep learning model system 310 may operate to establish a set of training cases based on the historical diagnostic records of the wind turbine operational data 305 that includes multiple root causes embedded within the data. The set of training cases may be used in training the deep learning model generated by component 325. In some embodiments, multiple pairs of time series of sensor measurements are selected for each training case and configured as scatter plots (or other graphical representations of data), wherein a combination of data patterns in the scatter plots is specific to one root cause. It is noted that normal turbine operation cases are also included in the training data set, and might be used to, for example, provide a relative operational baseline for the wind turbines represented in the operational data 305. In some embodiments, the diagnostic data records 305 and the corresponding scatter plots may be reviewed by domain experts and/or automated processing systems that can, for example, reference digitized or other machine readable data structures and systems, devices, and services that embody a domain expert knowledge base to ensure correct labeling of training cases.
The deep learning model building and validation component 325 or functionality of deep learning model system 310 may operate to convert or transform the scatter plots (or other representations of wind turbine operational data 305) into visual representation images of the scatter plots (or other representations of the operational data). For example, deep learning model building and validation component 325 may operate to develop (i.e., generate) a deep learning classification model that builds connections (e.g., transfer functions, algorithms, etc.) between the scatter plots based on the operational data and root causes for anomalies in the operational data by processing an input of high-dimensional images including data pixels corresponding to the scatter plots to generate an output including root cause labels associated with one or more anomalies derived from data patterns in the images. The deep learning model herein is a deep learning classification model developed to build a connection between scatter plots including data representations of wind turbine anomalies and the corresponding root causes thereof. In some aspects, a convolutional neural network (CNN) model is developed to capture and process pixel data to recognize the complex data patterns in images of the scatter plots and to further classify anomaly cases in the training set as being associated with a particular root cause for the determined anomaly classification. Deep learning model building and validation component325 may include functionality to perform one or more types of cross validation on the developed and trained model to ensure robustness of the model.
The output of deep learning model system 310 including an indication of the detected one or more anomalies derived from data patterns in the images and the corresponding root cause labels associated therewith, or at least a portion thereof, might be used for updating training data and model improvement. For example, when the model is used for real-time anomaly detection and root cause identification (i.e., the wind turbine operational data 305 is real-time operational data from one or more wind turbines), a feedback loop 335 may be configured to track an accuracy of the model. For example, newly identified anomaly cases can be added into the original training set (e.g., a subset of the historical operational data used to develop the model), and an updated deep learning model can be re-tuned to capture the new expanded distribution of training cases. In this manner, a functionality or process can be provided that facilitates a continuous updating of training data for the model, as well as model improvement.
Referring to
In one non-limiting example, a client 425 may execute one or more of the applications 430 to invoke performance of an anomaly detection and root cause identification process via a user interface displayed on the client 425 to view analytical information such as visualizations (e.g., charts, graphs, tables, and the like), based on the underlying data (e.g., wind turbine operational data) stored in the data store 405. The applications 430 may pass analytic information to one of services 420 (e.g., a deep learning model development and implementation service such as, for example, system 310 in
According to various embodiments, one or more of the applications 430 and the cloud services 420 may be configured to perform anomaly detection and root cause identification based on image processing performed by a deep learning model developed in accordance with some embodiments herein.
In some embodiments, the data of data store 405 may include files having one or more of conventional tabular data, row-based data, column-based data, object-based data, and the like. According to various aspects, the files may be database tables storing data sets. Moreover, the data may be indexed and/or selectively replicated in an index to allow fast searching and retrieval thereof. Data store 405 may support multi-tenancy to separately support multiple unrelated clients by providing multiple logical database systems which are programmatically isolated from one another. Furthermore, data store 405 may support multiple users that are associated with the same client and that share access to common database files stored in the data store 405.
Referring to the scatter plots
In some aspects, there might generally be a large variation in wind turbine operation data due to a plurality or combination of sensor, turbine control, and environment factors. The combination and complexity of factors presents a challenge to accurately distinguishing between normal wind turbine operation and abnormal wind turbine operation. In some aspects, the present disclosure's deep learning model to recognize data patterns embedded in images of the scatter plots of sensor measurements provides improvements by, for example, increasing and enhancing the anomaly detection accuracy and root cause identification.
In some embodiments and aspects, automatic processes and systems implementing such processes as disclosed herein to detect wind turbine (or other assets) operation anomalies and identify the corresponding root causes that can be scaled to multiple turbines at a wind farm and/or fleet level include a “physical +digital” integration that leverages accumulated domain knowledge (i.e., wind turbine operating characteristics, anomalies, and root causes of those anomalies) and advanced analytical techniques such as, for example a deep neural network (e.g., a CNN).
At operation 605, a deep learning model development platform, system, service, or device may receive historical time series sensor data associated with operation of an industrial asset, where the sensor data includes values for a plurality of sensors over a period of time. In some embodiments, a time series data collection component may collect and store wind turbine operational time series data including a set (e.g., pairs) of sensor measurements.
In some instances, at least a portion of the raw historical time series sensor data may be filtered to exclude data and/or artifacts that will not be included or needed in further operations of process 600. Such filtered data may include wind turbine (or other asset) downtime measurements.
At operation 610, visual representation images of scatter plots based on the received historical time series sensor data may be generated, wherein each scatter plot includes a specific pair of time series sensor data for the plurality of sensors interfaced with the wind turbine. In some embodiments, one or more of the generated images may comprise a plurality of the scatter plots.
In some embodiments, at least a portion of the received historical time series sensor data may be transformed to a format, configuration, level, resolution, etc. from its raw configuration as obtained by the wind turbine (or other asset) sensors. In some instances, the transformation will depend on the sensor measurements to be processed (e.g., an air density correction for wind speed measurements, etc.).
In some embodiments, a sub-process of process 600 or separate process might include aspects of an image specification and within-image plot arrangement method. Such a method may include, in part and/or in combination, selecting a specific set of pairs of sensor measurements according to known diagnostics (e.g., a digitized or machine-readable knowledge base built on engineering experience) for inclusion in an image; designing an image layout, including specifying a size for the image; and assigning each pair of sensor measurements to a specific location in the image. In this manner, an image comprising visual representations for a plurality of scatter plots might be configured in a single image in an efficient and defined manner so that such constructed images may be reliably generated based on scatter plots of operational data and further accurately analyzed for the detection of patterns indicative of operational anomalies. The pairwise sensor scatter plots and image generation therefrom may include, in part and/or in combination, drawing a scatter plot for each pair of time series sensor measurements; using, for each scatter plot, a binary scale for each pixel value, or using a continuous scale that incorporates additional information (e.g., data density and/or other normalized sensor measurements) in the scatter plot; adjusting the vertical and horizontal axis scale across scatter plots in the image to, for example, present/magnify certain image features; and adding, to each scatter plot, a comparative scatterplot as a reference/baseline plot, thereby generating a multi-layer image.
At operation 615, a root cause label is assigned to each visual image including the scatter plots representing an operational anomaly based on a reference to and leveraging of, at least in part, a digitized knowledge domain data structure or system associated with the industrial asset(s) in combination with the data patterns in each image. In some aspects, a standardized ground truth label is assigned to each generated image. In some regards, abnormal sensor measurements (i.e., anomalies) may be caused by different root causes. In particular, each root cause requires a specific type of maintenance and repair practice. As such, identification of the correct root cause can provide actionable insights with respect to on-going operations, preventative maintenance, and corrective maintenance aspects of a wind turbine (and/or other assets).
Continuing to operation 620, a deep learning model and more particularly a convolutional neural network (CNN) model is trained using a first subset of the labeled images and tested based on a second subset of the labeled images applied to the trained model to evaluate the performance of the trained model. In some aspects, the first subset of the labeled images is referred to a training set of data and the second subset of the labeled images that is applied to the trained model is referred to as test data, where the first and second subsets of images are distinct from each other.
In some embodiments, the CNN adheres to a specific model structure defined by, for example, the number of layers in the neural network, the number of nodes for each layer, the inter-connection between layers, transfer functions between layers, etc. and is trained using the training data with model parameters estimated accordingly. In some instances, cross validation technique(s) may be used to avoid model over fitting on the training data.
Moreover, iterations of the model training/test cycles may be executed to identify an optimal and robust CNN, where an “optimal” model may vary depending on one or more features of an application.
At operation 625 of
Referring to
At operation 635, a representation of the record including the at least one detected anomaly and the one or more root causes associated therewith may be sent to or transmitted to a device (e.g., a client device) that invokes an action (e.g., generate alarm(s) when a specific root cause is identified) in response to the one or more root causes indicated in the record. In some instances, the action might be automatically (i.e., without further user action(s)) invoked, executed, or at least initiated by the receiving device in response to the reception of the representation of the record.
In some embodiments, process 600 or a process executed to compliment process 600 might include providing at least a portion of the record of the at least one detected anomaly and the one or more root causes associated therewith back to the model to assist in at least one of tracking an accuracy of the model, continuous updating of the first set of the labeled images to train the model, re-tuning the model, and combinations thereof.
In some aspects, labeling of training data for a deep learning model herein is a significant concern. Consistency, accuracy, and sufficiency of training data are key aspects to ensure training data that is reliable to establish an accurate model. As used herein, consistency refers to using the same labeling nomenclature to describe a particular feature, event, or entity. For example, a first measurement “X” should always be referenced as measurement “X”, not “X” in one instance and “Y” in other instances. Accuracy in the data refers to a preciseness in the labeling of the data such that each label clearly references one particular feature, event, or entity.
In some embodiments,
It is noted that the various features, systems, and processes disclosed herein are not limited to the specific example applications and embodiments explicitly discussed. For example, the present disclosure is not limited to the specific examples discussed in the context and application of wind turbines disclosed in the detailed discussion above and/or the accompanying drawings. FIG. 20 illustrates concepts and features of an anomaly detection system in accordance with the present disclosure such as, for example, a land-based wind farm system 2005 that may be extended to and applied to an offshore wind farm site 2010. In some regards, offshore data 2015 may be used to at least supplement existing training data of the deep learning model trained on the land-based system to capture operational differences and/or idiosyncrasies of the offshore wind farm site 2010.
The embodiments described herein may be implemented using any number of different hardware configurations. For example,
Processor 2110 also communicates with a storage device 2130. Storage device 2130 may comprise any appropriate information storage device, including combinations of magnetic storage devices (e.g., a hard disk drive), optical storage devices, mobile telephones, and/or semiconductor memory devices. The storage device 2130 stores a program 2112 and/or a deep learning engine 2114 (e.g., associated with a model development and tuning process) for controlling the processor 2110. The processor 2110 performs instructions of the programs 2112, 2114, and thereby operates in accordance with any of the embodiments described herein. For example, the processor 2110 might receive sensor data associated with operation of an industrial asset, the sensor data including values for a plurality of sensors over a period of time. The processor 2110 may transform scatter plot representations of the operational data to image data comprising a plurality and combination of visual representations of the scatter plots capturing anomaly patterns for an industrial asset for which a model is developed based on training data and tested/evaluated by test data of the images. An output of the model may include an indication of the anomaly and the corresponding root cause for the anomaly. The generated deep learning (classification) model may then be executed to automatically identify an anomaly and its corresponding root cause for an operating industrial asset.
The programs 2112, 2114 may be stored in a compressed, uncompiled and/or encrypted format. The programs 2112, 2114 may furthermore include other program elements, such as an operating system, a database management system, and/or device drivers used by the processor 2110 to interface with peripheral devices.
As shown in
Some embodiments herein provide an automatic approach to detect turbine operation anomaly and identify the corresponding root causes, and therefore avoid tedious manual diagnostic process(es). The deep learning model herein can be applied to the real-time turbine operational data for all the turbines at the farm and/or fleet level, which facilitates the asset performance management strategy and largely increases business productivity. Also, the ability to identify root causes enables more efficient maintenance planning and solution deployment at the wind farm.
An embodying deep learning model can automatically detect anomaly and identify root causes with high model accuracy.
An embodying deep learning model might detect, for example, tower acceleration anomaly and identify the corresponding root causes based on thousands of historical diagnostic cases. Applicant(s) have realized a prove-of-concept model tested on real-time turbine operational data from six wind farms with wind turbines, and root causes of tower acceleration anomaly have been successfully identified.
While only certain features of the invention have been illustrated and described herein, many modifications and changes will occur to those skilled in the art. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/048223 | 8/27/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62897774 | Sep 2019 | US |