The present disclosure relates to devices, methods, and systems for turbine fault analysis.
Two challenges noted in a US Department of Energy issued report titled 20% Wind Energy by 2030 are the improvement of wind turbine performance and reduction in operating and maintenance costs. For example, after the capital costs of commissioning wind turbine generators, one large cost for owners is maintenance.
A reduction in maintenance and operating costs can reduce payback period considerably and can provide the impetus for investment, in some implementations, and potentially provide for widespread acceptance of this clean energy source. Wind turbines often operate in severe, remote environments and can require frequent scheduled maintenance.
Additionally, the cost of unscheduled maintenance, for example, due to undetected failures can be high in maintenance support and/or lost production time. In response to such potential costs, insurers, for example in the European Union, have begun requiring costly scheduled overhauls to address potential reliability concerns. Condition-based, rather than hours based maintenance, enables high reliability and low maintenance costs by eliminating unnecessary scheduled maintenance, in some implementations.
The present disclosure includes methods, devices, and systems for turbine failure analysis. For example, embodiments of the present disclosure can provide accurate prediction of bearing and gearbox mechanical failures.
In one example embodiment, a computing device implemented method for turbine failure analysis includes collecting a first set of data associated with wind turbine performance, via a computing device, analyzing the first set of data to determine whether the first set of data can indicate a fault condition, and reviewing a second set of data associated with wind turbine performance to determine if a fault condition exists.
In various embodiments, the method can further include providing a notification to a user to take an action based upon the determination that a fault condition exists. Embodiments can also include determining a type of fault condition that exists from a number of possible fault conditions.
In some embodiments, providing the notification can include information regarding a type of action to take based upon the determined type of fault condition. Collecting a first set of data associated with wind turbine performance, via a computing device, as discussed above can include collecting supervisor control and data acquisition (SCADA) type data from at least one of: a set of wind turbine sensors, a computing device associated with a wind turbine, and a file in memory.
Analyzing the first set of data to determine whether the first set of data can indicate a fault condition includes analyzing one or more data items for a wind turbine operating under non-fault conditions. In some embodiments, analyzing the first set of data to determine whether the first set of data can indicate a fault condition includes analyzing one or more data items for a wind turbine operating under non-fault conditions and one or more data items for a wind turbine operating under fault conditions.
Analyzing the first set of data to determine whether the first set of data can indicate a fault condition can be accomplished, for example, by analyzing the first set of data using a clustering algorithm methodology, such as a self organizing feature map (SOFM) methodology. In some embodiments, analyzing the first set of data to determine whether the first set of data can indicate a fault condition includes analyzing the first set of data using a principal component analysis methodology.
One benefit of embodiments of the present disclosure can be that turbine operation and maintenance costs can be reduced through continuous monitoring of turbine health, resulting in improved reliability. More specifically, component failures (e.g., those that cause the highest failures) in wind turbines can be detected early such that they can be addressed, for example, before more costly damage occurs.
Although the present embodiments are not limited to detecting gearbox and/or generator component issues, currently, operational failures associated with the gearbox and/or generator components appear to be significant and appear to account for the largest downtime and/or expense for repair, so some discussion below is provided with respect to such issues.
For example, in one method embodiment a gearbox failure is predicted using supervisory control and data acquisition (SCADA) data. Embodiments of the present disclosure include interpreting the SCADA data to detect the gearbox failure and processes to utilize the SCADA data. Generally, wind turbine SCADA data is used for operational performance monitoring and control applications.
The approaches discussed herein include, for example, where SCADA data is segmented according to the wind turbine operation region. In such embodiments, the operation region can, for example, be defined by a control law as the control scheme is different at each region.
In such embodiments, features used to detect a failure are obtained from the same operational region. In various embodiments, if the features at the different operational region are observed, a same or similar pattern of features may be exhibited because of the region changes rather than the fault.
In this manner, the patterns indicating a failure in region A could be also producible in region B even though there is no fault. This may be true for various types of failure, not just a gearbox failure. The features can, for example, be viewed separately at each region and can be differentiated when the information is extracted from the data.
Another approach discussed herein provides that for a gearbox failure, the measurements which are relevant to gearbox operation from the whole set of data can be initially identified (e.g., power, High Speed Shaft (HSS) torque, speed, Low Speed Shaft (LSS) torque, speed, oil temperature, and/or oil pressure). In various embodiments, each individual measurement can be reviewed separately, and they can be compared to the baseline and fault conditions to determine differences.
In some embodiments, some combination of individual measurements can be reviewed. For example, a ratio of individual measurement (e.g., ratio of HSS torque and HSS speed, ratio of HSS speed and LSS speed, and/or normalized oil temperature with respect to ambient temperature) can be reviewed.
The values of these measurements (e.g., obtained via sensor data from the wind turbine or associated instrumentation) can be reviewed with respect to each other, rather than the values reviewed individually. For example, HSS torque vs. power, ratio of HSS torque or HSS speed vs. power, and/or etc., can be reviewed against each other.
From this process, the most indicative patterns for the gearbox fault can be identified. For example, patterns can be observed from the ratio of HSS torque or HSS speed vs. Power Electronics Power (PE power), the ratio of HSS speed or LSS speed vs. power, the normalized oil temperature vs. Power, and/or the oil pressure vs. power.
Anomaly detection for individual wind turbines may be developed with a data set collected. Analysis may be done, but is not limited to, using principal component analysis (PCA), and non-linear methods. For example, SOFM (Self Organizing Feature Maps) is a non-linear method, by which unsupervised learning can be used in a set of data with unknown features, for categorizing them into groups with similar features. A set of algorithms that look for anomalies in a sensor within a sensor group may be used in detecting anomalies with the SCADA data.
The automation of the fault detection and/or prediction can, for example, be done using the PCA and/or SOFM. The PCA can, for instance, be used to detect the anomalies and the SOFM can, for example, be used to isolate a fault. Alarms generated from a fault detection and isolation system can, for example, be composed of multiple (e.g., three indicators) indicators, for example, based on Qstat, T2stat from PCA, and/or a clustering number from SOFM. In the prior art, process parameters are not routinely used for fault detection and/or prediction and only used in a primitive form for safeguarding against large deviations in individual signals.
Provided below is one example configuration of an embodiment of the present disclosure. This example should not be viewed as limiting the cope of the embodiments of the present disclosure.
The Controls Advanced Research Turbine 2 (CART2) is a modified Westinghouse WWG-0600 machine rated at 600 kW. It is located at the National Wind Technology Center (NWTC) in Boulder, Colo. and was installed to test new control schemes for power and load regulation. In its original configuration, the WING-0600 uses a synchronous generator, fluid coupling, and hydraulic collective pitch actuation.
However, the CART2 is fitted with an induction generator, rigid coupling, and individual electromechanical pitch actuators. The rotor runs upwind of the tower and includes two blades and a teetering hub.
As a research turbine, it is outfitted with many more sensors than would normally be installed on a wind farm turbine, including pressure transducers, torque transducers, strain gauges, thermometers, position encoders, accelerometers, anemometers, wind vanes, and power, current, and voltage meters. The output of each of these sensors is recorded at the control rate of 100 Hz. Total of 88 measurements are stored in 10 minute blocks including; Pitch Angles, Shaft Torque, Oil Temperature and Pressure, Yaw Positions, Wind Speeds and Directions, Ambient Temperature, Vibrations, Tower Loads, Generator Power/Current/Voltage, and other Control Signals. An example of the parameters includes:
‘Teeter brake pressure’
‘Gearbox oil pressure’
‘Nacelle ambient temperature’
‘PE kVA’
‘LSS RPM rate of change’
‘HSS RPM rate of change’
‘Digital outputs 1’
‘Digital outputs 2’
‘Normal stop bits 1’
‘Emergency stop bits 1’
‘Warning bits 1’
‘Warning bits 2’
Clustering algorithms are methods to divide a set of n observations into g groups so that members of the same group are more alike than members of different groups. If this is successful, the groups are called clusters.
Depending on the availability of class labels, there are two types of methods in learning the patterns in observations. When class labels of observations are available, supervised learning is used, otherwise unsupervised learning is appropriate. In unsupervised learning, there are no target outputs available. So the free parameters, such as weights and biases in neural networks, can be modified in response to network inputs only. In some embodiments, the input patterns are categorized into a number of classes.
A self organizing feature map (SOFM) is a specific type of clustering algorithm, forming neurons located on a regular, usually 1 or 2 dimensional grid. SOFM can detect regularities and correlations in their input and adapt their future responses to that input accordingly by learning to classify input vectors according to how they are grouped in the input space.
In other words, SOFM learns to recognize groups of similar input vectors in such a way that neurons physically near each other in the neuron layer respond to similar input vectors. In a classical k-means method, an observation can be mapped to the cluster whose representative is nearest to the observation.
This is what happens in SOFM, but additionally the training algorithm in SOFM attempts to map some structure to the representatives. In SOFM, a large number of clusters can be chosen, and arranged on a regular grid. The idea is that the representatives are spatially correlated, so that representatives at nearby points on the grid are more similar than those which are widely separated.
In SOFM, the cluster representatives, which are the neurons in the layer of a SOFM, are initially assigned at random in some suitable distribution according to a topology function which dictates the structure of the map. Whenever an observation x is presented, the closest representative mj is found, which is identified as the winner. Then for all representatives i which are neighbors of j on the grid;
m
i(t+1)=mi(t)+αhij[x(t)−mi(t)], for oil neighbors i,
where t denotes time, α is learning rate, and hij is the neighborhood kernel around the winner neuron j. The learning rate α is allowed to change with time.
The neighborhood function determines how strongly the neurons are connected to each other. An example of a neighborhood function is a constant over the whole neighborhood of the winner neuron. Another function of time-varying type is the Gaussian neighborhood function
where rj and ri are the locations of winner neuron j and its neighborhood neurons i on the map grid, respectively, and σ(t) is the neighborhood radius at time t.
The neighborhood function and the number of neurons determine the granularity of the resulting mapping. The larger the area where neighborhood function has high values, the more rigid is the map.
The larger the map, the more flexible it can become. This interplay determines the accuracy and generalization capability of SOFM.
In this work, the CART2 data is segmented according to the wind turbine operation region. The operation region is defined by a control law as the control scheme is different at each region.
Region 3 operation occurs above rated wind speed (i.e., the wind speed above which maximum peak power is produced). The turbine must limit the fraction of the wind power captured so as not to exceed electrical and mechanical design loads.
In embodiments of the present disclosure, the features used to detect the failure can be obtained from the same operational region. It is desired to decouple the real fault signatures from any signatures which are not caused by the fault but are similar to the ones when the fault exits.
The differences in the operating condition, the changes on the operating condition, and the differences in the ambient condition are the typical examples resulting in signatures similar to the ones caused by a real fault. Thus in this work, the CART2 data is segmented along with the operation regions, and the focus to observe the fault signatures is concentrated in region 2.
Generally, the region of interest will be selected depending on the type of failure mode. For example, if a target failure mode is expected to exhibit the most distinguishable fault signatures during the turbine startup period, then the observation will need to be focused on the region 1.
Or, if a target failure mode is expected to exhibit the signatures at the earliest time during the maximum power operation, the region of interest will be region 3 for the purpose of incipient fault detection. In this work, a target failure mode can be the gearbox failure which needs to be observed from the wider range of speed variations, thus our region of interest is the region 2 as it represents the wider range of operations.
Another part of data preprocessing can include the modification of the sampling rate. In one such example embodiment, the original CART2 data was collected, for example, at the sampling rate of 100 Hz (100 samples per second), as the main purpose of CART2 utilization is the test of control schemes.
Without considering the cost required for the data acquisition, it is preferred to have such high frequency speed data not only from the perspective of the control but also from the standpoint of the fault detection and/or diagnosis.
This is, for example, because the high frequency data contains the richer information or information that will not be available from the lower frequency data (i.e., more data points). The high frequency data will, in some implementations allow the system to address the faults that are not detectable due to the data quality limitations. In some embodiments, original data set (e.g., the high frequency data) is down-sampled to 1 Hz (1 samples per second). The benefits can, for example, be; 1) that the fault signatures are still observed well from the lower sampled data, and 2) as mentioned earlier, that SCADA data for the Condition Based Maintenance (CBM) purposes can be utilized, in some embodiments, and the sampling frequency of SCADA data is much slower, typically 1 sample per minute.
In order to build a base line of data for forecasting the condition of a wind turbine data can be obtained from test turbines. For example, if it was reported that there was a gearbox related failure from CART2 data, the data measured at the time of the failure can be obtained and extensive data sets can be collected before and after the time of the failure.
The measurements which are relevant to gearbox operation can be extracted from the whole set of data. For example, this information can include the power, the torque at both high speed shaft (HSS) and/or low speed shaft (LSS), the speed at high speed shaft (HSS) and/or low speed shaft (LSS), the oil temperature, and/or the oil pressure.
In some embodiments, each individual measurement can be analyzed separately to see how they are different compared to a baseline condition and/or fault condition. Also the combination of individual measurements, for example, the ratio of individual measurement can be analyzed to observe any indications of the fault, in some embodiments.
The changes against each other, rather than the changes of one item, for example, changes in HSS torque vs. power or the ratio of HSS torque to HSS speed vs. power, can be analyzed as well. From this process, the most indicative patterns for the gearbox fault can be identified, and they can be observed from the ratio of HSS torque or HSS speed vs. PE power, the ratio of HSS speed or LSS speed vs. power, the normalized oil temperature vs. power, as shown in
Such data collection and analysis can help identify the regions of a data set that may be most helpful in identifying faults. This can be beneficial, for example in focus data analysis efforts in these data regions and/or in selecting data to be extracted for data analysis or in selecting data to be discarded from a larger data set.
In some embodiments, based on these observations, the fault detection and isolation can be done using the PCA and SOFM. For example, the PCA can be used to detect anomalies in the data set and the SOFM can be used to isolate a fault.
In some embodiments, alarms can be generated based on one or more indicators. Examples of suitable indicators include, but are not limited to, indicators from PCA based on Q-stat and/or T2stat, and/or an indicator from SOFM based on one or more clusters of data.
In some embodiments, a training stage can be utilized wherein the system is calibrated based upon some initial data analysis. In some such embodiments, during the training stage, one or more sets of training can be prepared.
For example, one set of data can, for instance, be composed of Power, HSS torque, and HSS speed, and another set composed of Power, HSS torque, HSS speed, and LSS speed. The first set can, for example, be intended to capture the fault indication observed from
In some embodiments, the one or more data sets can be used to train Auto-Associative Neural Networks (AANN). This data can be prepared to ensure that it is representing wide possible operation range within the region 2 defined earlier.
In one example, the training (e.g., testing data) set includes data collected from operation of the device in which the data was collected during a number of time periods (e.g., October, January, December, November, February, March, April, and June over a time period of two years). In some such embodiments, the training data set for AANN is composed of baseline data only. In this manner, the baseline of the device analysis system can be defined and/or refined.
In some embodiments, training can be conducted using multiple data sets to determine which set may produce the best fault detection results. For example, having trained a system based on two different sets of data (e.g., one set composed of Power, HSS torque, and HSS speed, and the other set composed of Power, HSS torque, HSS speed, and LSS speed), it is found that the AANN trained with the first set produces better performances resulting in less false alarms and more successful detection than the second set. In such an embodiment, the training can then be focused on using one set of data, if desired.
In some embodiments, this testing can provide for inferences to be made regarding the testing and reasoning for one data sets success versus another data set. For example, the success of the first data set over the second above could be due to the wider spread of the speed ratio from the fault data shown in
In some embodiments, the SOFM can also be trained. For example, when trained with the first and second data sets above, the better performance with the system trained with the first data set can also be observed with the training of SOFM.
The data sets used for the training of SOFM can be the same or similar to those used for AANN training. For example, in some embodiments, a difference between the training set of SOFM and the training set of AANN can be that the SOFM training set includes the fault data. Thus, the SOFM training data set is the augmentation of the AANN training set with the addition of the fault data.
In this example, the data does not indicate the fault yet. The analysis on the consistency of the alarms as the time approaches closer to the failure time, or regarding how early the fault is detected can be achieved by analyzing the data collected in a time period around the day when the failure occurred. This time period for a particular type of event can be identified and/or refined based upon data collected from the occurrence or such events either on the device subject to this analysis or other devices that may provide similar results.
In an example, in some instances, information can be provided from resources outside the system. For instance, if the log book were reviewed, information regarding the failure, may be identified which can be used to further precision the analysis of the system.
For example, a log book may indicate that there was clunking noise heard while running the controller 14 days before the failure. This can be analyzed as to whether it is indicative of the start of the failure.
The first part of the data, up to the samples at around 3500, is the baseline data and they are marked with x's. The second part of the data, corresponding to the samples after 3500, is the fault data and they are marked with o's.
The y-axis represents the cluster number for the data. In such an embodiment, a clustering system having a number of clusters (e.g., 28 clusters) can be obtained after the training, and y-axis shows the cluster numbers from 1 to 28. As shown in the figure, there are no baseline samples which correspond to clusters number 17, 18, and 21, as only the samples among the fault data correspond to these clusters.
This indicates that the clusters number 17, 18, and 21 are formed based on the fault data only, thus they represent the fault condition. Whereas the other clusters correspond with the baseline data, thus they represent the normal condition. Based on this information, the mapping of the cluster numbers into the conditions can be accomplished and the result is shown in
Regarding the fault isolation, the SOFM can, for example, be used for the isolation purpose because the different clusters are formed representing different faults. In the example shown, though the SOFM is able to isolate the faults, it is trained with only one failure mode as that is the only failure history that is available. However, as more different failure modes are identified, and they are observable from the same set of parameters, a SOFM can demonstrate the isolation capability of those faults as well, in some embodiments.
As shown in
For example, user interface 1322 can include a screen that can provide information to a user of computing device 1320 and/or receive information entered into a display on the screen by the user. However, embodiments of the present disclosure are not limited to a particular type of user interface.
In some embodiments, the data, which is, for example, input to the PCA and SOFM, can be provided to the computing device 1320, for example, through a cable in an automated way. Data can be provided directly via a connection between the computing device 1320 to the wind turbine or indirectly from a database in which the wind turbine data is located.
This could be accomplished, for instance in an on-line and/or off-line manner.
For example, for an on-line operation, computing device 1320 can have a trigger condition for the data acquisition, and when the operation meets the trigger condition, then the PCA and/or SOFM can be performed and the results would be displayed. For instance, for and off-line operation, the data can be downloaded periodically and then processed accordingly (PCA and/or SOFM can be performed and the results displayed with the data collected during the operation of interest, like region 2, for example).
As shown in
Further, although memory 1326 is illustrated as being located in computing device 1320, embodiments of the present disclosure are not so limited. For example, memory 1326 can also be located internal to another computing resource (e.g., enabling computer readable instructions to be downloaded over the Internet or another wired or wireless connection).
Memory 1326 can also store executable instructions, such as, for example, computer readable instructions (e.g., software), for acquiring wind turbine data from a wind turbine or from a database or other memory store of wind turbine data in accordance with one or more embodiments of the present disclosure. Processor 1324 can execute the executable instructions stored in memory 1326 to acquire wind turbine data in accordance with one or more embodiments of the present disclosure.
In some embodiments, a computer-readable non-transitory medium can be used for storing instructions for turbine fault analysis. These instructions can be executable by a processor of a computing device to cause the computing device to accomplish various tasks.
For example, in some embodiments, the instructions can be executed to collect a first set of data associated with wind turbine performance, via a computing device. The first set of data can be analyzed to determine whether the first set of data can indicate a fault condition based upon a data value that satisfies a threshold value. A second set of data associated with wind turbine performance can be reviewed to determine if a fault condition exists based on a comparison of the second set of data and the threshold value.
In various embodiments, the instructions can include instructions executable by the computer to analyze the first set of data to determine a baseline condition with respect to one or more performance characteristics. In some embodiments, the instructions include instructions executable by the computer to determine when a fault condition exists with respect to one or more performance characteristics.
The one or more performance characteristics can, for example, include at least one of power, torque at a high speed shaft, torque at a low speed shaft, speed at the high speed shaft, speed at the low speed shaft, oil temperature, and/or oil pressure may be used, among other wind turbine performance characteristics that may be utilized.
In some embodiments, such parameters can be beneficial in identifying different fault types or may be particularly advantageous in predicting a fault of a certain type. For example, when multivariate analysis is performed (e.g., the observation from some ratio vs. power), the results can be indicative of gearbox failure and can be particularly useful in predicting gearbox failure detection and isolation.
In some embodiments, the instructions include instructions executable by the computer to analyze the first set of data to determine whether the first set of data can indicate a fault condition based upon data that satisfies a threshold value with respect to one or more performance characteristics. As with the above embodiments, the one or more performance characteristics include at least one of power, torque at a high speed shaft, torque at a low speed shaft, speed at the high speed shaft, speed at the low speed shaft, oil temperature, and/or oil pressure may be used, among other wind turbine performance characteristics that may be utilized.
Some embodiments provide a computing device for turbine fault analysis. In various embodiments the computing device can, for example, include: a processor and memory, wherein the memory stores instructions, executable by the processor to cause the computing device to perform a number of functions.
For example, some computing device embodiments can collect a first set of data associated with wind turbine performance. The first set of data can be analyzed using at least one of a self organizing feature map methodology and a principle component analysis methodology to determine whether the first set of data can indicate a fault condition. A second set of data associated with wind turbine performance can be reviewed to determine if the fault condition exists.
In some embodiments, the analysis of the data using a self organizing feature map methodology, for example, utilized to detect an anomaly in the first set of data. The analysis of the first set of data using the principle component analysis methodology can be utilized, for example, to locate a fault based on the detection of the anomaly. The computing device can include instructions to train the computing device to analyze the data using at least one of a self organizing feature map methodology and a principle component analysis methodology to determine whether the data can indicate a fault condition, via an auto-associative neural network.
Although specific embodiments have been illustrated and described herein, those of ordinary skill in the art will appreciate that any arrangement calculated to achieve the same techniques can be substituted for the specific embodiments shown. This disclosure is intended to cover any and all adaptations or variations of various embodiments of the disclosure.
It is to be understood that the above description has been made in an illustrative fashion, and not a restrictive one. Combination of the above embodiments, and other embodiments not specifically described herein will be apparent to those of skill in the art upon reviewing the above description.
The scope of the various embodiments of the disclosure includes any other applications in which the above structures and methods are used. Therefore, the scope of various embodiments of the disclosure should be determined with reference to the appended claims, along with the full range of equivalents to which such claims are entitled.
In the foregoing Detailed Description, various features are grouped together in example embodiments illustrated in the figures for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the embodiments of the disclosure require more features than are expressly recited in each claim.
Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus, the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate embodiment.
The subject matter of this disclosure was made with government support under Government Contract Number DE-EE0001368 awarded by the US Department of Energy under program Condition-Based Maintenance for Wind Farms. Accordingly, the US Government has certain rights to subject matter disclosed herein.
Number | Date | Country | |
---|---|---|---|
61441518 | Feb 2011 | US |