This U.S. patent application claims priority under 35 U.S.C. § 119 to: India Application No. 202421000058, filed on Jan. 1, 2024. The entire contents of the aforementioned application are incorporated herein by reference.
The disclosure herein generally relates to fault processing, and, more particularly, to a method and system for a Bayesian regression-based fault detection and diagnosis in wind turbine sensors and actuators.
Renewable energy sources have more importance in the context of ever increasing energy demand and adverse impact of fossil fuel on environment. Wind energy is such a renewable energy, which is a clean, non-polluting type of energy. The wind energy is harnessed using wind turbines, which has components such as rotor, blades, generator, and so on. Wind turbines, with all the mechanical components, are prone to failure or faulty operations, and hence require regular checks and maintenance. Faults in sensors and actuators are one of the major contributors to overall faults in wind turbine systems.
As with all mechanical systems, faults in wind turbines have characteristic signatures, which can be detected through sensor data acquisition. The effectiveness of a Fault Detection and Diagnosis (FDD) approach depends on how effectively these signatures are detected from the acquired signals. Conventionally, signal processing techniques have been widely employed for FDD. With the emergence of machine learning (ML), various classification-based models and anomaly detection techniques have gained popularity. In general, FDD methods can be classified into model-based or data-driven approaches. Both approaches map sensed operational data to specific faults. Model-based methods for FDD explicitly build an intermediate dynamics model for turbine operations using dynamical equations or using function approximations like neural networks. Given the sensed system state and inputs, faults are identified as some function of the residuals between model-expected values and observed values during current operation. On the other hand, the data-driven approaches typically classify the observed data (as either faulty or normal) using a machine learning or signal processing pipeline. Specifically, these data-driven methods do not use an intermediate dynamics model for turbine operations.
One disadvantage of the existing approaches is that they focus on fault detection alone and fail to diagnose even if a fault is detected, which adversely affects effectiveness with which maintenance can be performed. They provide little to no further information about the type of fault and its magnitude. Pure data-driven methods do not incorporate knowledge of the underlying system dynamics and may struggle with issues related to labeled data. Another disadvantage of the existing approaches is that they fail to take into consideration uncertainties that arise due to various factors such as, but not limited to, sensor noise and model inaccuracies. Providing a measure of uncertainty associated with a detected fault is important in critical systems where false alarms or missed faults can have significant consequences.
Embodiments of the present disclosure present technological improvements as solutions to one or more of the above-mentioned technical problems recognized by the inventors in conventional systems. For example, in one embodiment, a processor implemented method is provided. The method includes: collecting, via one or more hardware processors, data from a wind turbine as input data; generating, via the one or more hardware processors, a plurality of prime values associated with one or more parameters of the input data, by processing the input data using a dynamics model, wherein the plurality of prime values are an estimate of actual values of the one or more parameters of the input data; determining, using a fault model via the one or more hardware processors, correlation of a measured value of the one or more parameters of the input data with the associated prime values in presence of a plurality of fault signatures; determining a likelihood function, via the one or more hardware processors, wherein the likelihood function indicates probability of the measured value of the one or more parameters given the associated prime values and fault signatures; generating, using the Bayesian inference model via the one or more hardware processors, a posterior information by combining the likelihood function with a prior information, wherein the posterior information is an updation of the prior information in light of one or more new observations indicated in the likelihood function; and generating, via the one or more hardware processors, one or more fault signature distributions and associated magnitude and uncertainty, by processing the posterior information, wherein the one or more fault signature distributions is indicative of one or more probable faults in a sensor or actuator.
In an embodiment of the method, the input data comprises a plurality of state parameters, wind data, and measured values from the sensor or the actuator.
In another embodiment of the method, the one or more fault signature distributions capture a plurality of different types of faults associated with at least one of a sensor or actuator of the wind turbine.
In another embodiment of the method, the one or more probable faults are in one or more sensors or actuators of the wind turbine.
In yet another aspect, a system is provided. The system includes one or more hardware processors, a communication interface, and a memory storing a plurality of instructions. The plurality of instructions cause the one or more hardware processors to: collect data from a wind turbine as input data; generate, via the one or more hardware processors, a plurality of prime values associated with one or more parameters of the input data, by processing the input data using a dynamics model, wherein the plurality of prime values are an estimate of actual values of the one or more parameters of the input data; determine, using a fault model via the one or more hardware processors, correlation of a measured value of the one or more parameters of the input data with the associated prime values in presence of a plurality of fault signatures; determine a likelihood function, via the one or more hardware processors, wherein the likelihood function indicates probability of the measured value of the one or more parameters given the associated prime values and fault signatures; generate, using the Bayesian inference model via the one or more hardware processors, a posterior information by combining the likelihood function with a prior information, wherein the posterior information is an updation of the prior information in light of one or more new observations indicated in the likelihood function; and generate, via the one or more hardware processors, one or more fault signature distributions and associated magnitude and uncertainty, by processing the posterior information, wherein the one or more fault signature distributions is indicative of one or more probable faults in a sensor or actuator.
In an embodiment of the system, the input data comprises a plurality of state parameters, wind data, and measured values from the sensor or the actuator.
In another embodiment of the system, the one or more fault signature distributions capture a plurality of different types of faults associated with at least one of a sensor or actuator of the wind turbine.
In another embodiment of the system, the one or more probable faults are in one or more sensors or actuators of the wind turbine.
In yet another aspect, a non-transitory computer readable medium is provided. The non-transitory computer readable medium includes a plurality of instructions, which cause one or more hardware processors to: collect data from a wind turbine as input data; generate, via the one or more hardware processors, a plurality of prime values associated with one or more parameters of the input data, by processing the input data using a dynamics model, wherein the plurality of prime values are an estimate of actual values of the one or more parameters of the input data; determine, using a fault model via the one or more hardware processors, correlation of a measured value of the one or more parameters of the input data with the associated prime values in presence of a plurality of fault signatures; determine a likelihood function, via the one or more hardware processors, wherein the likelihood function indicates probability of the measured value of the one or more parameters given the associated prime values and fault signatures; generate, using the Bayesian inference model via the one or more hardware processors, a posterior information by combining the likelihood function with a prior information, wherein the posterior information is an updation of the prior information in light of one or more new observations indicated in the likelihood function; and generate, via the one or more hardware processors, one or more fault signature distributions and associated magnitude and uncertainty, by processing the posterior information, wherein the one or more fault signature distributions is indicative of one or more probable faults in a sensor or actuator.
In an embodiment of the non-transitory computer readable medium, the input data comprises a plurality of state parameters, wind data, and measured values from the sensor or the actuator.
In another embodiment of the non-transitory computer readable medium, the one or more fault signature distributions capture a plurality of different types of faults associated with at least one of a sensor or actuator of the wind turbine.
In another embodiment of the non-transitory computer readable medium, the one or more probable faults are in one or more sensors or actuators of the wind turbine. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate exemplary embodiments and, together with the description, serve to explain the disclosed principles:
Exemplary embodiments are described with reference to the accompanying drawings. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. Wherever convenient, the same reference numbers are used throughout the drawings to refer to the same or like parts. While examples and features of disclosed principles are described herein, modifications, adaptations, and other implementations are possible without departing from the scope of the disclosed embodiments.
One disadvantage of the existing approaches is that they focus on fault detection alone, and fail to diagnose even if a fault is detected, which adversely affects effectiveness with which maintenance can be performed. They provide little to no further information about the type of fault and its magnitude. Pure data-driven methods do not incorporate knowledge of the underlying system dynamics and may struggle with issues related to labeled data. Another disadvantage of the existing approaches is that they fail to take into consideration uncertainties that arise due to various factors such as, but not limited to, sensor noise and model inaccuracies. Providing a measure of uncertainty associated with a detected fault is important in critical systems where false alarms or missed faults can have significant consequences.
In order to solve the aforementioned challenges, embodiments disclosed herein provide a method and system for a Bayesian regression-based fault detection and diagnosis in wind turbine sensors and actuators. The system performs the following steps for the Bayesian regression-based fault detection and diagnosis in wind turbine sensors and actuators:—collect data from a wind turbine as input data; generate, via the one or more hardware processors, a plurality of prime values associated with one or more parameters of the input data, by processing the input data using a dynamics model, wherein the plurality of prime values are an estimate of actual values of the one or more parameters of the input data; determine, using a fault model via the one or more hardware processors, correlation of a measured value of the one or more parameters of the input data with the associated prime values in presence of a plurality of fault signatures; determine a likelihood function, via the one or more hardware processors, wherein the likelihood function indicates probability of the measured value of the one or more parameters given the associated prime values and fault signatures; generate, using the Bayesian inference model via the one or more hardware processors, a posterior information by combining the likelihood function with a prior information, wherein the posterior information is an updation of the prior information in light of one or more new observations indicated in the likelihood function; and generate, via the one or more hardware processors, one or more fault signature distributions and associated magnitude and uncertainty, by processing the posterior information, wherein the one or more fault signature distributions is indicative of one or more probable faults in a sensor or actuator. The system addresses the cited challenge with the existing approaches by facilitating diagnosis of detected faults which gives their types and magnitudes. The system also takes into consideration various factors/parameters that contribute to uncertainties, thereby addressing another disadvantage of the existing approaches.
Referring now to the drawings, and more particularly to
max(min(x, b), a)
(μ, v2)
The I/O interface 112 may include a variety of software and hardware interfaces, for example, a web interface, a graphical user interface, and the like. The I/O interface 112 may include a variety of software and hardware interfaces, for example, interfaces for peripheral device(s), such as a keyboard, a mouse, an external memory, a printer and the like. Further, the I/O interface 112 may enable the system 100 to communicate with other devices, such as web servers, and external databases.
The I/O interface 112 can facilitate multiple communications within a wide variety of networks and protocol types, including wired networks, for example, local area network (LAN), cable, etc., and wireless networks, such as Wireless LAN (WLAN), cellular, or satellite. For the purpose, the I/O interface 112 may include one or more ports for connecting several computing systems with one another or to another server computer. The I/O interface 112 may include one or more ports for connecting several devices to one another or to another server.
The one or more hardware processors 102 may be implemented as one or more microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, node machines, logic circuitries, and/or any devices that manipulate signals based on operational instructions. Among other capabilities, the one or more hardware processors 102 is configured to fetch and execute computer-readable instructions stored in the memory 104.
The memory 104 may include any computer-readable medium known in the art including, for example, volatile memory, such as static random-access memory (SRAM) and dynamic random-access memory (DRAM), and/or non-volatile memory, such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes. In an embodiment, the memory 104 includes a plurality of modules 106.
The plurality of modules 106 include programs or coded instructions that supplement applications or functions performed by the system 100 for executing different steps involved in the process of the Bayesian regression-based fault detection and diagnosis in wind turbine sensors and actuators. The plurality of modules 106, amongst other things, can include routines, programs, objects, components, and data structures, which performs particular tasks or implements particular abstract data types. The plurality of modules 106 may also be used as, signal processor(s), node machine(s), logic circuitries, and/or any other device or component that manipulates signals based on operational instructions. Further, the plurality of modules 106 can be used by hardware, by computer-readable instructions executed by the one or more hardware processors 102, or by a combination thereof. The plurality of modules 106 can include various sub-modules (not shown). The plurality of modules 106 may include computer-readable instructions that supplement applications or functions performed by the system 100 for the Bayesian regression-based fault detection and diagnosis in wind turbine sensors and actuators. In an embodiment, terms ‘wind turbine’ and ‘wind-turbine’ are used interchangeably.
The data repository (or repository) 110 may include a plurality of abstracted piece of code for refinement and data that is processed, received, or generated as a result of the execution of the plurality of modules in the module(s) 106.
Although the data repository 110 is shown internal to the system 100, it will be noted that, in alternate embodiments, the data repository 110 can also be implemented external to the system 100, where the data repository 110 may be stored within a database (repository 110) communicatively coupled to the system 100. The data contained within such external database may be periodically updated. For example, new data may be added into the database (not shown in
At step 202 of method 200 in
Further, at step 204 of the method 200, the system 100 generates, via the one or more hardware processors 102, a plurality of prime values associated with one or more parameters of the input data, by processing the input data using a dynamics model. The plurality of prime values are an estimate of actual values of the one or more parameters of the input data which are relevant to the faulty sensor or actuator under consideration. The actual values are also referred to as ‘model-expected values’. Details of the dynamic model are given below.
For the example of a faulty rotor speed sensor and pitch actuator under consideration, a model-based fault inference approach requires a model =(
,
) to estimate wind turbine dynamics. The model
estimates the true value of the rotor speed ω′r that is erroneously reported as ωr by the faulty rotor speed sensor. Likewise, the model
estimates the true value of the rotor pitch θ″ commanded by a pitch PI controller, which is wrongly set as θ by the faulty pitch actuator. Thus, given the wind turbine state vector st and the input wind speed ut, model
, gives the probability distribution p(ω′r
) and model
gives the probability distribution p(θ′t|st, ut;
).
The Dynamics model used by the system 100 is such a model =(
,
). The Dynamics model can be developed using wind turbine dynamics equations, giving rise to a mathematical (analytical) dynamics model
=(
,
). If such dynamics equations are unavailable, a completely data-driven model
=(
,
) is developed using neural networks as function approximators to the wind turbine dynamics. In this approach, data from the fault-free operation of the rotor speed sensor or the pitch actuator are used to train
and
, respectively.
Further, at step 206 of the method 200, the system 100 determines, using a fault model via the one or more hardware processors 102, correlation of a measured value of the one or more parameters of the input data with the associated prime values in presence of a plurality of fault signatures. The one or more fault signature distributions capture a plurality of different types of faults associated with at least one of a sensor or actuator of the wind turbine. Details of the fault model are given below.
A sensor is faulty when a measured output of the sensor deviates from a true value of the variable it is measuring, beyond some tolerance. Similarly, an actuator is faulty if the actuated output deviates from its commanded input. Refer to the table below.
Faulty signal of a general parameter ψ at time t is modeled as
Here, ψ′t is the true input to the sensor or actuator and ψmin t and ψmax t are the lower and upper bounds on ψt respectively. αt(1), αt(2), αt(3) are the gain, offset, and relative noise fault signatures respectively. Depending on the values of the fault signatures α(i) (for i=1, 2, 3), the general fault formulation in Equation (1) can incorporate various types of faults in sensors and actuators as given in Table 2.
In the method 200, general fault formulation model in (1) is contextualized to faults in the (rotor speed) sensor and the (rotor pitch) actuator. It is assumed that the faults in the sensor and the actuator are independent of each other. For simplification, the fault signatures ([α(1), α(2), α(3)]) are assumed to not vary with time i.e., there is no drift. Accordingly, two independent fault formulation models and
relate the faulty sensed/actuated quantities of ωrt and θt to the respective true (i.e., prime) inputs ω′r
Further, at step 208 of the method 200, the system 100 determines a likelihood function, via the one or more hardware processors 102. The likelihood function indicates probability of the measured value of the one or more faulty parameters under consideration given the associated prime values and fault signatures. Under the dynamics and fault formulation models described above, mixtures of censored Gaussian distributions are used to determine these likelihood functions.
Further, at step 210 of the method 200, the system 100 generates, using the Bayesian inference model used in the Bayesian regression approach used by the one or more hardware processors 102, a posterior information by combining the likelihood function with a prior information. The posterior information is an updation of the prior information in light of one or more new observations indicated in the likelihood function. Further, at step 212 of the method 200, the system 100 generates, via the one or more hardware processors 102, one or more fault signature distributions and associated magnitude and uncertainty, by processing the posterior information. The one or more fault signature distributions is indicative of one or more probable faults. In an embodiment, the one or more probable faults are in one or more sensors or actuators of the wind turbine.
The Bayesian inference approach used in steps 210 and 212 is elaborated below:
The Bayesian inference model is a statistical method for updating (prior) beliefs about a random parameter based on new evidence or observed data. Bayesian inference begins with a known/assumed prior probability distribution which represents the initial estimate of the parameter prior to receiving any actual data. As new observations/measurements come, the likelihood function indicates how the new data are distributed given different values of the parameter, i.e., the likelihood function quantifies the probability of observing the new data under suitable assumptions on the parameter. The posterior information includes probability distribution of each of these parameters, and is obtained by combining the prior information with the likelihood function. The posterior information represents the updated belief/estimate of a parameter being considered, after incorporating the new data. At this step, the system 100 obtains posterior distributions of the fault signatures α(i) (for i=1, 2, 3).
The system 100 may use Bayes' theorem to obtain the posterior information distribution. Specifically, the system 100 obtains the probability distributions of the fault signatures, posterior to observing the sensed/actuated value of the signals of interest ωrt or θt along with the input wind speed ut and the turbine state vector st under dynamics and fault formulation
models. The system 100 approximates these distributions at a given time t as:
Prior distributions: Bayesian methods require assigning appropriate prior distributions to the six fault signatures being inferred, which are represented by p([αω(1), αω(2), αω(3)]) and p([αθ(1), αθ(2), αθ(3)]) in Equations (4) and (5). These priors could be knowledge-driven or historical data-driven.
In Equations (4) and (5), (ωr
(θt|θ′t, αθ(1), αθ(2), αθ(3)), are the respective likelihood functions of the sensed rotor speed and the actuated rotor pitch. Mixtures (as estimates of ω′r
and the fault formulation model
.
Apart from the prior distributions and the likelihood functions, computation of the posterior distributions also requires determination of the terms p(ωrt|st, ut) and p(θt|st, ut) in Equations (4) and (5). These terms are difficult to compute analytically and hence, the system 100 may use numerical methods to approximate the posterior distributions of the fault signatures. The Bayesian inference uses a Markov Chain Monte Carlo (MCMC) sampling mechanism to approximate the posterior distributions of the fault signatures. In an embodiment, MCMC is used as an example, and the system 100 may use any other suitable approach for the purpose of approximating the posterior distributions.
Using MCMC, the system 100 obtains samples from the posterior distributions of the fault signatures. In MCMC, samples are generated as a chain with each subsequent sample depending on its previous sample. The system 100, using the sampling algorithm, drives the stationary distribution of the chain to converge to the desired posterior distribution so that after convergence, the chain samples are samples from the desired posterior distribution.
This section presents the dataset generation and the implementation details of the Bayesian approach used by the system 100. A data-driven binary classifier baseline model is used for comparison, and certain evaluation metrics are discussed.
a. Datasets Used:
For the experiments, simulated data was used to develop and evaluate the different inference methods. Two types of simulated datasets were generated: (1) an ideal dataset, and (2) a realistic dataset.
Two inference methods I and I
were developed, which differ in the type of dynamics model used (
and
, respectively). Both the methods use the same fault formulation model
=(
,
).
i. Method I
This inference method uses the analytical dynamics model =(
,
) based on the wind turbine dynamics equations. Use of mathematical equations in
M leads to deterministic estimates of the true rotor speed ω′r
ii. Method I
The dynamics model used in this inference method was the data-driven model =(
,
). Models
, and
were developed using neural networks to estimate ω′r
using sequential neural networks (NN) with dropout on the ideal
,
,
The models , and
were trained on the 200 special datasets generated, where the rotor speed sensor and the pitch actuator are fault-free, respectively. It should be noted that in the ideal dataset, fault-free operation of the speed sensor or the pitch actuator results in, ω′r
) and p(θ′t| st, ut;
) which helped in incorporating prediction uncertainty in the dynamics models. This prediction uncertainty could be used to estimate uncertainty in the inferred fault signatures downstream more accurately.
c. Prior Distributions and Likelihood Functions
Initialization of priors before starting any inference was done through domain knowledge of the typical values of fault signatures. Without any information about the faults, the following priors were used at the outset (at t=1 s):
These three prior distributions give a mode of 1, 0, and 0 for the gain, offset, and relative noise fault signatures respectively, which are the most likely values for healthy operation. The likelihood functions were given by the probability distribution function of a mixture of censored Gaussian distributions. Since M gives deterministic estimates of ω′r
is 1. Model
gives probabilistic estimates of ω′r
) and p(0′|st, ut;
) predicted by
and
respectively, resulting in 5 mixture components for method I
.
d. Posterior Distributions of Fault Signatures
Using the prior distributions, the likelihood functions, and the observed values of ωrt and θt, two independent inference models were run—one to infer the posterior distribution for the sensor fault signatures and the other to infer the actuator fault signatures αθ(i), 1, 2, 3. For each of the 100 faulty datasets (ideal or realistic), inference was implemented using the available data samples over a total period of 1-120s, split into two batches (1-60 s and 61-120 s). For the first batch, the prior distribution was the knowledge-driven one at t=1 s. Marginal posterior distributions for the fault signatures were estimated for each batch by Markov Chain Monte Carlo (MCMC) sampling using the Metropolis algorithm. Each run of the MCMC sampler had 3 chains with 20,000 burn-in (to allow the chain to converge to its stationary distribution) and 30,000 chosen samples (after the chain has converged) each. Thus, the sampled trace from all the chains has 90,000 samples for each fault signature that gives the estimated marginal posterior distribution for that signature. A Gaussian Kernel Density Estimator fit to these samples served as the prior distribution for the next batch that followed. This process continues till both the batches are used, eventually giving samples from estimates of p(αω(i)|ωr,
) and p(αθ(i)|θt, st, ut;
,
). This complete inference approach was implemented using Python library for probabilistic programming PyMC.
e. Classification Baseline,
The classifiers that are data-driven in nature have been predominantly used in literature for fault detection. Hence, a neural network-based binary (fault/no fault) classifier was developed as a baseline model. As wind turbine operational data were time-series, recurrent neural networks are the most suitable designs for building the classifiers as they use context from previous time indices while making predictions for a given time instant. Specifically, Gated Recurrent Unit (GRU)-based neural networks were used for the baseline model .
The same 100 simulated faulty datasets (ideal or realistic) that have been used to develop the Bayesian inference approaches were first split into training, validation and test sets, in the proportion of 40:10:50. Each training sample input to the GRU-neural network was a sequence spanning 120 s, which is the same as the total time period over which inference has been implemented by the Bayesian approaches. Two independent classifiers, one each for the rotor speed sensor (ω) and the rotor pitch actuator (
θ) were built. For these classifiers, sensed values of 6 parameters, namely ut, ωrt, τrt, ωgt, τgt, and θt are used as the input features. The output target was a class indicator from the set {0, 1}, in which Class 1 corresponds to the sensor/actuator being faulty while Class 0 represents the non-faulty case. The sensor/actuator is deemed faulty when any one of its fault signatures is beyond a user-defined acceptable fault tolerance. For the sake of demonstration, example tolerances were chosen such that:
To train the GRU-neural network classifiers, samples of input feature time sequences in the training set were first fed to a GRU. For the realistic dataset, the GRU was preceded by a masking layer that masks the time instants where data samples were missing. The output of the GRU was a fixed-dimensional vector for each sample that embedded the information from the input time sequence for that sample. Next, the output of the GRU was fed to a downstream sequential neural network with one hidden layer followed by the final output layer. Table 5 shows the hyperparameters chosen for the GRU-neural network classifiers for the rotor speed sensor (ω) and the rotor pitch actuator (
θ). These hyperparameters are tuned on the held-out validation set.
using GRU-neural
, ωg
, θt
f. Evaluation Metrics
This section discusses three metrics that were used to evaluate the inference approach used by the system 100. First two metrics have statistical rigor while the third one is a business metric.
The ideal energy output EH occurs when the faults are completely rectified in hardware (i.e., a full repair of the faulty sensor/actuator). RFTC defined above is the absolute percentage error between the ideal energy EH and ES which was the energy output through fault tolerant control-based rectification in software. The lower the RFTC is, the better is the inference method in its ability to improve the turbine output relative to the ideal energy output.
In this section, some key results from the evaluation of the different inference methods for both the ideal and realistic datasets, using different metrics, are discussed.
a. Posterior Log Likelihood (PLL)
To understand which method I or I
gives statistically significantly better inference for each fault signature, 100 values of PLL (one for each of the 100 simulated faulty datasets) given by method I
on the ideal dataset need to be statistically compared with those given by method I
on the same dataset. Hypothesis testing can be used for such a pairwise comparison between the two inference methods. One-sided Wilcoxon signed-rank test was used for such a hypothesis testing. The null hypothesis is that method I
has a PLL lower than (or equal to) that from method I
. The alternative hypothesis is that method I
has a PLL greater than that from method I
. The output from the Wilcoxon test is a p-value. If the p-value is less than a predetermined significance level, then the null hypothesis is rejected and the alternative hypothesis is accepted. Otherwise, the null hypothesis cannot be rejected.
Table 6 shows the p-value resulting from the hypothesis testing explained above for each fault signature. From the p-values, it can be observed that for all six fault signatures on the ideal dataset, at a 5% significance level, method I that uses analytical dynamics equations is statistically significantly better than method I
that uses neural networks to approximate dynamics. Thus, the inference follows the expected behavior of I
being better than I
, thereby increasing confidence in the approach in method 200.
and I
in terms of PLL.
<= PL
;
> PL
.
b. Classification Performance
The approach in method 200 gives the marginal posterior distribution of each fault signature. To use these methods in a binary classification setting where the fault signature can be classified as faulty (1) or not (0), its posterior distribution first needs to be compared with some fault tolerances.
In the presence of predefined fault tolerances: When predefined fault tolerances were available (say from sensor/actuator manufacturers), they can be used to label the inferred fault signatures as faulty (1) or not (0). A signature for which more samples from its inferred posterior distribution are within (than outside) the limits defined by the tolerances is labeled as non-faulty. The sensor/actuator with even one signature faulty is marked as faulty overall. for faulty/not faulty binary classification of the rotor speed sensor and the pitch actuator overall, computed on the 50 simulated faulty experiments comprising the test set that is neither used for training nor for validation of the classifier
. The same tolerances that have been used to train
were used here to mark the different fault signatures as faulty/non-faulty. It can be seen that on the ideal dataset, for the rotor speed sensor, the different inference methods have similar F1 scores that are marginally lower (up to 1%) than the score for the baseline classifier
. However, for the rotor pitch actuator, both the inference methods improve on the baseline classifier by at least 25%. On the realistic dataset, the different inference methods improve on the baseline classifier by at least 6% for the rotor speed sensor, and at least 15% for the rotor pitch actuator. Thus, the proposed model-based inference methods were observed to be giving a better classification performance than a pure data-driven classifier showing that incorporating dynamics helps in inference in a classification setting too.
In the absence of predefined fault tolerances, the 95% confidence intervals of the posterior distributions can be used as inference-driven tolerances for fault detection. If the confidence interval includes the ideal value of a signature, then the inferred signature corresponds to a fault-free (predicted label 0) condition, else it represents faulty (predicted label 1) condition. As mentioned in Table 2, these ideal values are 1 for the gain, 0 for the offset, and 0 for the relative noise fault signatures. The true label of the fault signature is 0 if its true value equals the ideal value, else it is 1. F1 score can then be used to compare the true and predicted labels.
Lastly, the inference methods give a lower 1 score for the actuator fault signatures than the corresponding sensor fault signatures in most cases, which could be a result of both upper and lower censoring for the pitch as compared to just lower censoring for the speed in the respective likelihood functions. Censoring can cause parameter non-identifiability leading to worse inference than that with no censoring.
c. Fault Tolerant Control Percentage Error (RFTC)
) are used for fault rectification in software. It also shows the power output when the faults are not rectified at all and when the faults are completely rectified in hardware. The faults were injected at t=1 s and fault rectification (either in software or in hardware) began at t=900 s. It can be seen from
Hypothesis testing was used next to do a pairwise comparison of the different inference methods in terms of the RFTC business metric from t=900 s to t=1200 s for all the 100 faulty experiments in the ideal and realistic datasets. For the ideal dataset, the null hypothesis is that method I has an RFTC greater than (or equal to) that from method I
. The alternative hypothesis was that method I
has an RFTC less than that from method I
. For the realistic dataset, the null hypothesis is that method I
has an RFTC equal to that from method I
. The alternative hypothesis is that method I
has an RFTC not equal to that from method I
. The Wilcoxon signed-rank test (one-sided for the ideal dataset and two-sided for the realistic dataset) is used for the hypothesis testing. Table 7 shows the resultant p-values. The table also shows the mean (±one standard deviation of the) RFTC across the 100 experiments for the different methods. From the table, it can be seen that the fault tolerant pitch control scheme in software gets, on an average, within at most 57% of the ideal energy improvement possible through a complete fault repair in hardware on the ideal dataset and within at most 55% on the realistic dataset. Thus, the method 200 combined with fault tolerant control in software could give at least a partial improvement in energy output without needing a costly and time-consuming repair of the faulty sensor or actuator that can cause turbine downtime.
The results indicated that the method I underperformed as compared to I
on the ideal dataset as I
gives a statistically significantly greater RFTC (by 36% on average) than I
. The two inference methods were statistically indistinguishable in terms of the RFTC on the realistic dataset.
and I
in terms of RFTC.
>=
; and
<
.
=
; and
#
.
d. Summary of Results
All the metrics indicated that, for the ideal dataset, using neural network approximation for dynamics worsens inference than using analytical dynamics equations due to an increased uncertainty in the estimates of the true rotor speed and commanded pitch. However, for the realistic dataset, the difference between the two inference methods is reduced due to the presence of real-world data issues. All the model-based inference methods were found to be better than a pure data-driven method of fault classification, thereby showing that incorporating dynamics helps in fault inference. The power of the Bayesian approach adopted by the system 100 is also highlighted by the usefulness of the 95% confidence intervals of the inferred posterior distributions as reasonable fault tolerances. Inference for the actuator fault signatures was found to be worse than that for the sensor fault signatures due to both lower and upper censoring leading to parameter non-identifiability. Lastly, the approach used in method 200 was found to be providing partial improvement in energy output via fault tolerant pitch control in software, without needing a complete repair in hardware.
The written description describes the subject matter herein to enable any person skilled in the art to make and use the embodiments. The scope of the subject matter embodiments is defined by the claims and may include other modifications that occur to those skilled in the art. Such other modifications are intended to be within the scope of the claims if they have similar elements that do not differ from the literal language of the claims or if they include equivalent elements with insubstantial differences from the literal language of the claims.
The embodiments of present disclosure herein address unresolved problem of fault analysis and diagnosis in wind turbine sensors and actuators. The embodiment, thus provides a Bayesian regression-based fault analysis in wind turbine sensors and actuators. Moreover, the embodiments herein further provide a mechanism for fault diagnosis using the Bayesian regression-based approach.
It is to be understood that the scope of the protection is extended to such a program and in addition to a computer-readable means having a message therein; such computer-readable storage means contain program-code means for implementation of one or more steps of the method, when the program runs on a server or mobile device or any suitable programmable device. The hardware device can be any kind of device which can be programmed including e.g., any kind of computer like a server or a personal computer, or the like, or any combination thereof. The device may also include means which could be e.g., hardware means like e.g., an application-specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or a combination of hardware and software means, e.g., an ASIC and an FPGA, or at least one microprocessor and at least one memory with software processing components located therein. Thus, the means can include both hardware means and software means. The method embodiments described herein could be implemented in hardware and software. The device may also include software means. Alternatively, the embodiments may be implemented on different hardware devices, e.g., using a plurality of CPUs.
The embodiments herein can comprise hardware and software elements. The embodiments that are implemented in software include but are not limited to, firmware, resident software, microcode, etc. The functions performed by various components described herein may be implemented in other components or combinations of other components. For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can comprise, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
The illustrated steps are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope of the disclosed embodiments. Also, the words “comprising,” “having,” “containing,” and “including,” and other similar forms are intended to be equivalent in meaning and be open ended in that an item or items following any one of these words is not meant to be an exhaustive listing of such item or items, or meant to be limited to only the listed item or items. It must also be noted that as used herein and in the appended claims, the singular forms “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise.
Furthermore, one or more computer-readable storage media may be utilized in implementing embodiments consistent with the present disclosure. A computer-readable storage medium refers to any type of physical memory on which information or data readable by a processor may be stored. Thus, a computer-readable storage medium may store instructions for execution by one or more processors, including instructions for causing the processor(s) to perform steps or stages consistent with the embodiments described herein. The term “computer-readable medium” should be understood to include tangible items and exclude carrier waves and transient signals, i.e., be non-transitory. Examples include random access memory (RAM), read-only memory (ROM), volatile memory, nonvolatile memory, hard drives, CD ROMs, DVDs, flash drives, disks, and any other known physical storage media.
It is intended that the disclosure and examples be considered as exemplary only, with a true scope of disclosed embodiments being indicated by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
202421000058 | Jan 2024 | IN | national |