The following refers to a method and a system for computer-implemented monitoring of a wind turbine, and lies in the field of structural health monitoring in order to determine the condition of a structure.
With respect to wind turbines, it is known that one method of diagnosing the state of the turbine is by the analysis of the vibrations of the rotating blades.
Document US 2010/0209243 A1 describes a method for monitoring blade frequencies of a wind turbine where the blade frequencies are extracted by analyzing the signals of an acceleration sensor positioned within the nacelle of the wind turbine. In this monitoring method, frequency differences between the blade frequencies of different blades are calculated and, in the case that one or more of the relative frequencies exceed a preset threshold, an alarm is generated.
It is an aspect of the present invention to provide a method and a system for computer-implemented monitoring of a wind turbine resulting in an enhanced detection of an abnormal behavior of the turbine.
The present invention provides a method for computer-implemented monitoring of a wind turbine, where the wind turbine comprises a tower, a nacelle, a hub and a plurality of blades, e.g. three blades. In this method, an activity signal is detected at subsequent time points for each blade. In an exemplary embodiment, the activity signals are blade frequencies resulting from vibrations of the respective blades. However, the activity signals may also refer to other kinds of signals detected for the respective blades, such as strain signals or moment signals. Strain signals may e.g. be detected by fibre optic strain gauges and moment signals may e.g. be calculated from detected strain signals. Furthermore, an activity signal may refer to a displacement or an acceleration of the corresponding blade, in the edgewise direction of the blade. The term detection is to be interpreted broadly. Particularly, the activity signal needs not be detected directly by a measurement but can also be derived from one or more measurements, e.g. from the measured signals of a G-sensor as described in the above-mentioned document US 2010/0209243 A1. In another embodiment, the activity signals in the form of blade frequencies may be detected by blade load sensors, where the blade frequencies are e.g. obtained by a phase lock loop (PLL) frequency estimation of the signals from the blade load sensors.
In the method of the present invention, the following steps are performed for each time point of at least some of the subsequent time points.
In a step a), an activity signal of each blade at the respective time point is predicted by a separate data-driven model, i.e. there is a separate data-driven model for each blade. The predicted activity signal is an output value of the respective data-driven model. One or more detected activity signals of blades other than the blade whose activity signal is the output value are input values of the respective data-driven model. The respective data-driven model has been learned by machine learning using training data comprising known input and output activity signals occurred in the past during an undamaged state of the wind turbine. In an exemplary embodiment, the learning of the data-driven model may be continued during monitoring of the wind turbine. In this case, the data-driven models are updated by machine learning at given time points using training data newly acquired during the operation of the wind turbine.
In a step b) of the method of the present invention, a residual between the predicted activity signal and the detected activity signal is determined for each data-driven model; e.g., the residual may be the difference or the absolute value of the difference between the predicted and the detected activity signals. Moreover, a residual may also refer to the square of this difference.
In a step c) of the method according to the present invention, a threshold criterion for one or more variables is checked, where the values of the one or more variables depend on the residuals for all data-driven models. This means that the values of the one or more variables as a whole depend on the residuals for all data-driven models. In other words, the value of a single variable may only depend on some residuals and needs not depend on the residuals for all data-driven models.
Based on the threshold criterion, a threshold is defined for each variable, where the threshold criterion is fulfilled if the value of any variable (i.e. at least one variable) exceeds the defined threshold. An appropriate choice of the thresholds lies within the expertise of a skilled person or can be determined by experiments. In an exemplary embodiment, the thresholds depend on the standard deviation of a frequency distribution of a time series of variable values as described below. In case that the value of any variable is equal to the threshold, this can be regarded as a case where the threshold criterion is fulfilled or, alternatively, as a case where the threshold criterion is not fulfilled.
In a step d) of the method according to the present invention, an abnormal operation state of the turbine is determined if the threshold criterion is fulfilled.
The present invention is based on the finding that a prediction of several learned data-driven models outputting the activity signal of a blade by inputs of activity signals of one or more other blades provides a reliable detection of an abnormal turbine state which is an indication of a damage which may result in a failure of the wind turbine in the near future.
In an exemplary embodiment, the detected activity signals are blade frequencies and blade edge frequencies resulting from the vibrations in the edgewise direction of the respective blades. The respective blade edge frequency is the fundamental frequency of the vibrations in the edgewise direction of the respective blade.
In another exemplary embodiment, the threshold for a respective variable depends on the standard deviation of a frequency distribution of a time series of values of the respective variable in the past, where the time interval of the time series does not cover the time interval of the training period of the data-driven models. The threshold lies between the standard deviation multiplied by two and the standard deviation multiplied by three. In a particularly exemplary embodiment, the threshold is the standard deviation multiplied by three. The above interval for choosing the respective threshold provides a particularly high reliability in detecting abnormal operation states.
In another exemplary embodiment, the one or more input values of the respective data-driven model comprise, additionally to the one or more detected activity signals of blades other than the blade which activity signal is the output value, one or more further environmental and/or operational parameters of the wind turbine, particularly an ambient temperature around the wind turbine and/or a power output of the wind turbine and/or a rotating speed of the wind turbine rotor and/or an azimuth angle of the respective blade and/or a pitch angle of the respective blade. The terms azimuth angle and pitch angle are well-known for a skilled person. The azimuth angle refers to the angle of the blade in a plane spanned by the blades of the turbine and the pitch angle refers to the angle the blade is facing towards the wind. The use of further inputs improves the fidelity of the predictions made by the data-driven models.
Any known data-driven models may be used in the method of the present invention; e.g., the respective data-driven models may be based on Gaussian processes and/or on a (artificial) neural network structure.
In another embodiment of the present invention, an alarm is recorded (i.e. stored in a corresponding storage) and/or an alarm is output via a user interface if an abnormal operation state of the turbine is determined, i.e. in case that the threshold criterion is fulfilled.
In a straightforward implementation of the present invention, the one or more variables processed in step c) of the method correspond to the residuals; i.e., the variables are identical with the residuals.
In another particular exemplary embodiment, a trend removal for removing a common trend in the time series of the residuals is applied to the residuals determined in step b), the result of the trend removal being the one or more variables processed in step c). Methods for trend removal are well-known for a skilled person; e.g., the trend removal may be based on cointegration, Auto-Associative Neural Networks or Principal Component Analysis.
In an exemplary variant of the embodiment, the trend removal is based on one or more cointegrated vectors, where the multiplication of the one or more cointegrated vectors with the residuals determined in step b) results in the one or more variables. Each cointegrating vector has been derived by cointegration applied to a time series of at least some residuals for different data-driven models occurred in the past during an undamaged state of the wind turbine. The time interval considered for cointegration is another time interval than the time interval comprising the data-driven model training period.
In another variant of the present invention, the trend removal is based on one or more transformations applied to the residuals determined in step b) and resulting in the one or more variables, each transformation having been derived by a Principal Component Analysis applied to a time series of at least two residuals for different data-driven models occurred in the past during an undamaged state of the wind turbine. The time interval of this time series is another time interval than the time interval comprising the data-driven model training period.
In another particularly exemplary embodiment, all possible pairs of residuals for different data-driven models are each processed by a separate trend removal procedure. This embodiment provides further information about the cause of a damage. Particularly, the blade which is damaged may be identified.
Besides the above method, the present invention refers to a system for computer-implemented monitoring of a wind turbine, the wind turbine comprising a tower, a nacelle, a hub and a plurality of blades, where for each blade an activity signal of the blade is detected at subsequent time points, where the system is configured to perform a method according to the present invention or a method according to one or more exemplary embodiments of the present invention.
The present invention also refers to a computer program product (non-transitory computer readable storage medium having instructions, which when executed by a processor, perform actions) with program code, which is stored on a non-transitory machine-readable carrier, for carrying out a method according to the present invention or a method according to one or more exemplary embodiments of the present invention, when the program code is executed on a computer.
The present invention also refers to a computer program with program code for carrying out a method according to the present invention or a method according to one or more exemplary embodiments of the present invention, when the program code is executed on a computer.
Some of the embodiments will be described in detail, with reference to the following figures, wherein like designations denote like members, wherein:
In the following, the present invention will be described based on a monitoring system for a wind turbine shown in
In
In the wind turbine of
For each of the detected blade edge frequencies together with the environmental and operational variables EOV, there exists a data-driven model. Those models are designated as DM1, DM2 and DM3. The data-driven model DM1 receives as inputs the blade edge frequency fA together with the variables EOV. The data-driven model DM2 receives as inputs the blade edge frequency fB together with the variables EOV. The data-driven model DM3 receives as inputs the blade edge frequency fC together with the variables EOV.
The data-driven model DM1 is assigned to the blade 5B and predicts its blade edge frequency, i.e. the data-driven model DM1 outputs the predicted blade edge frequency fBP of the blade 5B. The data-driven model DM2 is assigned to the blade 5C and outputs the predicted blade edge frequency fCP of the blade 5C. The data-driven model DM3 is assigned to the blade 5A and outputs the predicted blade edge frequency fAP of the blade 5A. The data-driven models DM1 to DM3 refer to well-known models which are learned by training data. In the embodiment described herein, the data-driven models are based on Gaussian processes.
The data-driven models have been learned before by machine learning based on training data of past operational values which have actually occurred (i.e. which have actually been detected) during operation of the wind turbine. Those training data form a time series of training data sets, where each training data set comprises the blade edge frequency and the environmental and operational variables input to the corresponding data-driven model as well as the blade edge frequency forming the output of the respective data-driven model. In order to distinguish between a healthy state and a damaged state of the turbine, the training data were only collected during a healthy state of the turbine. The time interval of the collected training data may e.g. be one year.
Based on the learned data-driven models DM1, DM2 and DM3 shown in
re
B
=f
B
−f
BP
re
C
=f
C
−f
CP
re
A
=f
A
−f
AP
Residual reB is with respect to the prediction of the blade edge frequency of blade 5B provided by data-driven model DM1. Residual rec is with respect to the prediction of the blade edge frequency of blade 5C provided by the data-driven model DM2. Residual reA is with respect to the prediction of the blade edge frequency of blade 5A provided by data-driven model DM3.
An essential feature of the embodiment of
In a straightforward implementation of the present invention, the residual errors may be used directly in order to identify an abnormal state of the wind turbine, i.e. a state which indicates a damage which may result in a failure of the turbine in the near future. To do so, a threshold may be defined directly for each residual and, in case that any of the residuals exceeds the threshold, an abnormal state will be detected. However, in order to enhance the diagnosis of the turbine, a normalization technique is implemented in the embodiment of
In the embodiment of
In each of the trend removal methods TR1, TR2 and TR3, a vector having two entries of different residuals is processed. In other words, in the trend removal method TR1, a vector comprising the residuals reB and reC is processed, as indicated by arrows P1. In the trend removal method TR2, a vector comprising the residuals rec and reA is processed, as indicated by arrows P2. In the trend removal method TR3, a vector comprising the residuals reB and reA is processed, as indicated by arrows P3. In the corresponding trend removal method, those vectors of residuals are multiplied with the cointegrating vector of the respective method.
For determining the respective cointegrating vectors, a time series of past pairs of residuals occurred during a normal operation of the turbine (i.e. in an undamaged state) are processed by cointegration. This cointegration will be described in the following where vectors are denoted in lowercase bold fonts and matrices are denoted in uppercase bold fonts. Cointegration is a well-known method. A description of cointegration is given in document Cross, Elizabeth J., Keith Worden, and Qian Chen. “Cointegration: a novel approach for the removal of environmental trends in structural health monitoring data.” Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. The Royal Society, 2011.
Let xt be a multivariate set of non-stationary measurements over time (the time is indicated by index t). In the embodiment of
The measurements xt are cointegrated if there exists a vector b that would yield a stationary signal zt via,
z
t
=b
T
x
t (1)
If such a solution exists, b is known as the cointegrating vector. This vector is determined beforehand in order to be used in the respective trend removal methods TR1, TR2 and TR3. Depending on the nature of a given set of non-stationary time series, there may exist more than one cointegrating vector, unique to uncertainty.
Prior to identifying the cointegrating vector, there exist some prerequisites. In addition to the fact that the signals in xt should share common trends, they should also be integrated to the same order. Here, the term “integrated to the same order” means that a non-stationary signal should be differenced n times before it becomes stationary and is denoted I(n); n is known as the order of integration.
In order to determine what the order of integration for a set of non-stationary signals is, a unit root test must be carried out. A unit root implies that the signal is intrinsically non-stationary. On the other hand, if the root is less than one, the signal is stationary, and if it is greater than one, the signal is a divergent process. The Augmented Dickey Fuller (ADF) test is commonly used to test for a unit root, where the null hypothesis is that the characteristic equation of a signal under consideration has at least one unit root. In the ADF test, the signal is fitted to a model of the form,
Δx
t
=αx
t−1+Σj=1p−1βjΔxt−t−j+εt (2)
where Δxt=xt−xt-1, and Δxt−1=xt−1xt−j−1. The number of lags, p allows for high order autoregressive processes, and needs to be chosen such that εt becomes a white noise process. The Akaike, Bayesian, or Hannan-Quinn information criteria (AIC, BIC, or HQIC, respectively) can aid in finding the optimum number of lags. In the embodiment described herein, the AIC was used. The coefficient a is the focal point of the ADF test. In principle, if α is 0, the signal is intrinsically non-stationary and hence contains a unit root. In reality, it is impossible for α to be precisely 0, and hence the problem is to determine whether α is statistically close to 0. This is performed by calculating a test statistic given by equation (3), and comparing it to critical ADF test statistic values. The test statistic is calculated using,
where {circumflex over (α)} is the estimate of a found using least squares, and se({circumflex over (α)}) is its standard error. The null hypothesis of the ADF test is accepted if {circumflex over (t)}>tc, where tc is the critical value found in the ADF test statistics tables. In this case, the set of solutions of the characteristic equation of the signal contains a unit root. The process is repeated once more for the differenced signal Δxt. In the second iteration, if the null hypothesis is rejected, then the original signal is non-stationary, and is integrated to order one, i.e. I(1). In the event that the null hypothesis of the ADF test is accepted, the above process is repeated for further differenced signals until the order of integration of the signal is found.
Having evaluated the orders of integration for all the signals, the next task is to identify b. In this process, only signals that are integrated of the same order can be used. There are several methods that can be employed to identify b. These include the Engle-Granger approach and the Johansen procedure. In the embodiment described herein, the Johansen procedure is used. The Johansen procedure is concerned with only I(1) signals, but there are methods to identify b for higher integrated orders. The following briefly details the Johansen procedure.
The Johansen procedure begins by fitting the signals into a vector autoregressive model, which can be stated as,
x
t
=M
1
x
t−1
+M
2
X
t−2
+ . . . +M
p
x
t−p+εt
where M1, M2, . . . , Mp are weight matrices xt−1, xt−2, . . . xt−p are lag terms, and εt is a Gaussian noise process. This step is preformed so that the model order, p can be determined by a suitable information criterion, such as AIC, which will be used in the subsequent evaluations. The next step of the Johansen procedure is to fit the signals to a vector error correction model (VECM), which can be stated as,
Δx
t
=ΔB
T
x
t−1+Σj=1p−1θjΔxt−j+εt (4)
where Δxt=xt−xt−1, θj is a weight for the corresponding difference term, and A and B are matrices containing relevant weights. Of these two matrices, B is the most crucial, since it contains the potential cointegrating vectors b. Equation (4) needs to be manipulated further to extract B; this is broken down into two simpler regression,
Δx
t=Σj=1p−1cjΔxt−j+r0t (5)
and
x
t−1
=Σ
j=1
p−1
d
j
Δx
t−j
+r
1t (6)
The residuals, r0t and r1t are related as follows,
r
0t
=ΔB
T
r
1t+εt (7)
Next, some product-moment matrices are defined,
S
ij=1/NΣt=1Nritrjt for i,j=0,1 (8)
These product-moment matrices are then used to solve,
(λiS11−S01TS00−1S01)v1=0 (9)
Equation (9) is an eigenvalue problem, with eigenvectors v1 and eigenvalues λi. The eigenvector corresponding to the largest eigenvalue in magnitude is the cointegrating vector b from the matrix B that provides the most stationary signal zt in equation (1). By multiplying this cointegrating vector in the respective trend removal methods TR1, TR2 and TR3 with the pair of residuals forming the input of the trend removal method, values of the variables va1, va2 and va3 as shown in
The process of identifying the cointegrating vectors is carried out in a period when the data available originate from a wind turbine that is considered normal (or healthy). Once the cointegrating vectors are obtained, they should be kept unchanged, and new data should be projected onto these cointegrating vectors. If the wind turbine continues to be healthy, the signal zt will continue to be stationary. If, however, the wind turbine becomes damaged, the signal zt (i.e. the values of the variables va1, va2 and va3) would become non-stationary.
As already mentioned above, the trend removal methods TR1, TR2 and TR3 provide values of variables va1, va2 and va3. Those variables are subjected to a threshold criterion TC, where a threshold TH1 is defined for variable va1, a threshold TH2 is defined for variable va2 and a threshold TH3 is defined for variable va3. In an exemplary embodiment, the thresholds are determined based on a time series of past values of the respective variables va1, va2 and va3 in a predetermined time interval. The time interval lies outside the time interval used for training the data models and covers an operation of the wind turbine where the turbine is in a healthy state. The thresholds TH1, TH2 and TH3 are determined based on the standard deviation a of the frequency distribution of the values of the respective variables va1, va2 and va3 in the considered time interval. The values for TH1, TH2 and TH3 lie between 2σ and 3σ of the respective frequency distribution. This choice ensures a reliable detection of an abnormal state of the wind turbine.
The threshold criterion TC is applied to the respective variables va1, va2 and va3. The threshold criterion is fulfilled if a value of any of the variables va1, va2 and va3 exceeds the corresponding threshold TH1 (for variable va1), TH2 (for variable va2) and TH3 (for variable va3). When exceeding the corresponding threshold, this is an indication that the predictions learned from a healthy state of the turbine are no longer correct, i.e. this is an indication of an abnormal or damage state of the turbine which may result in a failure of the turbine in the near future. This abnormal state is designated as AS in
In case that the method of
In the embodiment of
The present invention as described in the foregoing has several advantages. Particularly, the behavior of each blade in a wind turbine can be predicted, giving the performance of another blade, which forms a good baseline to compare with. Furthermore, non-stationary trends that can mask changes in blade properties are removed, resulting in an early and reliable damage detection of a turbine. Since fatigue damage/crack initiation and propagation occurs over a relatively long period of time, an abnormality can be determined very early with the method of the present invention based on the training of several data-driven models over healthy state data. Furthermore, environmental and operational variables which tend to obfuscate damages can be taken into account in the corresponding data-driven models as input, resulting in a more reliable detection of abnormal states.
Although the present invention has been disclosed in the form of preferred embodiments and variations thereon, it will be understood that numerous additional modifications and variations could be made thereto without departing from the scope of the invention.
For the sake of clarity, it is to be understood that the use of “a” or “an” throughout this application does not exclude a plurality, and “comprising” does not exclude other steps or elements.
Number | Date | Country | Kind |
---|---|---|---|
19156230.5 | Feb 2019 | EP | regional |
This application is a national stage entry of PCT Application No. PCT/EP2020/053145 having a filing date of Feb. 7, 2020, which claims priority to European Patent Application No. 19156230.5, having a filing date of Feb. 8, 2019, the entire contents of which are hereby incorporated by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/053145 | 2/7/2020 | WO | 00 |