The present application relates to a chance constrained extreme learning machine method for nonparametric interval forecasting of wind power, and belongs to the field of renewable energy power generation prediction.
At present, wind energy has become one of the main sources of renewable energy power generation due to its advantages such as wide distribution of resources, mature development technology and low investment cost. However, the chaotic nature of the atmospheric system leads to significant intermittency and uncertainty in wind power, posing a great challenge to the secure operation of the power system with a large share of wind power integration. High-precision wind power prediction provides key information support for power system planning and construction, operational control, market transactions, etc., and is one of the important means to help power system effectively cope with wind power uncertainty.
Traditional wind power prediction focuses on deterministic point prediction with a single point expected value as an output, so it is difficult to avoid prediction errors. Probabilistic forecasting effectively quantifies the uncertainty of wind power prediction by prediction interval, predictive quantile and predictive probability distribution, and provides more abundant and comprehensive information for decision makers, and thus has become research frontier in the field of wind power prediction. The prediction interval contains the future wind power with a certain confidence level, with clear probability interpretation and concise mathematical form, and thus is widely used in economic dispatch, optimal power flow, risk assessment and stability analysis of the power system. However, the existing methods need to prescribe the quantile proportions corresponding to the interval bounds in advance when deriving the wind power prediction interval. The common practice is to limit the interval bounds to be symmetrical about the median of the wind power to be predicted in the sense of probability. The limitation of asymmetric wind power probability distribution will lead to a conservative interval width and increase the potential operation cost for the power system to cope with the uncertainty of wind power. Therefore, it is necessary to invent a wind power interval prediction method with better flexibility, which can adaptively determine the quantile proportions of the interval bounds, and minimize the prediction interval width on the premise of meeting the nominal confidence level.
In view of the limitations of existing wind power interval prediction methods, the present application provides a wind power nonparametric interval prediction method based on a chance constrained extreme learning machine. This method does not depend on the parametric hypothesis of wind power probability distribution, does not need to specify the quantile proportions corresponding to the interval bounds in advance, but directly generates the wind power prediction interval meeting the confidence level with the goal of minimizing the interval width, and thus can adapt to symmetric or asymmetric wind power probability distribution under time-varying conditions. It is also suitable for interval prediction of other renewable energy generation power and load, and has good flexibility and adaptability
In order to achieve the above purpose, the present application adopts the following technical solution:
A wind power nonparametric interval prediction method based on a chance constrained extreme learning machine, comprising the following steps of:
(1) constructing a chance constrained extreme learning machine model
using the extreme learning machine as a regression function of upper and lower bounds of the wind power prediction interval, comprehensively considering joint probability distribution of a wind power and an input feature thereof, limiting the wind power to fall into the prediction intervals with a probability not lower than a nominal confidence level by using chance constraint, and taking an expectation of minimizing an interval width as a training objective, and constructing the chance constrained extreme learning machine model:
which is subject to:
where x is a random variable corresponding to the input feature, y is a random variable corresponding to normalized wind power, a joint probability distribution of the two is denoted as ΞΌ(x, y); f(x, Οl) and f(x, Οu) are output equations of the extreme learning machine, which represent lower and upper boundaries of the prediction interval, respectively, Οl and Οu are weight vectors from a hidden layer of the extreme learning machine to output neurons; 100(1βΞ²)% is the nominal confidence level of the prediction interval; and denote expectation and probability operators respectively;
(2) constructing a sample average approximate model of the chance constrained extreme learning machine
replacing the joint probability distribution of the input feature and wind power with an empirical probability distribution of training set samples thereof, approximating an actual expectation in an objective function by empirical expectation, and approximating an actual probability in the chance constraint by empirical probability to obtain the sample average approximate model of the chance constrained extreme learning machine;
which is subject to:
where v* is an optimal value of the optimization model, which donates the shortest overall width of prediction intervals satisfying the chance constraint; xt and yt are an input feature and the wind power; is a subscript set of various samples of the training set {(xt, yt), || is a number of the samples of the training set, Ξ²|| is a maximum number of wind power samples outside the prediction interval at the nominal confidence level; Ξ³t is an auxiliary variable indicating whether the wind power falls into the prediction interval, with a non-negative value indicating that the wind power falls into the corresponding prediction interval, or a positive value indicating that the wind power does not fall into the corresponding prediction interval; max{β } is a maximum function, taking a maximum value of each variable thereof; (β ) is an indicator function of a logical discriminant, with a value being 1 when the logical discriminant is true and 0 when the logical discriminant is false;
(3) constructing a parametric 0-1 loss minimization model;
introducing a virtual parametric variable to represent an overall width budget of the prediction interval, minimizing the probability that the wind power does not fall into the prediction interval on a premise of meeting the width budget to obtain the parametric 0-1 loss minimization model;
which is subject to:
where v is an introduced parameter representing the overall width budget of the prediction interval; Ο(v) is an optimal value function of the parametric 0-1 loss minimization model with the parameter v, and the smallest parameter that satisfies the condition Ο(v)β€Ξ²|| is the shortest overall width v* of the prediction intervals that satisfy the chance constraint;
(4) constructing a parametric difference of convex functions optimization model
approximating the indicator function in the objective function of the parametric 0-1 loss minimization model by a difference of convex function to obtain the parametric difference of convex functions optimization model:
which is subject to:
where
(5) adopting the difference of convex functions optimization based bisection search algorithm to train the extreme learning machine;
using the difference of convex functions optimization based bisection search algorithm to search for the shortest overall width v* of prediction intervals that satisfy the chance constraint, so as to realize the training of the extreme learning machine; and specifically comprising the following steps:
step (1), giving a bisection search algorithm precision β1 and a bisection search interval [], wherein the given bisection search interval should contain the shortest overall width v* of the prediction intervals;
Step (2): for the parametric difference convex functions optimization, setting the parameter v thereof as a midpoint ()/2 of the bisection search interval, and solving the difference of convex functions optimization model by using a convex-concave procedure algorithm:
step (2.1): giving an algorithm convergence accuracy β2, the slope parameter m of the difference of convex functions, and the parameter v representing the overall width budget of the prediction intervals;
step (2.2): setting an iteration counter kβ0; solving the following linear programming problem to obtain an initial solution ΞΈ(0) of the model:
which is subject to:
where 1 is a vector with all elements being 1, whose dimension is the same as the number of the samples in the training set;
step (2.3): updating the solution of the parametric difference of convex functions optimization model in a (k+1)th iteration by using the following formula:
which subject to:
where Lvex+(Ξ³) and Lvexβ(Ξ³) are both convex functions, which constitute a minuend and a subtrahend of the difference of convex function LDC(Ξ³t; m); Ξ΄(k) is a subdifferential of the convex function Lvexβ(Ξ³) at Ξ³(k), satisfying
where g β indicates that g is a real column vector with dimensionality equal to the number of samples || in the training set;
step (2.4): the iteration counter self-increasing kβk+1; calculating a convergence error eβΞΈ(k)βΞΈ(kβ1); and
step (2.5), checking whether a Euclidean norm β₯eβ₯2 of the convergence error meets the convergence accuracy β2, and if not, returning to the step (3), otherwise outputting the converged solution
step (3): calculating the number miss of samples of which the wind power in the training set falls outside the prediction interval
step (4), if missβ€Ξ²||, updating the upper boundary vuβ(n+vu)/2 of the bisection search interval and recording weight vectors β, Οuβ
step (5): if vuββ€β1, outputting the weight vectors and Οu of the output layer of the extreme learning machine, otherwise, returning to step (2).
The method has the following beneficial effects:
Aiming at the problem of nonparametric interval forecasts of wind power, a chance constrained extreme learning machine model is proposed. The model uses chance constraint to ensure that the confidence level of the prediction intervals meet the reliability requirements, and minimizes the interval width, avoiding the parametric hypothesis of wind power probability distribution and fixed quantile proportions of traditional prediction intervals, thereby realizing the self-adaptive direct construction of wind power prediction intervals; based on the training data, a sample average approximation model of the chance constrained extreme learning machine is established, and the sample average approximation model is transformed into a 0-1 loss minimization model and a parametric difference of convex functions optimization model, and the training of the extreme learning machine is transformed into searching the shortest overall interval width meeting the chance constraint; a bisection search algorithm based on difference of convex functions optimization is proposed to realize efficient training of the chance constrained extreme learning machine. The wind power prediction intervals obtained by the method have shorter interval width on the premise of ensuring a reliable confidence level, and provides more accurate uncertainty quantitative information for power system decision-making. In addition to wind power, the method of the present application is also applicable to interval forecasts of other renewable energy sources and loads, and thus has wide applicability.
The present application will be further explained with reference to the drawings and examples.
(1) a training data set ={(xt, yt) and a test data set ={(xt, yt)}tβv are constructed, wherein xt is an input feature, yt is a wind power value to be predicted, and and V subscript sets of the samples in the training set and test set respectively; the number of the hidden-layer neurons of the extreme learning machine is determined; the weight vector of the input layer and the bias of the hidden layer of extreme learning machine are randomly initialized; the nominal confidence level of the prediction interval 100(1βΞ²)% is determined.
(2) A sample average approximation model of the chance constrained extreme learning machine is constructed
which is subject to:
where f(x,) and f(x, Οu) are output equations of the extreme learning machine, which represent upper and lower boundaries of the prediction interval respectively, and Οu are weight vectors from the hidden layer of the extreme learning machine to an output neuron; Ξ³t is the auxiliary variable indicating whether the wind power falls into the prediction interval, a non-negative value indicating that the wind power falls into the corresponding prediction interval, and a positive value indicating that the wind power does not fall into the corresponding prediction interval; max{β } is the maximum function, taking the maximum value of each variable thereof; (β ) is the indicator function of a logical discriminant, taking 1 when the logical discriminant is true and 0 when the logical discriminant is false.
(3) The following parametric difference of convex functions optimization model is constructed:
which is subject to:
where v is the introduced parameter representing the overall width budget of the prediction intervals;
(4) For the parameter v in the difference convex optimization model, the bisection search algorithm is used to search for the shortest overall width v* of the prediction intervals that satisfy the chance constraint, so as to realize the training of the extreme learning machine; the algorithm specifically includes the following steps:
step (1), giving a bisection search algorithm precision β1 and a bisection search interval [], wherein the given bisection search interval should contain the shortest overall width v* of the prediction intervals;
Step (2): for the parametric difference of convex functions optimization, setting the parameter v thereof as the midpoint (+vu)/2 of the bisection search interval, and solving the difference of convex functions optimization model by using a convex-concave procedure algorithm as described in steps (2.1)-(2.5):
step (2.1): giving the algorithm convergence accuracy β2, the slope parameter m of the difference of convex functions, and the parameter v representing the overall width budget of the prediction intervals;
step (2.2): setting the iteration counter kβ0; solving the following linear programming problem to obtain an initial solution ΞΈ(0) of the model:
which is subject to:
where 1 is a vector with all elements being 1, whose dimension is the same as the number of the samples in the training set;
step (2.3): updating the solution of the parametric difference of convex functions optimization model in the (k+1)th iteration by the following formula:
which is subject to:
where Lvex+(Ξ³) and Lvexβ(Ξ³) are both convex functions, which constitute a minuend and a subtrahend of the difference of convex functions LDC(Ξ³t; m) respectively; Ξ΄(k) is a subdifferential of the convex function Lvexβ(Ξ³) at Ξ³(k), satisfying
where g β is a real column vector with dimensionality equal to the number of samples || in the training set;
step (2.4): the iteration counter self-increasing kβk+1; calculating a convergence error eβΞΈ(k)βΞΈ(kβ1); and
step (2.5), checking whether a Euclidean norm β₯eβ₯2 of the convergence error meets the convergence accuracy β2, and if not, returning to the step (3), otherwise outputting the converged solution
step (3): calculating the number miss of samples of which the wind power in the training set falls outside the prediction interval
step (4), if missβ€Ξ²||, updating the upper boundary vuβ(+vu)/2 of the bisection search interval and recording weight vectors β, Οuβ
step (5): if vuββ€β1, outputting the weight vectors and Οu of the output layer of the extreme learning machine, otherwise, returning to step (2).
(5) The trained extreme learning machine is used to construct the prediction intervals {[f (xt, ), f(xt, Οu])}tβv of the test set ={(xt, yt)}tβv, and the average coverage deviation (ACD) is used to evaluate the reliability of the prediction intervals, which is defined as the deviation between the empirical coverage probability (ECP) and the nominal confidence level 100(1βΞ²)%:
where, |V| is the number of the samples in the test set, the smaller the absolute value of the average coverage deviation, the better the reliability of the prediction interval;
The average width (AW) of the interval is used to evaluate the sharpness of the prediction interval, which is defined as
On the premise of well reliability of the prediction intervals, the smaller the average width of the prediction intervals, the higher the sharpness performance of the prediction intervals.
The above process is shown in
The effectiveness of the proposed method is verified by the actual wind power data from the Glens of Foudland Wind Farm in Scotland in 2017, and the time resolution of the data is 30 minutes. Considering the differences of seasonal characteristics, the wind power prediction model for each season is independently trained and verified, in which the training samples account for 60% of the data set in each season, and the remaining 40% samples are used as the test set. The leading time for prediction is 1 hour, and the nominal confidence of the prediction intervals is 95%.
Table 1 shows the performance indices of the prediction interval obtained by using sparse Bayesian learning, Gaussian kernel density estimation and the method of the present application. It can be seen that the absolute value of the average coverage deviation of the present application is less than 1.4%, and the empirical coverage probability is close to the nominal confidence level of 95%, which has excellent reliability; the average coverage deviation of sparse Bayesian learning in winter, summer and autumn data sets exceeds β2.6%, which is difficult to ensure the reliability of prediction; although Gaussian kernel density estimation has well reliability in data sets except winter, the prediction intervals obtained in winter, spring, summer and autumn are respectively 16.5%, 28.4%, 34.3% and 16.3% wider than those obtained by the method of the present application. To sum up, the method of the present application can effectively shorten the interval width on the premise of satisfactory reliability of the prediction interval.
The above description of the specific embodiments of the present application is not intended to limit the scope of protection of the present application. All equivalent models or equivalent algorithm flowcharts made according to the content of the description and drawings of the present application, which are directly or indirectly applied to other related technical fields, all fall within the scope of patent protection of the present application.
Number | Date | Country | Kind |
---|---|---|---|
202010255104.2 | Apr 2020 | CN | national |
The present application is a continuation of International Application No. PCT/CN2021/080895, filed on Mar. 16, 2021, which claims priority to Chinese Application No. 202010255104.2, filed on Apr. 2, 2020, the contents of both of which are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2021/080895 | Mar 2021 | US |
Child | 17694689 | US |