This application claims the priority of China patent application No. 201810606900.9 titled “Method for Estimating Meteorology Sensitive Load Power Based on Stacked Auto-Encoder” filed with the State Intellectual Property Office of the People's Republic of China on Jun. 13, 2018, and the priority of China patent application No. 201910430705.X titled “Meteorology Sensitive Load Power Estimation Method and Apparatus” filed with the State Intellectual Property Office of the People's Republic of China on May 22, 2019, disclosures of all of which are incorporated herein by reference in their entireties.
The present disclosure relates to the field of power system load forecasting and load power model, and more particularly relates to a method and apparatus for estimating a meteorology sensitive load power.
As global warming continues to intensify and living standards of citizens continue to improve, power consumption of meteorology sensitive loads mainly represented by air conditioners is increasing year by year. In 2017, summer air conditioning power consumption in some areas such as Suzhou has caused abnormal growth of the load. Studying the estimation of meteorology sensitive load power can not only improve the accuracy of the load power model and provide regulatory basis for safe and stable operation of the summer power grid, but also provide a basis for response capability assessment on a demand side, which has important research significance.
Patent application number 201810607600.2 provides a method for estimating a meteorology sensitive load power based on a load-meteorology nonlinear association model; but the model has high requirements on the integrity of load power and meteorology sample data. In practical, meteorology data, especially for a meteorology factor changing curve at a 10-minute sampling interval, can be easily lost. If more meteorology data of the current date is lost, the load-meteorology nonlinear association model cannot be used for estimating the meteorology sensitive load of that date.
In view of the above problems, the present disclosure provides a method and apparatus for estimating a meteorology sensitive load power, which is able to directly obtain a meteorology sensitive load power curve by a daily load curve, and is especially applicable to cases where meteorology data is frequently lost in practical applications.
The present disclosure adopts the following solutions.
In a first aspect, embodiments of the present disclosure provide a method for estimating a meteorology sensitive load power, including:
obtaining a meteorology sensitive load power estimation model;
inputting a daily load curve of a date to be estimated to the meteorology sensitive load power estimation model and extracting a daily load curve dimension reduction feature of the date to be estimated; and
outputting the meteorology sensitive load power based on the daily load curve dimension reduction feature of the date to be estimated and mapping relationships from daily load curve dimension reduction features onto meteorology sensitive load powers.
Optionally, obtaining the meteorology sensitive load power estimation model includes:
obtaining the meteorology sensitive load power estimation model by training, and testing the meteorology sensitive load power estimation model.
Optionally, the meteorology sensitive load power estimation model includes a stacked auto-encoder (SAE) model and a fully-connected layer;
where obtaining the meteorology sensitive load power estimation model by training includes:
training the SAE model and the fully-connected layer;
Inputting the daily load curve of the date to be estimated to the meteorology sensitive load power estimation model and extracting the daily load curve dimension reduction feature of the date to be estimated includes:
inputting the daily load curve of the date to be estimated to the SAE model and extracting the daily load curve dimension reduction feature of the date to be estimated.
outputting the meteorology sensitive load power based on the daily load curve dimension reduction feature of the date to be estimated and the mapping relationships from the daily load curve dimension reduction features onto the meteorology sensitive load powers includes:
outputting, by the fully-connected layer, the meteorology sensitive load power based on the daily load curve dimension reduction feature of the date to be estimated extracted by the SAE model and the mapping relationships from the daily load curve dimension reduction features onto the meteorology sensitive load powers.
Optionally, training the SAE model includes:
taking a historical data sample as input and output labels of the SAE model to train a first AE of the SAE model;
taking output of an encoding layer of the first AE as an input label to train a next AE of the SAE model until all AEs of the SAE model have been trained;
wherein a target function for the training is that a mean absolute percentage error (MAPE) of the output of the SAE model with respect to a daily load curve of a corresponding historical data sample is the minimum,
where xi is an actual daily load power, xi′ is output of the SAE model, and n is a total number of sample points.
Optionally, training the first AE of the SAE model satisfies the following formula:
h(1)i=sf(W1xi+b1);
where xi is output of the first AE of the SAE model, h(1)i output of the encoding layer of the first AE, W1 and b1 are respectively a weight matrix and a bias matrix, and sf is an activate function;
{circumflex over (x)}
i
=s
g(W1′h(1)i+b1′),
where {circumflex over (x)}i is the output of the first AE of the SAE model, W1′ and b1′ are respectively a weight matrix and a bias matrix in reconstruction, and sg is an activate function, in reconstruction;
where {circumflex over (x)}i and xi have a minimum mean squared error, θ* is an optimal fully-connected-layer parameter of the encoding layer and a decoding layer of the first AE, and N is a number of historical data samples.
Optionally, training the fully-connected layer includes:
taking the daily load curve dimension reduction feature of a historical data sample as an input label of the fully connected layer and a meteorology sensitive load power curve as an output label of the fully connected layer, to train the fully connected layer and obtain the optimal fully-connected-layer parameter θ′*, where a corresponding date of the daily load curve dimension reduction feature of the historical data sample is same as that of the meteorology sensitive load power curve;
where Oi is output of a last fully connected layer of an ith sample, PWi is a meteorology sensitive load power of the ith sample, and N′ is a number of dates of fully connected layer training samples.
Optionally, a computation formula of the fully connected layer satisfies
O=R(WI+b);
where I and O are respectively an input vector and an output vector of the fully connected layer, W and b are respectively a weight matrix and a bias matrix of the fully connected layer, and R is an activate function of the fully connected layer.
Optionally, the method further includes:
performing a normalization process on a historical data sample before training the SAE model; and
restoring a normalization calculation result of each sample output by the fully-connected layer after training the fully-connected layer.
In a second aspect, embodiments of the present disclosure provide a apparatus for estimating a meteorology sensitive load power, which includes: a stacked auto-encoder (SAE) model and a fully-connected layer.
The SAE model is configured for inputting a daily load curve of a date to be estimated, extracting a daily load curve dimension reduction feature of the date to be estimated, and inputting the daily load curve dimension reduction feature of the date to be estimated to a fully-connected layer; and
The fully-connected layer is connected to an output end of the SAE model and configured for outputting a meteorology sensitive load power based on the daily load curve dimension reduction feature of the date to be estimated and mapping relationships from daily load curve dimension reduction features onto meteorology sensitive load powers.
Optionally, a number of dimensions of the daily load curve of the date to be estimated is a number of sample points of the daily load curve of the date to be estimated; and a number of dimensions of the meteorology sensitive load power to be estimated is the number of sample points of the daily load curve of the date to be estimated.
Optionally, the SAE model is stacked by multiple AEs, and each of the plurality of AEs includes an encoding layer and a decoding layer.
The fully-connected layer comprises at least one layer.
In a third aspect, an embodiment of the present disclosure provides a method for estimating a meteorology sensitive load power based on a stacked auto-encoder, the method including: adding a multilayer fully-connected layer in an output end of a SAE model, and establishing a meteorology sensitive load power estimation model based on the SAE;
extracting a daily load curve dimension reduction feature by using an unsupervised training method of the SAE, using a meteorology sensitive load power curve as a labelled sample to train the fully-connected layer, to form mapping relationships from daily load curve dimension reduction features onto meteorology sensitive load powers at the fully-connected layer.
The estimation model is composed of two parts: a first part is a transitional SAE and a second part is multiple fully-connected layers stacked at an output end of the SAE.
Input of the estimation model is a daily load curve, a number of input dimensions is a number of sample points of the daily load curve; output of the estimation model is the meteorology sensitive load power and a number of output dimensions is the number of sample points of the daily load curve.
A forward propagation computation formula of the SAE is as follows:
input of a first layer of the SAE being xi, calculating output of an encoding layer of a first AE:
h(1)i=sf(W1xi+b1);
where W1 and b1 are respectively a weight matrix and a bias matrix, and sf is an activate function;
outputting by the encoding layer of the SE, and reconstructing an input vector through a decoding layer according to the following formula:
{circumflex over (x)}
i=(W1′h(1)i+b1′);
where W1′ and b1′ are respectively a weight matrix and a bias matrix in reconstruction, and sg is an activate function in reconstruction;
An unsupervised training method of the SAE is as follows:
training the SAE by using a historical daily daily load curve data sample as input and output labels of the SAE, and calculating an optimal fully-connected-layer parameter θ* of the encoding layer and decoding layer of the AE with a mean squared error of {circumflex over (x)}i calculated by the SAE with respect to the output label xi of the SAE being the minimum;
reserving h(1)i, using h(1)i as input and output labels of a next AE, continuing to train the next AE in the above manner, with input of the next AE being h(1)i, and so on, where the final SAE is stacked by multiple AEs.
A computation formula of the optimal fully-connected-layer parameter θ* of the encoding layer and the decoding layer of the AE is as follows:
where N is a number of training samples.
A forward propagation computation formula of the fully-connected layer is as follows:
O=R(WI+b);
in the formula, I and O are an input vector and an output vector of the fully connected layer respectively, W and b are a weight matrix and a bias matrix of the fully connected layer respectively, and R is an activate function of the fully connected layer.
A supervised training method of the fully-connected layer is as follows:
training by taking a deep layer feature of the daily load curve of a certain date after SAE dimension reduction as input of the fully connected layer, and taking the meteorology sensitive load power curve as the output label of the fully connected layer in a corresponding date, and calculating an optimal fully-connected-layer parameter θ′*:
in the formula, Oi is output of a last layer of the fully connected layer of an ith sample, PWi is a meteorology sensitive load power of an ith sample, and N′ is a number of dates of the meteorology sensitive load power.
The meteorology sensitive load power curve for a supervised training of the fully-connected layer is computed by the steps described below.
In step 1: data processing on a total load power and meteorology data of a certain region or a certain transformer station is performed and reordered, and a vertical data sample composed of a total load power and meteorology data of at the same time on the same date in different months is obtained. In step 2: a load-meteorology nonlinear association model among the total load power, the meteorology sensitive load power and various meteorological information is established, and model parameters is identified by using a gradient method.
In step 3, the identified model parameters, longitudinal historical meteorology data and total load power data is substituted into the association model, a longitudinal meteorology sensitive load power curve is calculated, and according to a normal time sequence is arranged to obtain a historical daily meteorology sensitive load power curve.
In step 1, the data processing of the total load power and the meteorology data includes data cleaning, removal of long-term increase of basic load power, correction calculation of various meteorology factors such as temperature accumulation, hysteresis effect and body-sensing temperature and humidity.
The step of correcting the meteorology factors is:
correcting the original temperature based on the temperature accumulation, hysteresis effect, the correction formula is:
T
DayMod=(Tday1λday1+Tday2λday2)/(λday1+λday2);
λday1=1−exp[−exp(Tday1−26/6)]
λday2=1−exp[−exp(Tday2−26/6)].
where TDayMod is a correction temperature after considering temperature accumulation effect, Tday1 is the original temperature of the current date; Tday2 is a correction temperature of a previous date; λday1 is a correction coefficient of the current date and Tday2 is a correction coefficient of the previous date.
The step of correcting the meteorology factor of the body-sensing temperature and humidity is:
H
T
=T
DayMod
H
TDayMod is a correction temperature after considering temperature accumulation effect, H is relative humidity, HT is a correction value of a humidity factor.
In step 2, the load-meteorology nonlinear association model is:
rXY is a correlation coefficient between the total load power and the meteorology sensitive load, X is the total load power processed and normalized in the step 1, Y is the meteorology sensitive load at the same time of the corresponding X,
In step 2, when parameters in the load-meteorology nonlinear association model are identified by using a gradient method, an objective function is established as the maximum value of the correlation coefficient, i.e.,
in the formula, m is a total number of samples, i is the ith sample, rXYi is the correlation coefficient between the total load power of the ith sample and the meteorology sensitive load.
In step 3, the identified model parameters, longitudinal corrected meteorology data and power data are substituted into the association model, the meteorology sensitive load power curve Y at the corresponding time of this longitudinal sample is calculated, maximum and minimum values of the meteorology sensitive load power curve Y are normalized and a ratio of the meteorology sensitive load power in the total load power is obtained.
in the formula, ρweather(j) is a meteorology sensitive load power ratio of a data in a certain sample, Y(j) is a jth meteorology sensitive load power estimation value Y of a certain sample, and Ymax, Ymin are the maximum value and the minimum value of the meteorology sensitive load respectively.
The actual meteorology sensitive load power estimation value is:
P
weather=ρweather·(Pmax−Pmin)
Pmax and Pmin are the maximum value and the minimum value of the total load power of this sample, ρweather is the meteorology sensitive load power ratio of this sample at a certain sampling time.
The longitudinal meteorology sensitive load power calculated by the association model is arranged according to a normal time sequence, and a daily meteorology sensitive load power curve arranged horizontally is obtained.
The present disclosure has the following beneficial effects:
the estimation model proposed can directly obtain the meteorology sensitive load power curve from the daily load curve, and is especially applicable to the case where meteorology data is frequently lost in practical applications. The SAE can extract the daily load curve dimension reduction feature without supervision, greatly reducing the number of input neurons of the fully connected layer, thereby greatly reducing network parameters of the fully connected layer and significantly reducing the model training difficulty.
Solutions according to the present disclosure will be described in further detail with reference to the drawings and specific embodiments, so that those skilled in the art can better understand and implement the present disclosure, but the embodiments are not intend to limit the present disclosure.
A stacked auto-encoder (SAE) estimation model uses the SAE unsupervised learning to extract a daily load curve dimension reduction feature, adds a multilayer fully-connected layer in an output end of the SAE, takes the dimension reduction feature as input of the SAE model, and takes an association model or traditional method for calculating the results as an output label of a fully connected layer, and trains the fully connected layer. In practical applications, the estimation model can directly obtain the meteorology sensitive load power curve from a daily load curve, thereby significantly improving the practicality of the method.
A structure of a meteorology sensitive load estimation model based on the SAE is illustrated in
The model training and the estimation step of the meteorology sensitive load are described below.
1. Unsupervised training is performed on the SAE by taking all daily load curves of April to October in a certain year as samples, thereby performing dimension reduction on the daily load curves and extracting deep features of the daily load curves.
Input of a first layer of the SAE is xi, output of an encoding layer of a first AE is calculated:
h(1)i=sf(W1xi+b1);
where W1 and b1 are respectively a weight matrix and a bias matrix, and sf is an activate function.
The above is output by the encoding layer of the SE, and an input vector is reconstructed through a decoding layer according to the following formula:
{circumflex over (x)}
i
=s
g(W1′h(1)i+b1′);
where W1′ and b1′ are respectively a weight matrix and a bias matrix in reconstruction, and sg is an activate function in reconstruction.
Then a historical daily daily load curve data sample is used for training, an optimal fully-connected-layer parameter θ* of the encoding layer and a decoding layer of the AE is sought according to the minimum mean squared error of {circumflex over (x)}i and xi, and the computation formula is as follows:
where N is a number of training samples, h(1)i is reserved, the training of a next AE is continued in the above manner, input of the next AE is h(1)i, and so on, the final SAE is stacked by multiple AEs.
2. The fully connected layer is trained by taking a calculation result of a calculation method for the meteorology sensitive load power curve as a labelled sample.
A computation formula of the fully connected layer is:
O=R(WI+b);
I and O are an input vector and an output vector of this layer respectively, W and b are a weight matrix and a bias matrix of the fully connected layer respectively, and R is an activate function of the fully connected layer.
An optimal fully-connected-layer parameter θ′* is calculated by training by taking a deep layer feature of the daily load curve of a certain date after SAE dimension reduction as input of the fully connected layer and the meteorology sensitive load power curve as the output label of the fully connected layer in a corresponding date,
in the formula, Oi is output of a fully connected layer of an ith sample, PWi is a meteorology sensitive load power of an ith sample, and N′ is a number of dates for which the meteorology sensitive load power may be calculated.
3. After the estimation model is trained, the daily load curve of the date to be estimated is taken as input, and the output of the model is the meteorology sensitive load power curve to be estimated.
A certain 220 kV transformer substation in a certain local city is taken as a research object for description of the implementation. The transformer substation includes industrial, commercial, residential and traction loads. The load type is comprehensive. The collected data is 2015 year-round load power of this substation (a sampling interval is 5 minutes), temperature and humidity data (a sampling interval is 10 minutes).
In step 1, sample data is prepared.
Due to incompleteness of meteorology data, a total of 70 pieces of daily meteorology sensitive load power curve data arranged in normal time order (4-10 months, and 10 dates per each month) is calculated, 65 pieces of data are used as labelled samples for training the fully connected layer of the SAE model and another 5 pieces are test samples.
Then 140 pieces of daily load curve data of all working dates of April to October in 2015 (214 dates in total, and includes 69 dates of holidates) are taken as unlabelled samples to train each SAE layer, and another 5 pieces are taken as test samples. The daily meteorology sensitive load power curve arranged in normal time order is calculated from the load-meteorology nonlinear association model.
In step 2, sample data is normalized.
Range normalization is performed on each sample of 70 pieces of meteorology sensitive load power data and 145 meteorology sensitive load data, i.e.,
in the formula, xi is ith data of a certain sample, xmin and xmax are the minimum value and the maximum value of the sample.
In step 3, Unsupervised training is performed on the SAE by taking all daily load curve data of April to October as samples, thereby performing dimension reduction on the daily load curves and extracting deep layer features of the daily load curves. A relative mean absolute percentage error (MAPE) minimum of decoder output and corresponding daily load curve data is taken as a training objective function. A computation formula of MAPE is:
xi is an actual daily load power, xi′ is a decoder output value, and n is a total number of sampling points.
After the actual testing, SAE super parameters are finally selected as: four SAE encoding layers and four SAE decoding layers, i.e., performing 4 times of auto-encoding process, and finally daily load data of 144 points is reduced to five deep feature parameters. Through dimension reduction, the number of input dimensions (5 dimensions) and the number of neurons of the fully connected layer are greatly reduced, i.e., the weight and bias parameter to be determined are greatly reduced, which effectively reduces training difficulty of the fully connected layer.
In step 4: the fully connected layer is trained by taking association model calculation results as labelled samples. A daily meteorology sensitive load power curve is calculated by taking the association model as the output label of the fully connected layer. The input sample is a deep layer feature of the daily load curve after SAE dimension reduction, and the trained objective function is the MAPE minimum.
Through the actual testing, two layers of fully connected layer are finally arranged, which respectively including 25 and 144 neurons. The activation function of the first layer is a ReLU function, and the second layer is a tanh function. Therefore, in practice, only two layers of the fully connected layers needs to be trained in the sample labelled with the meteorology sensitive load power curve.
In step 5, a normalization calculation result of each sample output by the complete model is restored:
y
i
=y
i′·(xmax−xmin)+xmin;
in the formula, yi′ is normalized meteorology sensitive load power value output by the model, xmax and xmin are the maximum actual value and the minimum actual value of the daily load curve sample input by the model.
In step 6, the SAE training result is tested.
The actual total load power value from July 27-30th is compared with the output curve after SAE encoding and decoding in the testing set, as illustrated in
It can be seen from the above table and
In step 7, a fully connected layer test result is tested.
A meteorology sensitive load power carve is compared with an association model result of test samples from July 27-30, as illustrated in
It can be seen from
In step 8, after the estimation model is trained and tested, the daily load curve of a date of the meteorology sensitive load power curve to be estimated is taken as the total input of the model, and the final output of the model is the meteorology sensitive load power curve to be estimated.
The above merely depicts some exemplary embodiments according to the present disclosure, and it should be noted that for those skilled in the art, numerous improvements and modifications can be made without departing from the principle of the present disclosure, where these improvements and modifications shall all fall in the scope of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
201810606900.9 | Jun 2018 | CN | national |
201910430705.X | May 2019 | CN | national |