The present disclosure relates to a probability estimation method for photovoltaic power based on an optimized copula function and also relates to a photovoltaic power system using the probability estimation method for photovoltaic power, and belongs to the technical field of photovoltaic power generation.
With the advantages of abundant exploitable resources, low development and construction difficulties, and significant energy-saving and environment-protecting benefits, distributed photovoltaic power stations are one of the important ways for photovoltaic development and utilization. However, because of the instability of weather conditions, photovoltaic power generation is highly intermittent and random, which poses a challenge to the planning and operation of existing power systems. Therefore, accurate prediction for the power generated by photovoltaic power generation is one of the important influencing factors for photovoltaic access to the power system.
Copula function is a statistics tool used for dealing with the correlation between random variables. Modeling with the copula function can better represent the spatial correlation of a power generation system of the distributed photovoltaic power stations, and the amount of data required is relatively small. However, due to great limitations, the traditional copula function cannot fit the power data well, resulting in low accuracy for predicting the photovoltaic power.
To address the shortcomings of the prior art, the primary technical problem to be solved by the present disclosure is to provide a probability estimation method for photovoltaic power based on an optimized copula function.
A further technical problem to be solved by the present disclosure is to provide a photovoltaic power system using the above probability estimation method for photovoltaic power.
To achieve the above objective of the present disclosure, the present disclosure uses the following technical solution.
According to the first aspect of an embodiment of the present disclosure, a probability estimation method for photovoltaic power based on an optimized copula function is provided, including the following steps:
Preferably, step (1) includes the following sub-steps:
Preferably, the correlation analysis is used for determining meteorological factors affecting the photovoltaic output as the clustering elements; and
Preferably, the meteorological factors include an atmospheric pressure, relative humidity and radiancy.
Preferably, in step (2), the plurality of copula function models include a Frank Copula function model and a hybrid Copula function model, the hybrid Copula function model being a weighted sum of the Frank Copula function model and other models in an Archimedean Copula function cluster model.
Preferably, the hybrid Copula function model is obtained by the following sub-steps:
Preferably, in step (3), an optimal Copula model corresponding to each weather type is selected from a Frank Copula model and an optimized hybrid Copula function model by comparing correlation coefficients and an error evaluation index under different weather.
Preferably, the correlation coefficients include: a Pearson correlation coefficient and a determination coefficient R2, and the error evaluation index is a root mean square error.
According to the second aspect of an embodiment of the present disclosure, a probability estimation apparatus for photovoltaic power based on an optimized copula function is provided, including a processor and a memory, the processor reading a computer program in the memory for executing the above probability estimation method for photovoltaic power based on an optimized copula function.
According to the third aspect of an embodiment of the present disclosure, a photovoltaic power system is provided, including a plurality of power generation units; a centralized photovoltaic power station or a distributed photovoltaic power station in each power generation unit being each connected to a power grid through a corresponding photovoltaic inverter and transformer. The photovoltaic power system uses the above probability estimation method for photovoltaic power based on an optimized copula function to estimate the photovoltaic power.
Compared with the prior art, the probability estimation method for photovoltaic power provided by the present disclosure creatively applies the combination of a clustering algorithm and the optimized copula model under weather classification to the field of predicting the power of the distributed photovoltaic power station, carries out weather clustering based on historical meteorological data, and constructs the copula function model based on the clustering results. Historical operation data and weather classification results are considered at the same time to make the obtained hybrid Copula function model have higher prediction accuracy. The probability estimation method for photovoltaic power predicts the power of the distributed photovoltaic power station through the centralized photovoltaic power station, solving the problem that operational data of the distributed photovoltaic power station is difficult to collect and providing strong support for the safe and stable operation of the photovoltaic power system.
The technical content of the present disclosure is described in detail below with reference to the accompanying drawings and specific embodiments.
An embodiment of the present disclosure provides a probability estimation method for photovoltaic power based on an optimized copula function. In the method, according to historical photovoltaic power data obtained from a centralized photovoltaic power station and a distributed photovoltaic power station, weather types are classified by a clustering method to obtain a plurality of weather types. Then, according to the cumulative distribution of the photovoltaic output obtained from photovoltaic data under different weather types, a plurality of copula function models for quantitatively representing the spatial correlation of the power of the centralized photovoltaic power station and the distributed photovoltaic power station are constructed, respectively, and for the characteristics of each weather attribute, the optimal representative weight of a Copula function is constructed, so as to select a corresponding optimal model for different weather, which is used for quantitatively expressing the spatial correlation of the power of the centralized photovoltaic power station and the distributed photovoltaic power station. According to the obtained data of the centralized photovoltaic power station, point prediction of the distributed photovoltaic power station is achieved through the optimal model of the corresponding weather. Based on the relationship between an actual value and a value of point prediction of the distributed photovoltaic power station, a conditional probability model is constructed, and through the conditional probability model, the probability distribution of the power of the distributed photovoltaic power station and the conditional probability corresponding to the value of point prediction are obtained. Finally, based on a real value of the power of the centralized photovoltaic power station at the future moment and in combination with the above conditional probability model, a predicted value of the power generated by the distributed photovoltaic power station at the future moment is obtained.
The specific steps of the above estimation method are described in detail below in combination with various embodiments.
As shown in
It is to be noted that: the Copula function in the embodiment of the present disclosure (detailed description of the Copula function can refer to the following link: http://www.columbia.edu/˜mh2078/QRM/Copulas.pdf) describes correlations between different variables and is actually a class of functions that connect joint distribution functions to respective marginal distribution functions thereof, also known as a connectivity function. In step 2, the plurality of copula function models include a Frank Copula function model and a hybrid Copula function model. The hybrid Copula function model is a weighted sum of the Frank Copula function model and other models in an Archimedean Copula function cluster model.
In step 1, performing weather clustering based on historical meteorological data in preparation for modeling of the copula function model, specifically includes the following sub-steps.
Step 1.1: historical photovoltaic power data is obtained and data cleaning is performed.
Specifically, the historical photovoltaic power data includes historical power data of the centralized photovoltaic power station and the distributed photovoltaic power station. On this basis, data cleaning is performed on a photovoltaic historical output value to exclude an abnormal value and zero and negative values.
Optionally, the photovoltaic data collected in the embodiment of the present disclosure is the power data of the photovoltaic power station on January, February and December, with a sampling interval of 10 min.
Step 1.2: Meteorological data in the corresponding period of the historical photovoltaic power data is obtained, and based on correlation analysis, clustering elements are determined to cluster the weather to obtain the plurality of weather types.
Specifically, a k-means algorithm is used in the embodiment of the present disclosure, including the following sub-steps.
Step 1.2-1: the correlation analysis is used for determining meteorological factors affecting the photovoltaic output as the clustering elements.
The meteorological data of the date corresponding to the photovoltaic historical output value is selected and, after correlation analysis, is as shown in Table 1 below, the finally determined meteorological factors include three characteristics of an atmospheric pressure, relative humidity and radiancy, as the clustering elements. By the correlation analysis, the correlation with the power generated by photovoltaic power generation can be comprehensively measured by counting correlation coefficients including Pearson, Spearman, and Kendall.
Pearson represents a Pearson correlation coefficient, Spearman represents a Spearman rank correlation coefficient, and Kendall represents a Kendall rank correlation coefficient.
In an embodiment of the present disclosure, from the five meteorological factors, three factors with high correlations are selected as classification data, i.e., an atmospheric pressure, relative humidity, and radiancy.
Step 1.2-2: according to the determined clustering elements, the weather is clustered using a k-means algorithm. The results are as shown in
Here, using the k-means algorithm can better reflect the temporal and spatial correlation of the output of the distributed photovoltaic power station and improve the prediction accuracy to a certain extent.
In step 2, for the obtained weather types, the cumulative distribution of the photovoltaic output under each weather type is calculated in order to establish the Frank Copula function model and the hybrid Copula function model, respectively.
Step 2-1: according to the cumulative distribution of the photovoltaic output, a correlation coefficient λ value under each weather type is obtained, and the Frank Copula function model is established.
The frequency distributions are observed, as shown in
In Equation (1), u and v are two marginal distribution variables; and λ is the correlation coefficient, according to the cumulative distribution of the photovoltaic output, a λ value under each weather type can be obtained, and the Frank Copula function model under each weather type can be obtained.
Step 2-2: based on other functions in an Archimedean Copula function cluster other than a Frank Copula function, the Copula function cluster model corresponding to each weather is constructed, and the Copula function cluster model and the Frank Copula function model are weighted and summed according to weights to obtain an optimized hybrid Copula function model.
In an embodiment of the present disclosure, on the basis of the Archimedean Copula function cluster, parameters are solved and optimized by an optimization algorithm, which can further improve the accuracy of prediction. The specific description is as follows: in addition to the Frank Copula function, two other functions in the Archimedean Copula are still used in the embodiment of the present disclosure, including: a Gumble Copula function and a Clayton Copula function, as shown in Equation (2) and Equation (3), respectively.
The optimized hybrid Copula function model is obtained after weighting, as shown in Equation (4):
CH(u, v;λ1, λ2, λ3)=A*CF(u, v; λ1)+B*Cc(u, v; λ2)+C*CG(u, v; λ3) (4)
In Equation (4), A, B and C are the weight coefficients of Frank Copula, Clayton Copula and Gumble Copula, respectively; and λ1, λ2, λ3 are the corresponding correlation coefficients.
The weight coefficient and the correlation coefficients λ1, λ2, λ3 are solved in such a way that the weight coefficient and the correlation coefficients are brought into Equation (4) based on Equations (1) to (3) to obtain a copula value to be solved, and the copula value set according to experience is used as the set value, so that the error between the copula value obtained and the copula value set according to experience is solved as an objective function, and a genetic algorithm is used for solving parameters, then all the parameter values to be solved can be obtained, and the weight coefficient and the correlation coefficients λ1, λ2, λ3 are obtained.
In the above embodiment, the optimized hybrid Copula function model is obtained based on the Archimedean Copula function cluster, which improves the prediction accuracy to a certain extent and can better fit the power data. The parameter values and weight values in the function cluster are determined according to the optimization algorithm, which overcomes the limitations of the traditional single copula function and is particularly suitable for fitting photovoltaic power data.
In step 3, the optimal Copula model corresponding to each weather type is selected from the Frank Copula model and the optimized hybrid Copula function model by comparing the correlation coefficients and an error evaluation index under different weather.
Specifically, the correlation coefficients and the error index are as shown in Table 2 below, and the optimal Copula model under each weather type is selected. The selected coefficients include: a Pearson correlation coefficient and a determination coefficient R2, and the error evaluation index is a root mean square error (RMSE). Due to the error relationship and similarity between the real value and the model predicted value, a model with the best index under each weather type is selected as the optimal Copula model according to the above evaluation index.
It is to be noted that: in Table 2, the single model refers to the Frank Copula model, and the hybrid Copula model refers to the hybrid Copula function model in Equation (4).
The Pearson correlation coefficient (PCC) indicates the trend and the degree of change between two variables, taking values between −1 and +1, with 0 representing no correlation, a positive value representing positive correlation, and a negative value representing negative correlation; and the larger the value is, the higher the correlation is.
The determination coefficient R2 , also known as the goodness of fit, is the square of the correlation coefficient r, and indicates that the variation in the dependent variable can be explained based on the variation in the independent variable. The magnitude of the determination coefficient R2 determines the closeness of the correlation. The greater the goodness of fit is, the higher the degree of explanation of the dependent variable by the independent variable is and the higher the percentage of the variation caused by the independent variable in the total variation is. Observation points are more dense near a regression line.
For both a cloudy day and an overcast day, the error index of the hybrid Copula model is superior to that of the Frank Copula model, and the optimized hybrid Copula model is applied preferentially under this weather condition. For a sunny day, the determination coefficient R2 of the optimized Copula model is improved, but RMSE is weaker than that of the Frank Copula model. The two models are equal in Pearson, and the effects of the two models are similar under this condition, and the hybrid Copula model can also be selected for a sunny day.
In an embodiment of the present disclosure, clustering is performed with three weather types: a cloudy day, an overcast day and a sunny day, and the same way can be used for other weather types to select the optimal model. The density maps of a cloudy day, an overcast day and a sunny day are as shown in
In step 4, the copula functions under different weather are used, the power prediction results of the centralized photovoltaic power station are used as input to obtain the point prediction results of the power of the distributed photovoltaic power station by the corresponding model.
Specifically, some point prediction results under each weather are selected, and the effect views of point prediction under multiple weather types are obtained by the hybrid Copula model established in the embodiment of the present disclosure, as shown in
In step 5, based on the relationship between the actual value and the value of point prediction of the distributed photovoltaic power station, a conditional probability model is constructed, and through the conditional probability model, the probability distribution of the power generated by the distributed photovoltaic power station and the conditional probability corresponding to the value of point prediction are obtained.
In an embodiment of the present disclosure, the actual value is assumed as x and the value of point prediction is assumed as Y , then the joint probability function density of x and y is as shown in Equation (5).
In Equation (5), fx(x) and fY(y) are the probability function densities of the marginal distributions of x and y respectively; and C(Fx(x), FY(y)) is the density of the Copula function.
The density of the Copula function is obtained according to the hybrid Copula function model established by Equation (4). The probability density of the marginal distribution is obtained from the data of the output of the centralized photovoltaic power station and the distributed photovoltaic power station, respectively.
Given the point prediction y=P , the density of the conditional probability function of the actual value is as shown in Equation (6).
Equation (6) shows that the conditional probability density includes two parts: the density of the Copula function with variable multipliers and the density of the probability function of the actual value. The conditional probability distribution of the photovoltaic output under different weather types when the final point prediction is 0.7 is as shown in
To illustrate the practical effect of the probability estimation method for photovoltaic power provided by the embodiment of the present disclosure, the inventors conducted simulation experiments to evaluate and test the predicted conditional probabilities, the prediction results are as shown in
As can be seen from
On the basis of the above probability estimation method for photovoltaic power, Embodiment II of the present disclosure further provides a probability estimation apparatus for photovoltaic power. As shown in
The processor 11 is used for controlling the overall operation of the probability estimation apparatus for photovoltaic power to accomplish all or some of the steps of the above probability estimation method for photovoltaic power. The processor 11 may be a central processing unit (CPU), a graphics processing unit (GPU), a field programmable gate array (FPGA), an application specific integrated circuit (ASIC), a digital signal processing (DSP) chip, etc. The memory 12 is used for storing various types of data to support operation at the probability estimation apparatus for photovoltaic power, the data may include, for example, instructions for any application program or method to operate on the probability estimation apparatus for photovoltaic power, as well as application program-related data.
The memory 12 may be implemented by any type of volatile or non-volatile storage devices or combinations thereof, such as a static random access memory (SRAM), an electrically erasable programmable read-only memory (EEPROM), an erasable programmable read-only memory (EPROM), a programmable read-only memory (PROM), a read-only memory (ROM), a magnetic memory, a flash memory, etc.
In an exemplary embodiment, the probability estimation apparatus for photovoltaic power may be implemented specifically by a computer chip or entity, or by a product having some functionality for executing the above probability estimation method for photovoltaic power, and achieving the technical effects consistent with those achieved in the above method. An exemplary embodiment relates to a computer. Specifically, the computer may be, for example, a personal computer, a laptop computer, an in-vehicle human-computer interaction device, a cellular phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an E-mail device, a game control panel, a tablet personal computer, a wearable device, or a combination of any of these devices.
In another exemplary embodiment, the present disclosure further provides a computer-readable storage medium including program instructions, the program instructions, when executed by a processor, implementing the steps of the probability estimation method for photovoltaic power according to any one of the above embodiments. For example, the computer-readable storage medium may be a memory including program instructions, and the above program instructions may be executed by the processor of the probability estimation apparatus for photovoltaic power to accomplish the above probability estimation method for photovoltaic power and achieve the technical effects consistent with those achieved in the above method.
Embodiment III of the present disclosure provides a photovoltaic power system, using the above probability estimation method for photovoltaic power to estimate the photovoltaic power. As shown in
Compared with the prior art, the probability estimation method for photovoltaic power provided by the present disclosure creatively applies the combination of a clustering algorithm and the optimized copula model under weather classification to the field of predicting the power of the distributed photovoltaic power station, carries out weather clustering based on historical meteorological data, and constructs the copula function model based on the clustering results. Historical operation data and weather classification results are considered at the same time to make the obtained hybrid Copula function model have higher prediction accuracy. The probability estimation method for photovoltaic power predicts the power of the distributed photovoltaic power station through the centralized photovoltaic power station, solving the problem that operational data of the distributed photovoltaic power station is difficult to collect and providing strong support for the safe and stable operation of the photovoltaic power system.
The probability estimation method for photovoltaic power based on an optimized copula function and the photovoltaic power system provided by the present disclosure are described in detail below. To a person of ordinary skill in the art, any obvious changes made to the present disclosure without departing from the substance of the present disclosure will constitute an infringement of the patent rights of the present disclosure and will be subject to the corresponding legal liability.
Number | Date | Country | Kind |
---|---|---|---|
2022107947399 | Jul 2022 | CN | national |