This application is based upon and claims priority to Chinese Patent Application No. 202210172447.1, filed on Feb. 24, 2022, the entire contents of which are incorporated herein by reference.
The present disclosure relates to the technical field of power quality of a power system and, specifically, to a method for constructing a general probability model of a harmonic emission level for an industrial load.
With the continuous development of the industrial level, more large-capacity, nonlinear, and impulsive loads are applied in a power system, which results in an increasingly serious harmonic problem in the power system. To improve the power quality of the power system, grid companies have carried out a series of operations, such as harmonic power flow calculation, harmonic responsibility division, and harmonic resonance analysis based on accurate modeling of a harmonic load. It can be seen that harmonic modeling of a large industrial load is of great significance for harmonic hazard assessment and governance.
At present, there are mainly two types of harmonic load modeling methods. One type of harmonic load modeling method is based on a circuit principle and mechanism analysis. Based on a volt-ampere characteristic between a harmonic voltage and a harmonic current, this method mainly obtains an equivalent current source model that can represent a coupling relationship between each harmonic voltage and current of a harmonic source or a harmonic coupling admittance matrix model based on an RLC circuit. This method generally needs to provide measured voltage and current waveform data to analyze a harmonic generation mechanism. In this method, it is difficult to obtain waveform data, the mechanism analysis process is complex, and there are single modeling objects and the like. The other type of harmonic load modeling method is based on monitoring data or simulation data and data analysis and mainly includes a machine learning method and a probabilistic method. There are two types of probabilistic methods based on whether a distribution form of a required variable needs to be preset, namely, a nonparametric estimation method and a parametric estimation method. The parametric estimation method selects a different probability density function (such as a normal distribution function, a lognormal distribution function, or the like) based on the actual probability distribution form of the variable and uses a maximum likelihood method or other parametric estimation method to solve a numerical characteristic of the probability density function. The parametric estimation method is generally only applicable to data with a known distribution form and a single peak value. Without assuming a probability density function that the nonparametric estimation method may meet, the nonparametric estimation method directly analyzes a probability density function of actual data through histogram estimation, kernel density estimation, or Monte Carlo simulation. However, the nonparametric estimation method lacks theoretical guidance, and a calculation result is easily affected by bandwidth. Therefore, it is difficult to directly apply the nonparametric estimation method to practical projects.
With the popularization of a power quality monitoring system, a bus outlet of a large industrial load is usually equipped with a power quality monitoring device to monitor the power quality of the industrial load in real-time, such as a total harmonic voltage/current distortion rate, a harmonic voltage ratio, an effective harmonic value, and the like, which provides a solid data basis for accurate modeling of a harmonic emission level of the large industrial load. Under such technical background, this patent proposes a general probability model of a harmonic emission level for an industrial load based on power quality monitoring data, which further improves model accuracy and scalability.
1) The modeling method based on the mechanism analysis needs to measure the waveform data of actual voltage and current of a load and obtains, based on the volt-ampere characteristic between the harmonic voltage and the harmonic current, the equivalent current source model that represents the coupling relationship between each harmonic voltage and current of the harmonic source or the harmonic coupling admittance matrix model based on the RLC circuit. The modeling method performs the analysis based on monitoring of the waveform data, and monitoring data in an actual power system is mainly steady-state power quality data monitored by the power quality monitoring system. Therefore, it is difficult to perform analysis directly. In addition, a mechanism analysis process is also very complex, and a specific load needs to be analyzed in a specific manner, which results in poor universality or scalability.
2) The single probability model analysis method needs to determine the distribution form of the variable in advance and select an appropriate probability distribution function based on the distribution form to perform analysis. Harmonic data of an actual load is distributed in a multi-peak and asymmetric form, so it is difficult to find a suitable probability density distribution function for analysis.
To resolve the above problems, the present disclosure provides a method for constructing a general probability model of a harmonic emission level for an industrial load to combine a parametric estimation method based on a normal distribution function and a lognormal distribution function with a nonparametric estimation method represented by a kernel density estimation method. This not only overcomes the disadvantage that a single probability distribution model depends on pilot experience and cannot be applied to a multi-peak and asymmetric distribution but also avoids an insufficient theoretical basis of a kernel density estimation method, thereby effectively improving the accuracy and adaptability of modeling. The technical solutions are as follows:
A method for constructing a general probability model of a harmonic emission level for an industrial load includes the following steps:
Step 1: extracting harmonic monitoring data of an industrial load to obtain a harmonic characteristic dataset X of a user:
where N represents the total quantity of sampling points, each column vector in the X represents a harmonic current monitoring sequence, a subscript of I represents a harmonic order, and a superscript of the I represents the quantity of sampling sequences.
Step 2: constructing a general probability model for an h-th harmonic Ih in the harmonic characteristic dataset:
where fi(.) represents a probability density subfunction, λi represents a weight coefficient of the probability density subfunction; the general harmonic probability model is a linear combination of three probability density subfunctions, where f1(.) represents a part that is of the Ih and obeys a normal distribution, f2(.) represents a part that is of the Ih and obeys a lognormal distribution, and f3(.) represents a part that is of the Ih and obeys another distribution. fi(.) is expressed as follows:
where μ1 and μ2 represent mathematical expectations of the probability density subfunction; σ1 and σ2 represent standard deviations of the probability density subfunction; K(.) represents a kernel function b>0, where b represents a smoothing parameter, which is referred to as a bandwidth or a window; Ihj represents a jth sample of the Ih in each window; n represents the total quantity of samples in each window.
The probability density function is non-negative and normative, so the weight coefficient of the probability density subfunction should meet the following formula, where λ1=1 or λ2=1, indicating that the Ih obeys a single normal distribution or the lognormal distribution:
Step 3: discretizing the general probability model of the Ih, where
the Ih is discretized to discretize the f1(.) and the f2(.) to obtain the following discretized general harmonic probability model:
where max(Ih) represents the maximum current value of the h-th harmonic.
Step 4: constructing a parameter optimization model of the general probability model, which specifically includes the following substeps:
Step 4.1: constructing an objective function, where
a degree of approximation between the general probability model and an actual probability distribution of the Ih is reflected by the difference between a mathematical expectation calculated by a general optimization model and an actual value, as well as a difference between a standard deviation calculated by the general optimization model and the actual value, and therefore the objective function is as follows:
where y1 and y2 respectively represent mean square errors of a mathematical expectation and a standard deviation of the general probability model; Ei(Īh) and E0(Īh) respectively represent a mathematical expectation that is of the Ih and calculated by the probability density subfunction and an actual mathematical expectation of the Ih; Vari(Īh) and Var0(Īh) respectively represent a standard deviation that is of the Ih and calculated by the probability density subfunction and an actual standard deviation of the Ih.
Two minimum objective functions are combined into a minimum objective function, and the combined minimum objective function is as follows:
Step 4.2: determining constraints, where
the constraints include an equality constraint and an inequality constraint.
1) The equality constraint for optimizing the variable λi is determined according to the following formula and is expressed by l:
2) The inequality constraint includes a value range of the weight coefficient λi and a value range that is of the random variable Ih and determined by numerical characteristics (μ1, σ1) and (μ2, σ2) when the single probability density subfunction takes effect.
Step 5: solving parameters {λ1, λ2, λ3, μ1, μ2, σ1, σ2} of the general probability model.
A constrained problem is converted into an unconstrained problem. A multiplier method is used for solving, that is, an optimization variable set is defined as γ={λ1, λ2, λ3, μ1, μ2, σ1, σ2}. An augmented Lagrange function is defined as J and is expressed is as follows:
where y(γ) represents the objective function, l(γ) represents the equality constraint, gq(γ) represents the inequality constraint, ωq represents a Lagrange multiplier of the inequality constraint, and ν represents a Lagrange multiplier of the equality constraint.
For the J(γ, ω, ν, ρ), only the sufficiently large parameter ρ needs to be taken, and the multipliers ω and ν are continuously corrected to obtain a local optimal solution by minimizing the J(γ, ω, ν, φ, where correction formulas of the multipliers ω and ν are as follows:
where k is a superscript that represents a quantity of corrections.
Step 6: obtaining the general probability model of the Ih.
Further, in the objective function in Step 4.1,
Further, in step 4.2, an inequality constraint of the λi is as follows:
where assuming that 95% confidence intervals of {μ1, μ2, σ1, σ2} are [{circumflex over (θ)}1, {circumflex over (θ)}2], [{circumflex over (θ)}3, {circumflex over (θ)}4], [{circumflex over (θ)}5, {circumflex over (θ)}6], and [{circumflex over (θ)}7, {circumflex over (θ)}8] respectively, inequality constraints of the optimization variables {μ1, μ2, σ1, σ2} are as follows:
where gq represents the inequality constraint and q=1, 2, L, 11.
Further, the multiplier method in step 5 specifically includes the following substeps:
step a: setting an initial point γ(0), initial multiplier vector estimates ω(1) and ν(1), a parameter ρ, an allowable error ε (>0), a constant c (>1), β (∈(0,1)), and k (=1);
step b: taking γ(k−1) as an initial point and solving an unconstrained problem shown in the following formula to obtain a solution γ(k):
min J(γ,ω(k),ν(k),ρ);
step c: if ∥l(γ(k))∥<ε, stopping the calculation and obtaining a point γ(k)); otherwise, performing step d;
step d: ∥l(γ(k))∥/∥l(γ(k−1))∥≥β, setting ρ=cρ and performing step e; otherwise, performing step e directly; and
step e: correcting multipliers ωq(k+1) and ν(k+1) by using the second formula in step 5, setting k=k+1, and performing step b.
The present disclosure achieves the following beneficial effects: The present disclosure establishes, based on harmonic data monitored by a power quality monitoring system, a general probability model by combining a parametric estimation method based on a normal distribution function and a lognormal distribution function with a nonparametric estimation method represented by a kernel density estimation method, taking a degree of approximation between the general probability model and an actual probability distribution of each harmonic current as an objective function based on parameters required by the model, and optimizing and solving the parameters of the proposed general probability model by using a multiplier method to determine parameters of the general probability model to finally obtain a general probability model applicable to different industrial loads. The present disclosure not only overcomes the disadvantage that a single probability distribution model depends on pilot experience and cannot be applied to a multi-peak and asymmetric distribution but also avoids an insufficient theoretical basis of a kernel density estimation method, thereby effectively improving the accuracy and adaptability of modeling.
The present disclosure will be further described below in conjunction with the accompanying drawings and specific embodiments.
An industrial load has a large capacity and accounts for a large proportion, which imposes a great impact on the power quality of a power system. To accurately depict a harmonic problem caused by the industrial load to a grid, the present disclosure provides a general probability model of a harmonic emission level for an industrial load. A basic flowchart is shown in
S1: Harmonic monitoring data of an industrial load is extracted to obtain harmonic characteristic dataset X of a user.
A bus inlet of the industrial load bus is usually equipped with a power quality monitoring device whose sampling interval is generally 3 to 15 min. Monitoring data mainly includes maximum values, minimum values, average values, and the 95% of the maximum value of a fundamental voltage, a fundamental current, active power, reactive power, apparent power, a total harmonic voltage/current distortion rate, a 2 to 25-th harmonic voltage ratio/effective current values, and the like. The present disclosure is intended to use an average value of a 2 to 25-th harmonic current measured at a 3-min sampling interval on Dec. 20, 2021, in a 110 kV steelmaking plant in Taiyuan City, Shanxi Province to perform the analysis.
In the above formula, N represents the total quantity of sampling points and is equal to 480 herein. Each column vector in the X represents a harmonic current monitoring sequence, a subscript of I represents a harmonic order, and a superscript of the I represents the quantity of sampling sequences. For example, Ihm represents an mth sampling point of an h-th harmonic; j=1, 2, . . . , 480; h represents a harmonic order; and h=2, 3, . . . , 25.
Step 2: A general probability model is constructed for the h-th harmonic (Ih) in the harmonic characteristic dataset, as shown in the following formula (2):
In the above formula, fi(.) represents a probability density subfunction, and λi represents the weight coefficient of the probability density subfunction. The general harmonic probability model is a linear combination of three probability density subfunctions. f1(.) represents a part that is of the Ih and obeys a normal distribution, f2(.) represents a part that is of the Ih and obeys a lognormal distribution, and f3(.) represents a part that is of the Ih and obeys another distribution. Formulas (3) to (5) are mathematical expressions of the fi(.):
In the above formulas, μ1 and μ2 represent mathematical expectations of the probability density subfunction, and σ1 and σ2 represent standard deviations of the probability density subfunction. K(.) represents a kernel function, and b>0, where b represents a smoothing parameter, which is referred to as a bandwidth or a window. Ihj represents a jth sample of the Ih in each window, and n represents the total quantity of samples in each window.
A probability density function is non-negative and normative, so the weight coefficient of the probability density subfunction should meet the following formula (6).
In the formula, λ1=1 or λ2=1, indicating that the Ih obeys a single normal distribution or the lognormal distribution:
Step 3: The general probability model of the Ih is discretized.
Since f1(.) and f2(.) are continuous functions and f3(.) is a discrete function, these two types of functions cannot be added directly by using the following formula (7). Instead, it is necessary to discretize f1(.) and f2(.). f1(.) and f2(.) can be discretized by discretizing the Ih. Therefore, the following discretized general harmonic probability model is obtained:
In the above formula, max(Ih) represents the maximum current value of the h-th harmonic.
Step 4: A parameter optimization model of the general probability model is constructed.
It can be seen from the formulas (3) to (7) that by adjusting a value of a parameter set {λ1, λ2, λ3, μ1, μ2, σ1, σ2, b}, the above discretized general harmonic probability model can fit and approximate a probability distribution function of any random variable. The smoothing parameter b can be set based on experience, and other parameters need to be solved by constructing the parameter optimization model. The parameter optimization model is mainly divided into the following three parts.
1) Determining an Objective Function
The degree of approximation between the general probability model and the actual probability distribution of the Ih may be visually reflected by the difference between the mathematical expectation calculated by a general optimization model and the actual value, as well as the difference between the standard deviation calculated by the general optimization model and the actual value. Higher accuracy of the parameter optimization model leads to a smaller difference between the mathematical expectations and a smaller difference between the standard deviations. Therefore, the following two objective functions are constructed in this specification:
In the above formulas,
A solution to this optimization problem is to combine two minimum objective subfunctions into a minimum objective function, and then an optimization method of a single objective optimization problem is used for solving. The following formula (16) is the combined objective function:
2) Determining Constraints
The constraints include an equality constraint and an inequality constraint based on the forms of the constraints.
Equality constraint: An equality constraint for optimizing the variable λi may be determined according to a formula (6) and is expressed by l:
Inequality constraint: Based on the optimization parameter set {λ1, λ2, λ3, μ1, μ2, σ1, σ2, b}, it can be seen that there are mainly two types of inequality constraints. One type of inequality constraint is a value range of the weight coefficient λi. The other type of inequality constraint is a value range that is of the random variable Ih and determined by numerical characteristics (μ1, σ1) and (μ2, σ2) when a single probability density subfunction takes effect.
An inequality constraint of the λi is as follows:
Inequality constraints of {μ1, μ2, σ1, σ2}: This specification uses a maximum likelihood estimation method to evaluate the numerical characteristics of the Ih obeying the single normal distribution or the lognormal distribution and takes the upper and lower confidence limits with a confidence level of 0.95 as value ranges of [μ1, μ2, σ1, σ2]. Assuming that 95% confidence intervals of {μ1, —2, σ1, σ2} are [θ1, θ2], [θ3, θ4], [θ5, θ6], and [θ7, θ8] respectively, the inequality constraints of the optimization variables {μ1, μ2, σ1, σ2} can be obtained, namely:
The parameter optimization model of the general probability model in the present disclosure is composed of the objective function (16) and the constraint conditions (17) to (22).
Step 5: Parameters {λ1, λ2, λ3, μ1, μ2, σ1, σ2} of the general probability model are solved.
The parameter optimization model in the present disclosure is a constrained optimization problem in an optimization theory. A solution of the parameter optimization model is to convert a constrained problem into an unconstrained problem. Common solving methods include a Lagrange multiplier method, Karush-Kuhn-Tucker (KKT) conditions, a penalty function method, and the like. The parameter optimization model in the present disclosure contains both the equality constraint and the inequality constraint and can be solved directly by using a multiplier method.
It is assumed that the optimization variable set is expressed by γ, namely, γ={λ1, λ2, λ3, μ1, μ2, σ1, σ2}, and an augmented Lagrange function is defined as J and is expressed is as follows:
For the J(γ, ω, ν, ρ), only the sufficiently large parameter ρ needs to be taken, and the multipliers ω and ν are continuously corrected to obtain a local optimal solution in the formula (23) by minimizing the J(γ, ω, ν, ρ), where correction formulas of the multipliers ω and ν are as follows:
Step 1: Initial point γ(0), initial multiplier vector estimates ω(1) and ν(1), parameter ρ, allowable error ε (>0), constant c (>1), β (ε(0,1)), and k (=1) are set.
Step 2: γ(k−1) is taken as an initial point, and an unconstrained problem shown in the following formula (25) is solved to obtain solution γ(k):
min J(γ,ω(k),ν(k),ρ) (25).
Step 3: If ∥l(γ(k))∥<ε, the calculation is stopped, and point γ(k)) is obtained. Otherwise, step 4 is performed.
Step 4: If ∥l(γ(k))∥/∥l(γ(k−1))∥≥β, ρ=cρ is set, and step 5 is performed. Otherwise, step 5 is performed directly.
Step 5: Multipliers ωq(k+1) and ν(k+1) are corrected by using the formula (24); k=k+1 is set, and step 2 is performed.
Step 6: The general probability model of the Ih is obtained.
Number | Date | Country | Kind |
---|---|---|---|
202210172447.1 | Feb 2022 | CN | national |