This invention relates generally to industry solutions for the Energy & Utility (E&U) industry and more particularly to E&U operation and planning studies involving optimal power flow.
This section is intended to provide a background or context to the invention disclosed below. The description herein may include concepts that could be pursued, but are not necessarily ones that have been previously conceived, implemented or described. Therefore, unless otherwise explicitly indicated herein, what is described in this section is not prior art to the description in this application and is not admitted to be prior art by inclusion in this section.
Central to an E&U's daily operation and planning studies is to solve an optimal power flow (OPF) problem under various constraints (e.g., primary electrical power flow constraints). For instance, one question commonly posed is, what is the most economical generation dispatch schedule under those electrical power flow constraints?
Because of its complexity, OPF is typically solved approximately. As an example, alternating current (AC) power flow constraints are approximated by their direct current (DC) power flow constraints. Thus, the nonlinear optimization problem becomes a linear programming problem, which is relatively easy to solve.
Direct consideration of AC power flow constraints has high computation complexity. Particularly, constraints, Jacobian functions and Hessian functions need to be computed explicitly either through closed formulas (e.g., MatPower, which is a MATLAB power system simulation package) or automatic differentiation. The latter is described, e.g., by Jiang et al. “An Efficient Implementation of Automatic Differentiation in Interior Point Optimal Power Flow”, IEEE Transactions on Power Systems, Vol. 25, No. 1, February 2010. See also Prasad Raju, et al., “An efficient technique of calculating first and second order derivatives in newton based optimal power flow problem,” International Journal of Research & Reviews in Computer Science, June 2012, Vol. 3 Issue 3, p 1645.
Problems with these approaches include that either the solution quality is low as an approach uses approximated electrical models, or the performance is low with high computation cost.
The following summary is merely intended to be exemplary. The summary is not intended to limit the scope of the claims.
An exemplary method includes solving on a computing system an optimal power flow formulation for a plurality of generators in a power system. The solving includes computing using multi-threaded parallelism a plurality of constraints for the formulation, computing using multi-threaded parallelism a plurality of Jacobian functions of the constraints, and computing using multi-threaded parallelism a Hessian of Lagrangian functions. The method further includes outputting results of the solving, wherein the results comprise values of generation levels for the plurality of generators. Apparatus and program products are also disclosed.
It is not an exaggeration to say that modern electric power systems are built upon optimization models and solution techniques.
As a simple description, the long-term system planning studies the question of how much new generation and transmission capacity is needed and where in the power system they should he built in order to satisfy a projected future demand in a time horizon of 5 to 15 years or longer, the maintenance scheduling problem aims to optimally schedule the maintenance of generators, transformers, and other equipment so that the system operation will be disturbed as little as possible at a cost as low as possible, where the lime horizon of decision is usually one year, the short-term operation problems determine daily or intraday generation schedules to match demand and satisfy various constraints.
All the major decision problems described above are built on two fundamental problems, the so-called unit commitment (UC) and economic dispatch based on optimal power flow (OPF). The instant disclosure concerns the optimal power flow.
A typical power system involves three major subsystems a generation system, a transmission network, and the distribution system.
In an example, the utilities may want to know how much power output their generators should be producing at some time. For instance, how much power should the coal plant, the nuclear plant, or any of the other power producers produce based on loads being used by customers and power produced by other generators? Utilities are trying to minimize the total cost of generation while satisfying the demand and electrical constraints. This is especially true since fuel prices, operating costs, power losses, and the like are different for each of the various utilities. This type of problem may be solved via an optimal flow problem.
The optimum flow-problem considered herein mainly concerns the high voltage transmission network, although with the increase of distributed generators, the distribution system is changing from a largely consumer network to a hybrid of power consumers and suppliers. The exemplary embodiments presented herein can be extended to the distribution system as well.
When real-time operation starts, the utility operator dispatches committed generators 390 to meet realized demand at a minimum total generation cost. This procedure is called economic dispatch. The underlying optimization problem is called optimal power flow, which involves continuous decision variables, i.e., generation output and voltages, and very often only has one decision period. In typical usage, the optimal power flow problem is solved every 5 to 15 minutes.
Due to the nonlinear and nonconvex nature of the optimal power flow, it is still a challenge to obtain fast, robust solutions to this problem in large-scale power systems. New challenges are emerging too. For example, unconventional generators such as wind and solar power are intrinsically stochastic, how to deal with such uncertainties poses challenges for large-scale integration of variable generation resources into the power grid.
Thus, solving the optimum power flow is important currently and will remain important as these unconventional generators become more prevalent. As described in more detail below, the instant exemplary embodiments provide improved techniques for solving the optimum power flow.
Turning to
The processor(s) 310 may be any processing units, such as digital signal processors, graphics processing units, and/or single-core or multi-core general purpose processors. Because of the complexity of the processing performed herein, typically, the processor(s) 310 would be multi-threaded, generally with a thread per core, although some processors allow fewer cores than threads. Also, graphics processing units (GPUs) are increasingly being used for mathematical operations, such as matrix manipulation or multiplication, e.g., since the graphics processing units provide a high degree of parallelism. A high-end. expensive general purpose processor may have several or tens of cores, each capable of running one or more threads. By contrast, a GPU may be able to perform calculations in parallel with thousands of cores and therefor thousands of threads. There are special frameworks that allow science applications to run on GPUs. One such framework is NVIDIA's Compute Unified Device Architecture (CLTDA) framework. While algorithms must be rewritten to make them data parallel (that is, data are distributed across different threads), such parallel threads may use floating point units, and therefore the speed of these parallel operations can approach or exceed multiple teraflops (where one teraflop is one trillion floating point operations per second).
Thus, in the example of
The circuitry 315 may be any electronic circuit such as an application specific integrated circuit or programmable logic. The memories 345 may comprise non-volatile and/or volatile RAM (random access memory), cache memory, NAND-based flash memory, long term storage (e.g., hard drive), and/or read only memory. The one or more I/O interfaces 320 may include interfaces through which a user may interact with the computing system 300. The display(s) 376 may be a touchscreen, flatscreen, monitor, television, projector, as examples
In one embodiment, users interact with the optimal power flow tool 350 through the UI 180 on the display 376 in an exemplary embodiment or through the network interface(s) 330 in another non-limiting embodiment. The external device(s) 390 enable a user to interact in one exemplary embodiment with the computing system 300 and may include a mouse, trackball, keyboard, and the like. The network interfaces 330 may be wired and/or wireless and may implement a number of protocols, such as cellular or local area network protocols. The elements in computing system 300 may be interconnected through any technology, such as buses, traces on a board, interconnects on semiconductors, and the like.
The optimal power flow tool 350 generates, using at least the power system information 395 and constraints 397, optimal power flow results 355. The power system information 395 includes such items as voltage, current, and power at certain locations (e.g., at the outputs of the generators 390). There are a number of constraints 397 that have to be considered. There are constraints on the maximum and minimum generation output levels that each generator can achieve, and transmission network constraints that involve power flow equations and power flow limits on transmission lines. There are also constraints on the voltage limits on buses. The generation output levels and voltage levels are modeled as continuous variables. The power flow equations and network constraints involve nonlinear nonconvex functions of the continuous variables. Indications 385 of the optimal power flow results 355 could be shown on UI 380 and displayed to a user or sent via the network interfaces) 330 to a user.
The exemplary embodiments herein provide methods and systems for computing function constraints, Jacobians, and Hessians for solving optimal power flow with AC power flow constraints. The exemplary embodiments provide computational speed up using multi-threading parallelism. For instance, embodiments may use complex numbers everywhere instead of computationally expensive-cost functions sin(x) (sine) and cos(x) (cosine). Exemplary embodiments propose a scheme to compute function constraints such that the main computation burden for a Jacobian evaluation is reused from the function constraints evaluation. As a further example, all Jacobian entries are used by the Hessian evaluation, and these are all performed using multi-threaded parallelism to efficiently compute sub-matrices.
The exemplary embodiments support real-time operation with AC OPF and provide a large cost savings. Most of the conventional AC OPF solvers required to efficiently evaluate these functions, such as gradient-type algorithms, use function constraints and Jacobian evaluation, while interior-point algorithms need to use function constraints, Jacobian evaluation. and Hessian evaluation. The examples herein use inter-point algorithms, and use function constraints, Jacobian evaluation, and Hessian evaluation, but the exemplary embodiments are not constrained to the use of these algorithms.
Many existing power grid related planning and operation studies can benefit from the instant techniques. For instance, economic dispatch, wind integration or photovoltaic installation studies, security analysis, and the like may benefit.
An optimal power flow formulation is as follows:
The function F (Pig) is a function to be minimized such that (“s.t.”) the constraints shown above are met. The following are definitions of the terms in the formulation above: ng is the number of generators committed in the power system (e.g., power system 301 of
The OPF formulation can be rewritten in a compact form as follows:
minf(x)
s.t.g≤g(x)≤
xL≤x≤xU
where f(x) corresponds to F(Pig), the function g(x) combines the upper three constraints from above (that is, Pi(δ,v)−Pig+Pid=0, Qi(δ,v)−Qig+Qid=0, and |Sij(δ,v)|≤Sijmax), g and
Now that the optimum power flow formulation has been presented, an exemplary algorithm for solving the formulation is presented. Turning to
In particular, the computing system 300 is to solve the optimal power flow, e.g., as formulated above, in block 400. Block 400 includes blocks 405-445 in this example. In block 405, the computing system 300 use complex numbers (e.g., instead of trigonometric functions such as sine, cosine) in calculations. That is, the computing system 300 uses eix and e−jx in appropriate equations instead of sin(x) and cos(x). Block 405 is applied to the other blocks in the flow of
The interior-point method is a powerful tool used to solve the nonlinear problem presented by the optimal power flow formulation. While oilier methods may be used to solve the OPF problem, the interior-point method is typically the most efficient method. Some references describing the interior-point method include the following: 1) Capitanescu, F., et al.: Interior-point based algorithms for the solution of optimal power flow problems, Electric Power Systems Research 77(5-6), 508-517 (2007); 2) Jabr, R. A.: A primal-dual interior-point method to solve the optimal power flow dispatching problem. Optimization and Engineering 4(4), 309-336 (2003); 3) Tones, G., Quintana, V.: An interior-point method for nonlinear optimal power flow using voltage rectangular coordinates, IEEE Transactions on Power Systems 13(4), 1211-1218 (1998); and 4) Wang, H., et al.: On computational issues of market-based optimal power flow, IEEE Transactions on Power Systems 22(3), 1185-1193 (2007). There are four main computational efforts in each iteration of the interior-point method. These efforts are represented as blocks 410, 420, 425, and 430 in
In block 410, the constraint function variables, gi(x), are evaluated. As described below in more detail, one portion of block 410 is to compute current as a function of admittance and voltage (block 415). As described below, the computation of current may be performed in a multi-threaded manner (that is, using multiple threads 325), which speeds up the process dramatically relative to conventional techniques.
In block 420, the Jacobian of the constraints, ∇gi(x), is evaluated. As described below, this evaluation may be multi-threaded using multiple threads 325. Again, use of multi-threading allows a tremendous increase in speed over conventional techniques. In block 425, the Hessian of the Lagrangian function, ∇2f(x)+Σi−1 mλi∇2gi(x), is evaluated. As described below, this evaluation may be multi-threaded using multiple threads 325. Use of multi-threading allows a tremendous increase in speed over conventional techniques. In block 430, a system of linear equations, Ax=b, is solved. There are several ways to write down a system of linear equations. It depends on what interior-point method is used (as there are several interior-point methods), but these methods have a common fact that constraint functions gi(x), the Jacobian of the constraints, Vgi(x), and the Hessian of the Lagrangian function, ∇2f(x)+Σi=1 mλi∇2gi(x), are entries of a matrix A and a vector b. For example, see Victor M. Zavala, “Implementation and Solution of IPOPT Primal-Dual System” for a form Ax=b, the linear equation is described in Equation (10) of Zavala, shown below:
where W+Dx is a Hessian function, at least JcT and JdT are Jacobian functions of the matrix A, Δx is an entry in the vector x. and c t x) is an entry in the vector b.
In block 445, it is determined if the OPF (optimum power flow) is solved. If the OPF is not (block 455=No), the flow proceeds to block 410. If the OPF is solved (block 445=Yes), in block 450 the optimum power flow results 355 are output, such as to a file (block 455, e.g., in memory(ies) 345) or to a display (block 460) or through a network interface 330 (block 461, e.g., to one or more operators). It is noted that there may be multiple on-site operators 398 and some portion (that is, less than all) of the results 355 may be output to each on-site operator 398. In an example, the optimum power flow results 355 are the values 463 of generation levels for generators. In block 465, one or more on-site operators 398 use the results 355 to modify the output of one or more generators 390. The one or more on-site operators 398 modify the active and reactive power at corresponding ones of the generators 390.
Turning to
Computing the constraint function values g(x)=(U1, Sij) (see blocks 410 and 415 of
U(v(t),i*(t))=vi*−(pg−pd) (1)
where t=(|v|,δ) is a state variable and δ is an instantaneous phase, and
i is the current injection for every node, Y is an admittance matrix, v is the voltage vector, and “*” denotes the conjugate of a complex number. It is noted that each of the Y*v* may be computed in parallel via multi-threaded parallelism (as indicated by reference 510 in
For Sij, we have
S
ij
=v
i
i
ij*=|vi|2yij−vivjyij (3)
where vi is a voltage at node i, vj* is a conjugate of a voltage at node j, and yij* is a conjugate of an admittance from node i to a node j. Thus (where “diag” means the matrix is diagonalized):
Hence, both the current vector i (from (3)) and the complex power flow S (from (4)) can be computed in parallel, e.g., in k processors. This corresponds to block 410 and 415 of
It should be noted that the values for the conjugate current vector i* (computed from (2)), Sij and vivj*yij (computed from (3)) are stored (block 515), since these values will be reused in computing the Jacobian ∇g(x).
Concerning computing the Jacobian ∇g(x)=(∇Ui, ∇Sij) (see block 420 of
For ∇U=(∂U/∂|v|, ∂U/∂δ), we have:
and j is the imaginary unit.
Thus,
We note that entries vivj*yij* and ik* for the Jacobian matrix ∂U/∂|v| have been computed when computing the function values g(x). This is indicated by reference 520 of
The parallelism (see reference 530 of
Similarly, we have
The parallelism (see reference 530 of
For ∇S, we have:
Thus
Similarly, we have
Thus,
Again, entries Sij and vivi*yij* for computing
were computed when computing g(x), e.g., see Equation (3) and reference 520 of
Furthermore, we can compute
in parallel (as illustrated by reference 530 of
Regarding computing the Hessian ∇2g(x)=(∇2Pi, ∇2Qi, ∇2Pij, ∇2Qij) (see block 425 of
A Hessian matrix is a square matrix of second order partial derivatives of a function (in this case, a Lagrangian function). The Hessian matrix describes the local curvature of a function of many variables. An exemplary Hessian matrix is the following:
Each of the following equations may be performed in parallel (reference 550 of
In the above equations, λ is a dual variable from the interior point method, and the gi(x) correspond to a combination of P, 0, and S. Note also that the Hessian would also be performed for the indexes i and ij. Using the multi-threading operations 550 illustrated by equations below, there is a four times speed improvement in Hessian forming (also called “stamping”) relative to a serial technique that does not use multi-threading. Furthermore, the Hessian can be computed from the Jacobian and the multi-threading operations 550 can take advantage of Hessian symmetry. For instance,
(where T means transpose), which means that only one of these needs to be determined.
The following are the Hessian matrices that are required for the index i in order to solve the above equations, each of which is a 2×2 matrix of second-order partial derivatives, each of which may be computed in parallel as part of multi-threaded operations 550 (although it should be noted that for each row, only three of these need to be computed):
∇δδ2Pi,∇δv2Pi,∇vδ2Pi,∇vv2Pi
∇δδ2Qi,∇δv2Qi,∇vδ2Qi,∇vv2Qi
The same would also be performed for the indexes ij. That is (although it should be noted that for each row, only three of these need to be computed):
∇δδ2Pi,∇δv2Pi,∇vδ2Pi,∇vv2Pi
∇δδ2Qi,∇δv2Qi,∇vδ2Qi,∇vv2Qi
For ∇2Pi, four Hessian matrices corresponding to Pi are as follows for each node i:
From (5), we have that
Thus, we yield
G and B are components of die admittance matrix Y, Y=G+jB.
From (6), we have that
Thus, we obtain
Thus, the second-order partial derivatives (as part of the Hessian function)
are computed using the Jacobian functions
respectively) (or results thereof), as indicated by reference 540 in
For ∇2Qi, four Hessian matrices corresponding to Qi are as follows for each node i.
From (5), we have that
Then, we have
From (6), we have that
Thus, we obtain
Thus, the second-order partial derivatives (as part of the Hessian function)
are computed using the Jacobian functions
(or results thereof), as indicated by reference 540 in
For ∇2Pij, four Hessian matrices corresponding to Pi are as follows for each node i:
From (7) and (9), we have
Thus, we yield
From (8) and (10), we have
Thus, we obtain
Thus, the second-order partial derivatives (as part of the Hessian function)
are computed using the Jacobian functions
(or results thereof), as indicated by reference 540 in
For ∇2Qij, from (7) and (9), we have
Then, we have
From (8) and (10), we have
Thus, we obtain
Thus, the second-order partial derivatives (as part of the Hessian function)
are computed using the Jacobian functions
(or results thereof), as indicated by reference 540 in
It should be noted that the above representations are merely exemplary. Although references 510, 530, and 550 indicate that multi-threading operations are performed at corresponding blocks 410/415, 420, and 425, it is not necessary to perform every one of these multi-threaded operations. For instance, the multi-threaded operations indicated by references 530 and 550 could be performed but the multi-threaded operations indicated by reference 510 might not be. As an example, only one of the multi-threaded operations indicated by references 510, 530, and 550 may be performed and the other two would not be performed (via multi-threading).
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium does not include propagating signals and may be, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. A computer readable storage medium does not include a propagating wave.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to. electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java. Smalltalk. C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of the present invention are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The corresponding structures, materials, acts, and equivalents of all means or step plus function elements in the claims below are intended to include any structure, material, or act for performing the function in combination with other claimed elements as specifically claimed. The description of the present invention has been presented for purposes of illustration and description, but is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the invention. The embodiment was chosen and described in order to best explain the principles of the invention and die practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.
The present application is a continuation of U.S. patent application Ser. No. 15/269,057 filed Sep. 19, 2016 and entitled “PARALLEL TECHNIQUE FOR COMPUTING PROBLEM FUNCTIONS IN SOLVING OPTIMAL POWER FLOW”, now U.S. Pat. No. 10,591,945. which is a continuation of U.S. patent application Ser. No. 14/134,351 filed Dec. 19, 2013 and entitled “PARALLEL TECHNIQUE FOR COMPUTING PROBLEM FUNCTIONS IN SOLVING OPTIMAL POWER FLOW”, now U.S. Pat. No. 9,552,007 all of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 15269057 | Sep 2016 | US |
Child | 16821995 | US | |
Parent | 14134351 | Dec 2013 | US |
Child | 15269057 | US |