METHOD FOR REAL-TIME SCHEDULING OF MULTI-ENERGY COMPLEMENTARY MICRO-GRIDS BASED ON ROLLOUT ALGORITHM

Abstract
The invention relates to a method for real-time scheduling of multi-energy complementary micro-grids based on a Rollout algorithm, which is technically characterized by comprising the following steps of: Step 1, setting up a moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs, and establishing constraint conditions for the real-time scheduling; Step 2, establishing a target function of the real-time scheduling; Step 3, dividing a single complete scheduling cycle into a plurality of scheduling intervals, and finding one basic feasible solution meeting the constraint conditions for the real-time scheduling based on a greedy algorithm; and Step 4, finding a solution to the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids by using the Rollout algorithm based on the basic feasible solution from Step 3. With the consideration of the fluctuations in the new-energy outputs, the present invention solves the problems of low speed and low efficiency of a traditional algorithm at the same time, enabling high-speed efficient multi-energy complementary micro-grid real-time scheduling.
Description
TECHNICAL FIELD

The invention generally belongs to the technical field of multi-energy complementary micro-grids, and relates to a method for real-time scheduling of multi-energy complementary micro-grids, and in particular to a method for real-time scheduling of multi-energy complementary micro-grids based on a Rollout algorithm


BACKGROUND

As the smart grid technology evolves, a multi-energy complementary micro-grid system incorporating new energies with an energy storage feature has aroused widespread concerns from researchers. As an autonomous system capable of self-control, protection and management, the multi-energy complementary micro-grids can facilitate the utilization of distributed energy on the spot and enable highly reliable supply of various forms of energy loaded in a more economic and friendly way, transitioning from the traditional grid to the smart grid.


The fluctuation and intermittence in the new-energy outputs pose great challenges to the real-time scheduling of the multi-energy complementary micro-grids, and since the real-time scheduling is a moving-horizon process, the control behavior in a current scheduling interval not only affects the current cycle but also affects the state of a next scheduling interval. The Markov decision model provides a good idea to solve this moving scheduling problem with uncertain variables, but the large number of variables brings a disaster on dimension, leading to a difficulty in finding a solution to the model, and how to find an effective method to solve the difficulty above has become the key to real-time scheduling.


SUMMARY OF THE INVENTION

An objective of the present invention is to overcome the deficiencies of the prior art, and to provide a method for real-time scheduling of multi-energy complementary micro-grids based on a Rollout algorithm, which is simple, feasible, efficient and rapid with reasonable design and high practicability.


The invention solves the technical problem with the following technical solution:

  • a method for real-time scheduling of multi-energy complementary micro-grids based on a Rollout algorithm, characterized by comprising the following steps of:
  • Step 1, setting up a moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs, and establishing constraint conditions for the real-time scheduling;
  • Step 2, establishing a target function of the real-time scheduling for the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs, with the goal of minimum operating cost of a micro-grid system in a moving-horizon Markov decision cycle;
  • Step 3, dividing a single complete scheduling cycle into a plurality of scheduling intervals, and finding one basic feasible solution meeting the constraint conditions for the real-time scheduling based on a greedy algorithm; and


Step 4, finding a solution to the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids by using the Rollout algorithm based on the basic feasible solution from Step 3.


Furthermore, the constraint conditions established for the real-time scheduling in Step 1 comprise: micro-grid electric equilibrium constraints, storage battery operating constraints, exchange electric power constraints for the micro-grids and a main grid, and electric power output constraints for combined heat and power equipment;


the micro-grid electric equilibrium constraints are as follows:









p
G



(
t
)


+




i
=
1

N




p
i
c



(
t
)



+


p
B



(
t
)


+


p
w



(
t
)



=


p
D



(
t
)






in the formula, t is a time parameter; pG (t) is exchange electric power for the micro-grids and the main grid at a time t, which is positive during the purchasing of electricity from the main grid and negative during selling of electricity to the main grid; N is the quantity of the combined heat and power equipment; pic(t) is the output electric power of the ith combined heat and power equipment at the time t; pB(t) is charging/discharging power of the storage battery at the time t, which is negative during charging and positive during discharging; pw(t) is generated output of wind power at the time t; and pD(t) is an electric load demand at the time t;


the storage battery operating constraints are as follows:








{





E


(

t
+
1

)


=


E


(
t
)


=




p
B



(
t
)


·
Δ







T
·

α
c











E


(

t
+
1

)


=


E


(
t
)


-




p
B



(
t
)


·
Δ






T


/



α
d










E
_



E


(
t
)




E
_











p
B



(
t
)







p
B

_










in the formulae, E(t) and E(t+1) are energy storage levels of the storage battery at the time t and a time t+1 respectively; E and Ē are upper and lower boundaries of the capacity of the storage battery respectively; ΔT is a time interval from the time t to the time t+1; ac and ad are charging and discharging efficiencies of the storage battery respectively; pB (t) is charging/discharging power of the storage battery at the time t, which is negative during charging and positive during discharging; and pB is an upper limit value of the charging/discharging power of the storage battery;


the exchange electric power constraints for the micro-grids and the main grid are as follows:








{








p
G



(
t
)







p
G

_












p
G



(
t
)


-


p
G



(

t
-
1

)






δ









in the formulae, pG(t) and pG(t−1) are exchange electric power between the micro-grids and the main grid at the times t and t−1 respectively, which is positive during purchasing of electricity from the main grid and negative during selling of electricity to the main grid; pG is an upper limit value of the exchange electric power between the micro-grids and the main grid, and δ is an upper fluctuation limit of the exchange electric power between the micro-grids and the main grid;


the electric power output constraints for the combined heat and power equipment are as follows:








{





0



p
i
c



(
t
)





p
i
c

_


,

i
=
1

,
2
,





,
N









v
i

_

·


H
i



(
t
)






p
i
c



(
t
)






v
i

_

·


H
i



(
t
)












in the formulae, pic(t) is output electric power of the ith combined heat and power equipment at the time t; pic is an upper limit of the output electric power of the ith combined heat and power equipment; Hi(t) is thermal power required to be supplied by the ith combined heat and power equipment at the time t; vi and vi are lower and upper limits of electric-thermal power conversion efficiency of the ith combined heat and power equipment.


Furthermore, establishing a target function of the real-time scheduling for the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs in Step 2 specifically comprises the following sub-steps of: first setting up an operating cost function of the micro-grid system at a single scheduling interval with the goal of minimum operating cost of the micro-grid system at the single scheduling interval, and then establishing a target function of the real-time scheduling with the goal of the minimum operating cost of the micro-grid system in the moving-horizon Markov decision cycle;


the operating cost function of the micro-grid system at the single scheduling interval is as follows:











c
t



(


X


(
t
)


,

A


(
t
)



)


=




λ


(
t
)


·


p
G



(
t
)


·
Δ






T

+

c
·




i
=
1

N




F
i
c



(
t
)
















wherein















{






F
i
c



(
t
)


=



a
i

·


p
i
c



(
t
)



+

b
i









X


(
t
)


=

[


E


(
t
)


,


p
G



(

t
-
1

)


,


p
w



(
t
)



]








A


(
t
)


=

[



p
i
c



(
t
)


,


p
G



(
t
)


,


p
B



(
t
)



]

















in the formulae, X(t) is a state variable of the micro-grid system at the time t; A(t) is a control variable of the micro-grid system at the time t; ct(X(t),A(t)) is a function of the system operating cost at the single scheduling interval; λ(t) is a grid electricity price at the time t; c is a price of natural gas; Fic(t) is a linear function between a gas consumption and an electric output of the ith combined heat and power equipment; and ai and bi are coefficients of the linear function between the gas consumption and the electric output of the ith combined heat and power equipment;


the target function of the real-time scheduling is as follows:







min







J
t



(


X


(
t
)


,

A


(
t
)



)



=



c
t



(


X


(
t
)


,

A


(
t
)



)


+

E




[





t
1

=

t
+
1



t
+
T
-
1





c

t
1




(


X


(

t
1

)


,

A


(

t
1

)



)



]






in the formula, Jt(X(t),A(t)) is a function of the operating cost of the micro-grid system in the moving-horizon Markov decision cycle;


Furthermore, Step 3 specifically comprises the following sub-steps of: dividing a complete scheduling cycle into a plurality of scheduling intervals, finding a solution specific to a scheduling optimization problem in each of the scheduling intervals based on the greedy algorithm respectively, and finally synthesizing locally optimal solutions to respective scheduling intervals into one basic feasible solution across the complete scheduling interval.


Furthermore, the finding a solution specific to a scheduling optimization problem in each of the scheduling intervals based on the greedy algorithm respectively in Step 3 specifically comprises the following sub-steps of:

  • (1) according to the operating cost function of the micro-grid system in a single scheduling interval in Step 2, listing the target function and the constraint conditions as follows:







min







c
t



(


X


(
t
)


,

A


(
t
)



)



=




λ


(
t
)


·


p
G



(
t
)


·
Δ






T

+




i
=
1

N



c
·

a
i

·


p
i
c



(
t
)




+




i
=
1

N



b
i







the constraint conditions are as follows:









{







p
G



(
t
)


+




i
=
1

N




p
i
c



(
t
)



+


p
B



(
t
)


+


p
w



(
t
)



=


p
D



(
t
)










pb
_



(
t
)





p
B



(
t
)





pb
_



(
t
)










pg
_



(
t
)





p
G



(
t
)





pg
_



(
t
)













pc
i

_



(
t
)





p
i
c



(
t
)







pc
i

_



(
t
)







i


=
1

,
2
,





,
N















Wherein















{






pb
_



(
t
)


=

max






{



(


E


(
t
)


-

E
_


)



/



(

Δ






T
·

α
c



)


,

-


p
B

_



}










pb
_



(
t
)


=

min






{




(


E


(
t
)


-

E
_


)

·

α
d




/


Δ





T

,


p
B

_


}










pg
_



(
t
)


=

max






{



p
G

_

,



p
G



(

t
-
1

)


-
δ


}










pg
_



(
t
)


=

min






{



p
G

_

,



p
G



(

t
-
1

)


+
δ


}











pc
i

_



(
t
)


=

max






{

0
,



v
i

_

·


H
i



(
t
)




}











pc
i

_



(
t
)


=

min






{



p
i
c

_

,



v
i

_

·


H
i



(
t
)




}


















in the formulae, pb(t) and pb(t) are new lower and upper limits of the charging/discharging power of the storage battery during building of the basic feasible solution respectively; pg(t) and pg(t) are new lower and upper limits of the exchange power between the micro-grid and main grid during the building of the basic feasible solution respectively; pci (t) and pci (t) are new lower and upper limits of the electric output of the ith combined heat and power equipment during the building of the basic feasible solution respectively;

  • (2) to obtain an optimized result in one of the scheduling intervals, ranking N+2 coefficients of the target function in an ascending order to obtain a scheduling sequence for respective decision variables, wherein the N+2 coefficients are λ(t)·ΔT, 0, c·ai(i=1, 2, . . . N), and the presence of the coefficient 0 is because the target function does not comprise pB(t); then forcing a load difference to be d(t)=pD(t)−pw(t); and finding a final optimized result according to the value of d(t) and the scheduling sequence of respective decision variables; and
  • (3) finding solutions for other scheduling intervals in turn to obtain one basic feasible solution meeting the micro-grid operating demand finally.


Furthermore, Step 4 specifically comprises the following sub-steps of:

  • (1) supposing the basic feasible solution obtained from Step 3 to be πb=(Ab,t+1, Ab,t+2, . . . , Ab,t+T−1) to obtain an approximate value of the target function for the minimum operating cost from the time t to the time t+T −1 during one real-time scheduling:









J
t

_



(


X


(
t
)


,

A


(
t
)



)


=



c
t



(


X


(
t
)


,

A


(
t
)



)


+

E




[





t
1

=

t
+
1



t
+
T
-
1





c

t
1




(


X


(

t
1

)


,

A

b
,

t
1




)



]






in the formula, Jt (X(t), A(t)) is a function of the operating cost of the micro-grid system in the moving-horizon Marcov decision cycle as solved with the Rollout algorithm

  • (2) forcing the approximate value to approach the minimum with the Rollout algorithm:








a
t

_

=

arg







min

A


(
t
)











J
t

_



(


X


(
t
)


,

A


(
t
)



)








  • (3) according to X(t) in one of the current scheduling intervals and in combination with a generated output value of wind power at the time t, finding X(t+1) in a next scheduling interval, finding the generated output value of the wind power at the time t+1 at the same time, then calculating X(t+2) in the next scheduling interval, and repeating the process until the whole scheduling cycle ends.



The present invention has the following advantages and positive effects:

  • 1. The present invention provides the method for real-time scheduling of multi-energy complementary micro-grids based on the Rollout algorithm, where at first, the moving-horizon Markov decision process model for multi-energy complementary micro-grid real-time scheduling is set up, and the constraint conditions for the real-time scheduling are established; then, a complete scheduling cycle is divided into a plurality of scheduling intervals, and one basic feasible solution meeting the constraint conditions for the real-time scheduling is found based on the greedy algorithm; and finally, a problem on the moving-horizon Markov decision for the multi-energy complementary micro-grids is solved by using the Rollout algorithm based on the basic feasible solution above. With the setup of the moving-horizon model for the multi-energy complementary micro-grids, the present invention finds the solution by using the Rollout algorithm, which is simple and effective with high practicability.
  • 2. The present invention finds the solution to the problem on the moving-horizon Markov decision model by using the Rollout algorithm, where a Markov decision model incorporating random new-energy outputs is set up at first, one basic feasible solution is found with the greedy algorithm, and the goal is approached based on this with the consideration of the fluctuations in the new-energy output, and the problems of low speed and efficiency of the traditional algorithm are solved at the same time, enabling high-speed efficient real-time scheduling for the multi-energy complementary micro-grids.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows a process flow diagram of the present invention.





DETAILED DESCRIPTION

The embodiments of the present invention are further described in detail below with reference to the accompanying drawings:


The present invention provides a method for real-time scheduling of multi-energy complementary micro-grids based on the Rollout algorithm, which not only takes the fluctuations in the new energy outputs into consideration, but also more effectively solves the problem on the moving-horizon scheduling of the multi-energy complementary micro-grids, solving the problem on the moving-horizon Marcov decision model with the Rollout algorithm. According to the method, at first, the moving-horizon Markov decision process model for multi-energy complementary micro-grid real-time scheduling with random new-energy output is set up, and the constraint conditions and the target function for the real-time scheduling are established; then, a complete scheduling cycle is divided into a plurality of scheduling intervals, and one basic feasible solution meeting the constraint conditions for the real-time scheduling is found based on the greedy algorithm; and finally, a solution to the moving-horizon Markov decision model for multi-energy complementary micro-grids is found by using the Rollout algorithm based on the basic feasible solution above.


A method for real-time scheduling of multi-energy complementary micro-grids based on a Rollout algorithm, as shown in FIG. 1, comprises the following steps of:


Step 1, setting up a moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs, and establishing constraint conditions for the real-time scheduling;


where the constraint conditions established for the real-time scheduling in Step 1 comprises: micro-grid electric equilibrium constraints, storage battery operating constraints, exchange electric power constraints for the micro-grids and a main grid, and electric power output constraints for combined heat and power equipment;


the micro-grid electric equilibrium constraints are as follows:









p
G



(
t
)


+




i
=
1

N




p
i
c



(
t
)



+


p
B



(
t
)


+


p
w



(
t
)



=


p
D



(
t
)






in the formula, t is a time parameter; pG(t) is exchange electric power for the micro-grids and the main grid at a time t, which is positive during purchasing of electricity from the main grid and negative during selling of electricity to the main grid; N is the quantity of the combined heat and power equipment; pic(t) is output electric power of the ith combined heat and power equipment at the time t; pB(1) is charging/discharging power of the storage battery at the time t, which is negative during charging and positive during discharging; pw(t) is generated output of wind power at the time t; and pD(t) is an electric load demand at the time t;


the storage battery operating constraints are as follows:








{





E


(

t
+
1

)


=


E


(
t
)


-




p
B



(
t
)


·
Δ







T
·

α
c











E


(

t
+
1

)


=


E


(
t
)


-




p
B



(
t
)


·
Δ






T


/



α
d










E
_



E


(
t
)




E
_











p
B



(
t
)







p
B

_










in the formulae, E(t) and E(t+1) are energy storage levels of the storage battery at the time t and a time t+1 respectively; E and Ē are upper and lower boundaries of the capacity of the storage battery respectively; ΔT is a time interval from the time t to the time t+1; ac and ad are charging and discharging efficiencies of the storage battery respectively; pB(t) is charging/discharging power of the storage battery at the time t, which is negative during charging and positive during discharging; and pB is an upper limit value of the charging/discharging power of the storage battery;


the exchange electric power constraints for the micro-grids and the main grid are as follows:








{








p
G



(
t
)







p
G

_












p
G



(
t
)


-


p
G



(

t
-
1

)






δ









in the formulae, pG(t) and pG(t−1) are exchange electric power between the micro-grids and the main grid at the times t and t−1 respectively, which is positive during purchasing of electricity from the main grid and negative during selling of electricity to the main grid; pG is an upper limit value of the exchange electric power between the micro-grids and the main grid, and δ is an upper fluctuation limit of the exchange electric power between the micro-grids and the main grid;


the electric power output constraints for the combined heat and power equipment are as follows:








{





0



p
i
c



(
t
)





p
i
c

_


,

i
=
1

,
2
,





,
N









v
i

_

·


H
i



(
t
)






p
i
c



(
t
)






v
i

_

·


H
i



(
t
)












in the formulae, pic(t) is output electric power of the ith combined heat and power equipment at the time t; pic is an upper limit of the output electric power of the ith combined heat and power equipment; Hi(t) is thermal power required to be supplied by the ith combined heat and power equipment at the time t; vi and vi are lower and upper limits of electric-thermal power conversion efficiency of the ith combined heat and power equipment.


Step 2, establishing a target function of the real-time scheduling for the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs, with the goal of minimum operating cost of a micro-grid system in a moving-horizon Markov decision cycle;


wherein when in a grid-connected state with the main grid, the multi-energy complementary micro-grids can exchange electricity with the main grid, with energy supply equipment comprising wind driven generators, combined heat and power (CHP) equipment, and storage batteries; and the target function for real-time scheduling is to achieve the minimum operating cost, including system electricity purchasing cost and fuel cost of the CHP equipment, for the micro-grid system.


the establishing a target function of the real-time scheduling for the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs specifically comprises the following sub-steps of: first setting up an operating cost function of the micro-grid system at a single scheduling interval with the goal of minimum operating cost of the micro-grid system at the single scheduling interval, and then establishing a target function of the real-time scheduling with the goal of the minimum operating cost of the micro-grid system in the moving-horizon Markov decision cycle;


the operating cost function of the micro-grid system at the single scheduling interval is as follows:











c
t



(


X


(
t
)


,

A


(
t
)



)


=




λ


(
t
)


·


p
G



(
t
)


·
Δ






T

+

c
·




i
=
1

N




F
i
c



(
t
)
















Wherein















{






F
i
c



(
t
)


=



a
i

·


p
i
c



(
t
)



+

b
i









X


(
t
)


=

[


E


(
t
)


,


p
G



(

t
-
1

)


,


p
w



(
t
)



]








A


(
t
)


=

[



p
i
c



(
t
)


,


p
G



(
t
)


,


p
B



(
t
)



]

















in the formulae, X(t) is a state variable of the micro-grid system at the time t; A(t) is a control variable of the micro-grid system at the time t; ct(X(t),A(t)) is a function of system operating cost at the single scheduling interval; λ(t) is a grid electricity price at the time t; c is a price of natural gas; Fic(t) is a linear function between a gas consumption and an electric output of the ith combined heat and power equipment; and ai and bi are coefficients of the linear function between the gas consumption and the electric output of the ith combined heat and power equipment;


the target function of the real-time scheduling is as follows:







min







J
t



(


X


(
t
)


,

A


(
t
)



)



=



c
t



(


X


(
t
)


,

A


(
t
)



)


+

E




[





t
1

=

t
+
1



t
+
T
-
1





c

t
1




(


X


(

t
1

)


,

A


(

t
1

)



)



]






in the formula, Jt(X(t),A(t)) is a function of the operating cost of the micro-grid system in the moving-horizon Markov decision cycle;


Step 3, dividing a single complete scheduling cycle into a plurality of scheduling intervals, and finding one basic feasible solution meeting the constraint conditions for the real-time scheduling based on a greedy algorithm;


where Step 3 specifically comprises the following sub-steps of: dividing a complete scheduling cycle into a plurality of scheduling intervals, finding a solution specific to a scheduling optimization problem in each of the scheduling intervals based on the greedy algorithm respectively, and finally synthesizing locally optimal solutions to respective scheduling intervals into one basic feasible solution across the complete scheduling interval.


the finding a solution specific to a scheduling optimization problem in each of the scheduling intervals based on the greedy algorithm respectively in Step 3 specifically comprises the following sub-steps of:

  • (1) according to the operating cost function of the micro-grid system in a single scheduling interval in Step 2, listing the target function and the constraint conditions as follows:







min







c
t



(


X


(
t
)


,

A


(
t
)



)



=




λ


(
t
)


·


p
G



(
t
)


·
Δ






T

+




i
=
1

N



c
·

a
i

·


p
i
c



(
t
)




+




i
=
1

N



b
i







the constraint conditions are as follows:









{







p
G



(
t
)


+




i
=
1

N




p
i
c



(
t
)



+


p
B



(
t
)


+


p
w



(
t
)



=


p
D



(
t
)










pb
_



(
t
)





p
B



(
t
)





pb
_



(
t
)










pg
_



(
t
)





p
G



(
t
)





pg
_



(
t
)













pc
i

_



(
t
)





p
i
c



(
t
)






pc
i



(
t
)







i


=
1

,
2
,





,
N















wherein















{






pb
_



(
t
)


=

max






{



(


E


(
t
)


-

E
_


)



/



(

Δ






T
·

α
c



)


,

-


p
B

_



}










pb
_



(
t
)


=

min






{




(


E


(
t
)


-

E
_


)

·

α
d




/


Δ





T

,


p
B

_


}










pg
_



(
t
)


=

max






{



p
G

_

,



p
T



(

t
-
1

)


-
δ


}










pg
_



(
t
)


=

min






{



p
G

_

,



p
G



(

t
-
1

)


+
δ


}











pc
i

_



(
t
)


=

max






{

0
,



v
i

_

·


H
i



(
t
)




}











pc
i

_



(
t
)


=

min






{



p
i
c

_

,



v
i

_

·


H
i



(
t
)




}


















in the formulae, pb (t)and pb(t) are new lower and upper limits of the charging/discharging power of the storage battery during building of the basic feasible solution respectively; pg(t) and pg(t) are new lower and upper limits of the exchange power between the micro-grid and main grid during the building of the basic feasible solution respectively; pci (t) and pci (t) are new lower and upper limits of the electric output of the ith combined heat and power equipment during the building of the basic feasible solution respectively;

  • (2) to obtain an optimized result in one of the scheduling intervals, ranking N+2 coefficients of the target function in an ascending order to obtain a scheduling sequence for respective decision variables, wherein the N+2 coefficients are λ(t)·ΔT, 0, c·ai (i=1, 2, . . . , N), the presence of the coefficient 0 is because the target function does not comprise pB (t); then forcing a load difference to be d(t)=pD(t)−pw(t); and finding a final optimized result according to the value of d(t) and the scheduling sequence of respective decision variables; and
  • (3) finding solutions for other scheduling intervals in turn to obtain one basic feasible solution meeting the micro-grid operating demand finally.


Step 4, finding a solution to the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids by using the Rollout algorithm based on the basic feasible solution from Step 3.


Step 4 specifically comprises the following sub-steps of:

  • (1) supposing the basic feasible solution obtained from Step 3 to be πb=(Ab,t+1, Ab,t+2, . . . , Ab,t+T−1) to obtain an approximate value of the target function for the minimum operating cost from the time t to the time t+T−1 during one real-time scheduling:









J
t

_



(


X


(
t
)


,

A


(
t
)



)


=



c
t



(


X


(
t
)


,

A


(
t
)



)


+

E




[





t
1

=

t
+
1



t
+
T
-
1





c

t
1




(


X


(

t
1

)


,

A

b
,

t
1




)



]






in the formula, Jt (X(t),A(0) is a function of the operating cost of the micro-grid system in the moving-horizon Marcov decision cycle as solved with the Rollout algorithm

  • (2) forcing the approximate value to approach the minimum with the Rollout algorithm:








a
t

_

=

arg







min

A


(
t
)







J
t

_



(


X


(
t
)


,

A


(
t
)



)








  • (3) according to X(t) in one of the current scheduling intervals and in combination with a generated output value of wind power at the time t, finding X (t+1) in a next scheduling interval, finding the generated output value of the wind power at the time t+1 at the same time, then calculating X(t+2) in the next scheduling interval, and repeating the process until the whole scheduling cycle ends.



It should be noted that the described embodiments of the present invention are for an illustrative purpose rather than a limiting purpose, and the present invention thus includes but not limited to the embodiments described in the Description of Preferred Embodiments. Any other embodiments obtained by those skilled in the art according to the technical solution of the present invention likewise fall within the protection scope of the present invention.

Claims
  • 1. A method for real-time scheduling of multi-energy complementary micro-grids based on a Rollout algorithm, characterized by comprising the following steps of: Step 1, setting up a moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs, and establishing constraint conditions for the real-time scheduling;Step 2, establishing a target function of the real-time scheduling for the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with the random new-energy outputs, with the goal of minimum operating cost of a micro-grid system in a moving-horizon Markov decision cycle;Step 3, dividing a single complete scheduling cycle into a plurality of scheduling intervals, and finding one basic feasible solution meeting the constraint conditions for the real-time scheduling based on a greedy algorithm; andStep 4, finding a solution to the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids by using the Rollout algorithm based on the basic feasible solution from Step 3.
  • 2. The method for real-time scheduling of multi-energy complementary micro-grids based on the Rollout algorithm according to claim 1, characterized in that the constraint conditions established for the real-time scheduling in Step 1 comprises: micro-grid electric equilibrium constraints, storage battery operating constraints, exchange electric power constraints for the micro-grids and a main grid, and electric power output constraints for combined heat and power equipment; the micro-grid electric equilibrium constraints are as follows:
  • 3. The method for real-time scheduling of multi-energy complementary micro-grids based on the Rollout algorithm according to claim 1, wherein the establishing a target function of the real-time scheduling for the moving-horizon Markov decision process model for the real-time scheduling of the multi-energy complementary micro-grids with random new-energy outputs in Step 2 specifically comprises the following sub-steps of: first setting up an operating cost function of the micro-grid system at a single scheduling interval with the goal of minimum operating cost of the micro-grid system at the single scheduling interval, and then establishing a target function of the real-time scheduling with the goal of the minimum operating cost of the micro-grid system in the moving-horizon Markov decision cycle; the operating cost function of the micro-grid system at the single scheduling interval is as follows:
  • 4. The method for real-time scheduling of multi-energy complementary micro-grids based on the Rollout algorithm according to claim 1, wherein Step 3 specifically comprises the following sub-steps of: dividing a complete scheduling cycle into a plurality of scheduling intervals, finding a solution specific to a scheduling optimization problem in each of the scheduling intervals based on the greedy algorithm respectively, and finally synthesizing locally optimal solutions to respective scheduling intervals into one basic feasible solution across the complete scheduling interval.
  • 5. The method for real-time scheduling of multi-energy complementary micro-grids based on the Rollout algorithm according to claim 4, characterized in that the finding a solution specific to a scheduling optimization problem in each of the scheduling intervals based on the greedy algorithm respectively in Step 3 specifically comprises the following sub-steps of: (1) according to the operating cost function of the micro-grid system in a single scheduling interval in Step 2, listing the target function and the constraint conditions as follows:
  • 6. The method for real-time scheduling of multi-energy complementary micro-grids based on the Rollout algorithm according to claim 1, wherein Step 4 specifically comprises the following sub-steps of: (1) supposing the basic feasible solution obtained from Step 3 to be πb=(Ab,t+1, Ab,t+2, . . . , Ab,t+T−1) to obtain an approximate value of the target function for the minimum operating cost from the time t to the time t+T−1 during one real-time scheduling:
Priority Claims (1)
Number Date Country Kind
201710168834.7 Mar 2017 CN national
PCT Information
Filing Document Filing Date Country Kind
PCT/CN2017/109862 11/8/2017 WO 00