The present invention relates generally to systems and methods for analyzing and modifying utility customer energy consumption patterns.
The drive towards more sustainable power supply systems has enabled significant growth of renewable generation. This in turn has pushed the rollout of demand response (DR) programs to address a larger population of consumers. Utilities are interested in enrolling small and medium sized customers that can provide demand curtailment during periods of shortfall in renewable production.
It then becomes important to be able to target the right customers among the large population, since each enrollment has a cost. Moreover, the power curtailment potential across customers varies significantly. Currently, however, most demand response targeting relies on segmentation of customers based on their monthly billing data or surveys.
The goal of a DR program is to elicit flexibility from loads by reducing or shifting consumption in response to external signals such as prices or curtailment indicators. Typically a program is designed to extract a targeted level of energy or power from all the participating loads. The program operation yield is the ratio between the actually extracted energy from the participants and the target level. Current program yields are low, in the range of 10% to 30%. Thus, there exists a need for improved methods for targeting utility customers for program enrollment.
Existing approaches to targeting rely on demographic variables to segment consumers and target them. They also do not account for system constraints. The approach of embodiments of the present invention avoids this problem by incorporating data-driven response models for this purpose and an optimization formulation that can account for costs and network system constraints.
In one aspect, the invention provides a methodology that utilizes utility resource consumption (electricity, gas or water) data from individual consumers to optimize the targeting and operation of demand management programs and improve the behavioral response of consumers. The methodology may include, as appropriate, (1) data-driven functional and behavioral response models and/or (2) targeting algorithm that can account for target goals, recruitment costs, physical system constraints and various uncertainties in these quantities. In the present context, targeting means selecting customers for enrollment in a program in offline manner, or including customers in real-time in program operations. In the latter, the response models below are estimated for each round. In the present context, programs are utility resource consumption management programs, such as demand management programs.
A significant feature of embodiments of the invention is the full data-driven approach, including an optimization algorithm that is aware of uncertainties, all of these applied to energy consumption data. Embodiments of the current invention significantly provide targeting recruitment in utility programs (demand response, energy efficiency); increasing consumer behavior change response to energy data. Additional aspects of embodiments of this invention may be refined by approaching different programs and performance goals.
There are variants of the approach: the targeting can rely on measured response models, or targeting can occur progressively, where after a certain number of consumers is targeted, their performance is measured and the response model is updated.
In one aspect, the invention provides a method implemented by a computer for enrolling utility customers in utility demand response or energy efficiency programs based on time-series consumption data. The method includes collecting from smart meter sensors time-series utility consumption data from individual utility customers, extracting by the computer features from the consumption data, generating a probabilistic response model for each customer representing a relationship between the extracted features and customer program performance by estimating parameters of a response distribution, solving an optimization problem for each customer using the estimated response distribution to achieve a targeting objective, and enrolling selected customers in programs based on the solution to the optimization problem.
In some embodiments, solving the optimization problem is performed periodically. The optimization problem may include a reliability constraint that captures behavioral compliance to a demand response signal, wherein the behavioral compliance is represented by a compliance response model dependent upon consumer characteristics, local environmental characteristics, and time of day. The method may additionally include selecting a probabilistic response (targeting) model from among multiple probabilistic response (targeting) models. It may also include communicating to the selected customers the selected programs.
As shown in
A targeting optimization program according to an embodiment of the invention is able to tradeoff between uncertainty and selection cost, with a goal to achieve a set demand response targeting goal with high probability during a DR event.
Note that, if ck is same for all customers, the budget constraint is the same as the constraint of limiting the number of participating customers by N, a limit of customer enrollment (i.e., Σckxk≦CΣxk≦N if ck same, where the sums are over k from 1 to K). K is the total number of potential customers that can provide DR service. The problem can be stated as shown in 402, i.e.,
maxxP(Σrkxk≧T) s.t. Σxk≦N
where the sums are over k from 1 to K. The goal of the program is to maximize the likelihood of saving at least T kWh, given that we are limited to selecting at most N customers among K candidates. Maximizing the probability of achieving T is equivalent to reducing the risk of failing to meet the target during a DR event. Solving this stochastic knapsack problem (SKP) requires the probability distribution of the customer's response. This formulation generalizes to any linear program constraint.
The program maximizes the reliability of the DR program by recruiting customers within the program budget. The optimal reliability for budget C and target T is given by objective function value p*(C,T). The function captures the tradeoff between DR availability and DR reliability for a budget C. The function has some important properties that conform to our intuition about the tradeoff The objective function is monotonic decreasing in T so p*(C,T1)≧p*(C,T2) if T1≧T2. The budget determines the constraints so p*(C1,T)≦p*(C2,T) if C1≧C2.
The proposed optimization problem is a stochastic knapsack problem (SKP). SKPs are stochastic integer programs known to be NP-hard. Here we provide an efficient approximation algorithm that scales to K in millions of customers. The efficient algorithm is used then to compute the function p*(C,T).
An important additional assumption is that K is large and C is sufficiently large, so a significant number of customers are included. In that case, given a set of random variables for the response rk, the total group response is approximately Gaussian distributed.
The linear constraint in the optimization problem 402 does not need to be confined to the number of customer enrollment. Any linear constraint can be added. For example, we can add a cost constraint if the enrollment cost is different for each customer. Also, certain physical system constraints can be implemented as linear constraints, e.g., the number of customer enrollment in a sub-system is limited.
maxμ
Alternatively, the SKP reduces to multiple knapsack problems (KP) on the assumption that all the σk2s are integers and all users are independent (i.e., the matrix Σ is diagonal). With these strong assumptions, this method is quite computationally efficient. An even more straightforward approach is to extend the concept of a greedy approximation algorithm utilized to solve specific instances of the knapsack problem. It utilizes the per customer risk-reward ratio μk/σk to sort and rank customers who are offering sufficient benefit. In the figure, x is the recruitment vector, N indicates a normal distribution, μ is a vector of individual response means μk and a covariance matrix with covariances Σjk between responses. In practice, if the number of customers selected is in the order of 50, the response distribution is very close to normal. The figure also shows how the objective function changes on the assumption that the responses are following a joint Gaussian distribution. We note it is very important that the optimal solution is one of the extreme points in the transferred domain. Finding the optimal combination of customers in original domain by constant times of sorting (or linear programming if there are other linear constraints beside the number of customer enrollment constraint), which makes the optimal extreme point in transferred domain a key point of our heuristic algorithm.
The method may also include minimizing the expected penalty when a DR event cannot achieve the targeted energy saving T, i.e.,
minxqE(T−Σrkxk)+s.t. Σckxk<C)
where q is a penalty parameter with measure ($/kWh), and where the sums are over k from 1 to K. The interpretation is that we minimize the cost from utility side for buying additional energy when the energy saving cannot meet the target. This problem can be solved by exactly the same algorithm (used to solve the SKP problem in 402) after proving the objective is also quasi-concave in the concerned domain.
Additionally, the method may also include minimizing the cost for guaranteeing a certain level of probability, p, of achieving targeted energy saving for every DR event hour, i.e.,
minxΣckxk<C s.t. P(Σrk,txk≦Tt)≧p, t∈hdr
where hdr means DR event hours, and we add one more subscript t to the targeted energy savings Tt, and where the sums are over k from 1 to K. The customers are selected who can save much for every hour in DR event duration. This problem can be solved as a SOCP (second order cone programming) after relaxing x vector as 0<xk<1.
The sub-problem, basically quadratic programming (QP), shown in
Depending on the targeting program, the response model can vary. Or, even in the same program targeting, there can be multiple response models. Thus, the key point of our targeting methodology is not how to set up the response model, but how efficiently we can select the small number of proper customers from a large population using the given information on the customers' response distribution.
Suppose the response modeling is done and the customers' estimated response distribution information is provided. Then, there can be several problem settings to select the proper customers depending on the needs of practitioners. For example, we can select the customers who can maximize the probability of achieving the targeted energy saving. Or, we can try to minimize the expected penalty when the targeted energy saving is not accomplished. Also, we can minimize the cost of achieving the targeted energy saving. The details of these three types of problem settings are described in one of the attached papers. All the problems started from the first problem setting, which is a stochastic knapsack problem (SKP). SKP is a combinatorial optimization problem which is well-known as a NP-hard problem. Thus, we developed an efficient novel heuristic with guaranteeing the approximate optimality.
Our targeting methodology is not confined to a specific DR program targeting. It can be utilized in recruiting the customers for any DR or EE (energy efficiency) program because our three-stage (filtering, response modeling and selection) targeting approach is flexible enough to cover other types of applications. For example, for any energy policy adoption or any targeting a certain type of people from a large population, if the objective function can be represented as one of our problem formats, basically our approach can work effectively. This makes sense because the response, reward or any type of reaction against a certain energy policy adoption or other program should be a probabilistic variable rather than a deterministic variable. Moreover, when we aggregate many variables (each variable represent a reaction of each entity in the program or policy) which are not totally dependent, it is natural and reasonable to assume following a Gaussian distribution. Additionally, the filtering process and response modeling can be changed properly depending on the given application.
The targeting methodology provided here has several important benefits:
1. the customer selection problem settings (which are based on the stochastic knapsack problem) are general.
2. the heuristic algorithm to solve the SKP is very efficient and scalable on a larger population or larger data sets.
3. by the filtering process (if possible), it makes the customer selection problem solving faster and enables analyzing even more in detail on small number of eligible customers.
Preferred embodiments of the invention make use of a variety of machine learning algorithms. For example, we fit one of the consumption models using EM (expectation maximization) style process. Moreover, we developed an efficient heuristic to solve a stochastic knapsack problem (SKP) with approximate optimality proof, which requires only constant times of solving a linear or quadratic programming problem. In our approach, if we can assume the customers are independent, the SKP becomes only constant times of sorting certain scores among all customers. For a modified problem (the problem minimizing the penalty in the attached paper), we proved its quasi-concavity and it can be also solved in the same heuristic we developed. As a byproduct of quasi-concavity proof, we obtained a tighter lower bound for the complementary cumulative distribution function for some range in a standard normal random variable.
In step 902 features are extracted from the data, and in step 904, depending on a selected DR program, probabilistic response model(s) are generated for each customer based on the extracted features. The probabilistic response model reflects the relationship between feature and program performance. The response model gives a predicted output for a customer conditional on external factors, customer dependent factors, etc. For example, feature can be temperature sensitivity, and program performance is the number of kWh a customer generates in a program. The probabilistic response model is then utilized for targeting. The response model gives a predicted output for a customer conditional on external factors, customer dependent factors, etc. The customer response model specification depends on the design of the demand response program. Consider a Global Temperature Adjustment (GTA) program for HVAC (air conditioning) systems. Such a program increases the temperature set point of the air conditioner for each customer by a fixed amount to reduce cooling power consumption. Selecting customers with high energy saving potential during a DR event day and hour requires an accurate model to estimate the total energy consumed at each set point level. For example, The power consumption of a customer k at time t on day d, is modeled as a function of outside temperature, HVAC temperature break-point, cooling sensitivity, heating sensitivity, and base load. If HVAC consumption is independently metered, a simple model can be built utilizing the observed consumption, external temperature and the utilized set point. In general, though, only total home consumption and external temperature are observed.
Model learning is performed in two steps. Minimization of residual sum of squares (RSS) is utilized to learn the parameters of the model and the distribution of the error from the observed data. An F-test is utilized to prevent over-fitting. The overall computation needed to fit the consumption model is to solve (at most) 20 linear regression models: one for each potential value of the breakpoint (at most 19 for the integer breakpoints between 68° F. and 86° F., which is typical) and one for the basic model (the case when cooling sensitivity is the same with heating sensitivity).
The targeting model selection procedure can include an optional step of filtering target choices according to their load profile characteristics.
In some embodiments, the targeting incorporates a discrete choice model or other choice model options that predict consumer propensity to enroll. This can be directly treated in the same manner as with reliability.
Once the model is generated and selected, it is used to estimate the parameters of a response distribution.
In step 906, based on the estimated response distribution, targeting is performed by solving an optimization problem (or several) depending on the targeting objective. For example, the targeting maximizes the probability of achieving the target, constrained on a budget cost (number of users, dollars, etc.), and other additional constraints such as variance of response, conditional value at risk and value of risk of response.
The targeting method includes for each customer a reliability constraint to capture the behavioral compliance to a demand response signal. The compliance is given by a compliance response model and it is a function of consumer characteristics, local environmental characteristics, time of day and other factors. The reliability constraint can be directly included in the objective function of the optimization. If reliability for customer i is γi and response is ri, a new compliance adjusted response {tilde over (r)}i=γiri, is defined and can be used in the regular targeting algorithm.
In some embodiments, the targeting method is separately performed periodically (e.g., each hour), or performed for multiple hours jointly (providing a best enrollment for a target shape).
Some embodiments may include computing a target response curve of desired response level (corresponding to DR availability) vs. required number of consumers (corresponding to DR cost) with guaranteeing a certain DR reliability. If a practitioner has this curve, he or she can decide a sweet spot in the curve as the DR operation point considering the constraints (e.g., budget, target energy saving).
In step 908, enroll the customers in the programs, perhaps involving communicating first with the selected customers to encourage them to enroll in a certain DR program, and enrolling them based on their response.
In some embodiments, compliance factors may be integrated into the customer selection optimization problems on the assumption that each customer's compliance variable is independent from his response variable and is provided in eligible types:
1. A fixed constant, a kind of discount rate.
2. A Bernoulli random variable with a success probability.
A customer's compliance may be integrated into any customer selection problem by changing the response variable from rk to akrk. Moreover, we can solve the problems on the assumption that the sum of responses is following a Gaussian distribution, with different Gaussian distribution parameters.
This application claims priority from U.S. Provisional Patent Application 61/914,681 filed Dec. 11, 2013 and from U.S. Provisional Patent Application 61/914,703 filed Dec. 11, 2013, both of which are incorporated herein by reference.
This invention was made with Government support under grant (or contract) no. DE-AR0000018 awarded by the Department of Energy. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
61914703 | Dec 2013 | US | |
61914681 | Dec 2013 | US |