The number of sensors and controllers connected to the electric power system is expected to grow by several orders of magnitude over the next several years. However, the information networks that are currently used to transmit and analyze data on the system are ill-equipped to handle the volume of communications resulting from the increased number of sensors. For example, the current information networks are incapable of delivering large amounts of data collected by the sensors in a predictable, time-effective, and reliable manner.
Without the ability to effectively manage and use data from the sensors, the deployment of sensors into the power grid (e.g., phasor measurement sensors into the transmission system and smart meters into the distribution system) will not result in the desired improvements. For example, existing bidding strategies for wholesale market are not readily adaptable to thermostatically controlled loads implemented at the consumer side. Further, requirements for multiple bidding iterations, large amounts of bid data, and an inability to encode private information into consumer bids hampers deployment of successful strategies for control schemes for demand response. Accordingly, there is ample opportunity for improved systems, methods, and apparatus for managing and using data in a power grid or other electric power distribution system.
Apparatus and methods are disclosed for the design, analysis, testing, and manufacture of devices used to coordinate groups of thermostatically controlled loads (TCLs) to achieve system-level objects with price incentives. In some examples, the framework is based on improving social welfare of the system subject to one or more feeder power constraints. In the framework, each individual load submits a bid to the market based on its current state. The market collects all the bids and determines the cleared price accordingly. After receiving the cleared price, each individual load makes the local control decision to maximize its utility. This framework can provide a solid mathematical foundation to the disclosed transactive control techniques.
In some examples of the disclosed technology, a joint state and parameter estimation framework using expectation maximization is applied to determine consumer bidding price/quantity pairs. A market coordinator evaluates received bids and transmits a clearing price to bidding consumers, which in turn operate (or turn off) controlled loads based on the clearing price. The disclosed examples of such a coordination framework can effectively improve the efficiency of power grid operation and reduce power grid congestion at key times, as will be further illustrated in the figures and detailed description below.
In some examples of the disclosed technology, a method of operating a load coupled to a thermal environment with power received from a power grid by submitting bids to a coordinator includes estimating a set of values for one or more unmeasured parameters of the thermal environment based at least in part on a plurality of output measurements of the thermal environment, determining an energy response based on the estimated set of values for the unmeasured parameters, and transmitting a bid for power for a finite time period based on the determined energy response to the coordinator. In some examples, the method further includes receiving a clearing price from the coordinator responsive to the transmitted bid based at least in part on the transmitted bid and on bids received from a plurality of additional loads, and responsive to the received clearing price, determining to send power received from the power grid to the load.
In some examples, the estimating includes determining a distribution for a system state vector conditioned on the output measurements, the output measurements including a time-ordered sequence of air temperatures, the system state vector being based in part on values determined for one or more model parameters. In some examples, the estimating includes updating a set of values for one or more model parameters to substantially maximize a likelihood function. In some examples, the energy response is determined based at least in part on the following: a measured air temperature, a control deadband value, and/or the set of unmeasured parameters. In some examples, the set of values for the one or more unmeasured parameters is determined using an equivalent thermal parameter model to estimate an inner mass temperature of the thermal environment. In some examples the method includes estimating a state trajectory and determining the energy response based at least in part on the state trajectory.
In some examples, the estimating the parameters includes performing an initial estimation of values for the one or more unmeasured parameters and determining a conditional distribution of a state of the thermal environment using a recursive Bayes filter. In some examples, the recursive Bayes filter is a Kalman filter.
In some examples, estimating set of values for the unmeasured parameters includes performing an initial estimation of values for a set of model parameters, determining a conditional distribution of a set of state values for the thermal environment, determining new values for the set of model parameters selected to maximize a likelihood function based on the conditional distribution, repeating the determining the conditional distribution and the determining new values for the set of model parameters until a set of convergence criteria is met, and, after the convergence criteria has been met, providing an estimation of state trajectory and an updated set of parameters for determining the energy response.
In some examples of the disclosed technology, a controller for operating a thermostatically-controlled load includes one or more sensors configured to generate temperature data used to determine the energy response, a network adapter configured to transmit the bid to the coordinator, one or more processors, one or more actuators configured to activate and/or deactivate the thermostatically-controlled load responsive to one or more signals received from the processors, and one or more computer-readable storage media storing computer-executable instructions that when executed by the processors, cause the controller to perform any one or more of the methods disclosed herein.
In some examples of the disclosed technology, a coordinator is configured to send clearing prices for a retail power market to a plurality of controllers configured to operate thermostatically-controlled loads, the coordinator comprising one or more processors configured to: receive one respective bid for each of the loads, each of the plurality of received bids being generated using one of the disclosed methods, determining a clearing price for the plurality of received bids, and transmitting the clearing price to each of the loads.
In some examples of the disclosed technology, a method of allocating power to a plurality of loads coupled to a power grid includes receiving one respective bid for each of the loads, each of the plurality of received bids being generated based on an energy response determined by estimating a respective set of values for one or more unmeasured parameters of a thermal environment of each of the loads, determining a clearing price for the plurality of received bids; and transmitting the clearing price to each of the loads. In some examples, a method of allocating power further comprises sending power to a selected one or more of the loads, the loads being selected based on the received bids and the clearing price.
In some examples of the disclosed technology, a market-based control system is configured to coordinate a group of thermostatically controlled loads to achieve system-level objectives with pricing incentives, the system including a market coordinator configured to generate clearing price data based on a plurality of bids specifying a quantity and a price for consuming power, a plurality of thermostatically controlled loads (TCLs). Each of the TCLs is configured to transmit bid data to the market coordinator specifying a bid quantity and a bid price for power received via a power grid for a predetermined time period. The transmitted bid data is based at least in part on estimating unmeasured parameters for a thermal environment to which each of the respective TCLs is coupled to supply heating or cooling to, each of the TCLs being further configured to consume or not consume power from the power grid based at least in part on the clearing price data and the TCLs' respective bid for the predetermined time period. The system further includes a computer network configured to transmit the bid data and the clearing price data between the market coordinator and each of the TCLs.
In some examples, the system includes a power grid configured to distribute power to the TCLs based at least in part on a market cleared by the market coordinator. In some examples, the system includes a power generation market administrator configured to send wholesale energy price data to the market coordinator, the wholesale energy price data being used at least in part to determine the clearing price data.
In some examples of the disclosed technology, the consumer bid includes one price and one corresponding quantity. In some examples, one or more computer-readable storage media storing computer-executable instructions that when executed by a computer, cause the computer to perform any of the disclosed methods. In some examples, a power grid includes an electric power distribution system configured to transmit electric power from one or more power sources to a plurality of thermostatically controlled loads, and a market coordinator configured to perform any one or more of the disclosed methods.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. The foregoing and other objects, features, and advantages of the disclosed technology will become more apparent from the following detailed description, which proceeds with reference to the accompanying figures.
This disclosure is set forth in the context of representative embodiments that are not intended to be limiting in any way.
As used in this application the singular forms “a,” “an,” and “the” include the plural forms unless the context clearly dictates otherwise. Additionally, the term “includes” means “comprises.” Further, the term “coupled” encompasses mechanical, electrical, magnetic, optical, as well as other practical ways of coupling or linking items together, and does not exclude the presence of intermediate elements between the coupled items. Furthermore, as used herein, the term “and/or” means any one item or combination of items in the phrase.
The systems, methods, and apparatus described herein should not be construed as being limiting in any way. Instead, this disclosure is directed toward all novel and non-obvious features and aspects of the various disclosed embodiments, alone and in various combinations and subcombinations with one another. The disclosed systems, methods, and apparatus are not limited to any specific aspect or feature or combinations thereof, nor do the disclosed things and methods require that any one or more specific advantages be present or problems be solved. Furthermore, any features or aspects of the disclosed embodiments can be used in various combinations and subcombinations with one another.
Although the operations of some of the disclosed methods are described in a particular, sequential order for convenient presentation, it should be understood that this manner of description encompasses rearrangement, unless a particular ordering is required by specific language set forth below. For example, operations described sequentially may in some cases be rearranged or performed concurrently. Moreover, for the sake of simplicity, the attached figures may not show the various ways in which the disclosed things and methods can be used in conjunction with other things and methods. Additionally, the description sometimes uses terms like “produce,” “generate,” “display,” “receive,” “evaluate,” “determine,” “send,” “transmit,” and “perform” to describe the disclosed methods. These terms are high-level descriptions of the actual operations that are performed. The actual operations that correspond to these terms will vary depending on the particular implementation and are readily discernible by one of ordinary skill in the art.
Theories of operation, scientific principles, or other theoretical descriptions presented herein in reference to the apparatus or methods of this disclosure have been provided for the purposes of better understanding and are not intended to be limiting in scope. The apparatus and methods in the appended claims are not limited to those apparatus and methods that function in the manner described by such theories of operation.
Any of the disclosed methods can be implemented as computer-executable instructions stored on one or more computer-readable media (e.g., non-transitory computer-readable storage media, such as one or more optical media discs, volatile memory components (such as DRAM or SRAM), or nonvolatile memory components (such as hard drives and solid state drives (SSDs))) and executed on a computer (e.g., any commercially available computer, including smart phones or other mobile devices that include computing hardware). Any of the computer-executable instructions for implementing the disclosed techniques, as well as any data created and used during implementation of the disclosed embodiments, can be stored on one or more computer-readable media (e.g., non-transitory computer-readable storage media). The computer-executable instructions can be part of, for example, a dedicated software application, or a software application that is accessed or downloaded via a web browser or other software application (such as a remote computing application). Such software can be executed, for example, on a single local computer (e.g., as a process executing on any suitable commercially available computer) or in a network environment (e.g., via the Internet, a wide-area network, a local-area network, a client-server network (such as a cloud computing network), or other such network) using one or more network computers.
For clarity, only certain selected aspects of the software-based implementations are described. Other details that are well known in the art are omitted. For example, it should be understood that the disclosed technology is not limited to any specific computer language or program. For instance, the disclosed technology can be implemented by software written in C, C++, Java, or any other suitable programming language. Likewise, the disclosed technology is not limited to any particular computer or type of hardware. Certain details of suitable computers and hardware are well-known and need not be set forth in detail in this disclosure.
Furthermore, any of the software-based embodiments (comprising, for example, computer-executable instructions for causing a computer to perform any of the disclosed methods) can be uploaded, downloaded, or remotely accessed through a suitable communication means. Such suitable communication means include, for example, the Internet, the World Wide Web, an intranet, software applications, cable (including fiber optic cable), magnetic communications, electromagnetic communications (including RF, microwave, and infrared communications), electronic communications, or other such communication means.
The disclosed methods can also be implemented by specialized computing hardware that is configured to perform any of the disclosed methods. For example, the disclosed methods can be implemented by an integrated circuit (e.g., an application specific integrated circuit (“ASIC”) or programmable logic device (“PLD”), such as a field programmable gate array (“FPGA”)). The integrated circuit or specialized computing hardware can be embedded in or directly coupled to an electrical device (or element) that is configured to interact with controllers and coordinators. For example, the integrated circuit can be embedded in or otherwise coupled to a generator (e.g., a wind-based generator, solar-based generator, coal-based generator, or nuclear generator), an air-conditioning unit; heating unit; heating, ventilation, and air conditioning (“HVAC”) system; hot water heater; refrigerator; dish washer; washing machine; dryer; oven; microwave oven; pump; home lighting system; electrical charger, electric vehicle charger; or home electrical system.
Methods and apparatus are disclosed for implementing market based control frameworks to coordinate a group of autonomous Thermostatically Controlled Loads (TCL) to achieve system-level objectives with pricing incentives.
Examples of TCLs that can be coordinated according to the disclosed technology include air conditioners, heat pumps, hot water heaters, refrigerators, plug-in hybrid electric vehicles, and commercial and industrial loads. The electricity consumption of TCL can be modulated while still meeting desired end-user temperature requirements due to the inherent thermal storage properties of TCLs.
A population of TCLs can be coordinated to achieve group objectives. In such examples, each TCL controller privately observes user preferences and determines control actions based on market energy prices to maximize the individual utility, while the coordinator designs the bidding and market clearing strategy to efficiently allocate energy to users, subject to a feeder power constraint. Mechanisms are disclosed to elicit private information from individual users for collective decision making. In some examples, the proposed mechanism corresponds to a dominant strategy equilibrium, which enables the controllers to determine the bidding price without knowing other controllers' actions. A realistic bidding strategy is disclosed to simplify the bidding structure and reduce communication overhead. Disclosed technologies enable coordination of groups of TCLs for demand response with a systematic consideration of various practical factors, such as heterogeneous load dynamics, private information, communication resources, and other factors.
Full knowledge of system state and ETP model parameters are difficult to obtain in practice (e.g., because the mass temperature cannot practically be measured, and only some rough statistical information about the model parameters is available). In some examples, the output measurements of TCL devices can be used to estimate system state and/or to identify model parameters. For instance, if the model parameter is available, then the system state can be estimated using recursive Bayes filter (e.g., a Kalman filter); when the system state is known or measurable, the model parameters can be recovered using the system identification methods.
In some examples of the disclosed technology, an expectation maximization approach allows the bidding strategy of each TCL to depend only on its own online measurements (such as air temperature and “on/off” state) enabling implementation of suitable bidding strategies.
Demand response is a technique that can be applied to improve the efficiency and reliability of future smart power grids. Demand response can be implemented using various pricing schemes, such as Real Time Pricing (RTP), Time of Use (TOU) and Critical Peak Pricing (CPP). A time-varying price structure incentivizes users to shift demand from high price periods to low price periods with reduced electricity expenditures. Price-based demand response can achieve results in terms of payment reduction, load shifting, and/or power saving. However, price-based demand response approaches typically directly pass the wholesale energy price to end-users or modifies the wholesale price in a heuristic way. In some examples of the disclosed technology predictable and reliable aggregated response can be used to implement demand response techniques including power capping, load following, frequency regulation, among others.
In certain examples of the disclosed technology, a population of TCLs is coordinated to achieve certain group objectives. In the considered example scenarios, self-interested users with private information determine control actions based on the energy prices to maximize individual utility, while the coordinator designs the bidding and clearing rules to efficiently allocate energy to individual users subject to a peak power constraint. In some examples, a systematic consideration of various practical factors, such as heterogeneous load dynamics, private information of individual users, unknown parameters of the load model, communication resources for the information exchange are employed.
There are several challenges that can be addressed in certain examples of the disclosed technology. For example, user utilities are private information, making it challenging for the market coordinator to achieve group objectives while respecting individual user load control authorities. Further, it is often desirable to avoid multiple communication iterations between load control agents and the market coordinator. Furthermore, accurate load models do not always have known model parameters.
In certain examples of the disclosed technology, a market-based coordination framework is used to control residential air conditioning loads with a systematic consideration of one or more of the aforementioned factors. In certain examples of the disclosed framework, each TCL device is equipped with a control module that collects user energy use preferences, computes a bid based on the temperature measurement, and determines a control setpoint according to the cleared market price, while the market coordinator determines an energy price selected to achieve group objectives. A mechanism is disclosed to elicit private information for collective decision making. In some examples, this results in a dominant strategy equilibrium that enables each user to determine the optimal bid without knowing information about other loads within the control framework. Thus, certain examples of the disclosed technology allow for coordination of a group of TCLs for demand response with limited communication resources.
In some examples of the disclosed technology, the framework is configured to maximize social welfare subject to a feeder power constraint. The framework allows a market coordinator to affect the aggregated power of a group of dynamical systems, and creates an interactive market where the users and the coordinator cooperatively determine the optimal energy allocation and energy price. An optimal pricing strategy is derived, which maximizes social welfare while respecting the feeder power constraint. The bidding strategy is also designed to compute the optimal price in real time based on local device information. Numerical simulations based on realistic price and model data are performed. The simulation results demonstrate that the proposed approach can effectively maximize the social welfare and reduce power congestion at key times.
In one embodiment, an optimal pricing strategy is proposed, which maximizes the social welfare while respecting the feeder constraint. A bidding strategy is also proposed to enable the numerical computation of the optimal price. In some examples of the disclosed technology, certain advantages can be realized. First, the proposed pricing strategy has been proven optimality, where social welfare can be maximized while the feeder power constraint is respected. Second, certain disclosed proposed bidding strategies provide the market with a minimum amount of information that is sufficient for the market to implement an optimal pricing strategy. This enables implementation of the disclosed framework in real time (e.g., by clearing a market using one bid from each load in a 5-minute period).
The disclosed technology provides a foundation for a fully dynamic version of market-based control of Thermostatically Controlled Loads to maximize the social welfare over multiple control periods. In such cases, all loads bid a price vector for the entire planning horizon, and the market is cleared with all prices for the subsequent periods within the horizon. Thus, social welfare can be maximized for multiple periods, and shape the power consumption, thereby flattening the power curve.
Disclosed bidding strategies enables a market coordinator to estimate aggregate power demand in response to market prices more accurately. Therefore, given a power trajectory, the market coordinator can determine the cleared price to coordinate loads to match the power trajectory reference in real time.
In certain examples of the disclosed technology, a group of TCLs are coordinated by a market coordinator with price incentives to limit aggregated power demand and improve system efficiency. Each device adjusts its temperature setpoint control in response to the energy price to maximize individual utility. The change on the setpoint control will then modify the system dynamics and affect the system state, on which the generated bid is based. According to the received load bids, the coordinator clears the market with a price for the next cycle to maximize social welfare subject to a feeder power constraint. A systematic mathematical framework is provided for the analysis and design of this kind of market-based coordination of responsive loads with nontrivial dynamics.
In some examples of the disclosed technology, a market-based coordination framework includes a coordinator that coordinates a group of autonomous TCLs to achieve system-level objectives with price incentives. In some examples, adapting the technology to TCLs allows incorporation of more realistic load dynamics into a market-based coordination framework. In some examples, the framework allows for the users to indicate their preferences regarding how TCL temperature setpoints respond to market clearing price(s). In this way, an interactive market is created for the coordinator and the users cooperatively determine energy allocation in a decentralized manner. In some examples, an optimal price is found to align individual optimality and social optimality. This property does not hold in general when the feeder power constraint is imposed on the system. In some examples, the devices can only bid once during each market clearing cycle. Thus, multiple iterations between the load controllers and market coordinator for each market clearing cycle, which demands considerable communication and computational resources, can be avoided.
Optimal pricing strategies are disclosed, which maximize the social welfare of the system, subject to a feeder power constraint. Load device bidding strategies are also presented to compute the optimal price numerically in real time while respecting the computational and communication constraints of the system. The effectiveness of the disclosed technology is demonstrated via a number of simulations based on realistic models of residential air conditioning loads. Disclosed frameworks can effectively cap the aggregated power below the feeder capacity and thus maximize the social welfare.
A diagram 100 illustrating an example of a possible network topology for an environment implementing coordination of thermostatically controlled loads (TCLs) according to the disclosed technology is depicted in
The power grid 120 includes transmission lines 125 that carry power from the energy sources 110 to a number of loads, including thermostatically-controlled loads. Energy consumers with suitable TCLs for deploying in the illustrated environment include residential consumers, including residential consumers 130 and 140, industrial consumers, such as industrial consumer 150, and commercial consumers, such as commercial consumer 160. Each of the associated consumers 130, 140, 150, and 160 is associated with one or more thermostatically-controlled loads. For example, the residential consumer 130 has three thermostatically-controlled loads (TCL) 131-133. Further, as shown, each of the TCLs 131-133 is coupled to a controller 136-138, respectively. Each of the controllers 136-138 can submit bids and receive clearing prices via a bid aggregator 135, and actuate their respective coupled TCLs 131-133 (e.g., by turning the associated load on or off by activating/deactivating the load). Additionally, residential consumer 140 has a number of TCLs 141-143 that are coupled to a single controller 146. The controller 146 can submit bids, receiving clearing prices, and actuate any of the coupled TCLs 141-143. Industrial consumer 150 has a number of TCLs (e.g., TCL 151) (controller(s) and any bid aggregator(s) being omitted from
Each of the TCLs is coupled to a controller that is operable to submit data to and receive data from other components via a computer network 170. In some examples, a number of TCLs associated with a single consumer can have data aggregated and bids submitted together using a bid aggregator (e.g., bid aggregator 135). In some examples, one or more of the TCL controllers are implemented using a microcontroller, memory, and suitable input/output resources for receiving signals carrying sensor data local to the TCL and controlling the coupled TCL (e.g., by actuating motors and other components of a respective TCL). In other examples, TCL controllers can be implemented using programmable logic or a general-purpose computer configured to receiving signals carrying signal data and generate signals for controlling the coupled TCL.
Each of the TCLs can be coupled to, for example, computing devices having computer hardware that run software or is otherwise configured to communicate with other computing devices accessible by the network 170. In other examples, the TCLs send data to other computing devices associated with one or more of the energy consumers. Each of the controllers coupled to and/or associated with the TCLs, a market coordinator 180, and a power generation market administrator 190 can have computer architecture(s) similar to those illustrated in
As shown in
In some examples of the disclosed technology, TCL controllers are configured to estimate a set of values for one or more unmeasured parameters of the thermal environment based at least in part on a plurality of output measurements of the thermal environment and determine an energy response based on the estimated set of values for the unmeasured parameters. The TCLs are further configured to determine an energy response relating price data for one or more energy prices to quantity data for power to be consumed by the associated TCL and to send bids for power for a finite time period based on the energy response to the market coordinator 180.
In some examples, each of the TCLs submits a single bid to the mark coordinator 180 for each finite time period. In other examples, additional bids are submitted in an iterative process. The market coordinator 180 in turn aggregates bids from a number of energy consumers participating in the market for the finite time period, and calculates a clearing price. The clearing price is transmitted from the market coordinator 180 to each of the energy consumers. The energy consumers respond to the clearing price by, for example, actuating their associated loads to activate or de-activate, thereby consuming, or not consuming, respectively, energy from the power grid according to the clearing price. For example, if an energy consumer did not bid a sufficient price to be allocated energy by the market coordinator, that consumer, a controller associated with the TCL, will not activate the device. Conversely, if a bid submitted for an associated TCL was sufficient to receive power, the controller can activate the associated thermostatically-controlled load. While the examples disclosed herein respond to the clearing price by either activating or de-activating the load, in other examples a finer-grained response of the loads can be performed (e.g., by consuming a portion of the loads maximum energy consumption). In some examples, the market coordinator 180 itself sends signals to activate or de-activate the loads, accordingly.
It should be noted that in some examples, individual TCLs associated with an energy consumer can submit different price and/or quantity values in their bid to the market coordinator 180 and thus, in certain instances, only a subset of TCLs associated with a particular TCL will be activated or de-activated according to the clearing price. As will be more fully explained below, this process can be repeated at fixed intervals (e.g., intervals of one hour or less, intervals of ten minutes or less, or intervals of five minutes or less).
In the illustrated example of
The various possible roles and functionalities of the TCLs, market coordinator 180, and power generation market administrator 190 will be described in more detail in the following sections.
The computing environment 200 is not intended to suggest any limitation as to scope of use or functionality of the technology, as the technology may be implemented in diverse general-purpose or special-purpose computing environments. For example, the disclosed technology may be implemented with other computer system configurations, including hand held devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, and the like. The disclosed technology may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.
With reference to
The storage 240 may be removable or non-removable, and includes magnetic disks, magnetic tapes or cassettes, CD-ROMs, CD-RWs, DVDs, or any other medium which can be used to store information and that can be accessed within the computing environment 200. The storage 240 stores instructions for the software 280, plugin data, and messages, which can be used to implement technologies described herein.
The input device(s) 250 may be a touch input device, such as a keyboard, keypad, mouse, touch screen display, pen, or trackball, a voice input device, a scanning device, or another device, that provides input to the computing environment 200. For audio, the input device(s) 250 may be a sound card or similar device that accepts audio input in analog or digital form, or a CD-ROM reader that provides audio samples to the computing environment 200. The input device(s) 250 can also include sensors and other suitable transducers for generating data about the environment such as room temperature, humidity, and status information for one or more TCLs (e.g., TCL 265). The output device(s) 260 may be a display, printer, speaker, CD-writer, or another device that provides output from the computing environment 200. The output device(s) 260 can also include interface circuitry for sending actuating commands to the TCLs, (e.g., TCL 265), for example, to activiate or deactivate actuators (e.g., motors, solenoids, hydraulic actuators, pneumatic actuators, etc.) the TCL, or to request sensor or other data from the TCL.
The communication connection(s) 270 enable communication over a communication medium (e.g., a connecting network) to another computing entity. The communication medium conveys information such as computer-executable instructions, compressed graphics information, video, or other data in a modulated data signal. The communication connection(s) 270 are not limited to wired connections (e.g., megabit or gigabit Ethernet, Infiniband, Fibre Channel over electrical or fiber optic connections) but also include wireless technologies (e.g., RF connections via Bluetooth, WiFi (IEEE 802.11a/b/n), WiMax, cellular, satellite, laser, infrared) and other suitable communication connections for providing a network connection for the disclosed controllers and coordinators. Both wired and wireless connections can be implemented using a network adapter. In a virtual host environment, the communication(s) connections can be a virtualized network connection provided by the virtual host. In some examples, the communication connection(s) 270 are used to supplement, or in lieu of, the input device(s) 250 and/or output device(s) 260 in order to communicate with the TCLs and/or sensors.
Some embodiments of the disclosed methods can be performed using computer-executable instructions implementing all or a portion of the disclosed technology in a computing cloud 290. For example, data acquisition and TCL actuation can be performed in the computing environment while computing energy response functions or bid generation can be performed on servers located in the computing cloud 290.
Computer-readable media are any available media that can be accessed within a computing environment 200. By way of example, and not limitation, with the computing environment 200, computer-readable media include memory 220 and/or storage 240. As should be readily understood, the term computer-readable storage media includes the media for data storage such as memory 220 and storage 240, and not transmission media such as modulated data signals.
As shown in
Apparatus and methods are disclosed for implementing market-based coordination frameworks for TCLs. In an exemplary embodiment, a market coordinator procures energy from the wholesale market and manages N users to maximize the social welfare subject to a feeder power constraint. C(α) represents the cost for the coordinator to procure α unit of energy from the wholesale market. The unit price can then be the Locational Marginal Price plus some additional charge for using the distribution network. For ease of explanation for this example, it is assumed that C is convex and continuously differentiable.
The market is cleared (e.g., by the market coordinator) every T units of time. At the beginning of each time cycle, each of the local devices receives the energy price and make control decisions to maximize its individual utility. This control decision affects the load dynamics and state, and in turn influences user bidding for the next market clearing cycle. After collecting all the device bids, the coordinator determines the price such that the social welfare is maximized, and the aggregated power does not exceed the feeder power constraint. As used herein, the aggregated power is the average power consumed during each market cycle.
At process block 410, a number of bids are received for a plurality of energy loads that were generated based, at least in part on, estimating a set of values for one or more unmeasured parameters of the thermal environment of the respective energy loads and determining an energy response relating price and quantity for a load based at least in part on the estimated unmeasured parameters. For example, each of a number of thermostatically-controlled loads (TCLs) can send bids expressed as a single price quantity pair, or as a plurality of two or more price quantity pairs. In other examples, more complex expressions of bids are received. Each of the bids can be associated with a single TCL, or represent an aggregated bid for two or more TCLs. Each of the bids for those TCLs participating in the market are for a finite time duration (e.g., five minutes). Once bids have been received for the associated TCLs, the method proceeds to process block 420. In addition to collecting the bids, the coordinator can also calculate the uncontrolled power load Quc and the power feeder constraint Olim for the finite time period.
At process block 420, a demand curve is produced by ordering the bids received at process block 410 according to their respective prices. For example, a market coordinator can order the received bids in a decreasing sequence from the highest bid to the lowest bid. Thus, energy can be allocated to TCLs associated with higher bids at a higher priority than TCLs associated with lower bids.
Before each market clearing cycle, the market coordinator collects all the bids from the devices, and orders the bidding price in a decreasing sequence Pbid1, . . . , Qbid1, where N denotes the number of users. With the price sequence and the associated bidding power sequence Qbid1, . . . , QbidN, the coordinator can construct the demand curve that maps the market energy price to the aggregated power. Using the demand curve 510, the coordinator can clear the market and determine the energy price to ensure that the aggregated power does not exceed the feeder capacity: if the total power demand (the total quantity of power bid) is less than the feeder power constraint, then the clearing price is equal to the base price Pbase (see
Once the demand curve has been produced, the method proceeds to process block 430.
At process block 430, the total power demanded according to all of the bids received is compared to a feeder power constraint. The feeder power constraint indicates the maximum amount of energy that can be provided by energy producers for the upcoming time period that the bids were based on. For example, based on power generation availability, transmission line conditions, and/or regulatory constraints, the feeder power constraint can be determined. If the total power demand is less than the amount of energy according to the feeder power constraint, the method proceeds to process block 440. Conversely, if the total power demand is greater than the feeder power constraint, the method proceeds to process block 450. Comparison of the total power demand to the feeder power constraint can be formed using the market coordinator 180 and the power generation market administrator 190 described above regarding
At process block 440, because the total power demand was less than the feeder power constraint, the clearing price is set to the wholesale market price. In the depicted embodiment, the clearing price is set to the same value for all bidders in the market for the current finite time period.
On the other hand, if it was determined that the total power demand is greater than the feeder power constraint, then method proceeds to process block 450 and the clearing price is set based on the intersection of the total power demand and the feeder power constraint. In the depicted embodiment, the clearing price is set to the same value for all bidders in the market for the current finite time period. In other examples, the clearing price could vary depending on the individual bid received from each of the TCLs.
An example of demand curve associated with this situation is illustrated in a chart 700 shown in
At process block 460, the clearing price is sent to controllers associated with TCLs that bid in the market. The clearing price can be sent, for example, using a computer network such as computer network 170 from the market coordinator 180 to each of the TCLs. After sending the clearing price to the TCLs, the method proceeds to process block 470.
At process block 470, the controllers associated with each of the bidding TCLs operate their loads according to the clearing price. In other words, if the associated TCL sent a bid greater than or equal to the clearing price, then the associated TCL is allowed to consume the amount of energy that was bid during the market session. For associated TCLs that did not submit a sufficient bid, and did not receive power because the total power demand was greater than the feeder power constraint, the associated controllers will de-activate the associated TCL.
The operations described above in the flowchart 400 can be performed repeatedly for each finite time period. For example, bidding for a next one (or more than one) time period can begin during the time period in which the TCL is operating according to a previously-cleared bid. In some examples, the sequence of bidding and price clearing is performed only once for each finite time period. Thus, communication and processing overhead associated with repeated bidding can be avoided.
In some examples of the method outlined in
The example method of market coordination elaborated in
At process block 810, a set of values for one or more unmeasured parameters of the thermal environment coupled to a TCL are estimated based at least in part on a plurality of output measurements of the thermal environment. For example, certain parameters that affect the demand in the load environment (e.g., inner mass temperature, heat gain/loss from exterior factors, etc.) are not typically measurable. An example method of estimating unmeasured parameters for the TCL are discussed in further detail below regarding
At process block 820, a controller coupled to a TCL determines an energy response relating price data to quantity data for energy or power to be consumed by the associated load. In some examples, the energy response is modeled using a function based on at least one or more of the following: a consumption state of the load (e.g., whether the load is currently operating or not), an air temperature (e.g., the current room temperature), and/or an inner mass temperature (e.g., the current temperature of solid objects within a region being heated or cooled by the associated load). In some examples, one or more of the variables on which the energy response function is based are estimated. For example, the inner mass temperature may not be available by data from a sensor. Other parameters of the energy response function can be provided by a sensor, for example the consumption state of the load, or the current room temperature. In other examples, other variables can be included in the energy response, including time of day, humidity, weather, outside air temperature, solar gain, or other suitable parameters.
In some examples, the energy response is a function of a user response parameter where the user response parameter relates an energy price and a user-selected comfort level. For example, the user can indicate a desired comfort level in relation to price using a controller, such as the controller 370 discussed above regarding
In some examples, the energy response is based, at least in part, on an equivalent thermal parameter model and a control policy indicating one or more power states for the load. These parameters are discussed further below. In some examples, the bid includes exactly one price and one corresponding quantity. In other examples, the bid includes two or more prices and two or more respectively corresponding quantities. In some examples, the bid is based on a point between two vectors that relate a state of the load, a model parameter, and a user input parameter. In some examples, the bid is based, at least in part, on an equivalent thermal parameter model. In some examples, the finite time period is less than one hour. In some examples, the finite time period is approximately five minutes or ten minutes.
A further detailed explanation of determination of exemplary energy consumption functions and energy response curves are discussed below and illustrated in
After determining an energy response, data representing the energy response is encoded and the method proceeds to process block 820.
At process block 830, the energy bid for a finite time period T based on the energy response data generated at process block 820 is sent to a market coordinator. For example, the energy bid data can be sent using a wired computer network, a wireless computer network, satellite or radio communication, or other suitable technologies for sending data to the market coordinator. After sending the energy bid to the market coordinator, the method proceeds to process block 840.
At process block 840, a clearing price is received from the market coordinator. The clearing price is based, at least in part, on the bid submitted by the controller at process block 830 in combination with energy bids that were submitted by other energy consumers associated with the same market coordinator. After receiving the clearing price for the associated time period T, the method compares the clearing price to the bid price. If the clearing price is less than the bid price, then the method proceeds to process block 850. If, on the other hand, the clearing price is greater than the bidding price sent at process block 830, then the method proceeds to process block 860.
At process block 850, one or more load devices associated with the controller and the successful energy bid (e.g., a clearing price that was less than or equal to the bidding price) is used to activate the associated load device for the finite time period T. In some examples, the load device is activated at a finer granularity than on/off. The load device is then permitted to consume its bid amount of energy for the corresponding time period.
At process block 860, the controller associated with the load uses an actuator to de-activate the load device for the finite time period T associated with the bid sent at process block 830. Thus, because the bid sent at process block 830 was insufficient, the load device remains idle for the finite time period. After de-activating the load device, the method proceeds to process block 870.
At process block 870, additional input data is gathered (e.g., room temperature, user preferences, inner mass temperatures, and other suitable data). The input data gathered is to be used for determining an energy response for the next time period. Once sufficient input data is gathered, the method proceeds back to process block 810 in order to determine a second energy response for a second finite time period.
Further details regarding suitable techniques and apparatus for determining pricing and bidding strategies are discussed below in Part A.
A. Introduction to Disclosed Consumer Load Pricing and Bidding Strategies
In some examples of the disclosed technology, an example pricing bidding strategy is present as a coordination problem for a group of TCLs, where the coordinator allocates energy to users to maximize social welfare subject to a feeder power constraint. The details of the examples in this Section can be implemented in modified versions of the method outlined and discussed above regarding
The remainder of this section provides formal mathematical descriptions of the components of a framework that can be used to implement the disclosed technologies.
B. Example Formulation of an Optimization Problem for Application of User Preferences and Utility
In the example formulation, assume that there are N self-interested users. Each user is to determine the temperature setpoint to obtain an energy allocation that maximizes each respective individual's utility (e.g., the user's comfort minus the electricity cost). In other words, each user is confronted with the tradeoff between comfort and electricity cost: when the electricity price is high, the load controller will adjust the temperature setpoint to save electricity cost at the sacrifice of some user comfort. Formally, a function Vi(ai): → can be used to represent the comfort level for each user with allocation ai. Assume that Vi(ai): → is concave, continuously differentiable, Vi(0)=0, and Vi′(0)>0. Let θi(tk) represent the private information of user i. Denote Eim as the energy consumption for the ith load if it is “on” during the entire period, i.e., ai<Eim. The individual utility maximization problem can be formulated as follows:
0≦ai≦Eim (1)
where Pc is the energy price. Let hi: → be the optimal solution to the optimization problem (1), produces:
It can be verified that with the conditions imposed on V that hi is continuous and non-increasing with respect to Pc for each i=1, . . . , N. It will be readily apparent to one of ordinary skill in the art that in the example formulation, the user cannot directly choose his optimal energy allocation. Instead, in this example, the user can only determine the temperature setpoint, which affects the energy consumption through the load dynamics.
C. Example Determination of Individual Load Dynamics
Let ηi(t)εn be the continuous state of the ith load. Denote qi(t) as the “on/off” state: qi(t)=0 when the load (e.g., a TCL) is off, and qi(t)=1 when it is on. The system dynamics can be represented by ƒoni and ƒoffi: n→n as follows:
where θim is the model parameter, and the state vector ηi(t) includes the air temperature (Tci(t)) and any other suitable measurable parameters.
The power state of the TCL can be regulated by a hysteretic controller based on the control deadband [ui(t)−δ/2, ui(t)+δ/2], where ui(t) is the temperature setpoint of the ith TCL and δ is the deadband. When the TCL controller is operating in the cooling mode, the load is turned off when Tci(t)≦ui(t)−δ/2, is turned on when Tci(t)≧ui(t)−δ/2, and remains in the same power state otherwise. This example hysteretic control policy can be described as:
For notational convenience, a hybrid state zi(t)=[ηi(t), qi(t)] is defined that includes of both the temperature and the “on/off” state of the load. Let [tk, tk+T] be the kth market clearing period, then the energy consumption of each load during the kth period depends on the system state and setpoint control ui(t). In this example, the private information consists of system state and model parameters. Therefore, the energy consumption of each load can be represented as ei(ui(tk), zi(tk), θim). This energy consumption function can be derived by calculating the portion of time that the system is on over the entire market period.
As shown, the associated TCL is an air conditioning unit that will consume a maximum amount of energy up to a temperature of about 71 degrees Fahrenheit and will then consume a lower amount of energy according to a slope up to about 73 degrees Fahrenheit. Thus, the temperature represents the thermostat set point associated with the air conditioner TCL.
Determinations of example energy consumption functions, as can be implemented in some examples of the disclosed technology, are discussed in further detail below.
For notational convenience, let θi(tk)=(zi(tk), θim) represent user's private information, then the energy function can be written as ei(ui(tk), θ(tk)). Notice that the private information for users is time varying, as it contains the system state.
After the market is cleared, each user attempts to determine the control action ui(tk) such that the resulting energy consumption equals the optimal solution to (1). Since the optimal control depends on the energy price, a user response function can be defined, φi: →with ui(tk)=φi(Pc). Therefore the optimal energy allocation function hi as defined in (2) should satisfy the following:
h
i(·;θi(tk)=ei(φi(•),θi(tk)) (5)
The left side of equation (5) represents the optimal energy allocation for a given price, while the right side arises from the physical property of the individual loads, and indicates that the user can specify the control action ui to match the actual energy consumption to the optimal allocation.
D. Example Determination of Clearing Price
The market coordinator purchases energy from the wholesale market at a cost denoted as C(Σi=1Nai):→. We assume that C(•) is differentiable and convex. The energy is then allocated to users to maximize the social welfare, which can be defined as:
Σi=1NVi(ai;θi(tk))−C(Σi=1Nai).
After the market price is determined, individual user will respond by adjusting the control setpoint, and there always exists a resulting energy allocation, a=(a1, . . . , aN). However, the inverse is not necessarily true: for a given energy allocation, the coordinator cannot always find a price to realize it.
As an example, consider two users with Vi(a1; θi(tk))=a1, V2(a2;θ2(tk))=3a2. The energy cost for the coordinator is C(a1+a2)=2ai+2a2. The group objective is to maximize the social welfare subject to a feeder power constraint, for example:
The optimal solution to Determination (6) is a1=0, a2=1. However, according to (1), given any energy price, ai is either 0 or 2. Therefore, the optimal energy allocation cannot be achieved.
To address this concern, the concept of realizable energy allocation can be applied as follows.
An energy allocation vector, a=(a1, . . . , aN), can be realized if there exists a price Pc, such that ai=hi (Pc; θi, (tk)) for all i=1, . . . , N. In this case, Pc realizes the energy allocation a in the kth market period.
With the above energy allocation vector, a set k can be defined to denote all the realizable energy allocation:
k
={a|∃P
c
,s.t.a
i
=h
i(Pc;θi(tk)),∀i=1, . . . ,N} (7)
Thus, an optimal energy allocation is in this set, and therefore there exists a price to realize the allocation. Analysis of such an allocation can be formally described as follows:
Design a bidding and clearing strategy, such that the cleared price realizes the allocation a* that maximizes the social welfare subject to a feeder power constraint, e.g., a* is the solution to the following optimization problem:
subject to:
It should be noted that Example Problem 1 is a convex optimization problem. However, due to limited communication resources, privacy issues, and unknown load parameters in real-world environments, it is infeasible for the market coordinator to have all the global information available in real time. Thus, the market coordinator can motivate self-interested individual users to submit useful information for the collective decision-making by applying targeted market rules.
The example feeder power constraint couples individual allocations through an inequality. In such cases, a price does not always exist to realize the optimal energy allocation taking into account the feeder power constraint. The set k can be used to ensure that the solution to the optimization problem (8), as stated above, is realized.
The exemplary proposed framework is different from the wholesale energy market, at least in that it incorporates the internal dynamics of TCLs into the decision-making. The clearing energy price triggers changes on setpoint control, which in turn modifies the system dynamics and affects the power consumption. In this manner, the load dynamics become important for the overall price response.
E. Example Aggregation of User Preferences
An example proper bidding and pricing strategy is presented, such that when each user (e.g., TCL consumers) selfishly attempts to maximize individual utility, the resulting outcome can also achieve a desired group objectives (e.g., by attempting to maximize social welfare).
Let xεX be the outcome of the mechanism that consists of the energy allocation and the energy price, e.g., x=(a1, . . . , aN, Pc). The utility of each user (comfort minus electricity cost) depends on the outcome. Moreover, it is assumed that at time tk, each user can privately observe his utility, Ui over different outcomes. In other words, the example model assumes that user i privately observes a parameter θi that determines its local utility. It should be noted that for ease of explanation, the dependence of θi on tk is omitted throughout the rest of this disclosure for notational convenience. In the exemplary mechanism design, θiεΘi, is referred to as the user i's type, where Θi denotes the set of all the possible types. The user type contains the system state, zi(tk), and the model parameter, θim, in particular:
U
i(x;θi)=Vi(ai;θi)−Pcai (9)
where θi=[zi(tk), θim].
As the user preferences are private, to determine the optimal energy price, the market coordinator requests that each user submit a bid to reveal at least a portion of the private information. Formally, this can be formulated as a message space M=M1× . . . ×MN, where Mi denotes the space of messages (bids) the ith user can communicate to the coordinator. The structure of Mi depends on particular applications. In some examples each load device controller submits a price and a quantity, such that (Pbidi, (Abidi)εMi. In other examples, each load device controller submits the slope of a demand curve, βi, in which case βiεMi. After collecting the user bids, the market is cleared with an energy price and a corresponding energy allocation. The clearing strategy can be represented by an outcome function, g:M→X, that maps the bids to an outcome, x. The message space and the outcome function together can fully characterize the rules governing the procedure for making the collective choice, and this is typically referred to as a mechanism:
A mechanism Γ=(M1, . . . , MN, g(•)) is a collection of N message spaces (M1, . . . , MN) and an outcome function, g:M→X.
Each controller observes θi privately and determines what to bid to maximize local load utility. This process can be represented by a bidding strategy mi: Θi→Mi that maps the type to a message. A number of different solution concepts for the mechanism can be applied, The present explanation applies a dominant strategy equilibrium as follows, although other suitable solution concepts can be applied, such as Nash equilibrium, Bayesian Nash equilibrium, etc. Denote m−i, as a collection of strategies of all the users other than i, then an example dominant strategy equilibrium can be defined as follows:
Example Dominant Strategy Equilibrium
Strategy profile (mi*(•), . . . , mN*(•)) is a dominant strategy equilibrium of mechanism Γ=(M1, . . . , MN,g(•) if for all i and all θiεΘi, Ui(g(m*i (θi),m−i), θi)≧Ui(g(m′i(θi),m−i),θi) for all m′i(θi)εMi and all m−iεM−i.
For Nash equilibrium, each agent plays the equilibrium strategy only when the agent has a correct forecast of the actions of other agents. When such knowledge is unavailable, it typically takes multiple iterations of information exchanges between the agents and coordinator to elicit the equilibrium strategy of the game. In contrast, the dominant strategy equilibrium discussed above is typically robust, as a rational agent always follows the equilibrium strategy regardless of other agent's actions. In other words, even when the load controller does not have knowledge regarding the actions of other loads, the controller still plays the equilibrium strategy. This enables each controller to only bid once at each market period, which can significantly reduce the communication overhead in certain examples of the disclosed technology.
The equilibrium strategy characterizes the individual′ TCL's self-interested behavior: each TCL is controlled to maximize its associated consumer's individual welfare. However, for the market coordinator, it may be more desirable to find the best choice for the overall social welfare. For this reason, a social choice function ƒ: Θ→X can be defined to represent the desired social outcome of the coordinator. More specifically, ƒ(•) determines what outcome will be chosen by the coordinator if had access to all the consumer private information. In our problem, ƒ consists of the optimal energy allocation to optimization problem (8) and the price that realizes this allocation. If we define θ=(θ1, . . . , θN), the conflict between the personal interest and social interest can be captured by the concept of the implementation described by Example Definition 4:
A mechanism Γ=(M1, . . . , MN, g(•)) implements the social choice function ƒ(•) in dominant strategies if there exists a dominant strategy equilibrium m*(•) of Γ, such that g(m1*(θ1), . . . , mN*(θN))=ƒ(θ) for all θεΘ.
In the above definition, g(m1*(θ1), . . . , mN*(θN)) represents the resulting outcome of individual maximization, while ƒ(θ) denotes the desired social outcome. The concept of implementation characterizes the social choice that can be realized when all the users take actions to selfishly maximize the individual utility. To this end, Problem 1 can be equivalently stated as the following Problem 2:
Problem 2:
Design a mechanism to implement the social choice function ƒ(•) that maximizes the social welfare subject to a feeder power constraint, ƒ(θ), is the solution to the optimization problem (8).
The design of a mechanism includes specifying the message space and the outcome function for each user. In the mechanism design problem, the coordinator needs to design the message space and the market clearing rule such that the optimal social welfare can be implemented when each user selfishly maximizes the individual utility, and very importantly, the feeder power constraint needs to be respected. Notice that although a power capping problem is considered in this paper, the proposed framework can be easily adapted to other demand response application, such as load following and frequency regulation.
F. Constructing the Example Mechanism Γ
Let ƒ*(θ)=(a1*, . . . , aN*, Pc*) be the social choice function that maximizes the social welfare subject to the feeder power constraint. Specifically, (a1*, . . . , aN*) is the optimal solution to (8), and ƒ*(θ) satisfies the following condition:
a*
i
=h
i(Pc*;θi(tk)),∀i=1, . . . ,N (10)
This subsection constructs a mechanism to implement ƒ*(•).
Consider a mechanism Γ*, where each device is asked to submit function hi(•;θi). Due to the assumption on the convexity of Vi, it can be verified from (1) that the curve hi(Pc;θi) is non-increasing with respect to Pc. In this case, the message space is the function space of all possible hi (non-increasing functions). It should be noted that the user's actual bids may deviate from function hi, unless they are motivated to bid hi. Let bi(•;θi) be a non-increasing function that represents the user's actual bid. The aggregated demand curve b(•;θ) can be obtained by adding individual bidding functions, i.e., b(•;θ)=Σi=1Nbi(•;θi). In this example mechanism, each user is required to submit a function, which requires considerable communication resources. This bidding strategy will be simplified in the next subsection to reduce the communication overhead.
The following outcome function g(b1, . . . , bN)=(a1*, . . . , aN*, Pc) can then be used to clear the market:
where C′ represents the derivative of the cost function C(•), and the market price Pc is the smaller of P* and
The following discussing illustrates some properties of the mechanism discussed above.
Proposition 1:
When each user is a price taker, the strategy profile (h1(•;θ1), . . . , hN(•;θN)) is a dominant strategy equilibrium of the proposed mechanism Γ*.
Proof:
It suffices to prove that submitting hi maximizes the individual utility, which is defined as Vi(ai;θi)−Pcai. It should be noted that Pc is not affected by the bids under the price taker assumption. Therefore, if suffices to prove that bidding hi results in an energy allocation, ai*, that satisfies ai*=arg max0≦ai≦Eim [Vi(ai;θi)−Pcai]. In other words, solve for ai*=hi(Pc;θi). According to (11), if we bi=hi, then the proof is complete.
Remark 4:
The result of Proposition 1 only holds when there are a large population of users such that the influence of an individual user on the market price is negligible. In other cases (such as an oligopolistic market), the mechanism needs to be designed differently. In addition, the price taker assumption does not indicate that individual can not affect his utility via his bid. Instead, it only suggests that the bid from an individual consumer does not affect the market price Pc. Notice that the utility function not only depends on Pc but also depends on ai. Under the price taker assumption, the consumer can still affect ai via its bid, which further determines the consumer utility.
In the proposed mechanism, the optimal bid of each user does not depend on the bidding decisions of others. Therefore, if the bidding decision of one user has to depend on the action of another, then the equilibrium strategy cannot be achieved unless all the users have accurate predictions on other user's action, which may not be a reasonable assumption. As a result, we can establish the following property of the example mechanism:
Proposition 2:
The proposed mechanism Γ* implements the social choice function ƒ*(•).
The proposed pricing strategy maximizes the social welfare subject to the feeder power constraint. Roughly speaking, one can view the result as clearing the market at the intersection point of the market demand curve and the market supply curve (the feeder power constraint can be regarded as the case of limited supply).
G. Example Bidding Strategy
The proposed mechanism provides a general solution to the coordination problem formulated in this paper. In real-world applications, directly submitting function hi requires considerable communication resources, and might impinge on customer privacy. To simplify the message space and reduce the communication overhead, we consider the thermal dynamics of TCLs, which can be captured by the second-order Equivalent Thermal Parameter (ETP) model:
where n/(t) consists of the air temperature and the inner mass temperature, q/(t) is the “on/off” state of the TCL, and the model parameters include Ai, Boni and Boffi, i.e., θim=[Ai, Boni, Boffi]T. The optimal energy allocation function, hi, corresponding to the ETP model is shown in
Remark 5:
The ETP model uses ordinary differential equations to describe the thermal behaviors of the buildings, and is widely used in the literature to characterize the energy performance of the TCLs. It can be categorized in two forms: first-order ETP model and second-order ETP model. The first-order model only considers the air temperature, and can be regarded as a special case of the second-order model, where the inner mass temperature is also taken into account. While the first-order ETP model is appropriate for small TCLs, such as refrigerators, it is not appropriate for HVAC systems, which have a large heat capacity due to furnishing and building material.
Due to the complicated nature of the hybrid system dynamics, directly submitting the function hi may require considerable communication resources in the real time implementation. To reduce the message space, hi is approximated with a step function as illustrated in
where L=[1,0], where Tci(tk) is the current air temperature and δ is the control deadband, and the power state of the ith TCL is “on” at tk.
The step function (in
In some examples, the bidding strategy assumes complete knowledge of ETP model parameters. In practice these parameters can be difficult to derive, and the users can only bid based on local measurements (air temperature, “on/off” state). In addition, the ETP model used in the framework may be inaccurate in terms of characterizing the real energy consumption of the TCLs. To address these challenges, we present a joint state and parameter estimation framework is discussed below that enables users to compute bidding prices according to the disclosed technology, even when there is incomplete knowledge of all ETP model parameters.
It can be observed that the bidding strategy discussed above assumes full knowledge of the system state and model parameters. However, in many examples the TCL controller can only measure the air temperature, while the inner mass temperature and the model parameters (Ai, Boni, and Boffi) are not available. To address this issue, methods are disclosed to estimate the unknown system state and model parameters based on the output measurements.
At process block 1210, an initial selection for the model parameters, void is made. In some examples, the initial selection is based on default values stored in a computer-readable storage medium coupled to a TCL controller. The default values can be programmed when the controller is manufactured, as part of a firmware or software update, or updating from previous iterations of performing the illustrated methods.
Assume that we have some rough statistical information about the model parameters. Specifically, the system dynamics can be captured with an uncertain discrete dynamic model with Gaussian noise:
where L=[1,0], and yi(tk) is the output measurement (air temperature). Let ηj(t1)=m0i+μi be the initial state (μ1 is the noise). It can be assumed that all the noise terms follow the Gaussian distributions:
Let σi=[Āi,
After performing an initial estimation of values for a set of model parameters, the method proceeds to process block 1220.
A. Example Determining of a Distribution for System State Vector
At process block 1220, in a first iterative stage (also dubbed “E-step”), a posterior distribution of the state p(Zi|Yi, σold) is evaluated assuming that all the parameters are known, where Zi=(ηi(t1), . . . , ηi(tM). In the second stage (dubbed “M-step”), discussed in further detail below regarding process block 1230, the derived posterior distribution is used to find updates of the estimated parameter σi selected to substantially maximize the expectation of the logarithm of the complete-data likelihood function, which is:
After the update for the parameter estimation is derived in the M step, the updated estimation is applied to σold and the method returns to E-step for another iteration. This technique is iterated until the estimations of the state and parameters converges. An example estimation algorithm for an ETP model can be summarized as follows:
An example technique for determining a distribution for a system state vector (E step) finds a distribution for the system state ηi(tk) conditioned on a full observation sequence, Yi=(yi(t1), . . . , yi(tM)), assuming that the model parameters are known as σold. This inference problem can be solved efficiently using the sum-product algorithm in two steps: first, the distribution of state ηi(tk) conditioned on a partial observation sequence (yi(t1), . . . , yi(tM)) can be derived with a Kalman filter; second, the conditional distribution p(ηi(tk)|Yi) can be found with a Kalman smoother.
Denote {circumflex over (γ)}(ηi(tk)) as the conditional distribution p(ηi(tk)|yi(t1), which satisfies:
{circumflex over (γ)}(ηi(tk))=(ηi(tk)|μk,Φk) (19)
In the context of linear-Gaussian systems, the sum-product algorithm gives the following recursion equations:
where
The initial conditions for the recursion equation is given by:
With the above recursion equations, the distribution for ηi(tk) can be derived conditioned on the observations from yi(t1) to yi(tk).
A probability distribution for ηi(tk) given all observations yi(t1) to yi(tM) can then be determined. Denote the conditional distribution p(ηi(tk)|Yi) as γ(ηi(tk)), which satisfies:
γ(ηi(tk))=(ηi(tk)|{circumflex over (μ)}k,Φk) (22)
The sum-product algorithm gives the following recursion equations:
With the recursion equation presented above, the conditional distribution p(ηi(tk)|Yi) can be computed using backward induction. After a distribution for a system state vector conditioned on the output measurements is determined, the method proceeds to process block 1230.
B. Updating Values for Model Parameters to Maximize Likelihood Function
At process block 1230, values for model parameters to maximize a likelihood function can be determined (M step) by attempting to select a parameter update that maximizes the logarithm of a complete-data likelihood function (19). Equation (19) indicates that aside from the conditional distribution p(ηi(tk)|Yi, θold) (which can be obtained by determining a distribution for a system state vector conditioned on the output measurements), the likelihood function also depends on a joint distribution p(Zi, Yi|σi), as given by:
where the dependence of the joint distribution on the unknown model parameters is made explicit. The complete-data likelihood function (σi, σold) can be then obtained by taking the expectation of (24) over Zi using the posterior distribution p(ηi(tk)|Yi, θold) derived in at process block ______.
Let Āi′,
(i) Maximizing Equation (19) over m0i and Φ0, the updates can be derived as:
(ii) Maximizing the likelihood function (19) over Āi, the update of Āi is given by:
(iii) Maximizing the likelihood function (19) over
where M1⊂{1, 2, . . . , M} denotes the time instants when the system is on, and on represents the size of M1. M2 and θon are defined similarly.
(iv) The update function for Ωi, can also be derived by maximizing the likelihood function with respect to Ωi, which gives:
(v) The update for Σi can also be obtained similarly:
In the above update equations, E[ηi(tk)(ηi(tk)T] can be computed based on the conditional distribution p(ηi(tk)|Yi) obtained at process block 1220, while the pairwise expectation E[ηi(tk)(ηi(t—1)T] can be derived using Bayesian Theorem. The expressions for these expectations are as follows:
After the estimate of model parameter values has been determined and updated (e.g., after the second iteration, the initial estimate of values is updated with σi′. Thus, after the update σi′ of the estimated parameter is derived, the updated estimation parameter is assigned to σold. The method proceeds to process block 1240.
At process block 1240, convergence criteria for the method are evaluated to determine whether another iteration should be performed. For example, the convergence criteria can evaluate whether the model parameters are converging to a stable value by comparing update model parameters to previous values. In some examples, a predefined amount of change in criteria is used to determine that the model values are sufficiently converged.
If the evaluated convergence criteria are not satisfied, the method proceeds back to process block 1220 to perform another iteration of determining a conditional distribution of a set of state values for the thermal environment and determining new values for the set of model parameters selected to maximize a likelihood function based on the conditional distribution. repeating the determining the conditional distribution and the determining new values for the set of model parameters until the set of convergence criteria is met. Once it is determined that the convergence criteria have been satisfied, the method proceeds to process block 1250.
At process block 1250, once the estimation for σi is obtained, the estimated state trajectory and/or updated model parameters are provided, which can be used to determine an energy response to perform bid generation. For example, the Example Bidding Strategy discussed above at part VII.G can be used to determine an energy response (or approximated energy response), and a bid quantity and price are determined. After determining a bid based on the estimated state trajectory and/or updated model parameters, the method proceeds to process block 1260.
At process block 1260, the bid determined at process block 1250 is submitted to a market coordinator. For example, the example method outlined above with respect to
This section applies an example disclosed market mechanism and the learning scheme to a TCL coordination problem, and presents simulation results to demonstrate the effectiveness of the proposed market mechanism. The simulation results were performed on a model similar to that illustrated in the diagram of
A. Simulation Setup
A realistic scenario is considered where each consumer is equipped with a smart thermostat that can measure the room temperature and communicate to the market coordinator. At each period, the TCL controller device measures the current room temperature and submits a bidding price based on a bidding curve. The coordinator collects all the user bids and clears the energy market with a price. Each device will then determine the control setpoint in response to this energy price, which modifies the load dynamics and affects the bids for the next period. This framework is validated in MATLAB using parameters generated in GridLab-D.
A second-order ETP model is used to capture the load dynamics of the TCLs. The ETP model parameters depend on various building parameters, such as glass type, floor area, area per floor, glazing layers and material, etc. In the simulation, 1000 sets of building parameters are generated. A few parameters are randomly, while the rest take their default values from GridLAB-D. Throughout the example simulation, the aggregated power of the unresponsive loads is assumed to be 12 MW, and the feeder power capacity is 15 MW.
Weather data and the Typical Meteorological Year (TMY2) data for Columbus, Ohio, which includes air temperature and solar radiation were. The wholesale energy prices are from the PJM market. The prices are modified to a retail rate in $1 per kWh plus a retail modifier; this retail price is used as the base price.
First the proposed mechanism is evaluated in the deterministic case, where each user can accurately estimate the unknown parameters. Each user submits a bid (e.g., as described above regarding
The trajectories of the market clearing price and the base price are shown in
Furthermore, to evaluate the proposed mechanism in terms of social welfare maximization, we compare it with another scenario, where Real Time Pricing (RTP) is adopted to cap the power in a heuristic way. More specifically, when there is no congestion, the market clearing price is equal to the base price. When the power congestion occurs, the clearing price is the base price multiplied by a fixed ratio γ, which is greater than 1 and can cap the aggregated power below the limit effectively. A number of simulations can be performed to find such ratios, and among all the possible ratios, the minimum one is chosen that can cap the aggregated power below the feeder capacity. Since the social welfare of the two scenarios will be the same during the uncongestion period (γ=1), the weather data on Aug. 16, 2009, is used where more power congestion can be observed due to the elevated temperature. In this case γ=2.6, and the social welfare of the two pricing strategies is shown in
B. Example Application of an Output-Based Bidding
This subsection shows how the proposed output-based bidding algorithm can be used to accurately estimate the bidding prices. In the simulation, it is assumed that each device can locally measure its room temperature every 30 seconds, and store all the measurements for the past 3 hours, in which case M=360. The algorithm is started with an initial guess σold with 10% error. In other words, each element of the initial guess σold is generated by randomly selecting a value between 90% and 110% of its true value. With the estimated parameters derived in the output-based bidding algorithm, each consumer controller can compute the bidding prices. Here a random coordinator in a random market period is chosen and its estimation result is present in
When all the users apply the output-based bidding technique to compute the biding prices, an error (of less than 1%) will be introduced. Next we evaluate how this estimation error can affect the aggregated power response. To implement the estimation framework, each device will locally perform the output-based bidding algorithm during each market period, which just takes 5.5 seconds on a laptop with 2.5 GHz Intel i5 processor and 8G memory. However, it is computationally intensive to do the centralized simulations for all the users over 24 hours to show how the estimation error affects the aggregated power response. For this reason, instead of directly incorporating the output-based bidding algorithm in individual simulations, we add a simulated error of 2% (this simulated error is larger than the actual error of the output-based bidding method) to each controller's bidding price. The simulation results with this bidding error are presented in
C. Comparison with Other Strategies
In this subsection we first compare the proposed mechanism with Real Time Pricing (RTP). RTP can incentivize users to shift demand from high price periods to low price periods to reduce electricity expenditures. However, as such an approach directly passes the base price to the retail market, it does not typically achieve predictable and reliable aggregated power response, which is essential in many demand response programs. To illustrate these limitations, we compare the example simulation approach with RTP by applying RTP in the considered problem. In this simulation the coordinator clears the market by directly passing the base price to individual users, and the devices respond to the energy price according to the response curve described in the companion paper. Except for the pricing strategy, all the parameters are the same as in the simulation in subsection IX.B, and the result is presented in
In addition, the proposed mechanism is also compared with another base scenario simulation. In that simulation, the market clearing strategies of the two cases are the same, while the bidding strategies are different. In the base scenario, each device submits a bid based on the current temperature, while the device in the proposed mechanism computes the bid according to
D. Effect of Adjusting Certain Parameters
This subsection discusses how adjusting certain parameters can affect the performance of disclosed methods for bidding and clearing prices.
Number of Households:
In this subsection we use numerical simulations to investigate to what extent the assumption that every consumer is a price taker can be justified. In particular, the influence of an individual bid on the market price is simulated to show how this influence changes with the growing number of participating households. This can be done by perturbing the bidding price of a user i, and observing how the market price changes with this perturbation while the bids of all the other users remain the same. It can be verified that under an example clearing strategy (e.g., as discussed above regarding
The simulation can be done in the following steps. First, randomly choose a group of users (e.g., 100 users) and a market period, simulate the market bidding and clearing process, and derive the corresponding market clearing price. Second, choose one user from this group, perturb his bid, and rerun the market clearing process to obtain another market price. Third, compute the influence index based on the market clearing prices derived from the first two steps. Fourth, enlarge the group and repeat all the procedures described above. Notice that when there is no congestion, the clearing price is the base price, and the influence index is zero (0). Therefore, the simulations are done in a market period during which power congestion occurs. In addition, to enforce a fair comparison, we assume that the feeder power constraint changes according to the number of participating household. For example, if the maximum power of each air conditioning load is 5 kW, and there are N loads in the project, then the maximum aggregated power is 5N kW, and the feeder power capacity is 60% of the maximum power, 3N kW.
The simulation result is shown in
Weather Information:
Aside from the number of participating households, the outside temperature data is also an important parameter that affects the performance of the proposed mechanism. The high temperature period can significantly increase the aggregated power demand of the air conditioning loads, and therefore cause more power congestion. For this reason, we evaluate the proposed method with a different temperature record. The data is obtained for Aug. 16 (hot day), 2009 in Columbus, Ohio, as shown in
The power trajectory and the market clearing prices are presented in
Initial Guess of the Output-Based Bidding Algorithm:
The initial guess of the output-based bidding algorithm also affects performance of the estimation result. In the previous simulations discussed above, the initial guess is generated by randomly selecting a value between 90% and 110% of its true value. Therefore, to implement the output-based bidding algorithm, we assume that users have some prior knowledge of the unknown parameters to guarantee that initial guess is within this range (from 90% to 110%) of the true value). In particular, we use the same model parameters, and gradually increase the error of the initial guess to evaluate the corresponding estimation performance. An estimation result is shown in
In view of the many possible embodiments to which the principles of the disclosed invention may be applied, it should be recognized that the illustrated embodiments are only preferred examples of the invention and should not be taken as limiting the scope of the invention. Rather, the scope of the invention is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.
This application claims the benefit of U.S. Provisional Patent Application No. 62/056,221, entitled “A MARKET MECHANISM DESIGN APPROACH FOR COORDINATION OF THERMOSTATICALLY CONTROLLED LOADS WITH UNKNOWN PARAMETERS,” filed Sep. 26, 2014, and U.S. Provisional Patent Application No. 62/056,212, entitled “ON MARKET-BASED COORDINATION OF THERMOSTATICALLY CONTROLLED LOADS WITH USER PREFERENCE,” filed Sep. 26, 2014, and which applications are incorporated herein by reference in their entireties.
This invention was made with Government support under Contract DE-ACO576RL01830 awarded by the U.S. Department of Energy. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62056221 | Sep 2014 | US | |
62056212 | Sep 2014 | US |