The present disclosure relates generally to control systems for an HVAC system. More particularly, the present disclosure relates to extremum-seeking control for an HVAC system.
Extremum-seeking control (ESC) is a class of self-optimizing control strategies that can dynamically search for the unknown and/or time-varying inputs of a system for optimizing a certain performance index. It can be considered a dynamic realization of gradient searching through the use of dither signals. The gradient of the system output with respect to the system input is typically obtained by slightly perturbing the system operation and applying a demodulation measure. Optimization of system performance can be obtained by driving the gradient towards zero by using an integrator in the closed-loop system. ESC is a non-model based control strategy, meaning that a model for the controlled system is not necessary for ESC to optimize the system.
ESC has been used in many different engineering applications (e.g., combustion, circuitry, mining, aerospace and land-based vehicles, building HVAC, wind and solar energy, etc.) and has been shown to be able to improve the operational efficiency and performance for these engineering applications. A variety of different types of ESC have been developed including dither ESC, switching ESC, sliding-mode ESC, adaptive ESC, among others. Each type of ESC has its respective advantages and disadvantages. Each of these approaches to ESC is implemented to drive a gradient of a control system to zero.
Typical ESC controllers operate by finding a gradient of a performance variable with respect to a manipulated variable associated with different building equipment configurations and constantly updating the manipulated variable until the gradient is driven to zero. The controllers generally update the configurations of pieces of building equipment before determining the gradient of associated performance variables.
One implementation of the present disclosure is a self-perturbing extremum-seeking controller for a plant having at least a static portion, according to some embodiments. In some embodiments, the controller includes a processing circuit configured to obtain a value of a performance variable characterizing a performance of the plant. In some embodiments, the processing circuit is configured to determine a sign of a gradient of the performance variable with respect to an input to the static portion of the plant. In some embodiments, the processing circuit is configured to adjust a control input for the plant using the sign of the gradient, and provide the control input to the plant to affect the performance variable.
In some embodiments, the plant is a dynamic plant including the static portion and a dynamic portion. In some embodiments, the input to the static portion of the plant is an output of the dynamic portion of the plant.
In some embodiments, the processing circuit is configured to determine a gradient of the performance variable with respect to time using one or more values of the performance variable. In some embodiments, the processing circuit is configured to estimate a gradient of the output of the dynamic portion of the plant with respect to time using a model of the dynamic portion of the plant. In some embodiments, the processing circuit is configured to determine the gradient of the performance variable with respect to the input to the static portion based on the gradient of the performance variable with respect to time and the gradient of the output of the dynamic portion with respect to time.
In some embodiments, the processing circuit is configured to mitigate an effect of the dynamic portion of the plant on the performance variable by holding the sign of the gradient at a positive state or a negative state until an amount of time has elapsed since the sign of the gradient last switched between the positive state and the negative state.
In some embodiments, the plant is a static plant and the input to the static portion is the control input provided by the controller.
In some embodiments, the processing circuit is configured to determine a gradient of the performance variable with respect to time using one or more values of the performance variable. In some embodiments, the processing circuit is configured to determine a gradient of the control input with respect to time, and determine the gradient of the performance variable with respect to the input to the static portion based on the gradient of the performance variable with respect to time and the gradient of the control input with respect to time.
In some embodiments, the processing circuit is configured to adjust a rate at which the performance variable moves toward a minimum or maximum of the performance variable by applying an adjustable gain parameter to the gradient.
In some embodiments, the processing circuit is configured to adjust at least one of a magnitude or frequency of oscillations of the performance by applying an adjustable gain parameter to the gradient.
Another implementation of the present disclosure is a self-perturbing extremum-seeking controller for a plant, according to some embodiments. In some embodiments, the controller includes a processing circuit configured to determine a gradient of a performance variable characterizing a performance of the plant using one or more values of the performance variable. In some embodiments, the processing circuit is configured to apply the gradient of the performance variable as an input to a relay that switches an output of the relay between a positive state and a negative state based on a sign of the gradient. In some embodiments, the processing circuit is configured to adjust a control input for the plant using the output of the relay, and provide the control input to the plant to affect the performance variable.
In some embodiments, the output of the relay is a discontinuous signal with respect to time that switches between the positive state and the negative state.
In some embodiments, the gradient of the performance variable is a time gradient indicating a rate of change of the performance variable with respect to time.
In some embodiments, the gradient of the performance variable is a gradient of the performance variable with respect to the control input to the plant.
In some embodiments, the plant includes a static portion and a dynamic portion. In some embodiments. In some embodiments, the processing circuit is configured to mitigate an effect of the dynamic portion of the plant on the performance variable by holding the output of the relay at the positive state or the negative state until a predetermined amount of time has elapsed since the output of the relay last switched between the positive state and the negative state.
In some embodiments, the processing circuit is configured to adjust a rate at which the performance variable moves toward a minimum or maximum of the performance variable by applying an adjustable gain parameter to the output of the relay.
In some embodiments, the processing circuit is configured to adjust at least one of a magnitude or frequency of oscillations of the performance variable by applying an adjustable gain parameter to the output of the relay.
Another implementation of the present disclosure is a method for adaptively perturbing a control input for a plant in an extremum-seeking control system, according to some embodiments. In some embodiments, the method includes determining a rate of change of a performance variable that characterizes a performance of the plant using one or more values of the performance variable. In some embodiments, the method also includes estimating a gradient of the performance variable with respect to a control input to the plant using the rate of change of the performance variable and an adaptive gain parameter indicating a rate of change of the control input to the plant. In some embodiments, the method includes adjusting the adaptive gain parameter using the gradient of the performance variable with respect to the control input to the plant. In some embodiments, the method includes perturbing the control input to the plant using the adjusted adaptive gain parameter and a sign of the gradient of the performance variable with respect to the control input to the plant.
In some embodiments, the method further includes normalizing the gradient relative to a range of the performance variable. The normalized gradient may be used to adjust the adaptive gain parameter.
In some embodiments, adjusting the adaptive gain parameter includes reducing a magnitude of the adaptive gain parameter as the performance variable approaches a minima or maxima of the performance variable.
In some embodiments, adjusting the adaptive gain parameter includes reducing a rate of convergence of the performance variable toward a minima or maxima of the performance variable as the performance variable approaches the minima or maxima and decreases a magnitude of oscillations of the performance variable about the minima or maxima.
In some embodiments, adjusting the adaptive gain parameter includes calculating a new value of the adaptive gain parameter using a current value of the adaptive gain parameter, the rate of change of the performance variable, and a non-adaptive gain parameter.
Overview
Referring generally to the FIGURES, a self-perturbing extremum-seeking control (ESC) system includes a plant, and an extremum-seeking controller. The extremum-seeking controller is configured to generate an input for the plant so that equipment of the plant operates to affect an environmental condition of a conditioned space (e.g., a room, a zone, an area, etc.). The plant may be a static plant (e.g., represented by a static linearity) or may be a dynamic plant including a static portion and a dynamic portion. The extremum-seeking controller may be configured to obtain values of a performance variable (e.g., a cost or output variable) that characterizes a performance of the plant and may obtain a gradient (e.g., a time derivative) of the performance variable or output of the plant.
In some embodiments, the extremum-seeking controller uses a model of the dynamic portion of the plant to estimate the effect of the dynamic portion of the plant. The effect of the dynamic portion of the plant may be represented as a predicted output of the dynamic portion of the plant, which is provided as an input to the static portion of the plant. The extremum-seeking controller can determine a gradient (e.g., a time derivative) of the predicted output of the dynamic portion of the plant and can use this gradient in combination with the gradient of the performance variable to determine a gradient of the performance variable with respect to the predicted output of the dynamic portion of the plant. For example, the time-gradient of the performance variable can be divided by the time-gradient of the predicted output of the dynamic portion of the plant to calculate the gradient of the performance variable with respect to the output of the dynamic portion of the plant.
The extremum-seeking controller can use a filter to identify a sign (e.g., positive or negative) of the gradient of the performance variable with respect to the output of the dynamic portion of the plant. The sign of the gradient may be represented by ε∈{−1, 1}, which means that the sign of the gradient ε is equal to 1 of the gradient is positive or −1 of the gradient is negative. Advantageously, the sign of the gradient can be used to perturb the control input for the plant without requiring a separate dither signal. The extremum-seeking controller may use the sign of the gradient to calculate an amount by which the control input for the plant should be adjusted. For example, the extremum-seeking controller may apply the sign of the gradient to a gain parameter k, which will result in a value of k if the gradient is positive or a value of −k if the gradient is negative. The gain parameter k may be a fixed parameter or can by adaptively adjusted as described in greater detail below. The calculated value k or −k represents an amount by which the control input to the plant should be adjusted. The extremum-seeking controller may adjust the control input by adding the calculated value k or −k to the previous value of the control input (e.g., using an integrator block) to generate an adjusted control input for the plant.
Building HVAC Systems and Building Management Systems
Referring now to
Building and HVAC System
Referring particularly to
The BMS that serves building 10 includes a HVAC system 100. HVAC system 100 can include a plurality of HVAC devices (e.g., heaters, chillers, air handling units, pumps, fans, thermal energy storage, etc.) configured to provide heating, cooling, ventilation, or other services for building 10. For example, HVAC system 100 is shown to include a waterside system 120 and an airside system 130. Waterside system 120 may provide a heated or chilled fluid to an air handling unit of airside system 130. Airside system 130 may use the heated or chilled fluid to heat or cool an airflow provided to building 10. An exemplary waterside system and airside system which can be used in HVAC system 100 are described in greater detail with reference to
HVAC system 100 is shown to include a chiller 102, a boiler 104, and a rooftop air handling unit (AHU) 106. Waterside system 120 may use boiler 104 and chiller 102 to heat or cool a working fluid (e.g., water, glycol, etc.) and may circulate the working fluid to AHU 106. In various embodiments, the HVAC devices of waterside system 120 can be located in or around building 10 (as shown in
AHU 106 may place the working fluid in a heat exchange relationship with an airflow passing through AHU 106 (e.g., via one or more stages of cooling coils and/or heating coils). The airflow can be, for example, outside air, return air from within building 10, or a combination of both. AHU 106 may transfer heat between the airflow and the working fluid to provide heating or cooling for the airflow. For example, AHU 106 can include one or more fans or blowers configured to pass the airflow over or through a heat exchanger containing the working fluid. The working fluid may then return to chiller 102 or boiler 104 via piping 110.
Airside system 130 may deliver the airflow supplied by AHU 106 (i.e., the supply airflow) to building 10 via air supply ducts 112 and may provide return air from building 10 to AHU 106 via air return ducts 114. In some embodiments, airside system 130 includes multiple variable air volume (VAV) units 116. For example, airside system 130 is shown to include a separate VAV unit 116 on each floor or zone of building 10. VAV units 116 can include dampers or other flow control elements that can be operated to control an amount of the supply airflow provided to individual zones of building 10. In other embodiments, airside system 130 delivers the supply airflow into one or more zones of building 10 (e.g., via supply ducts 112) without using intermediate VAV units 116 or other flow control elements. AHU 106 can include various sensors (e.g., temperature sensors, pressure sensors, etc.) configured to measure attributes of the supply airflow. AHU 106 may receive input from sensors located within AHU 106 and/or within the building zone and may adjust the flow rate, temperature, or other attributes of the supply airflow through AHU 106 to achieve setpoint conditions for the building zone.
Waterside System
Referring now to
In
Hot water loop 214 and cold water loop 216 may deliver the heated and/or chilled water to air handlers located on the rooftop of building 10 (e.g., AHU 106) or to individual floors or zones of building 10 (e.g., VAV units 116). The air handlers push air past heat exchangers (e.g., heating coils or cooling coils) through which the water flows to provide heating or cooling for the air. The heated or cooled air can be delivered to individual zones of building 10 to serve thermal energy loads of building 10. The water then returns to subplants 202-212 to receive further heating or cooling.
Although subplants 202-212 are shown and described as heating and cooling water for circulation to a building, it is understood that any other type of working fluid (e.g., glycol, CO2, etc.) can be used in place of or in addition to water to serve thermal energy loads. In other embodiments, subplants 202-212 may provide heating and/or cooling directly to the building or campus without requiring an intermediate heat transfer fluid. These and other variations to waterside system 200 are within the teachings of the present disclosure.
Each of subplants 202-212 can include a variety of equipment configured to facilitate the functions of the subplant. For example, heater subplant 202 is shown to include a plurality of heating elements 220 (e.g., boilers, electric heaters, etc.) configured to add heat to the hot water in hot water loop 214. Heater subplant 202 is also shown to include several pumps 222 and 224 configured to circulate the hot water in hot water loop 214 and to control the flow rate of the hot water through individual heating elements 220. Chiller subplant 206 is shown to include a plurality of chillers 232 configured to remove heat from the cold water in cold water loop 216. Chiller subplant 206 is also shown to include several pumps 234 and 236 configured to circulate the cold water in cold water loop 216 and to control the flow rate of the cold water through individual chillers 232.
Heat recovery chiller subplant 204 is shown to include a plurality of heat recovery heat exchangers 226 (e.g., refrigeration circuits) configured to transfer heat from cold water loop 216 to hot water loop 214. Heat recovery chiller subplant 204 is also shown to include several pumps 228 and 230 configured to circulate the hot water and/or cold water through heat recovery heat exchangers 226 and to control the flow rate of the water through individual heat recovery heat exchangers 226. Cooling tower subplant 208 is shown to include a plurality of cooling towers 238 configured to remove heat from the condenser water in condenser water loop 218. Cooling tower subplant 208 is also shown to include several pumps 240 configured to circulate the condenser water in condenser water loop 218 and to control the flow rate of the condenser water through individual cooling towers 238.
Hot TES subplant 210 is shown to include a hot TES tank 242 configured to store the hot water for later use. Hot TES subplant 210 may also include one or more pumps or valves configured to control the flow rate of the hot water into or out of hot TES tank 242. Cold TES subplant 212 is shown to include cold TES tanks 244 configured to store the cold water for later use. Cold TES subplant 212 may also include one or more pumps or valves configured to control the flow rate of the cold water into or out of cold TES tanks 244.
In some embodiments, one or more of the pumps in waterside system 200 (e.g., pumps 222, 224, 228, 230, 234, 236, and/or 240) or pipelines in waterside system 200 include an isolation valve associated therewith. Isolation valves can be integrated with the pumps or positioned upstream or downstream of the pumps to control the fluid flows in waterside system 200. In various embodiments, waterside system 200 can include more, fewer, or different types of devices and/or subplants based on the particular configuration of waterside system 200 and the types of loads served by waterside system 200.
Airside System
Referring now to
In
Each of dampers 316-320 can be operated by an actuator. For example, exhaust air damper 316 can be operated by actuator 324, mixing damper 318 can be operated by actuator 326, and outside air damper 320 can be operated by actuator 328. Actuators 324-328 may communicate with an AHU controller 330 via a communications link 332. Actuators 324-328 may receive control signals from AHU controller 330 and may provide feedback signals to AHU controller 330. Feedback signals can include, for example, an indication of a current actuator or damper position, an amount of torque or force exerted by the actuator, diagnostic information (e.g., results of diagnostic tests performed by actuators 324-328), status information, commissioning information, configuration settings, calibration data, and/or other types of information or data that can be collected, stored, or used by actuators 324-328. AHU controller 330 can be an economizer controller configured to use one or more control algorithms (e.g., state-based algorithms, ESC algorithms, proportional-integral (PI) control algorithms, proportional-integral-derivative (PID) control algorithms, model predictive control (MPC) algorithms, feedback control algorithms, etc.) to control actuators 324-328.
Still referring to
Cooling coil 334 may receive a chilled fluid from waterside system 200 (e.g., from cold water loop 216) via piping 342 and may return the chilled fluid to waterside system 200 via piping 344. Valve 346 can be positioned along piping 342 or piping 344 to control a flow rate of the chilled fluid through cooling coil 334. In some embodiments, cooling coil 334 includes multiple stages of cooling coils that can be independently activated and deactivated (e.g., by AHU controller 330, by BMS controller 366, etc.) to modulate an amount of cooling applied to supply air 310.
Heating coil 336 may receive a heated fluid from waterside system 200 (e.g., from hot water loop 214) via piping 348 and may return the heated fluid to waterside system 200 via piping 350. Valve 352 can be positioned along piping 348 or piping 350 to control a flow rate of the heated fluid through heating coil 336. In some embodiments, heating coil 336 includes multiple stages of heating coils that can be independently activated and deactivated (e.g., by AHU controller 330, by BMS controller 366, etc.) to modulate an amount of heating applied to supply air 310.
Each of valves 346 and 352 can be controlled by an actuator. For example, valve 346 can be controlled by actuator 354 and valve 352 can be controlled by actuator 356. Actuators 354-356 may communicate with AHU controller 330 via communications links 358-360. Actuators 354-356 may receive control signals from AHU controller 330 and may provide feedback signals to controller 330. In some embodiments, AHU controller 330 receives a measurement of the supply air temperature from a temperature sensor 362 positioned in supply air duct 312 (e.g., downstream of cooling coil 334 and/or heating coil 336). AHU controller 330 may also receive a measurement of the temperature of building zone 306 from a temperature sensor 364 located in building zone 306.
In some embodiments, AHU controller 330 operates valves 346 and 352 via actuators 354-356 to modulate an amount of heating or cooling provided to supply air 310 (e.g., to achieve a setpoint temperature for supply air 310 or to maintain the temperature of supply air 310 within a setpoint temperature range). The positions of valves 346 and 352 affect the amount of heating or cooling provided to supply air 310 by cooling coil 334 or heating coil 336 and may correlate with the amount of energy consumed to achieve a desired supply air temperature. AHU 330 may control the temperature of supply air 310 and/or building zone 306 by activating or deactivating coils 334-336, adjusting a speed of fan 338, or a combination of both.
Still referring to
In some embodiments, AHU controller 330 receives information from BMS controller 366 (e.g., commands, setpoints, operating boundaries, etc.) and provides information to BMS controller 366 (e.g., temperature measurements, valve or actuator positions, operating statuses, diagnostics, etc.). For example, AHU controller 330 may provide BMS controller 366 with temperature measurements from temperature sensors 362-364, equipment on/off states, equipment operating capacities, and/or any other information that can be used by BMS controller 366 to monitor or control a variable state or condition within building zone 306.
Client device 368 can include one or more human-machine interfaces or client interfaces (e.g., graphical user interfaces, reporting interfaces, text-based computer interfaces, client-facing web services, web servers that provide pages to web clients, etc.) for controlling, viewing, or otherwise interacting with HVAC system 100, its subsystems, and/or devices. Client device 368 can be a computer workstation, a client terminal, a remote or local interface, or any other type of user interface device. Client device 368 can be a stationary terminal or a mobile device. For example, client device 368 can be a desktop computer, a computer server with a user interface, a laptop computer, a tablet, a smartphone, a PDA, or any other type of mobile or non-mobile device. Client device 368 may communicate with BMS controller 366 and/or AHU controller 330 via communications link 372.
Building Management Systems
Referring now to
Each of building subsystems 428 can include any number of devices, controllers, and connections for completing its individual functions and control activities. HVAC subsystem 440 can include many of the same components as HVAC system 100, as described with reference to
Still referring to
Interfaces 407, 409 can be or include wired or wireless communications interfaces (e.g., jacks, antennas, transmitters, receivers, transceivers, wire terminals, etc.) for conducting data communications with building subsystems 428 or other external systems or devices. In various embodiments, communications via interfaces 407, 409 can be direct (e.g., local wired or wireless communications) or via a communications network 446 (e.g., a WAN, the Internet, a cellular network, etc.). For example, interfaces 407, 409 can include an Ethernet card and port for sending and receiving data via an Ethernet-based communications link or network. In another example, interfaces 407, 409 can include a Wi-Fi transceiver for communicating via a wireless communications network. In another example, one or both of interfaces 407, 409 can include cellular or mobile phone communications transceivers. In one embodiment, communications interface 407 is a power line communications interface and BMS interface 409 is an Ethernet interface. In other embodiments, both communications interface 407 and BMS interface 409 are Ethernet interfaces or are the same Ethernet interface.
Still referring to
Memory 408 (e.g., memory, memory unit, storage device, etc.) can include one or more devices (e.g., RAM, ROM, Flash memory, hard disk storage, etc.) for storing data and/or computer code for completing or facilitating the various processes, layers and modules described in the present application. Memory 408 can be or include volatile memory or non-volatile memory. Memory 408 can include database components, object code components, script components, or any other type of information structure for supporting the various activities and information structures described in the present application. According to some embodiments, memory 408 is communicably connected to processor 406 via processing circuit 404 and includes computer code for executing (e.g., by processing circuit 404 and/or processor 406) one or more processes described herein.
In some embodiments, BMS controller 366 is implemented within a single computer (e.g., one server, one housing, etc.). In various other embodiments BMS controller 366 can be distributed across multiple servers or computers (e.g., that can exist in distributed locations). Further, while
Still referring to
Enterprise integration layer 410 can be configured to serve clients or local applications with information and services to support a variety of enterprise-level applications. For example, enterprise control applications 426 can be configured to provide subsystem-spanning control to a graphical user interface (GUI) or to any number of enterprise-level business applications (e.g., accounting systems, user identification systems, etc.). Enterprise control applications 426 may also or alternatively be configured to provide configuration GUIs for configuring BMS controller 366. In yet other embodiments, enterprise control applications 426 can work with layers 410-420 to optimize building performance (e.g., efficiency, energy use, comfort, or safety) based on inputs received at interface 407 and/or BMS interface 409.
Building subsystem integration layer 420 can be configured to manage communications between BMS controller 366 and building subsystems 428. For example, building subsystem integration layer 420 may receive sensor data and input signals from building subsystems 428 and provide output data and control signals to building subsystems 428. Building subsystem integration layer 420 may also be configured to manage communications between building subsystems 428. Building subsystem integration layer 420 translate communications (e.g., sensor data, input signals, output signals, etc.) across a plurality of multi-vendor/multi-protocol systems.
Demand response layer 414 can be configured to optimize resource usage (e.g., electricity use, natural gas use, water use, etc.) and/or the monetary cost of such resource usage in response to satisfy the demand of building 10. The optimization can be based on time-of-use prices, curtailment signals, energy availability, or other data received from utility providers, distributed energy generation systems 424, from energy storage 427 (e.g., hot TES 242, cold TES 244, etc.), or from other sources. Demand response layer 414 may receive inputs from other layers of BMS controller 366 (e.g., building subsystem integration layer 420, integrated control layer 418, etc.). The inputs received from other layers can include environmental or sensor inputs such as temperature, carbon dioxide levels, relative humidity levels, air quality sensor outputs, occupancy sensor outputs, room schedules, and the like. The inputs may also include inputs such as electrical use (e.g., expressed in kWh), thermal load measurements, pricing information, projected pricing, smoothed pricing, curtailment signals from utilities, and the like.
According to some embodiments, demand response layer 414 includes control logic for responding to the data and signals it receives. These responses can include communicating with the control algorithms in integrated control layer 418, changing control strategies, changing setpoints, or activating/deactivating building equipment or subsystems in a controlled manner. Demand response layer 414 may also include control logic configured to determine when to utilize stored energy. For example, demand response layer 414 may determine to begin using energy from energy storage 427 just prior to the beginning of a peak use hour.
In some embodiments, demand response layer 414 includes a control module configured to actively initiate control actions (e.g., automatically changing setpoints) which minimize energy costs based on one or more inputs representative of or based on demand (e.g., price, a curtailment signal, a demand level, etc.). In some embodiments, demand response layer 414 uses equipment models to determine an optimal set of control actions. The equipment models can include, for example, thermodynamic models describing the inputs, outputs, and/or functions performed by various sets of building equipment. Equipment models may represent collections of building equipment (e.g., subplants, chiller arrays, etc.) or individual devices (e.g., individual chillers, heaters, pumps, etc.).
Demand response layer 414 may further include or draw upon one or more demand response policy definitions (e.g., databases, XML files, etc.). The policy definitions can be edited or adjusted by a user (e.g., via a graphical user interface) so that the control actions initiated in response to demand inputs can be tailored for the user's application, desired comfort level, particular building equipment, or based on other concerns. For example, the demand response policy definitions can specify which equipment can be turned on or off in response to particular demand inputs, how long a system or piece of equipment should be turned off, what setpoints can be changed, what the allowable set point adjustment range is, how long to hold a high demand setpoint before returning to a normally scheduled setpoint, how close to approach capacity limits, which equipment modes to utilize, the energy transfer rates (e.g., the maximum rate, an alarm rate, other rate boundary information, etc.) into and out of energy storage devices (e.g., thermal storage tanks, battery banks, etc.), and when to dispatch on-site generation of energy (e.g., via fuel cells, a motor generator set, etc.).
Integrated control layer 418 can be configured to use the data input or output of building subsystem integration layer 420 and/or demand response later 414 to make control decisions. Due to the subsystem integration provided by building subsystem integration layer 420, integrated control layer 418 can integrate control activities of the subsystems 428 such that the subsystems 428 behave as a single integrated supersystem. In some embodiments, integrated control layer 418 includes control logic that uses inputs and outputs from a plurality of building subsystems to provide greater comfort and energy savings relative to the comfort and energy savings that separate subsystems could provide alone. For example, integrated control layer 418 can be configured to use an input from a first subsystem to make an energy-saving control decision for a second subsystem. Results of these decisions can be communicated back to building subsystem integration layer 420.
Integrated control layer 418 is shown to be logically below demand response layer 414. Integrated control layer 418 can be configured to enhance the effectiveness of demand response layer 414 by enabling building subsystems 428 and their respective control loops to be controlled in coordination with demand response layer 414. This configuration may advantageously reduce disruptive demand response behavior relative to conventional systems. For example, integrated control layer 418 can be configured to assure that a demand response-driven upward adjustment to the setpoint for chilled water temperature (or another component that directly or indirectly affects temperature) does not result in an increase in fan energy (or other energy used to cool a space) that would result in greater total building energy use than was saved at the chiller.
Integrated control layer 418 can be configured to provide feedback to demand response layer 414 so that demand response layer 414 checks that constraints (e.g., temperature, lighting levels, etc.) are properly maintained even while demanded load shedding is in progress. The constraints may also include setpoint or sensed boundaries relating to safety, equipment operating limits and performance, comfort, fire codes, electrical codes, energy codes, and the like. Integrated control layer 418 is also logically below fault detection and diagnostics layer 416 and automated measurement and validation layer 412. Integrated control layer 418 can be configured to provide calculated inputs (e.g., aggregations) to these higher levels based on outputs from more than one building subsystem.
Automated measurement and validation (AM&V) layer 412 can be configured to verify that control strategies commanded by integrated control layer 418 or demand response layer 414 are working properly (e.g., using data aggregated by AM&V layer 412, integrated control layer 418, building subsystem integration layer 420, FDD layer 416, or otherwise). The calculations made by AM&V layer 412 can be based on building system energy models and/or equipment models for individual BMS devices or subsystems. For example, AM&V layer 412 may compare a model-predicted output with an actual output from building subsystems 428 to determine an accuracy of the model.
Fault detection and diagnostics (FDD) layer 416 can be configured to provide on-going fault detection for building subsystems 428, building subsystem devices (i.e., building equipment), and control algorithms used by demand response layer 414 and integrated control layer 418. FDD layer 416 may receive data inputs from integrated control layer 418, directly from one or more building subsystems or devices, or from another data source. FDD layer 416 may automatically diagnose and respond to detected faults. The responses to detected or diagnosed faults can include providing an alert message to a user, a maintenance scheduling system, or a control algorithm configured to attempt to repair the fault or to work-around the fault.
FDD layer 416 can be configured to output a specific identification of the faulty component or cause of the fault (e.g., loose damper linkage) using detailed subsystem inputs available at building subsystem integration layer 420. In other exemplary embodiments, FDD layer 416 is configured to provide “fault” events to integrated control layer 418 which executes control strategies and policies in response to the received fault events. According to some embodiments, FDD layer 416 (or a policy executed by an integrated control engine or business rules engine) may shut-down systems or direct control activities around faulty devices or systems to reduce energy waste, extend equipment life, or assure proper control response.
FDD layer 416 can be configured to store or access a variety of different system data stores (or data points for live data). FDD layer 416 may use some content of the data stores to identify faults at the equipment level (e.g., specific chiller, specific AHU, specific terminal unit, etc.) and other content to identify faults at component or subsystem levels. For example, building subsystems 428 may generate temporal (i.e., time-series) data indicating the performance of BMS 400 and the various components thereof. The data generated by building subsystems 428 can include measured or calculated values that exhibit statistical characteristics and provide information about how the corresponding system or process (e.g., a temperature control process, a flow control process, etc.) is performing in terms of error from its setpoint. These processes can be examined by FDD layer 416 to expose when the system begins to degrade in performance and alert a user to repair the fault before it becomes more severe.
Referring now to
BMS 500 provides a system architecture that facilitates automatic equipment discovery and equipment model distribution. Equipment discovery can occur on multiple levels of BMS 500 across multiple different communications busses (e.g., a system bus 554, zone buses 556-560 and 564, sensor/actuator bus 566, etc.) and across multiple different communications protocols. In some embodiments, equipment discovery is accomplished using active node tables, which provide status information for devices connected to each communications bus. For example, each communications bus can be monitored for new devices by monitoring the corresponding active node table for new nodes. When a new device is detected, BMS 500 can begin interacting with the new device (e.g., sending control signals, using data from the device) without user interaction.
Some devices in BMS 500 present themselves to the network using equipment models. An equipment model defines equipment object attributes, view definitions, schedules, trends, and the associated BACnet value objects (e.g., analog value, binary value, multistate value, etc.) that are used for integration with other systems. Some devices in BMS 500 store their own equipment models. Other devices in BMS 500 have equipment models stored externally (e.g., within other devices). For example, a zone coordinator 508 can store the equipment model for a bypass damper 528. In some embodiments, zone coordinator 508 automatically creates the equipment model for bypass damper 528 or other devices on zone bus 558. Other zone coordinators can also create equipment models for devices connected to their zone busses. The equipment model for a device can be created automatically based on the types of data points exposed by the device on the zone bus, device type, and/or other device attributes. Several examples of automatic equipment discovery and equipment model distribution are discussed in greater detail below.
Still referring to
In some embodiments, system manager 502 is connected with zone coordinators 506-510 and 518 via a system bus 554. System manager 502 can be configured to communicate with zone coordinators 506-510 and 518 via system bus 554 using a master-slave token passing (MSTP) protocol or any other communications protocol. System bus 554 can also connect system manager 502 with other devices such as a constant volume (CV) rooftop unit (RTU) 512, an input/output module (IOM) 514, a thermostat controller 516 (e.g., a TEC5000 series thermostat controller), and a network automation engine (NAE) or third-party controller 520. RTU 512 can be configured to communicate directly with system manager 502 and can be connected directly to system bus 554. Other RTUs can communicate with system manager 502 via an intermediate device. For example, a wired input 562 can connect a third-party RTU 542 to thermostat controller 516, which connects to system bus 554.
System manager 502 can provide a user interface for any device containing an equipment model. Devices such as zone coordinators 506-510 and 518 and thermostat controller 516 can provide their equipment models to system manager 502 via system bus 554. In some embodiments, system manager 502 automatically creates equipment models for connected devices that do not contain an equipment model (e.g., IOM 514, third party controller 520, etc.). For example, system manager 502 can create an equipment model for any device that responds to a device tree request. The equipment models created by system manager 502 can be stored within system manager 502. System manager 502 can then provide a user interface for devices that do not contain their own equipment models using the equipment models created by system manager 502. In some embodiments, system manager 502 stores a view definition for each type of equipment connected via system bus 554 and uses the stored view definition to generate a user interface for the equipment.
Each zone coordinator 506-510 and 518 can be connected with one or more of zone controllers 524, 530-532, 536, and 548-550 via zone buses 556, 558, 560, and 564. Zone coordinators 506-510 and 518 can communicate with zone controllers 524, 530-532, 536, and 548-550 via zone busses 556-560 and 564 using a MSTP protocol or any other communications protocol. Zone busses 556-560 and 564 can also connect zone coordinators 506-510 and 518 with other types of devices such as variable air volume (VAV) RTUs 522 and 540, changeover bypass (COBP) RTUs 526 and 552, bypass dampers 528 and 546, and PEAK controllers 534 and 544.
Zone coordinators 506-510 and 518 can be configured to monitor and command various zoning systems. In some embodiments, each zone coordinator 506-510 and 518 monitors and commands a separate zoning system and is connected to the zoning system via a separate zone bus. For example, zone coordinator 506 can be connected to VAV RTU 522 and zone controller 524 via zone bus 556. Zone coordinator 508 can be connected to COBP RTU 526, bypass damper 528, COBP zone controller 530, and VAV zone controller 532 via zone bus 558. Zone coordinator 510 can be connected to PEAK controller 534 and VAV zone controller 536 via zone bus 560. Zone coordinator 518 can be connected to PEAK controller 544, bypass damper 546, COBP zone controller 548, and VAV zone controller 550 via zone bus 564.
A single model of zone coordinator 506-510 and 518 can be configured to handle multiple different types of zoning systems (e.g., a VAV zoning system, a COBP zoning system, etc.). Each zoning system can include a RTU, one or more zone controllers, and/or a bypass damper. For example, zone coordinators 506 and 510 are shown as Verasys VAV engines (VVEs) connected to VAV RTUs 522 and 540, respectively. Zone coordinator 506 is connected directly to VAV RTU 522 via zone bus 556, whereas zone coordinator 510 is connected to a third-party VAV RTU 540 via a wired input 568 provided to PEAK controller 534. Zone coordinators 508 and 518 are shown as Verasys COBP engines (VCEs) connected to COBP RTUs 526 and 552, respectively. Zone coordinator 508 is connected directly to COBP RTU 526 via zone bus 558, whereas zone coordinator 518 is connected to a third-party COBP RTU 552 via a wired input 570 provided to PEAK controller 544.
Zone controllers 524, 530-532, 536, and 548-550 can communicate with individual BMS devices (e.g., sensors, actuators, etc.) via sensor/actuator (SA) busses. For example, VAV zone controller 536 is shown connected to networked sensors 538 via SA bus 566. Zone controller 536 can communicate with networked sensors 538 using a MSTP protocol or any other communications protocol. Although only one SA bus 566 is shown in
Each zone controller 524, 530-532, 536, and 548-550 can be configured to monitor and control a different building zone. Zone controllers 524, 530-532, 536, and 548-550 can use the inputs and outputs provided via their SA busses to monitor and control various building zones. For example, a zone controller 536 can use a temperature input received from networked sensors 538 via SA bus 566 (e.g., a measured temperature of a building zone) as feedback in a temperature control algorithm. Zone controllers 524, 530-532, 536, and 548-550 can use various types of control algorithms (e.g., state-based algorithms, extremum-seeking control (ESC) algorithms, proportional-integral (PI) control algorithms, proportional-integral-derivative (PID) control algorithms, model predictive control (MPC) algorithms, feedback control algorithms, etc.) to control a variable state or condition (e.g., temperature, humidity, airflow, lighting, etc.) in or around building 10.
Self-Perturbing Extremum-Seeking Control System
Plant Input/Output
Referring particularly to
Graph 600 illustrates a single-control input (u), single-output (J) system that has a steady-state characteristic with a concave shape (or a convex shape). The objective of extremum-seeking control is to modulate the control input u to drive the output J to an extremum, shown as extremum point 604. Extremum point 604 is at an intersection between an optimal control input u* and an optimal output J*, according to some embodiments. The output J of the plant may be represented by the function:
J=g(u) (1)
where g is a cost function (e.g., a transfer function) of the plant that relates the control input u to the output/cost/performance variable J.
Extremum-seeking control seeks to drive the control input/output (u,J) to extremum point 604 (e.g., such that u=u* and J=J*), according to some embodiments. In some embodiments, extremum-seeking control includes estimating a gradient 606 of the cost function g
and moving or adjusting the control input u in a direction that is proportional to a negative value of gradient 606 (e.g., if the plant exhibits a convex shape as shown in
where u(t+Δt) is a value of the control input u at a time t+Δt.
Referring now to
The gradient
can be calculated by determining time derivatives of each of the control input u and the performance variable J. For example, applying the chain rule to
yields:
which can be rewritten as:
according to some embodiments. In this way, if the time derivatives of the performance variable J and the control input u can be calculated, the gradient
can also be calculated. The gradient
may be integrated with a gain k (at integrator 706) and fed back to plant 702 as the control input u. In some embodiments, integrator 706 functions the same as or similar to integrator 812 as described in greater detail with reference to
Typical ESC systems use a dither signal to perturb the control input u (e.g., using a sinusoidal dither signal). The dither signal is typically generated by a controller and added to the control input u to provide artificial or controlled oscillations in the control input. However, the systems and methods described herein do not require artificial generation of a dither signal, thereby simplifying the ESC system. Instead of using a separate dither signal, the systems and methods described herein utilize a sign of the gradient (e.g., the gradient of the output of the plant with respect to the control input of the plant, or the gradient of the output of the plant with respect to an effect that a dynamic portion of the plant has on an input). Additionally, the gradient of the output with respect to the control input for traditional ESC may be largely dependent upon a scale of the plant (e.g., a magnitude of the response of the plant) which may require normalization to account for variation in scale across different types of plants (thereby allowing the ESC to be applied across different plants). Advantageously, the systems and methods herein utilize the sign of the gradient, thereby providing an inherently normalized control input for the plant. This may reduce the need to adjust the ESC for different scale plants, or for normalization. In this way, the self-perturbing ESC systems and methods as described herein can be applied to any plant, regardless of a scale of the plant, without requiring artificial normalization and without requiring a dither signal.
In order to simplify the ESC, a sign of the gradient
(i.e., whether the gradient is positive or negative) can be used as the feedback. Using the sign of the gradient
eliminates scale from the ESC such that the ESC algorithm can be used without requiring knowledge of the range of the performance variable J. For example, using the sign of the gradient
allows the ESC algorithm to be used with a plant or with equipment, regardless of scale or range of the performance variable J. Advantageously, this facilitates a more versatile extremum-seeking control scheme that can be used for various types of plants, different systems, different equipment, etc. Another advantage of using the sign of the gradient
in the feedback is that the extremum-seeking control loop does not reach steady-state and persistent excitation is provided to the plant (e.g., plant 702). In this way, the switching strategy is self-perturbing and does not require an external perturbation signal. Advantageously, this reduces (or may completely remove) the need for analysis, computation, etc., associated with generating a perturbation signal.
Self-Perturbing Extremum-Seeking Control System for a Static Plant
Referring now to
or the time derivative {dot over (J)} of the performance variable is calculated), an input gradient block 806 (where the gradient
or the time derivative {dot over (u)} of the control input is calculated), a division block 808, a filter 810, and an integrator 812.
Static plant 802 receives a control input u(t) and outputs the performance variable J(t), according to some embodiments. The performance variable J(t) is derived with respect to time at performance variable gradient 804 to determine
according to some embodiments. The control input u(t) is also derived with respect to time at input gradient block 806 to determine
according to some embodiments. The control input u(t) may be both provided to static plant 802 as the input and also derived with respect to time as shown in
The gradient of the performance variable
is divided by the gradient of the control input
at division block 808. Specifically, the gradient of the performance variable
is divided by the gradient of the control input
so that the time portions of the gradients cancel out, thereby determining the gradient of the performance variable J with respect to the control input u
The gradient of the performance variable J with respect to the control input u is provided to filter 810 (e.g., a sign filter) to determine a sign ε(t) of the gradient of the performance variable J with respect to the control input u
The sign ε(t) may be a binary value (e.g., −1 or +1) indicating a sign (e.g., positive or negative) of the gradient of the performance variable J with respect to the control input u
The sign ε(t) is multiplied by a value −k and integrated at integrator 812 to determine the control input u(t), according to some embodiments. In some embodiments, integrator 812 has the effect of adding the value −k to the previous value of the control input u(t−1) if the sign ε(t)) is positive (e.g., u(t)=u(t−1)+(−k)=u(t−1)−k) or subtracting the value −k from the previous value of the control input u(t−1) if the sign of ε(t)) is negative (e.g., u(t)=u(t−1)−(−k)=u(t−1)+k).
The value of k can be adjusted to increase a rate of convergence of the self-perturbing extremum-seeking control system 800. However, while increasing the value of k may lead to faster convergence, increasing the value of k may also lead to greater oscillation about an optimal solution (e.g., an optimal control input u). Additionally, the self-perturbing extremum-seeking control system 800 is configured for use with static plant 802. However, in practice, plants are often dynamic and include dynamic responses/portions.
Self-Perturbing Extremum-Seeking Control System for a Dynamic Plant
Referring now to
System 900 includes dynamic plant 902, performance variable gradient 804, division block 808, filter 810, a static nonlinearity gradient block 908, integrator 812, and dynamic estimation block 906, according to some embodiments. Dynamic plant 902 includes a dynamic portion 904 and static plant 802. Dynamic plant 902 receives the control input u(t) at dynamic portion 904, according to some embodiments. Dynamic portion 904 may be the dynamic portions of the dynamic plant 902. Static plant 802 represents the static portions of the dynamic plant 902, according to some embodiments. Dynamic portion 904 may be a dynamic portion of dynamic plant 902 which accounts for various internal parameters of dynamic plant 902. Dynamic portion 904 may receive the control input u(t) and output x(t). The output x(t) is provided to static plant 802 which outputs the performance variable J(t). However, in order to provide a self-perturbing extremum-seeking control system using similar methodology as described in greater detail above with reference to
It should be understood that a “static portion” and a “dynamic portion” of a plant as used herein refer to different behaviors of the plant or different portions of a model of the plant. For example, the dynamic portion 904 of plant 902 and the static portion 802 may characterize different behaviors or responses of the plant 902 given a specific control input or time series of control inputs. Static portion 802 and dynamic portion 904 of plant 902 mathematically define or characterize how plant 902 responds to a given control input, and do not necessarily refer to different physical portions of plant 902.
The dynamic portion 904 of the dynamic plant 902 can be estimated or approximated using a low-pass filter with a single time constant as shown in Equation (5) below:
where {circumflex over (D)}(s) is a low-pass filter with a single time constant τ that can be used to estimate the dynamic portion 904 of the dynamic plant 902. For example, the output x(t) can be estimated as {circumflex over (x)}(t) by: {circumflex over (x)}(t)={circumflex over (D)}(s)u(t).
Time derivatives of the performance variable J(t) (e.g., the output of the static portion 802) and the output x(t) (e.g., the output of the dynamic portion 904 of the dynamic plant 902) can be used to determine a gradient of the performance variable J(t) with respect to the output x(t). In some embodiments, the gradient of the performance variable J(t) with respect to the approximation or estimation of the output x(t), namely
is used to increase or decrease or determine the control input u(t) for dynamic plant 902.
The performance variable/output J from the dynamic plant 902 is derived with respect to time at performance variable gradient 804. Specifically, the gradient
of the performance variable/output J is determined by deriving J with respect to time. Likewise, the output {circumflex over (x)}(t) is derived with respect to time at gradient block 908 to determine a gradient
of the output {circumflex over (x)}(t) with respect to time t. In some embodiments, the output {circumflex over (x)}(t) is an estimate of the output x(t) of the dynamic portion 904 of dynamic plant 902. The gradient
and the gradient
are divided by each other at division block 808, according to some embodiments. Specifically, the gradient
is divided by the gradient
such that the time differentials dt cancel each other out, thereby achieving a gradient
of the performance variable/output J of the dynamic plant 902 with respect to the output {circumflex over (x)} of the dynamic estimation block 906.
The gradient
is then provided to filter 810, according to some embodiments. Filter 810 is configured to determine a sign ε(t) of the gradient
according to some embodiments. The sign ε(t) can be a binary value (e.g., −1 or +1) indicating whether the gradient
is positive or negative. The sign ε(t) is then integrated at integrator 812. Specifically, ε(t) is multiplied by the term
at integrator 812 to determine the control input u(t) for the dynamic plant 902 and the dynamic estimation block 906, according to some embodiments. In some embodiments, integrator 812 has the effect of adding the value −k to the previous value of the control input u(t−1) if the sign ε(t)) is positive (e.g., u(t)=u(t−1)+(−k)=u(t−1)−k) or subtracting the value −k from the previous value of the control input u(t−1) if the sign of ε(t)) is negative (e.g., u(t)=u(t−1)−(−k)=u(t−1)+k).
In this way, the self-perturbing extremum-seeking control system 900 can use the sign ε(t) of the gradient
to determine the control input u(t) for the dynamic plant 902. Advantageously, the self-perturbing extremum-seeking control system 900 is viable for low-cost applications and can be implemented on processing circuitry without requiring large amounts of memory or computational resources as other extremum-seeking control systems do. Additionally, the control input u(t) can be determined without tuning the extremum-seeking control system to account for a size or a scale of the response of the dynamic plant 902. The self-perturbing extremum-seeking control system 900 does not require an external dither signal to excite the dynamic plant 902. Furthermore, the convergence of the system represented by system 900 can be controlled or adjusted by increasing or decreasing the value of the gain k at the integrator block 812. The system 900 may be globally stable. Additionally, the extremum-seeking control system 900 only requires a single configuration parameter, that is, an estimate of a time constant of the dynamic plant 902. Finally, the extremum-seeking control system 900 is a simple approach to extremum-seeking control that uses easily computable time derivatives.
Referring particularly to
In some embodiments, inverse dynamic estimation block 1602 slows a speed of the control input u(t) so that the speed of the signal associated with the cost/output J(t) matches or is synchronized with the speed of the estimation {circumflex over (x)}(t), thereby facilitating proper determination of the gradient
In some embodiments, dynamic estimation block 906 may increase a speed of the signal associated with the estimation {circumflex over (x)}(t) so that the cost/output J(t) and the estimation {circumflex over (x)}(t) are synchronized with each other, thereby facilitating proper determination of the gradient
In some embodiments, both dynamic estimation block 906 and inverse dynamic estimation block 1602 are used. For example, the self-perturbing extremum-seeking control system may adjust the speed of the signals associated with both the estimation {circumflex over (x)}(t) and the control input u(t) so that the estimation {circumflex over (x)}(t) and the cost/output J(t) are synchronized, thereby facilitating proper determination of the gradient
Referring particularly to
For example, the sign ∈ may be −1 or +1 and can be calculated using:
according to some embodiments.
With assumptions regarding open loop stability, monotonic step response, etc., of D(s) (dynamic portion 904), and also assumptions regarding a sufficient hold time to enforce a sufficiently slow cyclical perturbation, the effect of the dynamics of dynamic plant 902 can be ignored, and it can be assumed that:
according to some embodiments.
Applying the chain rule to Equation (7) shown above yields:
which can be expressed as:
according to some embodiments. In some embodiments, hold block 1708 holds or slows a frequency of the perturbation signal thereby creating time scale separation to reduce the effect of the dynamics of dynamic plant 902.
HVAC System with Self-Perturbing Extremum-Seeking Control
Referring now to
Sensors 1008 can be configured to measure the environmental condition (e.g., the temperature) of conditioned space 1102. For example, sensors 1008 can be temperature sensors, humidity sensors, etc., or any other type of sensor configured to measure the environmental condition in conditioned space 1102. Sensors 1008 can be positioned locally at thermostat 1106 or may be positioned about conditioned space 1102. Sensors 1008 can be operatively or communicably coupled with ESC controller 1002 such that ESC controller 1002 receives measurements of the environmental condition of conditioned space 1102.
User interface 1108 can be configured to receive one or more user inputs and may provide the user inputs to controller 1002. In some embodiments, the user inputs indicate a desired value of the environmental condition (e.g., a temperature setpoint) of conditioned space 1102. ESC controller 1002 can use the user inputs received from user interface 1108 and the measurements obtained by sensors 1008 in a self-perturbing extremum-seeking control system (e.g., as shown in system 900) to generate control signals or inputs for HVAC equipment 1104. ESC controller 1002 provides the control inputs or the control signals to HVAC equipment 1104 such that HVAC equipment 1104 operates to affect the environmental condition (e.g., a variable state or a condition, a performance variable, etc.) of conditioned space 1102. In some embodiments, HVAC equipment 1104 is on-off equipment that is configured to transition between an on-state or an operational state, and an off-state or an inoperational state. The control inputs or the control signals provided to HVAC equipment 1104 by ESC controller 1002 may transition HVAC equipment 1006/1104 between the on-state and the off-state. The HVAC equipment 1104 can operate between the on-state and the off-state for various modes. For example, the HVAC equipment 1104 can be configured to operate in a heating mode and/or a cooling mode. The HVAC equipment 1104 can transition between the on-state and the off-state in the heating mode (e.g., to selectably provide heating to conditioned space 1102) or between the on-state and the off-state in the cooling mode (e.g., to selectably provide cooling to conditioned space 1102).
Referring particularly to
Control system 1000 includes one or more sensors 1008 that are configured to measure a value of a measured variable y of dynamic plant 1004, according to some embodiments. Sensors 1008 can provide the value(s) of the measured variable y of dynamic plant 1004 (e.g., temperature, humidity, etc.) to ESC controller 1002. ESC controller 1002 can use the values of the measured variable y in a self-perturbing extremum-seeking control system to determine optimal operational parameters of equipment 1006 (e.g., HVAC equipment 1104).
ESC Controller
Referring particularly to
ESC controller 1002 can be configured to implement the self-perturbing extremum-seeking control system 900 described in greater detail above with reference to
Communications interface 1210 can be or include wired or wireless communications interfaces (e.g., jacks, antennas, transmitters, receivers, transceivers, wire terminals, etc.) for conducting data communications with dynamic plant 1004 or other external systems or devices. In various embodiments, communications via communications interface 1210 can be direct (e.g., local wired or wireless communications) or via a communications network (e.g., a WAN, the Internet, a cellular network, etc.). For example, communications interface 1210 can include an Ethernet card and port for sending and receiving data via an Ethernet-based communications link or network. In another example, communications interface 1210 can include a Wi-Fi transceiver for communicating via a wireless communications network. In another example, communications interface 1210 can include cellular or mobile phone communications transceivers. In one embodiment, communications interface 1210 is a power line communications interface. In other embodiments, communications interface 1210 is an Ethernet interface.
Still referring to
Memory 1208 (e.g., memory, memory unit, storage device, etc.) can include one or more devices (e.g., RAM, ROM, Flash memory, hard disk storage, etc.) for storing data and/or computer code for completing or facilitating the various processes, layers and modules described in the present application. Memory 1208 can be or include volatile memory or non-volatile memory. Memory 1208 can include database components, object code components, script components, or any other type of information structure for supporting the various activities and information structures described in the present application. According to some embodiments, memory 1208 is communicably connected to processor 1206 via processing circuit 1204 and includes computer code for executing (e.g., by processing circuit 1204 and/or processor 1206) one or more processes described herein.
In some embodiments, ESC controller 1002 is implemented within a single computer (e.g., one server, one housing, etc.). In various other embodiments ESC controller 1002 can be distributed across multiple servers or computers (e.g., that can exist in distributed locations).
Referring still to
Performance variable gradient estimator 1224 is configured to receive the performance variable J from the dynamic plant 1004, according to some embodiments. Specifically, performance variable gradient estimator 1224 is configured to receive sensor signals from sensors 1008 that indicate the performance variable J (e.g., power consumption), according to some embodiments. In some embodiments, performance variable gradient estimator 1224 is configured to determine the gradient or the time derivative of the performance variable J. Performance variable gradient estimator 1224 can receive the performance variable J at consecutive time steps and estimate, calculate, determine, etc., the gradient
Performance variable gradient estimator 1224 can use a Savitzky-Golay filter (as shown in Equation (11) above) to determine the gradient
Performance variable gradient estimator 1224 provides the gradient
to gradient manager 1222, according to some embodiments.
Gradient manager 1222 is configured to receive the gradient
(e.g., the time derivative of the performance variable J) from performance variable gradient estimator 1224 and a gradient
(e.g., the time derivative of {circumflex over (x)}) from dynamic gradient estimator 1218. Gradient manager 1222 can determine a gradient
based on the gradient
and the gradient
Gradient manager 1222 may divide the gradient
by the gradient
to determine a gradient
of the performance variable J with respect to the output {circumflex over (x)} of plant model 1220. Gradient manager 1222 can then provide the gradient
to filter 1216.
Filter 1216 receives the gradient
and determines a sign ε of the gradient
according to some embodiments. In some embodiments, filter 1216 determines or identifies if the sign of the gradient
is positive or negative (e.g., +1 or −1) and outputs the sign ε to integrator 1214. The sign ε can be positive (e.g., ε=+1) if the performance variable J is increasing with respect to the output {circumflex over (x)} or negative (e.g., ε=−1) if the performance variable J is decreasing with respect to the output {circumflex over (x)} of plant model 1220. Filter 1216 provides the sign ε to integrator 1214, according to some embodiments.
Integrator 1214 is configured to receive the sign ε from filter 1216 and use the sign ε to generate the control input u for dynamic plant 1004. Integrator 1214 may integrate the sign ε by multiplying the sign ε by
In some embodiments, the parameter k is adjustable or adaptable to adjust (e.g., increase or decrease) a rate of convergence. Integrator 1214 provides the control input u to control signal generator 1212 and plant model 1220, according to some embodiments.
Plant model 1220 may be a model of the dynamic portion of dynamic plant 1004. For example, plant model 1220 can approximate the dynamic response of dynamic plant 1004. In this way, the output {circumflex over (x)} can indicate or approximate the effect of the plant dynamics on the control input variable u. In some embodiments, plant model 1220 is an approximation of a Wiener system, or is a Wiener system. For example, plant model 1220 can be a transfer function such as the transfer function shown in Equation (13). Plant model 1220 can receive the control input u and determine or generate or output {circumflex over (x)} which is used to determine the gradient
The output {circumflex over (x)} is provided to dynamic gradient estimator 1218.
Dynamic gradient estimator 1218 is configured to receive the output {circumflex over (x)} from plant model 1220 and determine a time derivative, a gradient, a derivative, etc., of the output {circumflex over (x)}. Dynamic gradient estimator 1218 can be configured to use a Savitzky-Golay filter (as shown in Equation (11) below) to numerically calculate the gradient
Control signal generator 1212 is configured to receive the control input u from integrator 1214 and generate control signals for equipment 1006 to implement or perform the control input u at dynamic plant 1004, according to some embodiments. In some embodiments, control signal generator 1212 is configured to provide the control input u to dynamic plant 1004. The control signals or the control input u can be setpoint decisions and may be provided to equipment 1006 to affect a variable condition, an environmental condition, etc., of dynamic plant 1004, or to affect an operation of equipment 1006. In some embodiments, the control signals or the control input u affect the performance variable J. The control input u or the control signals can be provided to equipment 1006 to adjust or affect a rate of heat transfer to/from conditioned space 1102 as driven by equipment 1006. For example, the control input u or the control signals may increase a rate of heat transfer that is provided into conditioned space 1102 by equipment 1006.
Equipment 1006 operates to affect an environmental condition or a performance variable (e.g., a variable state or condition) of conditioned space 1102. Sensors 1008 can be configured to measure the performance variable J or the performance variable or the environmental condition of conditioned space 1102 and provide sensor signals to performance variable gradient estimator 1224. In some embodiments, sensors 1008 are temperature sensors that are configured to measure a temperature within or at conditioned space 1102. Sensors 1008 may provide the measured temperature to performance variable gradient estimator 1224 as the performance variable J (e.g., the performance variable). Performance variable gradient estimator 1224 can then use the measurements or the performance variable J to determine the gradient
Referring particularly to
Referring still to
ESC controller 1002 receives values of the performance variable J from static plant 1005 at performance variable gradient estimator 1224 (e.g., from sensors 1008 of static plant 1005) and determines the gradient
of the performance variable J. In some embodiments, performance variable gradient estimator 1224 is configured to use values of the performance variable J to determine the gradient
and provide the gradient
to gradient manager 1222.
Input gradient manager 1226 is configured to receive the control input u for static plant 100 and output a gradient
of the control input u. In some embodiments, input gradient manager is configured to estimate, obtain, determine, calculate, etc., the gradient
using any of the techniques of performance variable gradient estimator 1224. Input gradient manager 1226 provides the gradient
to gradient manager 1222.
Gradient manager 1222 receives the gradient
from input gradient manager 1226 and the gradient
from performance variable gradient estimator 1224 and determines a gradient
The gradient
is then provided to filter 1216 for determining the sign ε. Integrator 1214 and control signal generator 1212 can then function as described in greater detail above with reference to
Test Results
Referring now to
J(t)=(x(t)−p(t))2 (10)
where p(t) is a step function such that p(t)=0 for t<tstep and p(t)=2 (or any other value) for t≥tstep. This means that, for the case when p(t)=2 for t≥tstep and p(t)=0 for t<tstep, an optimum point (e.g., extremum point 604) starts at (J=0, u=0) and then steps to (J=0, u=2) (test results shown in
The time derivatives (e.g., the gradients
can be calculated numerically using a Savitzky-Golay filter as shown in Equation (11) below:
where {dot over (ŷ)}(t) is the estimated derivative of a variable y (e.g., J and/or {circumflex over (x)}) at time t, y(t−1) is the value of the variable y at time t−1, and y(t+1) is the value of the variable y at time t+1.
The dynamic plant 902 can be represented as a Wiener process. Specifically, the dynamic portion 904 may be represented as:
where τ1 is a first time constant and τ2 is a second time constant.
The dynamic plant 902 can be estimated as shown in Equation (13) below:
where τ1 is the first time constant and τ2 is the second time constant.
Referring now to
In some embodiments, the test results shown in the graphs of
For the test that results in graphs 1400 and 1402 of
Referring particularly to
For a first-order system, phase-lag is given by:
ϕ=tan−1(−ωτ) (14)
according to some embodiments. The lag (delay) added by the input dynamics (e.g., by dynamic portion 904) is therefore a function of the frequency of the oscillation and the time constant of the plant (e.g., plant 902). Longer time constants in the plant will therefore increase the lag and thus also lead to greater amplitude of oscillation about the optimum as shown in
Self-Perturbing Process
Referring now to
Process 1300 includes receiving one or more values of a performance variable (e.g., a performance variable) from a plant (step 1302), according to some embodiments. In some embodiments, step 1302 is performed by ESC controller 1002, or more specifically, performance variable gradient estimator 1224. In some embodiments, the values of the performance variable or the performance variable are obtained by sensors 1008. For example, the values of the performance variable or the performance variable may be temperature values of an indoor temperature, an efficiency of an HVAC unit, a power consumption of the HVAC unit, etc. In some embodiments, the performance variable is referred to as J(t).
Process 1300 includes determining a time derivative of the performance variable or the performance variable (step 1304), according to some embodiments. In some embodiments, the time derivative of the performance variable is a gradient of the performance variable with respect to time,
In some embodiments, step 1304 is performed by performance variable gradient estimator 1224 using the values of the performance variable J. In some embodiments, the time derivative of the performance variable
is determined or obtained by performance variable gradient estimator 1224 using a Savitzky-Golay filter or any other numerical technique to determine a derivative based on sampled values.
Process 1300 includes estimating an effect of a dynamic portion of the plant for an input using a model of the dynamic portion of the plant (step 1306), according to some embodiments. In some embodiments, the effect of the dynamic portion of the plant is estimated using dynamic estimation block 906. In some embodiments, the effect of the dynamic portion of the plant is estimated using inverse dynamic estimation block 1602. In some embodiments, step 1306 is performed by or using plant model 1220 of ESC controller 1002. In some embodiments, the estimation of the effect of the dynamic portion of the plant is referred to as {circumflex over (x)}(t).
Process 1300 includes determining a time derivative of the effect of the dynamic portion of the plant (step 1308), according to some embodiments. In some embodiments, the time derivative of {circumflex over (x)}(t) is a gradient of {circumflex over (x)}(t) with respect to time. In some embodiments, step 1308 is performed by dynamic gradient estimator 1218. In some embodiments, the time derivative of the effect {circumflex over (x)}(t) of the dynamic portion of the plant is referred to as
In some embodiments, the time derivative of the effect {circumflex over (x)}(t) is calculated or determined using a Savitzky-Golay filter or any other numerical technique.
Process 1300 includes determining a gradient of the performance variable with respect to the effect of the dynamic portion of the plant (step 1310), according to some embodiments. In some embodiments, the gradient of the performance variable with respect to the effect of the dynamic portion of the plant is referred to as
In some embodiments, step 1310 is performed by gradient manager 1222. In some embodiments, gradient manager 1222 receives the gradient
and the gradient
and divides the gradient
by the gradient
to obtain the gradient
Process 1300 includes determining a sign of the gradient of the performance variable with respect to the effect of the dynamic portion of the plant (step 1312), according to some embodiments. In some embodiments, the sign of the gradient
expresses a relationship between the performance variable/output of the plant and the effect of the dynamic portion of the plant. In some embodiments, the sign of the gradient
is referred to as ε(t). In some embodiments, step 1312 is performed by filter 1216.
Process 1300 includes integrating and multiplying the sign ε by a gain factor k to determine the control input u (step 1314), according to some embodiments. In some embodiments, step 1314 is performed by integrator 1214. Integrator 1214 may integrate and multiple the sign ε by the gain factor k to determine the control input u for the plant.
Process 1300 includes providing the control input to the plant to affect the performance variable (step 1316), according to some embodiments. In some embodiments, step 1316 includes generating control signals for equipment of the plant that operate to effect the performance variable. In some embodiments, step 1316 is performed by control signal generator 1212 and equipment 1006. In some embodiments, equipment 1006 of dynamic plant 1004 operates to effect conditioned space 1102 or to effect the performance variable J of dynamic plant 1004, which can be detected or measured by sensors 1008.
Self-Perturbing Extremum-Seeking Control with Holding Relay
Referring now to
Referring particularly to
For example, plant 1806 may be a non-linear plant that is defined or described as:
{dot over (x)}=ƒ(x,u) (15)
y=h(x) (16)
where x is an n-dimensional state variable, u is the control input of plant 1806, y is a performance variable of plant 1806, ƒ is a function that relates x and u to {dot over (x)}, and h is a function that relates x to y.
It can be assumed that there is a differentiable function l:R→Rn such that:
ƒ(x,u)=0, if ƒx=l(u) (17)
according to some embodiments. In some embodiments, a performance variable function J is defined as:
J(u)=h(l(u)) (18)
where it is assumed that y≈J(u) if x≈l(u). Assuming that the equilibrium x=l(u) for each u or each value of u in the system of Equation (15) is globally stable, and that J(u) is the static map, it can further be assumed that there exists a unique u* that minimizes J where:
J′(u*)=0 (19)
J″(u*)>0 (20)
J′(u*+δ)(δ>0,∀δ≠0 (21)
according to some embodiments. The assumptions of Equations (18)-(21) shown above ensure that there is a locally unique minimum (or maximum, if considering a concave function) and that this is attainable as a stable point in the input space. It is also assumed that u, and correspondingly J(u), is bounded and locally Lipschitz continuous.
Referring generally to
The parameter τp can be considered analogous to a dominant open loop time constant of the plant 1806 and could be estimated from an open loop step test or from prior knowledge. In some embodiments, various other parameters of the self-perturbing extremum-seeking control system can be configured from this parameter.
The second parameter is the range of the performance variable, Jr, which is an estimate of the difference between the maximum and minimum expected performance variable values:
Jr−Jmax−Jmin (22)
according to some embodiments. In some embodiments, the second parameter Jr is used for normalization in the adaptive loop that adjusts a relay gain.
Referring again to
Plant 1806 outputs values of the performance variable y which may be obtained in discrete-time by a sensor, a collection of sensors, etc. In some embodiments, the obtaining of the values of the performance variable y in discrete-time is represented by discrete sampler 1808. In some embodiments, the performance variable y is sampled at a regular time interval Δt. In some embodiments, it is assumed that plant 1806 is perturbed at a slow enough frequency so that y≈J(u). In some embodiments, the output of discrete sampler 1808 is a discrete value Jk at a timestep k of the performance variable J. The discrete value Jk is then provided to a derivative estimator 1810, according to some embodiments. Derivative estimator 1810 can be configured to calculate a time derivative of the discrete value Jk of the performance variable J, shown as In some embodiments, derivative estimator 1810 is the same as or similar to performance variable gradient estimator 1224 and may be configured to use any of the techniques described herein to determine, estimate, calculate, obtain, etc., a time derivative of a signal (e.g., a Savitzky-Golay filter).
In some embodiments, the time derivative of the discrete value is multiplied by a previous value of an output of the relay 1818 multiplied by the gain M0 (e.g., an output of the relay 1818 at a timestep k−1, multiplied by the gain M0 at gain block 1816) at multiplier 1812, and this signal is used to switch the relay 1818.
Relay 1818 includes a sign block 1822 that is configured to convert the input signal (e.g., the output of multiplier 1812) to a binary value of either +1 or −1 and a hold block 1820 that prevents relay 1818 from switching to a new state (e.g., switching from +1 to −1 or switching from −1 to +1) until a predetermined amount of time Δh has elapsed that exceeds a threshold τp (e.g., the first parameter). In some embodiments, the threshold τp is a dominant time constant of the plant. By holding the output of sign block 1822 (i.e., the sign of the gradient) at a positive state or negative state for at least the threshold τp, hold block 1820 may hold or slow a frequency of the perturbation signal thereby creating time scale separation to reduce the effect of the dynamics of dynamic plant 1806. This mitigates or reduces an effect of a dynamic portion of plant 1806 on the performance variable J and consequently any gradient based on the performance variable J.
Referring particularly to
Referring again to
Other ESC control systems use demodulation or perform regressions on J and u (e.g., the performance variable with respect to the input of the plant). However, using relay 1818 facilitates a simple technique to estimate the gradient based on the time derivative of the performance variable function, according to some embodiments. The gradient, G, is obtained via application of the chain rule:
according to some embodiments. Since the self-perturbing extremum-seeking control system shown in
which reduces to:
sgn[G]=sgn[j]sgn[{dot over (u)}] (25)
according to some embodiments. The gradient G is determined at block 1812 using any of Equations (23)-(25) shown above. For example, block 1812 may be a multiplication block, a division block, or any other operation block that outputs a first type of value (e.g., a positive value) if {dot over (j)} and {dot over (u)} have a same sign (e.g., both are positive or both are negative), and outputs a second type of value (e.g., a negative value) if {dot over (j)} and {dot over (u)} have different signs (e.g., one is positive and the other is negative or vice versa).
As shown in
sgn[G]k=sgnsgn[Δuk-1] (26)
where sgn[G]k is the sign of the performance variable function at timestep k, sgn is the sign of the time derivative of the performance variable J at timestep k, and sgn[Δuk-1] is the sign of the control input u at a previous timestep k−1, according to some embodiments. In some embodiments, the gradient G can be estimated from the sign of the time derivative of the performance variable J and the previous value of Δu.
In some embodiments, the derivative estimator 1810 is configured to use a Savitzky-Golay method to estimate the derivative, which is more resistant to noise and is based on a moving window of samples:
where ci are fixed coefficients and m is a window size. In some embodiments, the window size m determines the estimation time span which is shorter than the time constant τp of the plant 1806. For example, the window size m may be: m=τp/n where 2≤n≤10 (as a suggested or possible range for n). Since the self-perturbing extremum-seeking control system shown in
The estimated gradient G represents the gradient of the static non-linear map of plant 1806, and thus a rate of change in u(t) must be sufficiently slow so that the dynamics of plant 1806 are negligible, according to some embodiments. In some embodiments, the excitation of plant 1806 is limited to low frequency excitation to ensure that the dynamics of plant 1806 are negligible.
In some embodiments, a hold time is imposed on the relay 1818 (e.g., the hold block 1820) so that hold block 1820 of relay 1818 holds its current state for a minimum time period. For example, if the hold time in hold block 1820 is set to τp, then the maximum frequency may be limited to
Using relay 1818 may result in persistent excitation of plant 1806, thereby providing a self-perturbing extremum-seeking control system and reducing or removing the need for an artificially produced dither or excitation signal.
The output ∈ of the relay 1818 represents the sign of the gradient G, according to some embodiments. In some embodiments, the sign ∈ of the gradient G is scaled by a gain M0 (at gain block 1816) before being integrated at integrator 1802 and transformed into a continuous-time signal at ZOH block 1804. The continuous-time signal u is then provided to plant 1806 as the control input. The output of gain block 1816
is then provided to multiplier 1812 before being delayed at delay block 1814.
In some embodiments, the operational range of the control input u is bounded 0≤u≤1. The gain parameter M0 may be based on the first parameter τp to limit a rate of change of the plant control input u and maintain time-scale separation. For example, the value of the gain parameter M0 is:
according to some embodiments. Using the value of the gain parameter M0 as shown in Equation (28) above would limit the rate of change of the control input u for plant 1806 so that it would take 10 time constants τp to move the control input u across its entire range. For the value of the gain parameter M0 shown in Equation (28) above, the amplitude of the control input signal u perturbation would be 10% of its range. In some embodiments, if the plant 1806 reaches a natural limit cycle that exceeds the period enforced by hold time of relay 1818 or hold block 1820, the amplitude of the perturbation may increase and this may indicate that the value of τp should be increased.
Advantageously, the self-perturbing extremum-seeking control system shown in
Adaptive Gain Loop
Referring particularly to
The value of the gain parameter M0 may be adjusted based on a state variable x and an incremental or small value δ. For example, system 1900 may output an updated value Mk of the gain parameter M0:
Mk=M0(|x|+δ) (29)
according to some embodiments. The state variable x may be an estimation Ĝ of the gradient G. An absolute value of the estimation of the gradient G can be obtained by substituting ∈Mk (the sign of the gradient G multiplied by the updated gain parameter Mk), and applying the chain rule:
according to some embodiments. It should be noted that the sign ∈ drops out of Equation (31) since |∈|=1, according to some embodiments. A normalized version {dot over ({tilde over (J)})} of the estimated time derivative {dot over (Ĵ)} is calculated using the range Jr:
according to some embodiments. Substituting {dot over ({tilde over (J)})} for {dot over (Ĵ)} in Equation (32) yields a normalized gradient {tilde over (G)} (since the bounded control input u is also normalized):
according to some embodiments.
The updated gain parameter Mk can be formulated at a sample k using the following Equation:
where Jr is the range (e.g., expressed in Equation (22) above), |{dot over (Ĵ)}| is an absolute value of the estimate of the time derivative of the performance variable, and |Mk-1| is an absolute value of a previous value (e.g., at timestep k−1) of the updated gain parameter, according to some embodiments.
Adjusting the gain parameter Mk is performed by multiplying the sign ∈ of the gradient G by the magnitude of the normalized gradient {tilde over (G)}. For example, the control input u may be given by:
uk=uk-1+ΔtM0sgn[Gk][{tilde over (G)}k+δ] (37)
uk=uk-1+ΔtM0{tilde over (G)}k, for δ→0 (38)
where uk-1 is the control input to the plant at a previous timestep (e.g., at k−1), Δt is a time duration or a time interval (e.g., between subsequent time steps), and {tilde over (G)}k is the normalized gradient at timestep k. Equation (38) shown above is therefore a gradient-descent procedure that has a fixed step size of ΔtM0. Convergence may occur if the step size satisfies
where L is a Lipschitz constant. In some embodiments, this can be verified to be true using a general class of static non-linear functions H(u):
H(u)=dc+b (39)
d=|u−u*| (40)
where 0≤u*≤1 is the optimal point in the input space, c∈ and b is a constant. The range of the performance variable defined in Equation (39) can be deduced to Hr=Hmax−Hmin=dmaxc since Hmax=dmaxc+b and Hmin=b.
The Lipschitz constant that corresponds to the maximum (magnitude) of the derivative of H(u) normalized by the performance variable function range Hr within the bounded control input range is:
where 0.5≤dmax≤1.
Therefore, from a gradient-descent convergence standpoint, it should be ensured that:
according to some embodiments.
For an archetypal system where dmax=0.5 (the optimal point is in the middle of the bounded input range) and c=2 (e.g., locally quadratic), it follows that
For the value of the gain parameter M0 shown in Equation (28) above:
where the sample period Δt<<τp to ensure sufficient data are available to accurately estimate the time derivative, according to some embodiments.
In some embodiments, an optimum step size may depend on a location of the optimum control input u* in the bounded input range and also on the convexity of the static non-linearity of plant 1806.
Referring still to
(the output of division block 1910) is then divided by the range Jr (e.g., by multiplying
by 1/Jr at range gain block 1908) to obtain
The estimated range of J and the adjustable gain parameter Mk are bounded by saturation block 1906 such that:
according to some embodiments. In some embodiments, saturation block 1906 outputs a value of 1 if
is equal to or greater than 1, and outputs a value of 0 if
is less than or equal to 0.
The output of the saturation block 1906 is then provided to summation block 1904, where the incremental amount δ is added, according to some embodiments. Saturation block 1906 may cause the condition δ|M0|≤|Mk|≤(1+δ)|M0| to be true. In some embodiments, adding the incremental amount δ to the output of saturation block 1906 facilitates enforcing a minimum gain so that persistent excitation of the plant is maintained (e.g., so that the adjustable gain parameter Mk does not become zero). The incremental amount or value δ also provides a minimum amplitude of variation in the plant control input u of δM0τp and can thus be set according to tolerable minimum variations.
In some embodiments, the output of the saturation block 1906 is
and the incremental value or amount δ is added to
at summation block 1904 such that summation block 1904 outputs
In some embodiments, the output of summation block 1904 is then provided to gain block 1816, where it is multiplied by the gain parameter M0 to obtain
which is equal to the adjustable gain parameter Mk (shown in Equation (36) above). In some embodiments, the adjustable gain parameter Mk at timestep k is then provided to delay block 1914, and used for subsequent calculations of the adjustable gain parameter (e.g., to calculate the adjustable gain parameter at a next timestep k+1). In some embodiments, the adjustable gain parameter Mk is provided to an optional absolute value block 1912 before being provided to division block 1910 for use in determining or calculating the next value of the adjustable gain parameter Mk.
Self-Perturbing Extremum-Seeking Control with Holding Relay and Adaptive Gain Loop
Referring now to
Advantageously, the self-perturbing extremum-seeking control system shown in
In some embodiments, in the calculation of {dot over (Ĵ)}, the window size m (as shown in Equation (27) above) can be set to a fraction of the plant time constant τp such as:
and 2≤n≤10. It should be understood that the incremental amount δ may also be selected (e.g., δ=0.01, δ=0.1, etc.) as any value that is greater than 0 (e.g., by a very small amount) to facilitate constant or persistent excitation of plant 1806.
Advantageously, the self-perturbing extremum-seeking control system shown in
Test Results
Referring particularly to
J(t)=P0(u(t)−p(t))2 (48)
where a default value of the parameter P0 is 1. In some embodiments, the static non-linearity of the plant is preceded by a dynamic part with a linear transfer function as shown in Equation (49) below:
where τ1 and τ2 are time constants of the plant (e.g., such that the plant in the simulation is second-order). For the simulation test, the first time constant has a value τ1=100 s and the second time constant has a value τ2=30 s. Second order dynamics of this sort may be typical in real-world HVAC systems. The parameter τp used to characterize the dynamics is τp=τ1+τ2=130 s, according to some embodiments.
In a first test, p(t) is a step function where p(t)=0 for t<tstep and p(t)=−3 for t≥tstep. The optimum point starts at (J=0, u=0) and then steps to (J=0, u=−3), according to some embodiments. For the first test, the range of the plant control input u is −5≤u≤5 and a unity output from the extremum-seeking control system was mapped onto this range. The range of J is Jr=25 and the M0 parameter was set to −1/(10τp) for the first test.
For a baseline comparison, a standard demodulation extremum-seeking control system with an artificially dither signal sin(ωt) was used. A dither amplitude of 5% of the input range with a period of 5τp was used.
Referring particularly to
As shown in
Referring particularly to
Graph 2200 includes series 2204 which illustrates the optimum control input u* with respect to time (e.g., changing over time according to steps, ramps, etc.). Graph 2200 also includes series 2208 which illustrates the control input u of the self-perturbing extremum-seeking control system 2000 of
Referring particularly to
Graph 2300 includes a series 2302 that illustrates the optimum control input u* over time, a series 2304 that illustrates the control input u for the dither-based ESC system, and a series 2306 that illustrates the control input u for the self-perturbing extremum-seeking control system 2000 of
Referring particularly to
The coolant loop 2504 includes evaporator 2518, a compressor 2522, a condenser 2520, and an expansion valve 2516, fluidly coupled with each other through conduit. Compressor 2522 is configured to drive a coolant through the coolant loop 2504 (e.g., through the condenser 2520, the expansion valve 2516, and then the evaporator 2518).
In some embodiments, air-conditioning system 2500 includes an extremum-seeking controller 2526 that is configured to implement the self-perturbing extremum-seeking control system 2000 of
The evaporator 2518 is one component of a basic vapor compression cycle air-conditioning system. Air flowing over fins of the evaporator 2518 transfer heat to the refrigerant flowing through tubes of the evaporator 2518, causing the liquid refrigerant of the coolant loop 2504 to evaporate. Refrigerant in a vapor state is then passes through compressor 2522, where it is compressed to a higher pressure and temperature before passing through condenser 2520. In the condenser 2520, the refrigerant rejects heat to the ambient air as it is condensed back to a liquid state. The refrigerant is then passed through expansion valve 2516 where the pressure decreases according to an isenthalpic process.
In some embodiments, air-conditioning system 2500 includes three PI controllers. One PI controller is used to modulate the expansion valve 2516 to maintain the superheat temperature (defined as a temperature different between the refrigerant and its saturation temperature) on a suction side of compressor 2522 above zero to prevent liquid refrigerant from entering compressor 2522. A second PI controller modulates the speed of compressor 2522 to maintain the supply air temperature Tsa at the setpoint rsa. A third PI controller controls a speed of the fan 2514 to satisfy the setpoint temperature in the zone 2512. The airflow rate over the condenser 2520 is typically controlled to prevent excessive compressor discharge pressure. In the test performed herein, the airflow rate is set to a constant value.
A dynamic model of air-conditioning system 2500 was developed and tests are performed to generate graphs 2600 and 2700 shown in
Referring particularly to
Referring particularly to
For the test results shown in graph 2700, the time constant τp of the system 2000 is set to 800 seconds and the minimum and maximum values of the manipulated variable u (the supply air temperature setpoint) are 7 and 15 degrees Celsius. The self-perturbing extremum-seeking control system 2000 of
The simulation is initialized with a supply air temperature setpoint rsa=8° C. For the first hour of the simulation, the self-perturbing extremum-seeking control system 2000 of
Dither-Based ESC System
Referring particularly to
ESC controller 2920 obtains values of the performance variable J from the plant 2902, according to some embodiments. In some embodiments, ESC controller 2920 provides a disturbance d (t) to the values of the performance variable J obtained from the plant 2902 at disturbance block 2904. In some embodiments, the performance variable J(t) that is obtained by ESC controller 2920 includes a disturbance d (t). The performance variable J(t) is then provided to transformation block 2906, according to some embodiments. The performance variable multiplied by the transfer function of transformation block 2906 is then multiplied or excited by the dither signal sin(ωt) at multiplication block 2920. The signal is then provided to transformation block 2908 and integrator 2910, according to some embodiments. The signal is then added to the dither signal a sin(ωt) at summation block 2912 to generate the input u(t), according to some embodiments. The test results shown in
Adaptive Gain ESC Controller
Referring particularly to
Adaptive gain ESC controller 3004 is shown to include a processing circuit 3006 including a processor 3008 and memory 3010. Processing circuit 3006 can be communicably connected to a communications interface such that processing circuit 3006 and the various components thereof can send and receive data via the communications interface. Processor 3008 can be implemented as a general purpose processor, an application specific integrated circuit (ASIC), one or more field programmable gate arrays (FPGAs), a group of processing components, or other suitable electronic processing components.
Memory 3010 (e.g., memory, memory unit, storage device, etc.) can include one or more devices (e.g., RAM, ROM, Flash memory, hard disk storage, etc.) for storing data and/or computer code for completing or facilitating the various processes, layers and modules described in the present application. Memory 3010 can be or include volatile memory or non-volatile memory. Memory 3010 can include database components, object code components, script components, or any other type of information structure for supporting the various activities and information structures described in the present application. According to some embodiments, memory 3010 is communicably connected to processor 3008 via processing circuit 3006 and includes computer code for executing (e.g., by processing circuit 3006 and/or processor 3008) one or more processes described herein.
In some embodiments, adaptive gain ESC controller 3004 is implemented within a single computer (e.g., one server, one housing, etc.). In various other embodiments adaptive gain ESC controller 3004 can be distributed across multiple servers or computers (e.g., that can exist in distributed locations). Although several controllers are shown and described as separate devices throughout the present disclosure, it should be understood that any or all of these controllers can be combined into a single device or consolidated into a smaller number of devices configured to perform any or all of the functions described with respect to the individual controllers. Similarly, it should be understood that the functions described with respect to these controllers can be distributed across multiple controllers, servers, cloud computing devices, or any other type of computing devices and are not necessarily performed by physical controllers (e.g., field controllers) in a building management system. All of these possibilities are within the scope of the present disclosure.
Referring still to
Referring still to
Derivative estimator 3014 is configured to obtain one or more values of the performance variable J from plant 3002. Derivative estimator 3014 may determine, estimate, calculate, etc., a time derivative {dot over (J)} of the performance variable J using any of the techniques described herein. For example, derivative estimator 3014 may use a Savitzky-Golay technique, or any other derivative estimation technique to obtain the time derivative {dot over (J)} of the performance variable J.
Derivative estimator 3014 provides the time derivative {dot over (J)} of the performance variable J to both gradient estimator 3016 and ESC controller 3012, according to some embodiments. In some embodiments, derivative estimator 3014 is configured to perform the functionality of derivative estimator 1810 as described in greater detail above with reference to
Gradient estimator 3016 can be configured to perform the functionality of division block 1910 to determine, estimate, calculate, obtain, the gradient G. For example, the gradient G can be obtained by dividing the derivative {dot over (J)} of the performance variable J by the previous value of the adaptive gain parameter Mk-1 or by dividing the gradient G by the previous value of the control input uk-1. It should be understood that since adaptive gain ESC controller 3004 may use only the sign of the gradient G, the previous value of the adaptive gain parameter Mk-1 and the derivative {dot over (J)} of the performance variable J may be multiplied instead of divided to obtain the gradient G. Likewise, gradient estimator 3016 may be configured to perform any other operation so that the gradient G has a first sign (e.g., a positive sign) if the previous value of the adaptive gain parameter Mk-1 and the derivative {dot over (J)} of the performance variable J have the same sign (e.g., both positive or both negative) and a second sign (e.g., a negative sign) if the previous value of the adaptive gain parameter Mk-1 and the derivative {dot over (J)} of the performance variable J have different signs (e.g., one has a positive sign and the other has a negative sign). It should be understood that any of the description of gradient estimator 3016 described herein may use the previous control input uk-1 in place of the previous value of the adaptive gain parameter Mk-1.
Gradient estimator 3016 provides the gradient G to gradient normalizer 3018, according to some embodiments. In some embodiments, gradient normalizer 3018 is configured to use a range Jr of the performance variable J (e.g., shown in
Gradient normalizer 3018 may output the normalized gradient G to gain generator 3020. In some embodiments, gain generator 3020 is configured to perform any of the functionality of saturation block 1906, summation block 1904, and gain block 1816 as described in greater detail above with reference to
In some embodiments, the adaptive gain parameter Mk is a magnitude or an absolute value of a rate of change of the control input uk. For example, as shown in
is equal to the adaptive gain parameter Mk multiplied by the sign ∈k. In this way, the adaptive gain parameter Mk defines an amount to increase or decrease (e.g., adjust) the control input u between consecutive steps.
In some embodiments, the adaptive gain parameter Mk changes with respect to changes in the performance variable J or in response to changes in the derivative {dot over (J)} of the performance variable J. For example, the adaptive gain parameter Mk is proportional to the normalized gradient {tilde over (G)} multiplied by the gain parameter M0, according to some embodiments. As the performance variable J or the control input u approach an optimal point (e.g., a local minima or a local maxima), the adaptive gain parameter Mk may decrease, thereby decreasing a rate of convergence of ESC controller 3012, but reducing oscillations about the optimal point. Likewise, the adaptive gain parameter Mk may increase with respect to increased distance between the performance variable J or the control input u and the optimal point (e.g., the local minima or the local maxima), thereby resulting in faster convergence (e.g., when the optimization is first initiated).
Relay-Based Self-Perturbing ESC Process
Referring particularly to
Process 2800 includes obtaining a value of a performance variable J from a plant (step 2802), according to some embodiments. In some embodiments, the performance variable J is a value of an output of the plant such as power consumption, efficiency, coefficient of performance, etc. Step 2802 can include using a sensor measurement to obtain the value of the performance variable J. For example, the value of the performance variable J may be obtained from a sensor at building equipment, from a building management system, etc.
Process 2800 includes estimating a time derivative {dot over (J)} of the performance variable J (step 2804), according to some embodiments. In some embodiments, the time derivative {dot over (J)} of the performance variable J is estimated by ESC controller 1002 and/or ESC controller 1850. In some embodiments, the time derivative {dot over (J)} of the performance variable J is estimated using a Savitzky-Golay method or filter. In some embodiments, the time derivative {dot over (J)} of the performance variable J is obtained or estimated by derivative estimator 1810.
Process 2800 includes obtaining a sign ∈ of a gradient G using the time derivative {dot over (J)} of the performance variable J and a time derivative {dot over (u)} of a previous control input u of the plant (step 2806), according to some embodiments. In some embodiments, step 2806 includes estimating the gradient G by dividing the derivative {dot over (J)} performance variable J by the derivative {dot over (u)} of the control input u. The gradient G may be a rate of change of the performance variable J with respect to changes in the control input u. In some embodiments, the sign ∈ of the gradient G is obtained by sign block 1822. In some embodiments, the sign ∈ is a binary value (e.g., +1 or −1) indicating whether the gradient G is increasing or decreasing. Step 2806 may be performed by ESC controller 1850 or by ESC controller 1002.
Process 2800 includes using a relay to hold the sign ∈ of the gradient G for a predetermined time period (step 2808), according to some embodiments. In some embodiments, the relay is relay 1818 or hold block 1820. Step 2808 can be performed by ESC controller 1850 and/or ESC controller 1002. The sign ∈ may be held by the relay for a time period Δt until a next or subsequent time step.
Process 2800 includes obtaining a time derivative {dot over (u)} of the control input u for the plant using an adaptive gain parameter Mk and the sign ∈ of the gradient G (step 2810), according to some embodiments. In some embodiments, the time derivative {dot over (u)} of the control input u is obtained by multiplying the sign ∈ by the adaptive gain parameter Mk. Step 2810 can be performed by ESC controller 1002 and/or by ESC controller 1850.
Process 2800 includes obtaining the control input u for the plant using the time derivative {dot over (u)} of the control input u (step 2812), according to some embodiments. In some embodiments, step 2812 includes integrating the time derivative {dot over (u)} to obtain the control input u. For example, step 2812 may be performed at integrator block 1802 and ZOH block 1804. In some embodiments, step 2812 is performed by ESC controller 1002 and/or ESC controller 1850.
Process 2800 includes providing the control input u to the plant to affect a variable state or condition of the plant and/or the performance variable J (step 2814), according to some embodiments. In some embodiments, step 2814 includes providing the control input u to plant 1806 or includes generating control signals for equipment of the plant.
Advantageously, process 2800 can be performed to provide self-perturbing extremum-seeking control for a plant without requiring an estimation of plant dynamics, and without requiring a model of the plant. Process 2800 utilizes a relay to hold a sign for a predetermined amount of time so that the value(s) of the performance variable J is/are obtained without requiring an artificial dither signal.
In some embodiments, the adaptive gain parameter Mk is obtained by performing an adaptive gain parameter process. The adaptive gain parameter Mk can be obtained by performing the functionality of system 1900. For example, the adaptive gain parameter Mk can be obtained using a range Jr of a performance variable J of the plant. The adaptive gain parameter Mk can be obtained using a current timestep value of an estimate of a time derivative of the performance variable, a previous value of the adaptive gain parameter (e.g., Mk-1), the range Jr, an incremental amount δ, and a non-adaptive gain parameter M0. In some embodiments, the range Jr is a difference between a maximum expected value Jmax of the performance variable J and a minimum expected value Jmin of the performance variable J (e.g., Jr=Jmax−Jmin).
Adaptive Gain Parameter Process
Referring particularly to
Process 3100 includes obtaining one or more values of a performance variable J from a plant (step 3102), according to some embodiments. In some embodiments, step 3102 is performed by derivative estimator 3014 of memory 3010 of adaptive gain ESC controller 3004. Step 3102 can include obtaining the values of the performance variable J from sensors, equipment, etc., of plant 3002.
Process 3100 includes estimating a time rate of change of the performance variable J (step 3104), according to some embodiments. In some embodiments, the time rate of change of the performance variable J is referred to as {dot over (J)}. The time rate of change of the performance variable J can be obtained by derivative estimator 3014 using any of the analytical techniques described herein.
Process 3100 includes determining a gradient G of the performance variable J with respect to a previous value of an adaptive gain parameter Mk-1 of the plant (step 3106), according to some embodiments. In some embodiments, step 3106 is performed by gradient estimator 3016 using the previous value of the adaptive gain parameter Mk-1 and the time rate of change of the performance variable J.
Process 3100 includes normalizing the gradient G with respect to an expected or estimated range Jr of the performance variable J (step 3108), according to some embodiments. In some embodiments, the gradient G is normalized so that the normalized gradient G is a value between 0 and 1. The normalized gradient {tilde over (G)} may be obtained by dividing the gradient G by the estimated range Jr of the performance variable J. In some embodiments, step 3108 is performed by gradient normalizer 3018.
Process 3100 (optionally) includes incrementally increasing the normalized gradient G (step 3110), according to some embodiments. In some embodiments, step 3110 is performed by gain generator 3020. The normalized gradient {tilde over (G)} can be incrementally increased to ensure that the ESC step of process 3100 (e.g., step 3114) does not stagnate and maintains some degree of excitation.
Process 3100 includes determining a value (e.g., a current value) of the adaptive gain parameter Mk using a gain parameter M0 and the normalized gradient (step 3112), according to some embodiments. In some embodiments, step 3112 is performed by gain generator 3020 using any of the techniques or functionality as described in greater detail above with reference to
Process 3100 includes using the adaptive gain parameter Mk and a sign of the gradient G to adjust or generate a control input for the plant (step 3114), according to some embodiments. In some embodiments, step 3114 is performed by ESC controller 3012 using the adaptive gain parameter Mk. For example step 3114 can be performed by using the adaptive gain parameter Mk in a relay-based self-perturbing extremum-seeking control system (e.g., as shown in
Process 3100 includes operating the equipment of the plant using the control input to affect the performance variable J of the plant (step 3116), according to some embodiments. In some embodiments, step 3116 is performed by providing the control input to equipment of the plant that operate to affect the performance variable J or a measured variable y of the plant.
The construction and arrangement of the systems and methods as shown in the various exemplary embodiments are illustrative only. Although only a few embodiments have been described in detail in this disclosure, many modifications are possible (e.g., variations in sizes, dimensions, structures, shapes and proportions of the various elements, values of parameters, mounting arrangements, use of materials, colors, orientations, etc.). For example, the position of elements may be reversed or otherwise varied and the nature or number of discrete elements or positions may be altered or varied. Accordingly, all such modifications are intended to be included within the scope of the present disclosure. The order or sequence of any process or method steps may be varied or re-sequenced according to alternative embodiments. Other substitutions, modifications, changes, and omissions may be made in the design, operating conditions and arrangement of the exemplary embodiments without departing from the scope of the present disclosure.
The present disclosure contemplates methods, systems and program products on any machine-readable media for accomplishing various operations. The embodiments of the present disclosure may be implemented using existing computer processors, or by a special purpose computer processor for an appropriate system, incorporated for this or another purpose, or by a hardwired system. Embodiments within the scope of the present disclosure include program products comprising machine-readable media for carrying or having machine-executable instructions or data structures stored thereon. Such machine-readable media can be any available media that can be accessed by a general purpose or special purpose computer or other machine with a processor. By way of example, such machine-readable media can include RAM, ROM, EPROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store desired program code in the form of machine-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer or other machine with a processor. Combinations of the above are also included within the scope of machine-readable media. Machine-executable instructions include, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing machines to perform a certain function or group of functions.
Although the figures show a specific order of method steps, the order of the steps may differ from what is depicted. Also two or more steps may be performed concurrently or with partial concurrence. Such variation will depend on the software and hardware systems chosen and on designer choice. All such variations are within the scope of the disclosure. Likewise, software implementations could be accomplished with standard programming techniques with rule based logic and other logic to accomplish the various connection steps, processing steps, comparison steps and decision steps.
Number | Name | Date | Kind |
---|---|---|---|
20150277444 | Burns | Oct 2015 | A1 |
20180299839 | Salsbury | Oct 2018 | A1 |
Number | Date | Country |
---|---|---|
106447136 | Feb 2017 | CN |
Entry |
---|
Olalla et al. Analysis and Comparison of Extremum Seeking Control Techniques, Jun. 2007, pp. 1-6 (Year: 2007). |
Kumar et al. Extremum Seeking Control for Multi-Objective Optimization Problems, Dec. 2016, pp. 1-7 (Year: 2016). |
Fu et al. Variable Structure Extremum Seeking Control Based on Sliding Mode Gradient Estimation for a Class of Nonlinear Systems, Jun. 2009, pp. 1-6 (Year: 2009). |
Number | Date | Country | |
---|---|---|---|
20210181703 A1 | Jun 2021 | US |