The present disclosure relates generally to an extremum-seeking control (ESC) system. ESC is a class of self-optimizing control strategies that can dynamically search for the unknown and/or time-varying inputs of a system for optimizing a certain performance index. ESC can be considered a dynamic realization of gradient searching through the use of dither signals. The gradient of the system output y with respect to the system input u can be obtained by slightly perturbing the system operation and applying a demodulation measure. Optimization of system performance can be obtained by driving the gradient towards zero by using a negative feedback loop in the closed-loop system. ESC is a non-model based control strategy, meaning that a model for the controlled system is not necessary for ESC to optimize the system.
One of the drawbacks of using a gradient descent is that it requires tuning of the ESC based on the plant. What is needed is an ESC system which can operate without requiring tuning, and can operate despite noise which may be present in performance variable signals.
One implementation of the present disclosure is a method for performing extremum-seeking control of a plant, according to some embodiments. In some embodiments, the method includes determining multiple values of a correlation coefficient that relates a control input provided as an input to the plant to a performance variable that characterizes a performance of the plant in response to the control input. In some embodiments, the performance variable includes a noise-free portion and an amount of noise. In some embodiments, the method includes determining an adjusted correlation coefficient by scaling a first value of the correlation coefficient selected from the multiple values relative to a second value of the correlation coefficient selected from the multiple values. In some embodiments, the adjusted correlation coefficient relates the noise-free portion of the performance variable to the control input. In some embodiments, the method includes using the adjusted correlation coefficient to modulate the control input provided as an input to the plant.
In some embodiments, the first value of the correlation coefficient is determined based on a covariance of the control input and the performance variable, a variance of the control input, and a variance of the performance variable.
In some embodiments, the second value of the correlation coefficient is determined based on a first filtered value of multiple estimated values of the correlation coefficient and a forgetting factor.
In some embodiments, the method further includes calculating the forgetting factor based on a dominant time constant of the plant.
In some embodiments, the method further includes scaling the adjusted correlation coefficient to dampen a convergence rate of the control input toward an optimum value as the correlation coefficient approaches zero.
In some embodiments, the amount of noise in the performance variable causes the first value of the correlation coefficient to underrepresent a true correlation between the performance variable and the control input. In some embodiments, the adjusted correlation coefficient reduces an impact of the amount of noise in the performance variable and more accurately represents the true correlation relative to the first value of the correlation coefficient.
In some embodiments, using the adjusted correlation coefficient to modulate the control input includes determining a new value of the control input based on a previous value of the control input, the adjusted correlation coefficient, and a controller gain value.
In some embodiments, the method further includes selecting an extremum of the multiple values of the correlation coefficient as the second value of the correlation coefficient.
In some embodiments, the method further includes adjusting the second value of the correlation coefficient by causing the second value of the correlation coefficient to decay toward zero over time.
In some embodiments, the correlation coefficient is at least one of a gradient of the performance variable with respect to the control input or a normalized indication of the gradient.
Another implementation of the present disclosure is an extremum-seeking controller including a processing circuit. In some embodiments, the processing circuit is configured to determine multiple values of a gradient that relates a control input provided as an input to a plant to a performance variable that characterizes a performance of the plant in response to the control input. In some embodiments, the performance variable includes a noise-free portion and an amount of noise. In some embodiments, the processing circuit is further configured to determine an adjusted gradient by scaling a first value of the gradient selected from the multiple values relative to a second value of the gradient selected from the multiple values. In some embodiments, the adjusted gradient relates the noise-free portion of the performance variable to the control input. In some embodiments, the processing circuit is configured to use the adjusted gradient to modulate the control input provided as an input to the plant.
In some embodiments, the processing circuit is configured to determine the first value of the gradient based on a covariance of the control input and the performance variable, a variable of the control input, and a variance of the performance variable.
In some embodiments, the second value of the gradient is determined based on a first filtered value of the multiple values of the gradient and a forgetting factor.
In some embodiments, the processing circuit is further configured to calculate the forgetting factor based on a dominant time constant of the plant.
In some embodiments, the processing circuit is further configured to scale the adjusted gradient to dampen a convergence rate of the control input toward an optimum value as the gradient approaches zero.
In some embodiments, the processing circuit is further configured to adjust the second value of the gradient by causing the second value of the gradient to decay toward zero over time.
In some embodiments, the processing circuit is configured to select an extremum of the multiple values of the gradient as the second value of the gradient.
Another implementation of the present disclosure is a controller including a processing circuit. In some embodiments, the processing circuit is configured to determine multiple values of a correlation coefficient that relates a control input provided as an input to a plant to a performance variable that characterizes a performance of the plant in response to the control input. In some embodiments, the performance variable includes a noise-free portion and an amount of noise. In some embodiments, the processing circuit is configured to determine an adjusted correlation coefficient by scaling a first value of the correlation coefficient selected from the multiple values relative to a second value of the correlation coefficient selected from the multiple values. In some embodiments, the adjusted correlation coefficient relates the noise-free portion of the performance variable to the control input. In some embodiments, the processing circuit is configured to use the adjusted correlation coefficient to modulate the control input provided as an input to the plant.
In some embodiments, the first value of the correlation coefficient is a maximum value of the multiple values of the correlation coefficient that decays over time.
In some embodiments, the multiple values of the correlation coefficient are filtered values of the correlation coefficient.
Referring generally to the FIGURES, various extremum-seeking control (ESC) systems and methods are shown, according to some embodiments. In general, ESC is a class of self-optimizing control strategies that can dynamically search for the unknown and/or time-varying inputs of a system for optimizing a certain performance index. ESC can be considered a dynamic realization of gradient searching through the use of dither signals. The gradient of the system output y with respect to the system input u can be obtained by slightly perturbing the system operation and applying a demodulation measure.
Optimization of system performance can be obtained by driving the gradient towards zero by using a feedback loop in the closed-loop system. ESC is a non-model based control strategy, meaning that a model for the controlled system is not necessary for ESC to optimize the system. Various implementations of ESC are described in detail in U.S. Pat. Nos. 8,473,080, 7,827,813, 8,027,742, 8,200,345, 8,200,344, U.S. patent application Ser. No. 14/495,773, U.S. patent application Ser. No. 14/538,700, U.S. patent application Ser. No. 14/975,527, and U.S. patent application Ser. No. 14/961,747. Each of these patents and patent applications is incorporated by reference herein.
In some embodiments, an extremum-seeking controller uses a stochastic excitation signal q to perturb a control input u provided to a plant. The controller can include a stochastic signal generator configured to generate a stochastic signal. The stochastic signal can be a random signal (e.g., a random walk signal, a white noise signal, etc.), a non-periodic signal, an unpredictable signal, a disturbance signal, or any other type of non-deterministic or non-repeating signal. In some embodiments, the stochastic signal has a non-zero mean. The stochastic signal can be integrated to generate the excitation signal q.
The stochastic excitation signal q can provide variation in the control input u sufficient to estimate the gradient of the plant output (i.e., a performance variable y) with respect to the control input u. The stochastic excitation signal q has several advantages over a traditional periodic dither signal v. For example, the stochastic excitation signal q is less perceptible than the traditional periodic dither signal v. As such, the effects of the stochastic excitation signal q on the control input u are less noticeable than the periodic oscillations caused by the traditional periodic dither signal v. Another advantage of the stochastic excitation signal q is that tuning the controller is simpler because the dither frequency ωv is no longer a required parameter. Accordingly, the controller does not need to know or estimate the natural frequency of the plant when generating the stochastic excitation signal q.
In some embodiments, the extremum-seeking controller uses a recursive estimation technique to estimate the gradient of the performance variable y with respect to the control input u. For example, the controller can use a recursive least-squares (RLS) estimation technique to generate an estimate of the gradient
In some embodiments, the controller uses exponential forgetting as part of the RLS estimation technique. For example, the controller can be configured to calculate exponentially-weighted moving averages (EWMAs) of the performance variable y, the control input u, and/or other variables used in the recursive estimation technique. Exponential forgetting reduces the required amount of data storage (relative to batch processing) and allows the controller to remain more sensitive to recent data and thus more responsive to a shifting optimal point.
In some embodiments, the extremum-seeking controller estimates a normalized correlation coefficient ρ relating the performance variable y to the control input u. The correlation coefficient ρ can be related to the performance gradient
but scaled based on the range of the performance variable y. For example, the correlation coefficient ρ can be a normalized measure of the performance gradient
scaled to the range −1≤ρ≤1. The normalized correlation coefficient ρ can be estimated based on the covariance between the performance variable y and the control input u, the variance of the performance variable y, and the variance of the control input u. In some embodiments, the normalized correlation coefficient ρ can be estimated using a recursive estimation process.
The correlation coefficient ρ can be used by the feedback controller instead of the performance gradient
For example, the feedback controller can adjust the DC value w of the control input u to drive the correlation coefficient ρ to zero. One advantage of using the correlation coefficient ρ in place of the performance gradient
is that the tuning parameters used by the feedback controller can be a general set of tuning parameters which do not need to be customized or adjusted based on the scale of the performance variable y. This advantage eliminates the need to perform control-loop-specific tuning for the feedback controller and allows the feedback controller to use a general set of tuning parameters that are applicable across many different control loops and/or plants. Additional features and advantages of the extremum-seeking controller are described in greater detail below.
The correlation coefficient ρ can be adjusted to account for noise in a signal associated with the performance variable ρ. For example, the correlation coefficient ρ can be adjusted, scaled, filtered, etc., to determine an adjusted correlation coefficient ρ which accounts for signal noise. An extremum-seeking controller can be configured to determine the adjusted correlation coefficient ρ which accounts for signal noise. The extremum-seeking controller is also configured to dampen a convergence rate of ESC by adjusting the correlation coefficient ρ. Advantageously, the extremum-seeking controller can account for noise in various signals such as signal associated with the performance variable y, and does not require tuning.
Referring now to
In various implementations, ESC can be used in any type of controller that functions to achieve a setpoint for a variable of interest (e.g., by minimizing a difference between a measured or calculated input and a setpoint) and/or optimize a variable of interest (e.g., maximize or minimize an output variable). It is contemplated that ESC can be readily implemented in various types of controllers (e.g., motor controllers, power controllers, fluid controllers, HVAC controllers, lighting controllers, chemical controllers, process controllers, etc.) and various types of control systems (e.g., closed-loop control systems, open-loop control systems, feedback control systems, feed-forward control systems, etc.). All such implementations should be considered within the scope of the present disclosure.
Referring particularly to
The circulated fluid from chiller 22 or boiler 24 can be transported to AHUs 36 via piping 32. AHUs 36 may place the circulated fluid in a heat exchange relationship with an airflow passing through AHUs 36. For example, the airflow can be passed over piping in fan coil units or other air conditioning terminal units through which the circulated fluid flows. AHUs 36 may transfer heat between the airflow and the circulated fluid to provide heating or cooling for the airflow. The heated or cooled air can be delivered to building 10 via an air distribution system including air supply ducts 38 and may return to AHUs 36 via air return ducts 40. In
In some embodiments, the refrigerant in chiller 22 is vaporized upon absorbing heat from the circulated fluid. The vapor refrigerant can be provided to a compressor within chiller 22 where the temperature and pressure of the refrigerant are increased (e.g., using a rotating impeller, a screw compressor, a scroll compressor, a reciprocating compressor, a centrifugal compressor, etc.). The compressed refrigerant can be discharged into a condenser within chiller 22. In some embodiments, water (or another chilled fluid) flows through tubes in the condenser of chiller 22 to absorb heat from the refrigerant vapor, thereby causing the refrigerant to condense. The water flowing through tubes in the condenser can be pumped from chiller 22 to a rooftop cooling unit 26 via piping 28. Cooling unit 26 may use fan driven cooling or fan driven evaporation to remove heat from the water. The cooled water in rooftop unit 26 can be delivered back to chiller 22 via piping 30 and the cycle repeats.
Referring now to
Each of dampers 60-64 can be operated by an actuator. As shown in
Actuators 54-58 may receive control signals from AHU controller 44 and may provide feedback signals to AHU controller 44. Feedback signals may include, for example, an indication of a current actuator or damper position, an amount of torque or force exerted by the actuator, diagnostic information (e.g., results of diagnostic tests performed by actuators 54-58), status information, commissioning information, configuration settings, calibration data, and/or other types of information or data that can be collected, stored, or used by actuators 54-58.
Still referring to
Each of valves 92-94 can be controlled by an actuator. As shown in
AHU controller 44 may operate valves 92-94 via actuators 88-90 to modulate an amount of heating or cooling provided to supply air 86 (e.g., to achieve a setpoint temperature for supply air 86 or to maintain the temperature of supply air 86 within a setpoint temperature range). The positions of valves 92-94 affect the amount of cooling or heating provided to supply air 86 by cooling coil 68 or heating coil 70 and may correlate with the amount of energy consumed to achieve a desired supply air temperature. In various embodiments, valves 92-94 can be operated by AHU controller 44 or a separate controller for HVAC system 20.
AHU controller 44 may monitor the positions of valves 92-94 via communications links 96-98. AHU controller 44 may use the positions of valves 92-94 as the variable to be optimized using an ESC control technique. AHU controller 44 may determine and/or set the positions of dampers 60-64 to achieve an optimal or target position for valves 92-94. The optimal or target position for valves 92-94 can be the position that corresponds to the minimum amount of mechanical heating or cooling used by HVAC system 20 to achieve a setpoint supply air temperature (e.g., minimum fluid flow through valves 92-94).
Still referring to
In some embodiments, AHU controller 44 receives information (e.g., commands, setpoints, operating boundaries, etc.) from supervisory controller 42. For example, supervisory controller 42 may provide AHU controller 44 with a high fan speed limit and a low fan speed limit. A low limit may avoid frequent component and power taxing fan start-ups while a high limit may avoid operation near the mechanical or thermal limits of the fan system. In various embodiments, AHU controller 44 and supervisory controller 42 can be separate (as shown in
Client device 46 may include one or more human-machine interfaces or client interfaces (e.g., graphical user interfaces, reporting interfaces, text-based computer interfaces, client-facing web services, web servers that provide pages to web clients, etc.) for controlling, viewing, or otherwise interacting with HVAC system 20, its subsystems, and/or devices. Client device 46 can be a computer workstation, a client terminal, a remote or local interface, or any other type of user interface device. Client device 46 can be a stationary terminal or a mobile device. For example, client device 46 can be a desktop computer, a computer server with a user interface, a laptop computer, a tablet, a smartphone, a PDA, or any other type of mobile or non-mobile device.
Extremum-Seeking Control Systems with Periodic Dither Signals
Referring now to
Extremum-seeking controller 302 uses extremum-seeking control logic to modulate the manipulated variable u. For example, controller 302 may use a periodic (e.g., sinusoidal) perturbation signal or dither signal to perturb the value of manipulated variable u in order to extract a performance gradient p. The manipulated variable u can be perturbed by adding periodic oscillations to a DC value of the performance variable u, which may be determined by a feedback control loop. The performance gradient p represents the gradient or slope of the performance variable y with respect to the manipulated variable u. Controller 302 uses extremum-seeking control logic to determine a value for the manipulated variable u that drives the performance gradient p to zero.
Controller 302 may determine the DC value of manipulated variable u based on a measurement or other indication of the performance variable y received as feedback from plant 304 via input interface 310. Measurements from plant 304 can include, but are not limited to, information received from sensors about the state of plant 304 or control signals sent to other devices in the system. In some embodiments, the performance variable y is a measured or observed position of one of valves 92-94. In other embodiments, the performance variable y is a measured or calculated amount of power consumption, a fan speed, a damper position, a temperature, or any other variable that can be measured or calculated by plant 304. Performance variable y can be the variable that extremum-seeking controller 302 seeks to optimize via an extremum-seeking control technique. Performance variable y can be output by plant 304 or observed at plant 304 (e.g., via a sensor) and provided to extremum-seeking controller at input interface 310.
Input interface 310 provides the performance variable y to performance gradient probe 312 to detect the performance gradient 314. Performance gradient 314 may indicate a slope of the function y=ƒ(u), where y represents the performance variable received from plant 304 and u represents the manipulated variable provided to plant 304. When performance gradient 314 is zero, the performance variable y has an extremum value (e.g., a maximum or minimum). Therefore, extremum-seeking controller 302 can optimize the value of the performance variable y by driving performance gradient 314 to zero.
Manipulated variable updater 316 produces an updated manipulated variable u based upon performance gradient 314. In some embodiments, manipulated variable updater 316 includes an integrator to drive performance gradient 314 to zero. Manipulated variable updater 316 then provides an updated manipulated variable u to plant 304 via output interface 318. In some embodiments, manipulated variable u is provided to one of dampers 60-64 (
Referring now to
Plant 404 can be the same as plant 304 or similar to plant 304, as described with reference to
Plant 404 can be represented mathematically as a combination of input dynamics 422, a performance map 424, output dynamics 426, and disturbances d. In some embodiments, input dynamics 422 are linear time-invariant (LTI) input dynamics and output dynamics 426 are LTI output dynamics. Performance map 424 can be a static nonlinear performance map. Disturbances d can include process noise, measurement noise, or a combination of both. Although the components of plant 404 are shown in
Plant 404 receives a control input u (e.g., a control signal, a manipulated variable, etc.) from extremum-seeking controller 402 via output interface 430. Input dynamics 422 may use the control input u to generate a function signal x based on the control input (e.g., x=ƒ(u)). Function signal x may be passed to performance map 424 which generates an output signal z as a function of the function signal (i.e., z=ƒ(x)). The output signal z may be passed through output dynamics 426 to produce signal z′, which is modified by disturbances d to produce performance variable y (e.g., y=z′+d). Performance variable y is provided as an output from plant 404 and received at extremum-seeking controller 402. Extremum-seeking controller 402 may seek to find values for x and/or u that optimize the output z of performance map 424 and/or the performance variable y.
Still referring to
The first step of the dither-demodulation technique is performed by dither signal generator 416 and dither signal element 414. Dither signal generator 416 generates a periodic dither signal v, which is typically a sinusoidal signal. Dither signal element 414 receives the dither signal v from dither signal generator 416 and the DC value of the plant input w from controller 412. Dither signal element 414 combines dither signal v with the DC value of the plant input w to generate the perturbed control input u provided to plant 404 (e.g., u=w+v). The perturbed control input u is provided to plant 404 and used by plant 404 to generate performance variable y as previously described.
The second step of the dither-demodulation technique is performed by high-pass filter 406, demodulation element 408, and low-pass filter 410. High-pass filter 406 filters the performance variable y and provides the filtered output to demodulation element 408. Demodulation element 408 demodulates the output of high-pass filter 406 by multiplying the filtered output by the dither signal v with a phase shift 418 applied. The DC value of this multiplication is proportional to the performance gradient p of performance variable y with respect to the control input u. The output of demodulation element 408 is provided to low-pass filter 410, which extracts the performance gradient p (i.e., the DC value of the demodulated output). The estimate of the performance gradient p is then provided to integrator feedback controller 412, which drives the performance gradient estimate p to zero by adjusting the DC value w of the plant input u.
Still referring to
Additionally, it may be desirable to carefully select the frequency of the dither signal v to ensure that the ESC strategy is effective. For example, it may be desirable to select a dither signal frequency ωv based on the natural frequency ωn of plant 304 to enhance the effect of the dither signal v on the performance variable y. It can be difficult and challenging to properly select the dither frequency ωv without knowledge of the dynamics of plant 404. For these reasons, the use of a periodic dither signal v is one of the drawbacks of traditional ESC.
In ESC system 400, the output of high-pass filter 406 can be represented as the difference between the value of the performance variable y and the expected value of the performance variable y, as shown in the following equation:
Output of High-Pass Filter: y−E[y]
where the variable E[y] is the expected value of the performance variable y. The result of the cross-correlation performed by demodulation element 408 (i.e., the output of demodulation element 408) can be represented as the product of the high-pass filter output and the phase-shifted dither signal, as shown in the following equation:
Result of Cross-Correlation: (y−E[y])(v−E[v])
where the variable E[v] is the expected value of the dither signal v. The output of low-pass filter 410 can be represented as the covariance of the dither signal v and the performance variable y, as shown in the following equation:
Output of Low-Pass Filter: E[(y−E[y])(v−E[U])]≡Cov(v,y)
where the variable E[u] is the expected value of the control input u.
The preceding equations show that ESC system 400 generates an estimate for the covariance Cov(v, y) between the dither signal v and the plant output (i.e., the performance variable y). The covariance Cov(v, y) can be used in ESC system 400 as a proxy for the performance gradient p. For example, the covariance Cov(v, y) can be calculated by high-pass filter 406, demodulation element 408, and low-pass filter 410 and provided as a feedback input to integrator feedback controller 412. Integrator feedback controller 412 can adjust the DC value w of the plant input u in order to minimize the covariance Cov(v, y) as part of the feedback control loop.
Extremum-Seeking Control System with Stochastic Excitation Signal
Referring now to
In some embodiments, the ESC logic implemented by controller 502a generates values for control input u based on a received control signal (e.g., a setpoint, an operating mode signal, etc.). The control signal may be received from a user control (e.g., a thermostat, a local user interface, etc.), client devices 536 (e.g., computer terminals, mobile user devices, cellular phones, laptops, tablets, desktop computers, etc.), a supervisory controller 532, or any other external system or device. In various embodiments, controller 502a can communicate with external systems and devices directly (e.g., using NFC, Bluetooth, WiFi direct, cables, etc.) or via a communications network 534 (e.g., a BACnet network, a LonWorks network, a LAN, a WAN, the Internet, a cellular network, etc.) using wired or wireless electronic data communications
Plant 504 can be similar to plant 404, as described with reference to
Plant 504 can be represented mathematically as a static nonlinearity in series with a dynamic component. For example, plant 504 is shown to include a static nonlinear function block 516 in series with a constant gain block 518 and a transfer function block 520. Although the components of plant 504 are shown in
Still referring to
In some embodiments interfaces 530 and 524-526 can be joined as one or two interfaces rather than three separate interfaces. For example, communications interface 530 and input interface 526 can be combined as one Ethernet interface configured to receive network communications from supervisory controller 532. In some embodiments, supervisory controller 532 provides both a setpoint and feedback via an Ethernet network (e.g., network 534). In such an embodiment, output interface 524 may be specialized for a controlled component of plant 504. In other embodiments, output interface 524 can be another standardized communications interface for communicating data or control signals. Interfaces 530 and 524-526 can include communications electronics (e.g., receivers, transmitters, transceivers, modulators, demodulators, filters, communications processors, communication logic modules, buffers, decoders, encoders, encryptors, amplifiers, etc.) configured to provide or facilitate the communication of the signals described herein.
Still referring to
Memory 542 can include one or more devices (e.g., memory units, memory devices, storage devices, etc.) for storing data and/or computer code for completing and/or facilitating the various processes described in the present disclosure. Memory 542 can include random access memory (RAM), read-only memory (ROM), hard drive storage, temporary storage, non-volatile memory, flash memory, optical memory, or any other suitable memory for storing software objects and/or computer instructions. Memory 542 can include database components, object code components, script components, or any other type of information structure for supporting the various activities and information structures described in the present disclosure. Memory 542 can be communicably connected to processor 540 via processing circuit 538 and can include computer code for executing (e.g., by processor 540) one or more processes described herein.
Still referring to
of the performance variable y with respect to the control input u and to adjust the DC value of the control input u (i.e., the variable w) to drive the gradient
to zero.
Recursive gradient estimator 506 can be configured to estimate the gradient
of the performance variable y with respect to the control input u. The gradient
may be similar to the performance gradient p determined in ESC system 400. However, the fundamental difference between ESC system 500 and ESC system 400 is the way that the gradient
is obtained. In ESC system 400, the performance gradient p is obtained via the dither-demodulation technique described with reference to
in ESC system 500 is obtained by performing a recursive regression technique to estimate the slope of the performance variable y with respect to the control input u. The recursive estimation technique may be performed by recursive gradient estimator 506.
Recursive gradient estimator 506 can use any of a variety of recursive estimation techniques to estimate the gradient
For example, recursive gradient estimator 506 can use a recursive least-squares (RLS) estimation technique to generate an estimate of the gradient
In some embodiments, recursive gradient estimator 506 uses exponential forgetting as part of the RLS estimation technique. Exponential forgetting reduces the required amount of data storage relative to batch processing. Exponential forgetting also allows the RLS estimation technique to remain more sensitive to recent data and thus more responsive to a shifting optimal point. An example a RLS estimation technique which can be performed recursive gradient estimator 506 is described in detail below.
Recursive gradient estimator 506 is shown receiving the performance variable y from plant 504 and the control input u from excitation signal element 510. In some embodiments, recursive gradient estimator 506 receives multiple samples or measurements of the performance variable y and the control input u over a period of time. Recursive gradient estimator 506 can use a sample of the control input u at time k to construct an input vector xk as shown in the following equation:
where uk is the value of the control input u at time k. Similarly, recursive gradient estimator 506 can construct a parameter vector {circumflex over (θ)}k as shown in the following equation:
where the parameter {circumflex over (θ)}2 is the estimate of the gradient
at time k.
Recursive gradient estimator 506 can estimate the performance variable ŷk at time k using the following linear model:
ŷ
k
=x
k
T{circumflex over (θ)}k-1
The prediction error of this model is the difference between the actual value of the performance variable yk at time k and the estimated value of the performance variable ŷk at time k as shown in the following equation:
e
k
=y
k
−ŷ
k
=y
k
−x
k
T{circumflex over (θ)}k-1
Recursive gradient estimator 506 can use the estimation error ek in the RLS technique to determine the parameter values {circumflex over (θ)}k. Any of a variety of RLS techniques can be used in various implementations. An example of a RLS technique which can be performed by recursive gradient estimator 506 is as follows:
g
k
=P
k-1
x
k(λ+xkTPk-1xk)−1
P
k=λ−1Pk-1−gkxkTλ−1Pk-1
{circumflex over (θ)}k={circumflex over (θ)}k-1+ekgk
where gk is a gain vector, Pk is a covariance matrix, and λ is a forgetting factor (λ<1). In some embodiments, the forgetting factor λ is defined as follows:
where Δt is the sampling period and τ is the forgetting time constant.
Recursive gradient estimator 506 can use the equation for gk to calculate the gain vector gk at time k based on a previous value of the covariance matrix Pk-1 at time k−1, the value of the input vector xkT at time k, and the forgetting factor. Recursive gradient estimator 506 can use the equation for Pk to calculate the covariance matrix Pk at time k based on the forgetting factor λ, the value of the gain vector gk at time k, and the value of the input vector xkT at time k. Recursive gradient estimator 506 can use the equation for {circumflex over (θ)}k to calculate the parameter vector {circumflex over (θ)}k at time k based on the error ek at time k and the gain vector gk at time k. Once the parameter vector {circumflex over (θ)}k is calculated, recursive gradient estimator 506 can determine the value of the gradient
by extracting the value of the {circumflex over (θ)}2 parameter from {circumflex over (θ)}k, as shown in the following equations:
In various embodiments, recursive gradient estimator 506 can use any of a variety of other recursive estimation techniques to estimate
For example, recursive gradient estimator 506 can use a Kalman filter, a normalized gradient technique, an unnormalized gradient adaption technique, a recursive Bayesian estimation technique, or any of a variety of linear or nonlinear filters to estimate
In other embodiments, recursive gradient estimator 506 can use a batch estimation technique rather than a recursive estimation technique. As such, gradient estimator 506 can be a batch gradient estimator rather than a recursive gradient estimator. In a batch estimation technique, gradient estimator 506 can use a batch of previous values for the control input u and the performance variable y (e.g., a vector or set of previous or historical values) as inputs to a batch regression algorithm. Suitable regression algorithms may include, for example, ordinary least squares regression, polynomial regression, partial least squares regression, ridge regression, principal component regression, or any of a variety of linear or nonlinear regression techniques.
In some embodiments, it is desirable for recursive gradient estimator 506 to use a recursive estimation technique rather than a batch estimation technique due to several advantages provided by the recursive estimation technique. For example, the recursive estimation technique described above (i.e., RLS with exponential forgetting) has been shown to greatly improve the performance of the gradient estimation technique relative to batch least-squares. In addition to requiring less data storage than batch processing, the RLS estimation technique with exponential forgetting can remain more sensitive to recent data and thus more responsive to a shifting optimal point.
In some embodiments, recursive gradient estimator 506 estimates the gradient
using the covariance between the control input u and the performance variable y. For example, the estimate of the slope {circumflex over (β)} in a least-squares approach can be defined as:
where Cov(u, y) is the covariance between the control input u and the performance variable y, and Var(u) is the variance of the control input u. Recursive gradient estimator 506 can calculate the estimated slope {circumflex over (β)} using the previous equation and use the estimated slope {circumflex over (β)} as a proxy for the gradient
Notably, the estimated slope {circumflex over (β)} is a function of only the control input u and the performance variable y. This is different from the covariance derivation technique described with reference to
In some embodiments, recursive gradient estimator 506 uses a higher-order model (e.g., a quadratic model, a cubic model, etc.) rather than a linear model to estimate the performance variable ŷk. For example, recursive gradient estimator 506 can estimate the performance variable ŷk at time k using the following quadratic model:
ŷ
k={circumflex over (θ)}1+{circumflex over (θ)}2uk+{circumflex over (θ)}3uk2+ϵk
which can be written in the form ŷk=xkT{circumflex over (θ)}k-1 by updating the input vector xk and the parameter vector {circumflex over (θ)}k as follows:
Recursive gradient estimator 506 can use the quadratic model to fit a quadratic curve (rather than a straight line) to the data points defined by combinations of the control input u and the performance variable y at various times k. The quadratic model provides second-order information not provided by the linear model and can be used to improve the convergence of feedback controller 508. For example, with a linear model, recursive gradient estimator 506 can calculate the gradient
at a particular location along the curve (i.e., for a particular value of the control input u) and can provide the gradient
as a feedback signal. For embodiments that use a linear model to estimate ŷk, the gradient
(i.e., me derivative of me linear model with respect to u) is a scalar value. When controller 508 receives a scalar value for the gradient
as a feedback signal, controller 508 can incrementally adjust the value of the control input u in a direction that drives the gradient
toward zero until the optimal value of the control input u is reached (i.e., the value of the control input u that results in the gradient
With a quadratic model, recursive gradient estimator 506 can provide feedback controller 508 with a function for the gradient
rather than a simple scalar value. For embodiments that use a quadratic model to estimate ŷk, the gradient
(i.e., the derivative of the quadratic model with respect to u) is a linear function of the control input u
When controller 508 receives a linear function for the gradient
as a feedback signal, controller 508 can analytically calculate the optimal value of the control input u that will result in the gradient
Accordingly, controller 508 can adjust the control input u using smart steps that rapidly approach the optimal value without relying on incremental adjustment and experimentation to determine whether the gradient
is moving toward zero.
Still referring to
reliably, it may be desirable to provide sufficient variation in the control input u that carries through to the performance variable y. Controller 502a can use stochastic signal generator 512 and integrator 514 to generate a persistent excitation signal q. The excitation signal q can be added to the DC value w of the control input u at excitation signal element 510 to form the control input u (e.g., u=w+q).
Stochastic signal generator 512 can be configured to generate a stochastic signal. In various embodiments, the stochastic signal can be a random signal (e.g., a random walk signal, a white noise signal, etc.), a non-periodic signal, an unpredictable signal, a disturbance signal, or any other type of non-deterministic or non-repeating signal. In some embodiments, the stochastic signal has a non-zero mean. The stochastic signal can be integrated by integrator 514 to generate the excitation signal q.
Excitation signal q can provide variation in the control input u sufficient for the gradient estimation technique performed by recursive gradient estimator 506. In some instances, the addition of excitation signal q causes the control input u to drift away from its optimum value. However, feedback controller 508 can compensate for such drift by adjusting the DC value w such that the control input u is continuously pulled back toward its optimum value. As with traditional ESC, the magnitude of the excitation signal q can be selected (e.g., manually by a user or automatically by controller 502a) to overcome any additive noise found in the performance variable y (e.g., process noise, measurement noise, etc.).
The stochastic excitation signal q generated by extremum-seeking controller 502a has several advantages over the periodic dither signal v generated by controller 402. For example, the stochastic excitation signal q is less perceptible than a traditional periodic dither signal v. As such, the effects of the stochastic excitation signal q on the control input u are less noticeable than the periodic oscillations caused by the traditional periodic dither signal v. Graphs illustrating a control input u excited by the stochastic excitation signal q and the resulting performance variable y are shown in
Another advantage of the stochastic excitation signal q is that tuning controller 502a is simpler because the dither frequency co, is no longer a required parameter. Accordingly, controller 502a does not need to know or estimate the natural frequency of plant 504 when generating the stochastic excitation signal q. In some embodiments, extremum-seeking controller 502a provides multiple control inputs u to plant 504. Each of the control inputs can be excited by a separate stochastic excitation signal q. Since each of the stochastic excitation signals q is random, there is no need to ensure that the stochastic excitation signals q are not correlated with each other. Controller 502a can calculate the gradient
of the performance variable y with respect to each of the control inputs u without performing a frequency-specific dither-demodulation technique.
One of the problems with traditional ESC is that the performance gradient
is a function of the range or scale of the performance variable y. The range or scale of the performance variable y can depend on the static and dynamic components of plant 504. For example, plant 504 is shown to include a nonlinear function ƒ(u) (i.e., function block 516) in series with a constant gain K (i.e., constant gain block 518). It is apparent from this representation that the range or scale of the performance variable y is a function of the constant gain K.
The value of the performance gradient
may vary based on the value of the control input u due to the nonlinearity provided by the nonlinear function ƒ(u). However, the scale of the performance gradient
is also dependent upon the value of the constant gain K. For example, the performance gradient
can be determined using the following equation:
where K is the constant gain and ƒ′(u) is the derivative of the function ƒ(u). It can be desirable to scale or normalize the performance gradient
(e.g., by multiplying by a scaling parameter K) in order to provide consistent feedback control loop performance. However, without knowledge of the scale of the performance variable y (e.g., without knowing the constant gain K applied by plant 504), it can be challenging to determine an appropriate value for the scaling parameter K.
Still referring to
but scaled based on the range of the performance variable y. For example, the correlation coefficient ρ can be a normalized measure of the performance gradient
(e.g., scaled to the range 0≤ρ≤1).
Correlation coefficient estimator 528 is shown receiving the control input u and the performance variable y as inputs. Correlation coefficient estimator 528 can generate the correlation coefficient ρ based on the variance and covariance of the control input u and the performance variable y, as shown in the following equation:
where Cov(u, y) is the covariance between the control input u and the performance variable y, Var(u) is the variance of the control input u, and Var(y) is the variance of the performance variable y. The previous equation can be rewritten in terms of the standard deviation σu of the control input u and the standard deviation σy of the performance variable y as follows:
where Var(u)=σu2 and Var(y)=σy2.
In some embodiments, correlation coefficient estimator 528 estimates the correlation coefficient ρ using a recursive estimation technique. For example, correlation coefficient estimator 528 can calculate exponentially-weighted moving averages (EWMAs) of the control input u and the performance variable y using the following equations:
where ūk and
Similarly, correlation coefficient estimator 528 can calculate EWMAs of the control input variance Var(u), the performance variable variance Var(y), and the covariance Cov(u, y) using the following equations:
where Vu,k, Vy,k, and ck are the EWMAs of the control input variance Var(u), the performance variable variance Var(y), and the covariance Cov(u, y), respectively, at time k. Vu,k-1, Vy,k-1, and ck-1 are the EWMAs of the control input variance Var(u), the performance variable variance Var(y), and the covariance Cov(u, y), respectively, at time k−1. Correlation coefficient estimator 528 can generate an estimate of the correlation coefficient ρ based on these recursive estimates using the following equation:
In some embodiments, correlation coefficient estimator 528 generates the correlation coefficient ρ based on the estimated slope {circumflex over (β)}. As previously described, the estimated slope {circumflex over (β)} can be calculated using the following equation:
where Cov(u, y) is the covariance between the control input u and the performance variable y, and Var(u) is the variance of the control input u (i.e., σu2). Correlation coefficient estimator 528 can calculate the correlation coefficient ρ from the slope {circumflex over (β)} using the following equation:
From the previous equation, it can be seen that the correlation coefficient ρ and the estimated slope {circumflex over (β)} are equal when the standard deviations σu and σy are equal (i.e., when σu=σy).
Correlation coefficient estimator 528 can receive the estimated slope {circumflex over (β)} from recursive gradient estimator 506 or calculate the estimated slope {circumflex over (β)} using a set of values for the control input u and the performance variable y. For example, with the assumption of finite variance in u and y, correlation coefficient estimator 528 can estimate the slope {circumflex over (β)} using the following least squares estimation:
For a small range of the control input u, the estimated slope {circumflex over (β)} can be used as a proxy for the performance gradient, as shown in the following equation:
As shown in the previous equation, the estimated slope {circumflex over (β)} contains the constant gain K, which may be unknown. However, normalization provided by the standard deviations σu and σy cancels the effect of the constant gain K. For example, the standard deviation σy of the performance variable y is related to the standard deviation σu of the control input u as shown in the following equations:
Multiplying the estimated slope {circumflex over (β)} by the ratio
to calculate the correlation coefficient ρ is equivalent to dividing by the constant gain K. Both the correlation coefficient ρ and the estimated slope {circumflex over (β)} indicate the strength of the relationship between the control input u and the performance variable y. However, the correlation coefficient ρ has the advantage of being normalized which makes tuning the feedback control loop much simpler.
In some embodiments, the correlation coefficient ρ is used by feedback controller 508 instead of the performance gradient
For example, feedback controller 508 can adjust the DC value w of the control input u to drive the correlation coefficient ρ to zero. One advantage of using the correlation coefficient ρ in place of the performance gradient
is that the tuning parameters used by feedback controller 508 can be a general set of tuning parameters which do not need to be customized or adjusted based on the scale of the performance variable y. This advantage eliminates the need to perform control-loop-specific tuning for feedback controller 508 and allows feedback controller 508 to use a general set of tuning parameters that are applicable across many different control loops and/or plants.
Referring now to
Referring particularly to
Referring particularly to
Referring now to
Flow diagram 800 is shown to include providing a control input u to a plant (block 802) and receiving a performance variable y as a feedback from a plant (block 804). The control input u can be provided by an extremum-seeking controller and/or a feedback controller for the plant. The controller can be any of the controllers previously described (e.g., controller 302, controller 402, controller 502a, etc.) or any other type of controller that provides a control input u to a plant. In some embodiments, the controller is an extremum-seeking controller configured to achieve an optimal value for the performance variable y by adjusting the control input u. The optimal value can be an extremum (e.g., a maximum or a minimum) of the performance variable y.
A plant in control theory is the combination of a process and one or more mechanically-controlled outputs. The plant can be any of the plants previously described (e.g., plant 304, plant 404, plant 504, etc.) or any other controllable system or process. For example, the plant can be an air handling unit configured to control temperature within a building space via one or more mechanically-controlled actuators and/or dampers. In various embodiments, the plant can include a chiller operation process, a damper adjustment process, a mechanical cooling process, a ventilation process, a refrigeration process, or any other process in which a control input u to the plant is adjusted to affect the performance variable y. The performance variable y can be a measured variable observed by one or more sensors of the plant (e.g., a measured temperature, a measured power consumption, a measured flow rate, etc.), a calculated variable based on measured or observed values (e.g., a calculated efficiency, a calculated power consumption, a calculated cost, etc.) or any other type of variable that indicates the performance of the plant in response to the control input u.
Flow diagram 800 is shown to include estimating a gradient of the performance variable y with respect to the control input u (block 806). In some embodiments, the gradient is the performance gradient p described with reference to
or the estimated slope {circumflex over (β)} described with reference to
In some embodiments, the gradient is estimated by performing a recursive gradient estimation technique. The recursive gradient estimation technique may include obtaining a model for the performance variable y as a function of the control input u. For example, the gradient can be estimated using the following linear model:
ŷ
k
=x
k
T{circumflex over (θ)}k-1
where xk is an input vector and {circumflex over (θ)}k is a parameter vector. The input vector xk and the parameter vector {circumflex over (θ)}k can be defined as follows:
where uk is the value of the control input u at time k and the parameter {circumflex over (θ)}2 is the estimate of the gradient
at time k.
The prediction error of this model is the difference between the actual value of the performance variable yk at time k and the estimated value of the performance variable ŷk at time k as shown in the following equation:
e
k
=y
k
−ŷ
k
=y
k
−x
k
T{circumflex over (θ)}k-1
The estimation error ek can be used in the recursive gradient estimation technique to determine the parameter values {circumflex over (θ)}k. Any of a variety of regression techniques can be used to estimate values for the parameter vector {circumflex over (θ)}k.
In some embodiments, a higher-order model (e.g., a quadratic model, a cubic model, etc.) rather than a linear model can be used to estimate the gradient. For example, the following quadratic model can be used to estimate the gradient
at a particular location along the curve defined by the model:
ŷ
k={circumflex over (θ)}1+{circumflex over (θ)}2uk+{circumflex over (θ)}3uk2+ϵk
In some embodiments, the gradient is estimated using a recursive least squares (RLS) estimation technique with exponential forgetting. Any of a variety of RLS techniques can be used in various implementations. An example of a RLS technique which can be performed to estimate the gradient is shown in the following equations, which can be solved to determine the value for the parameter vector {circumflex over (θ)}k.
g
k
=P
k-1
x
k(λ+xkTPk-1xk)−1
P
k=λ−1Pk-1−gkxkTλ−1Pk-1
{circumflex over (θ)}k={circumflex over (θ)}k-1+ekgk
where gk is a gain vector, Pk is a covariance matrix, and λ is a forgetting factor (λ<1). In some embodiments, the forgetting factor λ is defined as follows:
where Δt is the sampling period and τ is the forgetting time constant. Once the parameter vector {circumflex over (θ)}k is calculated, the gradient can be estimated by extracting the value of the {circumflex over (θ)}2 parameter from {circumflex over (θ)}k
In various embodiments, the gradient can be estimated using any of a variety of other recursive estimation techniques. For example, the gradient can be estimated using a Kalman filter, a normalized gradient technique, an unnormalized gradient adaption technique, a recursive Bayesian estimation technique, or any of a variety of linear or nonlinear filters. In some embodiments, the gradient can be estimated using a batch estimation technique rather than a recursive estimation technique. In the batch estimation technique, a batch of previous values for the control input u and the performance variable y (e.g., a vector or set of previous or historical values) can be used as inputs to a batch regression algorithm. Suitable regression algorithms may include, for example, ordinary least squares regression, polynomial regression, partial least squares regression, ridge regression, principal component regression, or any of a variety of linear or nonlinear regression techniques.
In some embodiments, the gradient can be estimated using the covariance between the control input u and the performance variable y. For example, the estimate of the slope {circumflex over (β)} in a least-squares approach can be defined as:
where Cov(u, y) is the covariance between the control input u and the performance variable y, and Var(u) is the variance of the control input u. The estimated slope {circumflex over (β)} can be calculated using the previous equation and used as a proxy for the gradient
Still referring to
Flow diagram 800 is shown to include generating a stochastic excitation signal q (block 810) and generating a new control input u by perturbing the output w of the feedback controller with the stochastic excitation signal q (block 812). The stochastic excitation signal q can be generated by stochastic signal generator 512 and/or integrator 514, as described with reference to
The stochastic excitation signal q can be added to the DC value w generated by the feedback controller to form the new control input u (e.g., u=w+q). After the new control input u is generated, the new control input u can be provided to the plant (block 802) and the ESC control technique can be repeated. The stochastic excitation signal q can provide variation in the control input u sufficient to estimate the performance gradient in block 806. In some instances, the addition of excitation signal q causes the control input u to drift away from its optimum value. However, the feedback controller can compensate for such drift by adjusting the DC value w such that the control input u is continuously pulled back toward its optimum value. As with traditional ESC, the magnitude of the excitation signal q can be selected (e.g., manually by a user or automatically by the controller) to overcome any additive noise found in the performance variable y (e.g., process noise, measurement noise, etc.).
The stochastic excitation signal q has several advantages over a periodic dither signal v. For example, the stochastic excitation signal q is less perceptible than a traditional periodic dither signal v. As such, the effects of the stochastic excitation signal q on the control input u are less noticeable than the periodic oscillations caused by the traditional periodic dither signal v. Another advantage of the stochastic excitation signal q is that tuning the controller is simpler because the dither frequency θv is no longer a required parameter. Accordingly, the controller does not need to know or estimate the natural frequency of the plant when generating the stochastic excitation signal q.
Referring now to
Flow diagram 900 is shown to include providing a control input u to a plant (block 902) and receiving a performance variable y as a feedback from a plant (block 904). The control input u can be provided by an extremum-seeking controller and/or a feedback controller for the plant. The controller can be any of the controllers previously described (e.g., controller 302, controller 402, controller 502a, etc.) or any other type of controller that provides a control input u to a plant. In some embodiments, the controller is an extremum-seeking controller configured to achieve an optimal value for the performance variable y by adjusting the control input u. The optimal value can be an extremum (e.g., a maximum or a minimum) of the performance variable y.
A plant in control theory is the combination of a process and one or more mechanically-controlled outputs. The plant can be any of the plants previously described (e.g., plant 304, plant 404, plant 504, etc.) or any other controllable system or process. For example, the plant can be an air handling unit configured to control temperature within a building space via one or more mechanically-controlled actuators and/or dampers. In various embodiments, the plant can include a chiller operation process, a damper adjustment process, a mechanical cooling process, a ventilation process, a refrigeration process, or any other process in which a control input u to the plant is adjusted to affect the performance variable y. The performance variable y can be a measured variable observed by one or more sensors of the plant (e.g., a measured temperature, a measured power consumption, a measured flow rate, etc.), a calculated variable based on measured or observed values (e.g., a calculated efficiency, a calculated power consumption, a calculated cost, etc.) or any other type of variable that indicates the performance of the plant in response to the control input u.
Flow diagram 900 is shown to include estimating a normalized correlation coefficient ρ relating the performance variable y to the control input u. The correlation coefficient ρ can be related to the performance gradient
but scaled based on the range of the performance variable y. For example, the correlation coefficient ρ can be a normalized measure of the performance gradient
(e.g., scaled to the range 0≤ρ≤1).
In some embodiments, the correlation coefficient ρ can be estimated based on the variance and covariance of the control input u and the performance variable y, as shown in the following equation:
where Cov(u, y) is the covariance between the control input u and the performance variable y, Var(u) is the variance of the control input u, and Var(y) is the variance of the performance variable y. The previous equation can be rewritten in terms of the standard deviation σu of the control input u and the standard deviation σy of the performance variable y as follows:
where Var(u)=σu2 and Var(y)=σy2
In some embodiments, the correlation coefficient ρ is estimated using a recursive estimation technique. The recursive estimation technique may include calculating exponentially-weighted moving averages (EWMAs) of the control input u and the performance variable y. For example, EWMAs of the control input u and the performance variable y can be calculated using the following equations:
where ūk and
EWMAs can also be calculated for the control input variance Var(u), the performance variable variance Var(y), and the covariance Cov(u, y) using the following equations:
where Vu,k, Vy,k, and ck are the EWMAs of the control input variance Var(u), the performance variable variance Var(y), and the covariance Cov(u, y), respectively, at time k. Vu,k-1, Vy,k-1, and ck-1 are the EWMAs of the control input variance Var (u), the performance variable variance Var(y), and the covariance Cov(u, y), respectively, at time k−1. The correlation coefficient ρ can be estimated based on these recursive estimates using the following equation:
In some embodiments, the correlation coefficient ρ is estimated based on the estimated slope {circumflex over (β)}. As previously described, the estimated slope {circumflex over (β)} can be calculated using the following equation:
where Cov(u, y) is the covariance between the control input u and the performance variable y, and Var(u) is the variance of the control input u (i.e., σu2). The correlation coefficient ρ can be calculated from the slope {circumflex over (β)} using the following equation:
From the previous equation, it can be seen that the correlation coefficient ρ and the estimated slope {circumflex over (β)} are equal when the standard deviations σu and σy are equal (i.e., when σu=σy).
In some embodiments, the estimated slope {circumflex over (β)} can be calculated using a set of values for the control input u and the performance variable y. For example, with the assumption of finite variance in u and y, the slope {circumflex over (β)} can be estimated using the following least squares estimation:
For a small range of the control input u, the estimated slope {circumflex over (β)} can be used as a proxy for the performance gradient, as shown in the following equation:
As shown in the previous equation, the estimated slope {circumflex over (β)} contains the constant gain K, which may be unknown. However, normalization provided by the standard deviations σu and σy cancels the effect of the constant gain K. For example, the standard deviation σy of the performance variable y is related to the standard deviation σu of the control input u as shown in the following equations:
Multiplying the estimated slope {circumflex over (β)} by the ratio
to calculate the correlation coefficient ρ is equivalent to dividing by the constant gain K. Both the correlation coefficient ρ and the estimated slope {circumflex over (β)} indicate the strength of the relationship between the control input u and the performance variable y. However, the correlation coefficient ρ has the advantage of being normalized which makes tuning the feedback control loop much simpler.
Still referring to
Flow diagram 900 is shown to include generating an excitation signal (block 910) and generating a new control input u by perturbing the output w of the feedback controller with the excitation signal (block 912). In various embodiments, the excitation signal can be a periodic dither signal v as described with reference to
The excitation signal can provide variation in the control input u sufficient to estimate the correlation coefficient ρ in block 906. In some instances, the addition of the excitation signal causes the control input u to drift away from its optimum value. However, the feedback controller can compensate for such drift by adjusting the DC value w such that the control input u is continuously pulled back toward its optimum value. The magnitude of the excitation signal can be selected (e.g., manually by a user or automatically by the controller) to overcome any additive noise found in the performance variable y (e.g., process noise, measurement noise, etc.).
Referring now to
In some embodiments, noise in performance variable y affects the determination of the correlation coefficient ρ. In some embodiments, the correlation coefficient ρ as determined by correlation coefficient estimator 528 decreases as an amount of noise (e.g., a signal to noise ratio) of the performance variable y increases (e.g., signal to noise ratio decreases). In this way, excessive amounts of noise that are present in the signal associated with the performance variable y may adversely affect the determination of the correlation coefficient ρ. In some embodiments, feedback controller 508 uses the correlation coefficient ρ as a feedback variable (
Referring now to
Referring to
Referring now to
Referring now to
Referring now to
Extremum-Seeking Controller with Correlation Coefficient Adjuster
Referring now to
In some embodiments, correlation coefficient estimator 528 provides ρ to filter 548. In some embodiments, filter 548 filters ρ to determine a filtered version of the correlation coefficient ρf2 and provides ρf2 to convergence dampener 546. In some embodiments, convergence dampener 546 determines a scaled version of ρadj, shown as ρadj′, and provides the scaled and adjusted correlation coefficient ρadj′ to feedback controller 508.
In some embodiments, the correlation coefficient ρ between two signals x (e.g., the signal associated with manipulated variable u) and y0 (e.g., a noise-less signal associated with performance variable y) can be determined as shown below:
where ρx,y
In some embodiments, the signal associated with the performance variable y has noise associated with it. When noise is present, the signal y0 (e.g., the noiseless signal associated with the performance variable y) is combined with a noise signal e such that σy2=σy
where σe2 is a variance of signal y due to noise, and y is a signal associated with the performance variable y which includes variation due to noise (e.g., signal e), according to some embodiments.
In some embodiments, correlation coefficient estimator 528 uses
to determine ρ2 without taking into account σe2, where σxy is covariance between the signal x associated with the control variable u, σx2 is the variance of the signal associated with the manipulated variable u, and σy2 is the variance of the signal associated with the performance variable y.
Since an error or noise signal, e, is an independent signal uncorrelated with signals x or y, σxy2=σxy
according to some embodiments. From the equation above, it can be seen that the correlation coefficient ρx,y
Various approaches can be adopted to counteract the effect of σe2, according to some embodiments. One approach is to use an estimate of the noise-free correlation coefficient ρ by solving the above equation for ρx,y
Another approach to counteract the effect of σe2 is described hereinbelow, according to some embodiments. In some embodiments, correlation coefficient adjuster 544 is configured to receive ρ as determined by correlation coefficient estimator 528 and determine ρadj.
Since the correlation coefficient ρxy
max(ρx,y02)=1
according to some embodiments. The maximum value of ρxy2 can be observed and the ratio of this maximum value can be used to estimate σe2:
since the only unknown value in the equation above is σe2.
In some embodiments, due to variance in sample estimation of ρx,y2, tracking the maximum value of ρx,y2 would lead to an inflated value of ρx,y2 which may underestimate the effect of σe2. In some embodiments, mitigating measures may be taken to account for the variance in the estimation of ρx,y2. In some embodiments, ρx,y2 (i.e., φ is received by correlation coefficient adjuster 544 from correlation coefficient estimator 528 and is filtered with filter 550. In some embodiments, filter 550 uses a first order filter shown below to determine a filtered correlation coefficient:
ρf1,k2=ρf1,k-12+λ(ρx,y2−ρf1,k-12)
where λ is a forgetting factor, ρf1,k is a filtered correlation coefficient at time step k, and ρf1,k is the filtered correlation coefficient at a previous time step k−1. In some embodiments, the maximum value of ρf1,k2 over a time period can be recorded (e.g., selected) by maximum correlation coefficient manager 552 and used to rescale ρx,y2 (the correlation coefficient determined based on the noisy performance variable y signal). However, remaining sample variance in ρf1,k may cause the maximum value of ρf1,k to increase over time, according to some embodiments. In some embodiments, filter 550 is optional. For example, maximum correlation coefficient manager 552 may receive the correlation coefficient ρ from correlation coefficient estimator 528 and perform its respective functions using values of the correlation coefficient ρ.
In some embodiments, maximum correlation coefficient manager 552 is configured to allow the maximum value of ρf1,k2 to decay over time using the equation:
ρmax,k2=max(ρmax,k-12(1−λ),ρf1,k2)
where ρmax,k2 is a filtered maximum value at time step k, ρf1,k2 is the filtered squared correlation coefficient at time step k, ρmax,k-12 is a filtered maximum value at a previous time step k−1, and λ is a forgetting factor. In this way, ρmax,k2 is prevented from becoming stuck or stagnating at a maximum value, according to some embodiments. In some embodiments, ρmax,k2 decays over time, and if a present value of ρf1,k2 does not exceed the previous maximum value multiplied by some amount (e.g., (1−λ)), the new ρmax,k2 value is ρmax,k-12(1−λ). Allowing ρmax,k2 to diminish or decay over time ensures that the noise adjustment does not become saturated and inactive, according to some embodiments. In some embodiments, this is advantageous for applications when the noise changes over time. In some embodiments, if the noise increases, the expected maximum value of ρ2 would decrease. In this way, ρmax2 is allowed to be reset to lower values, according to some embodiments.
In some embodiments, adjusted correlation coefficient generator 554 is configured to receive the correlation coefficient ρx,y from correlation coefficient estimator 528, and the maximum correlation coefficient at time step k (i.e., ρmax,k) from maximum correlation coefficient manager 552 to determine an adjusted ρ value (e.g., an adjusted ρxy) using the equation shown below:
where ρadj is the adjusted correlation coefficient, ρmax,k is the maximum correlation coefficient value at time step k, and ρx,y is the correlation coefficient at time step k. In some embodiments, ρx,y is ρ as determined by correlation coefficient estimator 528.
It should be noted that ρadj is not constrained within the range −1 to 1, according to some embodiments. In some embodiments, since ρadj is not constrained within the range −1 to 1 (e.g., is not normalized), ρadj does not indicate a degree of correlation between two signals, but rather is a feedback variable to be used by feedback controller 508.
In some embodiments, feedback controller 508 is an integrator which uses ρadj as a feedback variable. In some embodiments, feedback controller 508 is configured to receive the adjusted correlation coefficient from correlation coefficient adjuster 544 and use ρadj to determine a current value of the manipulated variable u using the equation:
u
k
=u
k-1−κρadj
where uk is the manipulated variable u at a present time k, uk-1 is a value of the manipulated variable u at a time k−1, K is the feedback/controller gain, and ρadj is the adjusted correlation coefficient.
Substituting the ρadj equation into the equation above yields:
according to some embodiments. In some embodiments, feedback controller 508 is configured to directly receive ρx,y from correlation coefficient estimator 528, and directly receive ρmax,k from maximum correlation coefficient manager 552 and uses the above equation to determine uk.
In some embodiments, feedback controller 508 uses a scaled adjusted correlation coefficient ρadj′. In some embodiments, feedback controller 508 receives ρadj′ from convergence dampener 546.
Filter 548 is configured to receive ρx,y from correlation coefficient estimator 528, according to some embodiments. In some embodiments, filter 548 is configured to determine a filtered version of ρx,y using the equation:
ρf2,k=ρf2,k-1+λ(ρx,y−ρf2,k-1)
where ρf2,k is a filtered value of ρx,y at time k, ρf2,k-1 is the filtered value of ρx,y at a previous time k−1, and λ is a forgetting factor.
In some embodiments, filter 548 determines ρf2,k and provides ρf2,k to convergence dampener 546. In some embodiments, convergence dampener 546 is configured to determine a scaled value of ρadj (i.e., ρadj) to dampen sensitivity of the integration when feedback controller 508 converges to an optimum u value and ρ approaches zero. In some embodiments, convergence dampener 546 determines ρadj′ using the equation:
where ρf2,k is the filtered value of ρx,y as determined by filter 548, ρmax,k is the maximum correlation coefficient determined by maximum correlation coefficient manager 552 and ρadj is the adjusted correlation coefficient determined by adjusted correlation coefficient generator 554 and/or correlation coefficient adjuster 544.
It should also be noted that the ρadj′ is also normalized by ρmax,k in the equation above in order to keep the scaling consistent for different noise levels, according to some embodiments. Feedback controller 508 uses ρadj′ to determine a current value at time k of the manipulated variable u using the equation:
u
k
=u
k-1−κρadj′
according to some embodiments. In some embodiments, convergence dampener 546 and/or filter 548 are optional. For example, feedback controller 508 may use the adjusted correlation coefficient ρadj and perform its functionality using the adjusted correlation coefficient ρadj. In some embodiments, filter 548 is optional and convergence dampener 546 uses values of the correlation coefficient ρ to perform its functionality instead of ρf2,k.
In some embodiments, feedback controller 508 receives ρx,y directly from correlation coefficient estimator 528, ρf2,k directly from filter 548, and ρmax,k directly from maximum correlation coefficient manager 552 and uses the equation:
to determine uk to the ESC. When feedback controller 508 converges, the expected value of ρ (e.g., ρx,y) as determined by correlation coefficient estimator 528 approaches zero, causing the integration to be less sensitive to changes in ρx,y, according to some embodiments.
The controller/feedback K is determined by feedback controller 508 using a dominant time constant of plant 504 that may be calculated using the equation:
where Δt is a time step, and τ is the dominant (e.g., largest) time constant of plant 504, according to some embodiments.
The forgetting factor λ is determined (e.g., by feedback controller 508) based on the dominant time constant of plant 504 as well, using the equation:
according to some embodiments.
Referring now to
Referring now to
Process 2000 includes receiving a signal associated with a performance variable and a signal associated with a control variable (step 2002), according to some embodiments. In some embodiments, the signal associated with the performance variable and the signal associated with the control variable are received by extremum-seeking controller 502b. In some embodiments, the signal associated with the performance variable and the signal associated with the control variable are received by correlation coefficient estimator 528. In some embodiments, the performance variable is a performance variable of a plant. In some embodiments, the control variable is a control variable of the plant.
Process 2000 includes determining a correlation coefficient based on the signal associated with the performance variable and the signal associated with the control variable (step 2004), according to some embodiments. In some embodiments, step 2004 includes determining the correlation coefficient ρ. In some embodiments, step 2004 is performed by correlation coefficient estimator 528.
Process 2000 includes determining a first filtered correlation coefficient based on the determined correlation coefficient (step 2006), according to some embodiments. In some embodiments, the first filtered correlation coefficient is ρf1,k. In some embodiments, the first filtered correlation coefficient is determined by correlation coefficient adjuster 544. In some embodiments, the first filtered correlation coefficient is determined by filter 550. In some embodiments, step 2006 includes using a first order filter.
Process 2000 includes determining a maximum correlation coefficient based on the first filtered correlation coefficient, a forgetting factor, and a previously determined maximum correlation coefficient (step 2008), according to some embodiments. In some embodiment, step 2008 is performed by maximum correlation coefficient manager 552. In some embodiments, the forgetting factor is λ. In some embodiments, the forgetting factor λ is determined based on a time step and a maximum time constant τ of the plant. In some embodiments, the maximum correlation coefficient is ρmax,k. In some embodiments, step 2008 is performed by maximum correlation coefficient manager 552 using outputs of filter 550 (i.e., ρf1,k). In some embodiments, step 2008 is performed by maximum correlation coefficient manager 552 using the outputs of correlation coefficient estimator 528 (i.e., φ.
Process 2000 includes determining an adjusted correlation coefficient based on the maximum correlation coefficient, and the correlation coefficient determined based on the signals associated with the control variable and the performance variable (step 2010), according to some embodiments. In some embodiments, step 2010 is performed by correlation coefficient adjuster 544. In some embodiments, step 2010 is performed by adjusted correlation coefficient generator 554.
Process 2000 includes determining a second filtered correlation coefficient based on a forgetting factor, a previously determined second filtered correlation coefficient value, and the correlation coefficient determined based on the signals associated with the control variable and the performance variable (step 2012), according to some embodiments. In some embodiments, step 2012 is performed by filter 548. In some embodiments, the forgetting factor of step 2012 is the forgetting factor of step 2008.
Process 2000 includes scaling (e.g., normalizing) the adjusted correlation coefficient based on the adjusted correlation coefficient, the second filtered correlation coefficient, and the correlation coefficient determined based on the signals associated with the control variable and the performance variable (step 2014), according to some embodiments. In some embodiments, step 2014 is performed by convergence dampener 546. In some embodiments, step 2014 results in a decreased sensitivity to changes in the correlation coefficient determined based on the signals associated with the control variable and the performance variable when the ESC is converging. In some embodiments, step 2014 determines ρadj′. In some embodiments, step 2014 is performed by feedback controller 508.
Process 2000 includes performing ESC using either the adjusted correlation coefficient (determined in step 2010) or using the scaled adjusted correlation coefficient (step 2016), according to some embodiments. In some embodiments, step 2016 includes using either the adjusted correlation coefficient or the scaled adjusted correlation coefficient to perform ESC in addition to using a controller gain value. In some embodiments, the controller gain value is K. In some embodiments, K is determined based on a time step and a maximum/dominant time constant of the plant.
Advantageously, process 2000 can be performed for implementing ESC even when a signal associated with the performance variable is noisy. Using process 2000 facilitates a rapid convergence even when the signal associated with the performance variable increases. Other ESC systems do not necessarily account for or mitigate the noise in the signal associated with the performance variable, and the convergence rate of such ESC systems may be adversely affected by the noisy signal.
The construction and arrangement of the systems and methods as shown in the various exemplary embodiments are illustrative only. Although only a few embodiments have been described in detail in this disclosure, many modifications are possible (e.g., variations in sizes, dimensions, structures, shapes and proportions of the various elements, values of parameters, mounting arrangements, use of materials, colors, orientations, etc.). For example, the position of elements may be reversed or otherwise varied and the nature or number of discrete elements or positions may be altered or varied. Accordingly, all such modifications are intended to be included within the scope of the present disclosure. The order or sequence of any process or method steps may be varied or re-sequenced according to alternative embodiments. Other substitutions, modifications, changes, and omissions may be made in the design, operating conditions and arrangement of the exemplary embodiments without departing from the scope of the present disclosure.
The present disclosure contemplates methods, systems and program products on any machine-readable media for accomplishing various operations. The embodiments of the present disclosure may be implemented using existing computer processors, or by a special purpose computer processor for an appropriate system, incorporated for this or another purpose, or by a hardwired system. Embodiments within the scope of the present disclosure include program products comprising machine-readable media for carrying or having machine-executable instructions or data structures stored thereon. Such machine-readable media can be any available media that can be accessed by a general purpose or special purpose computer or other machine with a processor. By way of example, such machine-readable media can include RAM, ROM, EPROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store desired program code in the form of machine-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer or other machine with a processor. Combinations of the above are also included within the scope of machine-readable media. Machine-executable instructions include, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing machines to perform a certain function or group of functions.
Although the figures show a specific order of method steps, the order of the steps may differ from what is depicted. Also two or more steps may be performed concurrently or with partial concurrence. Such variation will depend on the software and hardware systems chosen and on designer choice. All such variations are within the scope of the disclosure. Likewise, software implementations could be accomplished with standard programming techniques with rule based logic and other logic to accomplish the various connection steps, processing steps, comparison steps and decision steps.
This application is a continuation-in-part of U.S. patent application Ser. No. 16/438,854 filed Jun. 12, 2019, which is a continuation of U.S. patent application Ser. No. 15/080,435 filed Mar. 24, 2016, now U.S. Pat. No. 10,352,576 granted Jul. 16, 2019, which claims the benefit of and priority to U.S. Provisional Patent Application No. 62/296,713 filed Feb. 18, 2016, the entire disclosures of which are incorporated by reference herein. This application also claims the benefit of and priority to U.S. Provisional Patent Application No. 62/798,914 filed Jan. 30, 2019, incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
62798914 | Jan 2019 | US | |
62296713 | Feb 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15080435 | Mar 2016 | US |
Child | 16438854 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16438854 | Jun 2019 | US |
Child | 16740020 | US |