The present disclosure relates to an apparatus control device and an apparatus control method.
There is an apparatus control device that calculates a control value of a control target apparatus. For example, Patent Literature 1 discloses an air conditioning control device that acquires a control value of an air conditioner serving as a control target apparatus.
The air conditioning control device includes an acquisition unit that acquires, from a sensor that observes an environment in which the air conditioning control device is installed, an observed value of the environment and a control-content determination unit that gives the observed value acquired by the acquisition unit to a learning model and acquires a control value of the air conditioner from the learning model.
As information to be considered in order to more appropriately control a control target apparatus, in addition to an observable value that can be observed by a sensor, an unobservable value that cannot be directly observed by the sensor may be required.
For example, in the air conditioning control device disclosed in Patent Literature 1, an observable value such as a temperature that can be observed by a sensor is considered, but an unobservable value such as a thermal load is not considered. In a case where the thermal load or the like is not considered, it is difficult to correctly estimate how the environment changes depending on the output of an air conditioner, so that appropriate air conditioning control cannot be executed.
A conventional apparatus control device like the air conditioning control device disclosed in Patent Literature 1 has a problem that a control value is not calculated in consideration of an unobservable value.
The present disclosure has been made to solve the above problems, and an object thereof is to obtain an apparatus control device and an apparatus control method capable of acquiring a control value that changes depending on an unobservable value that is a value not directly observed by a sensor.
An apparatus control device according to the present disclosure includes a processor; and a memory storing a program, upon executed by the processor, to perform a process: to acquire, from a sensor to observe an environment in which a control target apparatus is installed, an observed value of the environment; to give an observed value acquired to a first learning model and acquire an observation predicted value that is a future observed value from the sensor from the first learning model; to give an observed value acquired to a second learning model and acquire an unobservable value that is a value not directly observed by the sensor from the second learning model; and to calculate a control value of the control target apparatus using an observed value acquired, an observation predicted value acquired, and an unobservable value acquired, wherein the process includes to, by substituting an observed value acquired, an observation predicted value acquired, and an unobservable value acquired into each of equations of state in each of the plurality of control methods, calculate a control value for each of a plurality of control methods as a control value of the control target apparatus, and to select a control value for any one of the plurality of control methods among control values for the plurality of control methods calculated.
According to the present disclosure, it is possible to acquire a control value that changes depending on the unobservable value that is a value not directly observed by the sensor.
Hereinafter, in order to describe the present disclosure in more detail, embodiments for carrying out the present disclosure will be described with reference to the accompanying drawings.
The apparatus control system illustrated in
The apparatus control system illustrated in
The sensor 2-n (n=1, . . . , N) observes an environment in which the air conditioner 1 is installed, and outputs an observed value of the environment to the apparatus control device 3.
Examples of the sensor 2-n include a room temperature sensor that observes the indoor temperature of a room in which the air conditioner 1 is installed, an outside air temperature sensor that observes the outside air temperature of the room, a humidity sensor that observes the humidity of the room, a solar radiation sensor that observes the amount of solar radiation to the room, and a human sensor that observes the number of people present in the room.
The apparatus control device 3 includes an observed value acquiring unit 11, an observation predicted value acquiring unit 12, an unobservable value acquiring unit 13, a control value calculating unit 14, and a display data generating unit 17.
The display device 4 includes a display.
The display device 4 displays the control value and the like of the air conditioner 1 on the display based on display data output from the apparatus control device 3.
The observed value acquiring unit 11 is implemented by, for example, an observed value acquiring circuit 21 illustrated in
The observed value acquiring unit 11 acquires an observed value of an environment from the sensor 2-n (n=1, . . . , N).
The observed value acquiring unit 11 outputs the observed value of the environment to each of the observation predicted value acquiring unit 12, the unobservable value acquiring unit 13, the control value calculating unit 14, and the display data generating unit 17.
The observation predicted value acquiring unit 12 is implemented by, for example, an observation predicted value acquiring circuit 22 illustrated in
An internal memory of the observation predicted value acquiring unit 12 stores a first learning model 12a.
The first learning model 12a is implemented by, for example, a neural network model or a deep learning model.
In the first learning model 12a, at the time of learning, the observed value of the environment from the sensor 2-n (n=1, . . . , N) is given as input data, and an observation predicted value that is a future observed value from the sensor 2-n is given as training data. The first learning model 12a outputs an observation predicted value corresponding to the input data at the time of inference by learning the observation predicted value at the time of learning.
In the apparatus control device 3 illustrated in
The observation predicted value acquiring unit 12 gives the observed value acquired by the observed value acquiring unit 11 to the first learning model 12a, and acquires the observation predicted value from the first learning model 12a.
The observation predicted value acquiring unit 12 outputs the observation predicted value acquired from the first learning model 12a to each of the control value calculating unit 14 and the display data generating unit 17.
The unobservable value acquiring unit 13 is implemented by, for example, an unobservable value acquiring circuit 23 illustrated in
An internal memory of the unobservable value acquiring unit 13 stores a second learning model 13a.
The second learning model 13a is implemented by, for example, a neural network model or a deep learning model.
In the second learning model 13a, at the time of learning, the observed value of the environment from the sensor 2-n (n=1, . . . , N) is given as input data, and an unobservable value that is a value not directly observed by the sensor 2-n is given as training data. The second learning model 13a outputs an unobservable value corresponding to the input data at the time of inference by learning the unobservable value at the time of learning.
In the apparatus control device 3 illustrated in
The unobservable value acquiring unit 13 gives the observed value acquired by the observed value acquiring unit 11 to the second learning model 13a, and acquires the unobservable value that is a value not directly observed by the sensor 2-n from the second learning model 13a.
The unobservable value acquiring unit 13 outputs the unobservable value acquired from the second learning model 13a to each of the control value calculating unit 14 and the display data generating unit 17.
The control value calculating unit 14 is implemented by, for example, a control value calculating circuit 24 illustrated in
The control value calculating unit 14 includes a state prediction unit 15 and a control value selecting unit 16.
The control value calculating unit 14 acquires an observed value from the observed value acquiring unit 11, acquires an observation predicted value from the observation predicted value acquiring unit 12, and acquires an unobservable value from the unobservable value acquiring unit 13.
The control value calculating unit 14 calculates the control value of the air conditioner 1 using the observed value acquired by the observed value acquiring unit 11, the observation predicted value acquired by the observation predicted value acquiring unit 12, and the unobservable value acquired by the unobservable value acquiring unit 13.
The control value calculating unit 14 outputs the control value to each of the air conditioner 1 and the display data generating unit 17.
The state prediction unit 15 substitutes the observed value acquired by the observed value acquiring unit 11, the observation predicted value acquired by the observation predicted value acquiring unit 12, and the unobservable value acquired by the unobservable value acquiring unit 13 into an equation of state, and obtains the solution of the equation of state as the control value of the air conditioner 1.
In addition, when each equation of state in each of a plurality of control methods is prepared as the equation of state, the state prediction unit 15 substitutes the observed value, the observation predicted value, and the unobservable value into each equation of state, thereby calculating a control value for each control method.
The control value selecting unit 16 selects a control value for any one control method among control values for a plurality of control methods.
The control value selecting unit 16 outputs the selected control value to each of the air conditioner 1 and the display data generating unit 17.
The display data generating unit 17 is implemented by, for example, a display data generating circuit 25 illustrated in
The display data generating unit 17 generates display data for displaying one or more of the unobservable value acquired by the unobservable value acquiring unit 13 and the control value calculated by the control value calculating unit 14, and outputs the display data to the display device 4.
In addition, the display data generating unit 17 generates display data for displaying one or more of the observed value acquired by the observed value acquiring unit 11 and the observation predicted value acquired by the observation predicted value acquiring unit 12, and outputs the display data to the display device 4.
In
Each of the observed value acquiring circuit 21, the observation predicted value acquiring circuit 22, the unobservable value acquiring circuit 23, the control value calculating circuit 24, and the display data generating circuit 25 corresponds to, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or a combination thereof.
The components of the apparatus control device 3 are not limited to those implemented by dedicated hardware, and the apparatus control device 3 may be implemented by software, firmware, or a combination of software and firmware.
The software or firmware is stored in a memory of a computer as a program. The computer means hardware that executes the program, and corresponds to, for example, a central processing unit (CPU), a central processing device, a processing device, an arithmetic device, a microprocessor, a microcomputer, a processor, or a digital signal processor (DSP).
In a case where the apparatus control device 3 is implemented by software, firmware, or the like, a program for causing a computer to execute each processing procedure in each of the observed value acquiring unit 11, the observation predicted value acquiring unit 12, the unobservable value acquiring unit 13, the control value calculating unit 14, and the display data generating unit 17 is stored in a memory 31. Then, a processor 32 of the computer executes the program stored in the memory 31.
In addition,
Next, an operation of the apparatus control system illustrated in
In the apparatus control system illustrated in
J room temperature sensors are individually distinguished by 2′-1 to 2′-J, and K human sensors are individually distinguished by 2″-1 to 2″-K.
The observed value of room temperature observed by the room temperature sensor 2′-j (j=1, . . . , J) is Xj(t), and the observed value of the number of people observed by the human sensor 2″-k (k=1, . . . , K) is Yk(t). t is a current time, and t+q is a future time. q=1, . . . , Q, and Q is an integer equal to or more than one.
The room temperature sensor 2′-j (j=1, . . . , J) outputs the observed value Xj(t) of the room temperature to the apparatus control device 3.
The human sensor 2″-k (k=1, . . . , K) outputs the observed value Yk(t) of the number of people to the apparatus control device 3.
In the apparatus control system illustrated in
The observed value acquiring unit 11 of the apparatus control device 3 acquires the observed value Xj(t) of the room temperature as the observed value of the environment from the room temperature sensor 2′-j (j=1, . . . , J) (step ST1 in
In addition, the observed value acquiring unit 11 acquires the observed value Yk(t) of the number of people as the observed value of the environment from the human sensor 2″-k (k=1, . . . , K) (step ST1 in
The observed value acquiring unit 11 outputs the observed value Xj(t) of the room temperature and the observed value Yk(t) of the number of people to each of the observation predicted value acquiring unit 12, the unobservable value acquiring unit 13, the control value calculating unit 14, and the display data generating unit 17.
The observation predicted value acquiring unit 12 acquires each of the observed value Xj(t) (j=1, . . . , J) of the room temperature and the observed value Yk(t) (k=1, . . . , K) of the number of people from the observed value acquiring unit 11.
At the time of learning, the first learning model 12a is assumed to be given, as the input data, a set temperature TSET of the air conditioner 1 in addition to the observed value Xj(t) of the room temperature and the observed value Yk(t) of the number of people, for example. Furthermore, it is assumed that a future observed value Xj(t+q) (q=1, . . . , Q) of the room temperature is given to the first learning model 12a as the training data. In this case, the first learning model 12a learns the future observed value Xj(t+q) of the room temperature.
The future observed value Xj(t+q) of the room temperature, which is the training data, is simulated by a simulator (not illustrated) on the basis of the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, and the set temperature TSET, for example. If the observed value Yk(t) of the number of people is known, it is possible to estimate the total approximate value of the amount of heat output from one or more persons present in the room. In addition, if the observed value Xj(t) of the room temperature and the set temperature TSET are known, it is possible to estimate the operation rate that is the workload of the air conditioner 1. Therefore, if the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, and the set temperature TSET are given, the simulator can simulate the future observed value Xj(t+q) of the room temperature. Here, if the set temperature TSET is a fixed temperature fixed to, for example, 23 degrees, the simulator can simulate the future observed value Xj(t+q) of the room temperature using the fixed temperature instead of the set temperature TSET. In this case, even if the set temperature TSET is not given to the first learning model 12a at the time of learning, the first learning model 12a can learn the future observed value Xj(t+q) of the room temperature.
Here, the simulator simulates the future observed value Xj(t+q) of the room temperature using the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, and the set temperature TSET. However, this is merely an example, and even if the simulator uses the observed value of the daily illuminance from the daily illuminance sensor instead of the observed value Yk(t) of the number of people, it is possible to simulate the future observed value Xj(t+q) of the room temperature. In this case, the first learning model 12a learns the future observed value Xj(t+q) of the room temperature if, at the time of learning, the observed value Xj(t) of the room temperature, the observed value of the daily illuminance, and the set temperature TSET are given as the input data and the future observed value Xj(t+q) of the room temperature is given as the training data.
In addition, the simulator can simulate the future observed value Xj(t+q) of the room temperature using, for example, the observed value of the daily illuminance in addition to the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, and the set temperature TSET. In general, the simulation accuracy of the simulator improves as the number of types of input data increases. In this case, the first learning model 12a learns the future observed value Xj(t+q) of the room temperature if, at the time of learning, the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, the observed value of the daily illuminance, and the set temperature TSET are given as the input data and the future observed value Xj(t+q) of the room temperature is given as the training data.
The simulator may simulate the future observed value of humidity using, for example, the observed value of humidity from the humidity sensor and the observed value Yk(t) of the number of people, instead of the observed value Xj(t) of the room temperature. In this case, the first learning model 12a learns the future observed value of humidity if, at the time of learning, the observed value of humidity from the humidity sensor and the observed value Yk(t) of the number of people are given as the input data and the future observed value of humidity is given as the training data. If the first learning model 12a learns the future observed value of humidity, the future observed value of humidity can be output at the time of inference.
If the input data at the time of learning includes, for example, the observed value Xj(t) of the room temperature and the observed value Yk(t) of the number of people, the observation predicted value acquiring unit 12 gives the observed value Xj(t) of the room temperature and the observed value Yk(t) of the number of people to the first learning model 12a at the time of inference.
If the input data at the time of learning includes, for example, the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, and the set temperature TSET, the observation predicted value acquiring unit 12 gives the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, and the set temperature TSET to the first learning model 12a at the time of inference.
The observation predicted value acquiring unit 12 acquires, from the first learning model 12a, an observation predicted value Xj(t+q) that is a future observed value Xj(t+q) (q=1, . . . , Q) of the room temperature as an inference result (step ST2 in
The observation predicted value acquiring unit 12 outputs the observation predicted value Xj(t+q) to each of the control value calculating unit 14 and the display data generating unit 17.
The unobservable value acquiring unit 13 acquires the observed value Yk(t) (k=1, . . . , K) of the number of people from the observed value acquiring unit 11.
It is assumed that, at the time of learning, each of the observed value Yk(t) of the number of people and history data is given to the second learning model 13a as the input data. In addition, it is assumed that the current unobservable value Zm(t) and the future unobservable value Zm(t+q) (q=1, . . . , Q) are given to the second learning model 13a as the training data. m=1, . . . , M, and M is an integer equal to or more than one. In this case, the second learning model 13a learns each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q). The history data is data indicating a change in the number of people in the past with the lapse of time. Furthermore, the history data is obtained by recording the observed value of the number of people in the past output from the observed value acquiring unit 11.
Each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q), which are the training data, is simulated by a simulator (not illustrated) on the basis of the observed value Yk(t) of the number of people and the history data. If the observed value Yk(t) of the number of people is known, it is possible to estimate the total approximate value of the amount of heat output from one or more persons present in the room. In addition, if the history data can be obtained, the tendency of the change in the number of people with the lapse of time can be known, thus it is possible to estimate the tendency of the change in the thermal load. Therefore, in a case where the unobservable value is a thermal load, the simulator can simulate each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) (q=1, . . . , Q) if the observed value Yk(t) of the number of people and the history data are given.
Here, the simulator simulates each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) using the observed value Yk(t) of the number of people and the history data. However, this is merely an example, and the simulator may simulate each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) using the observed value Yk(t) of the number of people and the past observed value Yk(t−1) of the number of people.
In this case, the second learning model 13a learns each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) if, at the time of learning, the observed value Yk(t) of the number of people and the past observed value Yk(t−1) of the number of people are given as the input data and the current unobservable value Zm(t) and the future unobservable value Zm(t+q) are given as the training data.
In addition, the simulator can simulate each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) using, for example, the observed value of the daily illuminance in addition to the observed value Yk(t) of the number of people and the past observed value Yk(t−1) of the number of people. In general, the simulation accuracy of the simulator improves as the number of types of input data increases. In this case, the second learning model 13a learns each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) if, at the time of learning, the observed value Yk(t) of the number of people, the past observed value Yk(t−1) of the number of people, and the observed value of the daily illuminance are given as the input data and the current unobservable value Zm(t) and the future unobservable value Zm(t+q) are given as the training data.
In the apparatus control system illustrated in
Examples of the unobservable value other than the thermal load include convection in a room that is an environment in which the air conditioner 1 is installed.
For example, if the observed value Xj(t) of the room temperature, the blowing direction of the air conditioner 1, and layout data are used, the simulator can simulate the convection in the room as the unobservable value. The layout data is data indicating the arrangement of furniture or the like installed in a room. In this case, if, at the time of learning, the observed value Xj(t) of the room temperature, the blowing direction of the air conditioner 1, and the layout data are given as the input data, and the current convection and the future convection are given as the training data, the second learning model 13a determines that the unobservable value is the convection, and learns each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q).
For example, if the blowing direction of the air conditioner 1 and the layout data are constant, the second learning model 13a can learn each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) if only the observed value Xj(t) of the room temperature is given as the input data and the current convection and the future convection are given as the training data.
In addition, examples of the unobservable value other than the thermal load include the opening and closing degree of a valve under a situation in which an opening degree sensor is not installed, the amount of microorganisms under a situation in which a microorganism detection sensor is not installed, and the number of people under a situation in which a human sensor is not installed.
If the input data at the time of learning includes, for example, the observed value Yk(t) of the number of people and the past observed value Yk(t−1) of the number of people, the unobservable value acquiring unit 13 gives the observed value Yk(t) of the number of people and the past observed value Yk(t−1) of the number of people to the second learning model 13a at the time of inference.
If the input data at the time of learning includes, for example, the observed value Yk(t) of the number of people and the history data, the unobservable value acquiring unit 13 gives the observed value Yk(t) of the number of people and the history data to the second learning model 13a at the time of inference.
The unobservable value acquiring unit 13 acquires, from the second learning model 13a, each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) as an inference result (step ST3 in
The unobservable value acquiring unit 13 outputs the current unobservable value Zm(t) and the future unobservable value Zm(t+q) to each of the control value calculating unit 14 and the display data generating unit 17.
The state prediction unit 15 of the control value calculating unit 14 acquires the observed value Xj(t) of the room temperature and the observed value Yk(t) of the number of people from the observed value acquiring unit 11.
In addition, the state prediction unit 15 acquires the observation predicted value Xj(t+q) (q=1, . . . , Q) from the observation predicted value acquiring unit 12.
The state prediction unit 15 further acquires, from the unobservable value acquiring unit 13, each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q).
The state prediction unit 15 acquires equations of state in C control methods from the internal memory. C is an integer equal to or larger than two. Examples of the control method include proportional integral differential (PID) control, proportional control, open sound control (OSC), model predictive control (MPC), and control in deep learning.
The following equation (1) shows an example of the equation of state.
In Equation (1), Wg(t) is a control value of the air conditioner 1. g=1, . . . , G, and G is an integer equal to or more than one.
Each of P1j, P2k, and P3g is a parameter of the apparatus control system and is a default value.
When calculating G current control values Wg(t) (g=1, . . . , G) in the air conditioner 1, the state prediction unit 15 substitutes the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, the observation predicted value Xj(t+1), and the unobservable value Zm(t) into each equation of state.
The state prediction unit 15 obtains solutions of the equations of state as G control values Wg(t) (g=1, . . . , G) (step ST4 in
Here, it is assumed that the observed value Xj(t) and the observed value Yk(t) used by the state prediction unit 15 to calculate the control value Wg(t) are the same as the observed value Xj(t) and the observed value Yk(t), respectively, which are used when the unobservable value acquiring unit 13 acquires the unobservable value Zm(t) and the like. However, this is merely an example, and the observed value Xj(t) and the observed value Yk(t) used by the state prediction unit 15 to calculate the control value Wg(t) may be different from the observed value Xj(t) and the observed value Yk(t), respectively, which are used when the unobservable value acquiring unit 13 acquires the unobservable value Zm(t) and the like.
That is, the observed value Xj(t) used by the state prediction unit 15 is only required to be an observed value of the sensor 2-n and may be an observed value other than the room temperature. If Xj(t) is an observed value other than the room temperature, Xj(t+1) is an observation predicted value other than the room temperature.
In addition, the observed value Yk(t) used by the state prediction unit 15 is only required to be an observed value of the sensor 2-n and may be an observed value other than the number of people.
When calculating G future control values Wg(t+q) (g=1, . . . , G; q=1, . . . , Q) in the air conditioner 1, the state prediction unit 15 substitutes the observed value Xj(t;q) of the room temperature, the observed value Yk(t+q) of the number of people, the observation predicted value Xj(t+q+1), and the unobservable value Zm(t+q) into each equation of state.
The state prediction unit 15 obtains solutions of the equations of state as G control values Wg(t+q) (g=1, . . . , G) at a time t+q (step ST4 in
The state prediction unit 15 outputs, to the control value selecting unit 16, control values Wg(t), Wg(t+1), . . . , Wg(t+Q) at a plurality of times from the present to the future for each of the control methods as scheduling data Dc (c=1, . . . , C) of control values for the control methods.
The control value selecting unit 16 acquires the scheduling data Di to Dc for C control methods from the state prediction unit 15.
The control value selecting unit 16 selects the scheduling data Dc for any one of the control methods among the scheduling data Di to Dc for the C control methods.
That is, the control value selecting unit 16 selects the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) for any one of the control methods (step ST5 in
Specifically, the control value selecting unit 16 compares predicted errors of the observation predicted value Xj(t) used when calculating the control value Wg(t−1) by each of the C control methods with each other.
That is, the control value selecting unit 16 compares the predicted errors of C observation predicted values Xj(t) acquired at the time (t−1) with each other. The predicted error of the observation predicted value Xj(t) is a difference between the observation predicted value Xj(t) and the observed value Xj(t) acquired at a time t.
The control value selecting unit 16 selects control values Wg(t), Wg(t+1), . . . , Wg(t+Q) with the smallest prediction error of the observation predicted value Xj(t) among the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) for C control methods.
The control value selecting unit 16 outputs the selected control values Wg(t), Wg(t+1), . . . , Wg(t+Q) to each of the air conditioner 1 and the display data generating unit 17.
The air conditioner 1 acquires the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) from the control value selecting unit 16 of the control value calculating unit 14.
The air conditioner 1 operates based on the control value Wg(t) at the time t. The air conditioner 1 operates based on the control value Wg(t+q) at a time t+q (q=1, . . . , Q).
The display data generating unit 17 acquires each of the observed value Xj(t) (j=1, . . . , J) of the room temperature and the observed value Yk(t) (k=1, . . . , K) of the number of people from the observed value acquiring unit 11.
In addition, the display data generating unit 17 acquires the observation predicted value Xj(t+q) (q=1, . . . , Q) from the observation predicted value acquiring unit 12.
The display data generating unit 17 acquires, from the unobservable value acquiring unit 13, each of the current unobservable value Zm(t) and the future unobservable value Zm(t+q) (g=1, . . . , G; q=1, . . . , Q).
In addition, the display data generating unit 17 acquires the control values Wg(t) and Wg(t+q) (g=1, . . . , G) from the control value selecting unit 16.
The display data generating unit 17 generates display data for displaying one or more of the unobservable value Zm(t) and the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) (step ST6 in
In addition, the display data generating unit 17 generates display data for displaying one or more of the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, and the observation predicted value Xj(t+1) (step ST6 in
The display data generating unit 17 outputs the generated display data to the display device 4.
The display device 4 displays the observed value Xj(t) of the room temperature, the observed value Yk(t) of the number of people, the observation predicted value Xj(t+1), the unobservable value Zm(t), or the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) on the display based on the display data output from the display data generating unit 17.
In the first embodiment described above, the apparatus control device 3 is configured to include the observed value acquiring unit 11 that acquires, from the sensor 2-n (n=1, . . . , N) that observes the environment in which the control target apparatus is installed, the observed value of the environment, the observation predicted value acquiring unit 12 that gives the observed value acquired by the observed value acquiring unit 11 to the first learning model 12a and acquires the observation predicted value that is the future observed value from the sensor 2-n from the first learning model 12a, and the unobservable value acquiring unit 13 that gives the observed value acquired by the observed value acquiring unit 11 to the second learning model 13a and acquires the unobservable value that is a value not directly observed by the sensor 2-n from the second learning model 13a. In addition, the apparatus control device 3 includes the control value calculating unit 14 that calculates the control value of the control target apparatus using the observed value acquired by the observed value acquiring unit 11, the observation predicted value acquired by the observation predicted value acquiring unit 12, and the unobservable value acquired by the unobservable value acquiring unit 13. Therefore, the apparatus control device 3 can acquire a control value that changes depending on the unobservable value that is a value not directly observed by the sensor 2-n.
In the apparatus control device 3 illustrated in
First, a user determines Δt that satisfies the following Equation (3) (step ST11 in
The state prediction unit 15 acquires observed values Xj(th−Δt), Xj(th), and Xj(tn+Δt) at certain times th−Δt, th, th+Δt from the observed value acquiring unit 11 (step ST12 in
Assuming that dX/dt is the right hand side of the following Equation (4), the state prediction unit 15 generates two equations related to the parameters P1j, P2k, and P3g included in the equation of state shown in Equation (1) (step ST13 in
The state prediction unit 15 obtains two equations at different times by repeating the processing of steps ST12 and ST13 a specified number of times.
The state prediction unit 15 determines each of the parameters P1j, P2k, and P3g on the basis of the two equations at different times (step ST14 in
The state prediction unit 15 calculates an unobservable value Zm(t) on the basis of the parameters P1j, P2k, and P3g (step ST15 in
Here, the state prediction unit 15 determines each of the parameters P1j, P2k, and P3g, and calculates the unobservable value Zm(t) on the basis of the parameters P1j, P2k, and P3g. However, this is merely an example, and the state prediction unit 15 may simply determine each of the parameters P1j, P2k, and P3g and output the unobservable value Zm(t) from the unobservable value acquiring unit 13.
The unobservable value Zm(t) based on the parameters P1j, P2k, and P3g can be obtained by the state prediction unit 15 performing a simulation, for example.
In a second embodiment, an apparatus control device 3 in which a control value calculating unit 18 includes a control value receiving unit 19 that receives a control value of a control target apparatus from the outside of the unit will be described.
The control value calculating unit 18 is implemented by, for example, a control value calculating circuit 26 illustrated in
The control value calculating unit 18 includes a state prediction unit 15, the control value receiving unit 19, and a control value selecting unit 20.
The control value receiving unit 19 receives control values W(t)′ and W(t+q)′ of an air conditioner 1 from the outside of the unit, and outputs control values W(t)′, W(t+1)′, . . . , W(t+Q)′ to the control value selecting unit 20.
The control value selecting unit 20 acquires control values Wg(t), Wg(t+1), . . . , Wg(t+Q) for C control methods from the state prediction unit 15, and acquires control values W(t)′, W(t+1)′, . . . , W(t+Q)′ from the control value receiving unit 19.
The control value selecting unit 20 selects any one control value among the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) for C control methods and the control values W(t)′, W(t+1)′, . . . , W(t+Q)′.
The control value selecting unit 20 outputs the selected control value to each of the air conditioner 1 and a display data generating unit 17.
In
Each of the observed value acquiring circuit 21, the observation predicted value acquiring circuit 22, the unobservable value acquiring circuit 23, the control value calculating circuit 26, and the display data generating circuit 25 corresponds to, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, ASIC, FPGA, or a combination thereof.
The components of the apparatus control device 3 are not limited to those implemented by dedicated hardware, and the apparatus control device 3 may be implemented by software, firmware, or a combination of software and firmware.
In a case where the apparatus control device 3 is implemented by software, firmware, or the like, a program for causing a computer to execute each processing procedure in each of the observed value acquiring unit 11, the observation predicted value acquiring unit 12, the unobservable value acquiring unit 13, the control value calculating unit 18, and the display data generating unit 17 is stored in the memory 31 illustrated in
In addition,
Next, an operation of the apparatus control system illustrated in
A user uses, for example, a man-machine interface (not illustrated) to perform an operation of giving control values W(t)′, W(t+1)′, . . . , W(t+Q)′ of the air conditioner 1 to the apparatus control device 3. The man-machine interface is, for example, a mouse or a keyboard.
The control value receiving unit 19 of the apparatus control device 3 receives the control values W(t)′, W(t+1)′, . . . , W(t+Q)′ of the air conditioner 1 output from the man-machine interface, and outputs the control values W(t)′, W(t+1)′, . . . , W(t+Q)′ to the control value selecting unit 20.
In the apparatus control system illustrated in
The control value selecting unit 20 acquires control values Wg(t), Wg(t+1), . . . , Wg(t+Q) for C control methods from the state prediction unit 15, and acquires control values W(t)′, W(t+1)′, . . . , W(t+Q)′ from the control value receiving unit 19.
The control value selecting unit 20 selects any one control value among the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) for C control methods and the control values W(t)′, W(t+1)′, . . . , W(t+Q)′.
For example, the control value selecting unit 20 compares the predicted error of the observation predicted value Xj(t) used when the control value Wg(t−1) is calculated by each of the C control methods with the predicted error of the observation predicted value Xj(t) when the control value W(t−1)′ is used. That is, the control value selecting unit 20 compares (C+1) predicted errors with each other.
The control value selecting unit 20 selects a control value with the smallest predicted error among the control values Wg(t), Wg(t+1), . . . , Wg(t+Q) for C control methods and the control values W(t)′, W(t+1)′, . . . , W(t+Q)′.
The control value selecting unit 20 outputs the selected control value to each of the air conditioner 1 and the display data generating unit 17.
In the second embodiment described above, the apparatus control device 3 illustrated in
In a third embodiment, an apparatus control device 3 will be described in which a display data generating unit 41 stores an observation predicted value acquired by an observation predicted value acquiring unit 12, and generates display data for displaying information indicating that a difference is larger than a threshold when the difference between the stored observation predicted value and the observed value acquired by an observed value acquiring unit 11 is larger than the threshold.
The display data generating unit 41 is implemented by, for example, a display data generating circuit 27 illustrated in
Like the display data generating unit 17 illustrated in
In addition, like the display data generating unit 17 illustrated in
Unlike the display data generating unit 17 illustrated in
The display data generating unit 41 calculates a difference ΔXj(t) between the stored observation predicted value Xj(t) and the observed value Xj(t) acquired by the observed value acquiring unit 11.
When the difference ΔXj(t) is larger than a threshold Th, the display data generating unit 41 generates display data for displaying information indicating that the difference ΔXj(t) is larger than the threshold Th, and outputs the display data to the display device 4. The threshold Th may be stored in an internal memory of the display data generating unit 41 or may be provided from the outside of the apparatus control device 3.
In the apparatus control device 3 illustrated in
In
Each of the observed value acquiring circuit 21, the observation predicted value acquiring circuit 22, the unobservable value acquiring circuit 23, the control value calculating circuit 24, and the display data generating circuit 27 corresponds to, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, ASIC, FPGA, or a combination thereof.
The components of the apparatus control device 3 are not limited to those implemented by dedicated hardware, and the apparatus control device 3 may be implemented by software, firmware, or a combination of software and firmware.
In a case where the apparatus control device 3 is implemented by software, firmware, or the like, a program for causing a computer to execute each processing procedure in each of the observed value acquiring unit 11, the observation predicted value acquiring unit 12, the unobservable value acquiring unit 13, the control value calculating unit 14, and the display data generating unit 41 is stored in the memory 31 illustrated in
In addition,
Next, an operation of the apparatus control system illustrated in
The display data generating unit 41 acquires the observation predicted value Xj(t) acquired at the time (t−1) by the observation predicted value acquiring unit 12, and stores the observation predicted value Xj(t).
The display data generating unit 41 acquires the observed value Xj(t) from the observed value acquiring unit 11.
In
A solid line indicates the observed value Xj(t), and a broken line indicates the observation predicted value Xj(t).
The display data generating unit 41 calculates a difference ΔXj(t) between the stored observation predicted value Xj(t) and the observed value Xj(t) as represented by the following equation (5).
The display data generating unit 41 compares the difference ΔXj(t) with the threshold Th.
When the difference ΔXj(t) is larger than the threshold Th, the display data generating unit 41 generates display data for displaying information indicating that the difference ΔXj(t) is larger than the threshold Th, and outputs the display data to the display device 4.
The display device 4 causes a display to display information indicating that the difference ΔXj(t) is larger than the threshold Th based on the display data. A user viewing the information displayed on the display can recognize that the predicted error is large.
In the apparatus control system illustrated in
In addition, when the difference ΔXj(t) is larger than the threshold Th, the display data generating unit 41 may generate display data for highlighting the control value Wg(t).
Note that it is possible to freely combine the embodiments, modify any component of each embodiment, or omit any component of each embodiment in the present disclosure.
The present disclosure is suitable for an apparatus control device and an apparatus control method.
1: air conditioner, 2-1 to 2-N: sensor, 3: apparatus control device, 4: display device, 11: observed value acquiring unit, 12: observation predicted value acquiring unit, 12a: first learning model, 13: unobservable value acquiring unit, 13a: second learning model, 14, 18: control value calculating unit, 15: state prediction unit, 16, 20: control value selecting unit, 17, 41: display data generating unit, 19: control value receiving unit, 21: observed value acquiring circuit, 22: observation predicted value acquiring circuit, 23: unobservable value acquiring circuit, 24, 26: control value calculating circuit, 25, 27: display data generating circuit, 31: memory, 32: processor
This application is a Continuation of PCT International Application No. PCT/JP2021/045729, filed on Dec. 13, 2021, which is hereby expressly incorporated by reference into the present application.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2021/045729 | Dec 2021 | WO |
Child | 18610055 | US |