Buildings appear responsible for over forty percent of global final energy consumption and, as a result, nearly one-third of direct and indirect CO2 emissions. Ensuring buildings are sustainable and energy-efficient may be key to efforts to tackle climate change. Existing energy management solutions, methods or approaches do not necessarily offer ways to evaluate energy consumption patterns real-time.
Estimated energy consumption or baseline in a building may be an estimated/expected energy value that is to be consumed in the building under normal working conditions as per its set-up. Estimating energy consumption of a building may require extensive experience in building energy management. It may also require analysis of much data like historical energy consumption, weather, occupancy, building layout, and so on. Having a framework that would auto-generate the estimated energy consumption of any building may help avoid a large manual effort.
The present solution, approach or method, may include an end-to-end automated data pipeline for data ingestion, storage, analysis, deployment, and ML model maintenance. The present solution, approach or method, which computes a baseline using machine learning methods, may help in the following ways. Accurate real time estimation may help evaluate the actual energy consumption, effectively identifying underlying root causes for an increase in actual consumption, as compared to the estimated energy, triangulating the time of day and place of such occurrences and correlating with dynamic factors like weather and occupancy. Accurately quantifying energy savings may be helpful. Forecasting energy consumption in the future, may enable planning for future energy needs. Energy saving calculations may be done by comparing actual consumption versus baseline predicted consumption based for a specific period. The present solution may offer a configurable machine learning model, which learns energy consumption patterns, i.e., a relationship between various factors and energy consumption for the desired baseline period and enables savings computation.
To generalize, the energy baseline of a building may be an estimated or expected energy usage by a building under normal working conditions as per its set-up. The present system may automate the energy baseline generation of any building using a machine learning (ML) model. The system may use historical energy consumption, weather, occupancy, building layout, and so on, to generate the baseline.
The system may include a generic or generalized energy baseline prediction model, which estimates a baseline even in the absence of historical energy consumption data from energy meters. This enables the model to function for new sites for which historical data do not exist. An agnostic model may work for both old and new buildings.
The system may have a site-specific prediction model. The system may also include site-specific models which learns both site-agnostic, an underlying relationship between energy consumption and weather patterns, occupant behavior, in addition to site-specific characteristics and geo location characteristics. This system may enable the model to function for old sites for which historical energy meter readings are absent.
Data quality checks and handling missing values may be done in ten second or less near real time. Complete data is important for the accurate baseline prediction. Missing or invalid data may reduce the accuracy of the model. Generally, in a machine learning model development, data quality improvement may be done manually. But the present system identifies missing, invalid data points from the incoming live stream of energy meter data, removes invalid data points and fills missing values using a combination of rule-based and statistical interpolation techniques. This data quality enrichment procedure runs in near real-time.
A digital twin of the building may be a digital representation of a physical asset. The system may employ a graph-based ontology (grammar) which serves as a digital twin representation of the building. The digital twin may capture characteristics of the building, relationships among various assets in the building like HVAC, pumps, and so on, their properties and operations in the building.
A separate digital twin instance of each of the buildings may be created using the generic ontology. The system may also maintain the historical changes in the building (like a new HVAC or addition of new lab space, and so forth) in the form of past versions of the digital twin. Such historical digital twin instances may be used for back testing baseline prediction models. This may give one an ability to compare new algorithms/models with previous algorithms/models and evaluate the effect of changed relationships on energy consumption in the building.
Auto retraining of an ML model may be noted. A baseline prediction ML model, which is the critical aspect of the system, may update itself on its own based on triggered events (e.g., new floor addition, building level changes, and so on). This approach may avoid manual intervention to retrain the ML model periodically. This may be enabled by a robust ML operating system (MLOps) pipeline to periodically train the underlying machine learning model on incremental data. MLOps may be a framework of software elements that helps in managing the code and deploy machine learning models throughout its lifecycle. The system has such a pipeline to manage the ML models.
Auto-alert generation and recommendation may be noted. The system may include automated generation of real time alerts when the energy consumption is higher than the baseline. The system may also have a recommendation module that highlights causes of the higher than normal/estimated energy consumption. It may also provide actionable insights that help a building management team to fix the faulty equipment that is causing high energy consumption.
The system may use real-time or live occupancy information as an external factor in the ML model. Realtime occupancy may be highly correlated with the energy consumption of the model. Thus, the present baseline prediction may be highly accurate.
The system may forecast estimated consumption in multiple forecast horizons (e.g., short term up to 7 days and long term up to 30 days) to facilitate planning ahead in time, for energy demand in the building.
The system may use specific ML algorithms that can capture non-linear and complex relationships between energy consumption and other parameters like building level, ontology features, weather conditions, occupancy, and so on, holistically. This may result in a highly accurate baseline prediction than the related art which generally uses algorithms that can capture only linear and non-complex relationships.
The following may reveal the advantages of the present solution, over other existing solutions: 1) Adaptability— A baseline estimation solution is adaptable and works for different types of buildings, different geolocations, different usage patterns, different occupancy, and so on; 2) Accuracy—Estimates are accurate, but not be overfit and thus generalized across buildings and the globe; 3) Continuous improvement—a self-learning model which improves on its own, after deployment, by learning continuously from new data over a period of time; 4) Scalability—Due to the above-mentioned need for maintaining model accuracy, adaptability across sites/geo-location and multiple types of meters, continuous improvement for multiple sites and meters is required in real time, which this disclosure provides 5) Robustness—it may work even in situations of unavailability of historical energy consumption data; and 6) Normalization—an ability to normalize energy consumption patterns over multiple building characteristics like occupancy, size, and geolocation. This may enable cross-buildings benchmarking.
The energy baselining approach or solution, part of a company's energy management as a Service offering for commercial buildings, may provide an ability to track actual consumption against an estimated consumption/baseline as per current operating conditions. This may provide an ability to compute and realize energy savings in the long run, and thereby reduce a carbon footprint. There may be an ability to identify a root cause of spikes in energy consumption as compared to expected consumption. There may be a further ability to calculate baseline consumption as per any baseline period. This approach may be used by energy managers and auditors to evaluate building energy conservation measures (ECM), compare different measures, and understand each ECM's efficacy. Additionally, this might be leveraged for energy savings calculation.
Near-term forecasting of energy consumption may result in an ability to benchmark each site against other sites, from an energy efficiency standpoint.
To reiterate, key aspects of the present solution that make it robust and unique may include the following.
The present solution may include an end-to-end automated data pipeline for data ingestion, storage, analysis, deployment, and ML model maintenance.
Data quality of incoming data may be assured, through interpolation methods for missing/anomalous data, outlier treatment, data sanity checks and cleansing logic implementation. These items as indicated in
The solution has a robust ontology definition, to model entire building's ontology in an rdf triples format, which may be updated via streams of change events to building ontology definitions. An ontology graph may be reviewed to extract information on relationships among various equipment, e.g., what load each meter is connected to, and so on.
One approach for the solution may be to use linear models to establish relationship between predictors like weather conditions, and load type with the energy consumption of a meter. However, the actual relationship between consumption and other factors may be stochastic in nature and highly non-linear, especially when we estimate at an hourly/15-minute frequency. Thus, a non-linear machine learning model, namely XGBoost regressor, which is an optimized implementation of gradient boosted decision trees, may have been used to get accurate predictions.
An energy consumption baseline may be estimated using learning from historical data and capturing multivariate, non-linear relationships among energy consumption and weather, occupancy, building size, type, layout, and so on.
MLOps strategies may have been adopted for an ML model maintenance in production, as per industry standard CRISP-DM.
The present approach may be a key component of building energy management solution. This may give an ability to estimate energy consumption, considering external weather parameters, occupancy, size of the building, load connected to various meters, and historical energy consumption patterns.
The predictions may be used to create real time alerting systems that are triggered when the actual consumption exceeds the baseline, beyond a preset threshold value. These alerts may help in diagnosing the unusual high energy consumption real time and be used to identify in a timely manner, faulty equipment, which may be consuming extremely high energy.
As a result of identifying and diagnosing real-time spikes in energy consumption the solution may make it possible to realize savings in the long run.
Forecasts of energy consumption may help plan for energy needs in future. The solution may enable a comparison of multiple sites, from an energy consumption standpoint, track energy efficiency for each of them, giving the facility managers better decision-making ability to make the building more energy efficient.
The solution is integrated into the product in the following manner:
The solution may be integrated with a BMS Supervisor and is to be provided as a service.
Energy meter reading may be collected from underlying BMS systems, data stored on cloud, including a history of readings from different buildings.
Cross building analytics may be performed and the solution may estimate energy consumption even for new buildings/sites which do not have historical data for training machine learning models.
This may help in building a timely maintenance as the deviation from estimated energy consumption is key for determining anomalies.
Organizations may leverage collective intelligence gathered from multiple sites.
The solution may have a software component with a stack level: insight (analytics)—data manipulation to gain info (trend and predictive analytics tool)
There may be a software type: connected/connectivity—offering available thru cloud or direct, remote connection (SaaS) or cover infrastructure enabling connected services (Sentience).
There may be an IOT stack level: insight (analytics)—data manipulation to gain information (trend and predictive analytics tool).
The solution may generate or capture data. The type of data and where it will reside may be energy meter readings from sites that are collected. The data may be stored in a cloud storage. Also, there may be weather data collected from an external API and there may be building ontology data.
Job names, schedules and job purposes relative to diagram 11 of cloud platform in
Symbol 24 represents enrichment job 1 having a schedule to run every hour and having a job purpose to calculate baseline consumption for every point (meter) at a 15 minute resolution. Symbol 25 represents an alarm generating job 1 having a schedule to run every 15 minutes, and having a purpose to read actual consumption (from an intermediate table), read baseline consumption from a final table, and read an alarm configuration from a configuration store, and generate an alarm publish to a streaming source. Symbol 26 may represent a training pipeline that has a schedule to be triggered every one-half month or a synchronously manually.
A closer look at platform 12 in
Symbol 31 represents actual meter reading storage with a 15 minute resolution, and having an output connected to symbol 27. Symbol 32 represents a weather API storage with a 15 minute resolution, and having an output connected to symbol 28. Symbol 33 represents a configuration store having an output connected to symbol 25.
An output from symbol 21 may go to a raw table 1 represented by symbol 26 which in turn has an output to symbol 24. An output from symbol 22 may go to a raw table 2 represented by symbol 27 which in turn has an output to symbol 24. An output from symbol 23 may go to a common data model/meta data storage represented by symbol 28.
An output from symbol 24 may go to an enriched and cleaned table 29 as represented by symbol 31. An output from symbol 31 may, as shown by a dashed line, go to symbol 26 for the training pipeline. An output from symbol 31 may go to a baseline prediction job 1 as represented by symbol 32. Another output from symbol 31 may go to symbol 25 of the alarms generating job 1.
An output from the training pipeline of symbol 26 may go to a model store represented by symbol 33. An output from symbol 33 may go to symbol 32. An output from symbol 32 may go to a final table represented by symbol 34. An output from symbol 34 may go to symbol 25 representing the alarms generating job 1. Another output from symbol 34 may go to a symbol 35 representing an alarms database. From symbol 25 representing alarms generating job 1 may be an output to a symbol 36 representing streaming (published) from platform 12.
An auto model for predicting baseline energy consumption may be noted.
Leveraging historical energy consumption data and additional temporal features may improve an existing energy baselining solutions. Accuracy improvement may be demonstrated due to following reasons. There may be an inclusion of non-linearity (XGBoost) into a baseline prediction model to capture non-linear relationships. There may be additional metric computation (RMSE ratings to balance accuracy and generation capability of the model (overfit versus underfit. There may be data cleansing that involves rule-based and statistical methods for missing value imputation.
The variables used herein and their descriptions may be noted. A weather variable may incorporate external temperature readings in C., heating degree days=(65−temp) in F., cooling degree days=(temp−65) in F., humidity in percent, dew point temperature, and atmospheric pressure in millibars (mb).
Historical meter readings may incorporate previous week same time energy (calculated by taking an energy meter reading 7 days prior to the same time instant), and previous day differential (t−1 day's one week differential, calculated by taking the difference between previous day's same time energy meter reading and 8 day's prior same time energy.
A date time may incorporate month (an integer value as per month, ranging from 1 to 12), time of the day (an integer value as per the hour, ranging from 0 to 23), day of the week (an integer value assigned starting with (Monday) to 6 (Sunday)), and weekend (binary value, 0 for day of week=[0,4] and 1 for day of week [5,6].
There may be further steps on auto-baselining as in the following: 1) Deciding on the frequency of re-training and training size on model performance during pilot deployment; 2) Testing the model on various geo locations; 3) Incorporating new features (e.g., occupancy, size of a subject building) in the model; 4) Exploring a possibility of a generic model that can predict a baseline when there are no historical data for the new meter and/or site; 5) Exploring how to incorporate dynamic changes to working conditions (e.g., an addition of a new lab); and 6) Identifying equipment which causes spikes in energy consumption and figuring out a cause. One would need an equipment's normal operating characteristics.
Case 2 of
Case 3 of
Observed values 106 and time interp values 107 may be noted in diagram 102. Time interpolation may be similar to linear interpolation when the values are equally spaced. Time based interpolation may have a mean absolute fractional error of 0.0705.
Standard interpolation techniques may be a good alternative for short gaps, but may not necessarily be applied for long continuous gaps (e.g., >10-12 hours). A rule based approach may be good at filling both short and long gaps provided that the underlying pattern is a repeating/periodic one as it is in the present cases.
An output from component 115 may go to an orchestrated experiment setup 116. A loop with an output from setup 116 may go to a model analysis device 117. An output from device 117 may go to setup 116. A source code component 118 may receive an input from setup 116. An output from component 118 may go to a source repo storage device 119. A C1:Build, test package pipeline components 120 may have an output to a CD:pipeline deployment device 121 input in the development environment 112 that may have an output to an automated training pipeline 122. An output from storage device 114 may go to pipeline 122. Also, a schedule trigger device 123 may be connected to pipeline 122. Pipeline 122 may provide, in a stated order, data extraction 131, data validation 132, data preparation 133, model training 134, and model evaluation 135. An output of the model evaluation 135 may go to a model registry 136. An output of registry 136 may be a trained model 137. The trained model 137 may go to a real time prediction pipeline 138. A pipeline 138 output may go to data tables (feature store) device 119, which in turn may have an output connected to pipeline 138.
Common data model use and meta data of buildings for analytics may be noted. Building ontology information from multiple sites in a graph rdf may be stored. It may provide a way to query the model to extract any required ontology information about sites. Analytics may give way to create a separate view for the meta data without exposing the underlying data model.
Information of graph representation 140 (viz., RDF triples) may be stored in delta tables 150. There may be seven rows for seven items. Five columns to indicate inserttime, contextID, subject, predicate, and object.
To recap, a mechanism for establishing and maintaining baseline energy consumption for a building, may incorporate a generalized energy consumption baseline prediction model, a—specific model of a building which learns site-specific relationship among energy consumption, weather patterns, occupant behavior, site structural characteristics and geo-location characteristics of the building, a framework that enables a machine learning model to retrain with updates, triggered events, and building changes, and a digital twin of the building which is a digital representation of the building that captures characteristics of the building, and relationship and change among various assets in the building. Energy consumption may be automatically compared with the generalized energy baseline prediction model.
The near real-time data quality enrichment procedure may identify missing or invalid data points from the incoming live stream of energy meter data, remove invalid data points and fill-in missing values using a combination of rule-based and statistical interpolation techniques.
Operational instances of the digital twin of the building may be used for back testing the generalized energy baseline prediction model, to give an ability to compare new algorithms or models with previous ones and evaluate the effect of changed relationships on energy consumption by the building.
A machine learning model may automatically update or retrain itself on its own, based on the triggered events with a machine learning operating system pipeline to periodically train the underlying machine learning model on incremental data, where the pipeline is a framework of software elements that manages a code and deploys the machine learning model throughout its lifecycle.
The machine learning model may incorporate algorithms that can capture non-linear and complex relationships between energy consumption and energy related parameters.
The mechanism may further incorporate automated generation of real time alerts when the energy consumption is higher than a baseline.
The mechanism may further incorporate a recommendation module that highlights causes of an energy consumption of the building higher than the baseline.
An energy consumption monitoring system may incorporate a first module configured to provide a generalized energy consumption baseline of a physical asset, a second module configured to measure energy consumption of the physical asset, and a third module configured to compare the energy consumption measured from the physical asset with the generalized energy consumption baseline of the physical asset. If the energy consumption measured exceeds the generalized energy consumption baseline, then an alert may emanate.
To provide the generalized energy consumption baseline of the physical asset may be automated using a machine learning model.
The generalized energy consumption baseline may be estimated in the absence of historical energy consumption data from energy meters of the physical asset, data of weather, occupancy, and building layout of the physical asset.
The physical asset may incorporate one or more buildings or structures.
The system may further incorporate a data quality enrichment procedure connected to the first module and run in a ten second or less near real time or real-time, and identify missing data points and invalid data points from an incoming live stream of energy meter data to the first module, remove the invalid data points, and fill-in missing values using a combination of rule-based and statistical interpolation techniques.
The system may further incorporate a digital twin of the physical asset. The digital twin may be a digital representation of the physical asset.
The digital twin may capture characteristics of the physical asset, relationships among various assets in the building like an HVAC and its properties, and operations in the physical asset.
An approach for evaluating energy consumption by a building, may incorporate generating an energy consumption baseline of a building, measuring energy consumption of the building, and comparing a measurement of the energy consumption of the building with the energy consumption baseline. If the measurement of the energy consumption exceeds the energy consumption baseline, then an alert may be issued.
An issued alert may be automatic.
Upon detecting the automatic issued alert on an indicator, highlighting on the indicator may occur to show actionable insights that help a building management team fix faulty equipment that causes the measurement of the energy consumption to exceed the energy consumption baseline.
The generating an energy consumption baseline of the building may be automated using a machine learning model.
Use of real-time or live occupancy information may be an external factor in the machine learning model. Real-time or live occupancy may be correlated with the energy consumption of the machine learning model.
Generating an energy consumption baseline of the building may be facilitated by an estimated predicted consumption in multiple forecast horizons varying from one day to sixty days to aid in planning for energy demand in the building.
In the present specification, some of the matter may be of a hypothetical or prophetic nature although stated in another manner or tense.
Although the present system and/or approach has been described with respect to at least one illustrative example, many variations and modifications will become apparent to those skilled in the art upon reading the specification. It is therefore the intention that the appended claims be interpreted as broadly as possible in view of the related art to include all such variations and modifications.