This application relates generally to a building system of a building. This application relates more particularly to systems for managing and processing data of the building system.
Various interactions between building systems, components of building systems, users, technicians, and/or devices managed by users or technicians can rely on timely generation and presentation of data relating to the status of one or more buildings or building equipment. The presentation of the data may be displayed on a user interface. User interfaces which may display information about the building, however, developing and maintaining these user interfaces may be complex and time consuming.
One or more aspects relate to building management systems and methods that automatically generates one or more user interfaces based on prompts received from the user. For example, a system can include at least one machine learning model configured using training data that includes at least one of unstructured data or structured data regarding items of equipment. The system can provide inputs, such as prompts, to the at least one machine learning model regarding an item of equipment, and generate, according to the inputs, a user interface providing information to a user based on the prompt. The machine learning model can include various machine learning model architectures (e.g., networks, backbones, algorithms, etc.), including but not limited to language models, LLMs, attention-based neural networks, transformer-based neural networks, generative pretrained transformer (GPT) models, bidirectional encoder representations from transformers (BERT) models, encoder/decoder models, sequence to sequence models, autoencoder models, generative adversarial networks (GANs), convolutional neural networks (CNNs), recurrent neural networks (RNNs), diffusion models (e.g., denoising diffusion probabilistic models (DDPMs)), or various combinations thereof.
One implementation is a method. The method includes receiving, by one or more processors via an AI chat interface, a prompt from a user. The method also includes dynamically determining, by the one or more processors using an AI model, a format and a content of an output user interface, wherein both the format and the content are determined dynamically by the AI model based on the prompt. The method may also include generating, by the one or more processors using the AI model, the output user interface in the determined format and including the determined content.
In some implementations, dynamically determining the format of the output user interface comprises selecting, by the one or more processors using the AI model, from among one of a plurality of predetermined output content formats based on the prompt. In some implementations, dynamically determining the format and the content of the output user interface includes predicting, by the one or more processors using the AI model, an intent of the user from the prompt and determining, by the one or more processors using the AI model, the format and the content of the output user interface using the predicted intent.
In some implementations, the output user interface comprises a predetermined interface with a plurality of populatable interface portions, and wherein generating the output user interface comprises populating, by the AI model, at least one of the one or more populatable interface portions using the dynamically determined format and content. In some implementations, the AI model is a generative AI model and dynamically determining the format and the content comprises dynamically generating, using the generative AI model, at least a portion of the format or the content of the output user interface using the generative AI model based on the prompt. In some implementations, the at least a portion of the format of the content of the output user interface comprises a new format or a new content generated by the generative AI model that was not preexisting, prior to receipt of the prompt from the user, in underlying data upon which the generative AI model was trained.
In some implementations, the output user interface comprises a history of one or more prompts previously received by the AI chat interface. In some implementations, the history of one or more prompts is dynamically generated responsive to the prompt in response to the AI model determining the one or more prompts to be potentially relevant to the prompt.
One implementation is a system for generating a response to a query using a generative artificial intelligence model. The system includes one or more memory devices having instructions stored thereon that, when executed by one or more processors, cause the one or more processors to perform operations. The operations include receiving, via an AI chat interface, a prompt from a user. The operations include dynamically determining, using an AI model, a format and a content of an output user interface, wherein both the format and the content are determined dynamically by the AI model based on the prompt. The operations may also include generating, using the AI model, the output user interface in the determined format and including the determined content.
One implementation is one or more non-transitory computer-readable media. The one or more non-transitory computer-readable media stores program instructions that when executed cause the one or more processors to perform operations. The operations include receiving, via an AI chat interface, a prompt from a user. The operations include dynamically determining, using an AI model, a format and a content of an output user interface, wherein both the format and the content are determined dynamically by the AI model based on the prompt. The operations may also include generating, using the AI model, the output user interface in the determined format and including the determined content.
Various objects, aspects, features, and advantages of the disclosure will become more apparent and better understood by referring to the detailed description taken in conjunction with the accompanying drawings, in which like reference characters identify corresponding elements throughout. In the drawings, like reference numbers generally indicate identical, functionally similar, and/or structurally similar elements.
Referring generally to the FIGURES, systems and methods in accordance with the present disclosure can implement various systems to precisely generate data relating to operations to be performed for managing building systems and components and/or items of equipment, including heating, ventilation, cooling, and/or refrigeration (HVAC-R) systems and components. For example, various systems described herein can be implemented to more precisely receive, via an AI chat interface, a prompt from a user; dynamically determine, using an AI model, a format and a content of an output user interface, wherein both the format and the content are determined dynamically by the AI model based on the prompt; and generating, using the AI model, the output user interface in the determined format and including the determined content.
Current methods for creating user interfaces with a three-tier architecture require the use of web frameworks (e.g., Angular, ReactJS, Next.JS, etc.) and various web programing languages (e.g., JavaScript/typescript, HTML, and CSS). Typically, creating user interfaces using current methods requires teams of UI/UX programmers to meet the requirements. Maintaining such UI/UX personnel to change and manage user interfaces can be costly, complex, and time consuming. For example, creating user interfaces with the three-tier design could require the coupling of the user interface and server code. Therefore, any required adjustments to a user interface would also require changes to the corresponding source code which makes maintaining user interfaces complex and time consuming. Therefore, systems and methods for automatically generating user interfaces using an AI model, such as a generative AI model may be desired.
The systems and methods described herein utilize a generative AI models, such as generative LLMs, to ingest user inputs from prompts entered by the user and develop an output user interface based on the prompts. The systems and methods described herein generate a user interface while eliminating complex UI work and reduce the amount of time used to generate user interfaces. Further, the prompts entered by the user will enable the generative AI models to generate integration layer code dynamically to gather any required user data. The systems and methods described herein provide multiple technical benefits including: 1) eliminating a complicated user interface design process; 2) providing a dashboard interface which may allow the user to render the result (e.g., output user interface) from a server on the dashboard; 3) minimizing routing on the server or user interface generating application side; 4) minimizing integration code in the server or user interface generating application side; and 5) minimizing contracts between the user interface generating application side and the user interface side.
AI and/or machine learning (ML) systems, including but not limited to LLMs, can be used to generate text data, user interfaces, and data of other modalities in a more responsive manner to real-time conditions, including generating strings of text data that may not be provided in the same manner in existing documents, yet may still meet criteria for useful text information, such as relevance, style, and coherence. For example, LLMs can predict text data based at least on inputted prompts and by being configured (e.g., trained, modified, updated, fine-tuned) according to training data representative of the text data to predict or otherwise generate.
However, various considerations may limit the ability of such systems to precisely generate appropriate data for specific conditions. For example, due to the predictive nature of the generated data, some LLMs may generate text data that is incorrect, imprecise, or not relevant to the specific conditions. Using the LLMs may require a user to manually vary the content and/or syntax of inputs provided to the LLMs (e.g., vary inputted prompts) until the output of the LLMs meets various objective or subjective criteria of the user. The LLMs can have token limits for sizes of inputted text during training and/or runtime/inference operations (and relaxing or increasing such limits may require increased computational processing, API calls to LLM services, and/or memory usage), limiting the ability of the LLMs to be effectively configured or operated using large amounts of raw data or otherwise unstructured data. In some instances, relatively large LLMs, such as LLMs having billions or trillions of parameters, may be less agile in responding to novel queries or applications. In addition, various LLMs may lack transparency, such as to be unable to provide to a user a conceptual/semantic-level explanation of a given output was generated and/or selected relative to other possible outputs.
Systems and methods in accordance with the present disclosure can use machine learning models, including LLMs and other generative AI systems, to capture data, including but not limited to unstructured knowledge from various data sources, and process the data to accurately generate outputs, such as completions responsive to prompts, including in structured data formats for various applications and use cases. A completion may be described as the output of the generative AI model based on a prompt or request submitted by the user. The system can implement various automated and/or expert-based thresholds and data quality management processes to improve the accuracy and quality of generated outputs and update training of the machine learning models accordingly. The system can enable real-time messaging and/or conversational interfaces for users to provide field data regarding equipment to the system (including presenting targeted queries to users that are expected to elicit relevant responses for efficiently receiving useful response information from users) and guide users, such as service technicians, through relevant service, diagnostic, troubleshooting, and/or repair processes.
For example, the system can provide a platform for fault detection and servicing processes in which a machine learning model is configured based on connecting or relating unstructured data and/or semantic data, such as human feedback and written/spoken reports, with time-series product data regarding items of equipment, so that the machine learning model can more accurately detect causes of alarms or other events that may trigger service responses. For instance, responsive to an alarm for a chiller, the system can more accurately detect a cause of the alarm, and generate a prescription (e.g., for a service technician) for responding to the alarm or present the alarm information to the user through a user interface. The system can generate a report, responsive to predicting the fault condition, that identifies the one or more actions
In some instances, significant computational resources (or human user resources) can be required to process data relating to equipment operation, such as time-series product data and/or sensor data, to detect or predict faults and/or causes of faults. In addition, it can be resource-intensive to label such data with identifiers of faults or causes of faults, which can make it difficult to generate machine learning training data from such data. Systems and methods in accordance with the present disclosure can leverage the efficiency of language models (e.g., GPT-based models or other pre-trained LLMs) in extracting semantic information (e.g., semantic information identifying faults, causes of faults, and other accurate expert knowledge regarding equipment servicing) from the unstructured data in order to use both the unstructured data and the data relating to equipment operation to generate more accurate outputs regarding equipment servicing. As such, by implementing language models using various operations and processes described herein, building management and equipment servicing systems can take advantage of the causal/semantic associations between the unstructured data and the data relating to equipment operation, and the language models can allow these systems to more efficiently extract these relationships in order to more accurately predict targeted, useful information for servicing applications at inference-time/runtime. While various implementations are described as being implemented using generative AI models such as transformers and/or GANs, in some embodiments, various features described herein can be implemented using non-generative AI models or even without using AI/machine learning, and all such modifications fall within the scope of the present disclosure.
The system can enable a generative AI-based service wizard interface. For example, the interface can include user interface and/or user experience features configured to provide a question/answer-based input/output format, such as a conversational interface, that directs users through providing targeted information for accurately generating predictions of root cause, presenting solutions, or presenting instructions for repairing or inspecting the equipment to identify information that the system can use to detect root causes or other issues. The system can use the interface to present information regarding parts and/or tools to service the equipment, as well as instructions for how to use the parts and/or tools to service the equipment.
In various implementations, the systems can include a plurality of machine learning models that may be configured using integrated or disparate data sources. This can facilitate more integrated user experiences or more specialized (and/or lower computational usage for) data processing and output generation. Outputs from one or more first systems, such as one or more first algorithms or machine learning models, can be provided at least as part of inputs to one or more second systems, such as one or more second algorithms or machine learning models. For example, a first language model can be configured to process unstructured inputs (e.g., text, speech, images, etc.) into a structure output format compatible for use by a second system, such as a root cause prediction algorithm or equipment configuration model.
The system can be used to automate interventions for equipment operation, servicing, fault detection and diagnostics (FDD), and alerting operations. For example, by being configured to perform operations such as root cause prediction, the system can monitor data regarding equipment to predict events associated with faults and trigger responses such as alerts, service scheduling, and initiating FDD or modifications to configuration of the equipment. The system can present to a technician or manager of the equipment a report regarding the intervention (e.g., action taken responsive to predicting a fault or root cause condition) and requesting feedback regarding the accuracy of the intervention, which can be used to update the machine learning models to more accurately generate interventions.
While various implementations discussed herein discuss building/equipment servicing as an application, it should be understood that the present disclosure could be applied to any of a variety of building use cases, and all such use cases/implementations are contemplated within the scope of the present disclosure.
For example, the system 100 can be implemented for operations associated with any of a variety of building management systems (BMSs) or equipment or components thereof. A BMS can include a system of devices that can control, monitor, and manage equipment in or around a building or building area. The BMS can include, for example, a HVAC system, a security system, a lighting system, a fire alerting system, any other system that is capable of managing building functions or devices, or any combination thereof. The BMS can include or be coupled with items of equipment, for example and without limitation, such as heaters, chillers, boilers, air handling units, sensors, actuators, refrigeration systems, fans, blowers, heat exchangers, energy storage devices, condensers, valves, or various combinations thereof.
The items of equipment can operate in accordance with various qualitative and quantitative parameters, variables, setpoints, and/or thresholds or other criteria, for example. In some instances, the system 100 and/or the items of equipment can include or be coupled with one or more controllers for controlling parameters of the items of equipment, such as to receive control commands for controlling operation of the items of equipment via one or more wired, wireless, and/or user interfaces of controller.
Various components of the system 100 or portions thereof can be implemented by one or more processors coupled with or more memory devices (memory). The processors can be a general purpose or specific purpose processors, an application specific integrated circuit (ASIC), one or more field programmable gate arrays (FPGAs), a group of processing components, or other suitable processing components. The processors may be configured to execute computer code and/or instructions stored in the memories or received from other computer readable media (e.g., CDROM, network storage, a remote server, etc.). The processors can be configured in various computer architectures, such as graphics processing units (GPUs), distributed computing architectures, cloud server architectures, client-server architectures, or various combinations thereof. One or more first processors can be implemented by a first device, such as an edge device, and one or more second processors can be implemented by a second device, such as a server or other device that is communicatively coupled with the first device and may have greater processor and/or memory resources.
The memories can include one or more devices (e.g., memory units, memory devices, storage devices, etc.) for storing data and/or computer code for completing and/or facilitating the various processes described in the present disclosure. The memories can include random access memory (RAM), read-only memory (ROM), hard drive storage, temporary storage, non-volatile memory, flash memory, optical memory, or any other suitable memory for storing software objects and/or computer instructions. The memories can include database components, object code components, script components, or any other type of information structure for supporting the various activities and information structures described in the present disclosure. The memories can be communicably connected to the processors and can include computer code for executing (e.g., by the processors) one or more processes described herein.
The system 100 can include or be coupled with one or more first models 104. The first model 104 can include one or more neural networks, including neural networks configured as generative models. For example, the first model 104 can predict or generate new data (e.g., artificial data; synthetic data; data not explicitly represented in data used for configuring the first model 104). The first model 104 can generate any of a variety of modalities of data, such as text, speech, audio, images, and/or video data. The neural network can include a plurality of nodes, which may be arranged in layers for providing outputs of one or more nodes of one layer as inputs to one or more nodes of another layer. The neural network can include one or more input layers, one or more hidden layers, and one or more output layers. Each node can include or be associated with parameters such as weights, biases, and/or thresholds, representing how the node can perform computations to process inputs to generate outputs. The parameters of the nodes can be configured by various learning or training operations, such as unsupervised learning, weakly supervised learning, semi-supervised learning, or supervised learning.
The first model 104 can include, for example and without limitation, one or more language models, LLMs, attention-based neural networks, transformer-based neural networks, generative pretrained transformer (GPT) models, bidirectional encoder representations from transformers (BERT) models, encoder/decoder models, sequence to sequence models, autoencoder models, generative adversarial networks (GANs), convolutional neural networks (CNNs), recurrent neural networks (RNNs), diffusion models (e.g., denoising diffusion probabilistic models (DDPMs)), or various combinations thereof.
For example, the first model 104 can include at least one GPT model. The GPT model can receive an input sequence, and can parse the input sequence to determine a sequence of tokens (e.g., words or other semantic units of the input sequence, such as by using Byte Pair Encoding tokenization). The GPT model can include or be coupled with a vocabulary of tokens, which can be represented as a one-hot encoding vector, where each token of the vocabulary has a corresponding index in the encoding vector; as such, the GPT model can convert the input sequence into a modified input sequence, such as by applying an embedding matrix to the token tokens of the input sequence (e.g., using a neural network embedding function), and/or applying positional encoding (e.g., sin-cosine positional encoding) to the tokens of the input sequence. The GPT model can process the modified input sequence to determine a next token in the sequence (e.g., to append to the end of the sequence), such as by determining probability scores indicating the likelihood of one or more candidate tokens being the next token, and selecting the next token according to the probability scores (e.g., selecting the candidate token having the highest probability scores as the next token). For example, the GPT model can apply various attention and/or transformer based operations or networks to the modified input sequence to identify relationships between tokens for detecting the next token to form the output sequence.
The first model 104 can include at least one diffusion model, which can be used to generate image and/or video data. For example, the diffusional model can include a denoising neural network and/or a denoising diffusion probabilistic model neural network. The denoising neural network can be configured by applying noise to one or more training data elements (e.g., images, video frames) to generate noised data, providing the noised data as input to a candidate denoising neural network, causing the candidate denoising neural network to modify the noised data according to a denoising schedule, evaluating a convergence condition based on comparing the modified noised data with the training data instances, and modifying the candidate denoising neural network according to the convergence condition (e.g., modifying weights and/or biases of one or more layers of the neural network). In some implementations, the first model 104 includes a plurality of generative models, such as GPT and diffusion models, that can be trained separately or jointly to facilitate generating multi-modal outputs, such as technical documents (e.g., service guides) that include both text and image/video information.
In some implementations, the first model 104 can be configured using various unsupervised and/or supervised training operations. The first model 104 can be configured using training data from various domain-agnostic and/or domain-specific data sources, including but not limited to various forms of text, speech, audio, image, and/or video data, or various combinations thereof. The training data can include a plurality of training data elements (e.g., training data instances). Each training data element can be arranged in structured or unstructured formats; for example, the training data element can include an example output mapped to an example input, such as a query representing a service request or one or more portions of a service request, and a response representing data provided responsive to the query. The training data can include data that is not separated into input and output subsets (e.g., for configuring the first model 104 to perform clustering, classification, or other unsupervised ML operations). The training data can include human-labeled information, including but not limited to feedback regarding outputs of the models 104, 116. This can allow the system 100 to generate more human-like outputs.
In some implementations, the training data includes data relating to building management systems. For example, the training data can include examples of HVAC-R data, such as operating manuals, technical data sheets, configuration settings, operating setpoints, diagnostic guides, troubleshooting guides, user reports, technician reports. In some implementations, the training data used to configure the first model 104 includes at least some publicly accessible data, such as data retrievable via the Internet.
Referring further to
The second model 116 can be similar to the first model 104. For example, the second model 116 can have a similar or identical backbone or neural network architecture as the first model 104. In some implementations, the first model 104 and the second model 116 each include generative AI machine learning models, such as LLMs (e.g., GPT-based LLMs) and/or diffusion models. The second model 116 can be configured using processes analogous to those described for configuring the first model 104.
In some implementations, the model updater 108 can perform operations on at least one of the first model 104 or the second model 116 via one or more interfaces, such as application programming interfaces (APIs). For example, the models 104, 116 can be operated and maintained by one or more systems separate from the system 100. The model updater 108 can provide training data to the first model 104, via the API, to determine the second model 116 based on the first model 104 and the training data. The model updater 108 can control various training parameters or hyperparameters (e.g., learning rates, etc.) by providing instructions via the API to manage configuring the second model 116 using the first model 104.
The model updater 108 can determine the second model 116 using data from one or more data sources 112. For example, the system 100 can determine the second model 116 by modifying the first model 104 using data from the one or more data sources 112. The data sources 112 can include or be coupled with any of a variety of integrated or disparate databases, data warehouses, digital twin data structures (e.g., digital twins of items of equipment or building management systems or portions thereof), data lakes, data repositories, documentation records, or various combinations thereof. In some implementations, the data sources 112 include HVAC-R data in any of text, speech, audio, image, or video data, or various combinations thereof, such as data associated with HVAC-R components and procedures including but not limited to installation, operation, configuration, repair, servicing, diagnostics, and/or troubleshooting of HVAC-R components and systems. Various data described below with reference to data sources 112 may be provided in the same or different data elements, and may be updated at various points. The data sources 112 can include or be coupled with items of equipment (e.g., where the items of equipment output data for the data sources 112, such as sensor data, etc.). The data sources 112 can include various online and/or social media sources, such as blog posts or data submitted to applications maintained by entities that manage the buildings. The system 100 can determine relations between data from different sources, such as by using timeseries information and identifiers of the sites or buildings at which items of equipment are present to detect relationships between various different data relating to the items of equipment (e.g., to train the models 104, 116 using both timeseries data (e.g., sensor data; outputs of algorithms or models, etc.) regarding a given item of equipment and freeform natural language reports regarding the given item of equipment).
The data sources 112 can include unstructured data or structured data (e.g., data that is labeled with or assigned to one or more predetermined fields or identifiers, or is in a predetermined format, such as a database or tabular format). The unstructured data can include one or more data elements that are not in a predetermined format (e.g., are not assigned to fields, or labeled with or assigned with identifiers, that are indicative of a characteristic of the one or more data elements). The data sources 112 can include semi-structured data, such as data assigned to one or more fields that may not specify at least some characteristics of the data, such as data represented in a report having one or more fields to which freeform data is assigned (e.g., a report having a field labeled “describe the item of equipment” in which text or user input describing the item of equipment is provided). The data sources 112 can include data that is incomplete,
For example, using the first model 104 and/or second model 116 to process the data can allow the system 100 to extract useful information from data in a variety of formats, including unstructured/freeform formats, which can allow service technicians to input information in less burdensome formats. The data can be of any of a plurality of formats (e.g., text, speech, audio, image, video, etc.), including multi-modal formats. For example, the data may be received from service technicians in forms such as text (e.g., laptop/desktop or mobile application text entry), audio, and/or video (e.g., dictating findings while capturing video).
The data sources 112 can include engineering data regarding one or more items of equipment. The engineering data can include manuals, such as installation manuals, instruction manuals, or operating procedure guides. The engineering data can include specifications or other information regarding operation of items of equipment. The engineering data can include engineering drawings, process flow diagrams, refrigeration cycle parameters (e.g., temperatures, pressures), or various other information relating to structures and functions of items of equipment.
In some implementations, the data sources 112 can include operational data regarding one or more items of equipment. The operational data can represent detected information regarding items of equipment, such as sensor data, logged data, user reports, or technician reports. The operational data can include, for example, service tickets generated responsive to requests for service, work orders, data from digital twin data structures maintained by an entity of the item of equipment, outputs or other information from equipment operation models (e.g., chiller vibration models), or various combinations thereof. Logged data, user reports, service tickets, billing records, time sheets, and various other such data can provide temporal information, such as how long service operations may take, or durations of time between service operations, which can allow the system 100 to predict resources to use for performing service as well as when to request service.
The data sources 112 can include, for instance, warranty data. The warranty data can include warranty documents or agreements that indicate conditions under which various entities associated with items of equipment are to provide service, repair, or other actions corresponding to items of equipment, such as actions corresponding to service requests.
The data sources 112 can include service data. The service data can include data from any of various service providers, such as service reports. The service data can indicate service procedures performed, including associated service procedures with initial service requests and/or sensor data related conditions to trigger service and/or sensor data measured during service processes.
In some implementations, the data sources 112 can include parts data, including but not limited to parts usage and sales data. For example, the data sources 112 can indicate various parts associated with installation or repair of items of equipment. The data sources 112 can indicate tools for performing service and/or installing parts.
The system 100 can include, with the data of the data sources 112, labels to facilitate cross-reference between items of data that may relate to common items of equipment, sites, service technicians, customers, or various combinations thereof. For example, data from disparate sources may be labeled with time data, which can allow the system 100 (e.g., by configuring the models 104, 116) to increase a likelihood of associating information from the disparate sources due to the information being detected or recorded (e.g., as service reports) at the same time or near in time.
For example, the data sources 112 can include data that can be particular to specific or similar items of equipment, buildings, equipment configurations, environmental states, or various combinations thereof. In some implementations, the data includes labels or identifiers of such information, such as to indicate locations, weather conditions, timing information, uses of the items of equipment or the buildings or sites at which the items of equipment are present, etc. This can enable the models 104, 116 to detect patterns of usage (e.g., spikes; troughs; seasonal or other temporal patterns) or other information that may be useful for determining causes of issues or causes of service requests, or predict future issues, such as to allow the models 104, 116 to be trained using information indicative of causes of issues across multiple items of equipment (which may have the same or similar causes even if the data regarding the items of equipment is not identical). For example, an item of equipment may be at a site that is a museum; by relating site usage or occupancy data with data regarding the item of equipment, such as sensor data and service reports, the system 100 can configure the models 104, 116 to determine a high likelihood of issues occurring before events associated with high usage (e.g., gala, major exhibit opening), and can generate recommendations to perform diagnostics or servicing prior to the events.
Referring further to
For example, the model updater 108 can identify one or more parameters (e.g., weights and/or biases) of one or more layers of the first model 104, and maintain (e.g., freeze, maintain as the identified values while updating) the values of the one or more parameters of the one or more layers. In some implementations, the model updater 108 can modify the one or more layers, such as to add, remove, or change an output layer of the one or more layers, or to not maintain the values of the one or more parameters. The model updater 108 can select at least a subset of the identified one or parameters to maintain according to various criteria, such as user input or other instructions indicative of an extent to which the first model 104 is to be modified to determine the second model 116. In some implementations, the model updater 108 can modify the first model 104 so that an output layer of the first model 104 corresponds to output to be determined for applications 120.
Responsive to selecting the one or more parameters to maintain, the model updater 108 can apply, as input to the second model 116 (e.g., to a candidate second model 116, such as the modified first model 104, such as the first model 104 having the identified parameters maintained as the identified values), training data from the data sources 112. For example, the model updater 108 can apply the training data as input to the second model 116 to cause the second model 116 to generate one or more candidate outputs.
The model updater 108 can evaluate a convergence condition to modify the candidate second model 116 based at least on the one or more candidate outputs and the training data applied as input to the candidate second model 116. For example, the model updater 108 can evaluate an objective function of the convergence condition, such as a loss function (e.g., L1 loss, L2 loss, root mean square error, cross-entropy or log loss, etc.) based on the one or more candidate outputs and the training data; this evaluation can indicate how closely the candidate outputs generated by the candidate second model 116 correspond to the ground truth represented by the training data. The model updater 108 can use any of a variety of optimization algorithms (e.g., gradient descent, stochastic descent, Adam optimization, etc.) to modify one or more parameters (e.g., weights or biases of the layer(s) of the candidate second model 116 that are not frozen) of the candidate second model 116 according to the evaluation of the objective function. In some implementations, the model updater 108 can use various hyperparameters to evaluate the convergence condition and/or perform the configuration of the candidate second model 116 to determine the second model 116, including but not limited to hyperparameters such as learning rates, numbers of iterations or epochs of training, etc.
As described further herein with respect to applications 120, in some implementations, the model updater 108 can select the training data from the data of the data sources 112 to apply as the input based at least on a particular application of the plurality of applications 120 for which the second model 116 is to be used for. For example, the model updater 108 can select data from the parts data source 112 for the product recommendation generator application 120, or select various combinations of data from the data sources 112 (e.g., engineering data, operational data, and service data) for the service recommendation generator application 120. The model updater 108 can apply various combinations of data from various data sources 112 to facilitate configuring the second model 116 for one or more applications 120.
In some implementations, the system 100 can perform at least one of conditioning, classifier-based guidance, or classifier-free guidance to configure the second model 116 using the data from the data sources 112. For example, the system 100 can use classifiers associated with the data, such as identifiers of the item of equipment, a type of the item of equipment, a type of entity operating the item of equipment, a site at which the item of equipment is provided, or a history of issues at the site, to condition the training of the second model 116. For example, the system 100 combine (e.g., concatenate) various such classifiers with the data for inputting to the second model 116 during training, for at least a subset of the data used to configure the second model 116, which can enable the second model 116 to be responsive to analogous information for runtime/inference time operations.
Referring further to
The applications 120 can include any of a variety of desktop, web-based/browser-based, or mobile applications. For example, the applications 120 can be implemented by enterprise management software systems, employee or other user applications (e.g., applications that relate to BMS functionality such as temperature control, user preferences, conference room scheduling, etc.), equipment portals that provide data regarding items of equipment, or various combinations thereof.
The applications 120 can include user interfaces, dashboards, wizards, checklists, conversational interfaces, chatbots, configuration tools, or various combinations thereof. The applications 120 can receive an input, such as a prompt (e.g., from a user), provide the prompt to the second model 116 to cause the second model 116 to generate an output, such as a completion in response to the prompt, and present an indication of the output. The applications 120 can receive inputs and/or present outputs in any of a variety of presentation modalities, such as text, speech, audio, image, and/or video modalities. For example, the applications 120 can receive unstructured or freeform inputs from a user, such as a building manager or service technician, and generate customized user interfaces based on the freeform inputs from the user. This can allow, for example, building personnel to: automatically, and flexibly, generate custom user interfaces without having to spend individually code and design each user interface; to receive inputs as dictations in order to generate user interfaces; receive inputs in any form or a variety of forms, and use the second model 116 (which can be trained to cross-reference metadata in different portions of inputs and relate together data elements) to generate output user interfaces (e.g., the second model 116, having been configured with data that includes time information, can use timestamps of input from dictation and timestamps of when an image is taken, and place the image in the report in a target position or label based on time correlation).
In some implementations, the applications 620 include at least one virtual assistant (e.g., virtual assistance for building management) application 620. The virtual assistant application can provide various services to support building operators, such as presenting information about the building status, modeling potential states of the building based on inputs from the user, identifying building faults and creating service requests, receiving queries regarding actions to perform to service items of equipment, and presenting responses indicating actions to perform to service items of equipment. The virtual assistant application can receive information regarding multiple buildings, a building, a portion of a building, or a piece of equipment for a building, such as sensor data, text descriptions, or camera images, and process the received information using the second model 616 to generate corresponding user interfaces about the building.
For example, the virtual assistant application 120 can be implemented in a UI/UX wizard configuration, such as to provide a sequence of requests for information from the user (the sequence may include requests that are at least one of predetermined or dynamically generated responsive to inputs from the user for previous requests). For example, the virtual assistant application 120 can provide one or more requests for information from users such as service technicians, facility managers, or other occupants, and provide the received responses to at least one of the second model 116 to generate a user interface which provides information about the building or building equipment. The virtual assistant application 120 can use requests for information such as for unstructured text by which the user describes characteristics of the item of equipment relating to the issue; answers expected to correspond to different scenarios indicative of the issue; and/or image and/or video input (e.g., images of problems, equipment, spaces, etc.) that can provide more context around the issue and/or configurations.
The virtual assistant application 120 can include a plurality of applications 120 (e.g., variations of interfaces or customizations of interfaces) for a plurality of respective user types. For example, the virtual assistant application 120 can include a first application 120 for a customer user, and a second application 120 for a building manager user. The virtual assistant applications 120 can allow for updating and other communications between the first and second applications 120 as well as the second model 116. Using one or more of the first application 120 and the second application 120, the system 100 can manage continuous/real-time conversations for one or more users, and evaluate the users' engagement with the information provided (e.g., did the user, customer, service technician, etc., follow the provided steps for responding to the issue or performing service, did the user discontinue providing inputs to the virtual assistant application 120, etc.), such as to enable the system 100 to update the information generated by the second model 116 for the virtual assistant application 120 according to the engagement. In some implementations, the system 100 can use the second model 116 to detect sentiment of the user of the virtual assistant application 120, and update the second model 116 according to the detected sentiment, such as to improve the experience provided by the virtual assistant application 120.
The applications 120 can include, in some implementations, at least one diagnostics and troubleshooting application 120. The diagnostics and troubleshooting application 120 can receive inputs including at least one of a service request or information regarding the item of equipment to be serviced, such as information identified by a service technician. The diagnostics and troubleshooting application 120 can provide the inputs to a corresponding second model 116 to cause the second model 116 to generate outputs such as indications of potential items to be checked regarding the item of equipment, modifications or fixes to make to perform the service, or values or ranges of values of parameters of the item of equipment that may be indicative of specific issues to for the service technician to address or repair. The diagnostics and troubleshooting application 120 may also receive inputs regarding any alarms or faults associated with the building or building equipment associated with the building. The diagnostics and troubleshooting application can provide the inputs to a corresponding second model 116 and cause the second model 116 to generate outputs such as user interface which may display information about the building including any active alarms and/or faults.
The applications 120 can at least one service recommendation generator application 120. The service recommendation generator application 120 can receive inputs such as a service request or information regarding the item of equipment to be serviced, and provide the inputs to the second model 116 to cause the second model 116 to generate outputs for presenting service recommendations, such as actions to perform to address the service request.
In some implementations, the applications 120 can include a user interface generation application 120. The user interface generator application 120 can process inputs such as information regarding the building or building equipment, using one or more second models 116 (e.g., models trained using parts data from the data sources 112), to generate a user interface which displays the information regarding the building or building equipment in a format requested by the user.
Referring further to
The feedback repository 124 can include feedback received from users regarding output presented by the applications 120. For example, for at least a subset of outputs presented by the applications 120, the applications 120 can present one or more user input elements for receiving feedback regarding the outputs. The user input elements can include, for example, indications of binary feedback regarding the outputs (e.g., good/bad feedback; feedback indicating the outputs do or do not meet the user's criteria, such as criteria regarding technical accuracy or precision); indications of multiple levels of feedback (e.g., scoring the outputs on a predetermined scale, such as a 1-5 scale or 1-10 scale); freeform feedback (e.g., text or audio feedback); or various combinations thereof.
The system 100 can store and/or maintain feedback in the feedback repository 124. In some implementations, the system 100 stores the feedback with one or more data elements associated with the feedback, including but not limited to the outputs for which the feedback was received, the second model(s) 116 used to generate the outputs, and/or input information used by the second models 116 to generate the outputs (e.g., service request information; information captured by the user regarding the item of equipment).
The feedback trainer 128 can update the one or more second models 116 using the feedback. The feedback trainer 128 can be similar to the model updater 108. In some implementations, the feedback trainer 128 is implemented by the model updater 108; for example, the model updater 108 can include or be coupled with the feedback trainer 128. The feedback trainer 128 can perform various configuration operations (e.g., retraining, fine-tuning, transfer learning, etc.) on the second models 116 using the feedback from the feedback repository 124. In some implementations, the feedback trainer 128 identifies one or more first parameters of the second model 116 to maintain as having predetermined values (e.g., freeze the weights and/or biases of one or more first layers of the second model 116), and performs a training process, such as a fine tuning process, to configure parameters of one or more second parameters of the second model 116 using the feedback (e.g., one or more second layers of the second model 116, such as output layers or output heads of the second model 116).
In some implementations, the system 100 may not include and/or use the model updater 108 (or the feedback trainer 128) to determine the second models 116. For example, the system 100 can include or be coupled with an output processor (e.g., an output processor similar or identical to accuracy checker 316 described with reference to
Referring further to
The system 100 can be used to automate operations for scheduling, provisioning, and deploying service technicians and resources for service technicians to perform service operations. For example, the system 100 can use at least one of the first model 104 or the second model 116 to determine, based on processing information regarding service operations for items of equipment relative to completion criteria for the service operation, particular characteristics of service operations such as experience parameters of scheduled service technicians, identifiers of parts provided for the service operations, geographical data, types of customers, types of problems, or information content provided to the service technicians to facilitate the service operation, where such characteristics correspond to the completion criteria being satisfied (e.g., where such characteristics correspond to an increase in likelihood of the completion criteria being satisfied relative to other characteristics for service technicians, parts, information content, etc.). For example, the system 100 can determine, for a given item of equipment, particular parts to include on a truck to be sent to the site of the item of equipment. As such, the system 100, responsive to processing inputs at runtime such as service requests, can automatically and more accurately identify service technicians and parts to direct to the item of equipment for the service operations. The system 100 can use timing information to perform batch scheduling for multiple service operations and/or multiple technicians for the same or multiple service operations. The system 100 can perform batch scheduling for multiple trucks for multiple items of equipment, such as to schedule a first one or more parts having a greater likelihood for satisfying the completion criteria for a first item of equipment on a first truck, and a second one or more parts having a greater likelihood for satisfying the completion criteria for a second item of equipment on a second truck.
The system 200 can include at least one data repository 204, which can be similar to the data sources 112 described with reference to
The data repository 204 can include a product database 212, which can be similar or identical to the parts data of the data sources 112. The product database 212 can include, for example, data regarding products available from various vendors, specifications or parameters regarding products, and indications of products used for various service operations. The products database 212 can include data such as events or alarms associated with products; logs of product operation; and/or time series data regarding product operation, such as longitudinal data values of operation of products and/or building equipment.
The data repository 204 can include an operations database 216, which can be similar or identical to the operations data of the data sources 112. For example, the operations database 216 can include data such as manuals regarding parts, products, and/or items of equipment; customer service data; and or reports, such as operation or service logs.
In some implementations, the data repository 204 can include an output database 220, which can include data of outputs that may be generated by various machine learning models and/or algorithms. For example, the output database 220 can include values of pre-calculated predictions and/or insights, such as parameters regarding operation items of equipment, such as setpoints, changes in setpoints, flow rates, control schemes, identifications of error conditions, or various combinations thereof.
As depicted in
In some implementations, the prompt management system 228 includes a pre-processor 232. The pre-processor 232 can perform various operations to prepare the data from the data repository 204 for prompt generation. For example, the pre-processor 232 can perform any of various filtering, compression, tokenizing, or combining (e.g., combining data from various databases of the data repository 204) operations.
The prompt management system 228 can include a prompt generator 236. The prompt generator 236 can generate, from data of the data repository 204, one or more training data elements that include a prompt and a completion corresponding to the prompt. In some implementations, the prompt generator 236 receives user input indicative of prompt and completion portions of data. For example, the user input can indicate template portions representing prompts of structured data, such as predefined fields or forms of documents, and corresponding completions provided for the documents. The user input can assign prompts to unstructured data. In some implementations, the prompt generator 236 automatically determines prompts and completions from data of the data repository 204, such as by using any of various natural language processing algorithms to detect prompts and completions from data. In some implementations, the system 200 does not identify distinct prompts and completions from data of the data repository 204.
Referring further to
The training management system 240 can include a training manager 244. The training manager 244 can incorporate features of at least one of the model updater 108 or the feedback trainer 128 described with reference to
In some implementations, the training management system 240 includes a prompts database 248. For example, the training management system 240 can store one or more training data elements from the prompt management system 228, such as to facilitate asynchronous and/or batched training processes.
The training manager 244 can control the training of machine learning models using information or instructions maintained in a model tuning database 256. For example, the training manager 244 can store, in the model tuning database 256, various parameters or hyperparameters for models and/or model training.
In some implementations, the training manager 244 stores a record of training operations in a jobs database 252. For example, the training manager 244 can maintain data such as a queue of training jobs, parameters or hyperparameters to be used for training jobs, or information regarding performance of training.
Referring further to
The model system 260 can include a model configuration processor 264. The model configuration processor 264 can incorporate features of the model updater 108 and/or the feedback trainer 128 described with reference to
The client device 304 can be a device of a user, such as a technician or building manager. The client device 304 can include any of various wireless or wired communication interfaces to communicate data with the model system 260, such as to provide requests to the model system 260 indicative of data for the machine learning models 268 to generate, and to receive outputs from the model system 260. The client device 304 can include various user input and output devices to facilitate receiving and presenting inputs and outputs.
In some implementations, the system 200 provides data to the client device 304 for the client device 304 to operate the at least one application session 308. The application session 308 can include a session corresponding to any of the applications 120 described with reference to
In some implementations, the application session 308 maintains a session state regarding the application session 308. The session state can include one or more prompts received by the application session 308, and can include one or more completions received by the application session 308 from the model system 260. The session state can include one or more items of feedback received regarding the completions, such as feedback indicating accuracy of the completion.
The system 200 can include or be coupled with one or more session inputs 340 or sources thereof. The session inputs 340 can include, for example and without limitation, location-related inputs, such as identifiers of an entity managing an item of equipment or a building or building management system, a jurisdiction (e.g., city, state, country, etc.), a language, or a policy or configuration associated with operation of the item of equipment, building, or building management system. The session inputs 340 can indicate an identifier of the user of the application session 308. The session inputs 340 can include data regarding items of equipment or building management systems, including but not limited to operation data or sensor data. The session inputs 340 can include information from one or more applications, algorithms, simulations, neural networks, machine learning models, or various combinations thereof, such as to provide analyses, predictions, or other information regarding items of equipment. The session inputs 340 can data from or analogous to the data of the data repository 204.
In some implementations, the model system 260 includes at least one sessions database 312. The sessions database 312 can maintain records of application session 308 implemented by client devices 304. For example, the sessions database 312 can include records of prompts provided to the machine learning models 268 and completions generated by the machine learning models 268. As described further with reference to
As depicted in
In some implementations, the system 200 includes an accuracy checker 316. The accuracy checker 316 can include one or more rules, heuristics, logic, policies, algorithms, functions, machine learning models, neural networks, scripts, or various combinations thereof to perform operations including evaluating performance criteria regarding the completions determined by the model system 260. For example, the accuracy checker 316 can include at least one completion listener 320. The completion listener 320 can receive the completions determined by the model system 260 (e.g., responsive to the completions being generated by the machine learning model 268 and/or by retrieving the completions from the sessions database 312).
The accuracy checker 316 can include at least one completion evaluator 324. The completion evaluator 324 can evaluate the completions (e.g., as received or retrieved by the completion listener 320) according to various criteria. In some implementations, the completion evaluator 324 evaluates the completions by comparing the completions with corresponding data from the data repository 204. For example, the completion evaluator 324 can identify data of the data repository 204 having similar text as the prompts and/or completions (e.g., using any of various natural language processing algorithms), and determine whether the data of the completions is within a range of expected data represented by the data of the data repository 204.
In some implementations, the accuracy checker 316 can store an output from evaluating the completion (e.g., an indication of whether the completion satisfies the criteria) in an evaluation database 328. For example, the accuracy checker 316 can assign the output (which may indicate at least one of a binary indication of whether the completion satisfied the criteria or an indication of a portion of the completion that did not satisfy the criteria) to the completion for storage in the evaluation database 328, which can facilitate further training of the machine learning models 268 using the completions and output.
The accuracy checker 316 can include or be coupled with at least one post-processor 336. The post-processor 336 can perform various operations to evaluate, validate, and/or modify the completions generated by the model system 260. In some implementations, the post-processor 336 includes or is coupled with data filters 500, validation system 600, and/or expert system 700 described with reference to
Referring further to
The system 200 can include or be coupled with operations data 348. The operations data 348 can be part of or analogous to one or more data sources of the data repository 204. The operations data 348 can include, for example and without limitation, data regarding real-world operations of building management systems and/or items of equipment, such as changes in building policies, building states, ticket or repair data, results of servicing or other operations, performance indices, or various combinations thereof. The operations data 348 can be retrieved by the application session 308, such as to condition or modify prompts and/or requests for prompts on operations data 348.
As depicted in
For example, the role-specific models 360 can include one or more of an author model 360, an editor model 360, a validator model 360, or various combinations thereof. The author model 360 can be used to generate an initial or candidate completion, such as to receive the prompt (e.g., via pre-processor 332) and generate the initial completion responsive to the prompt. The editor model 360 and/or validator model 360 can apply any of various criteria, such as accuracy checking criteria, to the initial completion, to validate or modify (e.g., revise) the initial completion. For example, the editor model 360 and/or validator model 360 can be coupled with the external systems 344 to query the external systems 344 using the initial completion (e.g., to detect a difference between the initial completion and one or more expected values or ranges of values for the initial completion), and at least one of output an alert or modify the initial completion (e.g., directly or by identifying at least a portion of the initial completion for the author model 360 to regenerate). In some implementations, at least one of the editor model 360 or the validator model 360 are tuned with different hyperparameters from the author model 360, or can adjust the hyperparameter(s) of the author model 360, such as to facilitate modifying the initial completion using a model having a higher threshold for confidence of outputted results responsive to the at least one of the editor model 360 or the validator model 360 determining that the initial completion does not satisfy one or more criteria. In some implementations, the at least one of the editor model 360 or the validator model 360 is tuned to have a different (e.g., lower) risk threshold than the author model 360, which can allow the author model 360 to generate completions that may fall into a greater domain/range of possible values, while the at least one of the editor model 360 or the validator model 360 can refine the completions (e.g., limit refinement to specific portions that do not meet the thresholds) generated by the author model 360 to fall within appropriate thresholds (e.g., rather than limiting the threshold for the author model 360).
For example, responsive to the validator model 360 determining that the initial completion includes a value (e.g., setpoint to meet a target value of a performance index) that is outside of a range of values validated by a simulation for an item of equipment, the validator model 360 can cause the author model 360 to regenerate at least a portion of the initial completion that includes the value; such regeneration may include increasing a confidence threshold for the author model 360. The validator model 360 can query the author model 360 for a confidence level associated with the initial completion, and cause the author model 360 to regenerate the initial completion and/or generate additional completions responsive to the confidence level not satisfying a threshold. The validator model 360 can query the author model 360 regarding portions (e.g., granular portions) of the initial completion, such as to request the author model 360 to divide the initial completion into portions, and separately evaluate each of the portions. The validator model 360 can convert the initial completion into a vector, and use the vector as a key to perform a vector concept lookup to evaluate the initial completion against one or more results retrieved using the key.
The feedback system 400 can receive feedback (e.g., from the client device 304) in various formats. For example, the feedback can include any of text, speech, audio, image, and/or video data. The feedback can be associated (e.g., in a data structure generated by the application session 308) with the outputs of the machine learning models 268 for which the feedback is provided. The feedback can be received or extracted from various forms of data, including external data sources such as manuals, service reports, or Wikipedia-type documentation.
In some implementations, the feedback system 400 includes a pre-processor 404. The pre-processor 404 can perform any of various operations to modify the feedback for further processing. For example, the pre-processor 404 can incorporate features of, or be implemented by, the pre-processor 232, such as to perform operations including filtering, compression, tokenizing, or translation operations (e.g., translation into a common language of the data of the data repository 204).
The feedback system 400 can include a bias checker 408. The bias checker 408 can evaluate the feedback using various bias criteria, and control inclusion of the feedback in a feedback database 416 (e.g., a feedback database 416 of the data repository 204 as depicted in
The feedback system 400 can include a feedback encoder 412. The feedback encoder 412 can process the feedback (e.g., responsive to bias checking by the bias checker 408) for inclusion in the feedback database 416. For example, the feedback encoder 412 can encode the feedback as values corresponding to outputs scoring determined by the model system 260 while generating completions (e.g., where the feedback indicates that the completion presented via the application session 308 was acceptable, the feedback encoder 412 can encode the feedback by associating the feedback with the completion and assigning a relatively high score to the completion).
As indicated by the dashed arrows in
For example, the data filters 500 can be used to evaluate data relative to thresholds relating to data including, for example and without limitation, acceptable data ranges, setpoints, temperatures, pressures, flow rates (e.g., mass flow rates), or vibration rates for an item of equipment. The threshold can include any of various thresholds, such as one or more of minimum, maximum, absolute, relative, fixed band, and/or floating band thresholds.
The data filters 500 can enable the system 200 to detect when data, such as prompts, completions, or other inputs and/or outputs of the system 200, collide with thresholds that represent realistic behavior or operation or other limits of items of equipment. For example, the thresholds of the data filters 500 can correspond to values of data that are within feasible or recommended operating ranges. In some implementations, the system 200 determines or receives the thresholds using models or simulations of items of equipment, such as plant or equipment simulators, chiller models, HVAC-R models, refrigeration cycle models, etc. The system 200 can receive the thresholds as user input (e.g., from experts, technicians, or other users). The thresholds of the data filters 500 can be based on information from various data sources. The thresholds can include, for example and without limitation, thresholds based on information such as equipment limitations, safety margins, physics, expert teaching, etc. For example, the data filters 500 can include thresholds determined from various models, functions, or data structures (e.g., tables) representing physical properties and processes, such as physics of psychometrics, thermodynamics, and/or fluid dynamics information.
The system 200 can determine the thresholds using the feedback system 400 and/or the client device 304, such as by providing a request for feedback that includes a request for a corresponding threshold associated with the completion and/or prompt presented by the application session 308. For example, the system 200 can use the feedback to identify realistic thresholds, such as by using feedback regarding data generated by the machine learning models 268 for ranges, setpoints, and/or start-up or operating sequences regarding items of equipment (and which can thus be validated by human experts). In some implementations, the system 200 selectively requests feedback indicative of thresholds based on an identifier of a user of the application session 308, such as to selectively request feedback from users having predetermined levels of expertise and/or assign weights to feedback according to criteria such as levels of expertise.
In some implementations, one or more data filters 500 correspond to a given setup. For example, the setup can represent a configuration of a corresponding item of equipment (e.g., configuration of a chiller, etc.). The data filters 500 can represent various thresholds or conditions with respect to values for the configuration, such as feasible or recommendation operating ranges for the values. In some implementations, one or more data filters 500 correspond to a given situation. For example, the situation can represent at least one of an operating mode or a condition of a corresponding item of equipment.
The system 200 can perform various actions responsive to the processing of data by the data filters 500. In some implementations, the system 200 can pass data to a destination without modifying the data (e.g., retaining a value of the data prior to evaluation by the data filter 500) responsive to the data satisfying the criteria of the respective data filter(s) 500. In some implementations, the system 200 can at least one of (i) modify the data or (ii) output an alert responsive to the data not satisfying the criteria of the respective data filter(s) 500. For example, the system 200 can modify the data by modifying one or more values of the data to be within the criteria of the data filters 500.
In some implementations, the system 200 modifies the data by causing the machine learning models 268 to regenerate the completion corresponding to the data (e.g., for up to a predetermined threshold number of regeneration attempts before triggering the alert). This can enable the data filters 500 and the system 200 selectively trigger alerts responsive to determining that the data (e.g., the collision between the data and the thresholds of the data filters 500) may not be repairable by the machine learning model 268 aspects of the system 200.
The system 200 can output the alert to the client device 304. The system 200 can assign a flag corresponding to the alert to at least one of the prompt (e.g., in prompts database 224) or the completion having the data that triggered the alert.
For example, the validation system 600 can receive data such as data retrieved from the data repository 204, prompts outputted by the prompt management system 228, completions outputted by the model system 260, indications of accuracy outputted by the accuracy checker 316, etc., and provide the received data to at least one of an expert system or a user interface. In some implementations, the validation system 600 receives a given item of data prior to the given item of data being processed by the model system 260, such as to validate inputs to the machine learning models 268 prior to the inputs being processed by the machine learning models 268 to generate outputs, such as completions.
In some implementations, the validation system 600 validates data by at least one of (i) assigning a label (e.g., a flag, etc.) to the data indicating that the data is validated or (ii) passing the data to a destination without modifying the data. For example, responsive to receiving at least one of a user input (e.g., from a human validator/supervisor/expert) that the data is valid or an indication from an expert system that the data is valid, the validation system 600 can assign the label and/or provide the data to the destination.
The validation system 600 can selectively provide data from the system 200 to the validation interface responsive to operation of the data filters 500. This can enable the validation system 600 to trigger validation of the data responsive to collision of the data with the criteria of the data filters 500. For example, responsive to the data filters 500 determining that an item of data does not satisfy a corresponding criteria, the data filters 500 can provide the item of data to the validation system 600. The data filters 500 can assign various labels to the item of data, such as indications of the values of the thresholds that the data filters 500 used to determine that the item of data did not satisfy the thresholds. Responsive to receiving the item of data from the data filters 500, the validation system 600 can provide the item of data to the validation interface (e.g., to a user interface of client device 304 and/or application session 308; for comparison with a model, simulation, algorithm, or other operation of an expert system) for validation. In some implementations, the validation system 600 can receive an indication that the item of data is valid (e.g., even if the item of data did not satisfy the criteria of the data filters 500) and can provide the indication to the data filters 500 to cause the data filters 500 to at least partially modify the respective thresholds according to the indication.
In some implementations, the validation system 600 selectively retrieves data for validation where (i) the data is determined or outputted prior to use by the machine learning models 268, such as data from the data repository 204 or the prompt management system 228, or (ii) the data does not satisfy a respective data filter 500 that processes the data. This can enable the system 200, the data filters 500, and the validation system 600 to update the machine learning models 268 and other machine learning aspects (e.g., generative AI aspects) of the system 200 to more accurately generate data and completions (e.g., enabling the data filters 500 to generate alerts that are received by the human experts/expert systems that may be repairable by adjustments to one or more components of the system 200).
In some implementations, the expert system 700 retrieves data to be provided to the application session 308, such as completions generated by the machine learning models 268. The expert system 700 can present the data via the expert session 708, such as to request feedback regarding the data from the client device 704. For example, the expert system 700 can receive feedback regarding the data for modifying or validating the data (e.g., editing or validating completions). In some implementations, the expert system 700 requests at least one of an identifier or a credential of a user of the client device 704 prior to providing the data to the client device 704 and/or requesting feedback regarding the data from the expert session 708. For example, the expert system 700 can request the feedback responsive to determining that the at least one of the identifier or the credential satisfies a target value for the data. This can allow the expert system 708 to selectively identify experts to use for monitoring and validating the data.
In some implementations, the expert system 700 facilitates a communication session regarding the data, between the application session 308 and the expert session 708. For example, the expert session 700, responsive to detecting presentation of the data via the application session 308, can request feedback regarding the data (e.g., user input via the application session 308 for feedback regarding the data), and provide the feedback to the client device 704 to present via the expert session 708. The expert session 708 can receive expert feedback regarding at least one of the data or the feedback from the user to provide to the application session 308. In some implementations, the expert system 700 can facilitate any of various real-time or asynchronous messaging protocols between the application session 308 and expert session 708 regarding the data, such as any of text, speech, audio, image, and/or video communications or combinations thereof. This can allow the expert system 700 to provide a platform for a user receiving the data (e.g., customer or field technician) to receive expert feedback from a user of the client device 704 (e.g., expert technician). In some implementations, the expert system 700 stores a record of one or more messages or other communications between the sessions 308, 708 in the data repository 204 to facilitate further configuration of the machine learning models 268 based on the interactions between the users of the sessions 308, 708.
Referring further to
For example, in some implementations, various data discussed herein may be stored in, retrieved from, or processed in the context of building data platforms and/or digital twins; processed at (e.g., processed using models executed at) a cloud or other off-premises computing system/device or group of systems/devices, an edge or other on-premises system/device or group of systems/devices, or a hybrid thereof in which some processing occurs off-premises and some occurs on-premises; and/or implemented using one or more gateways for communication and data management amongst various such systems/devices. In some such implementations, the building data platforms and/or digital twins may be provided within an infrastructure such as those described in U.S. patent application Ser. No. 17/134,661 filed Dec. 28, 2020, Ser. No. 18/080,360, filed Dec. 13, 2022, Ser. No. 17/537,046 filed Nov. 29, 2021, and Ser. No. 18/096,965, filed Jan. 13, 2023, and Indian patent application No. 202341008712, filed Feb. 10, 2023, the disclosures of which are incorporated herein by reference in their entireties.
As described above, systems and methods in accordance with the present disclosure can use machine learning models, including LLMs and other generative AI models, to ingest data regarding building management systems and equipment in various unstructured and structured formats, and generate completions and other outputs targeted to provide useful information to users. Various systems and methods described herein can use machine learning models to support applications for presenting data with high accuracy and relevance.
At 805, a prompt can be received. The prompt can be received using a user interface implemented by an application session of a client device. The prompt can be received in any of various data formats, such as text, audio, speech, image, and/or video formats. The prompt can be indicative of an item of equipment, such as a condition of the equipment (e.g., an error detected or fault condition) or a building management system or component thereof. The prompt can indicate a request for a service to perform for the item of equipment. The prompt can indicate one or more characteristics of the item of equipment. In some implementations, the application session provides a conversational interface or chatbot for receiving the prompt, and can present queries via the application to request information for the prompt. For example, the application session can determine that the prompt indicates a type of equipment, and can request information regarding expected issues regarding the equipment (e.g., via iterative generation of completions and communication with machine learning models).
At 810, the prompt may be validated. For example, criteria such as one or more rules, heuristics, models, algorithms, thresholds, policies, or various combinations thereof can be evaluated using the prompt. The criteria can be evaluated to determine whether the prompt is appropriate for the item of equipment. In some implementations, the prompt can be evaluated by a pre-processor that may be separate from at least one of the application session or the machine learning models. In some implementations, the prompt can be evaluated using any one or more accuracy checkers, data filters, simulations regarding operation of the item of equipment, or expert validation systems; the evaluation can be used to update the criteria (e.g., responsive to an expert determining that the prompt is valid even if the prompt includes information that does not satisfy the criteria, the criteria can be updated to be capable of being satisfied by the information of the prompt). In some implementations, the prompt is modified according to the evaluation; for example, a request can be presented via the application session for an updated version of the prompt, or the pre-processor can modify the prompt to make the prompt satisfy the one or more criteria. The prompt can be converted into a vector to perform a lookup in a vector database of expected prompts or information of prompts to validate the prompt.
At 815, at least one completion is generated using the prompt (e.g., responsive to validating the prompt). The completion can be generated using one or more machine learning models, including generative machine learning models. For example, the completion can be generated using a neural network comprising at least one transformer, such as a GPT model. The completion can be generated using image/video generation models, such as GAN and/or diffusion models. The completion can be generated based on the one or more machine learning models being configured (e.g., trained, updated, fine-tuned, etc.) using training data examples representative of information for items of equipment, including but not limited to unstructured data or semi-structured data such as service technician reports, operating manuals, technical data sheets, etc. Prompts can be iteratively received and completions iteratively generated responsive to the prompts as part of an asynchronous and/or conversational communication session.
In some implementations, generating the prompt comprises using a plurality of machine learning models, which may be configured in similar or different manners, such as by using different training data, model architectures, parameter tuning or hyperparameter fine tuning, or various combinations thereof. In some implementations, the machine learning models are configured in a manner representative of various roles, such as author, editor, validation, external data comparison, etc. roles. For example, a first machine learning model can operate as an author model, such as to have relatively fewer/lesser criteria for generating an initial completion responsive to the prompt, such as to require relatively lower confidence levels or risk criteria. A second machine learning model can be configured to have relatively greater/higher criteria, such as to receive the initial completion, process the initial completion to detect one or more data elements (e.g., tokens or combinations of tokens) that do not satisfy criteria of the second machine learning model, and output an alert or cause the first machine learning model to modify the initial completion responsive to the valuation. For example, the editor model can identify a phrase in the initial completion that does not satisfy an expected value (e.g., expected accuracy criteria determined by evaluating the prompt using a simulation), and can cause the first machine learning model to provide a natural language explanation of factors according to which the initial completion was determined, such as to present such explanations via the application session. The machine learning models can evaluate the completions according to bias criteria. The machine learning models can store the completions and prompts as data elements for further configuration of the machine learning models (e.g., positive/negative examples corresponding to the prompts).
At 820, the completion can be validated. The completion can be validated using various processes described for the machine learning models, such as by comparing the completion to any of various thresholds or outputs of databases or simulations. For example, the machine learning models can configure calls to databases or simulations for the item of equipment indicated by the prompt to validate the completion relative to outputs retrieved from the databases or simulations. The completion can be validated using accuracy checkers, bias checkers, data filters, or expert systems.
At 825, the completion is presented via the application session. For example, the completion can be presented as any of text, speech, audio, image, and/or video data to represent the completion, such as to provide an answer to a query represented by the prompt regarding an item of equipment or building management system. The completion can be presented via iterative generation of completions responsive to iterative receipt of prompts. The completion can be present with a user input element indicative of a request for feedback regarding the completion, such as to enable the prompt and completion to be used for updating the machine learning models.
At 830, the machine learning model(s) used to generate the completion can be updated according to at least one of the prompt, the completion, or the feedback. For example, a training data element for updating the model can include the prompt, the completion, and the feedback, such as to represent whether the completion appropriately satisfied a user's request for information regarding the item of equipment. The machine learning models can be updated according to indications of accuracy determined by operations of the system such as accuracy checking, or responsive to evaluation of completions by experts (e.g., responsive to selective presentation and/or batch presentation of prompts and completions to experts).
At 905, a prompt can be received from a user. The prompt can be received using a user interface implemented by an application session of a client device. The prompt can be received in any of various data formats, such as text, audio, speech, image, and/or video formats. The prompt can indicate a request for a user interface about a building, a portion of the building, or one or more pieces of building equipment to be generated by the application service. The prompt can indicate one or more characteristics the user desires the user interface to include. For example, the prompt may indicate that the user interface displays the building information in a certain format (e.g., graphs, charts, maps, text box, list, pictures, etc.). The prompt may also include a description of the kind of content the user desires to include in the generated user interface. The kind of content requested by the user may be any type of building information. For example, a user may desire to include content regarding any active alarms in the building, the occupancy status across the building, or any predicted or simulated states of the building. In some implementations, the application session provides a conversational interface or chatbot for receiving the prompt, and can present queries via the application to request information for the prompt. For example, the application session can determine that the prompt indicates a type of equipment, and can request information regarding the status of the equipment (e.g., via iterative generation of completions and communication with machine learning models). The format may indicate a certain way in which the user wants the content they request to be arranged and displayed.
At 910, the format and content of an output user interface is dynamically determined using an AI model and based on the prompt received at 905. For example, the user may enter a prompt which states “show alarms by hour and device.” Based on this prompt, it may be determined that the content requested relates to any building device alarms and the format can include any format which shows the alarms by hours and device (e.g., a table, a graph, etc.). In some embodiments, dynamically determining the format of the output user interface may include selecting one format from among multiple predetermined output content formats based on the prompt. For example, if the prompt includes any reference to time, the pre-defined formats may include types of graphs with time on one axis (e.g., a line graph, bar graph, etc.). In some embodiments, dynamically determining the format and the content of the output user interface may include predicting an intent of the user from the prompt and determining the format and the content of the output user interface based on the predicted intent using the AI model.
The format and content of the output can be generated using one or more machine learning models, including generative machine learning models. For example, the format and content can be generated using a neural network comprising at least one transformer, such as GPT model. The format and content can be generated using image/video generation models, such as GAN and/or diffusion models. The format and content can be generated based on the one or more machine learning models being configured (e.g., trained, updated, fine-tuned, etc.) using training data examples representative of information for items of equipment, including but not limited to unstructured data or semi-structured data such as service technician reports, operating manuals, technical data sheets, etc. Prompts can be iteratively received and user interfaces iteratively generated responsive to the prompts as part of an asynchronous and/or conversational communication session.
At 915, the output user interface is generated in the determined format and including the determined content using an AI model. In some embodiments, the output user interface includes a pre-determined interface with multiple populatable interface portions which may be populated by the AI model based on the dynamically determined format and content. For example, the pre-determined interface my be a table or graph which may include multiple populatable interface portions such as table cells which may be populated by the AI model based on the determined content and format. In some embodiments, the AI model is a generative AI model or LLM model. Further, in some embodiments, dynamically determining the format and content of the user interface includes dynamically generating at least a portion of the format or the content of the output user interface using the generative AI model or LLM model. In some embodiments, the determined format and content may be newly generated by the generative AI model that was not preexisting in the underlying data used to create and train the AI model prior to receiving the prompt from the user. In some embodiments, the output user interface includes a history of one or more prompts the user or other users have previously entered into the chatbot interface. The history of the one or more prompts may be related to the prompt entered by the user at 905. In some embodiments, the history of one or more prompts is dynamically generated by the AI model in response to the AI model determining that the history of one or more prompts is potentially relevant to the prompt entered by the user. For example, if the user entered a prompt requesting to create a user interface showing the current active alarms in a building, previously entered prompts relating to active alarms may be determined by the AI model and shown as a the history of one or more prompts which may be relevant to the prompt entered by the user.
The example user interface 1000 also includes a history portion 1004 which is configured to display one or more previously entered prompts which may be relevant to the prompt entered by the user in the AI chat interface 1002. As described above, the history of one or more prompts is dynamically generated by the AI model in response to the AI model determining that the history of one or more prompts is potentially relevant to the prompt entered by the user. In some embodiments, the example user interface 1000 includes an output user interface 1006 which is generated by an AI model as described above with respect to method 900.
The construction and arrangement of the systems and methods as shown in the various exemplary embodiments are illustrative only. Although only a few embodiments have been described in detail in this disclosure, many modifications are possible (e.g., variations in sizes, dimensions, structures, shapes and proportions of the various elements, values of parameters, mounting arrangements, use of materials, colors, orientations, etc.). For example, the position of elements may be reversed or otherwise varied and the nature or number of discrete elements or positions may be altered or varied. Accordingly, all such modifications are intended to be included within the scope of the present disclosure. The order or sequence of any process or method steps may be varied or re-sequenced according to alternative embodiments. Other substitutions, modifications, changes, and omissions may be made in the design, operating conditions and arrangement of the exemplary embodiments without departing from the scope of the present disclosure.
The present disclosure contemplates methods, systems and program products on any machine-readable media for accomplishing various operations. The embodiments of the present disclosure may be implemented using existing computer processors, or by a special purpose computer processor for an appropriate system, incorporated for this or another purpose, or by a hardwired system. Embodiments within the scope of the present disclosure include program products comprising machine-readable media for carrying or having machine-executable instructions or data structures stored thereon. Such machine-readable media can be any available media that can be accessed by a general purpose or special purpose computer or other machine with a processor. By way of example, such machine-readable media can comprise RAM, ROM, EPROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store desired program code in the form of machine-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer or other machine with a processor. When information is transferred or provided over a network or another communications connection (either hardwired, wireless, or a combination of hardwired or wireless) to a machine, the machine properly views the connection as a machine-readable medium. Thus, any such connection is properly termed a machine-readable medium. Combinations of the above are also included within the scope of machine-readable media. Machine-executable instructions include, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing machines to perform a certain function or group of functions.
Although the figures show a specific order of method steps, the order of the steps may differ from what is depicted. Also two or more steps may be performed concurrently or with partial concurrence. Such variation will depend on the software and hardware systems chosen and on designer choice. All such variations are within the scope of the disclosure. Likewise, software implementations could be accomplished with standard programming techniques with rule based logic and other logic to accomplish the various connection steps, processing steps, comparison steps and decision steps.
In various implementations, the steps and operations described herein may be performed on one processor or in a combination of two or more processors. For example, in some implementations, the various operations could be performed in a central server or set of central servers configured to receive data from one or more devices (e.g., edge computing devices/controllers) and perform the operations. In some implementations, the operations may be performed by one or more local controllers or computing devices (e.g., edge devices), such as controllers dedicated to and/or located within a particular building or portion of a building. In some implementations, the operations may be performed by a combination of one or more central or offsite computing devices/servers and one or more local controllers/computing devices. All such implementations are contemplated within the scope of the present disclosure. Further, unless otherwise indicated, when the present disclosure refers to one or more computer-readable storage media and/or one or more controllers, such computer-readable storage media and/or one or more controllers may be implemented as one or more central servers, one or more local controllers or computing devices (e.g., edge devices), any combination thereof, or any other combination of storage media and/or controllers regardless of the location of such devices.
This application claims the benefit of and priority to U.S. Provisional Patent Application No. 63/470,756, filed on Jun. 2, 2023, the entirety of which is incorporated by reference herein.
Number | Date | Country | |
---|---|---|---|
63470756 | Jun 2023 | US |