PREDICTIVE SUSTAINABILITY ANALYTICS FOR SOFTWARE DEPLOYMENTS

Information

  • Patent Application
  • 20250005595
  • Publication Number
    20250005595
  • Date Filed
    June 28, 2023
    a year ago
  • Date Published
    January 02, 2025
    3 days ago
Abstract
A greenhouse gas (GHG) emissions estimation system is designed to train and utilize an artificial intelligence (AI) model to estimate the GHG emissions associated with operating a software application that is deployed (or is intended to be deployed) within a virtualized environment. For example, a software developer may utilize the GHG emissions estimation system as part of a cost/benefit analysis of different software solutions at an early stage of application design and development. Since there may be a variety of possible infrastructural options for a given software application, the GHG emissions estimation system enables the developer to estimate the GHG emissions associated with each infrastructural option, which can be used when determining which solution should be developed and deployed.
Description
BACKGROUND

This disclosure relates generally to determining predictive sustainability analytics for software deployments, and more specifically to estimating a greenhouse gas (GHG) emission footprint for software solutions that are deployed (or are intended to be deployed) in virtualized environments.


This section is intended to introduce the reader to various aspects of art that may be related to various aspects of the present disclosure, which are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present disclosure. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.


Many organizations are interested in determining the climate impact of various operations. For example, a company may operate a computer-implemented system that monitors energy utilization of a technical system deployed in production, and that estimates a GHG emission footprint of the deployed technical system as a function of energy utilization. In another example, when a future system deployment has a simple hardware configuration (e.g., a single, standalone server), a company may estimate the GHG emission footprint of the future system deployment as a function of the manufacturer-supplied energy utilization specifications of the hardware.


However, it is increasingly common for certain technical systems to be deployed within virtualized environments. For example, a virtualized environment can be implemented using hardware servers disposed within a remote data center or an on-premises data center. Rather than executing software applications on dedicated data center hardware, a software application may instead execute within a virtualized environment that is hosted on the hardware of the data center, in which various computing (e.g., processor, memory, networking) resources of the data center are dynamically allotted to one or more virtual machines of the virtualized environment to execute the software application. It is presently recognized that, for organizations that deploy software applications in virtualized environments, there is not a solution to estimate the GHG emission footprint of the technical systems deployed within the virtualized environment.


BRIEF DESCRIPTION

Certain embodiments commensurate in scope with the originally claimed subject matter are summarized below. These embodiments are not intended to limit the scope of the claimed subject matter, but rather these embodiments are intended only to provide a brief summary of possible forms of the subject matter. Indeed, the subject matter may encompass a variety of forms that may be similar to or different from the embodiments set forth below.


In accordance with an embodiment of the present disclosure, a computing system includes at least one memory configured to store an artificial intelligence (AI) model, and includes at least one processor configured to execute stored instructions to perform actions. The actions include receiving a request to determine estimated greenhouse gas (GHG) emissions of an application when deployed in a virtualization environment, wherein the request indicates at least one application file of the application. The actions include analyzing the at least one application file of the application to determine infrastructural components and operational parameters of the application. The actions also include generating, using the AI model, the estimated GHG emissions of the application when deployed in the virtualization environment, based on the infrastructural components and operational parameters of the application. The actions further include providing a response to the request that includes at least the estimated GHG emissions of the application when deployed in the virtualization environment.


In accordance with an embodiment of the present disclosure, a computer-implemented method includes receiving a request to determine estimated GHG emissions of an application when deployed in a virtualization environment, wherein the request indicates at least one application file of the application. The method includes analyzing the at least one application file of the application to determine infrastructural components and operational parameters of the application. The method also includes generating, using an AI model, the estimated GHG emissions of the application when deployed in the virtualization environment, based on the infrastructural components and operational parameters of the application. The method further includes providing a response to the request that includes at least the estimated GHG emissions of the application when deployed in the virtualization environment.


In accordance with an embodiment of the present disclosure, a non-transitory, computer-readable medium storing instructions executable by a processor of a computing system. The instructions comprising instructions to receive a request to determine estimated GHG emissions of an application when deployed in a virtualization environment, wherein the request indicates at least one application file of the application. The instructions comprise instructions to analyze the at least one application file of the application to determine infrastructural components and operational parameters of the application. The instructions comprise instructions to generate, using an artificial intelligence (AI) model, the estimated GHG emissions of the application when deployed in the virtualization environment, based on the infrastructural components and operational parameters of the application. The instructions comprise instructions to provide a response to the request that includes at least the estimated GHG emissions of the application when deployed in the virtualization environment.


In accordance with an embodiment of the present disclosure, a computing system includes at least one memory configured to store an artificial intelligence (AI) model and at least one processor configured to execute stored instructions to perform actions. The actions comprise training the AI model to encode relationships between respective infrastructural components and operational parameters of a set of deployed applications and respective greenhouse gas (GHG) emissions of the set of deployed applications, yielding a trained AI model. The actions further include generating, using the trained AI model, estimated GHG emissions of an application when deployed in a virtualization environment, based on infrastructural components and operational parameters of the application.





DRAWINGS

These and other features, aspects, and advantages of the present invention will become better understood when the following detailed description is read with reference to the accompanying drawings in which like characters represent like parts throughout the drawings, wherein:



FIG. 1 is a diagram of an example GHG emissions estimation system implemented in a client-server architecture, in accordance with embodiments of the present technique;



FIG. 2 is a flow diagram of an example process for generating a training dataset for training an AI model of the GHG emissions estimation system, in accordance with embodiments of the present technique;



FIG. 3 is a flow diagram of an example process for training the AI model using the training dataset, in accordance with embodiments of the present technique; and



FIG. 4 is a flow diagram of an example process in which the GHG emissions estimation system uses the trained AI model in predictive mode to estimate GHG emissions of an application (when deployed), in accordance with embodiments of the present technique.





DETAILED DESCRIPTION

One or more specific embodiments will be described below. In an effort to provide a concise description of these embodiments, not all features of an actual implementation are described in the specification. It should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another. Moreover, it should be appreciated that such a development effort might be complex and time consuming, but would nevertheless be a routine undertaking of design, fabrication, and manufacture for those of ordinary skill having the benefit of this disclosure.


As noted above, there remains a need for a solution capable of estimating the GHG emission footprint of software applications that are deployed (or are intended to be deployed) within a virtualized environment. With this in mind, present embodiments are directed to a GHG emissions estimation system that is designed to estimate the GHG emissions associated with operating a software application that is deployed (or is intended to be deployed) within a virtualized environment. For example, a software developer may utilize the GHG emissions estimation system as part of a cost/benefit analysis of different software solutions. Since there may be a variety of possible infrastructural options for a given software application, the GHG emissions estimation system enables the developer to estimate the GHG emissions associated with each infrastructural option, which can be used when determining which solution should be developed and deployed.



FIG. 1 is a diagram of an embodiment of a GHG emissions estimation system 10 implemented in a client-server architecture 12. While the illustrated embodiment is implemented within the client-server architecture 12, it may be appreciated that, in other embodiments, the GHG emissions estimation system 10 may be implemented as an application that is locally stored and executed by a stand-alone client device. For the illustrated embodiment, the client-server architecture 12 includes at least one client device 14 that is communicatively coupled to at least one server device 16 (also referred to herein as simply “server”) via suitable network 17. In certain embodiments, the server 16 is communicatively coupled to a data center 18 (e.g., a remote or on-premises data center), while in other embodiments, the server 16 may be a hardware or virtual server of the data center 18.


In certain embodiments, the client device 14 may be a desktop computer, a laptop computer, a smart phone, or another suitable computing device. In certain embodiments, the server 16 may be a stand-alone server, or may be a virtualized server hosted by a remote or on-premises data center. The network 17 may include a wide-area network (WAN), a local-area network (LAN), a virtual private network (VPN), the internet, another suitable wired and/or wireless network, or any suitable combination thereof.


For the embodiment illustrated in FIG. 1, the client device 14 includes at least one processor 20 (e.g., a central processing unit (CPU), a graphic processing unit (GPU)), at least one memory 22 (e.g., random access memory (RAM), read-only memory (ROM)), and at least one storage 24 (e.g., a hard disk drive (HDD), a solid-state disk (SSD), flash memory). The storage 24 of the client device 14 may store suitable applications or apps that are executed by the processor 20 to provide desired functionality at the client device 14. For example, the storage 24 of the client device 14 may store a client web browser, or another suitable application, that can be loaded into memory 22 and executed by the processor 20 to enable the client device 14 to interact with services hosted by the server 16.


For the embodiment illustrated in FIG. 1, the server 16 includes at least one processor 26 (e.g., a CPU, a GPU), at least one memory 28 (e.g., RAM, ROM), and at least one storage 30 (e.g., a HDD, SSD, flash memory). For embodiments in which the server 16 is a virtualized server hosted by a remote or on-premises data center, the at least one processor 26, at least one memory 28, at least one storage 30, and so forth, may be infrastructural components or computing resources allocated to one or more virtual machines of a virtualized environment. The storage 30 includes a number of elements that may be loaded into the memory 28 and/or executed by the processor 26 to enable different functionality discussed herein. The client device 14 and/or the server 16 may include other suitable computing devices or components (e.g., networking devices or interfaces, input devices, output devices) in different implementations.


For the embodiment illustrated in FIG. 1, the server 16 hosts a software deployment platform 32 that is responsible for deploying applications 34 within a virtualization environment 36 that operates using the infrastructural components 37 (e.g., processors, memory, storage, networking components) of the data center 18. The deployed applications 34 may include, but are not limited to, web serving applications, user interface applications, cloud-based applications, e-commerce applications, and so forth, which are executed by the processor 26 to provide desired functionality to the client via the client device 14. The server 16 also hosts a software development platform 38 that stores source code 40 for the deployed applications 34 of the software deployment platform 32, and for applications that have not yet been deployed (e.g., applications still under development). The software development platform 38 stores one or more corresponding infrastructure-as-code (IAC) files 42 for the applications that are deployed or under development. As used herein, “source code” may refer to application instructions (e.g., human-readable instructions, machine-readable instructions, or both) for compiled programming languages, interpreted or scripted programming languages, or any combination thereof. The IAC files 42, also known as infrastructure-as-code templates, are industry standard for application deployment within a virtual environment, and are typically provided in a YAML format. The IAC files 42 are used by the software deployment platform 32 to define the various infrastructural components 37 of the data center 18 that an application is designed to use during operation within the virtualization environment 36. In certain embodiments, the server 16 may only include either the software deployment platform 32 or the software development platform 38. In some embodiments, the server 16 may lack the software development platform 38 and/or the software deployment platform 32, wherein the server 16 is communicatively coupled to one or more other computing systems hosting the software development platform 38 and/or the software deployment platform 32 to enable the functionality described herein.


For the embodiment illustrated in FIG. 1, the server 16 also hosts a training data repository 44 that stores a training dataset 46, which may be generated as discussed below with respect to FIG. 2. As discussed below with respect to FIG. 3, the training dataset 46 is used to train an AI model, yielding the trained AI model 48. As discussed below with respect to FIG. 4, the trained AI model 48 is subsequently used by the GHG emissions estimation system 10 in predictive mode to estimate the GHG emissions associated with operating an application within the virtualization environment 36. The AI model may be based on a supervised or unsupervised learning regression algorithm, such as a decision tree model, a random forest (RF) model, a support vector machine (SVM) model, a linear regression model, an artificial neural network (ANN) model, a convolutional neural network (CNN) model, a recurrent neural network (RNN) model, or another suitable model.


For the embodiment illustrated in FIG. 1, the server 16 also hosts an application analyzer 50. In certain embodiments, the application analyzer 50 is a software component having instructions that, when executed by the processor 26 of the server 16, applies a set of analysis rules 52 to analyze the source code 40 and/or the IAC files 42 associated with an application to determine infrastructural components and operational parameters of the application when deployed in the virtualization environment 36. For example, the analysis rules 52 indicate which infrastructural components and operational parameters should be determined for an application based on the resource requirements (e.g., number of virtual machines, number or type of processors, memory space, storage space, network bandwidth) defined within the IAC files of the application. In certain embodiments, the analysis rules 52 may additionally or alternatively indicate which infrastructural components and operational parameters should be determined for an application based on supporting systems (e.g., load balancers, web servers, database servers) that are used by the application during operation, as determined based on the contents of the source code and/or IAC files of the application. During operation, the application analyzer 50 may determine that a portion of the source code and/or IAC files of an application matches a condition defined by one of the analysis rules 52, and in response, may determine one or more infrastructural components and/or operational parameters of the application as defined by the matched analysis rule. In some embodiments, rather than being a rule-based application analyzer 50, the application analyzer 50 may instead be an AI-based application analyzer 50 having an AI model that is trained to receive the IAC files and/or source code of an application as inputs, and to provide an output that indicates the infrastructural components and operational parameters of the application.


For the embodiment illustrated in FIG. 1, the server 16 may receive a request 54 from the client device 14 to train and/or tune an AI model to yield the trained AI model 48. After training or tuning the AI model, the server 16 may provide a response 56 indicating results of the training or tuning of the AI model. For example, the response 56 may indicate an accuracy of the trained AI model with respect to the training dataset or a number of iterations (or amount of processing time) utilized during the training process. After the AI model is trained or tuned, the server 16 may receive a request 58 to estimate the GHG emissions associated with the operation of an application that has been deployed (or is intended to be deployed) in the virtualization environment 36. After using the trained AI model 48 to determine the GHG emissions associated with the operation of the application, the server 16 may provide a response 60 indicating the estimated GHG emissions. In some embodiments, the response 60 may also include one or more recommendations regarding changes that could be made to the source code 40 or the IAC files 42 of the application to reduce the estimated GHG emissions of the application during operation.



FIG. 2 is a flow diagram of an embodiment of process 70 whereby the GHG emissions estimation system 10 generates the training dataset 46. The process 70 may be at least partially stored as software instructions within the memory 28 and executed by the processor 26 of the server 16 of the GHG emissions estimation system 10. In other embodiments, the process 70 may include additional actions, fewer actions, repeated actions, actions taken in a different sequence, and so forth, in accordance with the present disclosure. The process 70 is discussed with reference to elements illustrated in FIG. 1.


For the embodiment illustrated in FIG. 2, the process 70 includes performing a number of actions with respect to each deployed application of a set of deployed applications, as indicated by block 72. In certain embodiments, the set of deployed applications includes all of the applications 34 deployed by the software deployment platform 32, while in some embodiments, the set of deployed applications includes only a subset of the applications 34 deployed by the software deployment platform 32. For the illustrated embodiment, the actions of block 72 include the processor 26 determining (block 74) infrastructural components and operational parameters 76 of the deployed application. The infrastructural components and operational parameters 76 indicate which of the infrastructural components 37 of the data center 18 are used by, and how these infrastructural components are configured for, the deployed application during operation within the virtualization environment 36. For example, in certain embodiments, the processor 26 may execute the application analyzer 50 to apply the analysis rules 52 to the IAC files 42 and/or source code 40 of the application to determine the infrastructural components and operational parameters 76 of the deployed application. In other embodiments, in block 74, the processor 26 may receive inputs from a user manually defining the infrastructural components and operational parameters of the deployed application.


For the embodiment illustrated in FIG. 2, the actions of block 72 of the process 70 include the processor 26 determining (block 78) measured or calculated GHG emissions 80 of the deployed application based on the infrastructural components and operational parameters 76. In certain embodiments, the processor 26 may receive inputs from a user manually defining the measured or calculated GHG emissions 80 of the deployed application. In some embodiments, at least a portion of the information used to calculate the GHG emissions 80 of the deployed application may be automatically retrieved by the processor 26. For example, in some embodiments, the processor 26 may receive or retrieve energy utilization data as tracked by the remote or on-premises data center 18 hosting the virtualization environment 36 using application and hardware metrics, and/or receive or retrieve data regarding the corresponding GHG emissions per unit of power consumed (e.g., per kilowatt hour (kWh)) based on the sources of energy (e.g., solar/wind, natural gas, diesel, coal) used by a utility to generate the energy consumed by the data center 18. In some embodiments, the processor 26 may use this data to automatically calculate the GHG emissions 80 of the deployed application, or at least a first approximation that may be subsequently modified based on user input.


For the embodiment illustrated in FIG. 2, the actions of block 72 of the process 70 include the processor 26 constructing (block 82) or generating an entry in the training dataset 46 for the deployed application. The entry includes the infrastructural components and the operational parameters 76 of the deployed application as inputs to be provided to the AI model during training, and includes the GHG emissions 80 of the deployed application as the desired output of the AI model in response to these inputs during training. Within block 72, the actions of blocks 74, 78, and 82 are repeated for each deployed application of the set, or a subset thereof, and the respective entries are combined to generate the training dataset 46. The training dataset 46 may be stored as a database table, spreadsheet, comma-separated value (CSV) file, or another suitable file or data format within the training data repository 44. In certain embodiments, the process 70 may be performed regularly, such as in response to updated GHG emission data becoming available for one or more deployed applications.


In an aspect, the GHG emission data may be specific to a virtual environment service provider. As such, when developing the training dataset, the processor 26 may receive infrastructural components and parameters associated with different virtual environment service providers and the corresponding GHG emission data for the infrastructural components and parameters associated with the different service providers. The processor 26 may generate associations among different infrastructural components and parameters across different service providers so as to be able to identify equivalent infrastructural components and parameters across different service providers.



FIG. 3 is a flow diagram of an embodiment of a process 90 whereby the processor 26 of the server 16 of the GHG emissions estimation system 10 trains the AI model using the training dataset 46 to “learn” or encode relationships between respective infrastructural components and operational parameters and respective greenhouse gas (GHG) emissions of the set of deployed applications 34. The process 90 may be stored as software instructions within the memory 28 and executed by the processor 26 of the server 16 of the GHG emissions estimation system 10. In other embodiments, the process 90 may include additional actions, fewer actions, repeated actions, actions taken in a different sequence, and so forth, in accordance with the present disclosure. The process 90 is discussed with reference to elements illustrated in FIGS. 1 and 2. The process 90 receives as input the training dataset 46 generated by the process 70 of FIG. 2.


For the embodiment illustrated in FIG. 3, the process 90 includes performing a number of actions with respect to each entry of the training dataset 46, as indicated by block 92. For the illustrated embodiment, the actions of block 92 include the processor 26 providing (block 94) the infrastructural components and operational parameters of the entry as inputs to the AI model. In response, the processor 26 receives (block 96) the estimated GHG emissions as output from the AI model. The actions of block 92 further include the processor 26 calculating (block 98) a difference between the estimated GHG emissions output by the AI model and measured or calculated GHG emissions of the entry.


For the embodiment illustrated in FIG. 3, the actions of block 92 of the process 90 include decision block 100, in which the processor 26 determines whether the calculated difference is greater than a predefined threshold value. For example, the predefined threshold value may be a maximum acceptable error threshold value that is defined in a configuration of the server 16 of the GHG emissions estimation system 10. When, in decision block 100, the processor 26 determines that the difference is greater than the predefined threshold value, the processor 26 responds by modifying (block 102) one or more parameters of the AI model, before returning to block 94. For example, modifying the parameters of the AI model may include adjusting one or more values (e.g., weights, coefficients) of the AI model, altering a structure of the AI model (e.g., adding/removing neural network layers, adding/removing neural network nodes), or any combination thereof. As such, for each entry of the training dataset, or a subset thereof, the processor 26 repeats the actions of blocks 94, 96, and 98 until the difference between the estimated GHG emissions output by the AI model and measured or calculated GHG emissions of the entry is less than or equal to the predefined threshold value, at which time the processor 26 proceeds (block 104) to the next entry of the training dataset 46. Once all of the or a sufficient number of entries have been processed in block 92, the parameters of the AI model are stored as the trained AI model 48, which is ready for predictive mode operation. In certain embodiments, the process 90 may be performed regularly, such as in response to the training dataset 46 being updated to include new GHG emission data for one or more of the deployed applications 34.



FIG. 4 is a flow diagram of an embodiment of a process 110 whereby the processor 26 of the server 16 of the GHG emissions estimation system 10 uses the trained AI model 48 to estimate GHG emissions of an application that is deployed (or is intended to be deployed) in the virtualization environment 36. The process 110 may be stored as software instructions within the memory 28 and executed by the processor 26 of the server 16 of the GHG emissions estimation system 10. In other embodiments, the process 110 may include additional actions, fewer actions, repeated actions, actions performed in a different sequence, and so forth, in accordance with the present disclosure. The process 110 is discussed with reference to elements illustrated in FIGS. 1-3. In certain embodiments, the process 110 may be executed in response to the request 58 from the client device 14 to estimate the GHG emissions of an application, as illustrated in FIG. 1, wherein the application to be evaluated is indicated, or the IAC files and/or source code of the application is included as part of the request 58.


For the embodiment illustrated in FIG. 4, the process 110 receives, as input, one or more application files 112. For the illustrated embodiment, the application files 112 include the source code 40 of the application and the IAC files 42 of the application. In some embodiments, the application files 112 may only include the IAC files 42 of the application. The process 110 begins with the processor 26 analyzing (block 114) the application files 112 to determine the infrastructural components and operational parameters 116 corresponding to the application(s) of the application files 112. The infrastructural components and operational parameters 76 indicate which of the infrastructural components 37 of the data center 18 will be used, and how these infrastructural components will be configured and used, by the application(s) during operation within the virtualization environment 36. For example, in certain embodiments, the processor 26 may execute the application analyzer 50 to apply the analysis rules 52 to the application files 112 (e.g., IAC files 42, and/or source code 40) to determine the infrastructural components and operational parameters 116 of the application(s). The process 110 continues with the processor 26 providing (block 118) the infrastructural components and operational parameters 116 of the application(s) as input to the trained AI model 48 and, in response, receiving as output from the trained AI model 48 the estimated GHG emissions 120 of the application(s).


In certain embodiments, the process 110 includes the processor 26 analyzing (block 122) the infrastructural components and operational parameters 116 of the application(s), and the estimated GHG emissions 120 of the application(s) to determine one or more recommendations for lowering the estimated GHG emissions 120. For example, in certain embodiments, the processor 26 may evaluate the relative contributions of each of the infrastructural components and operational parameters 116 to the estimated GHG emissions 120 to generate an ordered list of the infrastructural components and operational parameters 116 organized from largest contributions to smallest contributions with respect to the estimated GHG emissions, which can be used by a software developer to identify areas for improvement. In an example embodiment, the processor 26 may determine the relative contribution of each of the infrastructural components and operational parameters 116 to the estimated GHG emissions 120 by iteratively removing (or replacing) one of the infrastructural components and operational parameters 116 and providing the modified set of infrastructural components and operational parameters as input to the trained AI model 48 to determine the corresponding estimated GHG emissions after each modification. In some embodiments, the processor 26 may apply one or more optimization rules that compare the infrastructural components and operational parameters 116 of the application(s) to known values, and when a match is identified, generate recommendations regarding the match. For example, when an optimization rule is triggered by the infrastructural components and operational parameters 116 of the application(s), the optimization rule may generate a recommendation indicating that the matched infrastructural component or operational parameter is known to be a large contributor to the GHG emissions of the application, and offer an alternative infrastructural component and/or operational parameter that is known to be a less significant contributor to the estimated GHG emissions of the application.


For the embodiment illustrated in FIG. 4, the process 110 concludes with the processor 26 generating the response 60 indicating the estimated GHG emissions 120 of the application(s) when deployed in the virtualization environment 36. For embodiments in which the process 110 includes block 122, the response 60 may include the results of the additional analyses and/or recommendations regarding the estimated GHG emissions of the application(s) when deployed in the virtualization environment. In some embodiments, the GHG emissions are indicated in terms of volume or mass of carbon dioxide (CO2) equivalent per time period (e.g., hour, day, week, quarter) that the application(s) is deployed and operating within the virtualization environment (e.g., kilograms of CO2 per year). In certain cases, an application may be designed to scale within the virtualization environment in response to increased production traffic volume. For example, a scalable application may be hosted by a single virtual machine of the virtualization environment to support between 0 and 1000 active users, and may scale to additional virtual machines in response to additional active users. Since the infrastructural components and operational parameters of the application indicate the hardware (e.g., processors, memory, networking) components of the virtual machine, the scaling parameters (e.g., number of virtual machines per application instance, load balancers or other components to enable scaling), and the scaling conditions (e.g., a threshold number of active users per application instance), the GHG emissions estimation system 10 may determine the GHG emissions in terms of volume or mass of carbon dioxide (CO2) equivalent per time period that the application is deployed within the virtualization environment and per a predefined number of active users (e.g., kilograms of CO2 per year per thousand active users).


It may be appreciated that the process 110 of FIG. 4 may be performed at various times for various purposes. For example, in an embodiment, the process 110 may be executed as part of an application deployment process (e.g., a deployment pipeline of the software deployment platform 32) when a developed application is being prepared for deployment within the virtualization environment 36. In this manner, the GHG emissions of the application can be estimated prior to deployment. Additionally, the GHG emissions estimation system 10 may provide indications of infrastructural components and/or operational parameters of the application(s) that are the largest contributors to the estimated GHG emissions or recommendations for lowering the estimated GHG emissions, enabling the IAC files and/or source code of the application(s) to be modified to reduce the estimated GHG emissions prior to deployment.


In another example use case, the process 110 may be executed as part of an application development or design process. For example, in an early stage of an application design process, the application designers may be tasked with determining whether a first solution or a second solution should be developed based on a number of different cost/benefit factors including, for example, development cost, development time, and estimated GHG emissions. For this example, the IAC files of the first solution and the IAC files of the second solution may be defined without fully developing the source code, and the IAC files of each option may be provided as input to the process 110 to estimate the GHG emissions of each option when hypothetically deployed within the virtualization environment 36. As such, in certain embodiments, GHG emissions estimation system 10 may receive at least one request to estimate the GHG emissions for two applications being developed or deployed, and, in addition to providing the estimated GHG emissions of each option, may respond with a recommendation as to which of the two applications is more desirable to develop or deploy based on the GHG emissions. As such, the GHG emissions estimation system 10 enables the GHG emissions of different solutions to be estimated and considered for design and development decisions at an early phase of application design and development process. In an aspect, because the AI model may be trained on GHG emissions data from different virtual environment service providers, the GHG emissions estimates system 10 may provide not only recommendations on which of two applications is more desirable, but also provide recommendations on equivalent applications from other service providers that may achieve a similar performance while yielding more desirable GHG emissions.


It may be appreciated that the GHG emissions estimation system is not limited to the embodiments described above. For example, in certain embodiments, the AI model may instead be trained to directly learn or encode relationships between the IAC files of applications deployed within the virtualization environment and their corresponding measured or calculated GHG emissions. For such embodiments, the training dataset may include entries having the IAC files of a deployed application to be provided as input to the AI model during training, as well as the measured or calculated GHG emissions of the deployed application that is the corresponding desired output of the AI model during training. The trained AI model can then be provided with the IAC files of an application being developed or deployed as input, and in response, provide the estimated GHG emissions of the application when deployed within the virtualization environment.


The technical effects of the present disclosure include a GHG emissions estimation system that is designed to estimate the GHG emissions associated with operating a software application that is deployed (or is intended to be deployed) within a virtualized environment. For example, a software developer may utilize the GHG emissions estimation system as part of a cost/benefit analysis of different software solutions at an early stage of application design and development. Since there may be a variety of possible infrastructural options for a given software application, the GHG emissions estimation system enables the developer to estimate the GHG emissions associated with each infrastructural option, which can be used when determining which solution should be developed and deployed.


While only certain features of the invention have been illustrated and described herein, many modifications and changes will occur to those skilled in the art. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention.


The techniques presented and claimed herein are referenced and applied to material objects and concrete examples of a practical nature that demonstrably improve the present technical field and, as such, are not abstract, intangible or purely theoretical. Further, if any claims appended to the end of this specification contain one or more elements designated as “means for (perform) ing (a function) . . . ” or “step for (perform) ing (a function) . . . ”, it is intended that such elements are to be interpreted under 35 U.S.C. 112 (f). However, for any claims containing elements designated in any other manner, it is intended that such elements are not to be interpreted under 35 U.S.C. 112 (f).

Claims
  • 1. A computing system, comprising: at least one memory configured to store an artificial intelligence (AI) model; andat least one processor configured to execute stored instructions to perform actions comprising: receiving a request to determine estimated greenhouse gas (GHG) emissions of an application when deployed in a virtualization environment, wherein the request indicates at least one application file of the application;analyzing the at least one application file of the application to determine infrastructural components and operational parameters of the application;generating, using the AI model, the estimated GHG emissions of the application when deployed in the virtualization environment, based on the infrastructural components and operational parameters of the application; andproviding a response to the request that includes the estimated GHG emissions of the application when deployed in the virtualization environment.
  • 2. The computing system of claim 1, wherein the at least one application file comprises an infrastructure-as-code (IAC) file of the application.
  • 3. The computing system of claim 1, wherein the at least one application file comprises a source code file of the application.
  • 4. The computing system of claim 1, wherein the at least one memory is configured to store an application analyzer having a set of analysis rules, and wherein, to analyze the at least one application file of the application, the at least one processor is configured to execute stored instructions to perform actions comprising: applying each analysis rule of the set of analysis rules to the at least one application file, wherein, in response to a portion of the at least one application file matching a condition of at least one analysis rule, at least one of the infrastructural components and operational parameters of the application is determined.
  • 5. The computing system of claim 1, wherein the request is received from a deployment pipeline of a software deployment platform as part of a deployment process of the application.
  • 6. The computing system of claim 1, wherein the AI model is a decision tree model, a random forest (RF) model, a support vector machine (SVM) model, a linear regression model, an artificial neural network (ANN) model, a convolutional neural network (CNN) model, a recurrent neural network (RNN) model.
  • 7. The computing system of claim 1, wherein the at least one processor is configured to execute stored instructions to perform actions comprising: before providing the response, analyzing the infrastructural components and operational parameters of the application and the estimated GHG emissions of the application to determine one or more recommendations to lower the estimated GHG emissions of the application, wherein the one or more recommendations are included in the response.
  • 8. The computing system of claim 1, wherein the at least one processor is configured to execute stored instructions to perform actions comprising: receiving a second request to determine a second estimated GHG emissions of a second application when deployed in the virtualization environment, wherein the second request indicates at least one second application file of the second application;analyzing the at least one second application file of the second application to determine second infrastructural components and operational parameters of the second application;generating, using the AI model, the second estimated GHG emissions of the second application when deployed in the virtualization environment, based on the second infrastructural components and operational parameters of the second application;providing a second response to the second request that includes at least the second estimated GHG emissions of the second application when deployed in the virtualization environment; andcomparing the estimated GHG emissions of the application and the second estimated GHG emissions of the second application and providing a recommendation to develop or deploy the application or the second application based on the comparison.
  • 9. A computer-implemented method, comprising: receiving a request to determine estimated greenhouse gas (GHG) emissions of an application when deployed in a virtualization environment, wherein the request indicates at least one application file of the application;analyzing the at least one application file of the application to determine infrastructural components and operational parameters of the application;generating, using an artificial intelligence (AI) model, the estimated GHG emissions of the application when deployed in the virtualization environment, based on the infrastructural components and operational parameters of the application; andproviding a response to the request that includes at least the estimated GHG emissions of the application when deployed in the virtualization environment.
  • 10. The method of claim 9, wherein the at least one application file comprises an infrastructure-as-code (IAC) file of the application, a source code file of the application, or a combination thereof.
  • 11. The method of claim 9, wherein, analyzing the at least one application file of the application comprises: applying each analysis rule of a set of analysis rules to the at least one application file, wherein, in response to a portion of the at least one application file matching a condition of at least one analysis rule, at least one of the infrastructural components and operational parameters of the application is determined.
  • 12. The method of claim 9, wherein the request is received from a deployment pipeline of a software deployment platform as part of a deployment process of the application.
  • 13. The method of claim 9, wherein the AI model is a decision tree model, a random forest (RF) model, a support vector machine (SVM) model, a linear regression model, an artificial neural network (ANN) model, a convolutional neural network (CNN) model, a recurrent neural network (RNN) model.
  • 14. The method of claim 9, comprising: before providing the response, analyzing the infrastructural components and operational parameters of the application and the estimated GHG emissions of the application to determine one or more recommendations to lower the estimated GHG emissions of the application, wherein the one or more recommendations are included in the response.
  • 15. The method of claim 9, comprising: receiving a second request to determine a second estimated GHG emissions of a second application when deployed in the virtualization environment, wherein the second request indicates at least one second application file of the second application;analyzing the at least one second application file of the second application to determine second infrastructural components and operational parameters of the second application;generating, using the AI model, the second estimated GHG emissions of the second application when deployed in the virtualization environment, based on the second infrastructural components and operational parameters of the second application;providing a second response to the second request that includes at least the second estimated GHG emissions of the second application when deployed in the virtualization environment; andcomparing the estimated GHG emissions of the application and the second estimated GHG emissions of the second application and providing a recommendation to develop or deploy the application or the second application based on the comparison.
  • 16. A non-transitory, computer-readable medium storing instructions executable by a processor of a computing system, the instructions comprising instructions to: receive a request to determine estimated greenhouse gas (GHG) emissions of an application when deployed in a virtualization environment, wherein the request indicates at least one application file of the application;analyze the at least one application file of the application to determine infrastructural components and operational parameters of the application;generate, using an artificial intelligence (AI) model, the estimated GHG emissions of the application when deployed in the virtualization environment, based on the infrastructural components and operational parameters of the application; andprovide a response to the request that includes at least the estimated GHG emissions of the application when deployed in the virtualization environment.
  • 17. A computing system, comprising: at least one memory configured to store an artificial intelligence (AI) model; andat least one processor configured to execute stored instructions to perform actions comprising: training the AI model to encode relationships between respective infrastructural components and operational parameters of a set of deployed applications and respective greenhouse gas (GHG) emissions of the set of deployed applications, yielding a trained AI model; andgenerating, using the trained AI model, estimated GHG emissions of an application when deployed in a virtualization environment, based on infrastructural components and operational parameters of the application.
  • 18. The computing system of claim 17, wherein the at least one memory is configured to store a training dataset having a plurality of entries, each indicating the respective infrastructural components and operational parameters of a deployed application to be provided as input to the AI model during training, and each indicating the respective GHG emissions of the deployed application that is desired as output from the AI model in response to the input during training.
  • 19. The computing system of claim 18, wherein, to train the AI model, the at least one processor is configured to execute stored instructions to perform actions comprising: for each entry of the plurality of entries: (A) providing, as input to the AI model, the respective infrastructural components and operational parameters indicated for the deployed application by the entry, and in response receiving, as output, estimated GHG emissions of the deployed application,(B) calculating a difference between the estimated GHG emissions of the deployed application and the respective GHG emissions indicated for the deployed application by the entry, and(C) in response to determining that the difference is greater than a predefined threshold value, modifying one or more parameters of the AI model and returning to step (A); andstoring the one or more parameters as the trained AI model.
  • 20. The computing system of claim 18, wherein, to generate the training dataset, the at least one processor is configured to execute stored instructions to perform actions comprising: for each deployed application of a set of deployed applications: determining the respective infrastructural components and operational parameters of the deployed application;determining the respective GHG emissions of the deployed application; andgenerating an entry of the training dataset that includes the respective infrastructural components and operational parameters of the deployed application and includes the respective GHG emissions of the deployed application.