Deployment and lifecycle management of complex computing systems is challenging, and should be run as efficiently as possible to conserve time and financial resources, while preserving portability and reducing operator error.
Various embodiments of the invention are disclosed in the following detailed description and the accompanying drawings.
The invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term ‘processor’ refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.
A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.
An integrated system for simple, flexible packaging, distribution and reliable, consistent execution of complex applications, which preserves portability across multiple computing environments, is disclosed. A complex application may comprise multiple executable components and may require auxiliary workflows for effective use and management of the application. A “distributed system” as referred to herein includes any system that has an application and/or microservice running on a plurality of computing entities, containers, nodes, virtual machines, and/or physical machines. The term “distributed application” is used herein to describe a class of complex applications to run on a distributed system, for example a containerized application running on a plurality of containers. A “container” and/or “container instance” is referred to herein as a program that runs in an isolated environment from other programs, for example a Docker container and/or LxC Linux container, which provides portability for a simple application. An “orchestration framework” is referred to herein as a framework for orchestration, scaling, and/or scheduling of containers, for example Kubernetes, Mesos, and/or Docker Swarm. A “workflow” is referred to herein as a high-level program for orchestrating and performing actions related to the application.
Using workflows specified in a portable manner using YAML and/or tightly integrating a workflow engine with an application engine, artifact management, and/or a configuration database is disclosed. By improving the portability of a distributed application and associated lifecycle management workflows, flexibility and system operating efficiency will be improved, operator error will be reduced, and resource costs for storage/computing cycles/network bandwidth will be reduced.
Computer system 100, which includes various subsystems as described below, includes at least one microprocessor subsystem, also referred to as a processor or a central processing unit (“CPU”) 102. For example, processor 102 can be implemented by a single-chip processor or by multiple cores and/or processors. In some embodiments, processor 102 is a general purpose digital processor that controls the operation of the computer system 100. Using instructions retrieved from memory 110, the processor 102 controls the reception and manipulation of input data, and the output and display of data on output devices, for example display and graphics processing unit (GPU) 118. In one embodiment, server systems may not generally have a physical display, keyboard, and/or mouse attached physically.
Processor 102 is coupled bi-directionally with memory 110, which can include a first primary storage, typically a random-access memory (“RAM”), and a second primary storage area, typically a read-only memory (“ROM”). As is well known in the art, primary storage can be used as a general storage area and as scratch-pad memory, and can also be used to store input data and processed data. Primary storage can also store programming instructions and data, in the form of data objects and text objects, in addition to other data and instructions for processes operating on processor 102. Also as well known in the art, primary storage typically includes basic operating instructions, program code, data and objects used by the processor 102 to perform its functions, for example programmed instructions. For example, primary storage devices 110 can include any suitable computer-readable storage media, described below, depending on whether, for example, data access needs to be bi-directional or uni-directional. For example, processor 102 can also directly and very rapidly retrieve and store frequently needed data in a cache memory, not shown. The processor 102 may also include a coprocessor (not shown) as a supplemental processing component to aid the processor and/or memory 110.
A removable mass storage device 112 provides additional data storage capacity for the computer system 100, and is coupled either bi-directionally (read/write) or uni-directionally (read only) to processor 102. For example, storage 112 can also include computer-readable media such as flash memory, portable mass storage devices, holographic storage devices, magnetic devices, magneto-optical devices, optical devices, and other storage devices. A fixed mass storage 120 can also, for example, provide additional data storage capacity. One example of mass storage 120 is an eMMC or microSD device. In one embodiment, mass storage 120 is a solid-state drive connected by a bus 114. Mass storage 112, 120 generally store additional programming instructions, data, and the like that typically are not in active use by the processor 102. It will be appreciated that the information retained within mass storage 112, 120 can be incorporated, if needed, in standard fashion as part of primary storage 110, for example RAM, as virtual memory.
In addition to providing processor 102 access to storage subsystems, bus 114 can be used to provide access to other subsystems and devices as well. As shown, these can include a display monitor 118, a communication interface 116, a touch (or physical) keyboard 104, and one or more auxiliary input/output devices 106 including an audio interface, a sound card, microphone, audio port, audio recording device, audio card, speakers, a touch (or pointing) device, and/or other subsystems as needed. Besides a touch screen and/or capacitive touch interface, the auxiliary device 106 can be a mouse, stylus, track ball, or tablet, and is useful for interacting with a graphical user interface.
The communication interface 116 allows processor 102 to be coupled to another computer, computer network, or telecommunications network using a network connection as shown. For example, through the communication interface 116, the processor 102 can receive information, for example data objects or program instructions, from another network, or output information to another network in the course of performing method/process steps. Information, often represented as a sequence of instructions to be executed on a processor, can be received from and outputted to another network. An interface card or similar device and appropriate software implemented by, for example executed/performed on, processor 102 can be used to connect the computer system 100 to an external network and transfer data according to standard protocols. For example, various process embodiments disclosed herein can be executed on processor 102, or can be performed across a network such as the Internet, intranet networks, or local area networks, in conjunction with a remote processor that shares a portion of the processing. Throughout this specification “network” refers to any interconnection between computer components including the Internet, Bluetooth, WiFi, 3G, 4G, 4GLTE, GSM, Ethernet, TCP/IP, intranet, local-area network (“LAN”), home-area network (“HAN”), serial connection, parallel connection, wide-area network (“WAN”), Fibre Channel, PCI/PCI-X, AGP, VLbus, PCI Express, Expresscard, Infiniband, ACCESS.bus, Wireless LAN, HomePNA, Optical Fibre, G.hn, infrared network, satellite network, microwave network, cellular network, virtual private network (“VPN”), Universal Serial Bus (“USB”), FireWire, Serial ATA, 1-Wire, UNI/O, or any form of connecting homogenous, heterogeneous systems and/or groups of systems together. Additional mass storage devices, not shown, can also be connected to processor 102 through communication interface 116.
An auxiliary I/O device interface, not shown, can be used in conjunction with computer system 100. The auxiliary I/O device interface can include general and customized interfaces that allow the processor 102 to send and, more typically, receive data from other devices such as microphones, touch-sensitive displays, transducer card readers, tape readers, voice or handwriting recognizers, biometrics readers, cameras, portable mass storage devices, and other computers.
In addition, various embodiments disclosed herein further relate to computer storage products with a computer readable medium that includes program code for performing various computer-implemented operations. The computer-readable medium is any data storage device that can store data which can thereafter be read by a computer system. Examples of computer-readable media include, but are not limited to, all the media mentioned above: flash media such as NAND flash, eMMC, SD, compact flash; magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD-ROM disks; magneto-optical media such as optical disks; and specially configured hardware devices such as application-specific integrated circuits (“ASIC”s), programmable logic devices (“PLD”s), and ROM and RAM devices. Examples of program code include both machine code, as produced, for example, by a compiler, or files containing higher level code, for example a script, that can be executed using an interpreter.
The computer/server system shown in
Workflow definitions associated with a set of workflows may be used, for example to implement a state machine of an application, as described in U.S. patent application Ser. No. 15/695,322, entitled Execution Of Workflows In Distributed Systems, filed Sep. 5, 2017 which is incorporated herein by reference for all purposes. In one embodiment, a workflow designer may be constrained to express their designed workflow in a stylized and/or restrictive fashion, such as a declarative definition, rather than an imperative definition to foster efficiencies as described in U.S. patent application Ser. No. 15/695,322. In one embodiment, workflows are defined in YAML as a DSL (domain specific language). The workflow definitions are used in part to recognize if a common step exists between two or more workflows.
In one embodiment, a declarative workflow definition is mapped to a state machine representing the entire set of workflows. Steps in each workflow may be sequential and/or parallel. Data may be generated by a given step and consumed by a subsequent step, and this data is referred to herein as an “artifact”. Artifacts may be consumed by other steps in a workflow and/or stored persistently for future use. In one embodiment, each artifact and each step in a workflow may be identified by a content-based fingerprint, for example a hash. Data with the same content will have the same hash. Steps that perform the same operation may also have the same hash. For efficiency, the results at each point in a given workflow may be identified by a “cache hash”, referred to herein as a function of the hash of the input data and the hashes of the steps that have been performed up to that point. Note that the hashes of intermediate artifacts may not be needed to generate the cache hash.
The disclosed techniques are applicable to any workflow environment, for example that of data streams or logical execution plans in data lineage such as that in APACHE™ Spark, and/or workflows in distributed systems such as that in JENKINS™ Pipeline, APACHE™ Airflow, and/or ad hoc scripts. Without limitation an example for execution of workflows in distributed systems is described in detail. A distributed system comprises components that may independently fail and recover and where the components communicate with each other over communication channels that may not be completely reliable.
The modern use of containers, with operating-system virtualization, has made efficient deploying microservices, scalable distributed applications comprising multiple tiers of components, for example a simple three-tier web application may include three tiers of components: a front-end code, a business logic, and a back-end database. With distributed systems, software runs on multiple servers in different container/virtual machine/physical machine locations. Using declarative workflow definitions gives an inherent flexibility in how to accomplish a given step, and generally forces a designer to describe the step in a repeatable way.
By contrast to traditionally using Docker Compose to set up an application for a single host or ksonnet to setup an application on Kubernetes, a workflow approach encompasses a “lifecycle management” approach, wherein lifecycle management refers herein to any and/or all operations done to an application in order to maintain an application including:
Portability for a distributed application across execution environments is enabled by this simple lifecycle management for distributed applications across a plurality of containers. An application management service (212), for example implemented at least in part using Kubernetes CustomResourceDefinition (CRD), is used to store workflow definitions and use them to implement a state machine reflective of the distributed application.
An artifact is a collection of files/directories that may be used as input or output to a step in a workflow, wherein a “file” is a file system file and a “directory” is a file system directory. Artifacts may be “internal” and/or “intermediate”, in which case they may be referenced only within the execution of a particular workflow. Internal artifacts may be automatically garbage collected, for example garbage collected after 7 days. Artifacts may be exported within the system of
A “fixture” as referred to herein is a service needed to run a workflow. One example of a fixture is a MySQL service needed to run a test. A dynamic fixture may be created automatically by the system of
A persistent “volume” as referred to herein may be used to store data used by workflows. In one embodiment, anonymous volumes are volumes that exist only within the context of a specific executing workflow, as opposed to named volumes being volumes that exist independently of workflows. Anonymous volumes are generally created to provide temporary storage space for workflows. The content of anonymous volumes are typically not important to the execution of workflows. Named volumes may hold specific data for use by workflows. In some cases, the data on a named volume may remain the same or change between runs of a workflow.
A “secret” as referred to herein is an encrypted string that may be included in templates to avoid exposing secret in source repositories. The secret is then decrypted just prior to when it is used during the execution of a workflow. A set of one or more “configuration parameters” are stored in a database (224) and may be referenced by name from within templates. These configuration parameters avoid having to hardwire parameters in source repositories. As referred to herein, a “policy” template is used to specify automatically triggered workflows. Common “triggers” as referred to herein may include time based triggers, for example triggers based on cron, and triggers based on commits, pull requests, and/or merges to repositories.
For the system of
The system of
A configuration management database (224) (CMDB) stores the configuration and operational state of all system component and services including active workflows, fixtures, and so on. An operations API server (226) serves an API to handle YAML verification of workflow definitions, manages state needed by a UI for application management, handles events from the workflow executor (228), and provides miscellaneous and utility logic. In one embodiment, the CMDB (224) includes a configuration, operational, historical, and stats database.
The workflow executor (228) runs a workflow, manages workflow state transitions, and launches new steps. The workflow executor (228) may be part of a workflow execution platform. In one embodiment, one workflow executor (228) service is used per workflow and deployment, and the executor (228) exits after deployment starts. With the implemented state machine, a workflow executor (228) may restarts itself via the container orchestration service (210) and restart failed steps if interrupted, for example if the associated node restarts or reboots. The workflow executor (228) may schedule dynamic fixtures, reserve static fixtures, and/or schedules deployments.
An artifact manager (230) supports querying and searching for artifacts. The artifact manager (230) provides lifecycle management of artifacts, for example the abovementioned retention policies over 7-days, 6-months, and/or manual deletion only. The artifact manager (230) may query and/or find internal/external artifacts. The artifact manager (230) may serves and/or manage a table of artifacts. The artifact manager (230) may collects and catalog output artifacts.
A fixture manager (232) manages a table of static fixtures and persistent volumes. In one embodiment, the fixture manager (232) manages reservations for fixtures and volumes, for example some fixtures may be shared, while others cannot. The fixture manager (232) may allow the creation of fixtures/volumes.
A repository manager (234) or repo manager is a gateway to a repo such as git. It interfaces using an approval API, and may catch git state. The repository manager (234) may provide a query API for git state for example with commits, branches, commit data, and so on. A “branch” is referred to herein as a branch in a repo such as master or default, wherein each branch represents an alternate timeline of changes that may diverge and/or merge with other branches over time. An event trigger (236) triggers policies, triggers events from webhooks, polls repositories, updates the CMDB (224), and/or submits jobs via the operations API server (226).
An event platform (238) such as Kafka is used to collects events, provide statistics, provide a notification center, and manage workflow executor (228) events. In one embodiment, a statistics database (240) is separate from the CMDB (224) and talks with container advisors such as cAdvisor, and/or sends data to the events platform (238) like Kafka. In one embodiment, Prometheus is used as a statistics database to monitor volume stats, monitor file system and block device stats such as capacity and performance, monitor container orchestration data, for example Kubernetes data from cAdvisor, and monitor internal stats.
A notification center (242) processes events from the events platform (238) like Kafka, logs events to the CMDB (224), distribute events according to system/user preferences including UI and/or email notification. In one embodiment, an in-memory data structure store/database like Redis is used for caching and for workflow executor (228) notification.
A platform API server (244) serves as an interface to the container orchestration service (210) like Kubernetes. The platform API server (244) abstracts infrastructure services like Kubernetes, AWS (Amazon Web Services), GCP (Google Cloud Platform), and Microsoft Azure. The platform API server (244) may create and runs orchestration/Kubernetes specs, monitor the status of jobs and/or deployment, create volumes, modify volumes, and/or delete volumes.
A master manager (246) monitors the health of the container orchestration/Kubernetes master, terminates an unhealthy master, and starts a new orchestration/Kubernetes master in the event of a terminated master. Similarly, a node manager (248) monitors health of all nodes and/or minions. The node manager (248) may terminate/restart unhealthy nodes and/or minions. The node manager (248) may also monitor “spot” pricing of cloud instances, submit bids, and/or switch to “on-demand” instances when prices are high. The node manager (248) may also switch back to spot instances when prices drop. The node manager (248) may regularly monitor and report spot prices for each instance.
A pod executor (250) manages container orchestration units, for example Kubernetes pods. The pod executor (250) may initialize a container, in part by unpacking one or more artifacts, setting up environment variables, and/or replacing a user entry point with a wrapper entry point, which may perform additional setup, cleanup or coordination operations. The pod executor (250) may manager a user container wrapper, in part by setting up environment variables, invoking a user entry point, tarring output artifacts for saving, recording exit status, and informing a wait container that a step is complete. A pod executor may also include a wait container to collect live logs from the container orchestration service (210) like Kubernetes, upload logs and/or artifacts, and report status to a workflow executor (228).
The system includes a source repo (402), a binary repo (404), and an execution platform (406). Typically, a source code management system such as git may be used for the source repo (402) and an artifact management system such as Nexus as the binary repo (404). The execution platform (406) combines an engine for running workflows (228), an engine for running application objects (416), and a configuration database (224) to manage the application metadata and other state associated with the application.
The source repo (402) stores source code (412) for building binary objects such as executables, images, or other types of derived objects, some of which may be textual in nature such as “Makefile” and “myapp.c”. In some cases, the source repo (402) may also store non-source objects such as images like “myapp.png” or executables.
The binary repo (404) stores binary objects including derived objects. A derived object is an object derived from another object, often source code but sometimes other derived objects, by the application of an algorithm, computer program or other process. A derived object may itself be source code, often machine generated. An example of a derived object is “myappbuild:v5” in binary repo (404). The source repo (402) and binary repo (404) may be implemented using existing standard packaging technologies and distribution channels.
The execution platform (406) is a machine for running workflows and executable objects stored in the binary repo (404) based on application metadata (414) and other application state. The execution platform includes a workflow engine (228), an application engine (416), and a configuration database (224).
The execution platform (406) abstracts infrastructure and environmental dependencies, allowing the creation of highly abstract and portable workflows. In the example system, the execution platform (406) distinguishes between two types of executable objects: workflows and applications. A workflow is a source object (414) based on a high-level programming language such as YAML that may be run by the execution platform (228).
An application is generally a non-source object that may be run by the execution platform (406). Examples of non-source objects include executables compiled from source code. Workflows may be run or triggered in many ways including manual user action, programmatically by other workflows or applications, based on time for example with a chron job, or events both internally or externally generated.
In one embodiment, the source repo contains three main types of objects: application metadata (418), workflows (414) and source code (412). In the example system, the application metadata (418) references workflows, the workflows (414) contain directions for building the application from source code (412), storing the resulting executable objects in the binary repo (404), and deploying the application.
These actions may be performed by the execution platform (406) based on the application metadata (418), workflows (414) and source code (412) stored in the source repo (402). Additional instructions and configuration information may also be stored in a separate configuration database. The application itself may be a distributed application consisting of multiple executable objects that coordinate and communicate with each other.
The application metadata (418) defines a set of standard actions or object-oriented interface for interacting with the application. The set of standard actions may be directly presented to the user as an easily accessible menu of available actions or be used by other programs to automate use and management of the application. These standard actions may be mapped to user defined workflows.
Some actions may be specific to a particular application while others may be common across a class of similar applications. The latter may define an interface that other programs can use to programmatically access the application. One of the strengths of workflows is that they provide a high-level language that is easier to use than lower-level languages and provide flexible, powerful coordination of the tasks for managing complex applications.
The creation of an integrated system that combines packaging and distribution of application metadata, workflows, source code, and binary objects with an execution platform that may execute the workflows as well as run the applications allows the creation of portable complex applications that may be run and managed on any instance of such an integrated system. By contrast, traditionally this was not possible due to a wide range of issues including reliance on ad-hoc installation and management scripts that must be customized for each environment and was not portable across execution environments.
Virtualization, for example containerization, provides good encapsulation and portability of simple stand-alone application that may run in a single server. Complex applications such as distributed applications and applications that require access to external services may be difficult to configure and maintain. Traditionally scripting is used to help automate this process, but scripts must often deal with complex infrastructure and environmental variations and do not properly support the full set of required life-cycle management operations such as monitor, upgrades, loading data, deprovisioning and so on in a portable, consistent, and reliable manner. Complex applications thus are improved by using workflows for installation, configuration and continued management and/or by combining the workflow execution engine with the application execution engine as well as other system components already described for running the application and associated workflows.
In one embodiment, an execution platform (406) provides abstraction of the underlying infrastructure and integration of workflows (414) with the execution platform (406) enables specification of infrastructure independent workflows that may be packaged with the execution objects and application metadata. Together, they enable the creation, distribution and execution of portable complex applications. The packaged application may then be run consistently and reliably on the execution platform (406) specifically designed for this purpose across multiple execution environments.
A detailed description of how workflows are specified comprises in one embodiment describing the YAML format that serves as the DSL for workflows and policies.
Overview.
In one embodiment, workflows and policies are specified as YAML documents and checked into source control, for example git. Workflows and policies behave like code and may be treated as code. The workflows may call each other like functions. They may change and drift apart in different branches, and be made consistent again when merged back to the master branch. In one embodiment, a syntax and consistency checker is provided to validate the YAML code.
A simple example demonstrates some of the capabilities of the service.
Service that Runs a Container.
The following is a simple service that checks out source code. It uses the axscm:v1 docker image in the private get.applatix.io repository, and runs the axscm command to checkout the code to the /src directory inside the container. The repo and commit parameters are provided by the GUI session, and the code in /src is exposed as an artifact called code.
The next example describes a service that may perform a build. This service may need an input artifact from another step in the parent workflow. This artifact may be unpacked to the path /src inside the container before it begins execution. Finally, a supplied CMD may be executed to perform the build.
A Simple Workflow.
The following example is a workflow that performs a checkout followed by a build of the checked out source code. CMD is a build command that may be run to perform the build. The build step's code artifact is passed in as the %%code_artifact%% so that the build step can access the artifacts generated by the checkout step.
A Simple Deployment.
The following example is a deployment for Apache Web Server.
A Simple Policy.
The following example is a policy that triggers golang check and build on several events.
A Simple Project.
The following example is a project for golang
Service Template
A service template may have the following sections.
Common Name and Type.
Inputs
There may be two categories of inputs, including the parameters and the artifacts.
Outputs.
In one embodiment, artifact outputs are supported.
Container.
This section describes a container, for example a Docker container.
Steps
This section describes the steps in a workflow. In one embodiment, steps are mutually exclusive to the container section. A workflow only uses other containers and workflows, and may not define a container itself.
A workflow may contain any number of steps in an array. The steps may be executed in sequence, and if one step fails, the workflow may be considered failed and abort the steps following.
Each step may also contain multiple tasks, which may be run in parallel. If any one of them fails, the others in the same step may still execute to completion, but the overall step may be considered failed.
The steps section contains
Exception Handling.
In many cases, there is a need to specially handle errors and exceptions that occur during workflows. Two special flags are provided for this purpose: ignore_error: Ignore any errors/failures for the specified step. The step always succeeds. always_run: Always run this step before exiting the current sequence of steps.
Below, the cleanup step may always be run even if the job fails or is canceled during the setup or test phase. Furthermore, any errors during the cleanup step will be ignored and cannot fail the workflow.
Fixtures.
This section defines the fixtures that a service can launch or reserve. There are two types of fixtures: static and dynamic.
Below is an example of a test using a mongodb dynamic fixture. For simplicity the container definition of the mongodb-fixture or mogodb-loadgen service templates is not included.
Below is an example of a test requesting a static fixture, specifically requesting a Linux host running Ubuntu 16.04. The Linux category, and attributes os_vendor, os_version, hostname are all user defined attributes that have been input into the system via a GUI.
Parameter Special Values.
The parameters, for example for static fixtures, may have the following special values:
In one embodiment, a deployment template is the orchestration spec for a long running service. It includes the additional sections to provide the information about the route, scale, etc. In one embodiment, all the names may be compiled with the following pattern: {circumflex over ( )}([a-z0-9]([-a-z0-9]*[a-z0-9])?)$
Common Name and Type
Application.
Application is the group concept of deployments. With the same application, deployments may use an internal route to communicate with each other.
Deployment.
A unique deployment name is usually required.
External_Routes.
An external route is the route for deployment to be exposed to the Internet and/or other network.
Internal_Routes.
An internal route is the route for deployment to be exposed within the cluster.
Scale.
A scale section describes the number of replicas/instances for the deployment. A load balancer may be created automatically to load balance among instances when route is configured.
Containers.
This section describes containers to be included in the deployment. The template may be a container type service template. The container script may ensure it is long running, otherwise the container may be rescheduled every time it finishes. One or more containers in a single deployment instance may be supported.
Termination_Policy.
Both time and spending limit may be specified for the deployment by filling up the termination policy section.
A policy has the following sections.
Follow the steps to enable a policy in a branch:
Common Name and Type
Parameters
Parameters with “%%session.commit%%” or “%%session.repo%%” as default value may be fulfilled automatically, and do not need to necessarily be specified
Others may be specified in this section to fulfill the template parameters section
Notifications
Notification is made of two parts:
Multiple notification may be specified
When
when has different event types, and for many events, target branches are required and in regular expression format:
A project has the following sections.
Common Name and Type
Actions.
One or more entry points to a project. Each entry point may be a reference to an existing template. An action contains
Assets.
Project assets may be used by an app store and include icon and detail description of the project.
Categories.
List of categories under which the project may appear on an app store.
Publish.
Zero or more criteria for publishing a project to an app store. Only published projects are visible on the app store.
In step 502, an execution platform (406) is provided comprising a workflow engine (228) and an application engine (416). In one embodiment, the application is a containerized application and the workflow definition is packaged with an application definition (418), (414), (412). In one embodiment, the application definition comprises an application metadata (418). In one embodiment, the application metadata comprises at least one of the following: an application workflow (414) and an application source code (412). In one embodiment, the workflow engine (228) is computing platform agnostic, for example associated with a container technology.
In one embodiment, the workflow definition is computing platform agnostic. For example, the workflow definition may be defined in a portable manner using YAML and/or another portable language. In one embodiment, the workflow engine (228) is a service or other extension of the execution platform (406). In one embodiment, the service (228) is a Kubernetes service associated at least in part with a CustomResourceDefinition (CRD). In one embodiment, the workflow engine (228) is platform agnostic by nature of Kubernetes or another deployment regime.
In one embodiment, the application is a complex, multi-component, distributed application. In one embodiment, workflows provide at least one of the following: automated building, automated deployment, automated management of components, automated adding of new services. In one embodiment, workflows provide an automated scale up of the complex, multi-component, distributed application. In one embodiment, the automated scale up of the complex, multi-component, distributed application is based at least in part on demand.
In step 504, a workflow definition associated with an application is received. In one embodiment, the workflow definition is expressed in a higher level language. In one embodiment, the higher level language is a markup language. In one embodiment, the markup language is YAML.
In step 506, the workflow definition is used to run an application workflow (414) to perform an action with respect to the application, at least in part by causing an executable binary (404) to be run by the application engine (416). In one embodiment, the action is at least one of the following: a build action, a test action, a run action, a load data action, an initialize data action, a retention policy action, a scale up action, a scale down action, an update action, a change database schema action, a backup action, a restore action, and a lifecycle management action. In one embodiment, the action comprises causing a binary to be generated. In one embodiment, the action also comprises storing the binary in a binary repository (404).
In one embodiment, the system processes an artifact, for example including using the workflow definition to capture the artifact of a specified operation. In one embodiment, processing the artifact includes using the workflow definition to store the artifact of a specified operation. In one embodiment, processing the artifact includes using a second workflow definition with the artifact to perform a second operation, for example a workflow definition A may capture/store an artifact of operation X, and later a workflow definition B may use said artifact to do Y.
In one embodiment, a CMDB (224) is used and/or interfaced with. In one embodiment, the CMDB stores configuration information for a workflow, for example the configuration information may be used to customize workflows for a particular execution environment. In one embodiment, the CMDB stores operational information about a workflow, for example the operational information may comprise at least one of the following: workflow state, status, historical, and monitor information. In one embodiment, the operational information is generated by a workflow. In one embodiment, the operational information is used to modify the execution of a workflow.
Although the foregoing embodiments have been described in some detail for purposes of clarity of understanding, the invention is not limited to the details provided. There are many alternative ways of implementing the invention. The disclosed embodiments are illustrative and not restrictive.
This application is a continuation of U.S. patent application Ser. No. 15/943,562, which is a continuation in part of U.S. patent application Ser. No. 15/695,322, entitled “Execution Of Workflows In Distributed Systems”, filed Sep. 5, 2017, which claims the benefit of U.S. Provisional Patent Application No. 62/383,232, entitled “Execution Of Workflows In Distributed Systems”, filed Sep. 2, 2016, and which claims the benefit of U.S. Provisional Patent Application No. 62/482,636 entitled “Integrated System to Distribute and Execute Complex Applications”, filed Apr. 6, 2017. The entire contents of each of the aforementioned applications are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
7739243 | Erickson et al. | Jun 2010 | B2 |
8464213 | Stienstra et al. | Jun 2013 | B1 |
8856724 | Somani | Oct 2014 | B2 |
9063823 | Imrey et al. | Jun 2015 | B2 |
9201632 | Somani et al. | Dec 2015 | B2 |
9477522 | Harper et al. | Oct 2016 | B2 |
9524192 | van Velzen et al. | Dec 2016 | B2 |
9557988 | Binjrajka | Jan 2017 | B2 |
9696971 | Wierda et al. | Jul 2017 | B1 |
9710261 | Frankin | Jul 2017 | B2 |
9830135 | Tripathi et al. | Nov 2017 | B2 |
9904585 | Islam et al. | Feb 2018 | B1 |
9946517 | Talby et al. | Apr 2018 | B2 |
10282175 | Busjaeger | May 2019 | B2 |
10296327 | Govindaraju | May 2019 | B2 |
10514967 | McClory | Dec 2019 | B2 |
10698733 | Lee | Jun 2020 | B1 |
11265202 | Khazanchi | Mar 2022 | B2 |
20030023472 | Lee et al. | Jan 2003 | A1 |
20050149908 | Klianev | Jul 2005 | A1 |
20060015873 | Dettinger et al. | Jan 2006 | A1 |
20060048094 | Kipman et al. | Mar 2006 | A1 |
20060212857 | Neumann et al. | Sep 2006 | A1 |
20080229306 | Omori | Sep 2008 | A1 |
20090055825 | Smith et al. | Feb 2009 | A1 |
20100023950 | Fujii | Jan 2010 | A1 |
20100050156 | Bonanno | Feb 2010 | A1 |
20100240449 | Corem et al. | Sep 2010 | A1 |
20140067804 | Yoshizawa et al. | Mar 2014 | A1 |
20140123110 | Wan et al. | May 2014 | A1 |
20140282353 | Jubran | Sep 2014 | A1 |
20140282450 | Jubran et al. | Sep 2014 | A1 |
20150067687 | Turner | Mar 2015 | A1 |
20150100945 | Nadar et al. | Apr 2015 | A1 |
20160105487 | Chandra et al. | Apr 2016 | A1 |
20160124742 | Rangasamy et al. | May 2016 | A1 |
20160147529 | Coleman et al. | May 2016 | A1 |
20160202959 | Doubleday et al. | Jul 2016 | A1 |
20160239275 | Singh et al. | Aug 2016 | A1 |
20160328230 | Schneider | Nov 2016 | A1 |
20170097814 | Norskov et al. | Apr 2017 | A1 |
20170249141 | Parees et al. | Aug 2017 | A1 |
20170346683 | Li | Nov 2017 | A1 |
20180253296 | Brebner et al. | Sep 2018 | A1 |
20190303187 | Vu | Oct 2019 | A1 |
20200142675 | Jaeger | May 2020 | A1 |
Entry |
---|
Adrien Devresse et al.; “Nix based fully automated workflows and ecosystem to guarantee scientific result reproducibility across software environments and systems”; 2015 ACM; (Devresse_2015.pdf; pp. 1-7) (Year: 2015). |
Claus Pahl et al.; “Containers and Clusters for Edge Cloud Architectures—a Technology Review”; 2015 IEEE (Pahl_2015.pdf; pp. 1-8) (Year: 2015). |
IEEE Cloud Computing; “Cloud Tidbits Column”, IEEE, Sep. 2014 (Google_Kubernetes_2014.pdf; pp. 81-84) (Year: 2014). |
Baranowski et al., “Constructing workflows from script applications”, Scientific Programming 20 (2012/2013), IOS press (Baranowski_2012.pdf; pp. 359-377) (Year: 2013). |
Number | Date | Country | |
---|---|---|---|
20200326988 A1 | Oct 2020 | US |
Number | Date | Country | |
---|---|---|---|
62482636 | Apr 2017 | US | |
62383232 | Sep 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15943562 | Apr 2018 | US |
Child | 16915673 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15695322 | Sep 2017 | US |
Child | 15943562 | US |