The present disclosure relates in general to information handling systems, and more particularly to methods and systems for optimally orchestrating workloads in a distributed system using shared layers.
As the value and use of information continues to increase, individuals and businesses seek additional ways to process and store information. One option available to users is information handling systems. An information handling system generally processes, compiles, stores, and/or communicates information or data for business, personal, or other purposes thereby allowing users to take advantage of the value of the information. Because technology and information handling needs and requirements vary between different users or applications, information handling systems may also vary regarding what information is handled, how the information is handled, how much information is processed, stored, or communicated, and how quickly and efficiently the information may be processed, stored, or communicated. The variations in information handling systems allow for information handling systems to be general or configured for a specific user or specific use such as financial transaction processing, airline reservations, enterprise data storage, or global communications. In addition, information handling systems may include a variety of hardware and software components that may be configured to process, store, and communicate information and may include one or more computer systems, data storage systems, and networking systems.
In a distributed computing system, the ecosystem may have a plurality of distributed computing endpoints, each endpoint capable of instantiating containers for executing workloads. Containerization is an increasingly popular method of packaging and distributing software, with benefits in security, portability, and scalability. Workloads running in containers may be easily moved, or “offloaded,” to machines better suited for the task.
In a distributed computing system, containers may be used to distribute workloads among various virtual or physical machines at a user's disposal. Existing orchestration solutions provide methods of placing (i.e., scheduling) workloads based on hardware requirements such as processor and/or memory hardware availability, for example. However, existing approaches do not take into account software components available at an information handling system when placing workloads.
In accordance with the teachings of the present disclosure, the disadvantages and problems associated with existing approaches to workload distribution may be reduced or eliminated.
In accordance with embodiments of the present disclosure, an information handling system may include a processor and a workload orchestrator comprising a program of instructions configured to, when read and executed by the processor, in a distributed ecosystem comprising a plurality of host systems, responsive to a request to place a workload for execution in the distributed ecosystem: determine required software layers for a container image for executing the workload; determine candidate host systems among the plurality of host systems for execution of the workload; determine for each candidate host system the required software layers for the container image that are absent from such host system; based on the required software layers for the container image that are absent from the candidate host systems, calculate a score for each candidate host system; and cause the container for the workload to be instantiated on a selected host system that has the best score among the candidate host systems.
In accordance with these and other embodiments of the present disclosure, a method may include, in a distributed ecosystem comprising a plurality of host systems, responsive to a request to place a workload for execution in the distributed ecosystem: determining required software layers for a container image for executing the workload; determining candidate host systems among the plurality of host systems for execution of the workload; determining for each candidate host system the required software layers for the container image that are absent from such host system; based on the required software layers for the container image that are absent from the candidate host systems, calculating a score for each candidate host system; and causing the container for the workload to be instantiated on a selected host system that has the best score among the candidate host systems.
In accordance with these and other embodiments of the present disclosure, an article of manufacture comprising a non-transitory computer-readable medium and computer-executable instructions carried on the computer-readable medium, the instructions readable by a processor, the instructions, when read and executed, for causing the processor to, in a distributed ecosystem comprising a plurality of host systems, responsive to a request to place a workload for execution in the distributed ecosystem: determine required software layers for a container image for executing the workload; determine candidate host systems among the plurality of host systems for execution of the workload; determine for each candidate host system the required software layers for the container image that are absent from such host system; based on the required software layers for the container image that are absent from the candidate host systems, calculate a score for each candidate host system; and cause the container for the workload to be instantiated on a selected host system that has the best score among the candidate host systems.
Technical advantages of the present disclosure may be readily apparent to one skilled in the art from the figures, description and claims included herein. The objects and advantages of the embodiments will be realized and achieved at least by the elements, features, and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are examples and explanatory and are not restrictive of the claims set forth in this disclosure.
A more complete understanding of the present embodiments and advantages thereof may be acquired by referring to the following description taken in conjunction with the accompanying drawings, in which like reference numbers indicate like features, and wherein:
Preferred embodiments and their advantages are best understood by reference to
For the purposes of this disclosure, computer-readable media may include any instrumentality or aggregation of instrumentalities that may retain data and/or instructions for a period of time. Computer-readable media may include, without limitation, storage media such as a direct access storage device (e.g., a hard disk drive or floppy disk), a sequential access storage device (e.g., a tape disk drive), compact disk, CD-ROM, DVD, random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), and/or flash memory; as well as communications media such as wires, optical fibers, microwaves, radio waves, and other electromagnetic and/or optical carriers; and/or any combination of the foregoing.
For the purposes of this disclosure, information handling resources may broadly refer to any component system, device or apparatus of an information handling system, including without limitation processors, service processors, basic input/output systems, buses, memories, I/O devices and/or interfaces, storage resources, network interfaces, motherboards, and/or any other components and/or elements of an information handling system.
A host system 102 may comprise an information handling system. In some embodiments, a host system 102 may comprise a server (e.g., embodied in a “sled” form factor). In these and other embodiments, a host system 102 may comprise a personal computer. In other embodiments, a host system 102 may be a portable computing device (e.g., a laptop, notebook, tablet, handheld, smart phone, personal digital assistant, etc.). As depicted in
As used herein, a host system 102 may sometimes be referred to herein as an “endpoint” of distributed ecosystem 100.
A processor 103 may include any system, device, or apparatus configured to interpret and/or execute program instructions and/or process data, and may include, without limitation, a microprocessor, microcontroller, digital signal processor (DSP), application specific integrated circuit (ASIC), or any other digital or analog circuitry configured to interpret and/or execute program instructions and/or process data. In some embodiments, processor 103 may interpret and/or execute program instructions and/or process data stored in a memory 104 and/or other computer-readable media accessible to processor 103.
A memory 104 may be communicatively coupled to a processor 103 and may include any system, device, or apparatus configured to retain program instructions and/or data for a period of time (e.g., computer-readable media). A memory 104 may include RAM, EEPROM, a PCMCIA card, flash memory, magnetic storage, opto-magnetic storage, or any suitable selection and/or array of volatile or non-volatile memory that retains data after power to host system 102 is turned off.
As shown in
A container 118 or containerized application may include instructions for a container runtime to virtualize a computer's operating system kernel, enabling users to install containers (e.g., isolated application environments) on a virtualized operating system. Containerized applications may be built based on a containerfile, which is an ordered list of instructions on how to set up an environment and install an application. Each command in such ordered list may become a subsequent layer in the final container image. Often, the first few instructions in this containerfile will install libraries that the application needs to run, forming layers that are effectively “independent” and thus good candidates for sharing across other container images.
At least one host system 102 in system 100 may have stored within its memory 104 a manager 120. A manager 120 may comprise software and/or firmware generally operable to manage containers 118 instantiated on each host system 102, including controlling migration of containers between host systems 102. Further, as described in greater detail below, a manager 120 may perform session management to maintain mappings of user sessions across endpoints.
At least one host system 102 in system 100 may have stored within its memory 104 a workload orchestrator 122. As described in greater detail below, workload orchestrator 122 may comprise software and/or firmware generally operable to optimize workload placement in container-based environments. Workload orchestrator 122 may inspect candidate endpoints that meet hardware requirements for a workload to be placed and may score the candidate endpoints according to the number of shared layers already available on the endpoint and the time expected to download and extract any unavailable layers. Such calculation incorporates metrics for endpoints' network conditions, observed layer extraction times, and potential for peer-to-peer layer transfer. Based on the scores for the candidate endpoints, workload orchestrator 122 may also select an endpoint for the workload from among the candidate endpoints.
A network interface 106 may include any suitable system, apparatus, or device operable to serve as an interface between an associated host system 102 and network 110. A network interface 106 may enable its associated host system 102 to communicate with network 110 using any suitable transmission protocol (e.g., TCP/IP) and/or standard (e.g., IEEE 802.11, Wi-Fi). In certain embodiments, a network interface 106 may include a physical network interface controller (NIC). In the same or alternative embodiments, a network interface 106 may be configured to communicate via wireless transmissions. In the same or alternative embodiments, a network interface 106 may provide physical access to a networking medium and/or provide a low-level addressing system (e.g., through the use of Media Access Control addresses). In some embodiments, a network interface 106 may be implemented as a local area network (“LAN”) on motherboard (“LOM”) interface. A network interface 106 may comprise one or more suitable network interface cards, including without limitation, mezzanine cards, network daughter cards, etc.
Network 110 may be a network and/or fabric configured to communicatively couple information handling systems to each other. In certain embodiments, network 110 may include a communication infrastructure, which provides physical connections, and a management layer, which organizes the physical connections of host systems 102 and other devices coupled to network 110. Network 110 may be implemented as, or may be a part of, a storage area network (SAN), personal area network (PAN), local area network (LAN), a metropolitan area network (MAN), a wide area network (WAN), a wireless local area network (WLAN), a virtual private network (VPN), an intranet, the Internet or any other appropriate architecture or system that facilitates the communication of signals, data and/or messages (generally referred to as data). Network 110 may transmit data using any storage and/or communication protocol, including without limitation, Fibre Channel, Fibre Channel over Ethernet (FCOE), Small Computer System Interface (SCSI), Internet SCSI (iSCSI), Frame Relay, Ethernet Asynchronous Transfer Mode (ATM), Internet protocol (IP), or other packet-based protocol, and/or any combination thereof. Network 110 and its various components may be implemented using hardware, software, or any combination thereof.
In addition to processor 103, memory 104, and network interface 106, a host system 102 may include one or more other information handling resources.
As shown in
At step 202, a new workload request may be received by workload orchestrator 122. At step 204, workload orchestrator 122 may, based on container image information present in container registry 124, determine the software layers present in a container image for executing the new workload.
At step 206, workload orchestrator 122 may determine candidate endpoints for execution of the workload. For example, based on hardware requirements for the requested workload, workload orchestrator 122 may identify the endpoints within distributed ecosystem 100 that satisfy such hardware requirements as the candidate endpoints.
At step 208, for each of the candidate endpoints, workload orchestrator 122 may determine which software layers are already present for containers executing on the endpoint, in order to further determine which layers of the workload are missing from each endpoint.
At step 210, for each layer missing on each endpoint, workload orchestrator 122 may determine the missing layer size. At step 212, based on the missing layer size, workload orchestrator 122 may determine estimated time to download the missing layer image from container registry 124 and estimated time to extract the missing layer image once downloaded. Such estimated times may be estimated based on missing layer size, and one or more parameters associated with the candidate endpoint to which the layer would need to be installed, including without limitation such endpoint's network conditions, observed layer extraction times, and peer-to-peer layer transfer.
At step 214, workload orchestrator 122 may determine for each candidate endpoint, based on estimated times for download and extraction of those layers that would be needed to execute the workload on such endpoint, an estimated time needed to launch the container for the new workload on such candidate endpoint. At step 216, based on the estimated times needed to launch the container on the candidate endpoints, workload orchestrator 122 may generate a score for each candidate endpoint.
At step 218, workload orchestrator 122 may distribute the workload to the candidate endpoint with the best score, including causing a container for the workload to be instantiated on the selected candidate endpoint, and download of software layers from container registry 124 needed to instantiate such container.
After completion of step 218, method 200 may end.
Although
Method 200 may be implemented using distributed ecosystem 100 or any other system operable to implement method 200. In certain embodiments, method 200 may be implemented partially or fully in software and/or firmware embodied in computer-readable media.
As used herein, when two or more elements are referred to as “coupled” to one another, such term indicates that such two or more elements are in electronic communication or mechanical communication, as applicable, whether connected indirectly or directly, with or without intervening elements.
This disclosure encompasses all changes, substitutions, variations, alterations, and modifications to the example embodiments herein that a person having ordinary skill in the art would comprehend. Similarly, where appropriate, the appended claims encompass all changes, substitutions, variations, alterations, and modifications to the example embodiments herein that a person having ordinary skill in the art would comprehend. Moreover, reference in the appended claims to an apparatus or system or a component of an apparatus or system being adapted to, arranged to, capable of, configured to, enabled to, operable to, or operative to perform a particular function encompasses that apparatus, system, or component, whether or not it or that particular function is activated, turned on, or unlocked, as long as that apparatus, system, or component is so adapted, arranged, capable, configured, enabled, operable, or operative. Accordingly, modifications, additions, or omissions may be made to the systems, apparatuses, and methods described herein without departing from the scope of the disclosure. For example, the components of the systems and apparatuses may be integrated or separated. Moreover, the operations of the systems and apparatuses disclosed herein may be performed by more, fewer, or other components and the methods described may include more, fewer, or other steps. Additionally, steps may be performed in any suitable order. As used in this document, “each” refers to each member of a set or each member of a subset of a set.
Although exemplary embodiments are illustrated in the figures and described above, the principles of the present disclosure may be implemented using any number of techniques, whether currently known or not. The present disclosure should in no way be limited to the exemplary implementations and techniques illustrated in the figures and described above.
Unless otherwise specifically noted, articles depicted in the figures are not necessarily drawn to scale.
All examples and conditional language recited herein are intended for pedagogical objects to aid the reader in understanding the disclosure and the concepts contributed by the inventor to furthering the art, and are construed as being without limitation to such specifically recited examples and conditions. Although embodiments of the present disclosure have been described in detail, it should be understood that various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the disclosure.
Although specific advantages have been enumerated above, various embodiments may include some, none, or all of the enumerated advantages. Additionally, other technical advantages may become readily apparent to one of ordinary skill in the art after review of the foregoing figures and description.
To aid the Patent Office and any readers of any patent issued on this application in interpreting the claims appended hereto, applicants wish to note that they do not intend any of the appended claims or claim elements to invoke 35 U.S.C. § 112 (f) unless the words “means for” or “step for” are explicitly used in the particular claim.