The present disclosure generally relates to management of enterprise and cloud computing resources.
A datacenter is a dedicated space used to house computer systems and associated components, such as telecommunications and storage systems. Enterprise computing is the use of technology, information systems, and computers within an organization or business. An enterprise server is a computer server that includes programs required to collectively serve the requirements of an enterprise instead of an individual user, unit or specific application. Cloud computing is the delivery of different services or computing resources through the Internet, including data storage, servers, databases, networking, and software. Cloud-based storage makes it possible to save files to a remote database and retrieve them on demand. A cloud server is a virtual server (rather than a physical server) running in a cloud computing environment. It is built, hosted and delivered via a cloud computing platform via the internet, and can be accessed remotely. They are also known as virtual servers.
Many enterprises have spare capacity in their data-centers. Some large-scale enterprises offer cloud services by leveraging spare computing or storage capacity. However, offering cloud service from spare capacity is not viable at small scale, as spare server capacity may vary over time. Furthermore, enterprises may have security concerns as they let their servers be used by others.
Some embodiments of the disclosure provide a cloud-enterprise resource management system enables sharing of computing resources belonging to different datacenters by one or more clients of a resource pooling and sharing service. Each datacenter of includes a first partition of computing resources and a second partition of computing resources. The first partition is designated as reserved for use by an enterprise operating the datacenter. The second partition is designated as available for use by one or more clients of the resource pooling and sharing service. A workload manager in the datacenter predicts workload and transfers (i) a first computing resource from the first partition to the second partition wherein when the predicted workload is below a first threshold and (ii) a second computing resource from the second partition to the first partition when the predicted workload is above a second threshold.
By predicting workload in an enterprise datacenter, a computing device serving as the workload manager of the enterprise datacenter can participate in a resource pooling and sharing service and enables a cloud-enterprise resource management system. Based on the predicted workload, the computing device dynamically determines whether to offer spare resource to the resource pooling and sharing service or request release of resources from the resource pooling and sharing service. The resource utilization efficiency of the enterprise datacenter is improved.
In some embodiments, the first partition of computing resources (enterprise partition) is designated as reserved for use by an enterprise, and the second partition of computing resources (spare partition) is designated as available for use by one or more clients of a resource pooling and sharing service that coordinate sharing of computing resources belonging to one or more enterprises by one or more clients. In some embodiments, access to the first partition is controlled by a first firewall that allows access by the enterprise and denies access by clients of the resource pooling and sharing service, or any entity outside of the enterprise.
In some embodiments, the predicted workload being sufficiently below the capacity of the enterprise partition (e.g., below the first threshold) is used as an indication that there is excess computing capacity in the enterprise partition, and that one or more computing resources can be moved from the enterprise partition to the spare partition.
In some embodiments, the workload manager instructs a robot to physically disconnect the first computing resource from the first partition and to physically connect the first computing resource to the second partition. The workload manager may also communicate with a cloud coordinator of the resource pooling and sharing service to provide an identifier of the first computing resource and to specify a time frame at which the first computing resource becomes available for use by the clients of the resource pooling and sharing service. The cloud coordinator may in turn facilitate a client of the resource pooling and sharing service to access the first computing resource by using the provided identifier of the first computing resource at the specified time frame. The cloud-coordinator also credits an account of the enterprise and debits an account of the client based on usage of the first computing resource.
In some embodiments, the predicted workload being within a failure margin of the capacity of the enterprise partition or exceeding the capacity of the enterprise partition (e.g., above the second threshold) is used as an indication that there is insufficient computing capacity in the enterprise partition so that one or more computing resources are moved from the spare partition back to the enterprise partition.
In some embodiments, the workload manager instructs the robot to physically disconnect the second computing resource from the second partition and to physically connect the second computing resource to the first partition. The workload manager may also communicate with the cloud coordinator of the resource pooling and sharing service to request a return or release of a computing resource and to receive a reply that includes an identifier of the computing resource being returned or released. The identifier may identify a computing resource in the second partition that is not being used by any client of the resource pooling and sharing service.
The preceding Summary is intended to serve as a brief introduction to some embodiments of the disclosure. It is not meant to be an introduction or overview of all inventive subject matter disclosed in this document. The Detailed Description that follows and the Drawings that are referred to in the Detailed Description will further describe the embodiments described in the Summary as well as other embodiments. Accordingly, to understand all the embodiments described by this document, a Summary, Detailed Description and the Drawings are provided. Moreover, the claimed subject matter is not to be limited by the illustrative details in the Summary, Detailed Description, and the Drawings, but rather is to be defined by the appended claims, because the claimed subject matter can be embodied in other specific forms without departing from the spirit of the subject matter.
The drawings are of illustrative embodiments. They do not illustrate all embodiments. Other embodiments may be used in addition or instead. Details that may be apparent or unnecessary may be omitted to save space or for more effective illustration. Some embodiments may be practiced with additional components or steps and/or without all of the components or steps that are illustrated. When the same numeral appears in different drawings, it refers to the same or like components or steps.
In the following detailed description, numerous specific details are set forth by way of examples in order to provide a thorough understanding of the relevant teachings. However, it should be apparent that the present teachings may be practiced without such details. In other instances, well-known methods, procedures, components, and/or circuitry have been described at a relatively high-level, without detail, in order to avoid unnecessarily obscuring aspects of the present teachings.
Some embodiments of the disclosure provide a resource pooling and sharing service that enables a cloud-enterprise resource management system. The system allows multiple enterprises to be in a shared cloud service by pooling their excess server capacity. At the datacenter of each participating enterprise, the computing resources are divided into an enterprise partition (as an enterprise silo) and a spare partition (as a spare silo). The computing resources in the enterprise partition are resources reserved for use by the enterprise itself, while the computing resources in the spare partition can be made available for use as cloud servers.
In some embodiments, at each enterprise datacenter, an enterprise workload manager is implemented to allocate or assign computing resources into either the enterprise partition or the spare partition. The enterprise workload manager addresses security concerns by maintaining an air gap between the enterprise partition and the spare partition such that external clients of the spare servers cannot access the enterprise servers. The enterprise workload manager also makes resource allocation decisions between the enterprise partition and the spare partition (shifting boundaries between the two partitions) based on predictions of workload required by the enterprise. At the cloud, a cloud coordinator acts as the broker of all spare servers offered by all participating enterprises. The cloud coordinator matches requests for server capacity to the available spare server capacity in the participating enterprises. In some embodiments, the cloud coordinator also manages a credit system that keeps track of how much to bill the users of the spare servers and how much to credit the enterprises providing the spare servers.
For some embodiments,
In order to ensure security, each participating enterprise of the resource pooling and sharing service partitions its computing resources or servers into an enterprise partition and a spare partition. The computing resources or servers of the enterprise partition are designated as being reserved for use by the enterprise, while the computing resources or servers of the spare partition are designated as being available for use by the resource pooling and sharing service. Each participating enterprise may move computing resources from its enterprise partition into its spare partition and vice versa based on the predicted workload of the enterprise.
The resource pooling and sharing service of the cloud-enterprise resource management system 100 is implemented by a cloud coordinator 110 and enterprise workload managers at the participating enterprises. As illustrated, the enterprise A implements an enterprise workload manager 112, the enterprise B implements an enterprise workload manager 114, and the enterprise C implements an enterprise workload manager 116. The enterprise workload managers of enterprises A, B, and C determines which computing resources can be made available for the resource pooling and sharing service, and the cloud coordinator 110 in turn directs the clients W, X, Y, and Z to use the computing resources that are made available.
In some embodiments, the computing resources and servers of a participating enterprise may be located in one or more datacenters.
As illustrated, the enterprise partition 210 includes racks of computing resources or servers that are interconnected by switches and routers. The computing resources of the enterprise partition 210 are behind a firewall 215, which allows access to the computing resources of the enterprise partition 210 by the enterprise that operates or owns the datacenter. The firewall 215 allows only authenticated users of the enterprise, thereby preventing access of the enterprise partition 210 by entities outside of the enterprise, including clients of the resource pooling and sharing service. The spare partition 220 likewise includes racks of computing resources or servers that are interconnected by switches and routers. The computing resources of the spare partition 220 are behind a firewall 225 that allows access to the computing resources by the authenticated clients of the resource pooling and sharing service. The enterprise workload manager 230 of the enterprise is situated in the enterprise partition behind the firewall 215.
In some embodiments, the datacenter maintains a physical separation, or an “air gap” between the enterprise partition 210 and the spare partition 220. The “air gap” is for physically preventing the access of the computing resource in the enterprise partition by entities outside of the enterprise (e.g., clients of the resource pooling and sharing service). Specifically, no medium of communications exists between the hardware equipment (e.g., servers, racks, switches, routers) of the enterprise partition 210 and the hardware equipment of the spare partition 220 within the datacenter 200. In some embodiments, the enterprise partition 210 has no physical connection or contact with the hardware equipment of the spare partition 220. Consequently, one way to move a computing resource or server from the enterprise partition to the spare partition is to physically disconnect the computing resource from a hardware connection of the enterprise partition and to physically connect the computing resource to a hardware connection of the spare partition. Likewise, the only way to move a computing resource or server from the spare partition to the enterprise partition is to physically disconnect the computing resource from a hardware connection of the spare partition and to physically connect the computing resource to a hardware connection of the enterprise partition.
In the enterprise datacenter 200, a robot 250 is employed at the enterprise datacenter to physically connect and disconnect computing resources. For example, to transfer a computing resource 240 from the enterprise partition 210 to the spare partition 220, the enterprise workload manager 230 may instruct the robot 250 to perform the following operations: (1) unplug a cable between the computing resource 240 and a switch of the enterprise partition 210 and (2) plug-in a cable between the computing resource 240 and a switch of the spare partition 220.
In some other embodiments, a datacenter of an enterprise participating in the cloud-enterprise resource management system 100 does not use physical maneuvers to transfer computing resources between the enterprise partition and the spare partition. Instead, the computing resource being transferred remain in physical connections with both the enterprise partition and the spare partition, and the transfer is accomplished virtually in the electronic signaling domain, e.g., by changing the authentication requirement for the computing resource or imposing other software security measures to separate the enterprise partition and the spare partition.
The transfer of computing resources between the enterprise partition and the spare partition is initiated by the enterprise workload manager 230 based on its prediction of the enterprise's workload at the datacenter. In some embodiments, the predicted workload being sufficiently below the capacity of the enterprise partition (e.g., below a first threshold) is used as an indication that one or more computing resources can be moved from the enterprise partition to the spare partition. Conversely, the predicted workload being within a safety margin of the capacity of the enterprise partition or exceeding the capacity of the enterprise partition (e.g., above a second threshold) is used as an indication that one or more computing resources need to be moved from the spare partition back to the enterprise partition. The enterprise workload manager 230 may in turn (1) notify the cloud coordinator 110 that a computing resources is being made available for the resource pooling and sharing service or (2) request the cloud coordinator 110 to release a computing resource from the resource pooling and sharing service.
In some embodiments, if all of the computing resources in the spare partition 220 are being used by clients of the resource pooling and sharing service, the cloud coordinator 110 may inform the workload manager 230 that all of the computing resources in the spare partition 220 are being used. The cloud coordinator 110 in these instances may wait until at least one of the computing resources in the spare partition 220 becoming idle before notifying (at step 324) the workload manager 230 of the identity of the computing resource to be released back to the enterprise partition.
The workload predictor 411 predicts the workload to take place in the enterprise partition 210 at a future time frame. The workload predictor may make this prediction based on historical workload records, measurements or key performance indicators taken from the datacenter, as well as data provided by the enterprise operating the datacenter. The predicted workload is provided to the workload controller 413.
The workload controller 413 makes decisions regarding whether to move resources into or out of the spare partition based on the predicted workload provided by the workload predictor 411. The resource status table 412 maintains the status of the different computing resources in the datacenter. The status of a computing resource may include whether the computing resource is currently in the enterprise partition 210 or the spare partition 220, whether the computing resource is in active use by the enterprise, whether the computing resource is in active use by a client of the resource pooling and sharing service, etc. The workload controller 413 may use the content of the resource status table 412 to determine the current capacity of enterprise partition. Based on a comparison between the determined current enterprise partition capacity and the predicted workload, the workload controller 413 may (1) require additional capacity be added to the enterprise partition or (2) permit excess capacity be moved to the spare partition. The workload controller 413 may effectuate the corresponding transfer of computing resources by using the cloud interface 414 and the transfer interface 415.
The cloud interface 414 is used by the workload manager computing device 410 to communicate with the cloud coordinator 110 in order to (1) give the notification that a computing resource is being made available, (2) request the release a computing resource back to the enterprise partition, (3) identify the computer resources being transferred, and (4) determine the timing of when the transfer can take place. The communications between the cloud interface 414 and the cloud coordinator 110 are described by reference to
The transfer interface 415 controls the actual transfer of computing resources between the enterprise partition 210 and the spare partition 220. In some embodiments, the transfer interface 415 translates commands from the workload controller to specific instructions for controlling a robot (e.g., the robot 250) to physically disconnect and/or connect computing resources with the enterprise partition or the spare partition.
The enterprise interface 421 is used by the cloud coordinator to communicate with the datacenters of the enterprises that participate in the resource pooling and sharing service. The communications between the enterprise interface 421 and the participating enterprises are described by reference to
The service controller 423 determines which computing resource is to be provided to which client of the resource pooling and sharing service. The service controller 423 learns from the enterprise interface 421 which computing resources are available at which enterprise datacenter (i.e., in the spare partition), and which enterprise is requesting a release of a computing resource from the resource pooling and sharing service. The status of each computing resource is kept at the resource status table 422, and the service controller 423 uses the content of the resource status table 422 to determine whether to assign a computing resource to a client or to release a computing resource back to an enterprise.
The resource status table 422 maintains the status of the different computing resources of the different enterprise datacenters participating in the resource pooling and sharing service. The status of a computing resource may include the identifier of the datacenter that house the computing resource, indicia of whether the computing resource is available for clients of the resource pooling and sharing service, indicia of whether the computing resource is currently being used by a client of the resource pooling and sharing service, the identity of the client that is using the computing resource, etc.
The client interface 424 is used to communicate with various clients of the resource pooling and sharing service. The information being relayed to a client may include identities or addresses of the computing resources that are made available to the client, as well the authentication information that are salient for the client to access the computing resources in their corresponding enterprise datacenters.
The billing module 425 maintains a database of the accounts of the clients and the participating enterprises. The billing module 425 monitors the use of the computing resources in the participating enterprises by the clients of the resource pooling and sharing service. For example, the billing module may credit an account of an enterprise and debit an account of a client based on the monitored usage of a computing resource by the client.
The workload manager predicts (at block 510) a workload at a datacenter that includes a first partition of computing resources and a second partition of computing resources. In some embodiments, the first partition of computing resources (enterprise partition) is designated as reserved for use by an enterprise, and the second partition of computing resources (spare partition) is designated as available for use by one or more clients of a resource pooling and sharing service that coordinate sharing of computing resources belonging to one or more enterprises by one or more clients. In some embodiments, access to the first partition is controlled by a first firewall that allows access by the enterprise and denies access by clients of the resource pooling and sharing service, or any entity outside of the enterprise.
The workload manager determines (at block 520) whether the predicted workload is below a first threshold. If the predicted workload is below a first threshold, the process proceeds to block 525. Otherwise the process proceeds to block 530. In some embodiments, the predicted workload being sufficiently below the capacity of the enterprise partition (e.g., below the first threshold) is used as an indication that there is excess computing capacity in the enterprise partition, and that one or more computing resources can be moved from the enterprise partition to the spare partition.
At block 525, the workload manager transfers a first computing resource from the first partition to the second partition. The process then returns to block 510 to further predict workload at the datacenter for the enterprise. In some embodiments, the workload manager instructs a robot to physically disconnect the first computing resource from the first partition and to physically connect the first computing resource to the second partition. The workload manager may also communicate with a cloud coordinator of the resource pooling and sharing service to provide an identifier of the first computing resource and to specify a time frame at which the first computing resource becomes available for use by the clients of the resource pooling and sharing service. The cloud coordinator may in turn facilitate a client of the resource pooling and sharing service to access the first computing resource by using the provided identifier of the first computing resource at the specified time frame. The cloud coordinator also credits an account of the enterprise and debits an account of the client based on usage of the first computing resource.
The workload manager determines (at block 530) whether the predicted workload is above a second threshold. If the predicted workload is above the second threshold, the process proceeds to block 535. Otherwise the process returns to block 510. In some embodiments, the predicted workload being within a failure margin of the capacity of the enterprise partition or exceeding the capacity of the enterprise partition (e.g., above the second threshold) is used as an indication that there is insufficient computing capacity in the enterprise partition so that one or more computing resources are moved from the spare partition back to the enterprise partition.
At block 535, the workload manager transfers a second computing resource from the second partition to the first partition. The process then returns to block 510 to further predict workload at the datacenter for the enterprise. In some embodiments, the workload manager instructs the robot to physically disconnect the second computing resource from the second partition and to physically connect the second computing resource to the first partition. The workload manager may also communicate with the cloud coordinator of the resource pooling and sharing service to request a return or release of a computing resource and to receive a reply that includes an identifier of the computing resource being returned or released. The identifier may identify a computing resource in the second partition that is not being used by any client of the resource pooling and sharing service.
By predicting workload in an enterprise datacenter, a computing device serving as the workload manager of the enterprise datacenter can participate in a resource pooling and sharing service and enables a cloud-enterprise resource management system. Based on the predicted workload, the computing device dynamically determines whether to offer spare resource to the resource pooling and sharing service or request release of resources from the resource pooling and sharing service. The resource utilization efficiency of the enterprise datacenter is improved.
The present application may be a system, a method, and/or a computer program product at any possible technical detail level of integration. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present disclosure.
The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device. Computer readable program instructions for carrying out operations of the present disclosure may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, configuration data for integrated circuitry, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++, or the like, and procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present disclosure.
Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions. These computer readable program instructions may be provided to a processor of a computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks. The flowchart and block diagrams in the Figures (e.g.,
Data processing systems 600 and 650 are representative of any electronic device capable of executing machine-readable program instructions. Data processing systems 600 and 650 may be representative of a smart phone, a computer system, PDA, or other electronic devices. Examples of computing systems, environments, and/or configurations that may represented by data processing systems 600 and 650 include, but are not limited to, personal computer systems, server computer systems, thin clients, thick clients, hand-held or laptop devices, multiprocessor systems, microprocessor-based systems, network PCs, minicomputer systems, and distributed cloud computing environments that include any of the above systems or devices.
The data processing systems 600 and 650 may include a set of internal components 605 and a set of external components 655 illustrated in
The set of internal components 605 also includes a R/W drive or interface 632 to read from and write to one or more portable computer-readable tangible storage devices 686 such as a CD-ROM, DVD, memory stick, magnetic tape, magnetic disk, optical disk or semiconductor storage device. The instructions for executing the process 500 can be stored on one or more of the respective portable computer-readable tangible storage devices 686, read via the respective R/W drive or interface 632 and loaded into the respective hard drive 630.
The set of internal components 605 may also include network adapters (or switch port cards) or interfaces 636 such as a TCP/IP adapter cards, wireless Wi-Fi interface cards, or 3G or 4G wireless interface cards or other wired or wireless communication links. Instructions of processes or programs described above can be downloaded from an external computer (e.g., server) via a network (for example, the Internet, a local area network or other, wide area network) and respective network adapters or interfaces 636. From the network adapters (or switch port adaptors) or interfaces 636, the instructions and data of the described programs or processes are loaded into the respective hard drive 630. The network may comprise copper wires, optical fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers.
The set of external components 655 can include a computer display monitor 670, a keyboard 680, and a computer mouse 684. The set of external components 655 can also include touch screens, virtual keyboards, touch pads, pointing devices, and other human interface devices. The set of internal components 605 also includes device drivers 640 to interface to computer display monitor 670, keyboard 680 and computer mouse 684. The device drivers 640, R/W drive or interface 632 and network adapter or interface 636 comprise hardware and software (stored in storage device 630 and/or ROM 624).
It is to be understood that although this disclosure includes a detailed description on cloud computing, implementation of the teachings recited herein are not limited to a cloud computing environment. Rather, embodiments of the present disclosure are capable of being implemented in conjunction with any other type of computing environment now known or later developed. Cloud computing is a model of service delivery for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, network bandwidth, servers, processing, memory, storage, applications, virtual machines, and services) that can be rapidly provisioned and released with minimal management effort or interaction with a provider of the service. This cloud model may include at least five characteristics, at least three service models, and at least four deployment models.
On-demand self-service: a cloud consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed—automatically without requiring human interaction with the service's provider.
Broad network access: capabilities are available over a network and accessed through standard mechanisms that promote use by heterogeneous thin or thick client platforms (e.g., mobile phones, laptops, and PDAs).
Resource pooling: the provider's computing resources are pooled to serve multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically assigned and reassigned according to demand. There is a sense of location independence in that the consumer generally has no control or knowledge over the exact location of the provided resources but may be able to specify location at a higher level of abstraction (e.g., country, state, or datacenter).
Rapid elasticity: capabilities can be rapidly and elastically provisioned, in some cases automatically, to quickly scale out and rapidly released to quickly scale in. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be purchased in any quantity at any time.
Measured service: cloud systems automatically control and optimize resource use by leveraging a metering capability at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts). Resource usage can be monitored, controlled, and reported, providing transparency for both the provider and consumer of the utilized service.
Software as a Service (SaaS): the capability provided to the consumer is to use the provider's applications running on a cloud infrastructure. The applications are accessible from various client devices through a thin client interface such as a web browser (e.g., web-based e-mail). The consumer does not manage or control the underlying cloud infrastructure including network, servers, operating systems, storage, or even individual application capabilities, with the possible exception of limited user-specific application configuration settings.
Platform as a Service (PaaS): the capability provided to the consumer is to deploy onto the cloud infrastructure consumer-created or acquired applications created using programming languages and tools supported by the provider. The consumer does not manage or control the underlying cloud infrastructure including networks, servers, operating systems, or storage, but has control over the deployed applications and possibly application hosting environment configurations. Infrastructure as a Service (IaaS): the capability provided to the consumer is to provision processing, storage, networks, and other fundamental computing resources where the consumer is able to deploy and run arbitrary software, which can include operating systems and applications. The consumer does not manage or control the underlying cloud infrastructure but has control over operating systems, storage, deployed applications, and possibly limited control of select networking components (e.g., host firewalls).
Private cloud: the cloud infrastructure is operated solely for an organization. It may be managed by the organization or a third party and may exist on-premises or off-premises.
Community cloud: the cloud infrastructure is shared by several organizations and supports a specific community that has shared concerns (e.g., mission, security requirements, policy, and compliance considerations). It may be managed by the organizations or a third party and may exist on-premises or off-premises.
Public cloud: the cloud infrastructure is made available to the general public or a large industry group and is owned by an organization selling cloud services.
Hybrid cloud: the cloud infrastructure is a composition of two or more clouds (private, community, or public) that remain unique entities but are bound together by standardized or proprietary technology that enables data and application portability (e.g., cloud bursting for load-balancing between clouds).
A cloud-computing environment is service oriented with a focus on statelessness, low coupling, modularity, and semantic interoperability. At the heart of cloud computing is an infrastructure that includes a network of interconnected nodes.
Referring now to
Referring now to
Hardware and software layer 860 includes hardware and software components. Examples of hardware components include: mainframes 861; RISC (Reduced Instruction Set Computer) architecture based servers 862; servers 863; blade servers 864; storage devices 865; and networks and networking components 866. In some embodiments, software components include network application server software 867 and database software 868.
Virtualization layer 870 provides an abstraction layer from which the following examples of virtual entities may be provided: virtual servers 871; virtual storage 872; virtual networks 873, including virtual private networks; virtual applications and operating systems 874; and virtual clients 875.
In one example, management layer 880 may provide the functions described below. Resource provisioning 881 provides dynamic procurement of computing resources and other resources that are utilized to perform tasks within the cloud computing environment. Metering and Pricing 882 provide cost tracking as resources are utilized within the cloud computing environment, and billing or invoicing for consumption of these resources. In one example, these resources may include application software licenses. Security provides identity verification for cloud consumers and tasks, as well as protection for data and other resources. User portal 883 provides access to the cloud-computing environment for consumers and system administrators. Service level management 884 provides cloud computing resource allocation and management such that required service levels are met. Service Level Agreement (SLA) planning and fulfillment 885 provide pre-arrangement for, and procurement of, cloud computing resources for which a future requirement is anticipated in accordance with an SLA.
Workloads layer 890 provides examples of functionality for which the cloud computing environment may be utilized. Examples of workloads and functions which may be provided from this layer include: mapping and navigation 891; software development and lifecycle management 892; virtual classroom education delivery 893; data analytics processing 894; transaction processing 895; and workload 896. In some embodiments, the workload 896 performs some of the operations of the cloud coordinator 110.
The foregoing one or more embodiments implements a workload manager at an enterprise datacenter for a resource pooling and sharing service in a cloud-enterprise resource management system. The workload manager is implemented within a computer infrastructure by having one or more computing devices performing workload predictions for the datacenter and moving computing resources between an enterprise partition and a spare partition based on the predicted workload. The computer infrastructure is further used to communicate with a cloud coordinator of the resource pooling and sharing service, which may also be implemented within a computer infrastructure.
The descriptions of the various embodiments of the present disclosure have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.
Number | Name | Date | Kind |
---|---|---|---|
9379994 | Wray | Jun 2016 | B2 |
9442771 | Morgan | Sep 2016 | B2 |
9699114 | Hintermeister | Jul 2017 | B1 |
10496541 | Yang et al. | Dec 2019 | B2 |
20090083467 | Giles | Mar 2009 | A1 |
20110145413 | Dawson et al. | Jun 2011 | A1 |
20150172204 | Anderson et al. | Jun 2015 | A1 |
Number | Date | Country |
---|---|---|
103746997 | Apr 2014 | CN |
104025055 | Sep 2014 | CN |
104794239 | Jul 2015 | CN |
106534338 | Mar 2017 | CN |
Entry |
---|
Mell, P. et al., “Recommendations of the National Institute of Standards and Technology”; NIST Special Publication 800-145 (2011); 7 pgs. |
Wan, Z. et al., “Cloud Migration: Layer Partition and Integration”; 2017 IEEE 1st International Conference on Edge Computing; IEEE Computer Society (2017); 8 pgs. |
Aatikainen, G. et al., “Cost Benefits of Flexible Hybrid Cloud Storage: Mitigating Volume Variation with Shorter Acquisition Cycle”; The Journal of Systems and Software (2016); vol. 122; pp. 180-201. |
Number | Date | Country | |
---|---|---|---|
20220138015 A1 | May 2022 | US |