“Cloud computing” is a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services). The shared pool of configurable computing resources can be rapidly provisioned via virtualization and released with low management effort or service provider interaction, and then scaled accordingly. A cloud computing model can be composed of various characteristics (e.g., on-demand self-service, broad network access, resource pooling, rapid elasticity, measured service, etc), service models (e.g., Software as a Service (“SaaS”), Platform as a Service (“PaaS”), Infrastructure as a Service (“IaaS”), and deployment models (e.g., private cloud, community cloud, public cloud, hybrid cloud, etc.). An environment that implements the cloud computing model is often referred to as a cloud computing environment.
A cloud computing environment may include a number of data centers, each having computing resources such as processing power, memory, storage, bandwidth, and so forth. Some of the data centers are larger and may be referred to as origin data centers. Origin data centers may be distributed throughout the globe. The cloud computing environment may also have a larger number of smaller data centers, referred to as “edge data centers” also distributed through the globe. In general, for a given network location, a client entity (e.g., a client computing system or its user) is often a lot closer geographically and closer from a network perspective (in terms of lower latency) to an edge data center than to an origin data center.
At least one embodiment described herein relates to the improved performance of a cloud computing environment using an edge data center. A cloud computing environment includes larger origin data centers, and smaller, but more numerous, edge data centers. A management service receives requests for the cloud computing environment to host applications. In response, the management service allocates the application to run on an origin data center, evaluates the application by evaluating at least one application property specified by a provider of application code corresponding to the application or evaluating runtime performance of the application, and uses an edge server to improve performance of the application in response to evaluating the application. As examples only, a portion of application code may be offloaded to run on the edge data center, a portion of application data may be cached at the edge data center, and/or the edge server may add functionality to the application.
This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
In order to describe the manner in which the above-recited and other advantages and features can be obtained, a more particular description of various embodiments will be rendered by reference to the appended drawings. Understanding that these drawings depict only sample embodiments and are not therefore to be considered to be limiting of the scope of the invention, the embodiments will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:
In accordance with embodiments described herein, a management service receives requests for the cloud computing environment to host applications. In response, the management service allocates the application to run on an origin data center, evaluates the application by evaluating at least one application property specified by a provider of application code corresponding to the application or evaluating runtime performance of the application, and uses an edge server to improve performance of the application in response to evaluating the application. As examples only, a portion of application code may be offloaded to run on the edge data center, a portion of application data may be cached at the edge data center, or the edge server may add functionality to the application. First, some introductory discussion regarding computing systems will be described with respect to
Computing systems are now increasingly taking a wide variety of forms. Computing systems may, for example, be handheld devices, appliances, laptop computers, desktop computers, mainframes, distributed computing systems, or even devices that have not conventionally been considered a computing system. In this description and in the claims, the term “computing system” is defined broadly as including any device or system (or combination thereof) that includes at least one physical and tangible processor, and a physical and tangible memory capable of having thereon computer-executable instructions that may be executed by the processor. The memory may take any form and may depend on the nature and form of the computing system. A computing system may be distributed over a network environment and may include multiple constituent computing systems.
As illustrated in
In the description that follows, embodiments are described with reference to acts that are performed by one or more computing systems. If such acts are implemented in software, one or more processors of the associated computing system that performs the act direct the operation of the computing system in response to having executed computer-executable instructions. An example of such an operation involves the manipulation of data. The computer-executable instructions (and the manipulated data) may be stored in the memory 104 of the computing system 100. Computing system 100 may also contain communication channels 108 that allow the computing system 100 to communicate with other message processors over, for example, network 110.
Embodiments described herein may comprise or utilize a special purpose or general-purpose computer including computer hardware, such as, for example, one or more processors and system memory, as discussed in greater detail below. Embodiments described herein also include physical and other computer-readable media for carrying or storing computer-executable instructions and/or data structures. Such computer-readable media can be any available media that can be accessed by a general purpose or special purpose computer system. Computer-readable media that store computer-executable instructions are physical storage media. Computer-readable media that carry computer-executable instructions are transmission media. Thus, by way of example, and not limitation, embodiments of the invention can comprise at least two distinctly different kinds of computer-readable media: computer storage media and transmission media.
Computer storage media includes RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store desired program code means in the form of computer-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer.
A “network” is defined as one or more data links that enable the transport of electronic data between computer systems and/or modules and/or other electronic devices. When information is transferred or provided over a network or another communications connection (either hardwired, wireless, or a combination of hardwired or wireless) to a computer, the computer properly views the connection as a transmission medium. Transmissions media can include a network and/or data links which can be used to carry or desired program code means in the form of computer-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer. Combinations of the above should also be included within the scope of computer-readable media.
Further, upon reaching various computer system components, program code means in the form of computer-executable instructions or data structures can be transferred automatically from transmission media to computer storage media (or vice versa). For example, computer-executable instructions or data structures received over a network or data link can be buffered in RAM within a network interface module (e.g., a “NIC”), and then eventually transferred to computer system RAM and/or to less volatile computer storage media at a computer system. Thus, it should be understood that computer storage media can be included in computer system components that also (or even primarily) utilize transmission media.
Computer-executable instructions comprise, for example, instructions and data which, when executed at a processor, cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions. The computer executable instructions may be, for example, binaries, intermediate format instructions such as assembly language, or even source code. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the described features or acts described above. Rather, the described features and acts are disclosed as example forms of implementing the claims.
Those skilled in the art will appreciate that the invention may be practiced in network computing environments with many types of computer system configurations, including, personal computers, desktop computers, laptop computers, message processors, hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, mobile telephones, PDAs, pagers, routers, switches, and the like. The invention may also be practiced in distributed system environments where local and remote computer systems, which are linked (either by hardwired data links, wireless data links, or by a combination of hardwired and wireless data links) through a network, both perform tasks. In a distributed system environment, program modules may be located in both local and remote memory storage devices.
Each client 201 may, for example, be structured as described above for the computing system 100 of
Cloud computing environments may be distributed and may even be distributed internationally and/or have components possessed across multiple organizations. In this description and the following claims, “cloud computing” is defined as a model for enabling on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services). The definition of “cloud computing” is not limited to any of the other numerous advantages that can be obtained from such a model when properly deployed.
For instance, cloud computing is currently employed in the marketplace so as to offer ubiquitous and convenient on-demand access to the shared pool of configurable computing resources. Furthermore, the shared pool of configurable computing resources can be rapidly provisioned via virtualization and released with low management effort or service provider interaction, and then scaled accordingly.
A cloud computing model can be composed of various characteristics such as on-demand self-service, broad network access, resource pooling, rapid elasticity, measured service, and so forth. A cloud computing model may also come in the form of various service models such as, for example, Software as a Service (“SaaS”), Platform as a Service (“PaaS”), and Infrastructure as a Service (“IaaS”). The cloud computing model may also be deployed using different deployment models such as private cloud, community cloud, public cloud, hybrid cloud, and so forth. In this description and in the claims, a “cloud computing environment” is an environment in which cloud computing is employed.
The system 210 includes multiple data centers 211, each including corresponding computing resources, such as processing, memory, storage, bandwidth, and so forth. The data centers 211 include larger origin data centers 211A, 211B and 211C, though the ellipses 211D represent that there is no restriction as to the number of origin data centers within the data center group 211. Also, the data centers 211 include smaller edge data centers 211a through 211i, although the ellipses 211j represent that there is no restriction as to the number of edge data centers within the data center group 211. Each of the data centers 211 may include perhaps a very large number of host computing systems that may be each structured as described above for the computing system 100 of
The data centers 211 may be distributed geographically, and perhaps even throughout the world if the cloud computing environment 200 spans the globe. The origin data centers 211A through 211D have greater computing resources, and thus are more expensive, as compared to the edge data centers 211a through 211j. Thus, there are a smaller number of origin data centers distributed throughout the coverage of the cloud computing environment 200. The edge data centers 211 have lesser computing resource, and thus are less expensive. Thus, there is a larger number of edge data centers distributed throughout the coverage of the cloud computing environment 200. Thus, for a majority of clients 201, it is more likely that the client entity (e.g., the client machine itself or its user) is closer geographically and closer from a network perspective (in terms of latency) to an edge data center as compared to an origin data center.
The cloud computing environment 200 also includes services 212. In the illustrated example, the services 200 include five distinct services 212A, 212B, 212C, 212D and 212E, although the ellipses 212F represent that the principles described herein are not limited to the number of service in the system 210. A service coordination system 213 communicates with the data centers 211 and with the services 212 to thereby provide services requested by the clients 201, and other services (such as authentication, billing, and so forth) that may be prerequisites for the requested service.
One of the services 212 (e.g., service 212A) may be a management service that is described in further detail below, and that operates to deploy and operating an application in the cloud computing environment in a manner that performance of the application is enhanced.
The method 300 is performed in response to receiving a request for the cloud computing environment to host an application (act 301). The request may come with the application code itself, as well as a description of the structure and dependencies of the application and its constituent components. For example,
The management service then responds by allocating the application to run on an origin data center (act 302). For instance, suppose in the reference example, that the management service 212A responds to the request from the client 201A by allocating the application to run on the origin data center 211A.
Returning to
Returning to
In protocol translation, the application 410 is capable of interfacing over the channel 511 using a first set of protocols, whereas the client 503A is capable of interfacing over the channel 512 using a second set of protocols. Should the client entity 503 communicate over channel 512 using one of the second set of protocols that is not also in the first set of protocols, the component 801 performs protocol translation of the protocol from channel 512 into one of the first set of protocols for communication with the application 410 over channel 511. Thus, the component 801 may perform protocol translation allowing the application 410 to interface with client entities 503 that are not capable of directly interfacing with the application 410.
In compression functionality, the component 801 extracts compressed communications received from the application 410 over channel 511 or the client entity 503 over channel 512. Alternatively or in addition, the component 801 compresses communications transmitted to the application 410 over channel 511 or to the client entity 503 over channel 512. Thus, the component 801 may perform compression and/or extraction on behalf of the application 410 or the client entity 503.
In encryption functionality, the component 801 decrypts communications received from the application 410 over the channel 511 or the client entity 503 over the channel 512. Alternatively or in addition, the component 801 encrypts communications transmitted to the application 410 over channel 511, or to the client entity 503 over channel 512. Thus, the component 801 may perform encryption and/or decryption on behalf of the application 410 or the client entity 503.
In authentication functionality, the component 801 authenticates the client entity 503 or a third party to the application 410, or authenticates the application 410 or a third party to the client entity 503 of the application.
In load balancing functionality, the component 801 handles application requests associated with the application instead of the origin data server depending on a workload of the origin data server. For instance, if the application request would normally be handled by the origin data server 211A, but that origin data server is busy, the edge data server 502 may reroute that application request to another origin data server, or another edge data server.
For instance,
Origin data center 910(i) communicates with second tier data center 910(ii) using channel 911(i). Second tier data center 910(ii) communicates the next tier data center (data center 910(n) if “n” equals three, or 910(iii) (not shown) if “n” is greater than three) over channel 911(ii). This continues until the “n”th tier data center 910(n) communicates with the prior tier data center (data center 910(ii) if “n” equals three, or 910(n−1) (not shown) if “n” is greater than three) over channel 911(n−1). Mathematically stated, data center 910(k) communicates with the next tier data center 910(k+1) over channel 911(k), where “k” is any integer from 1 to n−1, inclusive. The “n′”th tier data center 910(n) communicates with client entity 503 over channel 911(n). In this example, the data centers become progressive smaller leading from the origin data center 910(i) to the edge data center 910(n)
Thus, a management service is described that operates in a cloud computing environment that allows an application to be hosted by an origin data center, while improving performance of the application using higher tier or edge data center.
The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.