The present disclosure generally relates to CDN scale down.
A content delivery network or content distribution network (CDN) is a large distributed system typically deployed in multiple data centers across the Internet. The goal of a CDN is to serve content to end-users with high availability and high performance. CDNs serve a large fraction of the Internet content, including web objects (text, graphics and scripts), downloadable objects (media files, software, documents), applications (e-commerce, portals), live streaming media, on-demand streaming media, and social networks.
The present disclosure will be understood and appreciated more fully from the following detailed description, taken in conjunction with the drawings in which:
DESCRIPTION OF EXAMPLE EMBODIMENTS
There is provided in accordance with an embodiment of the present invention, a system for orchestration of a content delivery network (CDN), the system including a processor, and a memory to store data used by the processor, wherein the processor is operative to monitor a plurality of edge caches in the CDN, determine that a first edge cache of the plurality of edge caches should be shutdown, determine that any clients downloading content from the first edge cache should continue downloading the content from a second edge cache of the plurality of edge caches, instruct a network resource to perform an action so that client content requests addressed to the first edge cache are directed to the second edge cache, without the first edge cache needing to receive the client content requests, and trigger shutdown of the first edge cache.
Reference is now made to
The content delivery system 10 includes a content delivery network (CDN) operating in a network infrastructure 20. The content delivery network typically includes a CDN orchestration sub-system 14, a plurality of edge caches 16 and other CDN components such as a request router (not shown).
The network infrastructure 20 typically includes a network resource 18 and other network components well known in the art (not shown). The network infrastructure 18 may be implemented in a cloud environment with a virtualization layer above the real resources where the network resource 18 is a cloud orchestration function/system for composing of architecture, tools and processes used by humans to deliver a defined service, stitching of software and hardware components together to deliver a defined service, and connecting and automating of work flows when applicable to deliver a defined service.
The orchestration sub-system 14 typically includes a processor 22 and a memory 24 to store data used by the processor 22. The processor 22 is operative to: monitor the edge caches 16 in the CDN, for example, for under-utilization; determine that one of the edge caches 16 should be shutdown (edge cache 16-A in the example of
The processor 22 of the orchestration sub-system 14 is operative to notify a CDN resource 23 such as the CDN request router (not shown) that new content requests 26 should not be redirected to the edge cache 16-A, but instead to one of the other edge caches 16 (e.g. edge cache 16-B).
Reference is now made to
The processor 22 of the orchestration sub-system 14 is operative to instruct the network resource 18 (via an instruction 32) to perform an action resulting in a networking re-direct, typically cloud based, so that client content requests 26, addressed to the edge cache 16-A, are directed to the edge cache 16-B, without the edge cache 16-A needing to receive the content requests 26. In other words the re-direct of the content request 26 may be performed without having the edge cache 16-A being involved in the re-direct.
Three different methods of providing network based re-directs are described in more detail with reference to
Once the network re-direct is in place, so that the content request 26 to edge cache 16-A are redirected to the edge cache 16-B, the processor 22 is operative to trigger shutdown (block 34) of the edge cache 16-A.
It should be noted that a CDN Management System component may be involved as an intermediary agent in some of the above steps. The CDN Management System component typically performs device-level and CDN-level tasks such as monitoring, configuring, upgrading, and trouble-shooting. The tasks may be performed in combination with the orchestration sub-system 14. For example, when the orchestration sub-system 14 creates a new edge cache, the orchestration sub-system 14 may communicate with the CDN Management System to notify it of the new edge cache and to request the CDN Management System to recognize the new edge cache, to configure the new edge cache and incorporate the new edge cache in to the CDN.
Reference is now made to
The steps include: monitoring the edge caches 16 (
Reference is now made to
Each edge cache 16 is allocated its own virtual network interface (VNI) 46 having a virtual Internet Protocol address 48. In the example, of
A mapping between a hostname 50 of each cache 16 and the associated virtual IP address 48 of each edge cache 16 is managed by a domain name system (DNS) mapping server 52, typically in conjunction with the network resource 18. In particular, in the example of
The client 30 is typically operative to periodically retrieve the virtual Internet Protocol address 48 for the edge cache 16-A from the DNS mapping server 52. When the virtual Internet Protocol address 48 is retrieved, a time-to-live (TTL) is also retrieved (block 54). The TTL indicates when the virtual Internet Protocol address 48 should be retrieved again by the client 30. Periodic retrieval of the virtual Internet Protocol address 48 for the edge cache 16-A from the DNS mapping server 52 is performed as the virtual Internet Protocol address 48 for the edge cache 16-A may be updated from time to time.
The client 30 requests content based on the virtual Internet Protocol address 48 retrieved from the DNS mapping server 52. The content request 26 is directed by the network to the edge cache 16-A via the virtual network interface 46-A having the associated virtual Internet Protocol address 48-A.
Reference is now made to
The processor 22 is operative to instruct the network resource 18 (via the instruction 32) to create a new virtual network interface 56 (in addition to the virtual network interface 46-B) for the edge cache 16-B and re-allocate the virtual IP address 48-A (and possibly hostname) to the new virtual network interface 56, from the virtual network interface 46-A, so that content requests 26 from the client 30 to the edge cache 16-A are re-directed by the network infrastructure 20 to the edge cache 16-B.
From the viewpoint of the client 30 making use of the CDN, a scale down is transparent at the Hypertext Transfer Protocol (HTTP) layer and is basically seen as a Transmission Control Protocol (TCP) connection reset at the TCP/IP layers as will be explained in more detail below.
The cloud-based network infrastructure 20 ensures that routing (of packets destined to the reallocated IP address 48-A) adapts in a timely fashion so that no or few packets are lost during the IP address re-allocation as will now be described in more detail below.
Prior to the scale down decision being made, the client 30 issues its content/segment requests 26 for a given uniform resource locator (URL) pattern (of the form http://hostname/path/content-name) to a given IP address 48-A. The IP address 48-A had first been obtained by the client from the DNS mapping server 52 as the IP address mapped to the hostname of the edge cache 16-A that was contained in the URL. The IP address 48-A is routed by a cloud networking component (not shown) to the edge cache 16-A. The orchestration sub-system 14 sends the instruction 32 to the network resource 18, that in turn re-allocates the IP address 48-A to the newly created virtual network interface 56 tied to the edge cache 16-B. It should be noted that instead of creating the new virtual network interface 56, the processor 22 may be operative to instruct the network resource 18 to re-allocate the virtual IP 48-A to the virtual network interface 46-B allocated to the edge cache 16-B. After the virtual IP address 48-A is re-allocated to the virtual network interface 56, the client 30 may briefly lose IP connectivity and is also likely to experience a TCP connection reset since the edge cache 16-B is unlikely to have inherited the state information associated with the TCP connection that was established on the edge cache 16-A and that is required to maintain that TCP connection. The TCP connection re-establishment is requested by the client 30 to the same IP address, but now the TCP connection is established by the edge cache 16-B. The client 30 still issues segment requests 26 to the same uniform resource locator (URL) pattern and to the same IP address 48-A. The edge cache 16-B uses configuration information to acquire and serve the requested content segments 28. The above helps ensure that sessions affected by a scale down are lightly impacted akin to a brief IP connectivity interruption, thereby increasing the likelihood of no video interruption or a transient bandwidth reduction of the ABR adaptive bitrate.
It should be noted that the virtual network interface 46-A and the new virtual network interface 56 are typically disposed on different virtual machines.
Once all, or most, of the sessions have been offloaded away from the edge cache 16-A to the edge cache 16-B, the CDN orchestration sub-system 14 triggers shutdown of the edge cache 16-A.
Reference is now made to
The processor 22 is operative to instruct the network resource 18 (via the instruction 32) to change the mapping of the hostname of the edge cache 16-A so that the hostname is mapped to the virtual Internet Protocol address 48-B assigned to the virtual network interface 46-B of the edge caches 16-B. The network resource 18 instructs the DNS mapping server 52 to change the mapping of the hostname of the edge cache 16-A so that the hostname is mapped to the virtual Internet Protocol address 48-B. It should be noted that in this embodiment the network resource 18 and the DNS mapping server 52 may be the same entity such that the instruction 32 is sent directly by the orchestration sub-system 14 to the DNS mapping server 52.
When the cached DNS entry for the hostname of the edge cache 16-A times-out (in accordance with the previously received TTL (block 54 of
Once the new DNS resolution for the edge cache 16-A has been retrieved by the client 30, the virtual Internet Protocol address 48 mapped to the hostname of the edge cache 16-A is the virtual Internet Protocol address 48 of the edge cache 16-B (69.32.156.1). Therefore, the client 30 creates the content requests 26 for retrieving content from the edge cache 16-A using the virtual Internet Protocol address 48 of the edge cache 16-B with a TCP connection establishment on initial communication with the edge cache 16-B. This embodiment does not typically need any involvement of a cloud-based routing subsystem since the mapping between IP addresses and virtual machines does not change. The orchestration sub-system 14 typically waits a certain time, greater than the TTL, which is sufficiently long enough to ensure that all or most sessions that were using the edge cache 16-A have been redirected to the edge cache 16-B, and then trigger shutdown the edge cache 16-A.
Since the hostname of a scaled down cache instance continues to be used after the instance is shutdown, a suitable hostname management mechanism needs to be implemented to make sure the hostname of a scaled down cache instance is not re-allocated to a new cache instance in a future scale up (at least not until the hostname is no longer being used).
Reference is now made to
In this embodiment, each of the edge caches 16 is allocated to its own load balancer group 60.
In the example of
The load balancer group 60-A is typically established by the processor 22 instructing the network resource 18 to: create the load balancer group 60-A and allocate the edge cache 16-A to the load balancer group 60-A; and map the hostname 50 of the edge cache 16-A to the virtual Internet IP address 62-A of the load balancer group 60-A.
The load balancer group 60-B is typically established by the processor 22 instructing the network resource 18 to: create the load balancer group 60-B and allocate the edge cache 16-B to the load balancer group 60-B; and map the hostname 50 of the edge cache 16-B to the virtual Internet IP address 62-B of the load balancer group 60-B.
The network resource 18 is operative to configure each load-balancer group 60 to direct 100% of content requests to the edge cache 16 allocated to that load balancer group 60. The network resource 18 is operative to ensure that the DNS mapping server 52 is configured to return the load-balancer group 60 virtual IP address 62 for the hostname 50 of the edge cache 16 being requested. Therefore, when the client 30 creates the content request 26, the content request 26 is addressed to the virtual Internet Protocol (IP) address 62 of the load balancer group 60 to which the relevant edge cache 16 is allocated. In the example of
Reference is now made to
The processor is operative to instruct the network resource 18 to: allocate the edge cache 16-B to the load balancer group 60-A in addition to being allocated to the load balancer group 60-B; and configure the load balancer group 60-A to direct substantially all, generally 100%, of the arriving client content requests 26 to the edge cache 16-B in the load balancer group 60-A. As a result, all the subsequent content requests 26 (e.g. ABR segment requests) from the client 30 towards the hostname of the edge cache 16-A are now directed by the load-balancer 60-A to the edge cache 16-B. This embodiment does not require any changes at the DNS mapping server 52, since the mapping between the hostname and the virtual IP address of the edge cache 16-A does not change during shutdown processing.
The processor 22 is operative to instruct the network resource 18 to shutdown the edge cache 16-A.
The processor 22 is also operative to instruct the network resource 18 to shutdown the load balancer group 60-A once all sessions using the load balancer group 60-A are terminated.
The edge cache 16-A is typically shutdown prior to shutdown of the load balancer group 60-A. However, the edge cache 16-A may be shutdown after the shutdown of the load balancer group 60-A.
It should be noted that processor 22 generally does not know with certainty if and when all sessions using the load balancer group 60-A are actually terminated. Therefore, termination of the sessions using the load balancer group 60-A may be estimated with sufficient likelihood or checking for session activity.
It should be noted that all three embodiments described above may be implemented regardless of which CDN request redirection method is used to initially redirect the ABR session client 30 to the edge cache 16-A, for example, but not limited to, HTTP, DNS, application programming interface (API).
All three embodiments generally result in a TCP connection reset, from the viewpoint of the client 30, since the edge cache 16 is changed.
The above embodiments have been described assuming transport over HTTP over TCP. It will be appreciated by those ordinarily skilled in the art that other transport stacks may be used, for example, but not limited to, HTTP over Quick UDP Internet Connections (QUIC) over User Datagram Protocol (UDP).
In practice, some or all of these functions may be combined in a single physical component or, alternatively, implemented using multiple physical components. These physical components may comprise hard-wired or programmable devices, or a combination of the two. In some embodiments, at least some of the functions of the processing circuitry may be carried out by a programmable processor under the control of suitable software. This software may be downloaded to a device in electronic form, over a network, for example. Alternatively or additionally, the software may be stored in tangible, non-transitory computer-readable storage media, such as optical, magnetic, or electronic memory.
It is appreciated that software components may, if desired, be implemented in ROM (read only memory) form. The software components may, generally, be implemented in hardware, if desired, using conventional techniques. It is further appreciated that the software components may be instantiated, for example: as a computer program product or on a tangible medium. In some cases, it may be possible to instantiate the software components as a signal interpretable by an appropriate computer, although such an instantiation may be excluded in certain embodiments of the present invention.
It will be appreciated that various features of the invention which are, for clarity, described in the contexts of separate embodiments may also be provided in combination in a single embodiment. Conversely, various features of the invention which are, for brevity, described in the context of a single embodiment may also be provided separately or in any suitable sub-combination.
It will be appreciated by persons skilled in the art that the present invention is not limited by what has been particularly shown and described hereinabove. Rather the scope of the invention is defined by the appended claims and equivalents thereof.
Number | Date | Country | Kind |
---|---|---|---|
PCT/CN2014/093901 | Dec 2014 | CN | national |
1500612.5 | Jan 2015 | GB | national |
RELATED APPLICATION INFORMATION The present application claims priority from International Patent Application S/N PCT/CN2014/093901 of Cisco Technologies Inc. filed on 16 Dec. 2014.