The present invention relates generally to the field of data communication and cache server networks, and specifically to systems and methods for scheduling multicasting distribution of content files within content delivery networks.
For large size content, such as movies, content clients usually can tolerate some delay in exchange for better quality. A client may rather watch a high quality downloaded video at a future scheduled time than view a low quality streaming video instantaneously. For example, a mobile user can order a video in advance while he/she is in a cellular mobile network and download it at a later time while he/she is in a hotspot wireless LAN. This is known as a remote site downloading. As such, the mobile user can enjoy a high quality content at low cost.
In recent years, the use of content delivery network (CDN) technology has spread to the Internet to improve the downloading of web pages. A content delivery network (CDN) consists of cache servers at different geographic locations, i.e., network nodes with storage and transport capabilities. The basic premise of CDN technology is that the link between the cache server and the client has low cost and high bandwidth. If at the time a client requests a content file, the content file is stored in the cache of a nearby cache server, the downloading will be fast. Otherwise, the client may experience a longer delay. Thus, it is preferable for a client to download the content file from the most nearby cache server. The technology of finding the nearby cache server for a client is called request-routing. It is a procedure of redirecting a content request to a closer cache server. For example, modifying a URL from the original URL to a URL prefixed by the cache server. In another application of us, an extension of conventional request-routing with content timing is provided to redirect a request to a closer cache server based on the future availability of the requested content on the cache server.
Typically, a client can tolerate a delay for a large size content file up to the expected service time which the client designates as the time he/she wishes to retrieve the content file. Thus, even if the requested content file is not currently stored in a cache server close to the client, so long as the downloading system transfers the content file to the cache server prior to the expected service time, the client will not experience a delay. It is a goal of the industry to reduce these delays by properly scheduling the downloading of requested content files to the appropriate cache server for client retrieval.
Multicasting content delivery can be requested at different cache servers. Due to the availability of advanced content request information before content downloading, it is possible to optimize content distribution in a CDN through multicasting technologies. Typically, a downloading service requires a CDN to provide distribution of a content file to the cache server closest to where the client request for that content file is coming from. The content file must be stored on that cache server and ready for downloading to the client at a time no later than the expected service time designated by the client. As such, a need exists for improved systems and methods for scheduling the distribution of a content file to cache servers associated with requests for that content file.
Briefly, the invention concerns a method for scheduling the distribution of a content file within a cached network environment. The method comprises the steps of: receiving a request for content to be delivered at a service time, associating the content file with a particular cache server, dynamically establishing a multicasting tree of cache servers and delivering the requested content at the service time from the multicasting tree of cache servers.
Multicasting distribution can be implemented at either the transport layer or the application layer. Because there are a number of deficiencies associated with transport layer multicasting, only application layer multicasting is considered for the present invention. Transport layer multicasting requires a multicasting enabled transport network. The Internet does not typically have such a transport network. Additionally, even if there is a multicasting enabled transport network available, the transmission on all the branches of a multicasting tree must be simultaneous. This may not be possible if any of the network nodes (i.e., the cache servers) on the multicasting tree do not have transport or cache capacity at any period of the multicasting session. However, application layer multicasting can be more flexible on the transmission schedule from node to node on a multicasting tree. For a downloading service that has many downloading requests at different expected service times, the application layer multicasting could be more suitable. As used herein, application layer multicasting is defined as a store/forward action at each network node on the multicasting tree. Store implies caching on intermediate nodes and forward means transmission to multiple ports at same or different time.
Referring now to
Referring now to
The requests generated by clients A1, B1, and C1 (which for simplification of understanding will be called requests A1, B1, and C1) are associated to cache server, A, B, C, respectively, completing step 310. The associations of the requests A1, B1, and C1 with cache server A, B, and C are designated by lines 1, 2, and 3 respectively in
When the association of a request to a cache server is dynamically determined by request-routing technology, extended request-routing technology should be used. E. In this case, even when the requested content file is currently not available on a cache server, the request-routing can still associate the request to that cache server because the association is meant to deliver the content file to that cache server at a future time.
Requests A1, B1 and C1 are sent to the content server S in the order of B1, C1 and A1. The multicasting tree will initially have only one node, content server S. Because request B1 is the first request sent to content server S, step 320 will be performed for cache server B first. At step 320, the determination is made whether the cache server B is on the multicasting tree. If the answer is NO (which it is in this case), the system adds node B to the multicasting tree and continues to step 330. At step 330, the system checks for the existence of a closest upstream cache server and, in this case, finds upstream cache server C. This is done through either static hierarchy or request-routing. Request routing is illustrated. Request B1 is then associated to the cache server C, completing step 340. The association of request B1 to cache server C is shown in
Step 320 is then performed for cache server C. According to step 320, it is then determined whether cache server C is on the multicasting tree. If the answer is NO, which it is in this case, the system adds node C to the multicasting tree. The cache server C then finds its closest upstream node, which is the content server S completing step 330. The request C1 is then associated to the content server S in step 340, which is shown in
Step 320 is then performed for content server S. According to step 320, since the content server S is on the multicasting tree, the answer is YES and it goes to step 350. Since the current server is the content server, the answer at 350 is NO and it goes to process the next request.
Turning now to request C1, request C1 is generated in step 300 (subsequent to request B1), and associated with the cache server C in step 310. Since the node C was already added to the multicasting tree in performing the process for request B1, the answer at step 320 is YES and process continues to step 350. Since the service time of C1 (8 PM) is later than the service time of B1 (5 PM), the answer to step 350 is NO. The process then starts over and processes the next request.
Turning now to request A1, which was received subsequent to request C1, request A1 is generated in step 300 and associated with the cache server A in step 310. According to step 320 the determination is made whether the cache server A is on the multicasting tree. In this case, the answer is NO and the process continues to step 330. At this point, node A is first added to the multicasting tree and then cache server A finds its upstream cache server B, completing step 330. Request A1 is then associated to the cache server B in step 340. This association is shown in
The algorithm used in determining the distance between cache servers is not only based on the geographical distance but also other factors, such as cache capacity, load balance of network links, etc. For example, node A may find node C is its upstream node because the cost of caching the content from 5 PM to 7 PM at node B may be larger than the cost difference between link 7 and link 6.
While the invention has been described and illustrated in sufficient detail that those skilled in this art can readily make and use it, various alternatives, modifications, and improvements should become readily apparent without departing from the spirit and scope of the invention.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US04/07652 | 3/12/2004 | WO | 9/11/2006 |