This application is related to the following copending application:
This application is hereby incorporated by reference as if fully set forth herein.
1. Field of the Invention
This invention relates to reducing the bandwidth and computing resources required to transmit a web page over a network.
2. Related Art
It is desirable to transmit web pages in such a way as to minimize the bandwidth and other computing resources required to transmit the web page from a server device to a client device.
One popular technique for minimizing bandwidth requirements and computing resources is to store the web pages at multiple mirroring servers throughout the network. By “pushing content to the edge of a network” (that is, caching the content and serving it from a mirroring server), it is possible to minimize the distance (that is, geographical distance or as measured by network topology) that information must travel before reaching a destination client device. Although redistribution of the load among multiple mirroring servers improves the quality of service by minimizing distance, it does not necessarily affect the bit rate at which a web page is transmitted.
Another technique for minimizing bandwidth requirements involves compression of the web page. Compression generally involves the use of a computer program such as gzip, glib or some other similar program. Since HTML is very compressible (for example, the use of glib can result in 73.8% compression of HTML), there is a significant minimization in the amount of bandwidth required for transmission of web pages. One drawback to this technique however, is that compressing each web page is a relatively inefficient use of computing resources. Every time a web page is sent to a user, the page must be compressed, regardless whether the page was compressed in the past. For example, every time an organizational home page is sent from a server to a client device, computing resources are devoted to compression of that home page. In a second example, every time a server sends similar pages (such two different instances of an order form from an on-line retailer), those pages must be separately compressed without regard for their similarity.
Accordingly, it would be desirable to provide a technique for serving relatively non-static content for delivery in a content delivery network.
In a first aspect of the invention, a template for a web page or for a set of web pages is identified and compressed at an originating server. A template can include either (1) an entire web page or (2) those elements in a web page that are relatively unchanging. Examples of templates include:
A library of these compressed templates is stored at the originating server and the mirroring servers so as to make it available for subsequent requests. Generally, compression of the template only occurs once. The template does not need to be compressed again, unless changes to the template itself are made (for example, changes to the style of the template). In this respect, the invention is very different from the prior art of HTML compression which requires compression of a web page every time the page is sent. By minimizing the number of times that compression takes place, fewer computational resources need be allocated toward compression operations. In a preferred embodiment, the compressed template is cached and served from a remote mirroring server. In other embodiments, the compressed template is cached and served from the originating server. When a client device requests a web page, the compressed template is sent from either the mirroring server (if the template is present) or from the originating server (if the template is not present at the mirroring server) by way of the mirroring server to the client device.
In a second aspect of the invention, delta information from a set of web pages is identified and compressed at the originating server. Unlike templates which include substantial similarities, delta information refers to a set of differences such as differences between instances of a particular web page. For example, in a web page of a completed order form, the web page may be divided into (1) a template, including the unchanging features of the page (or the entire page if there is no dynamic information in the page); and (2) delta information comprising specific personalized information pertaining to an individual order. In a preferred embodiment, the delta information is compressed at the originating server and sent from the originating server to the client. In a preferred embodiment, delta information is compressed each time is it transmitted. Given that the delta information is usually relatively small, the amount of computational resources devoted to its compression is also relatively small. This selective compression minimizes the bandwidth needed to transmit the page while simultaneously minimizing the computing resources allocated for compression. In other embodiments, the delta information may be compressed once and cached until such time that the compressed delta information is reused. This may be useful when the delta information includes advertising copy that may be repeated again.
In the following description, a preferred embodiment of the invention is described with regard to preferred process steps and data structures. Those skilled in the art would recognize after perusal of this application that embodiments of the invention can be implemented using one or more general purpose processors or special purpose processors or other circuits adapted to particular process steps and data structures described herein, and that implementation of the process steps and data structures described herein would not require undue experimentation or further invention.
Originating Server—as used herein, an “originating server” takes on the role of a server in a client-server relationship and is the original provider of content to a client device or to a mirroring server.
Mirroring server—as used herein, a “mirroring server” includes any device that takes on the role of a server in a client-server relationship and that receives requests from client devices and responds to those requests by sending content that originated (in whole or in part) at an originating server.
Template—as used herein, the term “template” refers to a selected portion of a web page that is relatively unchanging. If there is no difference between different instances of a web page, then the entire page may be a template.
Delta information—as used herein, the term “delta information” refers to a selected portion of a web page varies between instances of the web page.
A system for efficient compression using differential caching (shown by general character reference 100) includes one or more client devices 110 under the control of user 112, an originating server 120, a set of mirroring servers 130 and a communication network 140. A proxy encoder server 150 may be positioned “in front” of the originating server 120.
Each client device 110 includes a processor, an input element, a presentation element, a local memory and system software. Client devices 110 further include software 114 disposed for communicating with the communication network 140 and software 116 for integrating a web page.
In a preferred embodiment, each client device 110 includes a general-purpose computer, such as a laptop or workstation. However, a client device 110 can also include (either alone or in conjunction with a laptop or workstation), a hand-held calendar (such as a “Palm Pilot” or other hand-held device), a portable computer, a special purpose computer, a cellular telephone or other telephonic device, a web server acting as the agent for a user, or another device. In alternative embodiments, a client device 110 may also include any other device disposed for performing all or some of the functions described herein.
The software 116 for integrating a web page includes elements for decompressing a template, decompressing delta information, and integrating the template and delta information into a unified presentation for the user 112.
In some embodiments, the client device 110 also includes a decoder 118, preferably as a browser “plug-in”. However, the decoder 118 may also be situated in other locations downstream from the originating server 120, such as at a cache or firewall associated with an ISP or at the edge of an enterprise network. The decoder 118 specifies that it accepts delta encoding by adding information to the HTTP header In such embodiments, the decoder 118 may also perform the functions performed by software 114, such as integrating information.
Those embodiments of the invention that do not include a decoder 118 are known as “clientless”. In such instances, the server sees that the request does not come from a decoder 118 and serves deltas in the form of javascript instructions and a reference to the template. The scripting capabilities of the client's browser are directed toward applying the delta to the template and displaying the HTML page. In other embodiments, other scripting techniques may also be used.
The originating server 120 is under the control of an entity that provides web pages for users 112. Similar to the client devices 110, the originating server 120 includes a processor, an input element, a presentation element, system software and a local memory. However, unlike the client device 110, the originating server 120 includes a cache of the complete set of compressed web page templates 122, a cache of delta information 124, compression software 126 (for example gzip, glib or another comparable program) and web server software 128. The compression software 126 is used to compress both template information and delta information.
The compressed templates included in the cache of the complete set of compressed web page templates 122 are derived from web pages that include relatively unchanging elements, such as the backdrop in weather pages, blank charts used in stock pages, incomplete order form pages and similar web pages with relatively unchanging elements. If a web page does not include dynamic elements, the compressed template may include the entire web page. In a preferred embodiment, the template is compressed and cached with the complete set of compressed web page templates 122. It does not have to be compressed again unless the information itself is updated (for example, with stylistic changes).
The delta information in the cache of delta information 124 includes those portions of a set of web pages that are highly variable. This delta information can be derived from web pages that include elements that change frequently such as a weather forecast, the value of a stock, information to complete an order form and comparable information that is either very ephemeral or unique to a particular user 112. If the total number of sets of delta information 124 associated with a particular URL is relatively low (for example, a few different rotating banner advertisements) and that information is relatively small, the server can maintain a table mapping uncompressed deltas to compressed deltas. This saves computing resources because there is no need to re-compress the delta information every time it is served. Optimal benefits are obtained if the delta information is requested many times before the template changes.
The set of mirroring servers 130 is usually under the control of the same entity that controls the originating server 120. Similar to the originating server 120, the set of mirroring servers 130 includes a processor, system software, an input element, a presentation element, a local memory, a cache of compressed web page templates 132, compression software 134, and web server software 136. However, unlike the originating server 120, the cache of compressed web page templates 132 at the mirroring server 130 is not necessarily complete. Moreover, compression software 134 at the mirroring server 130 is generally used to compress delta information rather than templates as does the compression software 126. In a preferred embodiment, the mirroring servers are relatively more local to the client devices 110 than the originating server 120.
As noted supra, some embodiments of the system 100 also include a proxy encoder server 150. In these embodiments, the proxy encoder server 150 is coupled to the originating server 120, either as a separate server or as an encoder coupled to the originating server. The proxy encoder server 150 compares information stored locally or at the client to possibly fresher information from the mirroring server 130 or the originating server 120 and serves the compressed template and delta information to the client device 110.
The communication network 140 is disposed for transporting compressed templates, delta information and requests for web pages between the client devices 110, the originating server 120 and the mirroring server 130. In a preferred embodiment, the communication network 140 includes a packet switched network such as the Internet, as well as (in conjunction with or instead of) an intranet, an enterprise network, an extranet, a virtual private network, a virtual switched network, or a wireless network. In alternative embodiments, the communication network 140 may include any other set of communication links that couple the client devices 110 the originating server 120 and the mirroring servers 130.
A method 200 for efficient compression using differential caching is performed by the system 100, including a set of client devices 110, an originating server 120, a set of mirroring servers 130 and a communication network 140.
Although described serially and in a particular sequence, in a preferred embodiment the steps described herein can be performed concurrently or in parallel by the system elements, or could be performed in a different sequence or some combination thereof.
At a flow point 205, a user 112 is ready to request a web page from a mirroring server 130. In a preferred embodiment, the mirroring server 130 is relatively more proximate to the user 112 than the originating server 120.
At a step 210, the user 112 causes the client device 110 to generate a request message to the mirroring server 130 for an identified web page. In a preferred embodiment, each request for a web page is performed independently, even if a plurality of requests are to be performed substantially simultaneously.
In other embodiments, the client device 110 may request the web page from the originating server 120. Bypassing the mirroring server 130 may save bandwidth if the originating server 120 is closer to the user 112 than the mirroring server 130.
When the request is made, the client device 110 indicates whether it can receive delta information. If the client device 110 can receive delta information, the method proceeds with step 215.
At a step 215, the mirroring server 130 determines if a compressed template for the web page is available in the cache of compressed web page templates 132. If necessary, the mirroring server 130 obtains the compressed template from the originating server 120. If the compressed template is unavailable from the originating server 120 (this may be the case if the template has not been requested before, or if the template has been recently changed), a template is created and compression software 126 at the originating server 120 compresses the template. The newly compressed template is sent from the originating server 120 to the mirroring server 130 where it is stored in the cache of compressed web page templates 132.
At a step 220, the mirroring server 130 transmits the compressed template to the client device 110.
If the mirroring server 130 were bypassed in step 210, the compressed template is sent to the client device 110 from the originating server 120.
At a step 225, the client device 110 or the mirroring server 130 (depending upon the configuration of the system 100) transmits a message to the originating server 120 for the delta information.
At a step 230, the originating server 120 identifies the delta information in the cache of delta information 124 and compresses it using the compression computer program 126. Depending on the size and type of delta information 124, this compressed delta information may be stored at the originating server 120 or the mirroring server 130. This is particularly useful if there is a strong likelihood that the delta information will be requested again.
At a step 235 the originating server 120 transmits the compressed delta information to the client device 110.
At step 240 the client device 110 integrates the template and delta by performing the following substeps:
Given that the software 116 for integrating a web page is present at the client device 110, the resources required to decompress the delta information and template and integrate them do not contribute to the total resources required by the servers to compress and serve the web page.
In other embodiments, the system 100 does not include a decoder 118. These embodiments are known as “clientless”, because many of the functions normally performed by the decoder 118 are performed differently. In such embodiments, the originating server 120 identifies that the client can receive a delta (e.g. via a cookie), and serves a delta instead of the document. The delta contains a reference to the template, which can be served from either the mirroring servers 130 or from the originating server 120. In such embodiments, the template is compressed once and cached at the mirroring server 130.
The system 300 includes a set of data flows for sending and receiving information between the client devices 110, the originating server 120 and the mirroring server 130, using the communication network 140. It should be noted that it is not necessary to exhaust every data flow to achieve efficient differential caching.
A data flow 310 includes messages sets between the client device 110 under the control of the user 112 and the mirroring server 130.
Message set A from the client device 110 to the mirroring server 130 includes requests for web pages.
Message set B from the mirroring server 130 to the client device 110 includes the following:
A data flow 320 includes messages sets between the mirroring server 130 to the originating server 120.
Message set C from the mirroring server 130 to the originating server 120 includes the following:
Message set D from the originating server 120 to the mirroring server 130 includes the following:
A data flow 330 includes messages sets between the originating server 120 and the client device 110.
Message set E from the client device 110 to the originating server 120 includes the following:
Message set F from the originating server 120 to the client device 110 includes the following:
A method 400 for efficient compression using differential caching is performed by the system 100, including a set of client devices 110, an originating server 120, a set of mirroring servers 130, a proxy encoder server 150 and a communication network 140.
Similar to method 200, the steps described herein can be performed concurrently or in parallel by the system elements, or could be performed in a different sequence or some combination thereof.
At a flow point 405, a user 112 is ready to request a web page from a mirroring server 130. In a preferred embodiment, the mirroring server 130 is relatively more proximate to the user 112 than the originating server 120.
In a step 410, the user 112 requests the web page. In a preferred embodiment, this request is made using the decoder 118. The decoder 118 intercepts the request and redirects it to the proxy encoder server 150. The decoder 118 also informs the proxy encoder server 150 that the client device 110 can receive delta and template information, and provide delta encoding.
In other embodiments, the request can be made without using the decoder 118. In these other embodiments, the request goes directly to the proxy encoder server 150. The absence of a decoder 118 indicates that the client cannot provide delta encoding.
In a step 415, the proxy encoder server 150 obtains the web page or a template corresponding to the web page from the mirroring server or originating server. The proxy encoder server 150 compares the web page or template with information in the cache of compressed web page templates 132 (as noted supra, in such embodiments, these caches are local to the proxy encoder server 150). If there is not a corresponding template in the cache of compressed web page templates 132 (as may be the case if the web page was never requested before), then a new template is generated, compressed and cached. If there is a corresponding template in the cache of compressed web page templates 132 and that corresponding template needs to be updated, then updating is performed at this time and the updated template is compressed and cached. The compressed template may be tagged with information specifying a version number associated with the template.
In those embodiments that do not provide delta encoding, the compressed template may be stored at the client device 110. In such embodiments, the proxy encoder server 150 calculates the difference between the compressed template at the client device 110 and the fresh page and calculates a delta. The delta is an HTML page that includes a reference to a compressed template (preferably a Javascript) and some Javascript instructions. In this embodiment, the Javascript instructions tell the client device 110 how to transform the template into the updated web page.
In a step 420, the proxy encoder server 150 sends the compressed template to the client. This compressed template may be stored at the client device 110.
The following steps occur when the user 112 requests the same web page associated with the compressed and cached template:
In a step 425, the user 112 requests a web page using the decoder 118. In making the request, the decoder 118 also specifies what version of a compressed template for the web page has been received in the past. This request is directed to the proxy encoder server 150.
In a step 430, the proxy encoder server 150 receives the request and identifies a compressed template that is responsive to the request. The proxy encoder server 150 calculates the difference between the version of the compressed template and a presumably fresher version of the web page that is obtained from the originating server 120 or the mirroring server 130. This difference is the delta information.
In step 435, the proxy encoder server 150 sends delta information such as is responsive to the request. In some embodiments, the delta information is compressed before sending. In other embodiments, the delta information is compressed and cached.
In the “clientless” version, the proxy encoder server 150 does not know what version of the template is at the client. The proxy encoder server 150 makes this decision and instructs the client to use a specific version of the template. This is comparable to steps 310–335.
Although preferred embodiments are disclosed herein, many variations are possible which remain within the concept, scope and spirit of the invention; these variations would be clear to those skilled in the art after perusal of this application. One such variation includes caching (either at the originating server, mirroring server or proxy encoder server) a Huffman tree corresponding to a web page that has been requested in the past. In such alternative embodiments, the delta is calculated by comparing the Huffman tree to newer versions of the tree and computing a delta based upon those parts of the tree that have changed. This technique is preferable for web pages that are requested very frequently (or that change very rarely). In other embodiments, a Huffman tree corresponding to a template is generated and served separate from the delta information.
| Number | Name | Date | Kind |
|---|---|---|---|
| 5680634 | Estes | Oct 1997 | A |
| 5727129 | Barrett et al. | Mar 1998 | A |
| 5728129 | Sumners | Mar 1998 | A |
| 5774660 | Brendel et al. | Jun 1998 | A |
| 5802292 | Mogul | Sep 1998 | A |
| 5826258 | Gupta et al. | Oct 1998 | A |
| 5845088 | Lewis | Dec 1998 | A |
| 5859971 | Bittinger et al. | Jan 1999 | A |
| 5878223 | Becker et al. | Mar 1999 | A |
| 5878429 | Morris et al. | Mar 1999 | A |
| 5931904 | Banga et al. | Aug 1999 | A |
| 5944790 | Levy | Aug 1999 | A |
| 5978841 | Berger | Nov 1999 | A |
| 5983227 | Nazem et al. | Nov 1999 | A |
| 5987480 | Donohue et al. | Nov 1999 | A |
| 6003087 | Housel, III et al. | Dec 1999 | A |
| 6021426 | Douglis et al. | Feb 2000 | A |
| 6026413 | Challenger et al. | Feb 2000 | A |
| 6067565 | Horvitz | May 2000 | A |
| 6085193 | Mallkin et al. | Jul 2000 | A |
| 6098152 | Mounes-Toussi | Aug 2000 | A |
| 6112242 | Jois et al. | Aug 2000 | A |
| 6144990 | Brandt et al. | Nov 2000 | A |
| 6154767 | Altschuler et al. | Nov 2000 | A |
| 6216212 | Challenger et al. | Apr 2001 | B1 |
| 6256712 | Challenger et al. | Jul 2001 | B1 |
| 6260192 | Rosin et al. | Jul 2001 | B1 |
| 6314465 | Paul et al. | Nov 2001 | B1 |
| 6327628 | Anuff et al. | Dec 2001 | B1 |
| 6343318 | Hawkins et al. | Jan 2002 | B1 |
| 6374305 | Gupta et al. | Apr 2002 | B1 |
| 6377978 | Nguyen | Apr 2002 | B1 |
| 6397217 | Melbin | May 2002 | B1 |
| 6405252 | Gupta et al. | Jun 2002 | B1 |
| 6453319 | Mattis et al. | Sep 2002 | B1 |
| 6553413 | Leighton et al. | Apr 2003 | B1 |
| 6606525 | Muthuswamy et al. | Aug 2003 | B1 |
| 6623529 | Lakritz | Sep 2003 | B1 |
| 6629138 | Lambert et al. | Sep 2003 | B1 |
| 6704024 | Robotham | Mar 2004 | B2 |
| 6721780 | Kasriel et al. | Apr 2004 | B1 |
| 6728785 | Jungck | Apr 2004 | B1 |
| 6760746 | Schneider | Jul 2004 | B1 |
| 6862626 | Ryu | Mar 2005 | B1 |
| 6868453 | Watanabe | Mar 2005 | B1 |
| 6873877 | Tobias et al. | Mar 2005 | B1 |
| 6912591 | Lash | Jun 2005 | B2 |
| 6944817 | Danneels | Sep 2005 | B1 |
| 7058700 | Casalaina | Jun 2006 | B1 |
| 7092997 | Kasriel et al. | Aug 2006 | B1 |
| 7096418 | Singhal et al. | Aug 2006 | B1 |
| 20010037361 | Croy | Nov 2001 | A1 |
| 20010037400 | Raz et al. | Nov 2001 | A1 |
| 20010056460 | Sahota et al. | Dec 2001 | A1 |
| 20020032677 | Morgenthaler et al. | Mar 2002 | A1 |
| 20020048450 | Zetts | Apr 2002 | A1 |
| 20020056010 | Lincoln et al. | May 2002 | A1 |
| 20020138509 | Burrows et al. | Sep 2002 | A1 |
| 20020138511 | Psounls et al. | Sep 2002 | A1 |
| 20020161860 | Godlin et al. | Oct 2002 | A1 |
| 20030009563 | Douglis et al. | Jan 2003 | A1 |
| 20030110186 | Markowski et al. | Jun 2003 | A1 |
| 20030110296 | Kirssch et al. | Jun 2003 | A1 |
| 20030120752 | Corcoran | Jun 2003 | A1 |
| 20040128346 | Melamed et al. | Jul 2004 | A1 |
| 20040205165 | Melamed et al. | Oct 2004 | A1 |
| 20060168348 | Casalaina | Jul 2006 | A1 |
| Number | Date | Country |
|---|---|---|
| 010146356 | Apr 2003 | DE |
| 001662405 | May 2006 | EP |
| 02003132086 | May 2003 | JP |
| WO 0028433 | May 2000 | WO |
| WO 200254258 | Jan 2001 | WO |