The present invention relates to dynamic content caching systems and methods.
The following patent publications are believed to represent the current state of the art:
U.S. Pat. Nos. 6,351,767; 6,408,360; 6,757,708; 6,823,374; 7,096,418; 7,320,023; 7,343,412; and
U.S. Published Patent Application Nos.: 2002/0120710; 2003/0004998; 2004/90044731; 2005/0240732; 2009/0049243.
The present invention provides improved systems and methodologies for dynamic content caching.
There is thus provided in accordance with a preferred embodiment of the present invention a system for caching content including a server supplying at least one of static and non-static content elements, content distinguishing functionality operative to categorize elements of the non-static content as being either dynamic content elements or pseudodynamic content elements, and caching functionality operative to cache the pseudodynamic content elements.
In accordance with a preferred embodiment of the present invention the static content elements are content elements which are identified by at least one of the server and metadata associated with the content elements as being expected not to change, the non-static content elements are content elements which are not identified by the server and/or by metadata associated with the content elements as being static content elements, the pseudodynamic content elements are non-static content elements which, based on observation, are not expected to change, and the dynamic content elements are non-static content elements which are not pseudodynamic.
Preferably, the content distinguishing functionality is operative to distinguish between the dynamic content and the pseudodynamic content by distinguishing between content downloaded to disparate clients that has a changing byte content and content that has a static byte content.
Preferably, the caching functionality is operative to ascertain whether the content is cached upon receiving a content request from a client, the content request including a full URL. Preferably, the caching functionality is operative to provide the content to the client upon ascertaining that the content is cached. Alternatively, the caching functionality is operative to route the content request to a web server which hosts the full URL upon ascertaining that the content is not cached. Preferably, the client is operative to receive a response from the web server to the content request. Additionally or alternatively, the content distinguishing functionality is operative to receive a response from the web server to the content request.
Preferably, the content distinguishing functionality includes a URL database which stores a list of previously processed full URLs which were previously processed by the content distinguishing functionality. Preferably, the database stores a caching state associated with the previously processed full URL for each previously processed full URL stored in the URL database.
Preferably, the content distinguishing functionality is also operative to receive the full URL and attributes of the client. Preferably, the attributes include source IP, session ID, user agent and screen size. Preferably, the content distinguishing functionality is also operative to receive a digest of the response and a timestamp of the response. Preferably, the content distinguishing functionality is also operative to calculate a digest of the response and a timestamp of the response. Preferably, the content distinguishing functionality is also operative to ascertain whether the full URL is stored in the URL database.
Preferably, the content distinguishing functionality is operative to store the full URL, the digest, the timestamp and the attributes in the URL database, and to set a stored caching state corresponding to the full URL to “learning” when the full URL is not stored in the URL database. Alternatively, the content distinguishing functionality is also operative to ascertain whether the stored caching state corresponding to the full URL is one of “learning”, “pseudodynamic” and “dynamic” when the full URL is stored in the URL database.
Preferably, the content distinguishing functionality is also operative to ascertain whether the digest is identical to a stored digest corresponding to the URL when the stored caching state is “learning”. Additionally, the content distinguishing functionality is also operative to store the attributes in the URL database when the digest is identical to a stored digest corresponding to the URL, and responsive to a predefined learning time having elapsed since the stored caching state was initially set as “learning” and a predefined sufficient variety of client attributes having been stored over a predefined minimum number of responses associated with the URL said content distinguishing functionality is also operative to set the caching state as “pseudodynamic”, and to store the response in the cache with a predefined caching expiration time. Alternatively, the caching state of the URL is set as “dynamic” and the timestamp is stored when the digest is not identical to a stored digest corresponding to the URL. Preferably, the variety of client attributes includes a minimum number of distinct IP addresses in combination with a minimum number of distinct user agents. Preferably, the predefined caching expiration time is shorter than the learning time.
Alternatively, the content distinguishing functionality is also operative to ascertain whether the digest is identical to a stored digest corresponding to the URL when the stored caching state is “pseudodynamic”. Preferably, the response is stored in the cache with a new caching expiration time associated therewith when the digest is identical to a stored digest corresponding to the URL. Alternatively, the caching state of the URL is set as “learning” and the digest, the timestamp and the attributes are stored when the digest is not identical to a stored digest corresponding to the URL. Preferably, the new caching expiration time is equal to the predefined caching expiration time. Alternatively, the new caching expiration time is not equal to the predefined caching expiration time.
Alternatively, the content distinguishing functionality is also operative to ascertain whether a predefined amount of refresh time has elapsed since a stored timestamp of the URL when the stored caching state is “dynamic”. Preferably, the caching state of the URL is set as “learning” and the digest, the timestamp and the attributes are stored in the entry when a predefined amount of refresh time has elapsed since a stored timestamp of the URL.
There is also provided in accordance with another preferred embodiment of the present invention content distinguishing functionality operative in a system for serving content including a server supplying at least one of static content and non-static content, the content distinguishing functionality being operative to categorize elements of the non-static content as being either dynamic content elements or pseudodynamic content elements, and caching functionality operative to cache the pseudodynamic content.
Preferably, the static content elements are content elements which are identified by at least one of the server and metadata associated with the content elements as being expected not to change, the non-static content elements are content elements which are not identified by the server and/or by metadata associated with the content elements as being static content elements, the pseudodynamic content elements are non-static content elements which, based on observation, are not expected to change, and the dynamic content elements are non-static content elements which are not pseudodynamic.
Preferably, the content distinguishing functionality is operative to distinguish between the dynamic content and the pseudodynamic content by distinguishing between content downloaded to disparate clients that has a changing byte content and content that has a static byte content.
There is further provided in accordance with yet another preferred embodiment of the present invention a method for caching content including supplying at least one of static and non-static content elements, categorizing elements of the non-static content as being either dynamic content elements or pseudodynamic content elements, and caching the pseudodynamic content.
In accordance with a preferred embodiment of the present invention the static content elements are content elements which are identified by at least one of the server and metadata associated with the content elements as being expected not to change, the non-static content elements are content elements which are not identified by the server and/or by metadata associated with the content elements as being static content elements, the pseudodynamic content elements are non-static content elements which, based on observation, are not expected to change, and the dynamic content elements are non-static content elements which are not pseudodynamic.
Preferably, the categorizing includes distinguishing between the dynamic content and the pseudodynamic content by distinguishing between content downloaded to disparate clients that has a changing byte content and content that has a static byte content.
Preferably, the categorizing also includes ascertaining whether the content is cached upon receiving a content request including a full URL from a client. Preferably, the categorizing includes providing the content to the client upon ascertaining that the content is cached. Alternatively, the categorizing includes routing the content request to a web server which hosts the full URL upon ascertaining that the content is not cached. Preferably, the client is operative to receive a response from the web server to the content request. Additionally or alternatively, the categorizing also includes receiving a response from the web server to the content request.
Preferably, the categorizing also includes storing a list of previously processed full URLs. Preferably, a caching state associated with the previously processed full URL is stored for each stored previously processed full URL.
Preferably, the categorizing also includes receiving the full URL and attributes of the client. Preferably, the attributes include source IP, session ID, user agent and screen size. Preferably, the categorizing also includes receiving a digest of the response and a timestamp of the response. Preferably, the categorizing also includes calculating a digest of the response and a timestamp of the response. Preferably, the categorizing also includes ascertaining whether the full URL is stored in the URL database.
Preferably, the categorizing includes storing the full URL, the digest, the timestamp and the attributes and setting a stored caching state corresponding to the full URL to “learning” when the full URL is not stored. Alternatively, the categorizing also includes ascertaining whether the stored caching state corresponding to the full URL is one of “learning”, “pseudodynamic” and “dynamic” when the full URL is stored.
Preferably, the categorizing also includes ascertaining whether the digest is identical to a stored digest corresponding to the URL when the stored caching state is “learning”. Additionally, the categorizing also includes storing the attributes when the digest is identical to a stored digest corresponding to the URL, and responsive to a predefined learning time having elapsed since the stored caching state was initially set as “learning” and a predefined sufficient variety of client attributes having been stored over a predefined minimum number of responses associated with the URL the categorizing also includes setting the caching state as “pseudodynamic”, and caching the response with a predefined caching expiration time. Alternatively, the caching state of the URL is set as “dynamic” and the timestamp is stored when the digest is not identical to a stored digest corresponding to the URL. Preferably, the variety of client attributes includes a minimum number of distinct IP addresses in combination with a minimum number of distinct user agents. Preferably, the predefined caching expiration time is shorter than the learning time.
Alternatively, the categorizing also includes ascertaining whether the digest is identical to a stored digest corresponding to the URL when the stored caching state is “pseudodynamic”. Preferably, the response is cached with a new caching expiration time associated therewith when the digest is identical to a stored digest corresponding to the URL. Alternatively, the caching state of the URL is set as “learning”, and the digest, the timestamp and the attributes are stored when the digest is not identical to a stored digest corresponding to the URL. Preferably, the new caching expiration time is equal to the predefined caching expiration time. Alternatively, the new caching expiration time is not equal to the predefined caching expiration time.
Alternatively, the categorizing also includes ascertaining whether a predefined amount of refresh time has elapsed since a stored timestamp of the URL when the stored caching state is “dynamic”. Preferably, the caching state of the URL is set as “learning”, and the digest, the timestamp and the attributes are stored in the entry when a predefined amount of refresh time has elapsed since a stored timestamp of the URL.
There is yet further provided in accordance with still another preferred embodiment of the present invention a content distinguishing method operative in a system for serving content including a server supplying at least one of static content and non-static content, the content distinguishing method including categorizing elements of the non-static content as being either dynamic content elements or pseudodynamic content elements, and caching functionality operative to cache the pseudodynamic content.
Preferably, the static content elements are content elements which are identified by at least one of the server and metadata associated with the content elements as being expected not to change, the non-static content elements are content elements which are not identified by the server and/or by metadata associated with the content elements as being static content elements, the pseudodynamic content elements are non-static content elements which, based on observation, are not expected to change, and the dynamic content elements are non-static content elements which are not pseudodynamic.
Preferably, the categorizing includes distinguishing between the dynamic content and the pseudodynamic content by distinguishing between content downloaded to disparate clients that has a changing byte content and content that has a static byte content.
The present invention will be understood and appreciated more fully from the following detailed description, taken in conjunction with the drawings in which:
Reference is now made to
As seen in
For the purposes of the present application, the term “static content” is defined as content which is identified by the web server providing the content and/or by metadata associated with the content as content that is not expected to change over at least a predetermined time and in response to a predetermined number of content requests.
For the purposes of the present application, the term “non-static content” is defined as content which is not identified by the web server providing the content and/or by metadata associated with the content as content that is static.
In accordance with a preferred embodiment of the present invention, the system includes, in addition to server 100, content distinguishing functionality 102 operative to distinguish between non-static content that is dynamic content and non-static content which is pseudodynamic content, and caching functionality 104 operative to cache the static content and the pseudodynamic content.
For the purposes of the present application, the term “pseudodynamic content” is defined as non-static content which based on observation, is not expected to change over a predetermined time.
For the purposes of the present application, the term “dynamic content” is defined as non-static content which is not “pseudodynamic content”.
Reference is now made to
Turning to
In the present example, the requested web page is not cached and web server 100 serves the requested web page to the requesting internet access device 110. It is a particular feature of the present invention that the requested web page is also provided to content distinguishing functionality 102 (
In accordance with a preferred embodiment of the present invention, the content distinguishing functionality makes a determination based on earlier stored information as to whether the content is static, dynamic or pseudodynamic. Alternatively, the static content may be directly supplied to web caching proxy 112 and need not be supplied to the content distinguishing functionality. In accordance with a preferred embodiment of the present invention, the pseudodynamic content is stored in web caching proxy 112.
Turning to
In the present example, the requested web page is already cached and web caching proxy 112 serves the requested web page to the requesting internet access device 120.
Reference is now made to
As shown in
In a case where the requested full URL is not stored in the cache, or in a case where the content request includes a directive to avoid utilizing a cache, the request is routed to a web server (310) which hosts the requested full URL, and the response from the web server is preferably parallelly or sequentially received by both the client (312) and by the CDF (314).
The CDF preferably includes a URL database which comprises a list of full URLs previously processed by the CDF. For each processed full URL the database preferably includes information pertaining to received responses to requests which comprised the full URL, and a caching state associated with the full URL.
For each response routed to the CDF from a web server, the CDF receives the full URL of the corresponding request and attributes of the client which made the request, such as for example, source IP, session ID, user agent and screen size. The CDF also receives or calculates a digest of the response and a timestamp of the response (316).
The CDF then ascertains whether the full URL has been stored in the URL database (320). If the full URL has not been stored in the URL database, a new entry corresponding to the full URL is created in the URL database (322), the caching state of the full URL is set as “learning” (324), and the digest of the response, the timestamp of the response and the attributes of the client are stored in the new entry (326).
If the full URL has been stored in the URL database, the CDF then ascertains whether the caching state of the full URL is “learning” (328). If the caching state of the full URL is “learning”, the CDF then ascertains whether the digest of the response is identical to the stored digest (330).
If the digest of the response is identical to the stored digest, the attributes of the client which made the request are added to the entry (332). The CDF then ascertains whether all of the following conditions exist:
If the aforementioned conditions exist, the caching state of the full URL is set as “pseudodynamic” (338), and the response is stored in the cache with a predefined caching expiration time (340). The predefined caching expiration time is typically shorter than the learning time.
If the digest of the current response is not identical to the stored digest, the caching state of the full URL is set as “dynamic” (350), and the timestamp of the current response is saved in the entry (352).
If the state of the full URL is not “learning”, the CDF then ascertains whether the caching state of the full URL is “pseudodynamic” (360). If the state of the full URL is “pseudodynamic”, a response to the client was apparently not made available by the cache, for example, due to expiration of the predefined caching expiration time for this full URL, or due to the client's specific directive to avoid utilizing a cache. The CDF ascertains whether the digest of the response which corresponds to the full URL is identical to the stored digest (362). If the digest of the response is identical to the stored digest, the response is stored in the cache with a new predefined caching expiration time associated therewith (364). It is appreciated that the new predefined caching expiration time may be equal or not equal to the initial predefined caching expiration time. If the digest of the response which corresponds to the full URL is not identical to the stored digest, the caching state of the full URL is set as “learning” (366), and the digest of the response, the timestamp of the response and the attributes of the client are stored in the entry corresponding to the full URL (368).
If the state of the full URL is not “pseudodynamic”, the CDF then ascertains whether the caching state of the full URL is “dynamic” (370). If the state of the full URL is “dynamic”, the CDF ascertains whether a predefined amount of refresh time has elapsed since the timestamp stored in the entry corresponding to the full URL (372). If so, the caching state of the full URL is set as “learning” (366), and the digest of the response, the timestamp of the response and the attributes of the client are stored in the entry (368).
It will be appreciated by persons skilled in the art that the present invention is not limited by what has been particularly shown and described hereinabove. Rather, the invention also includes various combinations and subcombinations of the features described hereinabove as well as modifications and variations thereof, which would occur to persons skilled in the art upon reading the foregoing and which are not in the prior art.
Number | Name | Date | Kind |
---|---|---|---|
6351767 | Batchelder et al. | Feb 2002 | B1 |
6408360 | Chamberlain et al. | Jun 2002 | B1 |
6453319 | Mattis et al. | Sep 2002 | B1 |
6584548 | Bourne et al. | Jun 2003 | B1 |
6757708 | Craig et al. | Jun 2004 | B1 |
6823374 | Kausik et al. | Nov 2004 | B2 |
6832222 | Zimowski | Dec 2004 | B1 |
6915307 | Mattis et al. | Jul 2005 | B1 |
6993591 | Klemm | Jan 2006 | B1 |
7020736 | Cherukuri | Mar 2006 | B1 |
7076500 | Gallant et al. | Jul 2006 | B2 |
7096418 | Singhal et al. | Aug 2006 | B1 |
7171443 | Tiemann et al. | Jan 2007 | B2 |
7320023 | Chintalapati et al. | Jan 2008 | B2 |
7330938 | Nenov et al. | Feb 2008 | B2 |
7343412 | Zimowski | Mar 2008 | B1 |
7460038 | Samuels et al. | Dec 2008 | B2 |
7532134 | Samuels et al. | May 2009 | B2 |
7584294 | Plamondon | Sep 2009 | B2 |
7619545 | Samuels et al. | Nov 2009 | B2 |
7681221 | Kondo | Mar 2010 | B2 |
7706266 | Plamondon | Apr 2010 | B2 |
7720936 | Plamondon | May 2010 | B2 |
7720954 | Raja et al. | May 2010 | B2 |
7760642 | Plamondon | Jul 2010 | B2 |
7774487 | Chaudhry | Aug 2010 | B2 |
7783757 | Plamondon | Aug 2010 | B2 |
7796510 | Plamondon | Sep 2010 | B2 |
7809818 | Plamondon | Oct 2010 | B2 |
7827237 | Plamondon | Nov 2010 | B2 |
7843912 | Harris et al. | Nov 2010 | B2 |
7844624 | Kinno | Nov 2010 | B2 |
7865585 | Samuels et al. | Jan 2011 | B2 |
7872597 | Samuels et al. | Jan 2011 | B2 |
7916047 | Samuels et al. | Mar 2011 | B2 |
7945698 | Bannoura et al. | May 2011 | B2 |
7962594 | Kasriel et al. | Jun 2011 | B2 |
7966414 | Cinghita et al. | Jun 2011 | B2 |
20020120710 | Chintalapati et al. | Aug 2002 | A1 |
20030004998 | Datta | Jan 2003 | A1 |
20030055792 | Kinoshita et al. | Mar 2003 | A1 |
20030152904 | Doty, Jr. | Aug 2003 | A1 |
20040044731 | Chen et al. | Mar 2004 | A1 |
20050240732 | Crick et al. | Oct 2005 | A1 |
20050278259 | Gunaseelan et al. | Dec 2005 | A1 |
20070005511 | Martinez | Jan 2007 | A1 |
20070192344 | Meier et al. | Aug 2007 | A1 |
20070206497 | Plamondon et al. | Sep 2007 | A1 |
20070206615 | Plamondon et al. | Sep 2007 | A1 |
20070206621 | Plamondon et al. | Sep 2007 | A1 |
20070214245 | Hamalainen | Sep 2007 | A1 |
20080034057 | Kumar et al. | Feb 2008 | A1 |
20080046371 | He et al. | Feb 2008 | A1 |
20080224906 | Plamondon | Sep 2008 | A1 |
20080228772 | Plamondon | Sep 2008 | A1 |
20080228864 | Plamondon | Sep 2008 | A1 |
20080228939 | Samuels et al. | Sep 2008 | A1 |
20080229017 | Plamondon | Sep 2008 | A1 |
20080229020 | Plamondon | Sep 2008 | A1 |
20080229024 | Plamondon | Sep 2008 | A1 |
20080229025 | Plamondon | Sep 2008 | A1 |
20080229137 | Samuels et al. | Sep 2008 | A1 |
20080288722 | Lecoq et al. | Nov 2008 | A1 |
20090049243 | Dubrovsky et al. | Feb 2009 | A1 |
20100199245 | Levy | Aug 2010 | A1 |
20100199345 | Nadir | Aug 2010 | A1 |
20100251347 | Roskind | Sep 2010 | A1 |
20110029641 | Fainberg et al. | Feb 2011 | A1 |
20110138012 | Tiemann et al. | Jun 2011 | A1 |
20120143770 | Pauker et al. | Jun 2012 | A1 |
Entry |
---|
Webopedia, “Message Digest”, Apr. 5, 2001, pp. 1-2, https://web.archive.org/web/20010405165043/http://webopedia.com/TERM/M/message—digest.htm I. |
Webopedia, “Byte”, Apr. 10, 2001, pp. 1-2, https://web.archive.org/web/20010410183831/http://www.webopedia.com/TERM/b/byte.html. |
Webopedia, “One-Way Hash Function”, Apr. 5, 2001, pp. 1-2, https://web.archive.org/web/20010405170900/http://webopedia.com/TERM/O/one-way—hash—fu nction.htm I. |
Webopedia, “Hashing”, Apr. 11, 2001, pp. 1-2, https://web.archive.org/web/20010411005933/http://webopedia.com/term/H/hashing.html. |
Webopedia, “Message Digest”, Aug. 29, 1998, pp. 1-2, https://web.archive.org/web/20010405165043/http://webopedia.com/TERM/M/message—digest. htm I. |
An International Search Report and a Written Opinion both dated Sep. 4, 2012, which issued during the prosecution of Applicant's PCT/IL2012/000174. |
An International Search Report and a Written Opinion both dated Nov. 7, 2013, which issued during the prosecution of Applicant's PCT/IL2013/050528. |
An International Preliminary Report on Patentability dated Dec. 23, 2013, which issued during the prosecution of Applicant's PCT/IL2012/000174. |
Communication dated May 21, 2015 from the European Patent Office issued in corresponding European application No. 12803134.1. |
Number | Date | Country | |
---|---|---|---|
20120331228 A1 | Dec 2012 | US |