RSS, which stands for Really Simple Syndication, is one type of web content syndication format. RSS web feeds have become more and more popular on the web and numerous software applications with RSS support are being developed. Typically, when web pages and other applications want to consume RSS data, the web pages or applications use some type of script, such as Javascript, to parse through the RSS feed and provide the data. What makes this scenario particularly challenging is that RSS comes in a variety of versions and different formats, e.g. RSS 0.91, 0.92, 1.0, 2.0 and Atom. This makes developing script or code for RSS parsing non-trivial and error prone.
Various embodiments utilize a special object referred to as an rsshttp object to acquire an RSS feed, process the feed and expose an object model to a web page or application. The rsshttp object can parse through the feed's associated RSS data, normalize the feed data to a standard format, e.g. RSS 2.0, sanitize the feed data if necessary, and then present a standardized object model for interaction with web pages and applications. In at least some embodiments, the rsshttp object can be configured to work on an ad hoc basis, as by fetching and processing feeds when requested by the user, or on a scheduled basis in which feeds are fetched and processed on a scheduled basis. By using the object model, web pages and applications can access and meaningfully use associated feed data without having to understand the intricacies of the different feed formats.
Overview
Various embodiments utilize a special object to process RSS data. In this document, this special object is referred to as an rsshttp object and is configured to acquire an RSS feed, process the feed and expose an object model to a web page or application. It is to be appreciated and understood that while this object is referred to as an “rsshttp” object, such should not be construed to limit the object to applications only in connection with http. In one embodiment, the object can be implemented as a COM object.
The rsshttp object can parse through the feed's associated RSS data, normalize the feed data to a standard format, e.g. RSS 2.0, sanitize the feed data if necessary, and then present a standardized object model for interaction with web pages and applications. In at least some embodiments, the rsshttp object can be configured to work on an ad hoc basis, as by fetching and processing feeds when requested by the user, or on a scheduled basis in which feeds are fetched and processed on a scheduled basis. By using the object model, web pages and applications can access and meaningfully use associated feed data without having to understand the intricacies of the different feed formats, such as the XML that describes the feed data. Of course, web pages and applications that wish to interact with the RSS data outside of the object model can do so if they so choose.
Exemplary System Overview
In this example, the web browser 110 is configured to display one or more web pages 112 individual ones of which can support or contain Javascript, DHTML and the like. In addition, in at least some embodiments, browser 110 can comprise or otherwise make use of one or more rsshttp objects 114 and, optionally, one or more xmlhttp objects 116.
In this example, the rsshttp object is utilized to acquire RSS feeds, such as feeds that are required on an adhoc basis or feeds that are subscribed to by a user. In at least some embodiments, the rsshttp object can utilize xmlhttp object 116 as a means to acquire the feeds using known xml/http techniques.
When the rsshttp object acquires an RSS feed from, for example, servers 104, it can operate on the feed to perform a number of functions that permit applications and script to interact with the RSS data, without having to know anything about the underlying RSS format or the XML that describes the RSS data. More specifically, and as will be described in greater detail below, the rsshttp object processes the RSS feed to build an object model that is then exposed to applications and web pages.
As but one specific example of an rsshttp object, consider the section just below.
Exemplary RSSHTTP Object
In one embodiment, rsshttp object 202 includes functionality or components that include a feed retrieval component 204, an object model builder 206, an object model 208, a caching component 210 and an aggregation component 212.
In one embodiment, feed retrieval component 204 includes a feed subscription component 216 and an adhoc retrieval component 218. The feed retrieval component 204 is configured to enable feeds to be acquired and processed by the rsshttp object.
In this particular example, feed subscription component 216 enables a user to subscribe to a particular RSS feed. This can typically be done by a user, web site or application specifying an URL associated with the feed. Once the URL is specified, the subscription component 216 can subscribe to the feed and ensure that the most up-to-date information associated with that feed is available for consumption by the user. It can do this by regularly checking the RSS feed for new information on a scheduled basis. The schedule can be one that the rsshttp object sets, or one that is negotiated with an entity, such as a server, that provides the RSS feed.
In addition to feed subscription component 216, feed retrieval component 204 also includes, in this example, an adhoc retrieval component 218. In this embodiment, adhoc retrieval component 218 is operable to acquire RSS feeds on an adhoc basis. For example, if a user sees a feed that is of particular interest, they may click on an associated link at which time adhoc retrieval component 218 can take steps to acquire the feed specified by the user.
In one embodiment, object model builder 206 includes a parser component 220, a sanitizer component 222, a normalizer component 224 and a merger component 226.
In this particular example, parser component 220 is configured to parse the XML associated with RSS feeds that are acquired. Any suitable parsing component can be utilized, as will be appreciated by the skilled artisan. When the parser component operates on a feed, it parses through the XML elements identifying the particular elements that comprise the feed. Recall that RSS feeds can have many different formats. Accordingly, the parser is able to identify all of the different elements that comprise the feed. Note also that some of these elements may be elements that have been used to extend a feed's basic schema.
In this particular example, sanitizer component 222 is configured to sanitize the feed of any undesirable features or characteristics that the feed may have. For example, a feed may contain certain active or executable content that is undesirable to have. In this case, sanitizer component 222 sanitizes or removes the active or executable content.
Normalizer component 224 operates on the RSS data to normalize it to a standard or common format. In this example, the common or standard format is RSS 2.0. Accordingly, those other formats that vary from the RSS 2.0 format are operated upon to remove or map elements to the RSS 2.0 elements. As such, having the RSS data in a standardized format leads to predictability in handling and processing.
In this example, merger component 226 processes the RSS data and does such things as register and store new content in data store 214, along with other relevant state data. This can enable the object model to provide or fire events to entities that register for them. For example, an application may register for a notification when new content is received from a particular feed. In this instance, the merger component can look for any such new content so that object 202 can fire an event to the application.
The output of the object model builder 206, in at least some embodiments, is a normalized, sanitized object model 208 that can be exposed to application/web page 201. These entities can interact with the object model in lieu of the XML that defines the RSS feed thus alleviating the entities from having to understand the intricacies of the different RSS feeds. Of course, for those entities that wish to interact directly with the XML, they are free to do so. An exemplary object model is described below under the heading “Object Model”.
In one embodiment, caching component 210 is configured to perform caching duties that are designed to reduce the load that is experienced by servers that provide the RSS feeds. More specifically, the caching component can be configured to utilize conditional GET requests so that a request is not made unless it is necessary. For example, when requesting the feed data, the rsshttp object can s send a timestamp of the last time it received data for the feed to the server. The server can then respond with new data or quick respond with “no new data”. In addition, the caching component can be configured to ensure that requests are small so that the cache can be quickly updated.
In one embodiment, aggregation component 212 is configured to aggregate content. That is, typically RSS feeds provide only the most recent items. Yet, there are instances when having a complete set of items is desirable. Aggregation component 212 is configured to acquire these different and sometimes dated items and aggregate the items in data store 214 so that an application or web page can access all of the items.
Object Model
In this particular object model, a top level object feeds 302 is of the type feed. Underneath the feeds object 302 is an item object 304 of the type item, and underneath the item object 304 is an enclosure object 306 of the type object.
The individual objects of the object model have properties, methods and, in some instances, events that can be utilized to manage received web content. The above-described object model permits a hierarchical structure to be utilized to manage and interact with feeds without necessarily having to be knowledgeable of the underlying XML that describes the RSS feed.
Considering the object model further, consider item and enclosure objects 304, 306 respectively. Here, these objects very much reflect how RSS is structured itself. That is, each RSS feed has individual items inside of which can optionally appear an enclosure. Thus, the structure of the object model is configured to reflect the structure of the syndication format.
From an object model perspective, there are basically two different types of methods and properties on an item. A first type of method/property pertains to data which is read only, and a second type of method/property pertains to data which can be both read and written.
As an example of the first type of method property, consider the following. Each feed can have data associated with it that is represented in an XML structure. This data includes such things as the title, author, language and the like. Data such as this is treated by the object model as read only. This prevents applications from manipulating this data.
On the other hand, there is data that is treated as read/write data, such as the name of a particular feed. That is, the user may wish to personalize a particular feed for their particular user interface. In this case, the object model has properties that are read/write. For example, a user may wish to change the name of a feed from “New York Times” to “NYT”. In this situation, the name property may be readable and writable. The object model can also be extensible with “expando” properties which allow an application to add data/state to the feed dynamically. One example of this is storing foreign keys along side the rss items for easy matching of items with data in another database/store.
Starting first with the IFeed object, consider the following. Many of the properties associated with this object come from the RSS feed itself, e.g., Title, Url, Webmaster, SkipHours, SkipDays, ManagingEditor, Homepage, ImageURL and the like, as will be appreciated by the skilled artisan. In addition, there is another set of properties of interest, i.e. the Items property which is a collection that has all of the items that are part of a feed and the LocalEnclosurePath property which provides the actual directory to which all of the enclosures are written. Thus, for an application, the latter property makes it very easy to access the enclosures.
In addition, this object supports a small set of methods such as Download( ) which are used to manage particular feeds. Further, this object supports a method XML( ), which returns a feed's XML in a standard format. The XML data can be used for such things as creating a newspaper view of a feed.
Moving to the Item object, this object has a set of properties that represent regular RSS elements, e.g. Description, Url, Title, Author and the like. In addition, there is a Parent property that points back to the associated actual feed, and an Id property so that an application can identify items versus having to iterate over all items. In addition, there is an Enclosures property which is the collection of the item's enclosures of the type IEnclosure. Further, an IsRead property enables an application to indicate whether a particular item has been read.
Moving to the Enclosure object, consider the following. This object has properties that include a Type property (e.g. mp3) and Length property that describes the length of a particular enclosure for example in bytes. There is also the LocalAbsolutePath to a particular enclosure. The Download( ) method allows individual enclosures to be downloaded and used by applications.
By exposing the object model described above to applications and web pages, the applications and web pages can interact with the objects and hence data of the feed, without having to be knowledgeable of or deal with any of the underlying XML that describes the feed. Of course, applications and web pages that wish to deal with the underlying XML are still free to do so.
Exemplary Methods
In the illustrated methods, the flow diagrams are organized to illustrate which entities can perform the various acts. Accordingly, those acts that can be performed by an application or web page are designated as such. Similarly, those acts that can be performed by an rsshttp object or RSS source (server) are designated as such.
At step 500, an application or web page makes a request on the rsshttp object for an RSS feed. This request can be made via a call to a suitably exposed application program interface and can constitute any suitable type of request, such as an adhoc request, subscription request and the like.
The rsshttp object receives the request at 502 and makes a corresponding request, at 504, on an associated RSS source such as a server. The server receives the request at 506 and provides or sends RSS feed data to the rsshttp object at 508.
The rsshttp object then normalizes, sanitizes and merges the data into the store if it's not an ad-hoc feed.
The rsshttp object receives the RSS feed data at 510 and builds an object model at 512. Examples of how an object model as well as a specific instance of an object model are provided above. Once the object model is built, the rsshttp object exposes the object model to the application/web page at 514.
Once the object model is exposed, an application/web page can interact with and consume the RSS data. This can be done by making calls to various methods exposed by the object model. In this manner, an application or web page does not need to be concerned with the underlying XML that describes the feed. This is because the XML was abstracted away when the object model was built. In addition to the object model, in at least some embodiments, various error messages can be generated for the application or web page in the event of an error. Errors can include, for example, failed download, failed to normalize, failed to sanitize, invalid feed format and the like. Further, various statuses can be updated or saved in a suitable data store. For example, statuses associated with items such as “not updated”, “new”, “updated” and “removed” can be recorded.
At step 600, an application or web page makes a scheduling request on the rsshttp object. This request can be made via a call to a suitably exposed application program interface. In this example, an application or web page may wish to periodically receive feed updates. Thus, according to a schedule provided by the application or web page, or negotiated with the server, regular checks can be made. In at least some embodiments, the application or web page does not make the request every time. It simply sets up the request/schedule initially, then the rsshttp object will automatically, in the background, make the requests.
The rsshttp object thus receives the request at 602 and makes a corresponding request, at 604, on an associated RSS source such as a server, according to the schedule. The server receives the request at 606 and provides or sends RSS feed data to the rsshttp object at 608 if there is any data that meets the request.
The rsshttp object receives the RSS feed data at 610 and makes the RSS feed data available at 612. This step can be performed in a number of ways. For example, when new feed data is received responsive to the request, an event can be fired and the user can be notified. Notification can take place in any suitable way. For example, a user interface element in the user's browser may be activated to indicate that a new item has been received. Alternately or additionally, a notification can be sent to the user, such as an email or instant message, popup window or application/web page UI can update itself with the new item.
At step 700, an application or web page registers for an event with the rsshttp object. Any suitable event or type of event can be the subject of registration. For example, an application or web page may be interested in receiving notifications when new feed items are added or feed items are changed, deleted or read.
The rsshttp object receives the registration request at 702 and listens for the particular event at 704. This step can be implemented in any suitable way. For example, the rsshttp object may poll the RSS source or server at regular intervals to ascertain whether a particular event has occurred. Alternately or additionally, an RSS source or server may notify subscribers when certain events occur.
If an event occurs at step 706, the rsshttp object notifies the application or web page at step 708. Any suitable notification can be provided examples of which are described above. If an event does not occur, then step 706 returns to step 704 and listens for the event of interest.
Exemplary Implementation
In this example, a web page/user interface 802 allows a user to interact with the system. This interface can be provided by any suitable application. For example, in some embodiments, a browser interface can be used. System 800 also includes an rsshttp object 804, an optional xmlhttp object 806, an optional RSS platform component 808, a WinINET component 810 and a store database 812.
Here, rsshttp object 804 can use xmlhttp object 806 to access XML data in a manner which will be understood by the skilled artisan. Additionally, in this particular implementation, these objects can leverage an RSS platform 808 to acquire RSS feed data. An exemplary platform is described in U.S. patent application Ser. No. 11/158,398, filed on Jun. 21, 2005.
WinINET component 810 is utilized to make the network requests, as will be appreciated by the skilled artisan, and store database 812 is used to store all of the individual items and state information.
A typical operation utilizing this implementation example will occur as follows. In some instances, a piece of jscript code executing as part of an application will instantiate rsshttp object 804. The jscript code can then make a request on the rsshttp object using an URL associated with a particular RSS feed. The rsshttp object 804 can then make a request on the server. If xmlhttp object 806 and/or RSS platform are present, the rsshttp object can leverage these components to acquire the RSS feed data.
The server then gives back the RSS feed data to the rsshttp object 804. Now, the rsshttp object can do things such as sanitize the data, normalize the data, and merge the data in store database 812. This can include, by way of example and not limitation, updating state information associated with the RSS data. Once this is done, the rsshttp object can build an object model and expose the object model to the application or web page. In this example, store database 812 stores not only the state of individual feed items, but various subscription lists as well. This allows the rsshttp object to keep feed items fresh even when an application or web page 802 is not loaded.
For example, a web application can, per domain (url domain), subscribe to N number of feeds. This allows a web application to always have up-to-date data when it is launched.
Security
In at least some embodiments, the rsshttp object enforces a per domain RSS feed security model which means that a web page from a specific domain can only access a subset of the user's feed subscription for which it has received permission from the user. This makes it possible, for example, for a user to allow access to his family picture feed subscription to a new slideshow web page that displays images in a new and engaging way, while at the same time limit the access of this page to a feed of the user's recent credit card transactions.
Conclusion
Various embodiments utilize a special object referred to as an rsshttp object to acquire an RSS feed, process the feed and expose an object model to a web page or application. The rsshttp object can parse through the feed's associated RSS data, normalize the feed data to a standard format, e.g. RSS 2.0, sanitize the feed data if necessary, and then present a standardized object model for interaction with web pages and applications. In at least some embodiments, the rsshttp object can be configured to work on an ad hoc basis, as by fetching and processing feeds when requested by the user, or on a scheduled basis in which feeds are fetched and processed on a scheduled basis. By using the object model, web pages and applications can access and meaningfully use associated feed data without having to understand the intricacies of the different feed formats.
Although the invention has been described in language specific to structural features and/or methodological steps, it is to be understood that the invention defined in the appended claims is not necessarily limited to the specific features or steps described. Rather, the specific features and steps are disclosed as preferred forms of implementing the claimed invention.
Number | Name | Date | Kind |
---|---|---|---|
5819034 | Joseph et al. | Oct 1998 | A |
5926796 | Walker et al. | Jul 1999 | A |
5960411 | Hartman et al. | Sep 1999 | A |
6035336 | Lu et al. | Mar 2000 | A |
6057834 | Pickover | May 2000 | A |
6078924 | Ainsbury et al. | Jun 2000 | A |
6101510 | Stone et al. | Aug 2000 | A |
6708172 | Wong et al. | Mar 2004 | B1 |
6732102 | Khandekar et al. | May 2004 | B1 |
6874084 | Dobner et al. | Mar 2005 | B1 |
6917937 | Rubendall | Jul 2005 | B1 |
7092992 | Yu | Aug 2006 | B1 |
7103600 | Mullins | Sep 2006 | B2 |
7516399 | Hsu et al. | Apr 2009 | B2 |
7739602 | Feng et al. | Jun 2010 | B2 |
7865511 | Kahn et al. | Jan 2011 | B2 |
7979803 | Kim | Jul 2011 | B2 |
20010027472 | Guan | Oct 2001 | A1 |
20010050658 | Adams | Dec 2001 | A1 |
20010051907 | Kumar et al. | Dec 2001 | A1 |
20010056460 | Sahota et al. | Dec 2001 | A1 |
20020002571 | Manohar et al. | Jan 2002 | A1 |
20020019812 | Board et al. | Feb 2002 | A1 |
20020026462 | Shotton et al. | Feb 2002 | A1 |
20020035617 | Lynch et al. | Mar 2002 | A1 |
20020072951 | Lee et al. | Jun 2002 | A1 |
20020124074 | Levy et al. | Sep 2002 | A1 |
20020124172 | Manahan | Sep 2002 | A1 |
20020147746 | Lee | Oct 2002 | A1 |
20020156905 | Weissman | Oct 2002 | A1 |
20020184491 | Morgan et al. | Dec 2002 | A1 |
20030014406 | Faieta et al. | Jan 2003 | A1 |
20030028762 | Trilli et al. | Feb 2003 | A1 |
20030033179 | Katz et al. | Feb 2003 | A1 |
20030033606 | Puente et al. | Feb 2003 | A1 |
20030120671 | Kim et al. | Jun 2003 | A1 |
20030135553 | Pendakur | Jul 2003 | A1 |
20030149737 | Lambert et al. | Aug 2003 | A1 |
20030229716 | Holland | Dec 2003 | A1 |
20040024720 | Fairweather | Feb 2004 | A1 |
20040031052 | Wannamaker et al. | Feb 2004 | A1 |
20040073924 | Pendakur | Apr 2004 | A1 |
20040093296 | Phelan et al. | May 2004 | A1 |
20040117439 | Levett et al. | Jun 2004 | A1 |
20040225749 | Pavlik et al. | Nov 2004 | A1 |
20040230676 | Spivack et al. | Nov 2004 | A1 |
20040237120 | Lewin et al. | Nov 2004 | A1 |
20050027676 | Eichstaedt | Feb 2005 | A1 |
20050033657 | Herrington et al. | Feb 2005 | A1 |
20050039144 | Wada et al. | Feb 2005 | A1 |
20050091220 | Klemow | Apr 2005 | A1 |
20050108262 | Fawcett, Jr. et al. | May 2005 | A1 |
20050108633 | Sahota et al. | May 2005 | A1 |
20050119910 | Schneider | Jun 2005 | A1 |
20050165615 | Minar | Jul 2005 | A1 |
20050182645 | Ehlis et al. | Aug 2005 | A1 |
20050216439 | Kawakita | Sep 2005 | A1 |
20050216452 | Teague | Sep 2005 | A1 |
20050216837 | Washburn | Sep 2005 | A1 |
20050228887 | Wang et al. | Oct 2005 | A1 |
20050268100 | Gasparini et al. | Dec 2005 | A1 |
20050289147 | Kahn et al. | Dec 2005 | A1 |
20050289468 | Kahn et al. | Dec 2005 | A1 |
20060004703 | Spivack et al. | Jan 2006 | A1 |
20060041593 | Borthakur et al. | Feb 2006 | A1 |
20060053293 | Zager et al. | Mar 2006 | A1 |
20060095507 | Watson | May 2006 | A1 |
20060095860 | Wada et al. | May 2006 | A1 |
20060129917 | Volk et al. | Jun 2006 | A1 |
20060161845 | Kahn et al. | Jul 2006 | A1 |
20060173985 | Moore | Aug 2006 | A1 |
20060179060 | Shilo et al. | Aug 2006 | A1 |
20060184617 | Nicholas et al. | Aug 2006 | A1 |
20060200443 | Kahn et al. | Sep 2006 | A1 |
20060200740 | Kahn et al. | Sep 2006 | A1 |
20060206803 | Smith | Sep 2006 | A1 |
20060218403 | Sauve et al. | Sep 2006 | A1 |
20060230021 | Diab et al. | Oct 2006 | A1 |
20060230071 | Kass et al. | Oct 2006 | A1 |
20060242663 | Gogerty | Oct 2006 | A1 |
20060253459 | Kahn et al. | Nov 2006 | A1 |
20060253489 | Kahn et al. | Nov 2006 | A1 |
20060265518 | Owens et al. | Nov 2006 | A1 |
20060288011 | Gandhi et al. | Dec 2006 | A1 |
20060288329 | Gandhi | Dec 2006 | A1 |
20070011156 | Maron | Jan 2007 | A1 |
20070011665 | Gandhi et al. | Jan 2007 | A1 |
20070022174 | Issa | Jan 2007 | A1 |
20070033290 | Valen et al. | Feb 2007 | A1 |
20070050446 | Moore | Mar 2007 | A1 |
20070079321 | Ott | Apr 2007 | A1 |
20070100836 | Eichstaedt et al. | May 2007 | A1 |
20070101313 | Bodin et al. | May 2007 | A1 |
20070245251 | Kim | Oct 2007 | A1 |
20080195483 | Moora | Aug 2008 | A1 |
20090013266 | Gandhi | Jan 2009 | A1 |
20090019063 | Gandhi | Jan 2009 | A1 |
Number | Date | Country |
---|---|---|
1536483 | Oct 2004 | CN |
1124673 | Jul 2009 | CN |
1997023 | Dec 2008 | EP |
2008547117 | Dec 2008 | JP |
102003004225 | May 2003 | KR |
WO-0126018 | Apr 2001 | WO |
WO-0182139 | Nov 2001 | WO |
WO 2005089336 | Sep 2005 | WO |
WO-2007001882 | Jan 2007 | WO |
Number | Date | Country | |
---|---|---|---|
20070208759 A1 | Sep 2007 | US |