1. Field of the Invention
The present invention relates to methods and apparatus for automatically generating recommended links. More particularly, the present invention relates to the collection of data associated with web activities and the automatic generation of recommended links based upon the collected data.
2. Description of the Related Art
The Internet has recently become a popular information resource for even the most unsophisticated computer user. The popularity of the Internet as an information source is due, in part, to the vast amount of available information that can be downloaded by almost anyone having access to a computer and a connection. However, the enormous amount of information that is available on the Internet can make it difficult to locate specific information on a given topic.
While the amount of information accessible via the Internet is daunting, users often return to the same website on a regular basis. In order to reduce the time it takes to access a given website, a user may choose to create a readily-accessible link to the website by adding a “bookmark” to a bookmark list. A bookmark is a saved hyperlink to a website or web page. By adding a link to a website or web page to the user's bookmark list, the user may quickly and easily return to the website or web page via the saved link.
While a user always has the option to add a particular website or web page as a bookmark, most users generally do not have the time to regularly research additional websites other than those that they already frequently access. As a result, users tend to repeatedly use the same websites that they have previously visited. Since it is easier to return to websites that they have previously accessed, users often remain unaware of other similar or newly created websites that may be of interest to them. In view of the above, it would be beneficial to provide improved mechanisms for introducing websites or web pages that may be of interest to a user.
To achieve the foregoing and other objects of the invention, a variety of methods for providing recommended links to a user are described. In one aspect of the invention, a plurality of recommended links agents are executed. Each of the recommended links agents is adapted to identify a list of links that may be provided to the user as recommended links in an associated class of recommendations. The various recommended links agents may be executed at any appropriate time. For example, they may be run on a periodic basis (e.g., once an hour, once a day, once a week, etc.) or they may be executed on demand (e.g., when a particular host website is accessed, when a browser is opened, or upon request from a user). Once determined, the recommended links may be provided to a user in any suitable form or format. For example, the recommended links may be provided to the user via one (or more) of a web page accessed by the user, an e-mail message, as part of a list of bookmarks associated with the user, and/or as a feature of a toolbar, etc. In some embodiments, the recommended links are arranged in a plurality of different classes of recommendations.
A “recommended link” may take the form of any mechanism that is suitable for identifying (and preferably accessing) a specific recommended website or web page. By way of example, a recommended link may include, but is not limited to, a hypertext link, a URL representing a web page or website or any other mechanism.
In various independent aspects of the invention, recommended links agents may be arranged to recommend links based upon a wide variety of criteria and/or heuristics. In the following discussion, a variety of different agents are described. The agents may be used independently, or in conjunction with a system that obtains recommendations from multiple agents.
One type of recommended links agent is arranged to recommend links to websites that are believed to be similar to one or more websites that the user has previously visited (or is currently visiting). Such agents may operate using a variety of different heuristics. For example, in some implementations, the agent may be arranged to review the user's browsing history any identify websites that the user has previously visited. The agent then identifies other websites that are perceived to be similarly to the visited websites and presents a set of theses similar websites to the user as recommended links.
In another embodiment, a recommended links agent is arranged to recommend links to websites that the user has “frequently” visited within a specified period of time. In such a system the actual number of visits that a user would need to visit a site to be considered “frequent” may be widely varied and/or may be a function of the browsing habits of the user. For example, a “frequently” visited site for a heavy web user may require more visits than a “frequently” visited site for a light web user.
In accordance with another aspect of the invention, a list of recommended links that has been generated for use by one user may be provided to another user. Similarly, a list of bookmarks maintained by one user may be provided to another user as recommended links. This may be desirable, for instance, if a user wishes to provide access to his or her links to friends or relatives. Moreover, it may be desirable to provide a “link of the day,” which enables others to view a particular list of bookmarks of a particular user.
In accordance with another aspect of the invention, links that are recommended to the user may be filtered. For example, a link that has already been added to the user's list of bookmarks need not be recommended to the user and therefore may be filtered from a list of recommended links. In another example, a link that the user has previously declined to add to the user's list of bookmarks is not recommended to the user. In another example, an agent that is designed to recommend links referring to websites or web pages that are visited frequently by the user, those sites that are merely “link” sites or the user's home page may be identified and eliminated from the recommended list of links.
In accordance with yet another aspect of the invention, recommendations provided to a user may be time-segmented. For instance, web activities of the user (or a specific group of users) at a particular time may be used to generate a list of recommended links. As one example, the time specified may be morning, afternoon, evening, or late night. As another example, the time specified may be hourly, during the weekdays, during the weekend, during annual holidays, or during periodic sporting events such as the Olympics or the games of a particular baseball or soccer team. Moreover, a time period for which data is gathered (e.g., a period of weeks, months or years) will generally be used in order to retrieve the pertinent data, as well as limit the amount of data that is retrieved.
In accordance with yet another aspect of the invention, a user may wish to receive recommendations associated with a particular subject. In other words, the user may wish to receive recommendations that are content-based. For instance, a user may wish to receive recommendations related to news, movies, stocks, traffic, or sports. Similarly, a user may wish to receive notification of links referring to websites having (or not having) a particular adult content or rating. For instance, the user may be interested in websites that are not R-rated or X-rated.
In accordance with yet another aspect of the invention, web activities of individuals other than the user may be used to compile a list of recommended links. As one example, the web activities of a group of users with which the user identifies may be monitored in order to provide a suitable list of recommendations to the user.
As another example, a user may wish to be notified of bookmarks that the group of users or individuals in that group have selected. This group of users may, for example, be the user's family, the user's friends, co-workers, or those in club or association to which the user belongs.
In accordance with yet another aspect of the invention, web activities of similarly situated individuals (or bookmarks selected by those individuals) may be used to compile a list of recommended links for a particular user. Those who are similarly situated may, for example, be individuals within a particular geographic region, or those having a particular set of personal characteristics such as gender, age, employment status, race, etc., those with similar shopping behavior or similar browsing behaviors, and/or any of a wide variety of other similarities. A geographic region may include an entire state or city, or may simply be defined by a particular zip code or set of zip codes. Similarly, a group of similarly situated individuals may simply be individuals who access a number of the same Uniform Resource Locators (URLs), similar URLs or purchase some of the same products (or services).
In accordance with yet another aspect of the invention, those websites that are considered “movers and shakers” may be provided to the user as recommended links. A website may be considered a “mover and shaker,” for example, if it has gained popularity among a number of users. Similarly, a website may be considered a “mover and shaker” if it is accessed with a particular frequency during a particular period of time.
In accordance with yet another aspect of the invention, links that are bookmarked by the user or another group of individuals may be used to generate a list of recommended links. For instance, a user may be interested in bookmarks that have been created by family members or friends. These bookmarks may be bookmarks that have been selected from a list of recommended bookmarks, or they may be bookmarks that have been independently selected by the user.
As described above, there are a number of criteria that may be used to generate a list of recommended links. These criteria may be applied individually or in combination with one another. In accordance with one embodiment, the “intersection” of two different lists of recommended links may be identified in order to generate a recommended list of websites based upon multiple criteria. In accordance with another embodiment, each criterion may be used to generate a separate recommended link list. For instance, a recommended list of “morning links” and a recommended list of “evening links” may be generated for a single user.
In accordance with yet another aspect of the invention, each criterion or combination thereof may be selectable by a user. In accordance with one embodiment, each criterion may be used separately or in combination with other criteria to generate a list of recommended links via an agent implementing this criterion. Thus, the user may select the agent or agents that the user wishes to execute to generate his or her recommended list(s) of links. From a particular list of recommended links, the user may then select those links that are desired as bookmarks. These selected links may then be “transferred” to a list of bookmarks associated with the user and removed from the list of recommended links.
The embodiments of the invention may be implemented software, hardware, or a combination of hardware and software. The invention can also be embodied as computer readable code on a computer readable medium. In addition, data structures disclosed are also part of the invention.
The invention, together with further objects and advantages thereof, may best be understood by reference to the following description taken in conjunction with the accompanying drawings in which:
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art, that the present invention may be practiced without some or all of these specific details. In other instances, well known process steps have not been described in detail in order not to unnecessarily obscure the present invention.
One of the challenges facing Internet users is identifying content that might be of interest. There are a variety of ways that users may come across information of interest. One way that a user may learn of a website is through the personal recommendation of an acquaintance. Although recommended sites may be of particular interest to a user, in most situations, personal recommendations are made on an ad hoc basis and therefore somewhat limited in their usefulness. In one aspect, the present invention seeks to provide automated mechanisms for recommending sites that may be of interest to a user. Embodiments of the invention enable a set of websites, web pages, or resources (which may be represented by a URL, hyperlink, or other mapping technique) to be recommended to a user as recommended links.
There are a wide variety of categories and types of information that might be of interest to a user. Therefore, a wide variety of heuristics may be used to obtain the recommended links. For example, if a user has looked at several websites that can be categorized in a particular field or category of information, it may be useful to present the user with recommended links to other popular websites within that field. In another example, if a user regularly visits a particular website, it might be useful to provide direct links to those sites among the recommended links. Such recommendations might be time or context sensitive. For example, if a user regularly checks one set of specific sites on weekday mornings and another set of specific sites in the evenings, it might be most useful to include the sites that are normally checked in the morning among the recommended links when they are presented in the morning, but not in the evening.
In the illustrated embodiment, there are four classes of recommendations, each of which are represented by an associated folder. Each class of recommendations is associated with a particular “agent” that (as described in more detail below) is responsible for generating the associated recommendations. Of course, in other implementations, a wide variety of other classes of recommendations may be presented and/or any of the illustrated classes may be omitted. Alternatively, or in addition to the illustrated folders, some of the recommended links may be listed sequentially instead of hierarchically. Of course GUI widgets other than folders may be used to represent classes or groups of recommended links.
In the illustrated embodiment, the four classes of recommendations include: Related Websites 22; Related Categories 24; Frequently Visited Sites 26; and Movers & Shakers 28. Generally, the “Related Websites” Agent is arranged to analyze a user's browsing history to identify websites that are perceived to be related to websites the user has recently visited. This may be accomplished by tracking a history of websites that the user has visited and then identifying other sites that are believed to be “related” to the websites that have been visited. If a particular website is related to more than one of the sites that the user has recently visited, then it may be of interest to the user. The “Related Websites” Agent is arranged to analyze the sites that are related to sites that the user has visited and formulate recommended links based at least in part upon how many times a particular website is identified as being related to one of the sites that the user has previously visited.
As will be described in more detail below, there are currently a number of toolbars and other agents that are arranged to track an Internets user's browsing history. For example, some toolbars are arranged to transmit an identification of every page turn that a user makes while browsing the Internet to a browsing history database server. One such toolbar is the Alexa toolbar available from Alexa Internet Inc. (Alexa). There are also a number of services that seek to categorize websites and to identify related links. Some mechanisms for identifying related links are described in U.S. Pat. No. 6,691,163, entitled “Use of Web Usage Trail Data to Identify Related Links,” which is incorporated herein by reference in its entirety. One commercially available service that identifies related sites is provided by Alexa which categorizes related sites based upon the DMOZ.org categorization of websites.
The “Related Websites” agent is arranged to query a browse history database to identify each site that the user has visited during a designated time period (or other appropriate grouping). For each site the user visited, the “Related Websites” agent retrieves a set of “related” sites from a related sites database. The number of entries retrieved in the set of related sites may be widely varied based on the needs of a particular application. By way of example, in one specific implementation, a set of 10 related sites may be retrieved from the related sites database.
In accordance with one embodiment, each related site is scored by the “Related Websites” agent according to selected criteria. In the described embodiment, each related site first receives a “Relation-score.” A Relation-score may be obtained from Alexa. Second, the number of different sites visited by the user that refer to the same related site is ascertained. Third, the number of times the user has visited each visited site is ascertained from the history database. Then an appropriate scoring algorithm is used to rate the sites. For instance, a scoring algorithm that may be used by the Related Websites Agent is:
Score=sum(Relation-score*log2(1+visit-count)), where the sum is the sum over all visited sites that resulted in the recommended site, Relation-score is the relevancy score of the recommendation for the visited site returned by Alexa, and visit-count is the number of times that the user visited the site, and Score is the final score assigned to the recommended site. From these scores, the related websites may be ranked in order to provide a set of recommended related websites with the highest scores. Hyperlinks that link to the recommended websites are then created and referenced in the Related Websites folder 22.
The second class of recommendations illustrated in
In one particular implementation, each related category is scored by the “Related Categories” Agent according to three criteria. First, each related category receives a Relation-score. Again, such Relationship scores are available from Alexa. Second, the number of different sites visited by the user that are within the related category is ascertained. Third, the number of times the user has visited each visited site is ascertained from the history database. By way of example, one suitable scoring algorithm that may be used by the Related Categories Agent is:
Score=sum(Relation-score*log2(1+visit-count)), where the sum is the sum over all visited sites that were associated with the recommended category, the Relation-score is the relevancy score of the recommendation for the visited site returned by Alexa, visit-count is the number of times that the user visited the site, and Score is the final score assigned to the recommended category. From these scores, the related categories may be ranked in order to return a set of categories with the highest scores.
The third class of recommendations is associated with the “Frequently Visited Sites” folder 26. The “Frequently Visited Sites” folder 26 is arranged to provide a list of websites that the user has most frequently visited. Thus, the Frequently Visited Sites Agent is arranged to track the browsing history to identify the web pages or websites that the user has most frequently visited during a defined period of time or over a designated number of most recent visits (e.g., the 100 or 1000 most recent page turns or website visited). The most visited sites are presented as recommended links in the Frequently Visited Sites folder 26.
The fourth class of recommendations is associated with the “Movers and Shakers” folder 28. The Movers & Shakers folder 28 is arranged to provide a list of websites that are categorized as “movers and shakers.” Specifically, a website categorized as a “mover and shaker” is a website that is increasing (or decreasing) in popularity at a rapid rate. By way of example, one technique for generating a list of websites that are categorized as “Movers and Shakers” is disclosed in patent application Ser. No. 10/050,579, entitled “Web You Made,” filed on Jan. 5, 2002, which is incorporated herein by reference for all purposes. The techniques disclosed in the Web You Made application may be applied to the web as a whole in order to identify general websites that may be of interest to all users. Alternatively, the techniques may be applied to categories of websites related to the user's browsing history in order to identify interesting websites in categories that may be of particular interest to the user.
In the embodiment illustrated in
The described embodiments generally refer to recommended links as hyperlinks to recommended websites or web pages. However, these examples are merely illustrative. The list of recommended links may include URLs, hypertext links to accessible locations other than websites, and/or links created using any other link mapping or addressing technique. They may also reference categories of information (as in the Recommended Categories example) or groups of links, terms or other information that is believed to be of potential interest to the user. The order that recommendations are presented to the user may also be widely varied to accommodate any presentation scheme that is perceived to be of interest to the user.
The Recently Visited Domains folder 52 simply provides links to websites that have been most recently visited by the user. These recommended links may simply be a list of a specific number of links that were most recently visited (e.g. the 10 most recently visited websites) or a list of the links that were viewed over a designated time period (e.g., within the last 6 hours) or a combination of the two (e.g., the 10 most recently visited websites, so long as they were viewed in the last 48 hours). Of course the number of recently visited sites that are displayed and/or the designated time period from which the sites are defined as “recent” may be widely varied. In some implementations, the user may be given control over these variables.
The Movers & Shakers folder 28 and the Related Categories folder 24 operate the same as described above with respect to
The Most Visited Domains folder 58 presents links to the websites that the user has visited most frequently over the user's stored browsing history. These results may be provided by the Frequently Visited Sites Agent discussed above with respect to
The folders 54, 55 and 56, which are each labeled “Links,” present recommendations provided by the Related Websites Agent discussed above with respect to
In the embodiments illustrated in
In some embodiments, it may be desirable to allow the user to block certain recommended links. Blocking may be accomplished using a variety of different gestures. By way of example, the interface could be configured to block a particular link from appearing in the recommended links list by selecting (highlighting) a particular link and pressing the delete key. In other embodiments, a user may block a recommended link, by moving the recommended link from the recommended link list 106 to an icon or a container that contains a list of blocked links (not shown). In this manner, a user may permanently block or remove lo a particular link from appearing in the recommended link list 106. A user may later choose to modify the block list of links by deleting any entries from the block list of links through standard operations. A user may also wish to preemptively block a particular link that has not yet been recommended to the user by manually entering the link (or otherwise specifying that the link be added) into the list of blocked links.
Referring next to
Each Recommendation Agent 310 is arranged to generate a set of recommended links for a particular user based on a specific-set of heuristics. By way of example, to support the embodiment illustrated in
Workflow manager 308 coordinates the process by which recommendations are generated. In order to obtain the recommended links for a particular user, the workflow manager 308 may be arranged to call one or more of the agents 310 passing the information that the Agent needs to make its recommendations. Each Agent is arranged to generate a list of recommended links for each particular user. In some implementations, all of the users will be presented with the same classes of recommended links. In these cases, the workflow manager 308 may be configured to call all of the agents for every user. However, in other implementations, the classes of recommendations may be context sensitive and therefore selected by the system, or the user may be given control over the classes of recommendations that will be provided. In these embodiments, the workflow manager may only call selected agents 310.
The workflow manager 308 may be arranged to manage the order in which the agents are executed. In some instances, it may be desirable to control the order in which the Agents are executed in order to avoid duplicate processing. Specific ordering of the agents may also be desirable when an agent is dependent upon the processing or output of one or more other agents. This may be accomplished through the use of a tree or other suitable data structure to manage the execution order of multiple agents.
Once a list of recommended links has been generated by an agent or combination of agents, the list of recommended links is provided by the workflow manager 308 to the recommended links manager 316. The recommended links manager 316 stores the recommended links in a database 318. The recommended links manager 316 is also responsible for serving the recommended links at the appropriate time. In embodiments where the recommended links are served as part of a web page (as illustrated in
In the illustrated embodiment, the workflow manager 308 and recommended links manager 316 are illustrated as separate modules. However in alternative embodiments, the workflow manager 308 and recommended links manager 316 may also be implemented as a single unit.
The recommended links may be generated in real-time when the user accesses a central website or a particular feature of a website. Alternatively, these links may be generated in batch mode. For example, it may be desirable to generate a list of recommended links for various sets of users in different batches. Depending upon the criteria used to generate the list of recommended links, it may be desirable to generate or update a list of recommended links for some users every day, and generate or update a list of recommended links for other users every week.
Data may be obtained from one or more of the data sources directly by any of the agents 310a, 310b, . . . 310n using that data. Alternatively, the data may be obtained by the workflow manager 308 or the recommended links manager 316 to be transmitted to the appropriate agent(s). For instance, data that is used universally by multiple agents may be transmitted to those agents, while data specific to one or more agents may be retrieved directly by the agents using that specific data. In accordance with one embodiment, the recommended links manager 316 obtains the data from the history database 304(a) to be provided to each of the agents 310a, 310b, . . . 310n, while the appropriate agent or agents obtain recommended links directly from the recommendations database 306(b). The agents then process the data, as appropriate.
Agent processing may simply involve receiving recommended links or categories thereof from the recommendations database 304(b) and providing these to the user. For instance, the most popular websites (i.e., Movers and Shakers) may be obtained from the database without further processing. Alternatively, the processing may involve processing data obtained from the history database and/or the recommendations database prior to providing recommended links to the user. For instance, in order to identify the most visited domains (and to thereby generate a list of recommended links based on these domains), a list of website domains that a user has visited the most may be identified from the history database.
In order to generate a list of recommended links, most of the Agents will need to utilize information stored in one of the accessible databases. One database that has been frequently referenced in this application is a customer browsing history database 304(a), which stores data associated with the web activities of a number of users over time. One example of a system for generating and maintaining a history database 304(a) is disclosed in patent application Ser. No. 10/612,395, entitled “Server Architecture and Methods for Persistently Storing and Serving Event Data,” which is incorporated herein by reference.
In accordance with one embodiment, the data associated with the web activities of a user that is stored in the history database 304 is obtained via a toolbar that has been installed on the user's computer. By way of example, a toolbar capable of sending data back to a server, is disclosed in U.S. Pat. No. 6,282,548, entitled “Automatically Generate and Displaying Metadata as Supplemental Information Concurrently with the Web Page, There Being No Link Between Web Page and Metadata” and assigned to Alexa Internet, which is incorporated herein by reference. The data that is received from the toolbar may include, for example, a user identifier (e.g., account number of the user) and/or a toolbar identifier. associated with the toolbar, a URL being visited, and a timestamp. Activity may be tracked on a toolbar basis (in which case multiple people using the same computer could have their activity aggregated, or one user using two different computers could have two histories) or on a user basis (if the toolbar or a corresponding website supports log-on functionality to allow identification of a particular user using the computer). The data that is transmitted by the toolbar may be transmitted on a periodic basis or each time a website or web page is accessed via the toolbar.
While the toolbar is one way in which information associated with a user's web activity may be collected, it is important to note that other mechanisms for collecting data corresponding to a user's web activities are possible. For instance, data associated with a user's web activity may be captured via a server when a user accesses websites through the server.
The information that is stored in the history database 304 may be stored in a variety of formats. Exemplary tables that may be used to store history data associated with multiple users and multiple URLs will be described in further detail below with reference to
As set forth above, a recommendations database 304(b) may be accessed for use in generating a list of recommended or related links. An example of a system and method for generating a set of recommended links may be found in patent application Ser. No. 10/050,579, entitled “Web You Made,” filed on Jan. 5, 2002, which is incorporated herein by reference for all purposes. The Web You Made application discloses a technique for generating a list of recommended websites based at least in part upon a user's previously visited websites.
As suggested above, there are a wide variety of agents that may be used to generate the recommended links. A few have been discussed in some detail above, however any of a number of other specific agents could be provided to create recommended links that are perceived to be of interest to a user.
In one example, an Agent can be configured to recommend links that are related to web pages or websites stored in particular folders in a bookmark list. Much as a user may divide their bookmarks into one or more categories or separate them into one or more folders for organizational purposes, similarly one or more different recommended link lists may be presented to the user. Each folder or list of bookmarks presented in a bookmark list (as for example the bookmark list shown in
Some agents may be arranged to apply a geographic location limitation when generating a list of recommended links. For instance, in many instances a user may be particularly interested in businesses, events or organizations that are geographically close to the user. Therefore, an agent may use “geographical location” as one of the criteria when identifying related websites. It should be appreciated that the geographic location of a user may be ascertained from a number of sources. For example, the user's location may be available from registration information, billing or shipping information or the like. Alternatively, the user's general location can be automatically determined based on the IP address of the user (e.g., Akamai provides a mechanism for accurately mapping an IP address to a geographical location). Alternatively, the geographic region of interest to the user may be defined, for example, by entering a particular city, state, county, one or more zip codes, or a region have a radius of a specified number of miles around an identified center such as a particular landmark or address.
Another type of agent may restrict recommended links to those links that point to sites having particular content or are related to certain subject matter. For instance, links associated with a particular subject of interest to a user may be recommended to the user. The subject may, for example, be a category such as news, entertainment, movies, stocks, traffic, or sports. Alternatively, the subject may be defined by a rating (e.g., PG, R) of the content of the referenced site. Or, the subject may be a topic such as “bass fishing” that is very specific to the user. The subject of the websites that are being recommended may be identified by the top-level domain of the site or, alternatively, the content of the site based on keyword analysis or other prior categorization.
Still another type of agent may recommend links based on a status of the link. For instance, a website may achieve the status of a “mover and shaker” when it has reached a threshold level of popularity. The popularity of a website may be determined, for example, by the total number of hits received during a specified period of time or by the total number of unique users accessing the website during a specified period of time. Moreover, popularity may be ascertained by the number of times a particular user accesses the website during a specified period of time.
Often, a user (or a group of users) will typically visit certain particular websites or web pages at particular times of day or times of the year (e.g., morning, afternoon, evening, late night, weekday, weekend, hourly, at or around annual holidays, or during the time when specific sporting events are being held). Some agents may take the time of day or time of year into account when making recommendations. For example, some Agents may create separate lists of recommendations based on the time of day (e.g. one for the morning, one for the afternoon and one for the evening) or the time of year (e.g., a separate list during the Christmas holidays).
It is important to note that a user may be identified by one or more identifiers. For instance, a user may be identified by an IP address, user identifier (e.g., account number) and/or toolbar identifier. Since a user may have a toolbar installed on multiple, different computers, it may be desirable to uniquely identify each of these locations by a toolbar identifier. Thus, it is possible to track all web activity associated with the user via the user identifier (e.g., account number) associated with the user. Alternatively, it is possible to separately track web activity associated with a user at different locations via the corresponding toolbar identifier and/or IP address. In this manner, it is possible, for example, to separately track web activity associated with the user at work from a work computer and at home from a home computer. Accordingly, recommended links may be provided in accordance with the web activity of the user at these different locations.
Another agent may be arranged to recommend links based on an analysis of the browsing habits or bookmarks that are used by a group of users that a particular user is associated with. For instance, this group of users may be the user's family, a group of friends of the user, a group of friends of friends of the user, a company associated with the user, a club to which the user belongs, or an association to which the user belongs. For example, the user may define a list of friends, as well as other lists associated with different groups of users. A group-habit analyzing Agent may be arranged to recommend links based on an analysis of the websites or web pages that have been visited or bookmarked by others in the group. It may be desirable to provide only a subset of those websites that were either visited or bookmarked by at least one individual in the group. For instance, it may be desirable to establish a threshold number of visits or threshold visitation frequency by at least one individual in the group, a majority of the individuals in the group, or all individuals in the group. By way of example, one recommended links agent that provides recommendations based on the habits of a related group is described in Provisional Application No. 60/645,995 (A9XX-P001P), entitled: “Methods and Apparatus for Tracking Website Visitation Trends Among Discrete Sub-Populations” which is incorporated herein by reference.
Still other Agents may be configured to give the user control over some of the criteria that are used to generate the recommended links. This can give the user some ability to customize the nature of the recommendations provided. For example, when recommendations are based at least in part on historical browsing data, the user may be given the ability to edit the period of time over which the search history is analyzed and/or the total number of recent visits or page turns that are visited.
In other embodiments, certain agents may be configured to present a list of selectable criteria to the user. The list of selectable criteria may be generated by the entity performing the recommended link service, or may be customized by the user as described using the techniques below. From this list of criteria, the user may select those criteria to be applied to generate the list of recommended links. The user may select criteria, for example, by performing a drag-and-drop operation or by double-clicking on the criteria. Those criteria that are selected by the user are displayed in a list of selected criteria. In this example, the criteria that have been selected by the user include “singles dating sites,” “clubs in San Francisco,” “sites bookmarked in the last week by my family within 10 miles of me that relate to news sites,” and “sites bookmarked by my friends in the morning during weekdays that relate to traffic.”
If two or more criteria are added to the list of selected criteria, the user may be allowed to define how the criteria in the list are to be combined. For instance, a set of operators such as AND, OR, NOT, WITHIN, OUTSIDE, <, >, <>, /=, and = may be provided (not shown). The user may then select an appropriate operator to combine two or more different criteria for execution. In this manner, multiple criteria may be combined in a single statement for execution. In the absence of a user specifying one or more operators, a default operator (such as an “AND”) may be applied to the criteria in the list.
Once a list of recommended links and/or list(s) of bookmarks is generated, they can be shared or published for access by one or more users. Thus, a list of recommended links and/or list(s) of bookmarks may be presented to the user, as well as other individuals or groups of users. For instance, a user may wish to enable the recommended links or his or her list(s) of bookmarks (or portions thereof) to be viewable by friends or family. Moreover, users may be interested in viewing recommended links generated by the web activities of other similarly situated users or bookmarks that have been created by other similarly situated users. These similarly situated users may share a set of personal characteristics such as gender, age, employment status, race, etc or other characteristics such as geographic location. As another example, the user may have purchased one or more items, visited one or more URLs, or selected one or more bookmarks in common with at least one of the individuals. The set of characteristics may be pre-defined or may be selected by the user. In addition, a user's list of bookmarks may be published as the “list of the day.” Thus, one user's bookmarks may be presented as a list of recommended links to another individual, thereby enabling the individual to transfer any of these links to his or her list of bookmarks.
As described above, each of the agents includes one or more software modules that performs tasks in accordance with criteria that may be set by the user. The agents may be used separately or in combination with one another in order to generate a list of recommended links. These criteria may be selectable by a user, as well as configured with the desired values (e.g., distances, ages). In this manner, the user may control the quality of links that are presented to the user as recommended links.
It is important to note that the agents may be executed in batch mode. For instance, a set of agents may be executed for a single customer as set forth below with reference to
When an agent executes, it processes the pertinent data and reports a list of recommended sites to the recommended links manager at block 412. The recommended links manager receives the auto-generated recommendations from the agent at block 414. For each remaining agent at block 416, the workflow manager continues to initiate execution of the remaining agents at block 408.
When all agents have been executed for the customer, the auto-generated recommended links may be filtered for the customer by the recommended links manager and stored in the database for the customer's next visit at block 418. For instance, filtering may involve removing blocked bookmarks that are presented to the user as recommended bookmarks. One method of filtering auto-generated recommended links will be described in further detail below with reference to
If there are more customers for which agents are to be executed at block 422, the process repeats for the next customer at block 424 and the workflow manager continues to execute at block 404. When no customers remain, the process ends at block 426.
Upon completion of execution of an agent process, the workflow manager notifies the recommended links manager that the agent process has finished executing at block 440. The recommended links manager optionally filters auto-generated recommended links and stores the recommended links for the customer's next visit at block 442. One process of filtering recommended links will be described in further detail below with reference to
As described above with reference to block 450, the agent receives and processes the customer's data. This processing may simply involve receiving recommended links or categories thereof from another source and providing these to the user. Alternatively, the processing may involve processing data prior to providing recommendations to the user.
As described above, the history database may store data associated with multiple URLs and users. In accordance with one embodiment, the data is stored in URL tables and user tables. The URL tables support access to data using the URL as the primary key, while the user tables support access using a user identifier (e.g., account number, toolbar identifier and/or IP address) as the primary key.
The identity of a user may be established by at least one identifier. In accordance with one embodiment, a user may have an IP address and/or toolbar identifier. Moreover, a single user may have a different toolbar identifier for each computer on which a toolbar is installed. This is particularly desirable since a user may search for different websites at a home computer than at a work computer. As a result, it is possible to track activities of users at different locations or times of the day.
In order to summarize data for efficient retrieval, URL summary tables may be built or updated. For instance, each URL summary table may include data “summarized” over a particular time period. An exemplary URL summary table will be described in further detail below with reference to
In a URL summary table, data for a particular URL may be summarized over various time periods, such as per minute, hour, day, month or year. As described above, the URL 604 may be identified in the entry in a URL summary table. In the URL hourly summary table 614, data is summarized for each hour. For instance, each entry in the table may represent a different hour. In other words, the number of hits 606, number of unique users 608, and one or more user identifiers 610 (e.g., toolbar numbers, user identifiers and/or IP addresses) identifying the users who have accessed the URL together represents the web activity of users who accessed the URL during a specified hour long time period 612. Similarly, URL summary tables may be updated and maintained with summary data over periods of one or more days 616, or one or more months (or years) 618.
Data associated with a particular URL may be obtained from the appropriate URL or URL summary tables. Specifically, the URL may be used as a primary key. However, it may also be desirable to obtain data for a particular user (e.g., user identifier, toolbar identifier and/or IP address). Exemplary user and user summary tables will be described in further detail below with reference to
Similarly, user summary tables may be built or updated. For instance, each user summary table may include data “summarized” over a particular time period. An exemplary user summary table will be described in further detail below with reference to
In a user summary table, data for a particular user may be summarized over various time periods, such as per minute, hour, day, month or year. In the user hourly summary table 710, data is summarized for each hour. For instance, each entry in the table may represent a different hour. Similarly, in the user daily summary table 712, data is summarized for each day, while data is summarized for each month (or year) in the monthly (or annual) summary table 714. In this manner, data may be summarized for each user. Alternatively, in accordance with another embodiment, a user summary table associated with the particular time period (e.g., hour) may include all entries storing data representative of web activity occurring during that time period. In other words, the data may merely be re-organized rather than “summarized.”
Data in the user summary tables may be updated and maintained with summary data over periods of one or more hours 710, one or more days 712, or one or more months (or years) 714, for example. In each summary table, each entry summarizes the activity of a particular user over the specified time period. For instance, a single entry may identify a toolbar identifier, user identifier and/or IP address 716, a URL list 718 of one or more URLs accessed during the specified time period, and the applicable time period (or timestamp) 720. In this manner, the activity of a particular user over a specified time period may be easily accessed.
Data associated with a particular user may be obtained from the appropriate user or user summary tables. Specifically, the toolbar identifier (or user identifier) may be used as a primary key.
As described above, one or more identifiers (e.g., toolbar identifier, user identifier and/or IP address) may be used to identify a particular user or toolbar. Such an identifier may also be further linked to information associated with the user.
Information associated with the user may be obtained via the website during a registration process. For instance, personal information is generally collected when a new account is established. During the registration process, the website provider may obtain various consumer data such as socio-economic data and address information identifying a geographic region (e.g., zip code) within which the consumer lives or works. In addition, the consumer may enter a title, first name, last name, an electronic mail address, a password, and address information including a specific address and/or city, state and zip code. In addition, socioeconomic data including gender, race, occupation, salary, and education level may be obtained. In addition, at the time of registration, a user identifier (e.g., account number) may be assigned.
A geographical location 744 may also be specified by the user or ascertained from information such as the user's IP address (e.g., where a billing address or shipping address is not specified for the user), as described above.
Other information stored in a user record may include a link to the purchase history 746 of the user. In addition, the gender and/or race 748 may be specified by the user. Alternatively, the gender may be inferred from the name or title of the user. In addition, other information such as the user's age 750, employer (not shown), e-mail provider (not shown), school (not shown), and birthplace (not shown) may also be stored in the user record 730.
Various embodiments of the invention can also be embodied as computer readable code on a computer readable medium. The computer readable medium is any data storage device that can store data, which can thereafter be read by a computer system. Examples of the computer readable medium include read-only memory, random-access memory, CD-ROMs, magnetic tape, and optical data storage devices.
Although illustrative embodiments and applications of this invention are shown and described herein, many variations and modifications are possible which remain within the concept, scope, and spirit of the invention, and these variations would become clear to those of ordinary skill in the art after perusal of this application. For instance, embodiments of the present invention support the generation of lists of recommended links based upon data that satisfies specific criteria. Various exemplary criteria are set forth, which may be used individually or in combination with one another. However, it should be understood that the disclosed criteria are merely illustrative, and therefore the disclosed embodiments may be implemented with data retrieved and/or analyzed based upon other criteria, or combinations thereof. For instance, although some of the described embodiments refer to recommended links (e.g., URLs), a recommendation list of may instead reference one or more categories of recommended links. In such an arrangement, each category may include any number of recommended links. Accordingly, the present embodiments are to be considered as illustrative and not restrictive, and the invention is not to be limited to the details given herein, but may be modified within the scope and equivalents of the appended claims.
This application claims the priority of Provisional Application No. 60/610,161, filed Sep. 14th, 2004, which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
60610161 | Sep 2004 | US |