The present application shares some common subject matter with co-pending and commonly assigned U.S. patent application Ser. No. 12/614,260, entitled “Determining At Least One Category Path for Identifying Input Text”, filed on even date herewith, the disclosure of which is hereby incorporated by reference in its entirety.
A user's web browsing history is a rich data source representing a user's implicit and explicit interests and intentions, and of completed, recurring, and ongoing tasks of varying complexity and abstraction, and is thus a valuable resource. As the web continues to become ever more essential and the key tool for information seeking and retrieval, various web browsing mechanisms that organize a user's web browsing history have been introduced. These web browsing mechanisms range from mechanisms that organize a user's web browsing history using a simple chronological list to mechanisms that organize a user's web browsing history through visitation features, such as, uniform resource locator (URL) domain and visit count.
Features of the present invention will become apparent to those skilled in the art from the following description with reference to the figures, in which:
For simplicity and illustrative purposes, the present invention is described by referring mainly to an example embodiment thereof. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent however, to one of ordinary skill in the art, that the present invention may be practiced without limitation to these specific details. In other instances, well known methods and structures have not been described in detail so as not to unnecessarily obscure the present invention.
Disclosed herein are a method and apparatus for visually representing a hierarchy of predefined category nodes that identify one or more concepts. Also disclosed herein is a computer readable storage medium that contains a set of instructions for performing the method.
Through implementation of the method and apparatus discussed herein, the categories and concepts associated with a document access history may be visually represented, with the respective relevancies of the categories and concepts to the accessed documents. As such, “topics” rather than the actual documents that are included in the document access history may be revisited. In one regard, a user may have a richer interaction with the user's document access history as compared with conventional history tracking software.
With reference first to
The system 100 comprises a computing device, such as, a personal computer, a laptop computer, a tablet computer, a personal digital assistant, a cellular telephone, etc., configured with a visualization apparatus 101, a processor 130, an input source 140, a message store 150, and an output interface 160. The processor 130, which may comprise a microprocessor, a micro-controller, an application specific integrated circuit (ASIC), and the like, is configured to perform various processing functions. One of the processing functions includes invoking or implementing the visualization apparatus 101, and more particularly, the modules of the visualization apparatus 101, as discussed in greater detail herein below.
As shown in
According to an example, the visualization apparatus 101 comprises a hardware device, such as, a circuit or multiple circuits arranged on a board. In this example, the modules 102, 116, and 120 and the sub-modules contained therein, comprise circuit components or individual circuits. According to another example, the visualization apparatus 101 comprises software stored, for instance, in a volatile or non-volatile memory, such as dynamic random access memory (DRAM), electrically erasable programmable read-only memory (EEPROM), magnetoresistive random access memory (MRAM), flash memory, floppy disk, a compact disc read only memory (CD-ROM), a digital video disc read only memory (DVD-ROM) or other optical or magnetic media, and the like. In this example, the modules 102, 116, and 120 and the sub-modules contained therein, comprise software modules stored in the memory. According to a further example, the visualization apparatus 101 comprises a combination of hardware and software modules.
The visualization apparatus 101 may comprise a plug-in to a browser application, such as, a web browser, which allows access to webpages over an extranet, such as, the Internet or a file browser, which enables the user to browse through files stored locally on the user's visualization system 100 or through files stored externally, for instance, on a shared server. In addition, or alternatively, the visualization apparatus 101 may comprise a plug-in to a messaging application, which comprises any reasonably suitable application that enables communication over a network, such as, an intranet, the Internet, etc., through the system 100, for instance, an e-mail application, a chat messaging application, a text messaging application, etc. As a yet further example, the visualization apparatus 101 may comprise a standalone apparatus configured to interact with a messaging application, a browser application, or another type of application.
Generally speaking, the category path determining apparatus 102 is configured to receive as input, input text from a document, which may comprise a scanned document, a webpage (i.e., a hyper text markup language formatted, an extensible markup language (XML) formatted document, etc., document), a magazine article, an email message, a text message, a newspaper article, a handwritten note, an entry in a database, etc., and to automatically determine a category path that identifies the input text through use of machine-readable labels. In cases where the input text has already been labeled with a category path, then the category path determining apparatus 102 may simply look up and use that category path. For example, a category path may be encoded within a metadata tag within a web page. In cases where the input text is not already labeled, the category path determining apparatus 102 is configured to access and employ a labeled text data source in determining suitable categories for the input text and in determining the one or more category paths through a hierarchy of category nodes.
The labeled text data source may comprise a third-party database of articles, such as, Wikipedia™, Freebase™, IMDB™, and the like. The articles contained in the labeled text data sources are often assigned to one or more categories and sub-categories associated with the particular labeled text data sources. For instance, in the Wikipedia™ database, each of the articles is assigned a particular concept and there is a single article for each concept. In addition, the concepts are assigned to particular categories and sub-categories defined by the editors of the Wikipedia™ database. As discussed in greater detail herein below, the concepts and categories used in a labeled text data source, such as the Wikipedia™ database, may be leveraged in generating the one or more category paths to identify an input text.
A user may interface with the visualization apparatus 101 through the input source 140, which may comprise an interface device, such as, a keyboard, mouse, or other input device, to input the input text into the category path determining apparatus 102 or to direct the category path determining apparatus 102 to one or more documents of interest. A user may also use the input source 140 to instruct the category path determining module 102 to generate the at least one category path to identify a desired input text, which may include an entire document, to which the category path determining module 102 has access. In addition, a user may also use the input source 140 to navigate through one or more category paths determined for the input text. Furthermore, a user may use the input source 140 to select various elements, such as, category nodes, concepts, etc., to navigate through a displayed set of category paths and concepts, as discussed in greater detail herein below.
According to an embodiment, the category path determining apparatus 102 is configured to determine the one or more category paths for the selected input text through a hierarchy of category nodes from a weighted list of concept paths retrieved from one or more of the labeled text data sources. In this embodiment, the category path determining apparatus 102 may operate to communicate with the one or more labeled text data sources to retrieve the weighted list of category paths from the one or more labeled text data sources.
According to another embodiment, the category path determining apparatus 102 is configured to determine the one or more category paths for the selected input text by processing the input text and information contained in the labeled text data sources. A more detailed discussion of an example of this embodiment is discussed in greater detail in U.S. patent application Ser. No. 12/614,260. In addition,
As also discussed in that application for patent, some or all of the predefined category hierarchy may be manually defined. The category levels that are not manually defined may be computed from categorical information contained in the labeled text data source. Thus, for instance, a user may define a root node and one or more child nodes and may rely on the category levels contained in the labeled text data source for the remaining child nodes in the hierarchy of predefined category levels. According to a particular embodiment, a user may define the hierarchy of predefined category levels as a tree structure and may map the categories of the labeled text data source into the tree structure. According to another embodiment, the pre-processing module 104 may be configured to automatically map concepts from the labeled text data source into the hierarchy of predefined category levels. According to an additional embodiment, the relevance of each concept to each category may be recorded as the probability that another article that mentions that concept would appear in that category. According to yet another embodiment, categories may further be labeled as being useful for disambiguating concepts (see below) or as useful for display to an end user.
The category path determining apparatus 102 may output the determined at least one category path for identifying the input text through the output interface 160. The output interface 160 may provide an interface between the category path determining apparatus 102 and another component of the system 100, such as, the data store 150, upon which the determined at least one category path may be stored. In addition, or alternatively, the output interface 160 may provide an interface between the category path determining apparatus 102 and an external device, such as a display, a network connection, etc., such that the at least one category path may be communicated externally to the category path determining apparatus 102.
Various manners in which the modules 104-116 of the category path determining module 102 may operate in generating the category path of an input text to enable the input text to be identified by a computing device is further discussed in the U.S. patent application Ser. No. 12/614,260.
As further shown in
The modules 122-126 are further configured to construct and modify a graph that contains the hierarchy of category nodes and concepts. The modules 122-126 are configured to construct the graph to include the one or more determined visual characteristics. In addition, the output module 116 is configured to output the graph through the output interface 160, which may interface with a display on which the graph is displayed.
Various manners in which the modules 102 and 120 of the visualization apparatus 101 may operate in visually representing a hierarchy of category nodes that identify one or more selected concepts is discussed with respect to the method 200 depicted in
At step 202, accesses to documents are tracked, for instance, by the visualization apparatus 101. Thus, for instance, the visualization apparatus 101 may track the webpages that a user accesses, the emails that are sent to and/or received by the user, word processing documents that are accessed by the user, etc. The documents may also comprise text messages, news articles and publications, etc.
At step 204, concepts to which the documents are determined to be relevant are determined, for instance, by the visualization apparatus 101. As discussed above, each of the concepts may correspond to a machine-readable label assigned to an article in a labeled text data source, such as Wikipedia™, Freebase™, IMDB™, and the like. According to an embodiment, the concept determining module 108 is configured to determine the concepts for the accessed documents as discussed in greater detail herein below with respect to step 308 in
At step 206, one or more category paths are determined through a hierarchy of category nodes for one or more concepts, for instance, by the category path determining module 102. Various manners in which the one or more category paths may be generated are discussed in greater detail herein below with respect to step 310 in the method 300.
At step 302, the labeled text data source is pre-processed, for instance, by the pre-processing module 104. By way of a particular example, the pre-processing module 104 is configured to analyze the labeled text data source corpus, finding categories for each concept by mapping the labeled text data source categories into a category graph (such as, a manually constructed category tree, finding phrases related to each category by using the text of articles assigned to concepts in each category, finding phrases related to each concept by using the text anchor tags which point to that concept, and evaluating counts of occurrences to determine the probability that an occurrence of a particular phrase indicates the text is relevant to a particular category or a particular concept. In this way, the pre-processing module 104 creates dictionaries of probabilities that map concepts to categories, map anchor tags to categories, and map anchor tags to concepts. As discussed below, these dictionaries are used by the category determining module 106, the concept determining module 108, and the category path determining module 110.
At step 304, an input text is determined, for instance, by the category path determining apparatus 102. The category path determining apparatus 102 may determine the input text, for instance, through receipt of instructions from a user to initiate the method 220 on specified input text, which may include part of or an entire document. The category path determining apparatus 102 may also automatically determine the input text, for instance, as part of an algorithm configured to be executed as a user is browsing through one or more documents, or as part of an algorithm to send or receive textual content.
At step 306, one or more categories are determined from the category hierarchy that are most relevant to the input text, for instance, by the category determining module 106. The category determining module 106 may compare the input text with the text contained in a plurality of articles in the labeled text data source to determine which of the plurality of categories is most relevant to the input text. According to a particular example, category determining module 106 is configured to make this determination by looking up phrases from the input text in the dictionaries constructed by the pre-processing module 104 and then computing a probability for each category using the probabilities for each category given the presence of each matching phrase.
At step 308, one or more concepts are determined from the labeled text data source that are most relevant to the input text using the input text, information from the labeled data source, and the categories determined at step 306, for instance, by the concept determining module 108. The concept determining module 108 may compare the input text with the text contained in a plurality of articles in the labeled text data source to determine which of the plurality of concepts may plausibly be relevant to the input text. According to a particular example, the concept determining module 108 makes this determination by searching for phrases from the input text in the dictionaries constructed by the pre-processing module 104 and then computing a probability for each concept using the probabilities for each concept given the presence of each matching phrase and the category probabilities computed at step 226.
At step 310 category paths through the hierarchy of predefined category levels for the one or more plausible categories are determining for the input text determined at step 226 which terminate at any of the plausible concepts for the input text determined at step 228, for instance, by the category path determining module 112.
At step 312, a determination as to which of the plausible category paths are most relevant to the input text is made, for instance by the category path relevance determining module 114. According to an embodiment, the category path relevance determining module 114 computes metrics for each of the plurality of plausible category paths, in which the metrics are designed to identify a relevance level for each of the category paths with respect to the input text. For instance, the category path relevance determining module 114 weights each of the categories in the plausible category paths based upon the relevance of each of those categories to the input text. In one embodiment, relevance is measured by using the probabilities computed for each category by the category determining module 106, the probabilities for each concept computed by the concept determining module 108, and the prior probabilities computed by the pre-processing module 104.
At step 314, at least one category path for the one or more concepts determined to be the most relevant to the input text is generated, for instance, by the category path generating module 114. According to an example, the category path generating module 114 may generate a plurality of category paths through different categories to define the input text. In addition, the category path determining apparatus 102 may output the at least one category path generated for the input text through the output interface 160, as discussed above.
With reference back to
By way of a particular example, the relevance determining module 122 is configured to employ a scoring system for each of the category nodes, in which the scores assigned to the category nodes are a measure of their respective relevance levels with respect to the one or more concepts. In this example, the relevance determining module 122 is configured to set each of the category nodes to an initial predetermined score prior to assigning scores to each of the category nodes. For instance, the relevance determining module 122 may set each of the category nodes to have an initial value of 0. In addition, the relevance determining module 122 may increase the score of each of the category nodes based on the weighting assigned to each category path passing through that node. Thus, for instance, the category nodes contained in category paths having relatively higher weighting are assigned higher scores and are thus considered to have a higher relevance level to the one or more determined concepts then the category nodes having lower scores.
At step 210, the graph constructing module 124 constructs a graph that depicts a visual representation of the relevance levels determined for each of the category nodes in the hierarchy of category nodes. The relevance levels for the category nodes with respect to the one or more determined concepts may be visually represented through use of one or more graphical characteristics. By way of example, the graphical characteristics comprise one or more of font sizes, font colors, borders, bold-facing, underlining, different colors, shading, physical location, transparency, etc. The graphical characteristics may also include one or more of symbols, graphs, charts, etc., that may be placed adjacent to or in place of the text of the category nodes.
An example of a graph 400 constructed by the graph constructing module 124 is depicted in
With reference first to
As such, the “Culture” category node may have a higher relevance level as compared to the other category nodes contained in a particular hierarchy level. Thus, for instance, the category paths generated for a large number of (and/or the most recently accessed ones of) the accessed documents may include the “Culture” category node, which may have caused the relevance level of that category node be determined to be relatively high.
The tag cloud 420 is depicted as including a plurality of concepts determined at step 204 and the concepts in the tag cloud 420 are depicted as having one or more of various graphical characteristics, which may include, for instance, font sizes, font types, shading characteristics, placements in the tag cloud 420, colors, underlining, bolding, italicizing, etc. The various graphical characteristics of the concepts generally signify the relevance of the particular concepts to the selected category node. The relevance may be based upon one or more factors including the number of documents that the user has accessed corresponding to the concept, the time in which the user accessed a document corresponding to the concept, etc. The inclusion of and the various graphical characteristics of the concepts may thus vary depending upon the category node selected.
Also depicted in the graph 400 is a list of articles 430 that correspond to the concepts listed in the tag cloud 420. According to an example, the articles may be arranged in the articles list 430 based upon the chronological order in which the documents corresponding to the concepts contained in the tag cloud 420 have been accessed. Thus, the most recently visited webpages, for instance, may appear at the highest level in the articles list 430. In addition, the articles list 430 may include additional articles not shown at a particular time and the articles list 430 may be scrolled to find the additional articles.
Moreover, the list of articles 430 may contain a list of articles that the user has not previously visited, but may be considered to be relevant to the concepts contained in the tag cloud 420. For instance, articles relevant to the concepts contained in the tag cloud 420 that have recently been published may be identified, for instance, through key word matching, and displayed in the articles list 430.
At step 212, the visualization module 120 may output a constructed graph through the output module 116, to thereby cause the constructed graph to be displayed. In addition, at step 214, the visualization apparatus 101 tracks accesses with respect to at least one of the graph and documents. More particularly, the visualization apparatus 101 tracks when the one of the category nodes or lower-level category nodes has been selected. In addition or alternatively, the visualization apparatus 101 tracks when the a concept in the tag cloud 420 or an article listed in the articles list 430 have been selected. Moreover, the visualization apparatus 101 tracks when additional documents have been selected.
If the accesses include a selection of a category node, a concept, or an article, the visualization apparatus 101 modifies at least one of the category nodes and the tag cloud displayed in the graph displayed at step 212. Thus, for instance, if the user selects a category node, the visualization apparatus 101 may vary a graphical characteristic of the selected category node to denote that that category node has been selected.
However, if additional documents have been accessed, the visualization apparatus 101 is configured to repeat steps 204 to 212 based upon the additional documents. As such, for instance, the graph 400 may be continuously updated to include additionally accessed documents.
An example of an instance where the visualization apparatus 101 tracks a selection of lower-level category nodes is depicted in
In addition, the concepts contained in the tag cloud 420 have changed in the graph 400 depicted in
Moreover, the articles list 430 has been modified to show the articles that are most relevant to the concepts depicted in the tag cloud 420. In addition, the articles depicted therein may comprise those articles that have most recently been accessed by the user.
Turning now to
Although the hierarchy 410, the tag cloud 420, and the articles list 430 in the graph 400 have been depicted as having particular characteristics to distinguish some of the elements from each other, it should be understood that the elements may be caused to have other characteristics without departing from a scope of the invention. For instance, instead of the text of the category nodes and/or the concepts having different characteristics, additional distinguishing features, such as, symbols, graphs, etc., may be provided adjacent to the text, in which the additional distinguishing features may be configured to provide similar levels of differences between the text. In addition, the category nodes in the hierarchy 410 may also be depicted in one or more tag clouds, similar to the tag cloud 420.
Through implementation of the methods 200 and 300, the categories and concepts associated with a document access history may be visually represented, with the respective relevancies of the categories and concepts to the accessed documents. As such, “topics” may be revisited rather than the actual documents that are included in the document access history. In one regard, richer interaction with the documents in an access history as compared with conventional history tracking software is made available through implementation of the methods 200 and 300.
In addition, although the discussion above pertains to the visualization of category paths for use in browsing web documents, the same techniques are also applicable to other textual documents.
Some or all of the operations set forth in the methods 200 and 300 may be contained as utilities, programs, or subprograms, in any desired computer accessible medium. In addition, some or all of the methods 200 and 300 may be embodied by computer programs, which may exist in a variety of forms both active and inactive. For example, they may exist as software program(s) comprised of program instructions in source code, object code, executable code or other formats. Any of the above may be embodied on a computer readable medium.
Exemplary computer readable storage devices include conventional computer system random access memory (RAM), read only memory (ROM), electrically programmable read-only memory (EPROM), EEPROM, and magnetic or optical disks or tapes. Concrete examples of the foregoing include distribution of the programs on a CD ROM or via Internet download. It is therefore to be understood that any electronic device capable of executing the above-described functions may perform those functions enumerated above.
The computing apparatus 500 includes one or more processors 502. The processor(s) 502 may be used to execute some or all of the steps described in the methods 200 and 300. Commands and data from the processor(s) 502 are communicated over a communication bus 504. The computing apparatus 500 also includes a main memory 506, such as a random access memory (RAM), where the program code for the processor(s) 502, may be executed during runtime, and a secondary memory 508. The secondary memory 508 includes, for example, one or more hard disk drives 510 and/or a removable storage drive 512, representing a floppy diskette drive, a magnetic tape drive, a compact disk drive, etc., where a copy of the program code for the methods 200 and 300 may be stored.
The removable storage drive 510 reads from and/or writes to a removable storage unit 514 in a well-known manner. User input and output devices may include a keyboard 516, a mouse 518, and a display 520. A display adaptor 522 may interface with the communication bus 504 and the display 520 and may receive display data from the processor(s) 502 and convert the display data into display commands for the display 520. In addition, the processor(s) 402 may communicate over a network, for instance, the Internet, a local area network (LAN), etc., through a network adaptor 524.
It will be apparent to one of ordinary skill in the art that other known electronic components may be added or substituted in the computing apparatus 500. It should also be apparent that one or more of the components depicted in
What has been described and illustrated herein is a preferred embodiment of the invention along with some of its variations. The terms, descriptions and figures used herein are set forth by way of illustration only and are not meant as limitations. Those skilled in the art will recognize that many variations are possible within the scope of the invention, which is intended to be defined by the following claims—and their equivalents—in which all terms are meant in their broadest reasonable sense unless otherwise indicated.
Number | Name | Date | Kind |
---|---|---|---|
5835085 | Eick et al. | Nov 1998 | A |
5895470 | Pirolli et al. | Apr 1999 | A |
5918236 | Wical | Jun 1999 | A |
5924090 | Krellenstein | Jul 1999 | A |
5963208 | Dolan et al. | Oct 1999 | A |
5963965 | Vogel | Oct 1999 | A |
6031537 | Hugh | Feb 2000 | A |
6038560 | Wical | Mar 2000 | A |
6076088 | Paik et al. | Jun 2000 | A |
6144962 | Weinberg et al. | Nov 2000 | A |
6154213 | Rennison et al. | Nov 2000 | A |
6240410 | Wical | May 2001 | B1 |
6285999 | Page | Sep 2001 | B1 |
6295514 | Agrafiotis et al. | Sep 2001 | B1 |
6307573 | Barros | Oct 2001 | B1 |
6363378 | Conklin et al. | Mar 2002 | B1 |
6373484 | Orell et al. | Apr 2002 | B1 |
6377287 | Hao et al. | Apr 2002 | B1 |
6434556 | Levin et al. | Aug 2002 | B1 |
6510406 | Marchisio | Jan 2003 | B1 |
6556225 | MacPhail | Apr 2003 | B1 |
6571249 | Garrecht et al. | May 2003 | B1 |
6574632 | Fox et al. | Jun 2003 | B2 |
6583794 | Wattenberg | Jun 2003 | B1 |
6628313 | Minakuchi et al. | Sep 2003 | B1 |
6646652 | Card et al. | Nov 2003 | B2 |
6675159 | Lin et al. | Jan 2004 | B1 |
6704729 | Klein et al. | Mar 2004 | B1 |
6775659 | Clifton-Bligh | Aug 2004 | B2 |
6795826 | Flinn et al. | Sep 2004 | B2 |
6839680 | Liu et al. | Jan 2005 | B1 |
6868525 | Szabo | Mar 2005 | B1 |
6879332 | Decombe | Apr 2005 | B2 |
6928436 | Baudel | Aug 2005 | B2 |
6931604 | Lane | Aug 2005 | B2 |
6944830 | Card et al. | Sep 2005 | B2 |
6961731 | Holbrook | Nov 2005 | B2 |
6976020 | Anthony et al. | Dec 2005 | B2 |
7036093 | Decombe | Apr 2006 | B2 |
7054870 | Holbrook | May 2006 | B2 |
7106329 | Miller et al. | Sep 2006 | B1 |
7113954 | Vogel | Sep 2006 | B2 |
7224362 | Kincaid et al. | May 2007 | B2 |
7251637 | Caid et al. | Jul 2007 | B1 |
7257593 | Mazner et al. | Aug 2007 | B2 |
7268791 | Jannink | Sep 2007 | B1 |
7271804 | Evans | Sep 2007 | B2 |
7281002 | Farrell | Oct 2007 | B2 |
7290223 | Decombe | Oct 2007 | B2 |
7292243 | Burke | Nov 2007 | B1 |
7373612 | Risch et al. | May 2008 | B2 |
7421665 | Molesky et al. | Sep 2008 | B2 |
7424484 | Ma et al. | Sep 2008 | B2 |
7475072 | Ershov | Jan 2009 | B1 |
7493315 | Holbrook | Feb 2009 | B2 |
7512612 | Akella et al. | Mar 2009 | B1 |
7555476 | Holbrook | Jun 2009 | B2 |
7565627 | Brill et al. | Jul 2009 | B2 |
7694212 | Fikes et al. | Apr 2010 | B2 |
7698651 | Cabanes et al. | Apr 2010 | B2 |
7752534 | Blanchard et al. | Jul 2010 | B2 |
7756845 | Madani et al. | Jul 2010 | B2 |
7774335 | Scofield et al. | Aug 2010 | B1 |
7853552 | Beringer et al. | Dec 2010 | B2 |
7877393 | Green et al. | Jan 2011 | B2 |
7912849 | Ohrn et al. | Mar 2011 | B2 |
7949629 | Pollara | May 2011 | B2 |
8001152 | Solan | Aug 2011 | B1 |
8010342 | Uchimoto et al. | Aug 2011 | B2 |
8024324 | Amitay et al. | Sep 2011 | B2 |
8037066 | Majko | Oct 2011 | B2 |
8082511 | Sobotka et al. | Dec 2011 | B2 |
8112715 | Cragun et al. | Feb 2012 | B2 |
8140541 | Koss | Mar 2012 | B2 |
8150382 | Kim et al. | Apr 2012 | B2 |
8156131 | Marsden et al. | Apr 2012 | B2 |
8209337 | Park | Jun 2012 | B2 |
8219916 | Bonforte et al. | Jul 2012 | B2 |
8224851 | Banda et al. | Jul 2012 | B2 |
8229957 | Gehrking et al. | Jul 2012 | B2 |
8234305 | Seligmann et al. | Jul 2012 | B2 |
8239358 | Soubbotin | Aug 2012 | B1 |
8266153 | Hui et al. | Sep 2012 | B2 |
8275399 | Karmarkar et al. | Sep 2012 | B2 |
8280783 | Brownell et al. | Oct 2012 | B1 |
8296660 | Macadaan et al. | Oct 2012 | B2 |
8347231 | Stephens | Jan 2013 | B2 |
8364718 | Kirby | Jan 2013 | B2 |
8417695 | Zhong et al. | Apr 2013 | B2 |
8438487 | Lin-Hendel | May 2013 | B1 |
8520002 | Stambaugh | Aug 2013 | B2 |
20020022956 | Ukrainczyk et al. | Feb 2002 | A1 |
20020055919 | Mikheev | May 2002 | A1 |
20020078090 | Hwang et al. | Jun 2002 | A1 |
20020107827 | Benitez-Jimenez et al. | Aug 2002 | A1 |
20020129015 | Caudill et al. | Sep 2002 | A1 |
20020154175 | Abello et al. | Oct 2002 | A1 |
20030028495 | Pallante | Feb 2003 | A1 |
20030028564 | Sanfilippo | Feb 2003 | A1 |
20030050915 | Allemang et al. | Mar 2003 | A1 |
20030069908 | Anthony et al. | Apr 2003 | A1 |
20030085931 | Card et al. | May 2003 | A1 |
20030101449 | Bentolila et al. | May 2003 | A1 |
20030115189 | Srinivasa et al. | Jun 2003 | A1 |
20030167163 | Glover et al. | Sep 2003 | A1 |
20030167324 | Farnham et al. | Sep 2003 | A1 |
20030217335 | Chung et al. | Nov 2003 | A1 |
20040090472 | Risch et al. | May 2004 | A1 |
20040172600 | Evans | Sep 2004 | A1 |
20040243938 | Weise et al. | Dec 2004 | A1 |
20050108001 | Aarskog | May 2005 | A1 |
20050114786 | Decombe | May 2005 | A1 |
20050144158 | Capper et al. | Jun 2005 | A1 |
20050154690 | Nitta et al. | Jul 2005 | A1 |
20050171760 | Tinkler | Aug 2005 | A1 |
20050182764 | Evans | Aug 2005 | A1 |
20050220351 | Vanderwende et al. | Oct 2005 | A1 |
20060047649 | Liang | Mar 2006 | A1 |
20060053172 | Gardner et al. | Mar 2006 | A1 |
20060053174 | Gardner et al. | Mar 2006 | A1 |
20060074836 | Gardner et al. | Apr 2006 | A1 |
20060106793 | Liang | May 2006 | A1 |
20060143175 | Ukrainczyk et al. | Jun 2006 | A1 |
20060242147 | Gehrking et al. | Oct 2006 | A1 |
20070033221 | Copperman et al. | Feb 2007 | A1 |
20070078832 | Ott et al. | Apr 2007 | A1 |
20070100824 | Richardson et al. | May 2007 | A1 |
20070150487 | Christian et al. | Jun 2007 | A1 |
20070174247 | Xu et al. | Jul 2007 | A1 |
20080007556 | Evans | Jan 2008 | A1 |
20080010615 | Curtis et al. | Jan 2008 | A1 |
20080016071 | Frieden | Jan 2008 | A1 |
20080033932 | DeLong et al. | Feb 2008 | A1 |
20080059897 | Dilorenzo | Mar 2008 | A1 |
20080104061 | Rezaei | May 2008 | A1 |
20080126176 | Iguchi | May 2008 | A1 |
20080162275 | Logan et al. | Jul 2008 | A1 |
20080231644 | Lempel et al. | Sep 2008 | A1 |
20080243815 | Chan et al. | Oct 2008 | A1 |
20080270380 | Ohrn et al. | Oct 2008 | A1 |
20080282186 | Basavaraju | Nov 2008 | A1 |
20080306913 | Newman et al. | Dec 2008 | A1 |
20090006371 | Denoue et al. | Jan 2009 | A1 |
20090037400 | Cragun et al. | Feb 2009 | A1 |
20090119614 | Tienvieri et al. | May 2009 | A1 |
20090125381 | Delepet | May 2009 | A1 |
20090125382 | Delepet | May 2009 | A1 |
20090144240 | Singh et al. | Jun 2009 | A1 |
20090177957 | Bouillet et al. | Jul 2009 | A1 |
20090182727 | Majko | Jul 2009 | A1 |
20090281900 | Rezaei et al. | Nov 2009 | A1 |
20090299725 | Grigsby et al. | Dec 2009 | A1 |
20090300009 | Rezaei et al. | Dec 2009 | A1 |
20100030552 | Chen et al. | Feb 2010 | A1 |
20100114879 | Zhong et al. | May 2010 | A1 |
20100131899 | Hubert | May 2010 | A1 |
20100161631 | Yu et al. | Jun 2010 | A1 |
20100229082 | Karmarkar et al. | Sep 2010 | A1 |
20100235307 | Sweeney et al. | Sep 2010 | A1 |
20100318892 | Teevan et al. | Dec 2010 | A1 |
20110010307 | Bates et al. | Jan 2011 | A1 |
20110173216 | Newman et al. | Jul 2011 | A1 |
20110214050 | Stambaugh | Sep 2011 | A1 |
Entry |
---|
Fenstermacher et al.; “Mining Client-Side Activity for Personalization” Department of Management Information Systems, Eller College of Business and Public Administration, University of Arizona, Jun. 2002. |
Baoyao, Zhou; “Intelligent Web Usage Mining” Nanyang Technological University, Division of Information Systems, School of Computer Engineering, 2004. |
Baynote Inc.: “The Collective Intelligence Platform,” Online, http://www.baynote.com/technology/platform/ 2010. |
Hottolink Inc.; “Recognize” Online, http://www.hottolink.co.jp/english/reconize.html, 2009. |
Hongjun Lu et al: “Extending a Web Browser with Client-Side Mining,” Hong Kong University of Science and Technology Department of Computer Science, Clear Water Bay, Kowloon, Hong Kong, China, 2003. |
Shankar et al.; “Personalized Web Search Based on Client Side Ontology”, CS 498: B.Tech Project, 10. IIT Kanpur, India 2010. |
Sendhikumar et al.; “Personalized ontology for web search personalization” 1. Anna University, Chennai, India , 2008. |
Shahabi et al.; “Yoda an Accurate and Scalable Web based Recommendation System?” University of Southern California, Los Angeles, Sep., 2001. |
Claypool et al.; “Implicit Interest Indicators”, 14. Worcester Polytechnic Institute, Worcester, Computer Science Department Worcester Polytechnic Institute Worcester, MA 01609, USA., 2001. |
Shahabi et al.; “A Framework for Efficient and Anonymous Web Usage Mining Based on Client-Side Tracking”, 15. University of Southern California, Los Angeles, 2002. |
Chattertrap; Online http://www.chattertrap.com; Jul. 20, 2010. |
HSNW: SRI defense technology spawns civilian application: published Jun. 29, 2010. |
Anthony Ha: Facebook investor backs Chattertrap, a personal assistant for content, Jun. 28, 2010. |
Janik, J. et al., “Wikipedia in action: Ontological Knowledge in Text Categorization”,University of Georgia, Computer Science Department Technical report No. UGA-CS-TR-07-001. |
Schonhofen, P. “Identifying document topics using the Wikipedia category network”, Proceedings of the 2006 IEEE/ACM International Conference on Web Intelligence. |
Medelyan, O. et al., “Topic Indexing with Wikipedia”, Proc of Wikipedia and AI workshop at the AAAI-2008 Conference, 2008. |
Milne, D. et al., “Learning to Link with Wikipedia”, CIKM'08, Oct. 26-30, 2008, ACM 2008. |
Gabrilovich, E. et al., “Overcoming the Brittleness Bottleneck usingWikipedia: Enhancing Text Categorization with Encyclopedic Knowledge”, AAAI 2006. |
Banerjee, S., “Boosting Inductive Transfer for Text Classification using Wikipedia”, HP Laboratories, India, HPL-2008-42, Apr. 2008. |
Medelyan, O. et al.,“Mining Meaning From Wikipedia”, Sep. 2008. |
Zemanta, http://www.zemanta.com/. |
IndyCar description from Wikipedia, http://en.wikipedia.org/wiki/IndyCar, Jun. 2009. |
Readability, http://code.google.com/p/arc90labs-readability/, Jun. 2009. |
Banerjee, S. et al., “Clustering Short Texts using Wikipedia”, HP Laboratories, India, HPL-2008-41, Apr. 2008. |
Bederson, B. et al., “Ordered and Quantum Treemaps: Making Effective Use of 2D Space to Display Hierarchies” ,ACM Transactions on Graphics, Oct. 2002. |
Chernov, S. et al., “Extracting Semantic Relationships between Wikipedia Categories”, in 1st Workshop on Semantic Wikis:, Jun. 2006. |
C. Harrison. Clusterball, http://www.chrisharrison.net/projects/clusterball/, Jun. 2009. |
Mihalcea, R. et al., Wikify! linking documents to encyclopedic knowledge. In CIKM '07, Lisboa, Portugul, Nov. 2007. ACM. |
F. Nyffenegger. WikiMindMap.http://www.wikimindmap.org/, Jun. 2009. |
Syed, Z. S. et al., “Wikipedia as an Ontology for Describing Documents”, in Proc. of the Second International Conference on Weblogs and Social Media. AAAI Press, Mar. 2008. |
Thom, J. A. et al., Use of Wikipedia categories in entity ranking. In 12th Australasian Document Computing Symposium, Melborne, Australia, Dec. 2007. |
Wang, P. et al., “Improving text classi cation by using encyclopedia knowledge”, in ICDM '07: 2007. |
Zesch, T. et al., “Analysis of the Wikipedia Category Graph for NLP Applications”, (NAACL-HLT), 2007. |
Clusters, http://infosthetics.com/archives/2007/06/clusterball—wikipedia—data—visualization.html. |
Treemap, http://infosthetics.com/archives/2006/03/itunes—wikipedia—news—treemap—data—visualization.html. |
Tagcloud, http://infosthetics.com/archives/2006/03/itunes—wikipedia—news—treemap—data—visualization.html. |
Andrejko et al., (2007) “User Characteristics Acquisition from Logs with Semantics,” in Kelemenová, A., Koláø, D., Meduna, A., Zendulka, J., eds.: ISIM '07: Information Systems and Formal Models, Slezská universita v Opavi, 2007, 8 pages. |
Huang, A. et al., “Clustering Documents using aWikipedia-based Concept Representation”, Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, Apr. 27-30, 2009, Bangkok, Thailand. |
Why Wubat? Website User Behavior & Analysis Tool, Wubat, Online, http://www.wubat.com/ (download date Aug. 12, 2010.). |
Chris Harrison, http://www.chrisharrison.net/projects/wikiviz/index.html. (download date Oct. 29, 2009). |
Mindmap, http://www.wikimindmap.org/ (download date Oct. 29, 2009). |
Number | Date | Country | |
---|---|---|---|
20110113385 A1 | May 2011 | US |