Embodiments of the present invention relate to a system for the extraction of relevant conceptual associations from “groupings” of content. A conceptual association is defined as the connection or link relating concepts, keywords, or abstract ideas. The term “relevant” refers to the fact that such associations are commonly accepted as meaningful according to some criterion. For example, one measure of the relevance of associations among concepts is how frequently they are used in the same paragraph, within the same document, or in the same context. Finally, a “grouping” of content is defined as a collection of content nodes that are related to each other in some functional fashion, thus resulting in a topical bias. Groupings typically originate from relationships that exist between content nodes, either at a semantic level, or because links between them are explicitly defined. For example, a grouping strategy could be defined that associates Web pages that have common terms in the anchor text of hyperlinks pointing to them, e.g. the group of all pages for which the term “diabetes” appears in one or more of the in-link anchors. Often metadata provided with the content set can be used to extract such groupings, as in invention U.S. patent application Ser. No. 11/125,329, filed May 10, 2005 and entitled “Method and Apparatus for Distributed Community Finding”, where link structure among the content can be used to group content nodes into communities or groups. In a given set, comprising billions of content, there could be millions of such implied groupings and each content may belong to multiple such groupings.
Entity association maps are conventionally generated either manually based on existing taxonomies and encyclopedias or via a hybrid approach where pre-built taxonomies are enriched, either manually or through supervised aggregation. Examples of the former approach are Wikipedia, DMOZ's Open Directory Project or Google's Knol. These taxonomies rely on a substantial collaborative effort in order to span a significant number of topics with enough depth. Therefore, approaches of the second type have been introduced in order to either reduce the cost of building such taxonomies, or to expand their coverage.
In general, given the pace at which information is currently generated, it has become increasingly challenging to create and maintain manually created taxonomies. This is especially true on the World Wide Web, where not only the number of pages online has been increasing at a very rapid pace, but also the percentage of pages with content that quickly varies over time (e.g. news, blogs, personal pages within social networks) started assuming a predominant role. At the same time, the importance of correctly categorizing and organizing, not simply the content of such pages, but the information they contain has reached an unprecedented commercial value.
Embodiments of the present invention are directed at methods and systems that substantially obviate one or more of the above and other problems associated with conventional techniques for creating conceptual association maps (or taxonomies) and scaling them to a very large body of content, like the pages on the World Wide Web.
One aspect of the invention is a method for clustering structured and unstructured content into topically focused groups of documents and extracting relevant concepts from such content nodes. Another aspect of the invention is a method for determining the relevance of conceptual associations from various patterns of co-occurrence of conceptual entities across different content nodes.
Embodiments of the present invention are capable of extracting and evaluating what are the concepts (i.e., basic informational units) in the given set of content, in an unsupervised fashion, without the limitations typically imposed by manual or partially supervised approaches.
Embodiments of the present invention are simultaneously capable of extracting and evaluating the relevance of the associations among the concepts derived by the invention on a very large scale, in an unsupervised fashion, without the limitations typically imposed by manual or partially supervised approaches.
According to one embodiment of the present invention, a computerized system for extracting a conceptual association map on a large scale from structured and unstructured content and for scoring such associations based on their relevance is provided.
The accompanying drawings, which are incorporated into and constitute a part of this specification, illustrate one or more embodiments of the present invention and, together with the detailed description, serve to explain the principles and implementations of the invention.
In the drawings:
Embodiments of the present invention are described herein in the context of generating a conceptual association graph from structured content. Those of ordinary skill in the art will realize that the following detailed description of the present invention is illustrative only and is not intended to be in any way limiting. Other embodiments of the present invention will readily suggest themselves to such skilled persons having the benefit of this disclosure. Reference will now be made in detail to implementations of the present invention as illustrated in the accompanying drawings. The same reference indicators will be used throughout the drawings and the following detailed description to refer to the same or like parts.
According to one embodiment of the present invention, three separate modules are provided as shown in
The first module, the content grouping module (102), groups content nodes based on their patterns of association. According to one embodiment of the present invention, the system generates groups whose members are densely connected with other members of the same cluster, while being sparsely (or loosely) connected with content nodes that are not part of the cluster. The definition of “link” or “connection” between two content nodes is specific to each of the embodiments of this invention, but it is not limited to any of the types described in such embodiments.
Community finding algorithms are discussed in U.S. patent application Ser. No. 11/125,329, entitled “Method and Apparatus for Distributed Community Finding” the entirety of which is hereby incorporated by reference; and “Finding and evaluating community structure in networks,” M. E. J. Newman and M. Girvan, Phys. Rev. E 69, 026113 (2004). According to one embodiment of the present invention, links between content nodes are “hyperlink-style” connections, indicating that a referrer page is pointing to the content (or a section of the content) of a referred-to page.
According to another embodiment of the present invention, connections are “citation-style” links between a content node and a collection of content nodes that are cited as pertinent to the referrer node. Examples of content nodes associated by links of this type are scientific publications, encyclopedias and intellectual property publications (e.g., patents and patent applications).
According to another embodiment of the present invention, links between content nodes are generated from metadata entities, e.g. category tagging, descriptive keywords (as found in web pages as well as scientific articles), by grouping together all the nodes that are tagged with the same keyword or category.
According to another embodiment of the present invention, the grouping of content nodes is based on anchor text of hyperlinks found in web pages. Pages with common relevant terms in the anchor text of links pointing to them (in-links) are grouped together.
The second module is a concept extractor module (103) whose function is to tag content nodes with descriptive concepts and keywords. The concept extractor module (103) takes advantage of the statistical bias introduced by the topically focused clustering to better target the concept selection.
The statistics extracted from each cluster provide a “view” of the content nodes within a certain grouping. The membership of each content node to a specific grouping is not exclusive, meaning that the same node can be associated to many groups, each providing a different view of the same content. According to one embodiment of the present invention, different content nodes in a grouping might be assigned different weights so a view is generated by aggregating weighted statistics of such nodes. An example of a weighting mechanism is the use of global or topical pagerank of web pages.
Given a specific “view” for a content node, a tagger sub-module (0304) selects appropriate descriptive concepts and keywords for that node (0305). In this step, the topical focus spawning from the grouping stage aids in the process of selecting meaningful conceptual tags for each content node. Conceptual tags can have weights according to score of each n-gram in the view and statistics of that tag on the content node.
According to one embodiment of the present invention, the last module, the conceptual map builder (104), comprises a system for finding and scoring conceptual associations based on patterns of co-occurrence of concepts in tagged content nodes, and is illustrated in
According to one embodiment of the present invention, the pre-processing module (0602) also normalizes the contribution of content nodes generated from the same “source,” in order to prevent certain sources from becoming dominant in the definition of a conceptual association. A “source” is generically defined as the entity that provided the content nodes. For example, when content nodes are pages from the World Wide Web, the source is the domain that hosts the web pages.
According to one embodiment of the present invention, the score of a link between two concepts c1 and c2 is given by the probability that a content node is tagged with concept c2 given that it has been tagged with concept c1, estimated over all the content nodes tagged with concept c2.
According to another embodiment of the present invention, the score of a conceptual association is defined as described above but it also weighed by the probability that a given content node should be tagged with a given concept. In other words, this embodiment covers the case where a probability is assigned to the event that a content node is tagged with a certain concept.
According to another embodiment of the present invention, the computation of the association score between two concepts is weighed through a global measure of relevance of the content nodes that are tagged with such concepts. For example, when content nodes are from the World Wide Web, pages that are found to be more relevant according to some criterion (e.g. page rank) play a greater role in the weight definition than pages with lower relevance.
According to another embodiment of the present invention, the association scores are also determined by the positional information of an extracted concept within a concept node. In this version, both the absolute position of a concept within the content node, as well the relative position of two associated concepts in the text play a role in the definition of the association score. For example, concepts appearing in certain sections of the content node that are deemed more relevant (e.g. title, abstract or description) are assigned a larger weight. At the same time, associations between concepts that co-occur within a certain window of words can be scored higher due to their proximity.
According to another embodiment of the present invention, the concept associations found through the method are fed back to the concept extraction module, in order to improve the tagging of each concept node (reference numeral 106 of
According to another embodiment of the present invention, syntactic or part of speech tagging of tokens in each content node is used as additional tagging of the previously extracted conceptual associations.
The computer platform 701 may include a data bus 704 or other communication mechanism for communicating information across and among various parts of the computer platform 701, and a processor 705 coupled with bus 701 for processing information and performing other computational and control tasks. Computer platform 701 also includes a volatile storage 706, such as a random access memory (RAM) or other dynamic storage device, coupled to bus 704 for storing various information as well as instructions to be executed by processor 705. The volatile storage 706 also may be used for storing temporary variables or other intermediate information during execution of instructions by processor 705. Computer platform 701 may further include a read only memory (ROM or EPROM) 707 or other static storage device coupled to bus 704 for storing static information and instructions for processor 705, such as basic input-output system (BIOS), as well as various system configuration parameters. A persistent storage device 708, such as a magnetic disk, optical disk, or solid-state flash memory device is provided and coupled to bus 901 for storing information and instructions.
Computer platform 701 may be coupled via bus 704 to a display 709, such as a cathode ray tube (CRT), plasma display, or a liquid crystal display (LCD), for displaying information to a system administrator or user of the computer platform 701. An input device 710, including alphanumeric and other keys, is coupled to bus 701 for communicating information and command selections to processor 705. Another type of user input device is cursor control device 711, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 704 and for controlling cursor movement on display 709.
An external storage device 712 may be connected to the computer platform 701 via bus 704 to provide an extra or removable storage capacity for the computer platform 701. In an embodiment of the computer system 700, the external removable storage device 712 may be used to facilitate exchange of data with other computer systems.
Embodiments of the present invention are related to the use of computer system 700 for implementing the techniques described herein. According to one embodiment of the present invention, the system may reside on a machine such as computer platform 701. According to one embodiment of the present invention, the techniques described herein are performed by computer system 700 in response to processor 705 executing one or more sequences of one or more instructions contained in the volatile memory 706. Such instructions may be read into volatile memory 706 from another computer-readable medium, such as persistent storage device 708. Execution of the sequences of instructions contained in the volatile memory 706 causes processor 705 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions to implement embodiment of the present invention. Thus, embodiments of the present invention are not limited to any specific combination of hardware circuitry and software.
It should be noted that embodiments of the present invention are illustrated and discussed herein as having various modules which perform particular functions and interact with one another. It should be understood that these modules are merely segregated based on their function for the sake of description and represent computer hardware and/or executable software code which is stored on a computer-readable medium for execution on appropriate computing hardware. The various functions of the different modules and units can be combined or segregated as hardware and/or software stored on a computer-readable medium as above as modules in any manner, and can be used separately or in combination.
The term “computer-readable medium” as used herein refers to any medium that participates in providing instructions to processor 705 for execution. The computer-readable medium is just one example of a machine-readable medium, which may carry instructions for implementing any of the methods and/or techniques described herein. Such a medium may take many forms, including but not limited to, non-volatile media, volatile media, and transmission media. Non-volatile media includes, for example, optical or magnetic disks, such as storage device 708. Volatile media includes dynamic memory, such as volatile storage 706. Transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise data bus 704. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.
Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD-ROM, any other optical medium, punchcards, papertape, any other physical medium with patterns of holes, a RAM, a PROM, an EPROM, a FLASH-EPROM, a flash drive, a memory card, any other memory chip or cartridge, a carrier wave as described hereinafter, or any other medium from which a computer can read.
Various forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to processor 705 for execution. For example, the instructions may initially be carried on a magnetic disk from a remote computer. Alternatively, a remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local to computer system 700 can receive the data on the telephone line and use an infrared transmitter to convert the data to an infra-red signal. An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data on the data bus 704. The bus 704 carries the data to the volatile storage 706, from which processor 705 retrieves and executes the instructions. The instructions received by the volatile memory 706 may optionally be stored on persistent storage device 708 either before or after execution by processor 705. The instructions may also be downloaded into the computer platform 701 via Internet using a variety of network data communication protocols well known in the art.
The computer platform 701 also includes a communication interface, such as network interface card 713 coupled to the data bus 704. Communication interface 713 provides a two-way data communication coupling to a network link 714 that is connected to a local network 715. For example, communication interface 713 may be an integrated services digital network (ISDN) card or a modem to provide a data communication connection to a corresponding type of telephone line. As another example, communication interface 713 may be a local area network interface card (LAN NIC) to provide a data communication connection to a compatible LAN. Wireless links, such as well-known 802.11 a, 802.11 b, 802.11 g and Bluetooth may also used for network implementation. In any such implementation, communication interface 713 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
Network link 713 provides data communication through one or more networks to other network resources. For example, network link 714 may provide a connection through local network 715 to a host computer 716, or a network storage/server 717. Additionally or alternatively, the network link 713 may connect through gateway/firewall 717 to the wide-area or global network 718, such as an Internet. Thus, the computer platform 701 can access network resources located anywhere on the Internet 718, such as a remote network storage/server 719. On the other hand, the computer platform 701 may also be accessed by clients located anywhere on the local area network 715 and/or the Internet 718. The network clients 720 and 721 may themselves be implemented based on the computer platform similar to the platform 701.
Local network 715 and the Internet 718 both use electrical, electromagnetic or optical signals that carry digital data streams. The signals through the various networks and the signals on network link 714 and through communication interface 713, which carry the digital data to and from computer platform 701, are exemplary forms of carrier waves transporting the information.
Computer platform 701 can send messages and receive data, including program code, through the variety of network(s) including Internet 718 and LAN 715, network link 714 and communication interface 713. In the Internet example, when the system 701 acts as a network server, it might transmit a requested code or data for an application program running on client(s) 720 and/or 721 through Internet 718, gateway/firewall 717, local area network 715 and communication interface 713. Similarly, it may receive code from other network resources.
The received code may be executed by processor 705 as it is received, and/or stored in persistent or volatile storage devices 708 and 706, respectively, or other non-volatile storage for later execution. In this manner, computer system 701 may obtain application code in the form of a carrier wave.
Finally, it should be understood that processes and techniques described herein are not inherently related to any particular apparatus and may be implemented by any suitable combination of components. Further, various types of general purpose devices may be used in accordance with the teachings described herein. It may also prove advantageous to construct specialized apparatus to perform the method steps described herein. The present invention has been described in relation to particular examples, which are intended in all respects to be illustrative rather than restrictive. Those skilled in the art will appreciate that many different combinations of hardware, software, and firmware will be suitable for practicing the present invention. For example, the described software may be implemented in a wide variety of programming or scripting languages, such as Assembler, C/C++, perl, shell, PHP, Java, etc.
Moreover, other implementations of the present invention will be apparent to those skilled in the art from consideration of the specification and practice of the present invention disclosed herein. Various aspects and/or components of the described embodiments may be used singly or in any combination in the online behavioral targeting system. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the present invention being indicated by the following claims.
In the interest of clarity, not all of the routine features of the implementations described herein are shown and described. It will, of course, be appreciated that in the development of any such actual implementation, numerous implementation-specific decisions must be made in order to achieve the developer's specific goals, such as compliance with application- and business-related constraints, and that these specific goals will vary from one implementation to another and from one developer to another. Moreover, it will be appreciated that such a development effort might be complex and time-consuming, but would nevertheless be a routine undertaking of engineering for those of ordinary skill in the art having the benefit of this disclosure.
According to one embodiment of the present invention, the components, process steps, and/or data structures may be implemented using various types of operating systems (OS), computing platforms, firmware, computer programs, computer languages, and/or general-purpose machines. The method can be run as a programmed process running on processing circuitry. The processing circuitry can take the form of numerous combinations of processors and operating systems, connections and networks, data stores, or a stand-alone device. The process can be implemented as instructions executed by such hardware, hardware alone, or any combination thereof. The software may be stored on a program storage device readable by a machine.
While embodiments and applications of this invention have been shown and described, it would be apparent to those skilled in the art having the benefit of this disclosure that many more modifications than mentioned above are possible without departing from the inventive concepts herein. The invention, therefore, is not to be restricted except in the spirit of the appended claims.
This application is a non-provisional of and claims priority to U.S. Provisional Application No. 61/252,632, filed Oct. 16, 2009 and entitled “Generating A Conceptual Association Graph From Structured Content”, the entirety of which is hereby incorporated by reference. This application is also related to commonly assigned U.S. patent application Ser. No. 11/125,329, filed May 10, 2005 and entitled “Method and Apparatus for Distributed Community Finding”, the entirety of which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
5581764 | Fitzgerald et al. | Dec 1996 | A |
5721910 | Unger et al. | Feb 1998 | A |
5956708 | Dyko et al. | Sep 1999 | A |
6038560 | Wical | Mar 2000 | A |
6098064 | Pirolli et al. | Aug 2000 | A |
6233575 | Agrawal et al. | May 2001 | B1 |
6242273 | Goodwin et al. | Jun 2001 | B1 |
6339767 | Rivette et al. | Jan 2002 | B1 |
6397682 | Kumar et al. | Jun 2002 | B2 |
6446061 | Doerre et al. | Sep 2002 | B1 |
6544357 | Hehmann et al. | Apr 2003 | B1 |
6549896 | Candan et al. | Apr 2003 | B1 |
6665837 | Dean et al. | Dec 2003 | B1 |
6816884 | Summers | Nov 2004 | B1 |
6826553 | DaCosta et al. | Nov 2004 | B1 |
6886129 | Raghavan et al. | Apr 2005 | B1 |
7031308 | Garcia-Luna-Aceves et al. | Apr 2006 | B2 |
7051023 | Kapur et al. | May 2006 | B2 |
7092953 | Haynes | Aug 2006 | B1 |
7181438 | Szabo | Feb 2007 | B1 |
7269253 | Wu et al. | Sep 2007 | B1 |
7483711 | Burchfiel | Jan 2009 | B2 |
7590589 | Hoffberg | Sep 2009 | B2 |
7613851 | Rice et al. | Nov 2009 | B2 |
7660855 | Arning et al. | Feb 2010 | B2 |
7680796 | Yeh et al. | Mar 2010 | B2 |
7689493 | Sullivan et al. | Mar 2010 | B1 |
7716060 | Germeraad et al. | May 2010 | B2 |
7725467 | Yamamoto et al. | May 2010 | B2 |
7725475 | Alspector et al. | May 2010 | B1 |
7725525 | Work | May 2010 | B2 |
7730063 | Eder | Jun 2010 | B2 |
7805536 | Kompella et al. | Sep 2010 | B1 |
7818191 | Lutnick et al. | Oct 2010 | B2 |
7822745 | Fayyad et al. | Oct 2010 | B2 |
7831586 | Reitter et al. | Nov 2010 | B2 |
7885987 | Lee | Feb 2011 | B1 |
7958120 | Muntz et al. | Jun 2011 | B2 |
7984029 | Alspector et al. | Jul 2011 | B2 |
7996753 | Chan et al. | Aug 2011 | B1 |
8024372 | Harik et al. | Sep 2011 | B2 |
8050965 | Hellevik et al. | Nov 2011 | B2 |
8301617 | Muntz et al. | Oct 2012 | B2 |
8370362 | Szabo | Feb 2013 | B2 |
8380721 | Attaran Rezaei et al. | Feb 2013 | B2 |
8412575 | Labio et al. | Apr 2013 | B2 |
8417695 | Zhong et al. | Apr 2013 | B2 |
8468118 | Kim et al. | Jun 2013 | B2 |
8825654 | Muntz et al. | Sep 2014 | B2 |
8825657 | Rezaei et al. | Sep 2014 | B2 |
8838605 | Muntz et al. | Sep 2014 | B2 |
8843434 | Rezaei et al. | Sep 2014 | B2 |
20010037324 | Agrawal et al. | Nov 2001 | A1 |
20010049688 | Fratkina et al. | Dec 2001 | A1 |
20020016782 | Cooper | Feb 2002 | A1 |
20020049792 | Wilcox et al. | Apr 2002 | A1 |
20020080180 | Mander et al. | Jun 2002 | A1 |
20020087884 | Shacham et al. | Jul 2002 | A1 |
20020091846 | Garcia-Luna-Aceves et al. | Jul 2002 | A1 |
20020143742 | Nonomura et al. | Oct 2002 | A1 |
20030046307 | Rivette et al. | Mar 2003 | A1 |
20030115191 | Copperman et al. | Jun 2003 | A1 |
20030187881 | Murata et al. | Oct 2003 | A1 |
20030217139 | Burbeck et al. | Nov 2003 | A1 |
20030217140 | Burbeck et al. | Nov 2003 | A1 |
20030220866 | Pisaris-Henderson et al. | Nov 2003 | A1 |
20030227479 | Mizrahi et al. | Dec 2003 | A1 |
20040024739 | Copperman et al. | Feb 2004 | A1 |
20040064438 | Kostoff | Apr 2004 | A1 |
20040080524 | Yeh et al. | Apr 2004 | A1 |
20040085797 | Mei et al. | May 2004 | A1 |
20040093328 | Damle | May 2004 | A1 |
20040122803 | Dom et al. | Jun 2004 | A1 |
20040133555 | Toong et al. | Jul 2004 | A1 |
20040170328 | Ladwig et al. | Sep 2004 | A1 |
20040267638 | Giunta | Dec 2004 | A1 |
20050010556 | Phelan | Jan 2005 | A1 |
20050021461 | Flake et al. | Jan 2005 | A1 |
20050021531 | Wen et al. | Jan 2005 | A1 |
20050033742 | Kamvar et al. | Feb 2005 | A1 |
20050055321 | Fratkina et al. | Mar 2005 | A1 |
20050064618 | Brown et al. | Mar 2005 | A1 |
20050065980 | Hyatt et al. | Mar 2005 | A1 |
20050086260 | Canright et al. | Apr 2005 | A1 |
20050097204 | Horowitz et al. | May 2005 | A1 |
20050113691 | Liebschner | May 2005 | A1 |
20050114198 | Koningstein et al. | May 2005 | A1 |
20050114763 | Nonomura et al. | May 2005 | A1 |
20050117593 | Shand | Jun 2005 | A1 |
20050138070 | Huberman et al. | Jun 2005 | A1 |
20050144065 | Calabria et al. | Jun 2005 | A1 |
20050144162 | Liang | Jun 2005 | A1 |
20050160107 | Liang | Jul 2005 | A1 |
20050182755 | Tran | Aug 2005 | A1 |
20050203838 | Zhang et al. | Sep 2005 | A1 |
20050210008 | Tran et al. | Sep 2005 | A1 |
20050210027 | Aggarwal et al. | Sep 2005 | A1 |
20050222900 | Fuloria et al. | Oct 2005 | A1 |
20050256905 | Gruhl et al. | Nov 2005 | A1 |
20050256949 | Gruhl et al. | Nov 2005 | A1 |
20050283461 | Sell et al. | Dec 2005 | A1 |
20060004703 | Spivack et al. | Jan 2006 | A1 |
20060041548 | Parsons et al. | Feb 2006 | A1 |
20060047649 | Liang | Mar 2006 | A1 |
20060080422 | Huberman et al. | Apr 2006 | A1 |
20060085408 | Morsa | Apr 2006 | A1 |
20060106793 | Liang | May 2006 | A1 |
20060106847 | Eckardt, III et al. | May 2006 | A1 |
20060112105 | Adamic et al. | May 2006 | A1 |
20060112111 | Tseng et al. | May 2006 | A1 |
20060168065 | Martin | Jul 2006 | A1 |
20060171331 | Previdi et al. | Aug 2006 | A1 |
20060184464 | Tseng et al. | Aug 2006 | A1 |
20060209727 | Jennings, III et al. | Sep 2006 | A1 |
20060212350 | Ellis et al. | Sep 2006 | A1 |
20060218035 | Park et al. | Sep 2006 | A1 |
20060235841 | Betz et al. | Oct 2006 | A1 |
20060242017 | Libes et al. | Oct 2006 | A1 |
20060271564 | Muntz et al. | Nov 2006 | A1 |
20060294155 | Patterson | Dec 2006 | A1 |
20070025364 | Kodialam et al. | Feb 2007 | A1 |
20070033103 | Collins et al. | Feb 2007 | A1 |
20070038614 | Guha | Feb 2007 | A1 |
20070061300 | Ramer et al. | Mar 2007 | A1 |
20070143329 | Vigen | Jun 2007 | A1 |
20070174255 | Sravanapudi et al. | Jul 2007 | A1 |
20070198506 | Rezaei et al. | Aug 2007 | A1 |
20070203903 | Attaran Rezaei et al. | Aug 2007 | A1 |
20070239534 | Liu et al. | Oct 2007 | A1 |
20070245035 | Attaran Rezaei et al. | Oct 2007 | A1 |
20070300152 | Baugher | Dec 2007 | A1 |
20080033932 | DeLong et al. | Feb 2008 | A1 |
20080065483 | Ball | Mar 2008 | A1 |
20080086592 | Stephani | Apr 2008 | A1 |
20080104061 | Rezaei | May 2008 | A1 |
20080140491 | Jain et al. | Jun 2008 | A1 |
20080232809 | Beshai et al. | Sep 2008 | A1 |
20080256061 | Chang et al. | Oct 2008 | A1 |
20080301033 | Singh et al. | Dec 2008 | A1 |
20090043648 | Mahdian et al. | Feb 2009 | A1 |
20090046678 | Lee et al. | Feb 2009 | A1 |
20090086663 | Ho et al. | Apr 2009 | A1 |
20090157855 | Adam et al. | Jun 2009 | A1 |
20090168768 | Chiabaut et al. | Jul 2009 | A1 |
20090265475 | Fujita | Oct 2009 | A1 |
20090281900 | Rezaei et al. | Nov 2009 | A1 |
20090296719 | Maier et al. | Dec 2009 | A1 |
20090300009 | Rezaei et al. | Dec 2009 | A1 |
20100114879 | Zhong et al. | May 2010 | A1 |
20110276563 | Sandoval et al. | Nov 2011 | A1 |
20130046797 | Muntz et al. | Feb 2013 | A1 |
20130046842 | Muntz et al. | Feb 2013 | A1 |
20130198191 | Hernandez et al. | Aug 2013 | A1 |
20140040184 | Benissan | Feb 2014 | A1 |
20140067535 | Rezaei et al. | Mar 2014 | A1 |
20140351237 | Rezaei | Nov 2014 | A1 |
Number | Date | Country |
---|---|---|
101278257 | Oct 2008 | CN |
1891509 | Feb 2008 | EP |
WO2006121575 | Nov 2006 | WO |
WO2007084616 | Jul 2007 | WO |
WO2007084778 | Jul 2007 | WO |
WO2007100923 | Sep 2007 | WO |
Entry |
---|
Bagrow, J. P. et al., “A Local Method for Detecting Communities,” Physical Review E, vol. 72, No. 4, 2005, 046108, pp. 1-16. |
Bollobas, B., “Random Graphs,” Second Edition, Cambridge University Press, United Kingdom, 2001, Chapter 1 and 2, pp. 1-59. |
Boscolo, R., et al., “Functionality Encoded in Topology? Discovering Macroscopic Regulatory Modules from Large-Scale Protein-DNA Interaction Networks,” Reprint arXiv:q-bio/0501039, Jan. 2005. |
Chung, F. et. al., “The Small World Phenomenon in Hybrid Power Law Graphs,” Lect. Notes Phys., vol. 650, 2004, pp. 89-104. |
Clauset, A. et al., “Finding Community Structure in Very Large Networks,” Physical Review, vol. 70, No. 6, Dec. 2004, 066111. |
Erdos, P. et al., “On the Evolution of Random Graphs,” Publication of the Mathematical Institute of the Hungarian Academy of Sciences, vol. 5, 1960, pp. 343-347. |
Erdos, P. et al., “On the Strength of Connectedness of a Random Graph,” Acta Mathematica Scientia Hungarica, vol. 12, Nos. 1-2, 1961, pp. 261-267. |
Flake, G. W. et al., “Self-Organization and Identification of Web Communities,” IEEE Computer Society, vol. 35, Issue 3, Mar. 2002, pp. 66-71. |
Franceschetti, M. et. al., “Navigation in Small World Networks, a Scale-Free Continuum Model,” Jul. 2003, pp. 1-9. |
Garton, L., et al., “Studying On-Line Social Networks,” Doing Internet Research, Edited by S. Jones, Thousand Oaks, Calif., 1999, pp. 75-105. |
Girvan, M. et al., “Community Structure in Social and Biological Networks,” PNAS, vol. 99, No. 12, Jun. 2002, pp. 7821-7826. |
Golbeck, “Accuracy of Metrics for Inferring Trust and Reputation in Semantic Web-based Social Networks,” Apr. 2004, pp. 1-11. |
Gonzalez-Barahona, J. M. et al., “Community Structure of Modules in the Apache Project,” MSR '05: Proceedings of the 2005 International Workshop on Mining Software Repositories, 2005. |
Kernighan, B. W. et al., “An Efficient Heuristic Procedure for Partitioning Graphs,” The Bell System Technical Journal 49, Feb. 1970, pp. 291-307. |
Lyons, R. et al., “Probability on Trees and Networks,” Apr. 17, 2005, pp. 1-53. |
Manku, G. S. et. al., “Know thy Neighbor: the Power of Lookahead in Randomized P2P Networks,” STOC'04, Jun. 2004, pp. 1-10. |
Newman, M. E. J., “Coauthorship Networks and Patterns of Scientific Collaboration,” PNAS, vol. 101, Apr. 2004, pp. 5200-5205. |
Newman, M. E. J., “Fast Algorithm for Detecting Community Structure in Networks,” Physical Review E 69, Jun. 2004, pp. 066133.1-066133.5. |
Newman, M.E.J. et al. “Finding and Evaluating Community Structure in Networks,” Physical Review E, vol. 69, 026113, 2004, pp. 1-16. |
Owczarek, A. L. et. al., “Generalised Percolation Probabilities for the Self-Dual Potts Model,” J. Phys. A: Math. Gen. 20, Apr. 1987, pp. 5263-5271. |
Page, L. et al., “The PageRank Citation Ranking: Bringing Order to the Web,” Technical Report, Stanford InfoLab, 1999. |
Patch, K., “Simple Search Lightens Net load,” TRN, Sep. 2004, pp. 1-3. |
Pons, P. et al., “Computing Communities in Large Networks Using Random Walks,” ArXiv Condensed Matter e-prints, Dec. 2004. |
Pothen, A. et al., “Partitioning Sparse Matrices with Eigenvectors of Graphs,” Society of Industrial and Applied Mathematics, SIAM Journal on Matrix Analysis and Applications, vol. 11, No. 3, Jul. 1990, pp. 430-452. |
Radicchi, F. et al., “Defining and Identifying Communities in Networks,” PNAS, vol. 101, No. 9, Mar. 2004, pp. 2658-2663. |
Rives, A. W. et al., “Modular Organization of Cellular Networks,” PNAS, vol. 100, No. 3, Feb. 2003, pp. 1128-1133. |
Sarshar, N. et al., “Scalable Percolation Search in Power Law Networks,” Jun. 2004, Department of Electrical Engineering, University of California, Los Angeles, pp. 1-13. |
Scott, J. “Social Network Analysis: A Handbook,” 2nd Edition, Sage Publications, London, 2000, Chapter 4, pp. 63-81. |
Wasserman, S., “Social Network Analysis: Methods and Applications,” Cambridge University Press, Cambridge, England, 1994, pp. 17-20 and Chapter 7, pp. 249-290. |
Wellman, B. et al., “Computer Networks as Social Networks: Collaborative Work, Telework, and Virtual Community,” Annual Reviews Sociology, vol. 22, No. 1, 1996, pp. 213-238. |
Wu, F. et al., “Finding Communities in Linear Time: a Physics Approach,” The European Physical Journal B, vol. 38, No. 2, 2004, pp. 331-338. |
Zachary, W. W. “An Information Flow Model for Conflict and Fission in Small Groups,” Journal of Anthropological Research, vol. 33, No. 4, Winter, 1977, pp. 452-473. |
Chinese Office Action (First) in Chinese Patent Application No. 200680025190.6, dated Sep. 11, 2009. |
Chinese Office Action (Second) in Chinese Patent Application No. 200680025190.6, dated Jun. 3, 2010. |
International Search Report and Written Opinion for PCT Application No. PCT/US2006/14160, mailed Sep. 4, 2007. |
International Search Report and Written Opinion for PCT Application No. PCT/US2007/001335, mailed Feb. 12, 2008. |
International Search Report and Written Opinion for PCT Application No. PCT/US2007/001694, mailed Feb. 14, 2008. |
International Search Report and Written Opinion for PCT Application No. PCT/US2007/005504, mailed Jul. 29, 2008. |
Office Action in U.S. Appl. No. 11/125,329, mailed Mar. 30, 2010. |
Office Action in U.S. Appl. No. 11/125,329, mailed Jun. 24, 2009. |
Notice of Allowance in U.S. Appl. No. 11/125,329, mailed Feb. 2, 2011. |
Office Action in U.S. Appl. No. 11/624,674, mailed Mar. 15, 2010. |
Office Action in U.S. Appl. No. 11/624,674, mailed Sep. 21, 2010. |
Office Action in U.S. Appl. No. 11/625,279, mailed Mar. 18, 2009. |
Office Action in U.S. Appl. No. 11/625,279, mailed May 13, 2010. |
Office Action in U.S. Appl. No. 11/625,279, mailed Oct. 25, 2010. |
Office Action in U.S. Appl. No. 11/625,279, mailed Dec. 8, 2009. |
Office Action in U.S. Appl. No. 11/680,599, mailed Jan. 5, 2011. |
Office Action in U.S. Appl. No. 11/680,599, mailed Apr. 29, 2009. |
Office Action in U.S. Appl. No. 11/680,599, mailed May 21, 2010. |
Office Action in U.S. Appl. No. 11/923,546, mailed Feb. 26, 2010. |
Office Action in U.S. Appl. No. 11/923,546, mailed Aug. 2, 2010. |
Office Action in U.S. Appl. No. 11/923,546, mailed Dec. 22, 2010. |
Office Action in U.S. Appl. No. 11/923,546, mailed Jun. 3, 2011. |
Office Action in U.S. Appl. No. 12/130,171, mailed Apr. 1, 2011. |
U.S. Appl. No. 13/098,870, filed May 2, 2011. |
Chao, I. et al., “Tag Mechanisms Evaluated for Coordination in Open Multi-Agent Systems,” ESAW 2007, LNAI 4995, Springer-Verlag Berlin Heidelberg, 2008, pp. 254-269. |
Franceschetti et. al., “Closing the Gap in the Capacity of Wireless Networks Via Percolation Theory,” Oct. 26, 2004, IEEE, pp. 1-10. |
Kini et. al., “Fast and efficient randomized flooding on lattice sensor networks”, Nov. 19, 2004, Drexler University, pp. 1-33. |
Weikum, G. et al., “Towards Self-Organizing Query Routing and Processing for Peer-to-Peer Web Search,” DELIS-TR-0287, 2005, 19 pages. |
Notice of Allowance in U.S. Appl. No. 13/098,870, mailed Jun. 28, 2012. |
Office Action in U.S. Appl. No. 11/624,674, mailed Dec. 20, 2011. |
Office Action in U.S. Appl. No. 11/625,279, mailed Sep. 16, 2011. |
Office Action in U.S. Appl. No. 11/625,279, mailed Apr. 17, 2012. |
Office Action in U.S. Appl. No. 11/680,599, mailed Aug. 29, 2011. |
Office Action in U.S. Appl. No. 11/680,599, mailed Apr. 30, 2012. |
Office Action in U.S. Appl. No. 11/923,546, mailed Aug. 27, 2012. |
Office Action in U.S. Appl. No. 12/130,171, mailed Oct. 19, 2011. |
Office Action in U.S. Appl. No. 12/130,171, mailed Aug. 15, 2012. |
Office Action in U.S. Appl. No. 12/436,748, mailed Sep. 16, 2011. |
Office Action in U.S. Appl. No. 12/436,748, mailed May 2, 2012. |
Office Action in U.S. Appl. No. 12/476,205, mailed Oct. 12, 2011. |
Office Action in U.S. Appl. No. 12/476,205, mailed Jul. 3, 2012. |
Office Action in U.S. Appl. No. 12/610,202, mailed May 22, 2012. |
Calado, P., et al., “Combining Link-Based and Content—Based Methods for Web Document Classifications,” CIKM'03, pp. 394-401, ACM, 2003. |
Jiang, et al., “Monotone Percolation and the Topology Control of Wireless Networks”, California Institute of Technology, Electrical Engineering Dept, 0-7803-8968-9/05, 2005, pp. 327-338. |
Kong et al., “Collaborative Spam Filtering Using E-Mail Networks”, Aug. 2006, IEEE, pp. 67-73. |
Newman, et al., “Scaling and percolation in the small-world network model”, Sante Fe Institute, May 6, 1999, pp. 1-12. |
Newman, M.E.J., “Random Graphs as Models of Networks”, SFI Working Paper: 2002-02-005, 2002, pp. 1-36. |
Silverberg, et al., “A Percolation Model of Innovation in Complex Technology Spaces”, Sep. 2002, MERIT—Infonomics Research Memorandum Series, pp. 1-24. |
Wang, G., “Web Search with Personalization and Knowledge”, 2002 Proceedings of the IEEE Fourth International Symposium on Multimedia Software Engineering (MSE '02). |
Zou et al., “Email Virus Propagation Modeling and Analysis”, Univ. of Mass., Dept. of Electrical and Computer Engineering, Dept of Computer Science, 2004, TR-CSE-03-04, pp. 1-17. |
Amendment and Response to Office Action in U.S. Appl. No. 11/125,329, filed Dec. 22, 2009. |
Amendment and Response to Office Action in U.S. Appl. No. 11/125,329, filed Sep. 30, 2010. |
Preliminary Amendment in U.S. Appl. No. 13/098,870, filed Dec. 19, 2011. |
Office Action in U.S. Appl. No. 13/660,940, mailed Jan. 6, 2014. |
Amendment and Response to Office Action in U.S. Appl. No. 13/660,940, filed Apr. 7, 2014. |
Notice of Allowance in U.S. Appl. No. 13/660,940, mailed Jun. 13, 2014. |
Office Action in U.S. Appl. No. 13/660,955, mailed Jan. 8, 2014. |
Amendment and Response to Office Action in U.S. Appl. No. 13/660,955, filed Apr. 7, 2014. |
Notice of Allowance in U.S. Appl. No. 13/660,955, mailed Jun. 16, 2014. |
Response to Restriction Requirement and Amendment in U.S. Appl. No. 11/624,674, mailed Sep. 29, 2009. |
Amendment and Response to Office Action in U.S. Appl. No. 11/624,674, mailed Jun. 15, 2010. |
After Final Response to Office Action and Request for Reconsideration in U.S. Appl. No. 11/624,674, mailed Nov. 19, 2010. |
Advisory Action in U.S. Appl. No. 11/624,674, mailed Feb. 15, 2011. |
Amendment and Response to Office Action in U.S. Appl. No. 11/624,674, mailed Oct. 21, 2011. |
Proposed Amendment and Response to Office Action in U.S. Appl. No. 11/624,674, mailed May 16, 2012. |
Amendment and Response to Office Action in U.S. Appl. No. 11/624,674, mailed May 21, 2012. |
Notice of Allowance in U.S. Appl. No. 11/624,674, mailed Oct. 12, 2012. |
Preliminary Amendment in U.S. Appl. No. 11/625,279, filed May 31, 2007. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Sep. 18, 2009. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Mar. 2, 2010. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Aug. 13, 2010. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Apr. 15, 2011. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Mar. 16, 2012. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Oct. 17, 2012. |
Office Action in U.S. Appl. No. 11/625,279, mailed Feb. 12, 2013. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Aug. 12, 2013. |
Final Office Action in U.S. Appl. No. 11/625,279, mailed Oct. 3, 2013. |
Amendment and Response to Office Action in U.S. Appl. No. 11/625,279, filed Jan. 3, 2014. |
Notice of Allowance in U.S. Appl. No. 11/625,279, mailed Apr. 16, 2014. |
U.S. Appl. No. 14/457,693, filed Aug. 12, 2014. |
Amendment and Response to Office Action in U.S. Appl. No. 11/680,599, filed Oct. 29, 2009. |
Amendment and Response to Office Action in U.S. Appl. No. 11/680,599, filed Aug. 23, 2010. |
Amendment and Response to Office Action in U.S. Appl. No. 11/680,599, filed Aug. 3, 2011. |
Amendment and Response to Office Action in U.S. Appl. No. 11/680,599, filed Feb. 29, 2012. |
Amendment and Response to Office Action in U.S. Appl. No. 11/680,599, filed Oct. 30, 2012. |
Office Action in U.S. Appl. No. 11/680,599, mailed Jul. 3, 2013. |
Amendment and Response to Office Action in U.S. Appl. No. 11/680,599, filed Jan. 3, 2014. |
Notice of Allowance in U.S. Appl. No. 11/680,599, mailed Apr. 30, 2014. |
Amendment and Response to Office Action in U.S. Appl. No. 11/923,546, filed Jun. 28, 2010. |
Amendment and Response to Office Action in U.S. Appl. No. 11/923,546, filed Dec. 2, 2010. |
Amendment and Response to Office Action in U.S. Appl. No. 11/923,546, filed Apr. 28, 2011. |
Amendment and Response to Office Action in U.S. Appl. No. 11/923,546, filed Apr. 5, 2012. |
Amendment and Response to Office Action in U.S. Appl. No. 11/923,546, filed Nov. 6, 2013. |
Office Action in U.S. Appl. No. 11/923,546, mailed Jun. 6, 2013. |
Amendment and Response to Office Action in U.S. Appl. No. 11/923,546, filed Feb. 27, 2013. |
Office Action in U.S. Appl. No. 11/923,546, mailed Aug. 14, 2014. |
Amendment and Response to Office Action in U.S. Appl. No. 12/130,171, filed Aug. 1, 2011. |
Amendment and Response to Office Action in U.S. Appl. No. 12/130,171, filed Apr. 19, 2012. |
Amendment and Response to Office Action in U.S. Appl. No. 12/130,171, filed Feb. 15, 2013. |
Office Action in U.S. Appl. No. 12/130,171, mailed May 28, 2013. |
Amendment and Response to Office Action in U.S. Appl. No. 12/130,171, filed Nov. 27, 2013. |
Office Action in U.S. Appl. No. 12/130,171, mailed Mar. 26, 2014. |
Amendment and Response to Office Action in U.S. Appl. No. 12/130,171, filed Jul. 25, 2014. |
Office Action in U.S. Appl. No. 12/130,171, mailed Sep. 25, 2014. |
Amendment and Response to Office Action in U.S. Appl. No. 12/436,748, filed Mar. 16, 2012. |
Amendment and Response to Office Action in U.S. Appl. No. 12/476,205, filed Apr. 12, 2012. |
Amendment and Response to Office Action in U.S. Appl. No. 12/476,205, filed Aug. 5, 2013. |
Amendment and Response to Office Action in U.S. Appl. No. 12/610,202, filed Oct. 18, 2012. |
Notice of Allowance in U.S. Appl. No. 12/610,202, mailed Dec. 10, 2012. |
Japanese Office Action in Application No. 2008-511129 dated Jul. 26, 2011. |
Applicant Initiated Interview Summary in U.S. Appl. No. 11/923,546, mailed Oct. 23, 2014. |
Applicant Statement of the Substance of Interview (Summary of Interview with Examiner) in U.S. Appl. No. 11/923,546, mailed Nov. 24, 2014. |
Office Action in U.S. Appl. No. 14/017,123, mailed Oct. 23, 2014. |
Office Action in U.S. Appl. No. 12/476,205, mailed Apr. 16, 2015. |
Office Action in U.S. Appl. No. 14/017,123, mailed Apr. 17, 2015. |
Office Action in U.S. Appl. No. 12/130,171, mailed May 30, 2015. |
Office Action in U.S. Appl. No. 11/923,546, mailed Jun. 9, 2015. |
Number | Date | Country | |
---|---|---|---|
20110113032 A1 | May 2011 | US |
Number | Date | Country | |
---|---|---|---|
61252632 | Oct 2009 | US |