A graph (G) is typically a structure consisting of a set of vertices (V) and a set of edges (E) used to represent relationships between the vertices. For instance, a participation graph can be used to relate entities one to another based upon common conditions, properties, parameters, characteristics, activities, connections, and so forth. Such graphs may be employed for analysis of corresponding data sets in many scenarios. An example of such a scenario is in the context of online ad auctions for ad space that is associated with resources from a service provider. In the ad auction context, an auction participation graph is a structure that contains vertices representing advertisers who compete in the auctions and edges that connect two vertices when advertisers represented by the vertices are related by participating in auctions together. The auction participation graph contains a wealth of information that has many applications including advertiser flighting, suggestion of keywords to advertisers, micro-market analysis, and so forth.
Often, analysis of a graph of this type involves finding meaningful subsets or clusters within the graph. For the example auction participation graph, this may correspond to clustering the graph to identify submarkets in the advertiser graph that have relatively low interaction one to another. More generally, clustering may involve identifying clusters to optimize some objective function. Clustering a graph, though, can become quite difficult as the graph becomes larger (more vertices) and more dense (more edges between the vertices) due to the number of computations involved in the clustering. Thus, traditional algorithms used to directly cluster graphs may be unable to successfully perform clustering for larger and denser graphs that arise in some scenarios.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.
Various embodiments provide techniques for graph clustering. In one or more embodiments, a participation graph is obtained that represents relationships between entities. An auxiliary graph is constructed based on the participation graph. The auxiliary graph may be constructed such that the auxiliary graph is less dense than the participation graph and is therefore computationally less complex to analyze. Clusters in the auxiliary graph are determined by solving an objective function defined for the auxiliary graph. Clusters determined for the auxiliary graph may then be utilized to ascertain clusters in the participation graph that solve a related objective function defined for the participation graph.
Various embodiments provide techniques for graph clustering. In one or more embodiments, a participation graph is obtained that represents relationships between entities. An auxiliary graph is constructed based on the participation graph. The auxiliary graph may be constructed such that the auxiliary graph is less dense than the participation graph and is therefore computationally less complex to analyze. Clusters in the auxiliary graph are determined by solving an objective function defined for the auxiliary graph. Clusters determined for the auxiliary graph may then be utilized to ascertain clusters in the participation graph that solve a related objective function defined for the participation graph.
In the discussion that follows, a section entitled “Operating Environment” describes but one environment in which the various embodiments can be employed. Following this, a section entitled “Graph Clustering Procedures” describes example techniques for graph clustering in accordance with one or more embodiments. Next, a section entitled “Graph Clustering Implementation Details” describes example algorithms and implementations for graph clustering in accordance with one or more embodiments. Last, a section entitled “Example System” is provided and describes an example system that can be used to implement one or more embodiments.
Service provider 102 can be embodied as any suitable computing device or combination of devices such as, by way of example and not limitation, a server, a server farm, a peer-to-peer network of devices, a desktop computer, and the like. One specific example of a computing device is shown and described below in relation to
Service manager module 112 represents functionality operable by service provider 102 to manage various resources 114 that may be made available over the network 118. Service manager module 112 may manage access to the resources 114, performance of the resources 114, configuration of user interfaces or data to provide the resources 114, and so on. For example, clients 124 may form resource requests 126 for communication to the service provider 102 to obtain corresponding resources 114. In response to receiving such requests, service provider 102 can provide various resources 114 via webpages 128 and/or other user interfaces that are communicated over the network 118 for output by the one or more clients 124.
Resources 114 can include any suitable combination of content and/or services typically made available over a network by one or more service providers. Content can include various combinations of text, video, ads, audio, multi-media streams, animations, images, and the like. Some examples of services include, but are not limited to, a search service, an email service to send and receive email, an instant messaging service to provide instant messages between clients, and a social networking service to facilitate connections and interactions between groups of users who share common interests and activities. Services may also include an advertisement service configured to enable advertisers 120 to place advertisements 122 for presentation to clients 104 in conjunction with resources 114.
For instance, at least some of the webpages 128 can be configured to include advertisements 122 provided by the advertisers 120. Advertisements 122 may be selected for inclusion in webpages through an advertisement service using any suitable techniques for selection and delivery of the ads. In one example, auctions may be conducted for space that is reserved in a webpage 128 for advertisements 122 from the advertisers 120.
The clustering tool 116 is configured to implement aspects of graph clustering techniques described herein. The clustering tool 116 may be configured to make use of various graph data 130 that may be collected, stored, and/or accessed via the service provider 102. Although the example graph data 130 of
Having considered an example operating environment, consider now a discussion of example graph clustering techniques in accordance with one or more embodiments.
Graph Clustering Procedures
The following discussion describes graph clustering techniques that may be implemented utilizing the environment, systems, and/or devices described above and below. Aspects of each of the procedures below may be implemented in hardware, firmware, software, or a combination thereof. The procedures are shown as a set of blocks that specify operations performed by one or more devices and are not necessarily limited to the orders shown for performing the operations by the respective blocks. In portions of the following discussion, reference may be made to the example environment 100 of
Step 202 obtains a participation graph that represents relationships between entities. One way this can occur is by operation of a clustering tool 116 to construct a graph using graph data 130. Additionally or alternatively, clustering tool 116 may be configured to receive a previously constructed graph as input from any suitable source. Clustering tool 116 may then operate to perform graph clustering techniques on the obtained graph using various algorithms.
As noted, a graph (G) as used herein refers to a structure consisting of a set of vertices (V) and a set of edges (E) used to represent relationships between the vertices. For instance, a participation graph can be used to relate entities one to another based upon common conditions, properties, parameters, characteristics, activities, connections, and so forth. In the context of online ad auctions for ad space that is associated with resources 114 from a service provider 102, an auction participation graph is a structure that contains vertices (A) representing advertisers who compete in the auctions and edges (E) that connect two vertices when advertisers represented by the vertices are related by participating in auctions together.
Step 204 constructs an auxiliary graph based on the participation graph. As noted, sometimes a participation graph (e.g., an auction participation graph) can be too dense to permit efficient analysis. Accordingly, to enable more efficient analysis, the clustering tool 116 may be implemented to construct an auxiliary graph corresponding to the participation graph that is easier to work, and involves less computations to analyze. Generally speaking, the auxiliary graph is constructed to contain fewer edges and thus is less dense than the corresponding participation graph. One example of an auxiliary graph that can be constructed is a bipartite graph, which is described in greater detail relation to the following figures. Briefly, though, the bipartite graph introduces vertices representing the relationships in the participation graph and connects entities to the relationship vertices rather than one to another. By so doing, the number of edges used to represent the data set can be reduced.
In at least some embodiments, additional modifications are applied to the bipartite graph to further optimize the graph clustering techniques. Example modifications are described below in the section entitled “Graph Clustering Implementation Examples.”
Step 206 determines clusters by solving an objective function defined for the auxiliary graph. For instance, given the auxiliary graph constructed in step 204, clusters may be determined in the auxiliary graph by solving an objective function The clustering tool 116 may be implemented to make use of various suitable objective functions that are constructed to optimize clustering. In general, suitable objective functions that may be employed (1) measure the number of edges that are removed to form a cluster and (2) penalize larger clusters. The cluster size penalty prevents the algorithm from returning the entire data set as a result of the clustering. Suitable objective functions of this type result in clusters that have a relatively large number of edges within the cluster (close relationships), have relatively few edges leading outside to other clusters (loosely tied to other clusters), and are reasonable in size. Clusters having these characteristics may be desirable because such clusters represent groups of entities who interact often and/or share many common properties. In the ad auction context, the clusters may correspond to isolated submarkets that are considered good candidates for various market analyses. Further details regarding suitable objective functions can be found in relation to the following figures.
Step 208 utilizes the clusters determined for the auxiliary graph to ascertain clusters that solve a related objective function defined for the participation graph. The clustering tool 116 may be configured to reconstruct clusters in the participation graph based upon the clusters obtained from clustering the auxiliary graph. This can involve identifying entities contained in the clusters from the auxiliary graph and forming edges between the entities to reconstruct a cluster that corresponds to a portion of the participation graph.
In particular, an objective function that is employed to cluster the auxiliary graph can be derived from a related objective function defined for the participation graph. Therefore, as described in greater detail below, by carefully designing the objective functions that are used for clustering, clusters that are derived for the auxiliary graph can be used directly to obtain clusters corresponding to the participation graph. In some embodiments, the auxiliary graph is modified to optimize clustering as discussed above and below. These modifications determine the relationship between the objective functions for the participation graph and auxiliary graph.
For instance, when a bipartite graph is employed as the basis for graph clustering techniques, the result of clustering is a set of vertices representing entities that are connected to relationship vertices. A corresponding cluster for the participation graph can be obtained by interconnecting the entities one to another in accordance with their shared connections to the relationship vertices. This involves a simple translation between the bipartite graph and the corresponding participation graph.
Step 302 obtains an undirected bipartite graph that connects entities to relationships of the entities using undirected edges between vertices that represent the entities and vertices that represent the relationships. One way this can occur is by operation of a clustering tool 116 to create a bipartite graph from a corresponding participation graph or data set. Additionally or alternatively, the clustering tool 116 is capable of generating or otherwise obtaining a bipartite graph for a data set directly from graph data 130.
Consider now
Similar graphs can be constructed to represent a variety of relationships. As noted, relationships that exist between entities can include common conditions, properties, parameters, characteristics, activities, connections, and so forth. By way of example and not limitation, graphs similar to participation graph 402 can be used to relate webpages to links shared by the webpages, advertisers to keywords or search terms, auction participants to locations, webpages to ad revenue, or otherwise represent a data set by showing relationships that exist between entities of the data set.
Generally speaking the bipartite graph 404 introduces vertices 410 representing the relationships in the participation graph and connects entities to the relationship vertices 410 using edges 412. Continuing the example of the auction participation graph, vertices 410 may represent auctions. Accordingly, a participant vertex 408 representing an auction participant in the bipartite graph 404 is connected by an edge 412 to a vertex 410 representing an auction when the participant participated in the auction. As illustrated in
The bipartite graph just described is a starting point for graph clustering techniques described herein. In practice, further modifications are made to produce an optimal graph that can be clustered effectively using an objective function of the type described above and below. Some examples of such modifications are described generally in the continuing discussion of procedure 300 of
Returning now to the discussion of
Step 306 assigns directional weights to the directed edges. In particular, clustering tool 116 may assign weights to edges formed in step 306 according to the direction of the edges. When a graph clustering algorithm is applied, the weights have the effect of penalizing removal of some edges and/or vertices from a cluster. For instance the weight may correspond to a cost to remove a vertex by cutting a corresponding edge to produce a cluster. Thus, directional weights applied to the directed edges can be used to make cuts in one direction more costly than cuts in the other direction. In at least some embodiments, a relatively higher weight is assigned to edges traveling from an entity to a relationship, and a relatively lower weight is assigned to edges traveling from a relationship to an entity. The effect is to make it more difficult to cut an entity from an adjacent relationship. For example, a weight of infinity can be assigned to edges traveling from an advertiser to an auction and a weight of one can be assigned to edges traveling from an auction to an advertiser. Of course, various suitable values for edge weights can be employed and can be adjusted to fine-tune graph clustering. Although some examples herein use infinite edges weights, in practice finite values can be employed for simplicity.
Step 308 constructs an auxiliary bipartite graph from the directed graph by connecting the vertices representing an entity with an edge to an arbitrary sink vertex. By so doing, a cost is created for every entity (e.g., advertiser) that is included in a cluster. This is because the edges connected to the arbitrary sink vertex are disconnected when clustering occurs. The edges to the sink vertex can also be weighted using adjustable weights. Edge weights assigned to edges connected to the sink vertex are denoted as a in the following discussion.
Following step 308, an auxiliary graph has been constructed that can be used as an input for graph clustering techniques. Effectively, a general clustering problem for a dense graph has been reduced to a problem of finding clusters within a relatively sparse auxiliary graph. In particular, by constructing the auxiliary bipartite graph in the manner described, the clustering problem is reduced to creating clusters that minimize the number of edges that are cut (e.g., removed) to form the clusters. In at least some embodiments, this can occur by operation of the clustering tool 116 to apply a minimum-cut algorithm to the auxiliary bipartite graph that is generated in the manner described above.
In particular, step 310 selects one of the entities as a target, then step 312 employs minimum-cut on the target to obtain vertices on the target side of the cut. In the example of an auction participation graph, the result obtained is a cluster having a set of advertisers that are connected by edges to auctions in which they participate. This is a subset of the bipartite graph. Now, a corresponding cluster for the auction participation graph can be derived by directly translating the resultant cluster from a bipartite graph form to a participation graph form. Details regarding the minimum-cut algorithm are addressed in the following section.
Having described example procedures involving graph clustering, consider now specific implementation examples that can be employed with one or more embodiments described herein.
Consider now a discussion of clustering algorithms and implementations that may be employed using the previously described devices and systems. In particular, consider the problem of finding clusters in an auction participation graph. More precisely, let A be the set of advertisers. Define a graph G=(A, E) where the set of edges is:
E={(a,b): a and b participated in an auction}
Although the following techniques are described in relation to an example in which advertisers and auctions are related by an auction participation graph, it should be apparent that the techniques may be easily applied to other graphs (G) consisting of a set of vertices (V) and a set of edges (E) used to represent relationships between the vertices. In other words, the graph clustering techniques described herein in are generally applicable to graphs of the form G=(V,E).
Informally, the problem of clustering G is to find a subset of vertices C⊂A such that the weight of edges crossing the cluster is low (such a cluster is said to have low conductance), and the weight of the edges in the cluster is high (such a cluster is said to have high density).
The graph G=(A,E) can unfortunately become too dense to efficiently cluster. To see this, consider an auction in which the advertisers a1, . . . an participated. Such an auction introduces
edges to the graph. Traffic for an example ad delivery system may contain 10 billion edges for just a 1% sample of one day's advertising inventory and corresponding auctions. Further, existing clustering algorithms have running times on the order of O(|V|2|E|) time steps for a graph G=(V,E). Therefore, reducing edges may substantially improve performance of graph clustering.
To this end, the participation graph defined above can be replaced with a corresponding bipartite graph Gb=A␣Q,Eb) where Q is the set of auctions, and thus (a,q)εEb if a participated in an auction q. The graph Gb is constructed so as to contain one edge between an auction and an advertiser and thus is sparse assuming a small set of advertisers participate in a particular auction. Accordingly, analysis of the graph Gb may be faster and less computationally taxing than analysis of the participation graph.
As described in detail in the following discussion, it can be shown that finding small clusters of low conductance in the bipartite graph Gb can be related to finding small low-conductance clusters in the original graph G. Strictly speaking, advertisers in a cluster derived for the bipartite graph Gb may be used directly to construct a corresponding cluster for the participation graph G. To do so, an objective function for the bipartite graph Gb is defined that when solved also solves a related objective function for the original graph G.
For the purposes of example, the graph clustering techniques are described in relation to clustering algorithms which find clusters H that minimize an objective function of the form ƒ(H)=(H,
For each cluster H ⊂G a cluster H′ ⊂Gb can be associated by setting H′⊂H∪Γ(H) where Γ(H) are the auctions adjacent to H. In particular, maps φ can be derived that take a cluster in G and associate the cluster to a cluster in Gb that has the same set of advertisers, namely, φ(H)∩A=H.
Now, assume that g(H) has the following compatibility property with respect to the maps φ:
g(H)=g(φ(H)).
Another way to state this is that g is only a function of the advertisers in H, or that the function g factors as g=g′(H∩A). Examples of such functions are provided later in the discussion.
Consider the following proposition regarding solving objective functions for the graph G using the graph Gb. For Proposition A, let G have edge weights given by
if the vertices a and b participated in an auction with n other vertices. Let Gb have edge weights given by
where q is an auction with n neighbors.
Suppose a clustering algorithm on Gb exists that satisfies the following properties:
Then if H⊂Gb is a cluster produced by the algorithm, then H∩A is a cluster in G that minimizes ƒ(H∩A)=eG(H∩A,
To establish a Proof of Proposition A, let H be the cluster produced by the algorithm and set {hacek over (H)}=H∩A to be the cluster induced by H in G. Then ƒ({hacek over (H)})=eG({hacek over (H)},{hacek over (
Expanding the definition of eG it can be found that:
Where w(q,{hacek over (H)}) (resp. w(q,{hacek over (
Now notice that w(q,{hacek over (H)})+w(q,{hacek over (
Meanwhile, e(H,
A fundamental connection can be drawn that minimizing eG({hacek over (H)},{hacek over (
These two minimization problems have approximately the same solution, i.e., maximize θq. Moreover, the approximation factor can be shown to be
where is an upper bound on the and is the mean of the random variable. Technically, the minimization problem for variance admits a solution that minimizes θq, but the same value of the objective function can be achieved by setting θq′=1−θq. Therefore, the cluster {hacek over (H)} is optimal with respect to the function ƒ({hacek over (H)}) to within a constant factor.
Clustering the Bipartite Auction Participation Graph
With the above in mind, the discussion turns to consideration of a suitable algorithm that efficiently produces a cluster in the bipartite auction participation graph that satisfies all the hypotheses of Proposition A. This algorithm enables clusters of advertisers to be found in the participation graph by clustering the bipartite auction participation graph. The algorithm is described below in relation to three sub-algorithms (Algorithm A, Algorithm B, and Algorithm C) that are used to derive the clustering algorithm.
First consider the following simpler problem. Given an undirected graph G=(V,E) and sεV, find a subset H⊃{s} of the vertices (containing s) that minimizes the objective function e(H,
For Algorithm A, an auxiliary graph G′=(V␣{t},E∪{(v,t):vεV}) is constructed, where t is a new arbitrary sink vertex and the weight of the edges are set to w(v,t)=α. Now using the Max-Flow algorithm, a minimum s-t cut in G′ is computed. H is defined to be the set of vertices on the s side of the cut. By definition of the minimum-cut this gives a set H with sεH, such that e(H,
In the application of Algorithm A above, clusters are obtained that optimize the objective function e(H,
For Algorithm B, consider again a bipartite graph of the form G=(A␣Q,E). To obtain an algorithm that finds a cluster H (with sεH) that minimizes the objective function of e(H,
For Algorithm C, note that the clusters produced by Algorithm B may not satisfy the property: if aεH∩A is an advertiser in the cluster, then Γ(a)⊂H. To remedy this, the weights of the edges in G can be adjusted directionally. In particular, set w(a,q)=∞ whenever (a,q)Σ E where a is an advertiser and q is an auction and w(q,a) has the
if n advertisers participated in the auction q in G. Call this graph with the new directed edge weights {hacek over (G)}. Now, applying Algorithm B to the {hacek over (G)} produces a cluster H that minimizes e(H,
if n advertisers participated in the corresponding auction. Vertices representing the advertisers are also illustrated as being connected to the sink vertex t with a weight of these edges set to α.
Now a second proposition regarding solving objective functions for the graph G using the graph Gb can be stated. For Proposition B, let H be the output of Algorithm C on input Gb=(A␣Q,Eb), then H∩A minimizes the objective function eG(H∩A,
To establish a Proof of Proposition B, the clustering Algorithm C satisfies all the hypothesis of Proposition A discussed above, and thus the conclusion follows.
Note that for the foregoing proof to be successful, an edge weight other than ∞ can be employed. It suffices to take w(a,q)>maxqdeg(q)=1 (because of the weight normalization). Thus, in practice graph clustering techniques described herein can be applied using non-infinite weights when directionally weighting the directed edges introduced into the graph Gb.
The computing device 602 includes one or more processors or processing units 604, one or more memory and/or storage components 606, one or more input/output (I/O) interfaces 608 for input/output (I/O) devices, and a bus 610 that allows the various components and devices to communicate one to another. The bus 610 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. The bus 610 can include wired and/or wireless buses.
The memory/storage component 606 represents one or more computer storage media. The memory/storage component 606 may include volatile media (such as random access memory (RAM)) and/or nonvolatile media (such as read only memory (ROM), Flash memory, optical disks, magnetic disks, and so forth). The memory/storage component 606 may include fixed media (e.g., RAM, ROM, a fixed hard drive, etc.) as well as removable media (e.g., a Flash memory drive, a removable hard drive, an optical disk, and so forth).
The one or more input/output interfaces 608 allow a user to enter commands and information to computing device 600, and also allow information to be presented to the user and/or other components or devices using various input/output devices. Examples of input devices include a keyboard, a cursor control device (e.g., a mouse), a microphone, a scanner, and so forth. Examples of output devices include a display device (e.g., a monitor or projector), speakers, a printer, a network card, and so forth.
Various techniques may be described herein in the general context of software or program modules. Generally, software includes routines, programs, objects, components, data structures, and so forth that perform particular tasks or implement particular abstract data types. An implementation of these modules and techniques may be stored on or transmitted across some form of computer-readable media. The computer-readable media may include a variety of available medium or media that may be accessed by a computing device. By way of example, and not limitation, computer-readable media may comprise “computer-readable storage media.”
Software or program modules, including the graph clustering tool 116, applications 108, service manager module 112, operating system 110, and other program modules, may be embodied as one or more instructions stored on computer-readable storage media. The computing device 602 may be configured to implement particular functions corresponding to the software or program modules stored on computer-readable storage media. Such instructions may be executable by one or more articles of manufacture (for example, one or more computing device 602, and/or processors 604) to implement techniques for graph clustering, as well as other techniques. Such techniques include, but are not limited to, the example procedures described herein. Thus, computer-readable storage media may be configured to store instructions that, when executed by one or more devices described herein, cause various techniques for graph clustering.
The computer-readable storage media includes volatile and non-volatile, removable and non-removable media implemented in a method or technology suitable for storage of information such as computer readable instructions, data structures, program modules, or other data. The computer-readable storage media can include, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, hard disks, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or another tangible media or article of manufacture suitable to store the desired information and which may be accessed by a computer.
Although the invention has been described in language specific to structural features and/or methodological steps, it is to be understood that the invention defined in the appended claims is not necessarily limited to the specific features or steps described. Rather, the specific features and steps are disclosed as preferred forms of implementing the claimed invention.
Number | Name | Date | Kind |
---|---|---|---|
6044361 | Kalagnanam et al. | Mar 2000 | A |
6665653 | Heckerman et al. | Dec 2003 | B1 |
6714975 | Aggarwal et al. | Mar 2004 | B1 |
6990459 | Schneider | Jan 2006 | B2 |
7039931 | Whymark | May 2006 | B2 |
7546249 | Main | Jun 2009 | B2 |
7627878 | Pouliot et al. | Dec 2009 | B2 |
7738984 | Denton et al. | Jun 2010 | B2 |
8001004 | Protheroe et al. | Aug 2011 | B2 |
8082549 | Corley, Jr. et al. | Dec 2011 | B2 |
8224689 | Sandberg et al. | Jul 2012 | B1 |
8234139 | Wan et al. | Jul 2012 | B2 |
8352611 | Maddhuri et al. | Jan 2013 | B2 |
20020049608 | Hartsell et al. | Apr 2002 | A1 |
20020103688 | Schneider | Aug 2002 | A1 |
20030035429 | Mitra et al. | Feb 2003 | A1 |
20030154282 | Horvitz | Aug 2003 | A1 |
20030225635 | Renz et al. | Dec 2003 | A1 |
20030236745 | Hartsell et al. | Dec 2003 | A1 |
20040010592 | Carver et al. | Jan 2004 | A1 |
20040044571 | Bronnimann et al. | Mar 2004 | A1 |
20040257858 | Mansingh et al. | Dec 2004 | A1 |
20040267686 | Chayes et al. | Dec 2004 | A1 |
20050102398 | Zhang et al. | May 2005 | A1 |
20050144065 | Calabria et al. | Jun 2005 | A1 |
20050234935 | Barsness et al. | Oct 2005 | A1 |
20060049468 | Cheng et al. | Mar 2006 | A1 |
20060080171 | Jardins et al. | Apr 2006 | A1 |
20060089140 | Zhang | Apr 2006 | A1 |
20060253328 | Kohli et al. | Nov 2006 | A1 |
20070016473 | Anderson et al. | Jan 2007 | A1 |
20070016918 | Alcorn et al. | Jan 2007 | A1 |
20070027756 | Collins et al. | Feb 2007 | A1 |
20070083885 | Harding | Apr 2007 | A1 |
20070118432 | Vazirani et al. | May 2007 | A1 |
20070162882 | Ueda | Jul 2007 | A1 |
20070198350 | O'Kelley et al. | Aug 2007 | A1 |
20070214115 | Liu et al. | Sep 2007 | A1 |
20070255690 | Chang et al. | Nov 2007 | A1 |
20070256095 | Collins | Nov 2007 | A1 |
20080004990 | Flake et al. | Jan 2008 | A1 |
20080010304 | Vempala et al. | Jan 2008 | A1 |
20080016217 | Astley et al. | Jan 2008 | A1 |
20080071603 | Denton et al. | Mar 2008 | A1 |
20080080396 | Meijer et al. | Apr 2008 | A1 |
20080134193 | Corley et al. | Jun 2008 | A1 |
20080255937 | Chang et al. | Oct 2008 | A1 |
20080256034 | Chang et al. | Oct 2008 | A1 |
20080275899 | Baluja et al. | Nov 2008 | A1 |
20080313089 | Du Preez | Dec 2008 | A1 |
20090037267 | Duggal et al. | Feb 2009 | A1 |
20090070177 | Agarwal et al. | Mar 2009 | A1 |
20090106184 | Lang et al. | Apr 2009 | A1 |
20090144168 | Grouf et al. | Jun 2009 | A1 |
20090248478 | Duggal et al. | Oct 2009 | A1 |
20090254420 | Curd et al. | Oct 2009 | A1 |
20090313120 | Ketchum | Dec 2009 | A1 |
20100042496 | Wang et al. | Feb 2010 | A1 |
20100082393 | Vassilvitskii et al. | Apr 2010 | A1 |
20100082412 | Brower et al. | Apr 2010 | A1 |
20100106332 | Chassin et al. | Apr 2010 | A1 |
20100106641 | Chassin et al. | Apr 2010 | A1 |
20100121694 | Bharadwaj et al. | May 2010 | A1 |
20100131592 | Zhang et al. | May 2010 | A1 |
20100175111 | Schreiber et al. | Jul 2010 | A1 |
20100191558 | Chickering et al. | Jul 2010 | A1 |
20100279600 | Noh et al. | Nov 2010 | A1 |
20100299215 | Feldman et al. | Nov 2010 | A1 |
20110106922 | Bouillet et al. | May 2011 | A1 |
20110126207 | Wipfel et al. | May 2011 | A1 |
20110145512 | Adl-Tabatabai et al. | Jun 2011 | A1 |
20110187717 | Jagannath et al. | Aug 2011 | A1 |
20110208559 | Fontoura et al. | Aug 2011 | A1 |
20110231215 | Santos et al. | Sep 2011 | A1 |
20110238490 | Simard | Sep 2011 | A1 |
20110246312 | Meek | Oct 2011 | A1 |
20110251889 | Simard | Oct 2011 | A1 |
20110258045 | Chickering | Oct 2011 | A1 |
20110320606 | Madduri et al. | Dec 2011 | A1 |
20120265884 | Zhang et al. | Oct 2012 | A1 |
20130013357 | Liu et al. | Jan 2013 | A1 |
20130046924 | Adl-Tabatabai et al. | Feb 2013 | A1 |
20130046925 | Adl-Tabatabai et al. | Feb 2013 | A1 |
20130117062 | Rangarajan | May 2013 | A1 |
20130117454 | Rangarajan | May 2013 | A1 |
Entry |
---|
Buchbinder et al., “Online Primal-Dual Algorithms for Maximizing Ad-Auctions Revenue,” Oct. 2007, Springer Berlin Heidelberg, vol. 4698, pp. 255-260. |
Luxburg, “A tutorial on spectral clustering,” Aug. 2007, Springer Science+Business Media, pp. 401 and 412. |
“Final Office Action”, U.S. Appl. No. 12/751,533, (Oct. 25, 2012), 10 pages. |
“Restriction Requirement”, U.S. Appl. No. 13/288,650, (Dec. 10, 2012), 9 pages. |
“Final Office Action”, U.S. Appl. No. 12/732,031, (Jan. 30, 2013), 15 pages. |
“Non-Final Office Action”, U.S. Appl. No. 13/288,650, (Mar. 29, 2013), 17 pages. |
“Non-Final Office Action”, U.S. Appl. No. 13/348,465, Oct. 23, 2013, 15 pages. |
“Final Office Action”, U.S. Appl. No. 13/288,650, Sep. 30, 2013, 14 pages. |
“Advisory Action”, U.S. Appl. No. 13/288,650, Jan. 29, 2014, 7 pages. |
Schwaighoger, et al., “Scalable Clustering and Keyword Suggestion for Online Advertisements”, Retrieved at <<http://research.microsoft.com/pubs/81021/adClustering.pdf>>In the proceedings of ADKDD'09, Jun. 28, 2009, pp. 10. |
Carrasco, et al., “Clustering of bipartite advertiser-keyword graph”, Retrieved at <<http://www.soe.ucsc.edu/classes/ism293/Spring09/material/papers/lec6/clusteringBidTerms.pdf>>2003, pp. 8. |
Graph Partitioning, Retrieved at <<http://research.yahoo.com/node/2368>>Retrieved Date: Jan. 20, 2010, p. 1. |
Lang, et al., “Finding Dense and Isolated Submarkets in a Sponsored Search Graph”, Retrieved at <<http://www.google.co.in/url?sa=t&source=web&ct=res&cd=3&ved=0CBMQFjAC&url=http%3A%2Fresearch.yahoo.com%2Ffiles%2Fcikm636-langk.pdf&rct32 j&q=%22Finding+dense+and+isolated+submarkets+in+a+sponsored+search+spending+graph+by+Kevin+J.+Lang+and+Reid+Andersen%22&ei=EUpVS7yGGKK60gTMzbSsCg&usg=AFQjCNEVxq-U8y2Xd5671VbK—fxLfq0WIA>> In the proceedings of the sixteenth ACM conference on information and knowledge management, Nov. 6-10, 2007, pp. 613-622. |
“Display Inventory Allocation Optimization”, Retrieved from: <http://labs.yahoo.com/event/60>on Jan. 21, 2010, 1 page. |
“Final Office Action”, U.S. Appl. No. 12/757,634, (Oct. 11, 2012), 14 pages. |
“Final Office Action”, U.S. Appl. No. 12/761,961, (Oct. 19, 2012), 14 pages. |
“Non-Final Office Action”, U.S. Appl. No. 12/761,961, (Jun. 19, 2012), 12 pages. |
“Non-Final Office Action”, U.S. Appl. No. 12/732,031, (Sep. 7, 2012), 14 pages. |
“Non-Final Office Action”, U.S. Appl. No. 12/751,533, (Aug. 28, 2012), 9 pages. |
“Non-Final Office Action”, U.S. Appl. No. 12/757,634, (Jun. 19, 2012), 11 pages. |
“Restriction Requirement”, U.S. Appl. No. 12/751,533, (Jun. 28, 2012), 10 pages. |
Abrams, Zoe et al., “Optimal Delivery of Sponsored Search Advertisements Subject to Budget Constraints”, In EC '07: Proceedings of the 8th ACM conference on Electronic commerce, pp. 272-278, New York, NY , USA, 2007 . ACM., available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.138.7165&rep=rep1&type=pdf>,(2007), pp. 272-278. |
Aggarwal, Gagan et al., “Truthful Auctions for Pricing Search Keywords”, EC 2006, Available at <http://delivery.acm.org/10.1145/1140000/1134708/p1-aggarwal.pdf?key1=1134708&key2=3637314621&coll=GUIDE&dl=GUIDE&CFID=74229622&CFTOKEN=92771433>,(Jun. 2006), pp. 1-7. |
Agrawal, Shipra et al., “A Dynamic Near-optimal Algorithm for Online Linear Programming”, arXiv:0911.2974v1, 2009, available at <http://arxiv.org/PS—cache/arxiv/pdf/0911/0911.2974v1.pdf>,(2009), pp. 1-18. |
Alaei, Saeed “Online Allocation of Display Advertisements Subject to Advanced Sales Contracts”, ADKDD 2009, Available at <http://research.yahoo.com/files/23ADKDD09.pdf>,(Jun. 2009), 9 pages. |
Anastasakos, Tasos et al., “A Collaborative Filtering Approach to Sponsored Search”, Retrieved at <<http://research.yahoo.com/files/YL-2009-006-query-ad-click-graph.pdf>>, Yahoo!Labs Technical Report No. YL-2009-006, (Aug. 14, 2009), 24 pages. |
Arora, Dushyant et al., “On the Benefit of Virtualization: Strategies for Flexible Server Allocation”, Proceedings: USENIX Workshop on Hot Topics in Management of Internet, Cloud, and Enterprise Networks and Services (Hot-ICE), (2011), 6 pages. |
Arora, Sanjeev et al., “The Multiplicative Weights Update Method: A Meta Algorithm and Applications”, Technical report, 2005, available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.80.1997&rep=rep1&type=pdf>,(2005), 31 pages. |
Babaioff, Moshe et al., “Matroids, Secretary Problems, and Online Mechanisms”, Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, Available at <http://www.cs.cornell.edu/˜rdk/papers/matsec.pdf>,(2007), 10 pages. |
Bahmani, Bahman et al., “Improved Bounds for Online Stochastic Matching”, ESA, pp. 170-181, 2010, available at <http://www.stanford.edu/˜kapralov/papers/stochastic-final.pdf>,(2010) pp. 170-181. |
Balcan, Maria-Florina et al., “Mechanism Design via Machine Learning”, FOCS 2005, Available at <http://www.cs.tau.ac.il/˜mansour/papers/05-focs-aa.pdf>,(Oct. 2005), 10 pages. |
Borgs, Christian et al., “Multi-Unit Auctions with Budget-Constrained Bidders”, Proceedings of the 6th ACM Conference on Electronic Commerce, Available at <http://research.microsoft.com/en-us/um/people/borgs/Papers/budget.pdf>,(2005), pp. 1-11. |
Buchbinder, Niv “Online Primal-Dual Algorithms for Maximizing Ad-Auctions Revenue”, ESA 2007, Abstract, Available at <http://www.springerlink.com/content/52n2011t207301r5/fulltext.pdf?page=1>,(2007), 1 page. |
Buchbinder, Niv et al., “Online Primal-dual Algorithms for Maximizing Ad-auctions Revenue”, In ESA'07: Proceedings of the 15th annual European conference on Algorithms, pp. 253-264, Berlin, Heidelberg, 2007. Springer-Verlag, available at <http://www.math.tu-berlin.de/coga/teaching/wt07/online/pdf/ad-auction-paper.pdf>,(2007), pp. 253-264. |
Chakraborty, Tanmoy et al., “Selective Call Out and Real Time Bidding”, WINE, 2010, available at <http://arxiv.org/PS—cache/arxiv/pdf/1002/1002.3102v2.pdf>,(2010), pp. 1-24. |
Charles, Denis et al., “Fast Algorithms for Finding Matchings in Lopsided Bipartite Graphs with Applications to Display Ads”, EC '10: Proceedings of the 11th ACM conference on Electronic commerce, pp. 121-128, New York, NY, USA, 2010. ACM, available at <http://research.microsoft.com/en-us/um/people/nikdev/pubs/waterlevel.pdf>,(2010), 8 pages. |
Chickering, David M., et al., “Goal-Oriented Clustering”, Microsoft Research, Available at <http://research.microsoft.com/en-us/um/people/dmax/publications/goalclust.pdf>,(May 2000), 9 pages. |
Chickering, David M., et al., “Targeted Advertising with Inventory Management”, EC '00, Available at <http://research.microsoft.com/en-us/um/people/dmax/publications/ec00.pdf>, (Aug. 25, 2000), 5 pages. |
Chickering, David M., et al., “Targeted Internet Advertising Using Predictive Clustering and Linear Programming”, Microsoft Research, Available at <http://www.mapscapital.com/files/TargetedAdsPredictiveClusters.pdf>, 7 pages, 2004. |
Constantin, Florin et al., “An Online Mechanism for Ad Slot Reservations with Cancellations”, Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, Available at <http://www.siam.org/proceedings/soda/2009/SODA09—137—constantinf.pdf>,(2009), pp. 1265-1274. |
Davis, Jeff “Truviso 3.2”, Retrieved from: <http://www.truviso.com/blog/> on Feb. 24, 2010, (Sep. 4, 2009), 6 pages. |
Devanur, Nikhil R., et al., “The Adwords Problem: Online Keyword Matching with Budgeted Bidders Under Random Permutations”, In John Chuang, Lance Fortnow, and Pearl Pu, editors, ACM Conference on Electronic Commerce, pp. 71-78. ACM, 2009, available at <http://research.microsoft.com/en-us/um/people/nikdev/pubs/adwords.pdf>,(2009), 8 pages. |
Edelman, Benjamin et al., “Internet Advertising and the Generalized Second-Price Auction: Selling Billions of Dollars Worth of Keywords”, Available at <http://rwj.berkeley.edu/schwarz/publications/gsp051003.pdf>,(Oct. 3, 2005), pp. 1-21. |
Edelman, Benjamin et al., “Strategic Bidder Behavior in Sponsored Search Auctions”, Decision Support Systems, vol. 43, Issue 1, Available at <http://www.benedelman.org/publications/cycling-060703.pdf>,(Feb. 2007), pp. 1-12. |
Feldman, Jon et al., “Budget Optimization in Search-Based Advertising Auctions”, EC 2007, Available at <http://algo.research.googlepages.com/uniform.pdf>,(Jun. 2007), pp. 40-49. |
Feldman, Jon et al., “Offline Optimization for Online Ad Allocation”, Fifth Workshop on Ad Auctions 2009, Available at <http://www.eecs.harvard.edu/cs286r/papers/adauctions2009.pdf>,(2009), 8 pages. |
Feldman, Jon et al., “Online Stochastic Matching: Beating 1-1/e”, In FOCS '09: Proceedings of the 2009 50th Annual IEEE Symposium on Foundations of Computer Science, pp. 117-126, Washington, DC, USA, 2009. IEEE Computer Society, available at <http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5438641>,(2009), pp. 117-126. |
Fleischer, Lisa K., “Approximating Fractional Multicommodity Flow Independent of the Number of Commodities”, Siam J. Discret. Math., 13(4):505-520, 2000, available at <http://researchweb.watson.ibm.com/people/I/Ikf/papers/Ipmulti.pdf>,(2000), 16 pages. |
Garg, Naveen et al., “Faster and Simpler Algorithms for Multicommodity Flow and other Fractional Packing Problems”, In FOCS '98: Proceedings of the 39th Annual Symposium on Foundations of Computer Science, p. 300, Washington, DC, USA, 1998. IEEE Computer Society, available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.130.6527&rep=rep1&type=pdf >,(1998), 22 pages. |
Goel, Ashish et al., “Price Based Protocols for Fair Resource Allocation:Convergence Time Analysis and Extension to Leontief Utilities”, Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms, available at <http://research.microsoft.com/en-us/people/hamidnz/fair-price-based-protocols-soda08.pdf>,(2008), 9 pages. |
Goel, Gagan et al., “Online Budgeted Matching in Random Input Models with Applications to Adwords”, Proceedings of the Nineteenth Annual ACM-SIAM Symposium on Discrete Algorithms, Available at <http://delivery.acm.org/10.1145/1350000/1347189/p982-goel.pdf?key1=1347189&key2=2167314621&coll=GUIDE&dl=GUIDE&CFID=74230087&CFTOKEN=71191607>,(2008), pp. 982-991. |
Hoeffding, Wassily “Probability Inequalities for Sums of Bounded Random Variables”, Journal of the American Statistical Association, vol. 58, No. 301, Abstract, Retrieved from: <http://www.jstor.org/pss/2282952> on Jan. 21, 2010,(Mar. 1963), 2 pages. |
Kalyanasundaram, Bala et al., “An Optimal Deterministic Algorithm for Online B-Matching”, Theor. Comput. Sci., 233(1-2):319-325, 2000, available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.56.8916&rep=rep1&type=pdf>,(2000), 9 pages. |
Kamath, Anil et al., “Routing and Admission Control in General Topology Networks with Poisson Arrivals”, In SODA '96: Proceedings of the seventh annual ACM-SIAM symposium on Discrete algorithms, pp. 269-278, Philadelphia, PA USA, 1996. Society for Industrial and Applied Mathematics, available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.31.5443&rep=rep1&type=pdf>,(1996), 10 pages. |
Karande, Chinmay et al., “Online Bipartite Matching with Unknown Distributions”, STOC '11, Jun. 6-8, 2011, San Jose, CA, retrieved from <http://portal.acm.org/citation.cfm?id=1993715&preflayout=flat> on Aug. 3, 2011,(Jun. 6, 2011), pp. 587-596. |
Karp, Richard M., “An Optimal Algorithm for On-Line Bipartite Matching”, Proceedings of the Twenty-Second Annual ACM Symposium on Theory of Computing, Available at <http://www.eecs.berkeley.edu/˜vazirani/pubs/online.pdf>,(1990), pp. 352-358. |
Kleinberg, Robert “A Multiple-Choice Secretary Algorithm with Applications to Online Auctions”, Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms, http://delivery.acm.org/10.1145/1080000/1070519/p630-kleinberg.pdf?key1=1070519&key2=0877314621&coll=GUIDE&dl=GUIDE&CFID=72544299&CFTOKEN=88467143,(2005), pp, 630-631. |
Lahaie, Sebastien “An Analysis of Alternative Slot Auction Designs for Sponsored Search”, EC 2006, Available at <http://research.yahoo.com/files/fp185-lahaie.pdf>,(Jun. 2006), 10 pages. |
Mahdian, Mohammad “Allocating Online Advertisement Space with Unreliable Estimates”, ACM Conference on Electronic Commerce, Available at <http://www.stanford.edu/˜saberi/hamid-adwords.pdf>,(2007), pp. 1-11. |
Mahdian, Mohammad et al., “Online Bipartite Matching with Random Arrivals: An Approach Based on Strongly Factor-Revealing”, STOC, 2011, available at <http://mahdian.org/onlinematching.pdf>, (2011), 9 pages. |
Manshadi, Vahideh et al., “Online Stochastic Matching: Online Actions Based on Offline Statistics”, SODA, 2011, available at <http://arvix.org/pdf/1007.1673>,(Jul. 9, 2010), 15 pages. |
Mehta, Aranyak et al., “Adwords and Generalized Online Matching”, Journal of the ACM (JACM), vol. 54, Issue 5, Available at <http://www.stanford.edu/˜saberi/adwords.pdf>,(Oct. 2007), pp. 1-19. |
Motwani, Rajeev et al., “Fractional Matching via Balls-And-Bins”, APPROX-RANDOM, pp. 487-498, 2006, available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.94.6903&rep=rep1&type=pdf>,(2006), 11 pages. |
Nisan, Noam “Algorithmic Game Theory”, Sponsored Search Auctions, Available at <http://www.stanford.edu/˜saberi/sponsored-search.pdf>,(Sep. 24, 2007), pp. 1-25. |
Nodelman, URI et al., “Continuous Time Bayesian Networks”, Stanford University, Available at <http://ai.stanford.edu/˜nodelman/papers/ctbn.pdf>,(Jun. 10, 2007), 11 pages. |
Plotkin, Serge A., et al., “Fast Approximation Algorithms for Fractional Packing and Covering Problems”, SFCS '91: Proceedings of the 32nd annual symposium on Foundations of computer science, pp. 495-504, Washington, DC, USA, 1991. IEEE Computer Society, available at <http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=185411>,(1991), pp. 495-504. |
Radovanovic, Ana et al., “Revenue Maximization Through ‘Smart’ Inventory Management in Reservation-Based Online Advertising”, SIGMETRICS Performance Evaluation Review, available at <http://www.anaradovanovic.com/AllocPoliciesMaMAsub.pdf>,(2010), 3 pages. |
Rangarajan, et al., “Online Resource Allocation Algorithms”, U.S. Appl. No. 13/288,650, (Nov. 3, 2011), 44 pages. |
Ranjan, Supranamaya et al., “High-Performance Resource Allocation and Request Redirection Algorithms for Web Clusters”, retrieved from http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=04407691, Proceedings of IEEE Transactions on Parallel and Distributed Systems,(Sep. 2008), pp. 1186-1200. |
Regelson, Moira et al., “Predicting Click-Through Rate Using Keyword Clusters”, EC '06, Available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.130.9999&rep=rep1&type=pdf>,(Jun. 11, 2006), 6 pages. |
Schwind, Michael et al., “Using Shadow Prices for Resource Allocation in a Combinatorial Grid With Proxy-Bidding Agents”, available at <http://www/wiiw.de/publikationen/UsingShadowPricesforResourceAI1596.pdf/˜saberi/adwords.pdf>,(2006), 8 pages. |
Varian, Hal R., “Position Auctions”, International Journal of Industrial Organization, vol. 25, Available at <http://people.ischool.berkeley.edu/˜hal/Papers/2006/position.pdf>,(2007),pp. 1163-1178. |
Vee, Erik et al., “Optimal Online Assignment with Forecasts”, EC '10: Proceedings of the 11th ACM conference on Electronic commerce, pp. 109-118, New York, NY, USA, 2010. ACM , available at <http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.155.161&rep=rep1&type=pdf>,(2010), 19 pages. |
Vickrey, William “Counterspeculation, Auctions, and Competitive Sealed Tenders”, Journal of Finance, vol. 16, Issue 1,(Mar. 1961), pp. 8-37. |
Young, Neal E., “Randomized Rounding without Solving the Linear Program”, SODA '95: Proceedings of the sixth annual ACM-SIAM symposium on Discrete algorithms, pp. 170-178, Philadelphia, PA, USA, 1995. Society for Industrial and Applied Mathematics, available at <http://arxiv.org/pdf/cs/0205036>,(1995), 10 pages. |
“Non-Final Office Action”, US. Appl. No. 12/757,634, Jun. 19, 2014, 7 pages. |
“Non-Final Office Action”, U.S. Appl. No. 13/288,650, Mar. 20, 2014, 13 pages. |
“Notice of Allowance”, U.S. Appl. No. 12/732,031, Jun. 3, 2014, 10 pages. |
“Notice of Allowance”, U.S. Appl. No. 12/751,533, May 9, 2014, 11 pages. |
“Final Office Action”, U.S. Appl. No. 13/348,465, Jul. 8, 2014, 18 pages. |
“Non-Final Office Action”, U.S. Appl. No. 12/732,031, Aug. 15, 2014, 6 pages. |
“Non-Final Office Action”, U.S. Appl. No. 12/751,533, Aug. 14, 2014, 6 pages. |
“Non-Final Office Action”, U.S. Appl. No. 12/761,961, Jul. 3, 2014, 13 pages. |
“Non-Final Office Action”, U.S. Appl. No. 13/288,650, Oct. 24, 2014, 16 pages. |
Number | Date | Country | |
---|---|---|---|
20110234594 A1 | Sep 2011 | US |