Advertising is typically the primary source of revenue for commercial search sites that provide search services to the public. When a user submits a search query to a commercial search site, an advertising service associated with the search site may decide whether to display one or more advertisements with the search results. Further, if advertisements are to be displayed, the advertising service also determines which ads to display from among available candidate ads, and how to rank or position the ads with the search results.
In some cases the ads are chosen based, at least in part, on an auction bidding process. In the auction bidding process, advertisers bid a certain amount to have their ads displayed with search results in response to queries containing one or more specified keywords. Thus, the amount of the bid may influence whether the ad is displayed and may also influence the rank or position of the ad. Additionally, various methods may be applied for charging the advertisers for the advertising service. For example, the advertisers may be charged based on the number of ad impressions displayed to users, may be charged when a user clicks on an ad displayed with the search results, and the like.
In such an advertising-based revenue model, it is desirable that the advertisements provide information that is useful to the user and relevant to the user's search query. For example, if the advertising service presents ads that a user finds useful, then the user will be more likely to click on the ads displayed, and also more likely to click on ads in the future. This can result in increased revenue for the advertising service, while also fulfilling the expectations of the advertisers. Accordingly, the advertising service may strive to ensure advertisement suitability by gauging the quality of advertisements submitted by advertisers.
To determine advertisement quality, a quality score may be used as a dynamic variable assigned to ads and keywords. The quality score may provide a measure as to how relevant a particular ad is to a particular keyword and/or to a user's search query. Thus, the quality score may influence whether an ad is displayed with search results, and the rank or position of the ad in the search results. Quality score may also be applied, at least in part, when determining the minimum value of bids accepted for particular keywords. For instance, the higher the quality score, the better the ad position and the lower the amount of the minimum accepted bid for a particular keyword. Consequently, being able to accurately estimate the quality score of an ad-keyword pair can provide benefits to the advertising service, the advertisers and the users of a search site.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key or essential features of the claimed subject matter; nor is it to be used for determining or limiting the scope of the claimed subject matter.
Some implementations disclosed herein provide techniques for determining quality scores for advertisements and associated keywords. For example, some implementations herein provide a table-based approach for quality score estimation. In some implementations, training data is used to generate a table that can be used to determine quality scores. For example, a graph may be constructed based on the training data, and the graph may be cut into subgraphs to construct the table. Further, some implementations may employ a smoothing technique to smooth the table.
The detailed description is set forth with reference to the accompanying drawing figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The use of the same reference numbers in different figures indicates similar or identical items or features.
The technologies described herein generally relate to determining a quality score for an advertisement. For example, the quality score may be estimated for an advertisement or group of advertisements paired with a keyword (i.e., an ad-keyword pair) for use in an advertising service. Some implementations provide a lookup-type table approach for determining a quality score. A quality score table may be constructed using, as training data, historic data recorded for a plurality of ad-keyword pairs. For example, a directed graph may be constructed based on the training data. Next, a graph division or cutting technique may be applied to the graph to generate the quality score table. Further, in some implementations, the quality score table may be smoothed, such as by using a greedy smoothing technique, to achieve a monotonic increasing property. The smoothing can help limit the effects of any noisy training data and ensure that the results obtained during use of the quality score table remain consistent with the basic underlying assumptions and expectations.
As used herein the term “ad-keyword pair” may refer to a single advertisement or may refer to a group of advertisements (i.e., an ad group) that is paired with a bid keyword. For example, an ad group may include a one or more ads and one or more keywords that the advertiser would like the ads displayed in association with. Thus, depending on a desired implementation, quality scores may be determined for individual ads, for ad groups, or for both.
According to some implementations, for a particular ad-keyword pair, three advertisement parameters may be determined, notated herein as a triple (x, y, z). The ad parameter x may represent the relevance between the keyword and the ad group. For example, an ad typically includes one or two lines of text (referred to as “ad copy”), which along with a hyperlink to a landing page, are typically presented to a user with the search results. Accordingly, relevant ad copy should contain the bid keyword to be relevant. It is also desirable for the ad copy to include good grammatical structure, dynamic text, unique selling points, be focused toward an identified potential customer, and motivate the user to click on the ad. The ad parameter y may represent the relevance between the keyword and the landing page of the ad group. For example, landing page relevance may refer to the relevance of a webpage that a user is directed to when the user clicks on an ad (e.g., clicks on the hyperlink displayed with the ad copy). The landing page should be directly related to the ad and the bid keyword and/or the keywords contained in user's search query. The ad parameter z may represent the quality of the landing page. Landing page quality refers to the quality of the webpage that the user is directed to when the user clicks on an ad. For example, the landing page should adhere to certain editorial guidelines, be well organized, and make it easy for the user to purchase a product, sign up for a service, create an account, or the like. Further, the landing page should not contain a large amount of unrelated advertising, not contain misleading offers, spyware, or have functionality problems.
Various algorithms may be executed for determining the ad parameters (x, y, z) according to the considerations set forth above. In some implementations, the three ad parameters (x, y, z) may be quantified as integers, ranging from 1 to 10 so that there are ten possible values for each parameter, with 1 representing a low relevance or quality, and 10 representing a high relevance or quality. Implementations herein determine the quality score based on the given triple (x, y, z). According to some implementations, the quality score may be an integer, and may also range from 1 to 10. However, quality scores 9 and 10 are typically reserved, so the actual quality score may range in value from 1 to 8. Accordingly, given the three parameters (x, y, z) determined for a particular ad-keyword pair, implementations herein may use a quality score table to determine a quality score having a value between 1 and 8 to represent the particular ad keyword pair.
In some implementations, underlying quality score estimation represented by the quality score table is based on a rank correlation between the quality score and a ground truth. For example, the ground truth could be some historic performance indicator (PI) or other historic data that has been recorded for a plurality of ad-keyword pairs that have been in use in the ad service. Suitable historic PIs may include number of impressions, number of clicks, click through rate, or the like. The number of impressions is the number of times that an ad is displayed to users, such as in a search results pages. The number of clicks is the number of times that users click on the displayed ad. The measured click-through rate is the number of times the ad is actually clicked on in comparison with the number of impressions of the ad that have been presented. Further, each keyword is typically associated with plurality of ads or ad groups of a plurality of advertisers. These ad groups associated with the same keyword can be ranked according to the quality scores, and can also be ranked according to the ground truth.
Because the relationship between the quality score and the three ad parameters (x, y, z) can be complicated, a simple linear combination of the three ad parameters may typically be inadequate for consistently determining an accurate quality score. Accordingly, rather than utilizing a linear combination approach, such as an explicit function to map the triple (x, y, z) to a quality score, implementations herein construct a quality score table for mapping an ad-keyword pair to a quality score. For example, given the triple (x, y, z), implementations herein are able to determine a subgraph location for the ad-keyword pair and thereby, based on the identified subgraph, determine a quality score from the quality score table.
Some implementations herein may employ several steps to construct the quality score table. In a first step, a directed graph may be constructed based on the historic performance training data. Next, a quality score table is generated using an iterative graph cutting technique to generate a plurality of subgraphs. Each subgraph may correspond to different quality score value. Additionally, a smoothing operation may be used in some implementations to improve the consistency of the results achieved during application of the quality score table. Further, the table construction process herein utilizes a machine-learning-based approach. For example, the first two steps employ training data such as historical auction performance data and results. As mentioned above, the training data may include the historic logs of at least one performance indicator as training data, which may have been collected over a period of time, such as an immediately preceding week.
Following construction of the quality score table, the quality score table may be used to determine quality scores for any existing ad-keyword pairs for which the ad parameters (x, y, z) can be determined. Further, because the training data may change on a regular basis, the quality score table may be updated regularly and thereby the quality scores for current ad-keyword pairs may be dynamically and automatically updated on a regular basis without human involvement. Accordingly, the quality score table herein enables the quality scores to remain current and accurate over time as the ad service and/or an associated search service evolves.
In some implementations, the quality scores determined for a plurality of ad-keyword pairs may be used by the advertising service when determining when and where to use ads, how to rank ads, and the like. Additionally, the quality scores may be applied when determining an amount of a minimum acceptable bid for an ad-keyword pair of an advertiser. Furthermore, the advertising service may provide the quality score for a particular ad-keyword pair as feedback to the advertiser to enable the advertiser to improve the ad, and thereby improve the ad ranking and placement. By improving the quality scores of their ads, advertisers may improve the rankings and effectiveness of their ads, since users are more likely to click on ads of higher quality. Further, because payment by the advertisers to the advertising service may be based, at least in part, on whether users actually click on the ads, having ads of higher quality can also increase the revenue of the advertising service.
Advertising service 102 may include an advertiser interface component 108 that enables advertiser 104 to access and utilize advertising service 102. Advertiser interface component 108 may provide a series of webpages, or the like, that present a graphic user interface to advertiser 104 to enable advertiser 104 to submit one or more advertisements 110 to advertising service 102. For example, advertiser 104 may submit an advertisement 110 in an ad submission request 112 transmitted to advertising service 102 over network(s) 106. In some implementations, advertiser 104 may use the advertiser interface component 108 to create the advertisement 110, while in other implementations, the advertiser 104 may create the advertisement 110 independently and submit the advertisement 110 to the advertiser interface component 108 with the ad submission request 112.
The ad submission request 112 may further identify one or more keywords 114 that the advertiser 104 would like the advertisement 110 to be displayed in connection with. Additionally, in implementations in which the advertising service 102 uses an auction-type revenue model, the ad submission request 112 may also include a bid amount that the advertiser 104 is willing to pay the advertising service 102 for displaying the advertisement 110 in connection with the keyword 114. For example, the advertiser may pay an amount for each impression of the ad presented to a user (pay-per-impression), may pay for each click on the ad by a user (pay-per-click), or combinations thereof. Other payment models may also be used, such as pay-per-sale, pay-per-page-visit, pay-per-lead (e.g., filling out a form at the advertiser's website), or the like.
In the example illustrated, advertising service 102 may be associated with a search service 116. However, other implementations of advertising service 102 contemplated herein are not limited to use with a search service. One or more user devices 118 may be in communication with search service 116 through network(s) 106, which may include the same network type as that used for communication between advertiser 104 and advertising service 102, or a different network type. For example, the user device 118 may submit a search query 120 to search service 116 over network(s) 106. When the search service 116 receives the search query 120, the search service 116 may provide one or more query keywords 122 from the search query 120 to the advertising service 102. In response, an ad selection component 124 of the advertising service 102 may identify one or more selected ads 126 to be displayed with search results 128 that will be provided in response to the search query 120. The advertising service 102 may also include position or ranking information as ad rank 130 when there are multiple selected ads 126. The search service 116 may then assemble the search results 128 with the selected ads 126, such as in the form of a webpage, to provide search results 128 to the user device 118. The search results 128 may include the one or more selected ads 126 placed in the search results 128 in accordance with the ad rank 130 provided by the advertising service 102.
The user device 118 receives and displays the search results 128 to a user 132. In the case of a pay-per-impression agreement between the advertiser and the advertising service 102, the impression of a selected ad 126 to the user 132 can be recorded and the advertiser 104 charged accordingly. Further, the user 132 may choose whether or not to click on or otherwise select one of the selected ads 126 included in the search results 128. If the user 132 does click on a selected ad 126, this action can be detected by the search service 116. In the case of a pay-per-click agreement between the advertiser 104 and the advertising service 102, the click event can be recorded and the advertiser 104 charged accordingly.
When determining whether any ads 110 should be selected as selected ads 126, which ads 110 to select, and the ad rank 130 identifying a ranking or position of the selected ads 126, ad selection component 124 may employ quality scores 134, as determined by a quality score component 136. The quality score component 136 may be configured to determine quality scores 134 using a quality score table 138 for each advertisement-keyword pair 140. The quality score component 136 may construct the table during a table construction stage. The quality score component 136 may then subsequently use the quality score table 138 to determine quality scores 134 for the ad-keyword pairs 140. As mentioned above, in some implementations, the quality score component 136 may construct the quality score table 138 using historic data 142 collected over a period of time for a plurality of ad-keyword pairs 140. For example, historic data 142 may include historic performance data for the plurality of ad-keyword pairs and a plurality of corresponding advertisement parameters (x, y, z) for the plurality of ad-keyword pairs. Additional details of constructing and using the quality score table 138 are discussed below with reference to
As mentioned above, the quality score component 136 may periodically recalculate or reconstruct the quality score table 138 based on the latest available set of training data, and may then recalculate the quality scores 134 for the add-keyword pairs 140. Since the advertising service 102 is a dynamic system and because the quality score component 136 herein adopts a machine-learning approach to constructing the quality score table 138, the quality score component 136 is able to dynamically change and update the quality score table 138 as the advertising service 102 (and the search service 116) evolve. Accordingly, the quality scores 134 can be kept current and accurate through the periodic updating of the quality score table 138 and the quality scores 134. Additionally, in some implementations, advertising service 102 may provide feedback 144 to an advertiser 104 regarding the quality scores 134 determined for the advertiser's advertisements 110. For example, when the quality score 134 for an ad-keyword pair 140 has been determined by the quality score component 136, the advertising service 102 may provide the estimated quality score 134 to the advertiser 104 to enable the advertiser to attempt to improve the quality score 134.
At block 202, the quality score component 136 constructs a quality score table based on historic data for a plurality of ad-keyword pairs provided in a set of training data. For example, the historic data may include historic performance data for the plurality of ad-keyword pairs and a plurality of corresponding advertisement parameters (x, y, z) for the plurality of ad-keyword pairs. As described in detail below, the quality score component 136 may construct a preference graph from the training data and may cut the graph iteratively into a plurality of subgraphs, with each subgraph corresponding to different quality score value in the quality score table.
At block 204, the quality score component 136 selects an ad-keyword pair for determining a quality score. For example, the ad-keyword pair may have been in use by the advertising service for a period of time sufficient to establish historic data 142, such as one or more performance indicators, e.g., click through rate, number of clicks, number of impressions, and the like.
At block 206, the quality score component 136 determines the relevant ad parameters (x, y, z) of the selected ad-keyword pair. For example, the quality score component may use one or more algorithms to determine and assign values between 1 and 10 for each of the ad parameters (x, y, z). As mentioned above, the ad parameter x may represent the relevance between the keyword and the ad or ad group. For example, relevant ad copy should contain the bid keyword at least once to be relevant. It is also desirable for the ad copy to include good grammatical structure, dynamic text, unique selling points, be focused toward an identified potential customer, and motivate the user to click on the ad. The ad parameter y may represent the relevance between the keyword and the landing page of the ad or ad group. For example, the landing page should be directly related to the ad and the bid keyword. The ad parameter z may represent the quality of the landing page. For example, the landing page should adhere to certain editorial guidelines, be well organized, and should make it easy for the user to purchase a product, sign up for a service, create an account, or the like. Further, the landing page should not contain a large amount of unrelated advertising, should not contain misleading offers, spyware, or have functionality problems.
At block 208, based on the ad parameters (x, y, z) for the selected ad-keyword pair, the quality score component 136 determines the quality score from the quality score table. For example, the quality score component determines which subgraph of the plurality of subgraphs the selected ad-keyword pair belongs in, and from there is able to determine the quality score from the quality score table.
At block 210, the advertising service 102 utilizes the quality score in the advertisement service. For example, the ad service may apply the quality score during selection of advertisements, such as for use by a search service when providing search results in response to a search query. Additionally, the ad service may apply the quality score when determining minimum acceptable bids for the ad, the ad group or the advertiser.
At block 212, optionally, the advertising service 102 may provide the quality score for the ad-keyword pair to the advertiser 104 as feedback.
Example Search Results Page with Ads Ranked by Quality Score
In the illustrated example, search results page 300 may be displayed in a browser window 302, and may include a search menu 304 for selecting a resource to be searched, such as the “Web,” “images,” “videos,” “shopping,” “news,” “maps,” or “more,” along with an option to access email. Search results page 300 may further include a query entry window 306 for receiving the search query 120, and a results source menu 308 indicating a source of the results, e.g., the “Web,” “visual search,” “local,” “shopping,” “videos,” “images,” and “more.” Search results page 300 may further include a listing of related searches 310 and/or a search history 312. The search results page 300 may further include a presentation of search results 314 determined by the search service 116 to be relevant to the search query 120, such as a first-ranked search result 316, a second-ranked search result 318, and so forth.
According to some implementations herein, the search results page 300 may include one or more advertisements positioned or ranked based, at least in part, on a quality score 134 determined by the quality score component 136. In the illustrated example, an advertisement location 320 may immediately precede the search results 314, and may include one or more advertisements, such as a first-ranked ad 322 and a second-ranked ad 324. A location for additional advertisements 324 may be positioned to one side of search results 314, and may include a third-ranked ad 328, a fourth-ranked ad 330, and so forth. According to one possible method for determining ad rank 130, the ad rank 130 may be equal to the bid amount multiplied by the quality score. Thus as an example, when ad rank 130 is determined according to this method, if the bid amount for ads 322, 324, 328 and 330 was the same amount, then the rank of ads 322, 324, 328 and 330 would correspond to the quality score 134 for each ad. Thus, in this example, first-ranked ad 322 may have a higher quality score 134 than second-ranked ad 324, second-ranked ad 324 may have a higher quality score 134 than third-ranked ad 328, and so on.
When the user 132 clicks on one of the ads 322, 324, 328 or 330, the user's browser window 302 may be redirected to a landing page (not shown in
Advertising service component 506 may further include an auction component 508 and a history component 510. Auction component 508 may manage the auction portion of the advertising service. For example, the auction component may set minimum bids for particular ad-keyword pairs 140, may receive and manage the bids from advertisers, perform billing functions, and the like. History component 510 may maintain a history of historic data 142 for each ad keyword pair 140. For example, history component 510 may track the click-through rate, the number of impressions, the number of clicks, and other aspects and actions recorded with respect to each ad-keyword pair 140. The history component 510 may provide the historic data 142 for each ad-keyword pair 140 to quality score component 136 for use in generating quality score table 138 and determining quality scores 134. The history component 510 may also provide history information to auction component 508 for billing purposes, minimum bid determination, and the like.
Search service 116 may run on the same computing device(s) 502 as advertising service component 506, or on separate computing devices dedicated to the search service 116. Search service 116 may include a search engine 512, one or more search indexes 514 and a query response component 516. When the search query 120 is received by the search service 116, query response component 516 provides query keywords 122 from the search query 120 to the ad selection component 124 and receives back the selected ads 126 and the corresponding ad rank 130. Query response component 516 may then assemble the search results 128 in a search results page as described above with reference to
Advertiser computing device 504 may include one or more ad groups 522, as described above with reference to
The table construction technique herein utilizes a machine-learning-based approach. For example, the table construction may employ training data 602, such as logs of historical auction data and click results. The training data 602 may include one or more performance-based metrics or indicators from the historic data 142 as training data. The performance indicator may be used as a ground truth when constructing the graph G. The training data 602 may have been collected over a period of time, such as an immediately preceding week, or other period of time. Thus, suppose that there are K keywords that participated in ad auctions in the past week, and each keyword is associated with ni ads or ad groups. Then, (xji, yji, zji) may be used to indicate the ad parameters of the pair of i-th keyword and its j-th ad group, and gji may be used to indicate the ground truth of this pair. As mentioned above, the ground of truth may be based on historical performance of the ad-keyword pair, such as click through rate, number of clicks, number of impressions, or the like, recorded over the time period, such as during the past week.
As mentioned above, the three ad parameters (x, y, z) are based on ad-keyword features. For example, x can be a keyword-ad copy relevance score, y can be a keyword-landing page relevance score, and z can be a landing page quality score, each of which may be determined based on various features and considerations, as discussed above. Suitable algorithms for determining the three ad parameters (x, y, z) are well established by advertising services and thus are not further described herein. According to some implementations herein, the three ad parameters (x, y, z) may all be assigned integer values ranging between 1 and 10. Therefore, the three ad parameters (x, y, z) are each located in the group [1,10], and thus, there are at most 1000 different triples of (x, y, z). The 1000 different triples of (x, y, z) may be assigned to an index based on the following:
I(x,y,z)=100x+10y+z−111 EQ (1)
Accordingly the index of the triples may start at 0 and end at 999. Next, some implementations herein may construct the directed graph G having 1000 nodes (0-999). Thus, in the directed graph G, node i represents the triple (x, y, z) which has index i: I(x, y, z)=i. Furthermore, in the directed graph G, a weight Wi,j of an edge 604 from the i-th node to the j-th node indicates how many times the i-th triple wins over the j-th triple according to the ground truth for the corresponding ad-keyword pairs. Similarly, a weight wj,i of an edge 606 from the j-th node to the i-th node indicates how many times the j-th triple wins over the i-th triple according to the ground truth for the corresponding ad-keyword pairs. The weights of the edges may be initialized as follows:
w
i,j=0,∀0≦i,j≦999
Next, to apply the weights to the edges of the directed graph G, the quality score component 136 may traverse the training data 602 and determine weights of each of the edges in the graph. For example, if the i-th node represents the triple (xi, yi, zi) and the j-th node represents the triple (xj, yj, zj), and if click-through rate is used as the ground truth, then, during traversal of the training data, each time an ad-keyword pair corresponding to the triple (xi, yi, zi) has a higher click-through rate than an ad-keyword pair corresponding to the triple (xj, yj, zj), then a point may be added to the weight wi,j of the edge 604. On the other hand, if the ad-keyword pair corresponding to the triple (xj, yj, zj) has the higher click-through rate, then a point may be added to the weight wj,i of edge 606. Further, while click-through rate is used as an example of the ground truth in this example, other suitable performance indicators may alternatively be used as the ground truth. The following pseudocode sets forth example instructions that the quality score component 136 may execute to perform this operation.
Following determining the weights of the edges of the directed graph G, as described above, the quality score component 136 may generate a quality score table 138 using an iterative graph cutting operation, as illustrated in
In which jεG indicates that node j is in the graph. For each node i in the graph, if
IN(i)<OUT(i)
then that node will be will be put into the sub graph G1, else that node will be put into the sub graph G0. In this way, the graph is divided into two subgraphs G0 and G1. Next, using the same technique, the quality score component 136 can further cut the subgraph G1 into two sub graphs G11 and G10, and cut the subgraph G0 into two sub graphs G01 and G00. Finally, the quality score component 136 may similarly cut the four subgraphs G00, G01, G10 and G11 to generate eight subgraphs G000, G001, G010, G011, G100, G101, G110 and G111.
When using the quality score table to determine a quality score for a selected ad-keyword pair 140, the quality score component 136 determines the ad parameter triple (x, y, z) for the selected ad-keyword pair 140 and then determines in which subgraph G000, G001, G010, G011, G100, G101, G110 or G111 the triple (x, y, z) is located. For example, suppose that a selected ad-keyword pair 140 has an ad parameter x with a value of 8 representing the relevance between the keyword and the ad group, an ad parameter y with a value of 6 representing the relevance between the keyword and the landing page of the ad group, and an ad parameter z having a value of 9 representing the quality of the landing page. Thus, the parameter triple for the ad-keyword pair in this example is (8, 6, 9). Further, suppose that the quality score component 136 determines that this triple (8, 6, 9) is located in the subgraph G110. Accordingly, based on the quality score table 138, the quality score for the selected ad-keyword pair would be 7. Further, it should be noted that while implementations herein assign quality scores having values between 1 and 8 to ad-keyword pairs, in other implementations, the different values or ranges of values may be used for quality scores. For example, the directed graph G may be divided into any number of subgraphs, each of which may represent a different quality score value. Further, the ad parameters (x, y, z) may also be assigned different values or ranges of values, and thus directed graphs G of different sizes and number of nodes may be generated. Additionally, rather than merely relying on the three ad parameters (x, y, z), in some implementations, other ad parameters may be included or may replace the three ad parameters (x, y, z). Numerous other variations will be apparent to those of skill in the art in view of the disclosure herein.
The quality score table 138 constructed as described above may be used to determine quality scores 134. However, the quality score table 138 may benefit from smoothing to bolster consistency and to decrease any impact of noisy (irregular) training data. Accordingly, it may be desirable for the quality score table 138 to be smoothed so as to ensure a monotonic increasing property, i.e., to ensure that larger ad parameters (x, y, z) will generally lead to a larger quality score, such as:
x
1
≧x
2
,y
1
≧y
2
,z
1
≧z
2
=>QS(xi,yi,zi)≧QS(x2,y2,z2).
However, the raw quality score table 138 as constructed above may not always demonstrate this property in some cases. Accordingly, to ensure monotonicity, the quality score table 138 may be smoothed using a smoothing operation. While various smoothing algorithms may be implemented, an example of a suitable smoothing operation may be a greedy smoothing algorithm. In some implementations, instructions for performing the smoothing operation may be executed based on the following pseudocode. In some implementations, this algorithm is applied on the raw quality score table 138. That is, the algorithm may take in the raw quality score table as input, and output a smoothed quality score table.
Accordingly, in view of the foregoing, it may be seen that implementations herein provide for a machine-learning-based approach to determining quality scores for ad-keyword pairs. A quality score table can be constructed and updated automatically as the training data is updated. Since the training data is simply the historical data already gathered in an advertising service, the training data may be easily obtained from an actual ad platform and can reflect actual market dynamics. Furthermore, the quality score table according to implementations herein is able to capture more complex relationships between quality scores and the ad parameters (x, y, z) than would be possible using a linear function. Consequently, implementations herein can achieve a higher rank correlation with a ground truth than is possible with a linear function.
Additionally, various different performance indicators may be used as the ground truth, depending on the types of historic information collected or recorded by the ad service. For example, some implementations can simply change the way in which ad-keyword pairs are associated when constructing the directed graph and determining the edge weights. Thus, implementations herein may construct the directed graph based on, for example, click-through rate, number of impressions, number of clicks, number of leads (i.e., a user fills out a form or signs up for a service at a landing page), number of sales, or the like. Furthermore, other techniques may alternatively be used to construct the directed preference graph G, which may lead to even more accurate quality score estimation. For example, implementations herein are able to differentiate ad-keyword pairs and apply different weights to those pairs while changing the edge weights. Other implementations may also consider tied ad-keyword pairs in graph construction. Numerous other variations will be apparent to those of skill in the art in light of the disclosure herein.
At block 802, the quality score component 136 obtains historic data 142 for a plurality of ad-keyword pairs to be used as training data.
At block 804, the quality score component 136 constructs a directed graph based on the historic data. For example, the directed graph may include the ad parameter triples (x, y, z) as the nodes of the graph.
At block 806, the quality score component 136 determines weights of edges of the directed graph based on a ground truth determined from the historic data. For example, the ground truth may be a performance indicator, such as click-through rate, number of impressions, number of clicks, number of leads (i.e., a user fills out a form or signs up for a service at a landing page), number of sales, or the like
At block 808, the quality score component 136 cuts the directed graph into a plurality of subgraphs based on the weights of the edges. For example, the graph may be iteratively cut until a desired number of subgraphs are obtained.
At block 810, the quality score component 136 generates a quality score table containing a plurality of quality score values. Each quality score value in the table may correspond to a different one of the subgraphs in the final iteration.
At block 812, the quality score component 136 optionally performs smoothing of the quality score table using a smoothing function. For example, a greedy smoothing algorithm may be applied to help ensure a monotonic increasing property of the quality score table.
At block 814, the quality score component 136 employs the quality score table to determine a quality score for one or more selected ad-keyword pairs. For example, the quality score component 136 may match the advertisement parameter triple (x, y, z) for a selected ad-keyword pair with a corresponding triple in one of the subgraphs for determining the quality score of the selected ad keyword pair.
At block 816, the advertising component utilizes the quality score in an advertising service. For example, the advertising component may utilize the quality score for various decision making processes, such as when determining whether to display the advertisement, whether to include the advertisement in search results, where to rank the advertisement relative to other advertisements, determining a minimum bid amount, and the like.
The processor 902 may be a single processing unit or a number of processing units, all of which may include single or multiple computing units or multiple cores. The processor 902 can be implemented as one or more microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries, and/or any devices that manipulate signals based on operational instructions. Among other capabilities, the processor 902 can be configured to fetch and execute computer-readable instructions or processor-accessible instructions stored in the memory 904, mass storage devices 912, or other computer-readable storage media.
The computing device 900 may also include one or more communication interfaces 906 for exchanging data with other devices, such as via a network, direct connection, or the like, as discussed above. The communication interfaces 906 can facilitate communications within a wide variety of networks and protocol types, including wired networks (e.g., LAN, cable, etc.) and wireless networks (e.g., WLAN, cellular, satellite, etc.), the Internet and the like. Communication interfaces 906 can also provide communication with external storage (not shown), such as in a storage array, network attached storage, storage area network, or the like.
A display device 908, such as a monitor may be included in some implementations for displaying information to users. Other I/O devices 910 may be devices that receive various inputs from a user and provide various outputs to the user, and can include a keyboard, a remote controller, a mouse, a printer, audio input/output devices, and so forth.
Memory 904 and mass storage devices 912 are examples of computer-readable media for storing instructions which are executed by the processor 902 to perform the various functions described above. For example, memory 904 may generally include both volatile memory and non-volatile memory (e.g., RAM, ROM, or the like). Further, mass storage devices 912 may generally include hard disk drives, solid-state drives, removable media, including external and removable drives, memory cards, Flash memory, floppy disks, optical disks (e.g., CD, DVD), a storage array, a network attached storage, a storage area network, or the like. Both memory 904 and mass storage devices 912 may be non-transitory computer storage media, and may collectively be referred to as memory or computer-readable media herein.
Memory 904 and/or mass storage 912 are capable of storing computer-readable, processor-executable instructions as computer program code that can be executed by the processor 902 as a particular machine configured for carrying out the operations and functions described in the implementations herein. For example, memory 904 may include modules and components for determining and applying quality scores according to the implementations herein. In the illustrated example, memory 904 includes an advertising service component 916 that affords functionality for quality score determination. For example, advertising service component 916 may include advertiser interface component 108, ad selection component 124, quality scores 134, quality score component 136, quality score table 138, ad keyword pairs 140, and historic data 142. As described above, quality score component 136 may construct one or more quality score tables 138 and may use quality score table 138 to determine quality scores 134 for ad-keyword pairs 140. Memory 904 may also include one or more other modules 918, such as the auction component 508, the history component 510, and components of the search system 116, such as the query response component 516. Other modules 918 may also include an operating system, drivers, communication software, or the like. Memory 904 may also include other data 920 to carry out the functions described above. Further, while the quality score component 136 has been illustrated and described herein in the environment of an advertising service, other implementations of the quality score component 136 are not limited to use with an advertising service.
Although illustrated in
Computer storage media includes volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information for access by a computing device.
In contrast, communication media may embody computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave, or other transmission mechanism. As defined herein, computer storage media does not include communication media.
The example systems and computing devices described herein are merely examples suitable for some implementations and are not intended to suggest any limitation as to the scope of use or functionality of the environments, architectures and frameworks that can implement the processes, components and features described herein. Thus, implementations herein are operational with numerous environments or architectures, and may be implemented in general purpose and special-purpose computing systems, or other devices having processing capability. Generally, any of the functions described with reference to the figures can be implemented using software, hardware (e.g., fixed logic circuitry) or a combination of these implementations. The term “module,” “mechanism” or “component” as used herein generally represents software, hardware, or a combination of software and hardware that can be configured to implement prescribed functions. For instance, in the case of a software implementation, the term “module,” “mechanism” or “component” can represent program code (and/or declarative-type instructions) that performs specified tasks or operations when executed on a processing device or devices (e.g., CPUs or processors). The program code can be stored in one or more computer-readable memory devices or other computer-readable storage devices. Thus, the processes, components and modules described herein may be implemented by a computer program product.
Furthermore, this disclosure provides various example implementations, as described and as illustrated in the drawings. However, this disclosure is not limited to the implementations described and illustrated herein, but can extend to other implementations, as would be known or as would become known to those skilled in the art. Reference in the specification to “one implementation,” “this implementation,” “these implementations” or “some implementations” means that a particular feature, structure, or characteristic described is included in at least one implementation, and the appearances of these phrases in various places in the specification are not necessarily all referring to the same implementation.
Although the subject matter has been described in language specific to structural features and/or methodological acts, the subject matter defined in the appended claims is not limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims. This disclosure is intended to cover any and all adaptations or variations of the disclosed implementations, and the following claims should not be construed to be limited to the specific implementations disclosed in the specification. Instead, the scope of this document is to be determined entirely by the following claims, along with the full range of equivalents to which such claims are entitled.