Claims
- 1. Using a computer system with access to electronically retrievable content in digital form, a method of determining presentations from a set of presentations to present alongside search results, the method comprising:
receiving a request from a user in the form of a search query string; canonicalizing one or more terms in the search query string to form a reformed search query string when the one or more terms meet canonicalization criteria; applying the reformed search query string as an input to a presentation processor; selecting, using the presentation processor, presentations to be included in the response to the request, wherein selection of presentations is done according to at least keywords associated with presentations in the set of presentations, further wherein at least one keyword is a canonicalization of at least one of the one or more terms meeting the canonicalization criteria.
- 2. The method of claim 1, wherein the set of presentations is a set of advertisements and selection of advertisements for presentation is determined by keywords assigned to advertisers.
- 3. The method of claim 1, wherein the set of presentations is a set of sponsored search results.
- 4. The method of claim 1, wherein one or more word-based canonicalizations are performed and one or more term-based canonicalizations are performed.
- 5. The method of claim 1, wherein the canonicalization criteria are determined by canonicalization rules stored as one or more rule base.
- 6. The method of claim 1, wherein canonicalizing includes inputting search query state information and using the search query state information in the canonicalization step.
- 7. The method of claim 1, further comprising:
selecting search results based on the reformed search query string; and presenting the search results along with the selected presentations.
- 8. The method of claim 1, wherein the reformed search query string is a first reformed search query string, the method further comprising:
canonicalizing one or more terms in the search query string to form a second reformed search query string distinct from the first reformed search query string when one or more terms meet a second set of canonicalization criteria; selecting search results based on the second reformed search query string; and presenting the search results selected using the second reformed search query string along with the selected presentations selected using the first reformed search query string.
- 9. The method of claim 1, wherein canonicalizing the one or more terms in the search query string comprises mapping plural forms to canonical singular forms.
- 10. The method of claim 9, wherein mapping plural forms to canonical singular forms uses a dictionary generated by the steps of:
collect a set of nouns as the dictionary; remove compound words from the dictionary; use an inflection process to convert singular nouns from the set of nouns to plural nouns; invert mapping of the inflection process to thereby converting plural nouns to singular nouns; check for a cyclic mappings, wherein the output of the inverted mapping step is compared to the singular nouns input to the inflection process; flagging singular nouns that do not pass the cyclic mapping check; remove nouns that do not have corresponding plurals in the set of nouns; remove ambiguous plural forms that can be mapped to multiple singular forms, such that at most one of the multiple singular forms remains in the dictionary; remove nouns that correspond to abbreviations; and consolidate transitive mappings into single nontransitive mappings.
- 11. The method of claim 10, further comprising:
mapping nouns to a preferred dialect; and mapping nouns to one of multiple forms when a short form and a long form of a noun appear.
- 12. Using a computer system with access to electronically retrievable content in digital form, a method of determining presentations from a set of presentations to present, the method comprising:
receiving a request from a user in the form of a query string; canonicalizing one or more terms in the query string to form a reformed query string when the one or more terms meet canonicalization criteria, wherein canonicalizing is performed using one or more canonicalization rules mapping terms from query strings to canonical forms used in the reformed query string, wherein at least one canonicalization rule maps terms to canonical forms according to heuristics determined at least in part by contents of query logs; applying the reformed search query string as an input to a presentation processor; selecting, using the presentation processor, presentations to be included in the response to the request.
- 13. The method of claim 12, wherein selection of presentations is done according to at least keywords associated with presentations in the set of presentations, further wherein at least one keyword is a canonicalization of at least one of the one or more terms meeting the canonicalization criteria.
- 14. The method of claim 12, wherein the set of presentations is a set of advertisements and selection of advertisements for presentation is determined by keywords assigned to advertisers.
- 15. The method of claim 12, wherein the set of presentations is a set of sponsored search results.
- 16. The method of claim 12, wherein mapping terms to canonical forms according to heuristics determined at least in part by contents of query logs comprises:
logging, in a query log, past query strings; logging, in a query log, subsequent selections among presentations presented in response to query strings; storing query-selection associations, wherein a query-selection association associates a query string with a subsequent selection made by a user from a set of presentations presented in response to the query string; and associating distinct terms with a common canonical form when query-selection associations are present for queries containing the distinct terms and a common selection of presentation.
- 17. The method of claim 12, wherein selection of presentations is done, in addition to keywords, according to one or more of user demographics, prior search history, time of day, and prior presentation history.
- 18. The method of claim 12, wherein the canonicalization criteria are determined by canonicalization rules stored as one or more rule base.
- 19. The method of claim 12, further comprising:
selecting search results based on the reformed search query string; and presenting the search results along with the selected presentations.
- 20. The method of claim 12, wherein the reformed search query string is a first reformed search query string, the method further comprising:
canonicalizing one or more terms in the search query string to form a second reformed search query string distinct from the first reformed search query string when one or more terms meet a second set of canonicalization criteria; selecting search results based on the second reformed search query string; and presenting the search results selected using the second reformed search query string along with the selected presentations selected using the first reformed search query string.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority from co-pending U.S. Provisional Patent Application No. 60/460,657 filed Apr. 4, 2003, entitled “Canonicalization of Terms in a Keyword-Based Presentation System” which is hereby incorporated by reference, as if set forth in full in this document, for all purposes.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60460657 |
Apr 2003 |
US |