Claims
- 1. A method of predicting a correct spelling of a potentially-misspelled search term within a multiple-term search query, the method comprising:
identifying at least one matching search term of the multiple-term search query, said matching search term being different from the potentially-misspelled search term; identifying at least one additional term that, based on an analysis of search query submissions of users, has occurred in combination with the matching search term relatively frequently; and comparing a spelling of the additional term to a spelling of the potentially-misspelled search term.
- 2. The method of claim 1, wherein identifying at least one additional term comprises taking into consideration a search field in which the potentially-misspelled search term was entered.
- 3. The method of claim 2, wherein identifying at least one additional term further comprises taking into consideration a search field in which the matching search term was entered.
- 4. The method of claim 1, further comprising automatically replacing the potentially-misspelled term with the additional term if the additional is similar in spelling to the potentially-misspelled term to within a defined threshold.
- 5. The method of claim 1, further comprising performing said analysis of search query submissions.
- 6. The method of claim 5, wherein performing the analysis comprises according a greater weight to a search query submission that, based on an action performed by a user with respect to a search result item, is deemed to have produced a successful result.
- 7. The method of claim 6, wherein said action is a purchase of a search result item.
- 8. A method of predicting correct spellings of potentially-misspelled search terms within multiple-term search queries, the method comprising:
analyzing search query submissions of users to identify search terms that are related to each other by virtue of a relative frequency with which such search terms occur in combination; generating a table reflective of a result of said analysis of search query submissions, said table indicating specific search terms that are related to each other; and using the table to predict correct spellings of non-matching search terms within multiple-term search queries.
- 9. The method as in claim 8, wherein analyzing the search query submissions comprises keeping track of specific search fields in which search terms were entered by users.
- 10. The method as in claim 8, further comprising updating the table over time to reflect new query submissions from users such that spelling predictions reflect current searching patterns of users.
- 11. The method as in claim 8, wherein generating the table comprises incorporating into the table term correlation data derived from a term co-occurrence analysis of a database to be searched.
- 12. The method as in claim 8, wherein using the table to predict correct spellings of non-matching search terms comprises:
receiving a multiple-term search query from a user; identifying a matching search term and a non-matching search term within the multiple-term search query; accessing the table to look up at least one additional term that is related to the matching search term; and comparing a spelling of the additional term to a spelling of the non-matching search term.
- 13. A method of predicting correct spellings of potentially-misspelled search terms within multiple-term search queries, the method comprising:
analyzing search query submissions of users to identify search terms that, based on co-occurrences of search terms within submitted search queries, are related to one another, wherein analyzing the search query submissions comprises according different weights to different search query submissions, said weights being dependent upon user actions performed with respect to associated query result items; storing a data set reflective of a result of said analysis of search query submissions, said data set indicating specific search terms that are related to each other; and using the data set to predict correct spellings of non-matching search terms within multiple-term search queries.
- 14. The method of claim 13, wherein according different weights comprises according, to a search query submission of a user, a weight that is dependent upon whether the user viewed a query result item.
- 15. The method of claim 13, wherein according different weights comprises according, to a search query submission of a user, a weight that is dependent upon whether the user purchased a query result item.
- 16. The method as in claim 13, wherein using the data set to predict correct spellings of non-matching search terms comprises:
receiving a multiple-term search query from a user; identifying a matching search term and a non-matching search term within the multiple-term search query; looking up from the data set at least one additional term that is related to the matching search term; and comparing a spelling of the additional term to a spelling of the non-matching search term.
- 17. The method as in claim 13, wherein the data set comprises search term correlation data for each of a plurality of search fields.
- 18. A method of correcting spelling errors in search queries, the method comprising:
receiving a multiple-term search query from a user; identifying a non-matching search term within the multiple-term term search query; identifying a matching search term within the multiple-term term search query; and selecting a replacement term that is a candidate correctly-spelled replacement for the non-matching search term, wherein the candidate correctly-spelled replacement term is selected based at least in part upon an identity of the matching search term.
- 19. The method as is claim 18, wherein selecting the candidate correctly-spelled replacement term comprises selecting a term that, based on an analysis of search query submissions of users, has appeared in combination with the matching search term relatively frequently.
- 20. The method as is claim 18, wherein selecting the candidate correctly-spelled replacement term comprises selecting a replacement term that, based on an analysis of a database to which the multiple-term search query is directed, has appeared in combination with the matching search term within the database relatively frequently.
- 21. A system for processing search queries, comprising:
a data set that links terms to sets of related terms, said data set reflecting a result of a search term co-occurrence analysis in which a weight accorded to a search query submission from a user is dependent upon a predicted degree of success of the search query submission; and a query processing module that uses the data set to predict correct spellings of search terms within multiple-term search queries.
- 22. The system of claim 21, wherein the weight accorded to the search query submission is dependent upon whether the user viewed a resulting query result item.
- 23. The system of claim 21, wherein the weight accorded to the search query submission is dependent upon whether the user purchased a resulting query result item.
- 24. The system of claim 21, wherein the weight accorded to the search query submission is dependent upon whether the user added a resulting query result item to a shopping cart.
- 25. A method of predicting correct spellings of a potentially-misspelled search terms within multiple-term search queries, the method comprising:
analyzing a database to identify terms that are related to one another by virtue of frequencies with which such terms occur in combination within the database; generating a data set that is reflective of a result of said analysis of the database, said data set indicating specific terms that are related to each other; and using the data set to predict correct spellings of non-matching search terms within multiple-term search queries submitted by users.
- 26. The method as in claim 25, wherein analyzing the database comprises determining frequencies with which specific terms co-occur within a specific database field.
- 27. The method as in claim 25, wherein analyzing the database comprises determining frequencies with which specific terms co-occur within database records.
- 28. The method as in claim 25, wherein the database is a products database of an online merchant, and wherein analyzing the database comprises according to a product a weight that is dependent upon a number of units of that product sold over a period of time..
- 29. The method as in claim 25, wherein using the data set to predict correct spellings of non-matching search terms comprises:
receiving a multiple-term search query from a user; identifying a matching search term and a non-matching search term within the multiple-term term search query; accessing the data set to look up at least one additional term that is related to the matching search term; and comparing a spelling of the additional term to a spelling of the non-matching search term.
- 30. A method of predicting a correct spelling of a non-matching search term within a multiple-term search query, the method comprising:
identifying a plurality of matching search terms within the multiple-term search query, each matching search term being different from the non-matching search term; for each matching search term, looking up a corresponding set of related terms, to thereby obtain multiple sets of related terms; combining the multiple sets of related terms to form a related terms set; and comparing spellings of individual terms in the related terms set to a spelling of the non-matching term.
- 31. The method of claim 30, wherein looking up a corresponding set of related terms comprises identifying related terms that, based on an analysis of search query submissions of users, has occurred in combination with a matching search term relatively frequently.
PRIORITY CLAIM
[0001] This application is a continuation of U.S. application Ser. No. 09/517,786, filed Mar. 2, 2000, which is a continuation of Appl. Ser. No. 09/115,662, filed Jul. 15, 1998, now U.S. Pat. No. 6,144,958.
Continuations (2)
|
Number |
Date |
Country |
Parent |
09517786 |
Mar 2000 |
US |
Child |
10114555 |
Apr 2002 |
US |
Parent |
09115662 |
Jul 1998 |
US |
Child |
09517786 |
Mar 2000 |
US |