Methods and systems to generate rules to identify data items

Information

  • Patent Application
  • 20070198501
  • Publication Number
    20070198501
  • Date Filed
    February 07, 2007
    17 years ago
  • Date Published
    August 23, 2007
    17 years ago
Abstract
There is provided a method and system generate rules and identify data items. The system receives an aspect that is used to describe a data item and a request for at least one candidate value to associate with the aspect. Next, the system identifies a string of text in a database based on the aspect, analyzes the string of text based on the aspect to identify at least one candidate value in the string of text and receives a selection identifying a candidate value. Next, the system generates a rule that includes the aspect-value pair that includes the aspect and the selected candidate value. Next, system associates the aspect-value pair to a first data item based on a publication of the rule. Next, the system receives a query, associates the aspect-value pair to the query, based on the rule, generates a second query that includes the aspect-value pair and identifies the first data item based on the aspect-value pair in the second query for an interface that includes the first data item.
Description

BRIEF DESCRIPTION OF THE DRAWINGS

An embodiment is illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:



FIG. 1 is a network diagram depicting a system, according to one example embodiment, having a client-server architecture;



FIG. 2 is a block diagram illustrating modules and engines, according to an embodiment;



FIG. 3 is a block diagram illustrating authoring modules, according to an embodiment;



FIG. 4 is a block diagram illustrating an information storage and retrieval platform, according to an embodiment;



FIG. 5 is a diagram illustrating an example domain structure, according to one embodiment;



FIG. 6 is a table illustrating sell-side data and buy-side data, according to one embodiment;



FIG. 7 is a diagram illustrating a canonical matching concept, according to an embodiment;



FIG. 8A is a block diagram illustrating databases, according to an embodiment;



FIG. 8B is a block diagram illustrating additional databases, according to an embodiment;



FIG. 9A is a block diagram illustrating classification information, according to an embodiment;



FIG. 9B is a block diagram illustrating production classification information, according to an embodiment;



FIG. 9C is a block diagram illustrating preview classification information, according to an embodiment;



FIG. 10 is a diagram illustrating rules, according to an embodiment;



FIG. 11 is a block diagram illustrating data item information, according to an embodiment;



FIG. 12 is a block diagram illustrating a search index engine, according to an embodiment;



FIG. 13A is a block diagram illustrating a data item search information, according to an embodiment;



FIG. 13B is a block diagram illustrating a sample data item search information, according to an embodiment;



FIG. 14 is a block diagram illustrating query information, according to an embodiment;



FIG. 15 is a block diagram illustrating data item criteria, according to an embodiment;



FIG. 16 is a block diagram illustrating preview publish information, according to an embodiment;



FIG. 17 is a block diagram illustrating most popular query information, according to an embodiment;



FIG. 18 is a block diagram illustrating histogram information, according to an embodiment;



FIG. 19 is a flow chart illustrating a method to generate rules to identify data items, according to an embodiment;



FIG. 20 is a flowchart illustrating a method, according to an embodiment, to represent percentage of coverage for a subset of most popular queries;



FIG. 21 is a flowchart illustrating a method, according to an embodiment, to apply aspect rules to most popular queries;



FIG. 22 is a flowchart illustrating a method, according to an embodiment, to determine percentage coverage for most popular queries;



FIG. 23 is a flowchart illustrating a method, according to an embodiment, to represent percentage coverage associated with a domain;



FIG. 24 is a flowchart illustrating a method, according to an embodiment, to apply domain rules to data items;



FIG. 25 is a flowchart illustrating a method, according to an embodiment, to determine domains;



FIG. 26 is a flowchart illustrating a method, according to an embodiment, to represent percentage coverage for aspects;



FIG. 27 is a flowchart illustrating a method, according to an embodiment, to apply aspect rules to data items;



FIG. 28 is a flowchart illustrating a method, according to an embodiment, to determine percentage coverage for aspects;



FIG. 29 is a flowchart illustrating a method, according to an embodiment, to represent percentage coverage for aspect-value pairs;



FIG. 30 is a flowchart illustrating a method, according to an embodiment to determine percentage coverage for aspect-value pairs;



FIGS. 31-38 are diagrams illustrating user interfaces, according to an embodiment;



FIG. 39 is a block diagram illustrating marketplace applications, according to an embodiment;



FIG. 40 is a block diagram illustrating marketplace information, according to an embodiment; and



FIG. 41 is a block diagram of a machine, according to an example embodiment, including instructions to perform any one or more of the methodologies described herein.


Claims
  • 1. A method including: receiving an aspect that is used to describe a data item and a request for at least one candidate value to associate with the aspect;identifying a string of text in a database based on the aspect;analyzing the string of text based on the aspect to identify at least one candidate value in the string of text, the at least one candidate value to include a first candidate value that is selected;generating and publishing a rule that includes an aspect-value pair that includes the aspect and the first candidate value;associating the aspect-value pair to a first data item based on the published rule;receiving a first query;associating the aspect-value pair to a first query based on the rule; andidentifying the first data item based on the aspect-value pair for an interface that includes the first data item.
  • 2. The method of claim 1, wherein the aspect is utilized to describe the first data item that is offered by a seller on a network-based marketplace, wherein the first query is received from a user searching data items on the network-based marketplace.
  • 3. The method of claim 2, wherein the database is a sample of data items from a first database that is utilized by a plurality of buyers and a plurality of sellers that utilize the network-based marketplace, wherein the sample of data items is selected from a group of samples consisting of a current sample, a seasonal sample, and an historical sample.
  • 4. The method of claim 1, wherein the identifying the string of text includes one or more actions selected from the group of actions consisting of identifying the aspect in the string of text, identifying a synonym of the aspect in the string of text, identifying an acronym of the aspect in the string of text, and identifying a alternate spelling of the aspect in the string of text.
  • 5. The method of claim 1, wherein the analyzing the string of text includes removing stop-words from the string of text.
  • 6. The method of claim 1, wherein the rule includes a predicate clause to identify the candidate value in the first query and a condition clause that includes the aspect-value pair.
  • 7. The method of claim 1, wherein the rule includes a predicate clause to identify the first data item and a condition clause that includes the aspect-value pair.
  • 8. The method of claim 7, wherein the associating the aspect-value pair to the first data item includes concatenating the aspect-value pair to the first data item and wherein the associating the aspect-value pair to the first query includes: identifying a keyword in the first query as the value; andgenerating a transformed query that includes the aspect-value pair.
  • 9. The method of claim 8, wherein the identifying the first data item based on the first query includes utilizing the transformed query to identify the aspect-value pair in the first data item.
  • 10. A system including: a value generator module to receive an aspect that is used to describe a data item and a request for at least one candidate value to associate with the aspect;a string identifier module to identify a string of text in a database based on the aspect, the string identifier module to analyze the string of text based on the aspect to identify at least one candidate value in the string of text, the at least one candidate value to include a first candidate value that is selected;a rules editor to generate a rule that includes the aspect-value pair that includes the aspect and the first candidate value; andan information storage and retrieval platform, to associate the aspect-value pair to a first data item based on a publication of the rule, the information storage and retrieval platform to receive a first query, the information storage and retrieval platform to associate the aspect-value pair to the first query, based on the rule, the information storage and retrieval platform to identify the first data item based on the aspect-value pair for an interface that includes the first data item.
  • 11. The system of claim 10, wherein the aspect is utilized to describe the first data item that is offered by a seller on a network-based marketplace, wherein the first query is received from a user that searches for data items on the network-based marketplace.
  • 12. The system of claim 11, wherein the database is a sample of data items from a first database that is utilized by a plurality of buyers and a plurality of sellers that utilize the network-based marketplace, wherein the sample of data items is anyone from a group of samples including a current sample, a seasonal sample, and an historical sample.
  • 13. The system of claim 10, wherein the string identifier module identifies the aspect in the string of text, wherein the string identifier module identifies a synonym of the aspect in the string of text, wherein the string identifier module identifies an acronym of the aspect in the string of text, and wherein the string identifier module identifies an alternate spelling of the aspect in the string of text.
  • 14. The system of claim 10, wherein the string analyzer module removes stop-words from the string of text.
  • 15. The system of claim 10, wherein the rule includes a predicate clause to identify the value in the first query and a condition clause that includes the aspect-value pair.
  • 16. The system of claim 15, wherein the rule includes a predicate clause to identify the first data item and a condition clause that includes the aspect-value pair.
  • 17. The system of claim 16, wherein the information storage and retrieval platform concatenates the aspect-value pair to the data item, and wherein the information storage and retrieval platform identifies a keyword in the first query as the value and generates a transformed query that includes the aspect-value pair.
  • 18. The system of claim 17, wherein the information storage and retrieval platform utilizes the transformed query to identify the aspect-value pair in the first data item.
  • 19. A system including: a first means for receiving an aspect that is used to describe a data item and a request for at least one candidate value to associate with the aspect;a second means for identifying a string of text in a database based on the aspect, the second means for analyzing the string of text based on the aspect to identify at least one candidate value in the string of text, the at least one candidate value to include a first candidate value that is selected;a third means for generating a rule that includes the aspect-value pair that includes the aspect and the first candidate value; anda fourth means for associating the aspect-value pair to a first data item based on a publication of the rule, the third means for receiving a first query, the third means for associating the aspect-value to the first query based on the rule, the third means for identifying the first data item based on the aspect-value pair for an interface that includes the first data item.
  • 20. A tangible machine readable medium storing a set of instructions that, when executed by a machine, cause the machine to: receive an aspect that is used to describe a data item and a request for at least one candidate value to associate with the aspect;identify a string of text in a database based on the aspect;analyze the string of text based on the aspect to identify at least one candidate value in the string of text, the at least one candidate value to include a first candidate value that is selected;generate a rule that includes the aspect-value pair that includes the aspect and the first candidate value;associate the aspect-value pair to a first data item based on a publication of the rule;receive a first query;associate the aspect-value to the first query based on the rule; andidentify the first data item based on the aspect-value pair for an interface that includes the first data item.
Provisional Applications (3)
Number Date Country
60743256 Feb 2006 US
60781521 Mar 2006 US
60745347 Apr 2006 US