Claims
- 1. A method for generating an annotated database and syndicating content from that database comprising:
selecting a term from a corpus of documents according to rules; storing said term into a terms database; identifying a knowledgeable expert familiar with said term; sending said selected term to said expert; syndicating data objects representing said database to remote servers; and using said data objects to execute rules for linking to information from said database without requiring a connection to said database.
- 2. The method of claim 1, further comprising:
crawling said corpus of documents.
- 3. The method of claim 1, further comprising:
parsing all documents matching selected criteria in said corpus.
- 4. The method of claim 1, further comprising:
providing a term annotation by said expert.
- 5. The method of claim 4, further comprising:
linking said annotation to said term in said database.
- 6. The method of claim 1 wherein said rules comprise any of:
said term not previously existing in said database, unusually high frequency of said term, said term is an article, said term is an unusual part of speech.
- 7. The method of claim 1, wherein said rules comprise:
ranking an order chains of said terms by said terms normalized frequency, where said normalized frequency is a frequency of said chain divided by a frequency of said term.
- 8. The method of claim 7, wherein said normalized frequency is greater than or equal to a threshold value.
- 9. The method of claim 1, wherein said annotated term is sponsored by an advertiser automatically when a page containing said annotated term is viewed.
- 10. The method of claim 9, wherein said annotated term is pre-selected by providing said annotated term in said terms database.
- 11. The method of claim 4 wherein said annotated term links to a content window, said content window containing information related to said term.
- 12. The method of claim 11, wherein said information comprises such as:
definitions, related products or services, sponsorship information, information from content syndicators, translations and reference works, document archives, and other repositories of information said information accessible by selecting said term within said corpus of documents.
- 13. The method of claim 1 further comprising:
filtering terms from said corpus of documents by removing commonly used terms.
- 14. The method of claim 1 further comprising:
analyzing said term with a term editor utility interface, said interface capable of expanding or reducing extensions of a context said term is contained in.
- 15. The method of claim 1 further comprising:
syndicating to remote servers lexical data objects containing a representation of content in the term database
- 16. The method of claim 15, wherein said lexical data object contains information such as:
terms, term ID, dictionary, annotation content, meta data
- 17. The method of claim 15, further comprising:
a single lexical object is used by all processing engines on a single server or stored on a single server and accessed by processing engines on multiple servers
- 18. The method of claim 1 further comprising:
processing engine on a remote server utilizing said data object to match terms in unstructured text to terms in said database
- 19. The method of claim 18 further comprising:
a connection to the said database is not required at the time processing takes place
- 20. The method of claim 1 further comprising:
syndicating to remote servers template data objects containing linking rules specified in a template database
- 21. The method of claim 20, wherein said template data object contains information such as:
template names, template IDs, dictionary names, dictionary IDs, metadata criteria, filter names, tag definitions, run time mode selections, and additions to the page
- 22. The method of claim 20, further comprising:
a single template object is used by all processing engines on a single server or stored on a single server and accessed by processing engines on multiple servers
- 23. The method of claim 1 further comprising:
processing engine on a remote server utilizing said template data object to determine and implement linking rules
- 24. The method of claim 23 further comprising:
a connection to said template database is not required at the time processing takes place
- 25. A method for generating an annotated database and syndicating content from that database comprising:
selecting a term from a corpus of documents according to rules; storing said term into a terms database; identifying a knowledgeable expert familiar with said term; sending said selected term to said expert; syndicating data objects representing said database to remote servers; and using said data objects to execute rules for linking to information from said database without requiring a connection to said database.
- 26. The method of claim 25, further comprising:
crawling said corpus of documents.
- 27. The method of claim 25, further comprising:
parsing all documents matching selected criteria in said corpus.
- 28. The method of claim 25, further comprising:
providing a term annotation by said expert.
- 29. The method of claim 28, further comprising:
linking said annotation to said term in said database.
- 30. The method of claim 25 wherein said rules comprise any of:
said term not previously existing in said database, unusually high frequency of said term, said term is an article, said term is an unusual part of speech.
- 31. The method of claim 25, wherein said rules comprise:
ranking an order chains of said terms by said terms normalized frequency, where said normalized frequency is a frequency of said chain divided by a frequency of said term.
- 32. The method of claim 31, wherein said normalized frequency is greater than or equal to a threshold value.
- 33. The method of claim 25, wherein said annotated term is sponsored by an advertiser automatically when a page containing said annotated term is viewed.
- 34. The method of claim 33, wherein said annotated term is pre-selected by providing said annotated term in said terms database.
- 35. The method of claim 28 wherein said annotated term links to a content window, said content window containing information related to said term.
- 36. The method of claim 35, wherein said information comprises such as:
definitions, related products or services, sponsorship information, information from content syndicators, translations and reference works, document archives, and other repositories of information said information accessible by selecting said term within said corpus of documents.
- 37. The method of claim 25 further comprising:
filtering terms from said corpus of documents by removing commonly used terms.
- 38. The method of claim 25 further comprising:
analyzing said term with a term editor utility interface, said interface capable of expanding or reducing extensions of a context said term is contained in.
- 39. The method of claim 25 further comprising:
syndicating to remote servers lexical data objects containing a representation of content in the term database
- 40. The method of claim 39, wherein said lexical data object contains information such as:
terms, term ID, dictionary, annotation content, meta data
- 41. The method of claim 39, further comprising:
a single lexical object is used by all processing engines on a single server or stored on a single server and accessed by processing engines on multiple servers
- 42. The method of claim 25 further comprising:
processing engine on a remote server utilizing said data object to match terms in unstructured text to terms in said database
- 43. The method of claim 42 further comprising:
a connection to the said database is not required at the time processing takes place
- 44. The method of claim 25 further comprising:
syndicating to remote servers template data objects containing linking rules specified in a template database
- 45. The method of claim 44, wherein said template data object contains information such as:
template names, template IDs, dictionary names, dictionary IDs, metadata criteria, filter names, tag definitions, run time mode selections, and additions to the page
- 46. The method of claim 44, further comprising:
a single template object is used by all processing engines on a single server or stored on a single server and accessed by processing engines on multiple servers
- 47. The method of claim 25 further comprising:
processing engine on a remote server utilizing said template data object to determine and implement linking rules
- 48. The method of claim 47 further comprising:
a connection to said template database is not required at the time processing takes place.
CLAIM OF PRIORITY
[0001] Priority for the present application is claimed to U.S. Provisional Application 60/313,041 filed Aug. 16, 2001.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60313041 |
Aug 2001 |
US |