Claims
- 1. A method for identifying documents that are relevant to an event of interest, comprising:
receiving, from a user, one or more example documents that define the event; obtaining documents in real time that correspond to information created in a plurality of media formats; determining relevance of the documents to the event based on the one or more example documents; and alerting the user when one or more of the documents are determined to be relevant.
- 2. The method of claim 1, wherein the one or more example documents include at least one of text documents, audio documents, and video documents.
- 3. The method of claim 1, wherein the one or more example documents include a total of at least approximately two thousand words.
- 4. The method of claim 1, wherein the information includes at least two of real time audio broadcasts, real time video broadcasts, and text streams or files.
- 5. The method of claim 1, further comprising:
building a statistical language model using the one or more example documents.
- 6. The method of claim 5, wherein the determining relevance of the documents includes:
finding similarities between words in the documents and words in the one or more example documents, and identifying one of the documents as relevant when the words in the document are similar to the words in the one or more example documents.
- 7. The method of claim 1, wherein the determining relevance of the documents includes:
determining similarities between the documents and the one or more example documents, and identifying one of the documents as relevant when the document is similar to at least one of the one or more example documents.
- 8. The method of claim 1, wherein the determining relevance of the documents includes:
generating scores for the documents, and determining that ones of the documents with scores above a threshold are relevant.
- 9. The method of claim 8, further comprising:
providing the ones of the documents to the user based on the scores.
- 10. The method of claim 8, wherein the generating scores includes:
determining scores based on degrees of similarities between the documents and the one or more example documents.
- 11. The method of claim 1, wherein the alerting the user includes at least one of:
placing a telephone call to the user, sending an e-mail to the user, sending a page to the user, sending an instant message to the user, and sending a facsimile to the user.
- 12. The method of claim 1, wherein the alerting the user includes:
sending an alert to the user after a predetermined number of the documents are determined to be relevant.
- 13. The method of claim 1, further comprising:
receiving, from the user, a request for additional information relating to the event, and sending the additional information to the user.
- 14. The method of claim 13, wherein the additional information includes the one or more documents that are determined to be relevant.
- 15. The method of claim 13, wherein the additional information includes the information, corresponding to the one or more documents that are determined to be relevant, in one of the media formats in which the information was created.
- 16. A system for identifying data that is relevant to an event of interest, comprising:
means for obtaining, from a user, a user profile that includes one or more example documents that define the event; means for receiving real-time data that corresponds to multimedia information; means for determining relevance of the data to the event based on the one or more example documents; and means for notifying the user when the data is determined to be relevant.
- 17. An event tracking system, comprising:
collection logic configured to:
receive data items that include textual representations of multimedia information; and tracking logic configured to:
obtain a user profile that includes one or more example documents that define an event for which a user desires data, determine relevance of the data items received by the collection logic to the event based on the user profile, and send an alert to the user when at least one of the data items is determined to be relevant.
- 18. The system of claim 17, wherein the one or more example documents include at least one of text documents, audio documents, and video documents.
- 19. The system of claim 17, wherein the one or more example documents collectively include at least approximately two thousand words.
- 20. The system of claim 17, wherein the multimedia information includes at least one of real time audio broadcasts, real time video broadcasts, text streams, and text files.
- 21. The system of claim 17, wherein the tracking logic is further configured to:
build a statistical language model using the one or more example documents.
- 22. The system of claim 21, wherein when determining relevance of the data items, the tracking logic is configured to:
determine similarities between words in the data items and words in the one or more example documents, and identify one of the data items as relevant when the words in the data item are similar to the words in the one or more example documents.
- 23. The system of claim 17, wherein when determining relevance of the data items, the tracking logic is configured to:
determine similarities between the data items and the one or more example documents, and identify one of the data items as relevant when the data item is similar to at least one of the one or more example documents.
- 24. The system of claim 17, wherein when determining relevance of the data items, the tracking logic is configured to:
generate scores for the data items, and determining that ones of the data items with scores greater than a threshold are relevant.
- 25. The system of claim 24, wherein the tracking logic is further configured to:
provide the ones of the data items to the user based on the scores.
- 26. The system of claim 24, wherein when generating scores, the tracking logic is configured to:
determine scores based on a degree of similarity between the data items and the one or more example documents.
- 27. The system of claim 17, wherein when sending an alert, the tracking logic is configured to cause at least one of a telephone call to be placed to the user, an e-mail to be sent to the user, a page to be sent to the user, an instant message to be sent to the user, and a facsimile to be sent to the user.
- 28. The system of claim 17, wherein when sending an alert, the tracking logic is configured to:
wait until a predetermined number of the data items are determined to be relevant before sending the alert to the user.
- 29. The system of claim 17, wherein the tracking logic is further configured to:
receive, from the user, a request for additional information relating to the event, and send the additional information to the user.
- 30. The system of claim 29, wherein the additional information includes the at least one data item that is determined to be relevant.
- 31. The system of claim 29, wherein the additional information includes the multimedia information corresponding to the at least one data item that is determined to be relevant.
- 32. A computer-readable medium that stores instructions which when executed by a processor cause the processor to perform a method for notifying a user of documents that are relevant to an event of interest, the computer-readable medium comprising:
instructions for obtaining at least one example document that defines the event; instructions for acquiring real-time documents corresponding to information created in a plurality of media formats; instructions for determining relevance of the real-time documents to the event based on the at least one example document; and instructions for notifying the user when one or more of the real-time documents are determined to be relevant.
- 33. An event tracking system, comprising:
one or more indexers configured to:
capture data, the data including at least one of audio data, video data, and text data, and transcribe the data when the data is the audio data or the video data to create text data; and alert logic configured to:
receive at least one example document that defines an event for which a user desires information, receive the text data from the one or more indexers, determine relevance of the text data to the event based on the at least one example document, and alert the user when the text data is determined to be relevant.
- 34. A method for notifying a user of documents that are relevant to an event of interest, comprising:
receiving one or more example documents that define the event; obtaining a plurality of types of media documents; using a model-based approach to determine relevance of the media documents to the event based on the one or more example documents; and alerting the user when one or more of the media documents are determined to be relevant.
RELATED APPLICATION
[0001] This application claims priority under 35 U.S.C. §119 based on U.S. Provisional Application Nos. 60/394,064 and 60/394,082, filed Jul. 3, 2002, and Provisional Application No. 60/419,214, filed Oct. 17, 2002, the disclosures of which are incorporated herein by reference.
[0002] This application is related to U.S. patent application, Ser. No. ______ (Docket No. 02-4039), entitled, “Systems and Methods for Providing Real-Time Alerting,” filed concurrently herewith and incorporated herein by reference.
GOVERNMENT CONTRACT
[0003] The U.S. Government may have a paid-up license in this invention and the right in limited circumstances to require the patent owner to license others on reasonable terms as provided for by the terms of Contract No. 2001*S651600*000 awarded by the Office of Advanced Information Technology.
Provisional Applications (3)
|
Number |
Date |
Country |
|
60394064 |
Jul 2002 |
US |
|
60394082 |
Jul 2002 |
US |
|
60419214 |
Oct 2002 |
US |