Claims
- 1. A method for facilitating perusal of an item of interest, comprising:
retrieving a textual representation of the item; presenting the textual representation to a user; obtaining an original form of the item; providing the item to the user in the original form; and visually synchronizing the providing of the item in the original form with the textual representation of the item.
- 2. The method of claim 1, wherein the retrieving a textual representation includes:
generating a request concerning the item of interest, sending the request to a server, and obtaining, from the server, the textual representation of the item.
- 3. The method of claim 2, wherein the obtaining the textual representation includes:
using the request, by the server, to retrieve metadata relating to the item from a metadata database, generating the textual representation of the item from the metadata, and receiving, from the server, the generated textual representation.
- 4. The method of claim 3, wherein the generating the textual representation includes: creating a HyperText Markup Language document from the metadata.
- 5. The method of claim 1, wherein the presenting the textual representation includes: providing the textual representation within a graphical user interface of a web browser.
- 6. The method of claim 1, wherein the obtaining an original form of the item includes:
accessing a database of original media to retrieve the item in the original form.
- 7. The method of claim 1, wherein the obtaining an original form of the item includes:
receiving input, from the user, regarding a desire for the item in the original form, initiating a media player, and using the media player to obtain the item in the original form.
- 8. The method of claim 7, wherein the receiving input from the user includes:
receiving selection of a portion of the textual representation.
- 9. The method of claim 8, wherein the using the media player includes:
determining, by the media player, the portion selected by the user, and retrieving the item in the original form corresponding to the determined portion.
- 10. The method of claim 9, wherein the determining the portion includes:
identifying time codes associated with a beginning and an ending of the selected portion.
- 11. The method of claim 9, wherein the portion selected by the user includes a starting position in the textual representation; and
wherein the determining the portion includes:
identifying time codes associated with the starting position in the textual representation.
- 12. The method of claim 1, wherein the textual representation includes time codes corresponding to when words in the textual representation were spoken.
- 13. The method of claim 12, wherein the visually synchronizing the providing of the item includes:
comparing times corresponding to the providing of the item in the original form to the time codes from the textual representation, and visually distinguishing words in the textual representation when the words are spoken during the providing of the item in the original form.
- 14. The method of claim 1, wherein the providing the item to the user includes:
permitting the user to control the providing of the item in the original form.
- 15. The method of claim 14, wherein the permitting the user to control the providing includes:
allowing the user to at least one of fast forward, speed up, slow down, and back up the providing of the item in the original form.
- 16. The method of claim 1, wherein the item is an audio file and the textual representation of the item includes a transcription of the audio file and at least one of a speaker identifier, a topic, and one or more word time codes.
- 17. The method of claim 1, wherein the item is a video file and the textual representation of the item includes a transcription of the video file and at least one of a speaker identifier, a topic, and one or more word time codes.
- 18. The method of claim 1, wherein the original form of the item includes a format in which the item was originally created.
- 19. A system for facilitating browsing of an item of interest, comprising:
means for obtaining a transcription of the item; means for providing the transcription to a user; means for retrieving the item in an original form; means for presenting the item to the user in the original form; and means for visually synchronizing the presenting of the item in the original form with the transcription of the item.
- 20. A system for aiding a user in browsing information of interest, comprising:
a memory configured to store instructions; and a processor configured to execute the instructions in memory to:
obtain a transcription of the information, provide the transcription to a user, retrieve the information in an original format, present the information to the user in the original format, and visually synchronize the presentation of the information in the original format with the transcription of the information.
- 21. The system of claim 20, wherein when obtaining a transcription, the processor is configured to:
generate a request concerning the information of interest, send the request to a server, and obtain, from the server, the transcription of the information.
- 22. The system of claim 20, wherein when providing the transcription, the processor is configured to:
present the transcription within a graphical user interface of a web browser.
- 23. The system of claim 20, wherein when retrieving the information in an original format, the processor is configured to:
obtain the information from a database of original media.
- 24. The system of claim 20, wherein when retrieving the information in an original format, the processor is configured to:
receive input, from the user, regarding a desire for the information in the original format, and initiate a media player to obtain the information in the original format.
- 25. The system of claim 20, wherein when retrieving the information in an original format, the processor is configured to:
receive input, from the user, regarding a desire for the information in the original format, receive selection of a portion of the transcription by the user, and obtain the information in the original format corresponding to the selected portion.
- 26. The system of claim 25, wherein when obtaining the information in the original format, the processor is configured to:
identify time codes associated with a beginning and an ending of the selected portion.
- 27. The system of claim 25, wherein the portion selected by the user includes a starting position in the transcription; and
wherein when obtaining the information in the original format, the processor is configured to:
identify time codes associated with the starting position in the transcription.
- 28. The system of claim 20, wherein the transcription includes time codes corresponding to when words in the transcription were spoken.
- 29. The system of claim 28, wherein when visually synchronizing the presentation of the information, the processor is configured to:
compare times corresponding to the presentation of the information in the original format to the time codes from the transcription, and visually distinguish words in the transcription when the words are played back during the presentation of the information in the original format.
- 30. The system of claim 20, wherein when presenting the information to the user, the processor is configured to:
permit the user to control the presentation of the information in the original format.
- 31. The system of claim 30, wherein when permitting the user to control the presentation of the information, the processor is configured to:
provide controls to the user to allow the user to at least one of fast forward, speed up, slow down, and back up the presentation of the information in the original format.
- 32. The system of claim 20, wherein the information is an audio file and the transcription of the information includes a transcription of the audio file and at least one of a speaker identifier, a topic, and one or more word time codes.
- 33. The system of claim 20, wherein the information is a video file and the transcription of the information includes a transcription of the video file and at least one of a speaker identifier, a topic, and one or more word time codes.
- 34. The system of claim 20, wherein the original format of the information includes a form in which the information was originally created.
- 35. A computer-readable medium that contains instructions for causing at least one processor to perform a method for facilitating browsing of audio and video information, comprising:
instructions for retrieving a textual representation of the information; instructions for presenting the textual representation to a user; instructions for obtaining the information in an original format; instructions for providing the information to the user in the original format; and instructions for visually synchronizing the providing of the information in the original format with the textual representation of the information.
- 36. A graphical user interface, comprising:
a transcription section that includes a transcription of non-text information; a speaker section that identifies boundaries between speakers in the transcription section; a topic section that includes one or more topics relating to the transcription; and a request media button that, when selected, causes:
retrieval of the non-text information to be initiated, playing of the non-text information, and the playing of the non-text information to be visually synchronized with the transcription in the transcription section.
- 37. The graphical user interface of claim 36, wherein the transcription visually distinguishes names of people, places, and organizations.
- 38. The graphical user interface of claim 36, wherein the speaker section further includes at least one of gender and names of the speakers.
- 39. The graphical user interface of claim 36, wherein the one or more topics relate to one or more main themes of the transcription.
- 40. The graphical user interface of claim 36, wherein the transcription includes time codes that identify when words in the transcription were spoken with regard to the non-text information.
- 41. The graphical user interface of claim 40, wherein the request media button causes words in the transcription to be visually distinguished in synchronism with the words in the non-text information being played.
- 42. The graphical user interface of claim 36, wherein the non-text information includes at least one of audio and video.
RELATED APPLICATIONS
[0001] This application claims priority under 35 U.S.C. § 119 based on U.S. Provisional Application Nos. 60/394,064 and 60/394,082, filed Jul. 3, 2002, and Provisional Application No. 60/419,214, filed Oct. 17, 2002, the disclosures of which are incorporated herein by reference.
[0002] This application is related to U.S. patent application Ser. No. ______ (Docket No. 02-4038), entitled, “Systems and Methods for Aiding Human Translation,” filed concurrently herewith and incorporated herein by reference.
GOVERNMENT CONTRACT
[0003] The U.S. Government may have a paid-up license in this invention and the right in limited circumstances to require the patent owner to license others on reasonable terms as provided for by the terms of Contract No. N66001-00-C-8008 awarded by the Defense Advanced Research Projects Agency (DARPA).
Provisional Applications (3)
|
Number |
Date |
Country |
|
60394064 |
Jul 2002 |
US |
|
60394082 |
Jul 2002 |
US |
|
60419214 |
Oct 2002 |
US |