Not Applicable
Not Applicable
The present invention relates to printing of electronic documents and more particularly to method and apparatus for augmenting the printing of electronic documents with features to enhance the experience of reading or using the hardcopy of an electronic document.
Increasingly, readers of documents are being called upon to assimilate vast quantities of information in a short period of time. To meet the demands placed upon them, readers find they must read documents “horizontally,” rather than “vertically,” i.e., they must scan, skim, and browse sections of interest in multiple documents rather than read and analyze a single document from beginning to end.
Documents are becoming more widely available in electronic form. Some documents are available electronically by virtue of their having been created using word processing software. Other electronic documents are accessible via the Internet. Yet others may become available in electronic form by virtue of being scanned in, copied, or faxed. Commonly assigned U.S. application Ser. No. 08/754,721, entitled AUTOMATIC AND TRANSPARENT DOCUMENT ARCHIVING, the contents of which are incorporated herein by reference for all purposes, details a system for generating electronic as well as hardcopy format of documents.
However, the mere availability of documents in electronic form does not assist the reader in confronting the challenges of assimilating information quickly. Indeed, many time-challenged readers still prefer paper documents because of their portability and the ease of flipping through pages.
Certain tools take advantage of the electronic form documents to assist harried readers. Tools exist to search for documents both on the Internet and locally. Once a document is identified and retrieved, automatic summarization techniques, such as the Reader's Helper™, described in a commonly owned copending U.S. patent application Ser. No. 08/995,616, entitled AUTOMATIC ADAPTIVE DOCUMENT HELP SYSTEM, the contents of which are incorporated herein by reference for all purposes, helps the reader to find as well as assimilate the information he or she wants more quickly. However, there is heretofore no automatic assistance available to the reader who desires to work with printed hardcopy of electronic documents.
What is needed is a document printing system that helps the reader print the information he or she wants more quickly. The document printing system should be easily personalizable, flexible and adaptive as well.
An automatic printing assistant application for documents in electronic form is provided by virtue of the present invention. In certain embodiments, an elongated thumbnail image of all or part of an electronically stored document is displayed. A section of the document of interest to the reader is emphasized. Movement of the emphasized area in the elongated thumbnail image assists the user with the selection of sections or pages of the document for printing. The operation of the assistant is personalizable for a particular user by setting of a sensitivity level and selection of relevant topics of interest. Some embodiments of the assistant are also capable of improved performance over time by both automatic and manual feedback. The assistant is usable with many popular electronic document formats.
In accordance with a first aspect of the present invention, a method for adaptively controlling printing of an electronically stored document includes a step of accepting user input indicating a user-specified concept of interest. A step of analyzing the electronically stored document to identify locations of discussion of the user-specified concept of interest may also be included. Embodiments can also include a step of displaying visual indications of the identified locations. In another step, user input indicating a print preference for certain locations is accepted. Finally, portions of the electronic document corresponding to the user's print preferences are printed.
In accordance with a second aspect of the present invention, the method for assisting a reader in printing an electronically stored document also includes a step of accepting user input indicating a print preference by emphasizing an area of interest to the user in a thumbnail image corresponding to a section of interest to the user in the document. The user can control printing by sliding the emphasized area through thumbnail image for the purposes of indicating sections of the electronically stored document to print.
In select embodiments in accordance with the present invention, the method includes a step of displaying an elongated thumbnail image of a portion of the electronically stored document in a viewing area of a display.
In certain embodiments in accordance with the present invention, the step of analyzing the electronically stored document to identify locations of discussion of the user-specified concept of interest may be realized by exploiting a probabilistic inference method, such as a Bayesian belief network or its equivalent to identify such locations.
Numerous benefits are achieved by way of the present invention over conventional techniques. In some embodiments, the present invention is more user friendly than conventional techniques. The present invention can provide a way for the user to obtain hardcopy of only those sections of a large document which contain concepts of interest. Some embodiments according to the invention are more robust than known techniques. These and other benefits are described throughout the present specification and more particularly below.
A further understanding of the nature and advantages of the inventions herein may be realized by reference to the remaining portions of the specification and the attached drawings.
The present invention provides a personalizable system for printing automatically annotated documents based upon concepts of interest to a particular user. An embodiment according to the present invention has been reduced to practice under the name Reader's Helper™.
Computer System Usable for Implementing the Present Invention
Annotated Document Printing User Interface
An elongated thumbnail image 214 of many pages, or all of an electronically stored document 502 of
The elongated thumbnail image 214 in
The annotations in elongated thumbnail image 214 make it very easy to find relevant text anywhere in document 502. Furthermore, elongated thumbnail image 214 provides a highly useful way of keeping track of a lengthy document. These features enable the user 504 to readily locate portions of the document for printing.
A series of concept indicators 206 permit the user to view which concepts of interest are to be noted in the document. Concepts are defined by means of a defined concepts list 806 shown in
A sensitivity control 208 permits the user to select the degree of sensitivity to apply in identifying potential locations of relevant discussion for printing. At low sensitivity, more locations will be denoted as being relevant, even though some may not be of any actual interest. At high sensitivity, most all denoted locations will in fact be relevant but some other relevant locations may be missed. After each concept name appearing by one of concept indicators 206 appears a percentage giving the relevance of the currently viewed document to the concept. These relevance levels offer a quick assessment of the relevance of the document to the selected concepts.
Miscellaneous printing options are found on a printing options button bar 216.
Printing Assistance
In a particular embodiment, user 504 uses a sensitivity control 208 to select the degree of sensitivity to apply in identifying potential locations of relevant discussion. At low sensitivity, more locations will be denoted as being relevant, even though some may not be of any actual interest. At high sensitivity, most all denoted locations will in fact be relevant but some other relevant locations may be missed.
Automatic Annotation Software
Annotation agent 508 adds the annotations to document 502 to prepare it for viewing by document browser 506 and printing by printing dialog 200. Processing by annotation agent 508 may be understood to be in three stages, a text processing stage 510, a content recognition stage 512, and a formatting stage 514. The input to text processing stage 510 is raw text. The output from text processing stage 510 and input to content recognition stage 512 is a parsed text stream, a text stream with formatting information such as special tags around particular words or phrases removed. The output from content recognition stage 512 and input to formatting stage 514 is an annotated text stream. The output of formatting stage 514 is a formatted text file which may be printed using print dialog 200 or viewed with document browser 506.
The processing of annotation agent 508 is preferably a run-time process. The annotations are not preferably pre-inserted into the text but are rather generated when user 504 requests document 502 for browsing. Thus, this is preferably a dynamic process. Annotation agent 508 may also, however, operate in the background as a batch process.
The annotation added by annotation agent 508 depends on concepts of interest selected by user 504. User 504 also inputs information used by annotation agent 508 to identify locations of discussion of concepts of interest in document 502. In a preferred embodiment, this information defines the structure of a Bayesian belief network. The concepts of interest and other user-specific information are maintained in a user profile file 516. User 504 employs a profile editor 518 to modify the contents of user profile file 516.
Text processing stage 510 includes a file I/O stage 612, an updating stage 614, and a language processing stage 616. File I/O stage reads the document file from network 602. Updating stage 614 maintains a history of recently visited documents in a history file 618. Language processing stage 616 parses the text of document 502 to generate the parsed text output of text processing stage 510.
Pattern identification stage 620 looks for keywords and key phrases of interest and locates relevant discussion of concepts based on the located keywords. The identification of keywords and the application of the keywords to locating relevant discussion is preferably accomplished by reference to a belief system. The belief system is preferably a Bayesian belief network.
The structure of Bayesian belief network 700 is only one possible structure applicable to the present invention. For example, one could employ a Bayesian belief network with more than two levels of hierarchy so that the presence of subconcepts is suggested by the presence of “subsubconcepts” and so on. In the preferred embodiment, presence of a keyword or key phrase always indicates presence of discussion of the subconcept but it is also possible to configure the belief network so that presence of a keyword or key phrase suggests discussion of the subconcept with a specified probability.
The primary source for the structure of Bayesian belief network 700 including the selection of concepts, keywords and key phrases, interconnections, and probabilities is user profile file 516. In a preferred embodiment, user profile file 516 is selectable for both editing and use from among profiles for many users.
The structure of belief system 700 is modifiable during use of the annotation system. The modifications may occur automatically in the background or may involve explicit user feedback input. The locations of concepts of interest determined by pattern identification stage 620 are monitored by profile updating stage 624. Profile updating stage 624 notes the proximity of other keywords and key phrases within each analyzed document to the locations of concepts of interest. If particular keywords and key phrases are always near a concept of interest, the structure and contents of belief system 700 are updated in the background without user input by profile updating stage 624. This could mean changing probability values, introducing a new connection between a subconcept and concept, or introducing a new keyword or key phrase.
User 504 may select a word or phrase in document 502 as being relevant to a particular concept even though the word or phrase has not yet defined to be a keyword or key phrase. Belief system 700 is then updated to include the new keyword or key phrase.
User 504 may also give feedback for an existing key word or key phrase, indicating the perceived relevance of the keyword or key phrase to the concept of interest. If the selected keyword or key phrase is indicated to be of high relevance to the concept of interest, the probability values connecting the subconcept indicated by the selected keywords or key phrases to the concept of interest increases. If, on the other hand, user 504 indicates the selected keywords or key phrases to be of little interest, the probability values connecting these keywords or key phrases to the concept decrease.
User Profile and Feedback Interfaces
A defined concepts list 806 lists all of the concepts which have already been added to the user profile. By selecting a concept add button 808, the user may add a new concept. By selecting a concept edit button 810, the user may modify the belief network as it pertains to the listed concept that is currently selected. By selecting a remove button 812, the user may delete a concept.
If a concept has been selected for editing, its name appears in a concept name box 813. The portion of the belief network pertaining to the selected concept is shown in a belief network display window 814. Belief network display window 814 shows the selected concept, the subconcepts which have been defined as relating to the selected concept and the percentage values associated with each relationship. The user may add a subconcept by selecting a subconcept add button 815. The user may edit a subconcept by selecting the subconcept in belief network display window 814 and then selecting a subconcept edit button 816. A subconcept remove button 818 permits the user to delete a subconcept from the belief network.
Selecting subconcept add button 815 causes a subconcept add window 820 to appear. Subconcept add window 820 includes a subconcept name box 822 for entering the name of a new subconcept. A slider control 824 permits the user to select the percentage value that defines the probability of the selected concept appearing given that the newly selected subconcept appears. A keyword list 826 lists the keywords and key phrases which indicate discussion of the subconcept. The user adds to the list by selecting a keyword add button 828 which causes display of a dialog box (not shown) for entering the new keyword or key phrase. The user deletes a keyword or key phrase by selecting it and then selecting a keyword delete button 830. Once the user has finished defining the new subconcept, he or she confirms the definition by selecting an OK button 832. Selection of a cancel button 834 dismisses subconcept add window 820 without affecting the belief network contents or structure. Selection of subconcept edit button 816 causes display of a window similar to subconcept add window 820 permitting redefinition of the selected subconcept.
By determining whether a background learning checkbox 836 has been selected, the user may enable or disable the operation of profile updating stage 624. A web autofetch check box 838 permits the user to select whether or not to enable an automatic web search process. When this web search process is enabled, whenever a particular keyword or key phrase is found frequently near where a defined concept is determined to be discussed, a web search tool such as AltaVista™ is employed to look on the World Wide Web for documents containing the keyword or key phrase. A threshold slider control 840 is provided to enable the user to set a threshold relevance level for this autofetching process.
Software Implementation
In a preferred embodiment, software to implement the present invention is written in the Jave™ computer programming language. Preferably, the software forms a part of a stand-alone browser program written in the Jave™ language. Alternatively, the code may be in the form of a so-called “plug-in” operating with a Jave™ -equipped web browser used to browse HTML documents including the special annotation tags explained above.
In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. For example, any probabilistic inference method may be substituted for a Bayesian belief network. It will, however, be evident that various modifications and changes may be made thereunto without departing from the broader spirit and scope of the invention as set forth in the appended claims and their full scope of equivalents.
The present application is a continuation application of and claims priority to U.S. Non-Provisional patent application Ser. No. 10/402,844, filed on Mar. 28, 2003, which is a continuation of U.S. Non-Provisional patent application Ser. No. 09/149,920, filed on Jun. 24, 2003, the entire contents of which are herein incorporated by reference for all purposes. The following related commonly-owned copending application is hereby incorporated by reference in its entirety for all purposes: U.S. Non-Provisional patent application Ser. No. 09/149,921, filed on Sep. 9, 1998 entitled, “AUTOMATIC ADAPTIVE DOCUMENT HELP FOR PAPER DOCUMENTS.” Further, this application incorporates by reference the following commonly owned copending U.S. patent application in its entirety for all purposes: U.S. Non-Provisional patent application Ser. No. 08/995,616, filed Dec. 22, 1997, entitled “AUTOMATIC ADAPTIVE DOCUMENT HELP SYSTEM.”
Number | Name | Date | Kind |
---|---|---|---|
4417239 | Demke et al. | Nov 1983 | A |
4823303 | Terasawa | Apr 1989 | A |
5153831 | Yianilos | Oct 1992 | A |
5297042 | Morita | Mar 1994 | A |
5299123 | Wang et al. | Mar 1994 | A |
5309359 | Katz et al. | May 1994 | A |
5339391 | Wroblewski et al. | Aug 1994 | A |
5349658 | O'Rourke et al. | Sep 1994 | A |
5384703 | Withgott et al. | Jan 1995 | A |
5404295 | Katz et al. | Apr 1995 | A |
5418948 | Turtle | May 1995 | A |
5442795 | Levine et al. | Aug 1995 | A |
5446891 | Kaplan et al. | Aug 1995 | A |
5479600 | Wroblewski et al. | Dec 1995 | A |
5481666 | Nguyen et al. | Jan 1996 | A |
5519857 | Kato et al. | May 1996 | A |
5523945 | Satoh et al. | Jun 1996 | A |
5530942 | Tzou et al. | Jun 1996 | A |
5533182 | Bates et al. | Jul 1996 | A |
5535382 | Ogawa | Jul 1996 | A |
5546502 | Hart et al. | Aug 1996 | A |
5596700 | Darnell et al. | Jan 1997 | A |
5598557 | Doner et al. | Jan 1997 | A |
5638543 | Pedersen et al. | Jun 1997 | A |
5680636 | Levine et al. | Oct 1997 | A |
5694559 | Hobson et al. | Dec 1997 | A |
5696963 | Ahn | Dec 1997 | A |
5721897 | Rubinstein | Feb 1998 | A |
5721902 | Schultz | Feb 1998 | A |
5737599 | Rowe et al. | Apr 1998 | A |
5748805 | Withgott et al. | May 1998 | A |
5751283 | Smith | May 1998 | A |
5751287 | Hahn et al. | May 1998 | A |
5754939 | Herz et al. | May 1998 | A |
D395297 | Cheng et al. | Jun 1998 | S |
5761655 | Hoffman | Jun 1998 | A |
5768578 | Kirk et al. | Jun 1998 | A |
5774118 | Hatakama | Jun 1998 | A |
5774888 | Light | Jun 1998 | A |
5778363 | Light | Jul 1998 | A |
5778397 | Kupiec et al. | Jul 1998 | A |
5781785 | Rowe et al. | Jul 1998 | A |
5784616 | Horvitz | Jul 1998 | A |
D398299 | Ballay et al. | Sep 1998 | S |
5806079 | Rivette et al. | Sep 1998 | A |
D400195 | Utesch | Oct 1998 | S |
5818446 | Bertram et al. | Oct 1998 | A |
5819301 | Rowe et al. | Oct 1998 | A |
D400520 | Baker et al. | Nov 1998 | S |
5832474 | Lopresti et al. | Nov 1998 | A |
5835922 | Shima et al. | Nov 1998 | A |
5838317 | Bolnick et al. | Nov 1998 | A |
5838323 | Rose et al. | Nov 1998 | A |
5845305 | Kujiraoka | Dec 1998 | A |
5847709 | Card et al. | Dec 1998 | A |
5857185 | Yamamura | Jan 1999 | A |
5860074 | Rowe et al. | Jan 1999 | A |
5870770 | Wolfe | Feb 1999 | A |
5873107 | Borovoy et al. | Feb 1999 | A |
5920859 | Li | Jul 1999 | A |
5926808 | Evans et al. | Jul 1999 | A |
5933841 | Schumacher et al. | Aug 1999 | A |
5943669 | Numata | Aug 1999 | A |
5943679 | Niles et al. | Aug 1999 | A |
5946678 | Aalbersberg | Aug 1999 | A |
5950187 | Tsuda | Sep 1999 | A |
5963940 | Liddy et al. | Oct 1999 | A |
5987454 | Hobbs | Nov 1999 | A |
5987460 | Niwa et al. | Nov 1999 | A |
6006218 | Breese et al. | Dec 1999 | A |
6006225 | Bowmann et al. | Dec 1999 | A |
D418826 | Pavely et al. | Jan 2000 | S |
D419144 | Baker et al. | Jan 2000 | S |
6012053 | Pant et al. | Jan 2000 | A |
6021403 | Horvitz et al. | Feb 2000 | A |
6026409 | Blumenthal | Feb 2000 | A |
6028601 | Machiraju et al. | Feb 2000 | A |
6041323 | Kubota | Mar 2000 | A |
6055542 | Nielsen et al. | Apr 2000 | A |
D424036 | Arora et al. | May 2000 | S |
D425039 | Shields | May 2000 | S |
D425497 | Eisenberg et al. | May 2000 | S |
6057839 | Advani et al. | May 2000 | A |
6058202 | Yamaguchi | May 2000 | A |
6070176 | Downs et al. | May 2000 | A |
6076088 | Paik et al. | Jun 2000 | A |
6094648 | Aalbersberg | Jul 2000 | A |
6101503 | Cooper et al. | Aug 2000 | A |
6122647 | Horowitz et al. | Sep 2000 | A |
6131092 | Masand | Oct 2000 | A |
D435561 | Pettigrew et al. | Dec 2000 | S |
6167368 | Wacholder | Dec 2000 | A |
6182067 | Presnell et al. | Jan 2001 | B1 |
6182090 | Peairs | Jan 2001 | B1 |
D439585 | Law et al. | Mar 2001 | S |
6205456 | Nakao | Mar 2001 | B1 |
6222547 | Schwuttke et al. | Apr 2001 | B1 |
6236768 | Rhodes et al. | May 2001 | B1 |
D445802 | Greminger | Jul 2001 | S |
6259458 | Theisen et al. | Jul 2001 | B1 |
6275229 | Weiner et al. | Aug 2001 | B1 |
D449050 | Graham | Oct 2001 | S |
6326957 | Nathan et al. | Dec 2001 | B1 |
6335730 | Gould | Jan 2002 | B1 |
6339437 | Nielsen | Jan 2002 | B1 |
6369811 | Graham et al. | Apr 2002 | B1 |
D456817 | Graham | May 2002 | S |
6397213 | Cullen et al. | May 2002 | B1 |
6457026 | Graham et al. | Sep 2002 | B1 |
6470307 | Turney | Oct 2002 | B1 |
6535890 | Sandifer | Mar 2003 | B2 |
6553373 | Boguraev et al. | Apr 2003 | B2 |
6564250 | Nguyen | May 2003 | B1 |
6582475 | Graham et al. | Jun 2003 | B2 |
6681370 | Gounares et al. | Jan 2004 | B2 |
6792430 | Kenyon et al. | Sep 2004 | B1 |
6839702 | Patel et al. | Jan 2005 | B1 |
6968332 | Milic-Frayling et al. | Nov 2005 | B1 |
7228492 | Graham | Jun 2007 | B1 |
20020042792 | Nishioka et al. | Apr 2002 | A1 |
20020065814 | Okamoto et al. | May 2002 | A1 |
20030051214 | Graham et al. | Mar 2003 | A1 |
20040059708 | Dean et al. | Mar 2004 | A1 |
20070180372 | Graham | Aug 2007 | A1 |
Number | Date | Country |
---|---|---|
378 848 | Jul 1990 | EP |
0459174 | Dec 1991 | EP |
737 927 | Oct 1996 | EP |
762 297 | Mar 1997 | EP |
802 492 | Oct 1997 | EP |
2156118 | Oct 1985 | GB |
2234609 | Feb 1991 | GB |
2137788 | Oct 1994 | GB |
2290898 | Jan 1996 | GB |
2332544 | Jun 1999 | GB |
02-001057 | Jan 1990 | JP |
05-81327 | Apr 1993 | JP |
05-081327 | Apr 1993 | JP |
06-203024 | Jul 1994 | JP |
07-146872 | Jun 1995 | JP |
08-255163 | Oct 1996 | JP |
8-297677 | Nov 1996 | JP |
11-213011 | Aug 1999 | JP |
WO 97-12328 | Apr 1997 | WO |
Number | Date | Country | |
---|---|---|---|
20070016856 A1 | Jan 2007 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10402844 | Mar 2003 | US |
Child | 11482529 | US | |
Parent | 09149920 | Sep 1998 | US |
Child | 10402844 | US |