Claims
- 1. A method of OCR output error detection, comprising the steps of:recognizing a plurality of characters in a document image; determining words from a sequence of said plurality of characters; determining regions of the document image that correspond to said words; correlating said words to said regions of said document image in a correlation table; determining a recognition confidence parameter for a plurality of words in said correlation table; defining a threshold level for said recognition confidence parameter; and displaying the regions of the document image containing a word having a recognition confidence parameter greater than said threshold level.
- 2. The method of claim 1, further comprising the steps of:receiving input that selects a region in the document image; determining a word from said correlation table that corresponds to said selected region; and displaying the word corresponding to said region.
- 3. The method of claim 2, wherein the step of displaying the word includes the step of displaying the word in a pop-up menu.
- 4. The method of claim 1, further comprising the steps of:determining a color for the regions having a recognition confidence parameter less than said threshold value; and displaying the regions of the document image having said color.
- 5. An apparatus for OCR output error detection, comprising:an OCR device for recognizing a plurality of characters in a document image; means for determining words from a sequence of said plurality of characters; means for determining regions of the document image that correspond to said words; means for correlating said words to said regions of said document image in a correlation table; means for determining a recognition confidence parameter for a plurality of words in said correlation table; means for defining a threshold level for said recognition confidence parameter; and a display for displaying the regions of the document image containing a word having a recognition confidence parameter greater than said threshold level.
- 6. The apparatus of claim 5, further comprising:a cursor control for receiving input that selects a region in the document image; and means for determining a word from said correlation table that corresponds to said selected region; wherein the display displays the word corresponding to said region.
- 7. The apparatus of claim 6, wherein the display displays the word corresponding to said region in a pop-up menu.
- 8. The apparatus of claim 5, further comprising:means for determining a color for the regions having a recognition confidence parameter less than said threshold value; wherein the display displays the regions of the document image having said color.
- 9. A computer readable medium having sequences of instructions for OCR output error detection, said sequences of instructions including sequences of instructions for performing the steps of:recognizing a plurality of characters in a document image; determining words from a sequence of said plurality of characters; determining regions of the document image that correspond to said words; correlating said words to said regions of said document image in a correlation table; determining a recognition confidence parameter for a plurality of words in said correlation table; defining a threshold level for said recognition confidence parameter; and displaying the regions of the document image containing a word having a recognition confidence parameter greater than said threshold level.
- 10. The computer readable medium of claim 9, wherein said sequences of instructions further include sequences of instructions for performing the steps of:receiving input that selects a region in the document image; determining a word from said correlation table that corresponds to said selected region; and displaying the word corresponding to said region.
- 11. The computer readable medium of claim 10, wherein the step of displaying the word includes the step of displaying the word in a pop-up menu.
- 12. The computer readable medium of claim 9, wherein said sequences of instructions further include the steps of:determining a color for the regions having a recognition confidence parameter less than said threshold value; and displaying the regions of the document image having said color.
Parent Case Info
This application is a divisional of patent application Ser. No. 08/900,547 filed Jul. 25, 1997, now abandoned.
US Referenced Citations (13)