MULTIPLE IMAGE INPUT FOR OPTICAL CHARACTER RECOGNITION PROCESSING SYSTEMS AND METHODS

Information

  • Patent Application
  • 20070211942
  • Publication Number
    20070211942
  • Date Filed
    November 15, 2006
    17 years ago
  • Date Published
    September 13, 2007
    17 years ago
Abstract
A method of processing an image includes receiving a digital version of the image, processing the digital version of the image through at least two binarization processes to thereby create a first binarization and a second binarization, and processing the first binarization through a first optical character recognition process to thereby create a first OCR output file. Processing the first binarization through a first optical character recognition process includes compiling first metrics associated with the first OCR output file. The method also includes processing the second binarization through the first optical character recognition process to thereby create a second OCR output file. Processing the second binarization through the first optical character recognition process includes compiling second metrics associated with the second OCR output file. The method also includes using the metrics, at least in part, to select a final OCR output file from among the OCR output files.
Description

BRIEF DESCRIPTION OF THE DRAWINGS

A further understanding of the nature and advantages of the present invention may be realized by reference to the remaining portions of the specification and the drawings wherein like reference numerals are used throughout the several drawings to refer to similar components. Further, various components of the same type may be distinguished by following the reference label by a dash and a second label that distinguishes among the similar components. If only the first reference label is used in the specification, the description is applicable to any one of the similar components having the same first reference label irrespective of the second reference label.



FIG. 1A depicts an exemplary Optical Character Recognition (OCR) system according to embodiments of the invention.



FIG. 1B depicts a block diagram of an exemplary Binarization/OCR process according to embodiments of the invention, which process may be implemented in the system of FIG. 1.



FIG. 2 depicts an exemplary OCR process according to embodiments of the invention which process may be implemented in the system of FIG. 1.


Claims
  • 1. A method of processing an image, comprising: receiving a digital version of the image;processing the digital version of the image through at least two binarization processes to thereby create a first binarization and a second binarization;processing the first binarization through a first optical character recognition process to thereby create a first OCR output file, wherein processing the first binarization through a first optical character recognition process comprises compiling first metrics associated with the first OCR output file;processing the second binarization through the first optical character recognition process to thereby create a second OCR output file, wherein processing the second binarization through the first optical character recognition process comprises compiling second metrics associated with the second OCR output file; andusing the metrics, at least in part, to select a final OCR output file from among the OCR output files.
  • 2. The method of claim 1, further comprising: processing the first binarization through a second optical character recognition process to thereby create a third OCR output file, wherein processing the first binarization through a second optical character recognition process comprises compiling third metrics associated with the third OCR output file; andprocessing the second binarization through the second optical character recognition process to thereby create a fourth OCR output file, wherein processing the second binarization through the second optical character recognition process comprises compiling fourth metrics associated with the fourth OCR output file.
  • 3. The method of claim 1, wherein the binarization processes are selected from a group consisting of: clustering;global-thresholding; andadaptive thresholding.
  • 4. The method of claim 1, wherein the first and second optical character recognition processes comprise the same optical character recognition process.
  • 5. The method of claim 1, wherein the first and second optical character recognition processes comprise different optical character recognition process.
  • 6. The method of claim 1, wherein the metrics associated with a particular output file comprise one or more selections from a group consisting of: a number of characters recognized in the particular output file;a number of dictionary words in the particular output file;a number of unknown words in the particular output file;a per-character confidence level in the particular output file;a per-word confidence level in the particular output file;a per-image confidence level in the particular output file;a degree of agreement between the particular output file and other output files;which binarization and OCR process produced the particular output file; anda measure of historical accuracy associated with the particular binarization/OCR combination that produced the particular output file.
  • 7. The method of claim 1, creating the digital version of the image from a physical version of the image.
  • 8. A method of optically recognizing characters in an image, comprising: creating multiple binarizations of the image using different binarization techniques;presenting each binarization to an optical character recognition (OCR) engine to produce OCR output file for each binarization;developing metrics relating to each OCR output file; andusing the metrics, at least in part, to select a final OCR output file from among the OCR output files.
  • 9. The method of claim 8, wherein the different binarization techniques comprise one or more selections from a group consisting of: clustering;global-thresholding; andadaptive thresholding.
  • 10. The method of claim 8, wherein presenting each binarization to an OCR engine comprises presenting each binarization to a different OCR engine.
  • 11. The method of claim 8, wherein the metrics relating to a particular OCR output file comprise one or more selections from a group consisting of: a number of characters recognized in the particular OCR output file;a number of dictionary words in the particular OCR output file;a number of unknown words in the particular OCR output file;a per-character confidence level in the particular output file;a per-word confidence level in the particular output file;a per-image confidence level in the particular output file;a degree of agreement between the particular OCR output file and other OCR output files;which binarization and OCR process produced the particular OCR output file; anda measure of historical accuracy associated with the particular binarization/OCR combination that produced the particular OCR output file.
  • 12. An optical character recognition system, comprising: at least two binarization processes configured to convert grayscale images to bitonal images;at least one optical character recognition process configured to process bitonal images into final output files having characters therein;a metrics generation process configured to analyze output files and produce metrics associated therewith;a voting process configured to select a final output file from among the output files based on the metrics;a storage arrangement configured to store final output files and serve the information therein to users; andat least one processor programmed to execute the at least one optical character recognition processes, the at least one optical character recognition process, metrics generation process, and the voting process.
  • 13. The optical character recognition system of claim 12, wherein the at least two binarization processes comprise at least one selection from a group consisting of: clustering;global-thresholding; andadaptive thresholding.
  • 14. The optical character recognition system of claim 12, wherein the metrics associated with a particular output file comprise one or more selections from a group consisting of: a number of characters recognized in the particular output file;a number of dictionary words in the particular output file;a number of unknown words in the particular output file;a per-character confidence level in the particular output file;a per-word confidence level in the particular output file;a per-image confidence level in the particular output file;a degree of agreement between the particular output file and other output files;which binarization and OCR process produced the particular output file; anda measure of historical accuracy associated with the particular binarization/OCR combination that produced the particular output file.
Provisional Applications (1)
Number Date Country
60780484 Mar 2006 US