Claims
- 1. A macro-detection method for segmenting and classifying a scanline of image data contained in an image page, the scanline of image data comprising a plurality of pixels, the method comprising the steps of:evaluating each pixel of the scanline of image data, and segmenting the scanline into at least one image run and at least one edge pixel, each image run being bordered by (i) an edge pixel at each end of the image run or (ii) an edge pixel at one end of the image run and a page edge of the image page at another end of the image run; determining at least one image type of each at least one image run; and performing cleanup procedures to determine an image type of each at least one edge pixel.
- 2. The macro-detection method of claim 1, further comprising the step of determining a confidence factor for the determined at least one image type of each at least one image run.
- 3. The macro-detection method of claim 2, wherein the step of determining at least one image type of each at least one image run comprises determining at least one image type of each at least one image run based on micro-detection results for the at least one image run and based on an image type and a confidence factor of an adjacent image run of an adjacent scanline.
- 4. The macro-detection method of claim 2, wherein the step of performing cleanup procedures further comprises re-determining at least one image type of an image run having a low confidence factor based on micro-detection results for the image run and based on an image type of a neighboring image run.
- 5. The macro-detection method of claim 1, wherein the step of performing cleanup procedures comprises determining an image type of a vertical edge based on micro-detection results for the vertical edge and based on an image type of at least one neighboring image run of the scanline.
- 6. The macro-detection method of claim 1, wherein the step of performing cleanup procedures comprises determining an image type of a horizontal edge based on micro-detection results for the horizontal edge and based on an image type of a neighboring segment of an adjacent scanline.
- 7. An apparatus that performs a macro-detection operation to segment and classify a scanline of image data contained in an image page, the scanline of image data comprising a plurality of pixels, the method comprising:a segmenting block that evaluates each pixel of the scanline of image data and segments the scanline into at least one image run and at least one edge pixel, each image run being bordered by (i) an edge pixel at each end of the image run or (ii) an edge pixel at one end of the image run and a g edge of the image page at another end of the image run; a classifying block that determines at least one image type of each at least one image run; and a cleanup procedures block that performs cleanup procedures to determine an image type of each at least one edge pixel.
- 8. The apparatus of claim 7, further comprising a confidence factor block that determines a confidence factor for the determined at least one image type of each at least one image run.
- 9. The apparatus of claim 8, wherein the classifying block determines at least one image type of each at least one image run based on micro-detection results for the at least one image run and based on an image type and a confidence factor of an adjacent image run of an adjacent scanline.
- 10. The apparatus of claim 8, wherein the cleanup procedures block redetermiines at least one image type of an image run having a low confidence factor based on micro-detection results for the image run and based on an image type of at least one neighboring image run of the scanline.
- 11. The apparatus of claim 7, wherein the cleanup procedures block determines an image type of a vertical edge based on micro-detection results for the vertical edge and based on an image type of a neighboring left or right image run.
- 12. The apparatus of claim 7, wherein the cleanup procedures block determines an image type of a horizontal edge based on micro-detection results for the horizontal edge and based on an image type of a neighboring segment of an adjacent scanline.
- 13. A method for segmenting and classifying a scanline of image data, the scanline of image data comprising a plurality of pixels, the method comprising:evaluating each pixel of the scanline of image data to determine if any edge pixels are present in the scanline; segmenting the scanline into N+1 image runs when a number of edge pixels determined to be present is N; determining at least one image type of each image run based on micro-detection results; and performing cleanup procedures to determine an image type of each of the edge pixels determined to be present.
- 14. The method of claim 13, wherein the step of evaluating includes determining whether a pixel has an intensity different from an adjacent pixel by at least a predetermined amount.
Parent Case Info
This is a Division of application Ser. No. 08/686,803 filed Jul. 26, 1996, now U.S. Pat. No. 5,850,474. The disclosse of this prior application is hereby incorporated by reference herein in its entirety.
US Referenced Citations (5)
Non-Patent Literature Citations (3)
Entry |
“Automatic Document Segmentation,” IBM Technical Disclosure Bulletin, vol. 31, No. 7, Dec. 1988, pp. 73-74. |
“Rule-Based System for Chinese Newspaper Segmentation,” Signal Processing VI: Theories and Applications, vol. 1, No. Conf. 6, Aug. 1992, pp. 615-618. |
“A Tool Classifying Office Documents,” Proc. of the 1993 IEEE Int'l Conference on Tools with AI, No. 5, Nov. 1993, pp. 427-434. |