Claims
- 1. A system for optically scanning a medium, said medium having thereon an unknown character, said system comprising:
- scanning means for scanning said medium, said scanning means providing as output a bit-mapped image of said medium; memory means coupled with said scanning means for storing said bit-processing image;
- processing means coupled with said memory means including means for parsing said bit-mapped image of said medium and providing as output a bit-mapped representation of said unknown character, means for identifying said unknown character and means for analyzing said unknown character based on the surrounding context of said medium;
- said means for analyzing said unknown character based on the surrounding context of said medium includes means for preparing a line of text for context analysis and means for resolving character ambiguities, said means for resolving character ambiguities comprising means for analyzing said line of text to determine spatial information about said line of text in said medium and means for creating attribute data for each character in a line of text.
- 2. A method for recognizing characters on a medium, said method comprising the steps of:
- scanning said medium to produce a bit-mapped image of said medium;
- parsing said bit-mapped image to isolate individual characters and providing as output of said parsing process a bit-mapped image of an unknown character;
- identifying said unknown character; and
- analyzing said unknown character based on the surrounding context of said medium;
- said step of analyzing sad unknown character based on the surrounding context of said medium further comprising the steps of:
- analyzing a line of text in said medium to determine spatial information about said line of text;
- creating attribute data for each character in said line of text; and resolving ambiguities for a character based on said spatial information about said line of text and sad attribute data for each character in a line of text.
- 3. A system for optically scanning a medium, said medium having thereon an unknown character, said system comprising:
- scanning means for scanning said medium, said scanning means providing as output a bit-mapped image of said medium;
- memory means coupled with said scanning means for storing said bit-mapped image;
- processing means coupled with said memory means including means for parsing said bit-mapped image of said medium and providing as output a bit-mapped representation of said unknown character, means for identifying said unknown character and means for analyzing said unknown character based on the surrounding context of said medium;
- said means for analyzing said unknown character based on the surrounding context of said medium includes means for preparing a line of text for context analysis and means for resolving character ambiguities;
- said means for preparing a line of text for context analysis includes means for creating a histogram of the distances between characters in said line of text, means for determining average heights of known characters in said line of text and means for assigning attribute data to each character in said line of text.
- 4. A system for optically scanning a medium as recited in claim 1, wherein said means for analyzing said unknown character based on the surrounding context of said medium is further comprised of a database of characteristic attributes for known characters.
- 5. As system for optically scanning a medium as recited in claim 4, wherein said means for resolving character ambiguities is further comprised of means for accessing said database of characteristic attributes for known characters to retrieve characteristic attributes of said unknown character, and means for resolving character ambiguities based on retrieved characteristic attributes.
- 6. The method as recited in claim 1 wherein said spatial information comprises information describing said line of text's skew, character spacing information, and heights of character information.
- 7. The method as recited in claim 2 wherein said spatial information comprises information describing said line of text's skew, character spacing information, and heights of character information.
Parent Case Info
This is a divisional of application Ser. No. 07/799,549, filed Dec. 27, 1991 now U.S. Pat. No. 5,278,918. which is a continuation of application Ser. No. 07/230,847, filed Aug. 10, 1988, now U.S. Pat. No. 5,131,053.
US Referenced Citations (15)
Foreign Referenced Citations (5)
Number |
Date |
Country |
0138079 |
Sep 1984 |
EPX |
0144202 |
Nov 1984 |
EPX |
1403816 |
Oct 1973 |
GBX |
2182796 |
Oct 1986 |
GBX |
WO8600445 |
Jan 1986 |
WOX |
Divisions (1)
|
Number |
Date |
Country |
Parent |
799549 |
Dec 1991 |
|
Continuations (1)
|
Number |
Date |
Country |
Parent |
230847 |
Aug 1988 |
|